Publication Date
2025
Document Type
Article
Description
Ancient DNA (aDNA) is fragmented and degraded due to post-mortem processes, leading to low coverage and incomplete sequences. Mitochondrial DNA (mtDNA), despite its high copy number, is similarly subject to degradation processes, requiring statistical imputation for whole mtDNA reconstruction. While nuclear genome imputation is well-established, ancient mtDNA imputation remains underexplored. Here, this study benchmarks two mtDNA imputation pipelines—MitoIMP, and a novel pipeline integrating Minimac4’s Hidden Markov Model (HMM) with a k-Nearest Neighbors (k-NN) algorithm. Using the largest mtDNA imputation panel to date (n = 46,000), reconstruction accuracy was tested on 100 simulated ancient mtDNAs (≥0.25X coverage) generated with Gargammel and processed via EAGER (Efficient Ancient Genome Reconstruction). Imputation performance was evaluated using HaploGrep3 haplogroup classifications. The results show that combining a HMM with k-NN refinement significantly improves imputation accuracy, particularly at ultra-low coverage, and increases the likelihood of assigning accurate haplotypes to highly degraded aDNA and forensic DNA samples.
Files
Download Full Text (918 KB)
Recommended Citation
Plummer, Nathaniel; Fang, Yuan; Shamoon-Pour, Michel; Cozzarelli, Ari; Ghafoor, Suhail; Spake, Laure; and Emery, Matthew W., "Imputation and k-NN based Haplotype Refinement of Simulated Ancient Mitochondrial Genomes" (2025). Research Days Posters 2025. 140.
https://orb.binghamton.edu/research_days_posters_2025/140
