MitoH3: Mitochondrial Haplogroup and Homoplasmic/Heteroplasmic Variant Calling Pipeline for Alzheimer's Disease Sequencing Project.

IF 2.8 Q2 NEUROSCIENCES Journal of Alzheimer's disease reports Pub Date : 2024-04-08 eCollection Date: 2024-01-01 DOI:10.3233/ADR-230120
Congcong Zhu, Tong Tong, John J Farrell, Eden R Martin, William S Bush, Margaret A Pericak-Vance, Li-San Wang, Gerard D Schellenberg, Jonathan L Haines, Kathryn L Lunetta, Lindsay A Farrer, Xiaoling Zhang
{"title":"MitoH3: Mitochondrial Haplogroup and Homoplasmic/Heteroplasmic Variant Calling Pipeline for Alzheimer's Disease Sequencing Project.","authors":"Congcong Zhu, Tong Tong, John J Farrell, Eden R Martin, William S Bush, Margaret A Pericak-Vance, Li-San Wang, Gerard D Schellenberg, Jonathan L Haines, Kathryn L Lunetta, Lindsay A Farrer, Xiaoling Zhang","doi":"10.3233/ADR-230120","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Mitochondrial DNA (mtDNA) is a double-stranded circular DNA and has multiple copies in each cell. Excess heteroplasmy, the coexistence of distinct variants in copies of mtDNA within a cell, may lead to mitochondrial impairments. Accurate determination of heteroplasmy in whole-genome sequencing (WGS) data has posed a significant challenge because mitochondria carrying heteroplasmic variants cannot be distinguished during library preparation. Moreover, sequencing errors, contamination, and nuclear mtDNA segments can reduce the accuracy of heteroplasmic variant calling.</p><p><strong>Objective: </strong>To efficiently and accurately call mtDNA homoplasmic and heteroplasmic variants from the large-scale WGS data generated from the Alzheimer's Disease Sequencing Project (ADSP), and test their association with Alzheimer's disease (AD).</p><p><strong>Methods: </strong>In this study, we present MitoH3-a comprehensive computational pipeline for calling mtDNA homoplasmic and heteroplasmic variants and inferring haplogroups in the ADSP WGS data. We first applied MitoH3 to 45 technical replicates from 6 subjects to define a threshold for detecting heteroplasmic variants. Then using the threshold of 5% ≤variant allele fraction≤95%, we further applied MitoH3 to call heteroplasmic variants from a total of 16,113 DNA samples with 6,742 samples from cognitively normal controls and 6,183 from AD cases.</p><p><strong>Results: </strong>This pipeline is available through the Singularity container engine. For 4,311 heteroplasmic variants identified from 16,113 samples, no significant variant count difference was observed between AD cases and controls.</p><p><strong>Conclusions: </strong>Our streamlined pipeline, MitoH3, enables computationally efficient and accurate analysis of a large number of samples.</p>","PeriodicalId":73594,"journal":{"name":"Journal of Alzheimer's disease reports","volume":"8 1","pages":"575-587"},"PeriodicalIF":2.8000,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11091720/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Alzheimer's disease reports","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/ADR-230120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Mitochondrial DNA (mtDNA) is a double-stranded circular DNA and has multiple copies in each cell. Excess heteroplasmy, the coexistence of distinct variants in copies of mtDNA within a cell, may lead to mitochondrial impairments. Accurate determination of heteroplasmy in whole-genome sequencing (WGS) data has posed a significant challenge because mitochondria carrying heteroplasmic variants cannot be distinguished during library preparation. Moreover, sequencing errors, contamination, and nuclear mtDNA segments can reduce the accuracy of heteroplasmic variant calling.

Objective: To efficiently and accurately call mtDNA homoplasmic and heteroplasmic variants from the large-scale WGS data generated from the Alzheimer's Disease Sequencing Project (ADSP), and test their association with Alzheimer's disease (AD).

Methods: In this study, we present MitoH3-a comprehensive computational pipeline for calling mtDNA homoplasmic and heteroplasmic variants and inferring haplogroups in the ADSP WGS data. We first applied MitoH3 to 45 technical replicates from 6 subjects to define a threshold for detecting heteroplasmic variants. Then using the threshold of 5% ≤variant allele fraction≤95%, we further applied MitoH3 to call heteroplasmic variants from a total of 16,113 DNA samples with 6,742 samples from cognitively normal controls and 6,183 from AD cases.

Results: This pipeline is available through the Singularity container engine. For 4,311 heteroplasmic variants identified from 16,113 samples, no significant variant count difference was observed between AD cases and controls.

Conclusions: Our streamlined pipeline, MitoH3, enables computationally efficient and accurate analysis of a large number of samples.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MitoH3:阿尔茨海默病测序项目线粒体单倍群和同质/异质变异调用管道。
背景:线粒体 DNA(mtDNA)是一种双链环状 DNA,在每个细胞中有多个拷贝。过多的异质性(细胞内 mtDNA 副本中不同变体的共存)可能会导致线粒体损伤。在全基因组测序(WGS)数据中准确确定异质性是一项重大挑战,因为在文库制备过程中无法区分携带异质性变体的线粒体。此外,测序错误、污染和核 mtDNA 片段也会降低异质变体调用的准确性:从阿尔茨海默病测序项目(ADSP)产生的大规模 WGS 数据中高效、准确地调用 mtDNA 同质和异质变体,并检验它们与阿尔茨海默病(AD)的关联:在本研究中,我们介绍了MitoH3--一种用于在ADSP WGS数据中调用mtDNA同质和异质变异并推断单倍群的综合计算管道。我们首先将 MitoH3 应用于来自 6 个受试者的 45 个技术重复,以确定检测异质变异的阈值。然后,利用5%≤变异等位基因比例≤95%的阈值,我们进一步应用MitoH3调用了16113个DNA样本中的异质变异,其中6742个样本来自认知正常对照组,6183个样本来自AD病例:该管道可通过 Singularity 容器引擎使用。对于从 16,113 份样本中鉴定出的 4,311 个异质变体,AD 病例和对照组之间没有观察到显著的变体数量差异:我们的简化管道 MitoH3 可以对大量样本进行高效、准确的计算分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.80
自引率
0.00%
发文量
0
期刊最新文献
Biomarker Assessment in Parkinson's Disease Dementia and Dementia with Lewy Bodies by the Immunomagnetic Reduction Assay and Clinical Measures. Cognitive Reserve Relationship with Physical Performance in Dementia-Free Older Adults: The MIND-China Study. Are Opioids Agitating? A Data Analysis of Baseline Data from the STAN Study. Cognitive Function After Stopping Folic Acid and DHA Intervention: An Extended Follow-Up Results from the Randomized, Double Blind, Placebo-Controlled Trial in Older Adults with Mild Cognitive Impairment. Individualized and Biomarker-Based Prognosis of Longitudinal Cognitive Decline in Early Symptomatic Alzheimer's Disease.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1