MalKinID:利用后裔身份识别疟疾寄生虫谱系关系的分类模型。

IF 3.3 3区 生物学 Q2 GENETICS & HEREDITY Genetics Pub Date : 2024-11-23 DOI:10.1093/genetics/iyae197
Wesley Wong, Lea Wang, Stephen S Schaffner, Xue Li, Ian Cheeseman, Timothy J C Anderson, Ashley Vaughan, Michael Ferdig, Sarah K Volkman, Daniel L Hartl, Dyann F Wirth
{"title":"MalKinID:利用后裔身份识别疟疾寄生虫谱系关系的分类模型。","authors":"Wesley Wong, Lea Wang, Stephen S Schaffner, Xue Li, Ian Cheeseman, Timothy J C Anderson, Ashley Vaughan, Michael Ferdig, Sarah K Volkman, Daniel L Hartl, Dyann F Wirth","doi":"10.1093/genetics/iyae197","DOIUrl":null,"url":null,"abstract":"<p><p>Pathogen genomics is a powerful tool for tracking infectious disease transmission. In malaria, identity-by-descent (IBD) is used to assess the genetic relatedness between parasites and has been used to study transmission and importation. In theory, IBD can be used to distinguish genealogical relationships to reconstruct transmission history or identify parasites for quantitative-trait-locus experiments. MalKinID (Malaria Kinship Identifier) is a new classification model designed to identify genealogical relationships among malaria parasites based on genome-wide IBD proportions and IBD segment distributions. MalKinID was calibrated to the genomic data from three laboratory-based genetic crosses (yielding 440 parent-child [PC] and 9060 full-sibling [FS] comparisons). MalKinID identified lab generated F1 progeny with >80% sensitivity and showed that 0.39 (95% CI 0.28, 0.49) of the second-generation progeny of a NF54 and NHP4026 cross were F1s and 0.56 (0.45, 0.67) were backcrosses of an F1 with the parental NF54 strain. In simulated outcrossed importations, MalKinID reconstructs genealogy history with high precision and sensitivity, with F1-scores exceeding 0.84. However, when importation involves inbreeding, such as during serial co-transmission, the precision and sensitivity of MalKinID declined, with F1-scores (the harmonic mean of precision and sensitivity) of 0.76 (0.56, 0.92) and 0.23 (0.0, 0.4) for PC and FS and <0.05 for second-degree and third-degree relatives. Disentangling inbred relationships required adapting MalKinID to perform multi-sample comparisons. Genealogical inference is most powered when 1) outcrossing is the norm or 2) multi-sample comparisons based on a predefined pedigree are used. MalKinID lays the foundations for using IBD to track parasite transmission history and for separating progeny for quantitative-trait-locus experiments.</p>","PeriodicalId":48925,"journal":{"name":"Genetics","volume":" ","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MalKinID: A Classification Model for Identifying Malaria Parasite Genealogical Relationships Using Identity-by-Descent.\",\"authors\":\"Wesley Wong, Lea Wang, Stephen S Schaffner, Xue Li, Ian Cheeseman, Timothy J C Anderson, Ashley Vaughan, Michael Ferdig, Sarah K Volkman, Daniel L Hartl, Dyann F Wirth\",\"doi\":\"10.1093/genetics/iyae197\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Pathogen genomics is a powerful tool for tracking infectious disease transmission. In malaria, identity-by-descent (IBD) is used to assess the genetic relatedness between parasites and has been used to study transmission and importation. In theory, IBD can be used to distinguish genealogical relationships to reconstruct transmission history or identify parasites for quantitative-trait-locus experiments. MalKinID (Malaria Kinship Identifier) is a new classification model designed to identify genealogical relationships among malaria parasites based on genome-wide IBD proportions and IBD segment distributions. MalKinID was calibrated to the genomic data from three laboratory-based genetic crosses (yielding 440 parent-child [PC] and 9060 full-sibling [FS] comparisons). MalKinID identified lab generated F1 progeny with >80% sensitivity and showed that 0.39 (95% CI 0.28, 0.49) of the second-generation progeny of a NF54 and NHP4026 cross were F1s and 0.56 (0.45, 0.67) were backcrosses of an F1 with the parental NF54 strain. In simulated outcrossed importations, MalKinID reconstructs genealogy history with high precision and sensitivity, with F1-scores exceeding 0.84. However, when importation involves inbreeding, such as during serial co-transmission, the precision and sensitivity of MalKinID declined, with F1-scores (the harmonic mean of precision and sensitivity) of 0.76 (0.56, 0.92) and 0.23 (0.0, 0.4) for PC and FS and <0.05 for second-degree and third-degree relatives. Disentangling inbred relationships required adapting MalKinID to perform multi-sample comparisons. Genealogical inference is most powered when 1) outcrossing is the norm or 2) multi-sample comparisons based on a predefined pedigree are used. MalKinID lays the foundations for using IBD to track parasite transmission history and for separating progeny for quantitative-trait-locus experiments.</p>\",\"PeriodicalId\":48925,\"journal\":{\"name\":\"Genetics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-11-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genetics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/genetics/iyae197\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/genetics/iyae197","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

病原体基因组学是追踪传染病传播的有力工具。在疟疾中,通过后代鉴定(IBD)可用于评估寄生虫之间的遗传亲缘关系,并已被用于研究传播和输入。从理论上讲,IBD 可用来区分谱系关系,以重建传播历史,或为定量性状-病灶实验识别寄生虫。MalKinID (疟疾亲缘关系识别器)是一种新的分类模型,旨在根据全基因组的 IBD 比例和 IBD 片段分布来识别疟疾寄生虫之间的系谱关系。MalKinID 根据三个实验室基因杂交的基因组数据进行了校准(产生了 440 个亲子 [PC] 和 9060 个全同胞 [FS] 比较)。MalKinID 识别实验室产生的 F1 后代的灵敏度大于 80%,并显示 NF54 和 NHP4026 杂交的第二代后代中有 0.39(95% CI 0.28,0.49)个是 F1 后代,0.56(0.45,0.67)个是 F1 与亲本 NF54 株系的回交后代。在模拟的外交进口中,MalKinID 能高精度、高灵敏度地重建系谱历史,F1 评分超过 0.84。然而,当导入涉及近亲繁殖时,如在连续共输过程中,MalKinID 的精确度和灵敏度下降,PC 和 FS 的 F1 分数(精确度和灵敏度的调和平均值)分别为 0.76(0.56,0.92)和 0.23(0.0,0.4),FS 和 PC 的 F1 分数(精确度和灵敏度的调和平均值)分别为 0.50(0.50,0.10)和 0.50(0.10,0.10)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MalKinID: A Classification Model for Identifying Malaria Parasite Genealogical Relationships Using Identity-by-Descent.

Pathogen genomics is a powerful tool for tracking infectious disease transmission. In malaria, identity-by-descent (IBD) is used to assess the genetic relatedness between parasites and has been used to study transmission and importation. In theory, IBD can be used to distinguish genealogical relationships to reconstruct transmission history or identify parasites for quantitative-trait-locus experiments. MalKinID (Malaria Kinship Identifier) is a new classification model designed to identify genealogical relationships among malaria parasites based on genome-wide IBD proportions and IBD segment distributions. MalKinID was calibrated to the genomic data from three laboratory-based genetic crosses (yielding 440 parent-child [PC] and 9060 full-sibling [FS] comparisons). MalKinID identified lab generated F1 progeny with >80% sensitivity and showed that 0.39 (95% CI 0.28, 0.49) of the second-generation progeny of a NF54 and NHP4026 cross were F1s and 0.56 (0.45, 0.67) were backcrosses of an F1 with the parental NF54 strain. In simulated outcrossed importations, MalKinID reconstructs genealogy history with high precision and sensitivity, with F1-scores exceeding 0.84. However, when importation involves inbreeding, such as during serial co-transmission, the precision and sensitivity of MalKinID declined, with F1-scores (the harmonic mean of precision and sensitivity) of 0.76 (0.56, 0.92) and 0.23 (0.0, 0.4) for PC and FS and <0.05 for second-degree and third-degree relatives. Disentangling inbred relationships required adapting MalKinID to perform multi-sample comparisons. Genealogical inference is most powered when 1) outcrossing is the norm or 2) multi-sample comparisons based on a predefined pedigree are used. MalKinID lays the foundations for using IBD to track parasite transmission history and for separating progeny for quantitative-trait-locus experiments.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Genetics
Genetics GENETICS & HEREDITY-
CiteScore
6.90
自引率
6.10%
发文量
177
审稿时长
1.5 months
期刊介绍: GENETICS is published by the Genetics Society of America, a scholarly society that seeks to deepen our understanding of the living world by advancing our understanding of genetics. Since 1916, GENETICS has published high-quality, original research presenting novel findings bearing on genetics and genomics. The journal publishes empirical studies of organisms ranging from microbes to humans, as well as theoretical work. While it has an illustrious history, GENETICS has changed along with the communities it serves: it is not your mentor''s journal. The editors make decisions quickly – in around 30 days – without sacrificing the excellence and scholarship for which the journal has long been known. GENETICS is a peer reviewed, peer-edited journal, with an international reach and increasing visibility and impact. All editorial decisions are made through collaboration of at least two editors who are practicing scientists. GENETICS is constantly innovating: expanded types of content include Reviews, Commentary (current issues of interest to geneticists), Perspectives (historical), Primers (to introduce primary literature into the classroom), Toolbox Reviews, plus YeastBook, FlyBook, and WormBook (coming spring 2016). For particularly time-sensitive results, we publish Communications. As part of our mission to serve our communities, we''ve published thematic collections, including Genomic Selection, Multiparental Populations, Mouse Collaborative Cross, and the Genetics of Sex.
期刊最新文献
A modular system to label endogenous presynaptic proteins using split fluorophores in C. elegans. Multiple DNA repair pathways prevent acetaldehyde-induced mutagenesis in yeast. CelEst: a unified gene regulatory network for estimating transcription factor activities in C. elegans. Correction to: A review of multimodal deep learning methods for genomic-enabled prediction in plant breeding. Allele ages provide limited information about the strength of negative selection.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1