FnR:用于计算近亲繁殖和分子关系系数的 R 软件包。

IF 2.3 Q2 ECOLOGY BMC ecology and evolution Pub Date : 2024-07-18 DOI:10.1186/s12862-024-02285-4
Mohammad Ali Nilforooshan
{"title":"FnR:用于计算近亲繁殖和分子关系系数的 R 软件包。","authors":"Mohammad Ali Nilforooshan","doi":"10.1186/s12862-024-02285-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Inbreeding and relationship coefficients are essential for conservation and breeding programs. Whether dealing with a small conserved population or a large commercial population, monitoring the inbreeding rate and designing mating plans that minimize the inbreeding rate and maximize the effective population size is important. Free, open-source, and efficient software may greatly contribute to conservation and breeding programs and help students and researchers. Efficient methods exist for calculating inbreeding coefficients. Therefore, an efficient way of calculating the numerator relationship coefficients is via the inbreeding coefficients. i.e., the relationship coefficient between parents is twice the inbreeding coefficient of their progeny. A dummy progeny is introduced where no progeny exists for a pair of individuals. Calculating inbreeding coefficients is very fast, and finding whether a pair of individuals has a progeny and picking one from multiple progenies is computationally more demanding. Therefore, the R package introduces a dummy progeny for any pair of individuals whose relationship coefficient is of interest, whether they have a progeny or not.</p><p><strong>Results: </strong>Runtime and peak memory usage were benchmarked for calculating relationship coefficients between two sets of 250 and 800 animals (200,000 dummy progenies) from a pedigree of 2,721,252 animals. The program performed efficiently (200,000 relationship coefficients, which involved calculating 2,721,252 + 200,000 inbreeding coefficients) within 3:45 (mm:ss). Providing the inbreeding coefficients (for real animals), the runtime was reduced to 1:08. Furthermore, providing the diagonal elements of D in <math><mrow><mi>A</mi> <mo>=</mo> <msup><mi>TDT</mi> <mo>'</mo></msup> </mrow> </math> (d), the runtime was reduced to 54s. All the analyses were performed on a machine with a total memory size of 1 GB.</p><p><strong>Conclusions: </strong>The R package FnR is free and open-source software with implications in conservation and breeding programs. It proved to be time and memory efficient for large populations and many dummy progenies. Calculation of inbreeding coefficients can be resumed for new animals in the pedigree. Thus, saving the latest inbreeding coefficient estimates is recommended. Calculation of d coefficients (from scratch) was very fast, and there was limited value in storing those for future use.</p>","PeriodicalId":93910,"journal":{"name":"BMC ecology and evolution","volume":"24 1","pages":"99"},"PeriodicalIF":2.3000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11256478/pdf/","citationCount":"0","resultStr":"{\"title\":\"FnR: R package for computing inbreeding and numerator relationship coefficients.\",\"authors\":\"Mohammad Ali Nilforooshan\",\"doi\":\"10.1186/s12862-024-02285-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Inbreeding and relationship coefficients are essential for conservation and breeding programs. Whether dealing with a small conserved population or a large commercial population, monitoring the inbreeding rate and designing mating plans that minimize the inbreeding rate and maximize the effective population size is important. Free, open-source, and efficient software may greatly contribute to conservation and breeding programs and help students and researchers. Efficient methods exist for calculating inbreeding coefficients. Therefore, an efficient way of calculating the numerator relationship coefficients is via the inbreeding coefficients. i.e., the relationship coefficient between parents is twice the inbreeding coefficient of their progeny. A dummy progeny is introduced where no progeny exists for a pair of individuals. Calculating inbreeding coefficients is very fast, and finding whether a pair of individuals has a progeny and picking one from multiple progenies is computationally more demanding. Therefore, the R package introduces a dummy progeny for any pair of individuals whose relationship coefficient is of interest, whether they have a progeny or not.</p><p><strong>Results: </strong>Runtime and peak memory usage were benchmarked for calculating relationship coefficients between two sets of 250 and 800 animals (200,000 dummy progenies) from a pedigree of 2,721,252 animals. The program performed efficiently (200,000 relationship coefficients, which involved calculating 2,721,252 + 200,000 inbreeding coefficients) within 3:45 (mm:ss). Providing the inbreeding coefficients (for real animals), the runtime was reduced to 1:08. Furthermore, providing the diagonal elements of D in <math><mrow><mi>A</mi> <mo>=</mo> <msup><mi>TDT</mi> <mo>'</mo></msup> </mrow> </math> (d), the runtime was reduced to 54s. All the analyses were performed on a machine with a total memory size of 1 GB.</p><p><strong>Conclusions: </strong>The R package FnR is free and open-source software with implications in conservation and breeding programs. It proved to be time and memory efficient for large populations and many dummy progenies. Calculation of inbreeding coefficients can be resumed for new animals in the pedigree. Thus, saving the latest inbreeding coefficient estimates is recommended. Calculation of d coefficients (from scratch) was very fast, and there was limited value in storing those for future use.</p>\",\"PeriodicalId\":93910,\"journal\":{\"name\":\"BMC ecology and evolution\",\"volume\":\"24 1\",\"pages\":\"99\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11256478/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC ecology and evolution\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s12862-024-02285-4\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC ecology and evolution","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s12862-024-02285-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:近交系数和关系系数对保护和育种计划至关重要。无论是处理小规模的保护种群还是大规模的商业种群,监测近亲繁殖率并设计交配计划以最大限度地降低近亲繁殖率并最大限度地提高有效种群数量都是非常重要的。免费、开源、高效的软件可极大地促进保护和育种计划,并为学生和研究人员提供帮助。计算近交系数的有效方法是存在的。因此,计算分子关系系数的有效方法是通过近交系数,即亲本之间的关系系数是其后代近交系数的两倍。如果一对个体没有后代,则引入一个虚拟后代。计算近交系数的速度非常快,而查找一对个体是否有后代并从多个后代中挑选一个后代的计算要求更高。因此,R 软件包为任何一对关系系数值得关注的个体引入了一个虚拟后代,无论他们是否有后代:对运行时间和内存峰值使用情况进行了基准测试,以计算由 2,721,252 头动物组成的血统中 250 头和 800 头动物(200,000 个虚拟后代)两组动物之间的关系系数。程序在 3:45 (mm:ss) 内高效运行(200,000 个关系系数,包括计算 2,721,252 + 200,000 个近交系数)。提供近亲繁殖系数(真实动物)后,运行时间缩短为 1:08。此外,如果提供 A = TDT ' (d) 中 D 的对角元素,运行时间将缩短至 54 秒。所有分析都是在总内存为 1 GB 的机器上进行的:R 软件包 FnR 是免费的开源软件,对保护和育种计划具有重要意义。事实证明,对于大型种群和许多假后代来说,它既节省时间,又节省内存。对于血统中的新动物,可以重新计算近交系数。因此,建议保存最新的近交系数估计值。计算 d 系数(从头开始)的速度非常快,而存储这些系数以供将来使用的价值有限。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
FnR: R package for computing inbreeding and numerator relationship coefficients.

Background: Inbreeding and relationship coefficients are essential for conservation and breeding programs. Whether dealing with a small conserved population or a large commercial population, monitoring the inbreeding rate and designing mating plans that minimize the inbreeding rate and maximize the effective population size is important. Free, open-source, and efficient software may greatly contribute to conservation and breeding programs and help students and researchers. Efficient methods exist for calculating inbreeding coefficients. Therefore, an efficient way of calculating the numerator relationship coefficients is via the inbreeding coefficients. i.e., the relationship coefficient between parents is twice the inbreeding coefficient of their progeny. A dummy progeny is introduced where no progeny exists for a pair of individuals. Calculating inbreeding coefficients is very fast, and finding whether a pair of individuals has a progeny and picking one from multiple progenies is computationally more demanding. Therefore, the R package introduces a dummy progeny for any pair of individuals whose relationship coefficient is of interest, whether they have a progeny or not.

Results: Runtime and peak memory usage were benchmarked for calculating relationship coefficients between two sets of 250 and 800 animals (200,000 dummy progenies) from a pedigree of 2,721,252 animals. The program performed efficiently (200,000 relationship coefficients, which involved calculating 2,721,252 + 200,000 inbreeding coefficients) within 3:45 (mm:ss). Providing the inbreeding coefficients (for real animals), the runtime was reduced to 1:08. Furthermore, providing the diagonal elements of D in A = TDT ' (d), the runtime was reduced to 54s. All the analyses were performed on a machine with a total memory size of 1 GB.

Conclusions: The R package FnR is free and open-source software with implications in conservation and breeding programs. It proved to be time and memory efficient for large populations and many dummy progenies. Calculation of inbreeding coefficients can be resumed for new animals in the pedigree. Thus, saving the latest inbreeding coefficient estimates is recommended. Calculation of d coefficients (from scratch) was very fast, and there was limited value in storing those for future use.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Greater risk-taking by non-native than native shrimp: an advantage in a human-disturbed environment? Survival cost sharing among altruistic full siblings in Mendelian population. Next-generation phylogeography reveals unanticipated population history and climate and human impacts on the endangered floodplain bitterling (Acheilognathus longipinnis). Repeated evolution on oceanic islands: comparative genomics reveals species-specific processes in birds. Unravelling spatial scale effects on elevational diversity gradients: insights from montane small mammals in Kenya.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1