The power of next-generation sequencing and machine learning for causal gene finding and prediction of phenotypes.

A. Sowa, L. Dußling, J. Hagmann, Sebastian J. Schultheiss
{"title":"The power of next-generation sequencing and machine learning for causal gene finding and prediction of phenotypes.","authors":"A. Sowa, L. Dußling, J. Hagmann, Sebastian J. Schultheiss","doi":"10.1079/9781789249095.0041","DOIUrl":null,"url":null,"abstract":"Abstract\n The wide application of next-generation sequencing (NGS) has facilitated and accelerated causal gene finding and breeding in the field of plant sciences. A wide variety of techniques and computational strategies is available that needs to be appropriately tailored to the species, genetic architecture of the trait of interest, breeding system and available resources. Utilizing these NGS methods, the typical computational steps of marker discovery, genetic mapping and identification of causal mutations can be achieved in a single step in a cost- and time-efficient manner. Rather than focusing on a few high-impact genetic variants that explain phenotypes, increased computational power allows modelling of phenotypes based on genome-wide molecular markers, known as genomic selection (GS). Solely based on this genotype information, modern GS approaches can accurately predict breeding values for a given trait (the average effects of alleles over all loci that are anticipated to be transferred from the parent to the progeny) based on a large training population of genotyped and phenotyped individuals (Crossa et al., 2017). Once trained, the model offers great reductions in breeding speed and costs. We advocate for improving conventional GS methods by applying advanced techniques based on machine learning (ML) and outline how this approach can also be used for causal gene finding. Subsequent to genetic causes of agronomically important traits, epigenetic mechanisms such as DNA methylation play a crucial role in shaping phenotypes and can become interesting targets in breeding pipelines. We highlight an ML approach shown to detect functional methylation changes sensitively from NGS data. We give an overview about commonly applied strategies and provide practical considerations in choosing and performing NGS-based gene finding and NGS-assisted breeding.","PeriodicalId":287197,"journal":{"name":"Mutation breeding, genetic diversity and crop adaptation to climate change","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mutation breeding, genetic diversity and crop adaptation to climate change","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1079/9781789249095.0041","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract The wide application of next-generation sequencing (NGS) has facilitated and accelerated causal gene finding and breeding in the field of plant sciences. A wide variety of techniques and computational strategies is available that needs to be appropriately tailored to the species, genetic architecture of the trait of interest, breeding system and available resources. Utilizing these NGS methods, the typical computational steps of marker discovery, genetic mapping and identification of causal mutations can be achieved in a single step in a cost- and time-efficient manner. Rather than focusing on a few high-impact genetic variants that explain phenotypes, increased computational power allows modelling of phenotypes based on genome-wide molecular markers, known as genomic selection (GS). Solely based on this genotype information, modern GS approaches can accurately predict breeding values for a given trait (the average effects of alleles over all loci that are anticipated to be transferred from the parent to the progeny) based on a large training population of genotyped and phenotyped individuals (Crossa et al., 2017). Once trained, the model offers great reductions in breeding speed and costs. We advocate for improving conventional GS methods by applying advanced techniques based on machine learning (ML) and outline how this approach can also be used for causal gene finding. Subsequent to genetic causes of agronomically important traits, epigenetic mechanisms such as DNA methylation play a crucial role in shaping phenotypes and can become interesting targets in breeding pipelines. We highlight an ML approach shown to detect functional methylation changes sensitively from NGS data. We give an overview about commonly applied strategies and provide practical considerations in choosing and performing NGS-based gene finding and NGS-assisted breeding.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
下一代测序和机器学习在因果基因发现和表型预测中的作用。
新一代测序技术(NGS)的广泛应用促进和加速了植物科学领域因果基因的发现和育种。各种各样的技术和计算策略都是可用的,需要根据物种、感兴趣的性状的遗传结构、育种系统和可用资源进行适当的调整。利用这些NGS方法,标记发现、遗传作图和因果突变鉴定的典型计算步骤可以在一个步骤中以成本和时间效率的方式实现。计算能力的提高使得基于全基因组分子标记(即基因组选择(GS))的表型建模成为可能,而不是专注于解释表型的几个高影响遗传变异。仅基于该基因型信息,现代GS方法可以基于大量基因型和表型个体的训练群体,准确预测给定性状的育种值(等位基因在所有位点上的平均效应,预计将从亲本转移到后代)(Crossa等人,2017)。经过训练后,该模型大大降低了育种速度和成本。我们提倡通过应用基于机器学习(ML)的先进技术来改进传统的GS方法,并概述了如何将这种方法也用于因果基因发现。继农学上重要性状的遗传原因之后,表观遗传机制如DNA甲基化在塑造表型中起着至关重要的作用,并且可以成为育种管道中的有趣目标。我们强调了一种ML方法,可以从NGS数据中敏感地检测功能性甲基化变化。我们概述了常用的策略,并提供了选择和执行基于ngs的基因发现和ngs辅助育种的实际考虑。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Production of haploid embryos and plants in Iranian melon (Cucumis melo L.) through irradiated pollen-induced parthenogenesis. Progress of mutant resource development and tilling on starch biosynthesis in wheat. Scandinavian mutation research during the past 90 years - a historical review. Impact of mutant varieties in Malaysia: challenges and future perspectives for mutation breeding. Induced mutagenesis for improvement of bean (Phaseolus vulgaris L.) production in Bulgaria.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1