Pub Date : 2024-09-01Epub Date: 2024-08-01DOI: 10.1002/tpg2.20488
Paul Adunola, Luis Felipe V Ferrão, Juliana Benevenuto, Camila F Azevedo, Patricio R Munoz
Genomic prediction is a modern approach that uses genome-wide markers to predict the genetic merit of unphenotyped individuals. With the potential to reduce the breeding cycles and increase the selection accuracy, this tool has been designed to rank genotypes and maximize genetic gains. Despite this importance, its practical implementation in breeding programs requires critical allocation of resources for its application in a predictive framework. In this study, we integrated genetic and data-driven methods to allocate resources for phenotyping and genotyping tailored to genomic prediction. To this end, we used a historical blueberry (Vaccinium corymbosun L.) breeding dataset containing more than 3000 individuals, genotyped using probe-based target sequencing and phenotyped for three fruit quality traits over several years. Our contribution in this study is threefold: (i) for the genotyping resource allocation, the use of genetic data-driven methods to select an optimal set of markers slightly improved prediction results for all the traits; (ii) for the long-term implication, we carried out a simulation study and emphasized that data-driven method results in a slight improvement in genetic gain over 30 cycles than random marker sampling; and (iii) for the phenotyping resource allocation, we compared different optimization algorithms to select training population, showing that it can be leveraged to increase predictive performances. Altogether, we provided a data-oriented decision-making approach for breeders by demonstrating that critical breeding decisions associated with resource allocation for genomic prediction can be tackled through a combination of statistics and genetic methods.
{"title":"Genomic selection optimization in blueberry: Data-driven methods for marker and training population design.","authors":"Paul Adunola, Luis Felipe V Ferrão, Juliana Benevenuto, Camila F Azevedo, Patricio R Munoz","doi":"10.1002/tpg2.20488","DOIUrl":"10.1002/tpg2.20488","url":null,"abstract":"<p><p>Genomic prediction is a modern approach that uses genome-wide markers to predict the genetic merit of unphenotyped individuals. With the potential to reduce the breeding cycles and increase the selection accuracy, this tool has been designed to rank genotypes and maximize genetic gains. Despite this importance, its practical implementation in breeding programs requires critical allocation of resources for its application in a predictive framework. In this study, we integrated genetic and data-driven methods to allocate resources for phenotyping and genotyping tailored to genomic prediction. To this end, we used a historical blueberry (Vaccinium corymbosun L.) breeding dataset containing more than 3000 individuals, genotyped using probe-based target sequencing and phenotyped for three fruit quality traits over several years. Our contribution in this study is threefold: (i) for the genotyping resource allocation, the use of genetic data-driven methods to select an optimal set of markers slightly improved prediction results for all the traits; (ii) for the long-term implication, we carried out a simulation study and emphasized that data-driven method results in a slight improvement in genetic gain over 30 cycles than random marker sampling; and (iii) for the phenotyping resource allocation, we compared different optimization algorithms to select training population, showing that it can be leveraged to increase predictive performances. Altogether, we provided a data-oriented decision-making approach for breeders by demonstrating that critical breeding decisions associated with resource allocation for genomic prediction can be tackled through a combination of statistics and genetic methods.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20488"},"PeriodicalIF":3.9,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141861364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this study, the contents of four carotenoids in 244 maize inbred lines were detected and about three million single nucleotide polymorphisms (SNPs) for genome-wide association study to preliminarily analyze the genetic mechanism of maize kernel carotenoids. We identified 826 quantitative trait loci (QTLs) were significantly associated with carotenoids contents, and two key candidate genes Zm00001d029526 (CYP18) and Zm00001d023336 (wrky91) were obtained. In addition, we found a germplasm IL78 with higher carotenoids. The results of this study can provide a theoretical basis for screening genes that guide kernel carotenoids selection breeding.
{"title":"Genome-wide association study of carotenoids in maize kernel.","authors":"Weiwei Chen, Xiangbo Zhang, Chuanli Lu, Hailong Chang, Zaid Chachar, Lina Fan, Yuxing An, Xuhui Li, Yongwen Qi","doi":"10.1002/tpg2.20495","DOIUrl":"10.1002/tpg2.20495","url":null,"abstract":"<p><p>In this study, the contents of four carotenoids in 244 maize inbred lines were detected and about three million single nucleotide polymorphisms (SNPs) for genome-wide association study to preliminarily analyze the genetic mechanism of maize kernel carotenoids. We identified 826 quantitative trait loci (QTLs) were significantly associated with carotenoids contents, and two key candidate genes Zm00001d029526 (CYP18) and Zm00001d023336 (wrky91) were obtained. In addition, we found a germplasm IL78 with higher carotenoids. The results of this study can provide a theoretical basis for screening genes that guide kernel carotenoids selection breeding.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20495"},"PeriodicalIF":3.9,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141917867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-01Epub Date: 2024-08-31DOI: 10.1002/tpg2.20502
Yinqiang Zi, Mengjie Zhang, Xiuyao Yang, Ke Zhao, Tuo Yin, Ke Wen, Xulin Li, Xiaozhen Liu, Hanyao Zhang
Salt stress is one of the primary environmental stresses limiting plant growth and production and adversely affecting the growth, development, yield, and fruit quality of Citrus sinensis. bHLH (basic helix-loop-helix) genes are involved in many bioregulatory processes in plants, including growth and development, phytohormone signaling, defense responses, and biosynthesis of specific metabolites. In this study, by bioinformatics methods, 120 CsbHLHgenes were identified, and phylogenetic analysis classified them into 18 subfamilies that were unevenly distributed on nine chromosomes. The cis-acting elements of the CsbHLH genes were mainly hormone-related cis-acting elements. Seventeen CsbHLH genes exhibited significant differences in expression under salt stress. Six CsbHLH genes with significant differences in expression were randomly selected for quantitative real-time polymerase chain reaction (qRT-PCR) validation. The qRT-PCR results showed a strong correlation with the transcriptome data. Phytohormones such as jasmonic acid (JA) are essential for biotic and abiotic stress responses in plants, and CsbHLH55 and CsbHLH87 are considered candidate target genes for sweet orange MYC2 transcription factors involved in the JA signaling pathway. These genes are the main downstream effectors in the JA signaling pathway and can be activated to participate in the JA signaling pathway. Activation of the JA signaling pathway inhibits the production of reactive oxygen species and improves the salt tolerance of sweet orange plants. The CsbHLH55 and CsbHLH87 genes could be candidate genes for breeding new transgenic salt-resistant varieties of sweet orange.
盐胁迫是限制植物生长和产量的主要环境胁迫之一,对柑橘的生长、发育、产量和果实品质都有不利影响。bHLH(基本螺旋-环-螺旋)基因参与植物的许多生物调控过程,包括生长和发育、植物激素信号转导、防御反应和特定代谢产物的生物合成。本研究通过生物信息学方法鉴定了 120 个 CsbHLHgenes,并通过系统进化分析将其分为 18 个亚科,这些亚科不均匀地分布在 9 条染色体上。CsbHLH基因的顺式作用元件主要是与激素相关的顺式作用元件。17个CsbHLH基因在盐胁迫下的表达有显著差异。随机选取了6个表达差异显著的CsbHLH基因进行实时定量聚合酶链反应(qRT-PCR)验证。qRT-PCR 结果与转录组数据有很强的相关性。茉莉酸(JA)等植物激素对植物的生物和非生物胁迫反应至关重要,而 CsbHLH55 和 CsbHLH87 被认为是参与 JA 信号通路的甜橙 MYC2 转录因子的候选靶基因。这些基因是 JA 信号通路的主要下游效应因子,可被激活参与 JA 信号通路。激活 JA 信号通路可抑制活性氧的产生,提高甜橙植株的耐盐性。CsbHLH55和CsbHLH87基因可作为培育甜橙转基因耐盐新品种的候选基因。
{"title":"Identification of the sweet orange (Citrus sinensis) bHLH gene family and the role of CsbHLH55 and CsbHLH87 in regulating salt stress.","authors":"Yinqiang Zi, Mengjie Zhang, Xiuyao Yang, Ke Zhao, Tuo Yin, Ke Wen, Xulin Li, Xiaozhen Liu, Hanyao Zhang","doi":"10.1002/tpg2.20502","DOIUrl":"10.1002/tpg2.20502","url":null,"abstract":"<p><p>Salt stress is one of the primary environmental stresses limiting plant growth and production and adversely affecting the growth, development, yield, and fruit quality of Citrus sinensis. bHLH (basic helix-loop-helix) genes are involved in many bioregulatory processes in plants, including growth and development, phytohormone signaling, defense responses, and biosynthesis of specific metabolites. In this study, by bioinformatics methods, 120 CsbHLHgenes were identified, and phylogenetic analysis classified them into 18 subfamilies that were unevenly distributed on nine chromosomes. The cis-acting elements of the CsbHLH genes were mainly hormone-related cis-acting elements. Seventeen CsbHLH genes exhibited significant differences in expression under salt stress. Six CsbHLH genes with significant differences in expression were randomly selected for quantitative real-time polymerase chain reaction (qRT-PCR) validation. The qRT-PCR results showed a strong correlation with the transcriptome data. Phytohormones such as jasmonic acid (JA) are essential for biotic and abiotic stress responses in plants, and CsbHLH55 and CsbHLH87 are considered candidate target genes for sweet orange MYC2 transcription factors involved in the JA signaling pathway. These genes are the main downstream effectors in the JA signaling pathway and can be activated to participate in the JA signaling pathway. Activation of the JA signaling pathway inhibits the production of reactive oxygen species and improves the salt tolerance of sweet orange plants. The CsbHLH55 and CsbHLH87 genes could be candidate genes for breeding new transgenic salt-resistant varieties of sweet orange.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20502"},"PeriodicalIF":3.9,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142113851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-01Epub Date: 2024-06-17DOI: 10.1002/tpg2.20484
Jeffrey B Endelman, Moctar Kante, Hannele Lindqvist-Kreuze, Andrzej Kilian, Laura M Shannon, Maria V Caraza-Harter, Brieanne Vaillancourt, Kathrine Mailloux, John P Hamilton, C Robin Buell
Mid-density targeted genotyping-by-sequencing (GBS) combines trait-specific markers with thousands of genomic markers at an attractive price for linkage mapping and genomic selection. A 2.5K targeted GBS assay for potato (Solanum tuberosum L.) was developed using the DArTag technology and later expanded to 4K targets. Genomic markers were selected from the potato Infinium single nucleotide polymorphism (SNP) array to maximize genome coverage and polymorphism rates. The DArTag and SNP array platforms produced equivalent dendrograms in a test set of 298 tetraploid samples, and 83% of the common markers showed good quantitative agreement, with RMSE (root mean squared error) <0.5. DArTag is suited for genomic selection candidates in the clonal evaluation trial, coupled with imputation to a higher density platform for the training population. Using the software polyBreedR, an R package for the manipulation and analysis of polyploid marker data, the RMSE for imputation by linkage analysis was 0.15 in a small half-diallel population (N = 85), which was significantly lower than the RMSE of 0.42 with the random forest method. Regarding high-value traits, the DArTag markers for resistance to potato virus Y, golden cyst nematode, and potato wart appeared to track their targets successfully, as did multi-allelic markers for maturity and tuber shape. In summary, the potato DArTag assay is a transformative and publicly available technology for potato breeding and genetics.
{"title":"Targeted genotyping-by-sequencing of potato and data analysis with R/polyBreedR.","authors":"Jeffrey B Endelman, Moctar Kante, Hannele Lindqvist-Kreuze, Andrzej Kilian, Laura M Shannon, Maria V Caraza-Harter, Brieanne Vaillancourt, Kathrine Mailloux, John P Hamilton, C Robin Buell","doi":"10.1002/tpg2.20484","DOIUrl":"10.1002/tpg2.20484","url":null,"abstract":"<p><p>Mid-density targeted genotyping-by-sequencing (GBS) combines trait-specific markers with thousands of genomic markers at an attractive price for linkage mapping and genomic selection. A 2.5K targeted GBS assay for potato (Solanum tuberosum L.) was developed using the DArTag technology and later expanded to 4K targets. Genomic markers were selected from the potato Infinium single nucleotide polymorphism (SNP) array to maximize genome coverage and polymorphism rates. The DArTag and SNP array platforms produced equivalent dendrograms in a test set of 298 tetraploid samples, and 83% of the common markers showed good quantitative agreement, with RMSE (root mean squared error) <0.5. DArTag is suited for genomic selection candidates in the clonal evaluation trial, coupled with imputation to a higher density platform for the training population. Using the software polyBreedR, an R package for the manipulation and analysis of polyploid marker data, the RMSE for imputation by linkage analysis was 0.15 in a small half-diallel population (N = 85), which was significantly lower than the RMSE of 0.42 with the random forest method. Regarding high-value traits, the DArTag markers for resistance to potato virus Y, golden cyst nematode, and potato wart appeared to track their targets successfully, as did multi-allelic markers for maturity and tuber shape. In summary, the potato DArTag assay is a transformative and publicly available technology for potato breeding and genetics.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20484"},"PeriodicalIF":3.9,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141421506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-01Epub Date: 2024-08-20DOI: 10.1002/tpg2.20501
Hector Oberti, Juan Gutierrez-Gonzalez, Clara Pritsch
Acca sellowiana [Berg] Burret, a cultivated fruit tree originating from South America, is gaining the attention of the nutraceutical and pharmaceutical industries due to their high content of flavonoids and other phenolic compounds in fruits, leaves, and flowers. Flavonoids are a diverse group of secondary metabolites with antioxidant, anti-inflammatory, and antimicrobial properties. They also play a crucial role in plant immune response. Despite their importance, the lack of research on A. sellowiana genomics and transcriptomics hinders a deeper understanding of the molecular mechanisms behind flavonoid biosynthesis and its regulation. Here, we de novo assembled and benchmarked 11 A. sellowiana transcriptomes from leaves and floral tissues at three developmental stages using high-throughput sequencing. We selected and annotated the best assembly according to commonly used metrics and databases. This reference transcriptome consisted of 221,649 nonredundant transcripts, of which 107,612 were functionally annotated. We then used this reference transcriptome to explore the expression profiling of key secondary metabolite genes. Transcripts from genes involved in the flavonoid and anthocyanin biosynthesis pathways were identified. We also identified 4068 putative transcription factors, with the most abundant families being bHLH, C2H2, NAC, MYB, and MYB-related. Transcript expression profiling revealed distinct patterns of gene expression during flower development. Particularly, we found 71 differentially expressed transcripts representing 14 enzymes of the flavonoid pathway, suggesting major changes in flavonoid accumulation across floral stages. Our findings will contribute to understanding the genetic basis of flavonoids and provide a foundation for further research and exploitation of the economic potential of this species.
{"title":"A first de novo transcriptome assembly of feijoa (Acca sellowiana [Berg] Burret) reveals key genes involved in flavonoid biosynthesis.","authors":"Hector Oberti, Juan Gutierrez-Gonzalez, Clara Pritsch","doi":"10.1002/tpg2.20501","DOIUrl":"10.1002/tpg2.20501","url":null,"abstract":"<p><p>Acca sellowiana [Berg] Burret, a cultivated fruit tree originating from South America, is gaining the attention of the nutraceutical and pharmaceutical industries due to their high content of flavonoids and other phenolic compounds in fruits, leaves, and flowers. Flavonoids are a diverse group of secondary metabolites with antioxidant, anti-inflammatory, and antimicrobial properties. They also play a crucial role in plant immune response. Despite their importance, the lack of research on A. sellowiana genomics and transcriptomics hinders a deeper understanding of the molecular mechanisms behind flavonoid biosynthesis and its regulation. Here, we de novo assembled and benchmarked 11 A. sellowiana transcriptomes from leaves and floral tissues at three developmental stages using high-throughput sequencing. We selected and annotated the best assembly according to commonly used metrics and databases. This reference transcriptome consisted of 221,649 nonredundant transcripts, of which 107,612 were functionally annotated. We then used this reference transcriptome to explore the expression profiling of key secondary metabolite genes. Transcripts from genes involved in the flavonoid and anthocyanin biosynthesis pathways were identified. We also identified 4068 putative transcription factors, with the most abundant families being bHLH, C2H2, NAC, MYB, and MYB-related. Transcript expression profiling revealed distinct patterns of gene expression during flower development. Particularly, we found 71 differentially expressed transcripts representing 14 enzymes of the flavonoid pathway, suggesting major changes in flavonoid accumulation across floral stages. Our findings will contribute to understanding the genetic basis of flavonoids and provide a foundation for further research and exploitation of the economic potential of this species.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20501"},"PeriodicalIF":3.9,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142005616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-01Epub Date: 2024-08-27DOI: 10.1002/tpg2.20494
Julia Brose, John P Hamilton, Nicholas Schlecht, Dongyan Zhao, Paulina M Mejía-Ponce, Arely Cruz-Pérez, Brieanne Vaillancourt, Joshua C Wood, Patrick P Edger, Salvador Montes-Hernandez, Guillermo Orozco de Rosas, Björn Hamberger, Angélica Cibrian-Jaramillo, C Robin Buell
Salvia hispanica L. (Chia), a member of the Lamiaceae, is an economically important crop in Mesoamerica, with health benefits associated with its seed fatty acid composition. Chia varieties are distinguished based on seed color including mixed white and black (Chia pinta) and black (Chia negra). To facilitate research on Chia and expand on comparative analyses within the Lamiaceae, we generated a chromosome-scale assembly of a Chia pinta accession and performed comparative genome analyses with a previously published Chia negra genome assembly. The Chia pinta and Chia negra genome sequences were highly similar as shown by a limited number of single nucleotide polymorphisms and extensive shared orthologous gene membership. However, there is an enrichment of terpene synthases in the Chia pinta genome relative to the Chia negra genome. We sequenced and analyzed the genomes of 20 Chia accessions with differing seed color and geographic origin revealing population structure within S. hispanica and interspecific introgressions of Salvia species. As the genus Salvia is polyphyletic, its evolutionary history remains unclear. Using large-scale synteny analysis within the Lamiaceae and orthologous group membership, we resolved the phylogeny of Salvia species. This study and its collective resources further our understanding of genomic diversity in this food crop and the extent of interspecies hybridizations in Salvia.
莎草(Salvia hispanica L.,Chia)是拉米亚科植物,是中美洲一种具有重要经济价值的作物,其种子脂肪酸成分对健康有益。Chia 品种根据种子颜色进行区分,包括白黑混色(Chia pinta)和黑色(Chia negra)。为了促进对 Chia 的研究,并扩大对唇形科植物的比较分析,我们对 Chia pinta 进行了染色体组组装,并与之前发表的 Chia negra 基因组组装进行了比较分析。从数量有限的单核苷酸多态性和广泛的共享直向基因成员来看,Chia pinta 和 Chia negra 基因组序列高度相似。不过,与黑茶基因组相比,品丽珠基因组中的萜烯合成酶更为丰富。我们对 20 个具有不同种子颜色和地理起源的 Chia 入选品种的基因组进行了测序和分析,揭示了 S. hispanica 的种群结构以及丹参物种的种间引种。由于丹参属是多态种,其进化历史仍不清楚。我们利用唇形科内的大规模同源分析和直向同源群成员资格,解决了丹参属物种的系统发育问题。这项研究及其集体资源进一步加深了我们对这种食用作物基因组多样性以及丹参种间杂交程度的了解。
{"title":"Chromosome-scale Salvia hispanica L. (Chia) genome assembly reveals rampant Salvia interspecies introgression.","authors":"Julia Brose, John P Hamilton, Nicholas Schlecht, Dongyan Zhao, Paulina M Mejía-Ponce, Arely Cruz-Pérez, Brieanne Vaillancourt, Joshua C Wood, Patrick P Edger, Salvador Montes-Hernandez, Guillermo Orozco de Rosas, Björn Hamberger, Angélica Cibrian-Jaramillo, C Robin Buell","doi":"10.1002/tpg2.20494","DOIUrl":"10.1002/tpg2.20494","url":null,"abstract":"<p><p>Salvia hispanica L. (Chia), a member of the Lamiaceae, is an economically important crop in Mesoamerica, with health benefits associated with its seed fatty acid composition. Chia varieties are distinguished based on seed color including mixed white and black (Chia pinta) and black (Chia negra). To facilitate research on Chia and expand on comparative analyses within the Lamiaceae, we generated a chromosome-scale assembly of a Chia pinta accession and performed comparative genome analyses with a previously published Chia negra genome assembly. The Chia pinta and Chia negra genome sequences were highly similar as shown by a limited number of single nucleotide polymorphisms and extensive shared orthologous gene membership. However, there is an enrichment of terpene synthases in the Chia pinta genome relative to the Chia negra genome. We sequenced and analyzed the genomes of 20 Chia accessions with differing seed color and geographic origin revealing population structure within S. hispanica and interspecific introgressions of Salvia species. As the genus Salvia is polyphyletic, its evolutionary history remains unclear. Using large-scale synteny analysis within the Lamiaceae and orthologous group membership, we resolved the phylogeny of Salvia species. This study and its collective resources further our understanding of genomic diversity in this food crop and the extent of interspecies hybridizations in Salvia.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20494"},"PeriodicalIF":3.9,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142082333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sugarcane (Saccharum spp.) plays a crucial role in global sugar production; however, the efficiency of breeding programs has been hindered by its heterozygous polyploid genomes. Considering non-additive genetic effects is essential in genome prediction (GP) models of crops with highly heterozygous polyploid genomes. This study incorporates non-additive genetic effects and pedigree information using machine learning methods to track sugarcane breeding lines and enhance the prediction by assessing the degree of association between genotypes. This study measured the stalk biomass and sugar content of 297 clones from 87 families within a breeding population used in the Japanese sugarcane breeding program. Subsequently, we conducted analyses based on the marker genotypes of 33,149 single-nucleotide polymorphisms. To validate the accuracy of GP in the population, we first predicted the prediction accuracy of the best linear unbiased prediction (BLUP) based on a genomic relationship matrix. Prediction accuracy was assessed using two different cross-validation methods: repeated 10-fold cross-validation and leave-one-family-out cross-validation. The accuracy of GP of the first and second methods ranged from 0.36 to 0.74 and 0.15 to 0.63, respectively. Next, we compared the prediction accuracy of BLUP and two machine learning methods: random forests and simulation annealing ensemble (SAE), a newly developed machine learning method that explicitly models the interaction between variables. Both pedigree and genomic information were utilized as input in these methods. Through repeated 10-fold cross-validation, we found that the accuracy of the machine learning methods consistently surpassed that of BLUP in most cases. In leave-one-family-out cross-validation, SAE demonstrated the highest accuracy among the methods. These results underscore the effectiveness of GP in Japanese sugarcane breeding and highlight the significant potential of machine learning methods.
甘蔗(Saccharum spp.)在全球蔗糖生产中发挥着至关重要的作用;然而,其杂合多倍体基因组阻碍了育种计划的效率。考虑非加性遗传效应对于具有高度杂合多倍体基因组的作物基因组预测(GP)模型至关重要。本研究利用机器学习方法将非加性遗传效应和血统信息纳入甘蔗育种系的跟踪,并通过评估基因型之间的关联程度来加强预测。本研究测量了日本甘蔗育种计划中一个育种群体中 87 个家系的 297 个克隆的茎秆生物量和含糖量。随后,我们根据 33,149 个单核苷酸多态性的标记基因型进行了分析。为了验证群体中 GP 的准确性,我们首先根据基因组关系矩阵预测了最佳线性无偏预测(BLUP)的预测准确性。预测准确性的评估采用了两种不同的交叉验证方法:重复 10 倍交叉验证和排除一族交叉验证。第一种和第二种方法的 GP 预测准确率分别为 0.36 至 0.74 和 0.15 至 0.63。接下来,我们比较了 BLUP 和两种机器学习方法的预测准确率:随机森林和模拟退火集合(SAE),后者是一种新开发的机器学习方法,可明确模拟变量之间的相互作用。血统和基因组信息都被用作这些方法的输入。通过反复的 10 倍交叉验证,我们发现机器学习方法的准确性在大多数情况下都超过了 BLUP。在一族淘汰交叉验证中,SAE 的准确率是所有方法中最高的。这些结果凸显了 GP 在日本甘蔗育种中的有效性,并彰显了机器学习方法的巨大潜力。
{"title":"Machine learning for genomic and pedigree prediction in sugarcane.","authors":"Minoru Inamori, Tatsuro Kimura, Masaaki Mori, Yusuke Tarumoto, Taiichiro Hattori, Michiko Hayano, Makoto Umeda, Hiroyoshi Iwata","doi":"10.1002/tpg2.20486","DOIUrl":"10.1002/tpg2.20486","url":null,"abstract":"<p><p>Sugarcane (Saccharum spp.) plays a crucial role in global sugar production; however, the efficiency of breeding programs has been hindered by its heterozygous polyploid genomes. Considering non-additive genetic effects is essential in genome prediction (GP) models of crops with highly heterozygous polyploid genomes. This study incorporates non-additive genetic effects and pedigree information using machine learning methods to track sugarcane breeding lines and enhance the prediction by assessing the degree of association between genotypes. This study measured the stalk biomass and sugar content of 297 clones from 87 families within a breeding population used in the Japanese sugarcane breeding program. Subsequently, we conducted analyses based on the marker genotypes of 33,149 single-nucleotide polymorphisms. To validate the accuracy of GP in the population, we first predicted the prediction accuracy of the best linear unbiased prediction (BLUP) based on a genomic relationship matrix. Prediction accuracy was assessed using two different cross-validation methods: repeated 10-fold cross-validation and leave-one-family-out cross-validation. The accuracy of GP of the first and second methods ranged from 0.36 to 0.74 and 0.15 to 0.63, respectively. Next, we compared the prediction accuracy of BLUP and two machine learning methods: random forests and simulation annealing ensemble (SAE), a newly developed machine learning method that explicitly models the interaction between variables. Both pedigree and genomic information were utilized as input in these methods. Through repeated 10-fold cross-validation, we found that the accuracy of the machine learning methods consistently surpassed that of BLUP in most cases. In leave-one-family-out cross-validation, SAE demonstrated the highest accuracy among the methods. These results underscore the effectiveness of GP in Japanese sugarcane breeding and highlight the significant potential of machine learning methods.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20486"},"PeriodicalIF":3.9,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141460146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-01Epub Date: 2024-06-09DOI: 10.1002/tpg2.20470
Subash Thapa, Harsimardeep S Gill, Jyotirmoy Halder, Anshul Rana, Shaukat Ali, Maitiniyazi Maimaitijiang, Upinder Gill, Amy Bernardo, Paul St Amand, Guihua Bai, Sunish K Sehgal
Fusarium head blight (FHB) remains one of the most destructive diseases of wheat (Triticum aestivum L.), causing considerable losses in yield and end-use quality. Phenotyping of FHB resistance traits, Fusarium-damaged kernels (FDK), and deoxynivalenol (DON), is either prone to human biases or resource expensive, hindering the progress in breeding for FHB-resistant cultivars. Though genomic selection (GS) can be an effective way to select these traits, inaccurate phenotyping remains a hurdle in exploiting this approach. Here, we used an artificial intelligence (AI)-based precise FDK estimation that exhibits high heritability and correlation with DON. Further, GS using AI-based FDK (FDK_QVIS/FDK_QNIR) showed a two-fold increase in predictive ability (PA) compared to GS for traditionally estimated FDK (FDK_V). Next, the AI-based FDK was evaluated along with other traits in multi-trait (MT) GS models to predict DON. The inclusion of FDK_QNIR and FDK_QVIS with days to heading as covariates improved the PA for DON by 58% over the baseline single-trait GS model. We next used hyperspectral imaging of FHB-infected wheat kernels as a novel avenue to improve the MT GS for DON. The PA for DON using selected wavebands derived from hyperspectral imaging in MT GS models surpassed the single-trait GS model by around 40%. Finally, we evaluated phenomic prediction for DON by integrating hyperspectral imaging with deep learning to directly predict DON in FHB-infected wheat kernels and observed an accuracy (R2 = 0.45) comparable to best-performing MT GS models. This study demonstrates the potential application of AI and vision-based platforms to improve PA for FHB-related traits using genomic and phenomic selection.
镰孢菌头孢疫病(FHB)仍然是小麦(Triticum aestivum L.)中破坏性最强的病害之一,对产量和最终使用质量造成了巨大损失。对 FHB 抗性性状、镰刀菌损伤籽粒(FDK)和脱氧雪腐镰刀菌烯醇(DON)的表型分析,要么容易出现人为偏差,要么资源昂贵,阻碍了抗 FHB 栽培品种的育种进展。虽然基因组选择(GS)是选择这些性状的有效方法,但表型不准确仍是利用这种方法的障碍。在这里,我们使用了一种基于人工智能(AI)的精确 FDK 估算方法,该方法表现出较高的遗传率以及与 DON 的相关性。此外,使用基于人工智能的 FDK(FDK_QVIS/FDK_QNIR)的 GS 与使用传统估计的 FDK(FDK_V)的 GS 相比,预测能力(PA)提高了两倍。接下来,对基于人工智能的 FDK 和多性状(MT)GS 模型中的其他性状进行了评估,以预测 DON。将 FDK_QNIR 和 FDK_QVIS 以及茎秆生长天数作为协变量,与基线单一性状 GS 模型相比,DON 的 PA 提高了 58%。接下来,我们利用受 FHB 感染的小麦籽粒的高光谱成像技术作为改进 DON 的 MT GS 的新途径。在 MT GS 模型中使用高光谱成像得出的选定波段对 DON 的 PA 值超过了单一性状 GS 模型约 40%。最后,我们评估了通过将高光谱成像与深度学习相结合来直接预测受 FHB 感染的小麦籽粒中 DON 的表观预测结果,观察到其准确率(R2 = 0.45)与表现最佳的 MT GS 模型相当。这项研究展示了人工智能和基于视觉的平台在利用基因组和表型组选择改善 FHB 相关性状的 PA 方面的潜在应用。
{"title":"Integrating genomics, phenomics, and deep learning improves the predictive ability for Fusarium head blight-related traits in winter wheat.","authors":"Subash Thapa, Harsimardeep S Gill, Jyotirmoy Halder, Anshul Rana, Shaukat Ali, Maitiniyazi Maimaitijiang, Upinder Gill, Amy Bernardo, Paul St Amand, Guihua Bai, Sunish K Sehgal","doi":"10.1002/tpg2.20470","DOIUrl":"10.1002/tpg2.20470","url":null,"abstract":"<p><p>Fusarium head blight (FHB) remains one of the most destructive diseases of wheat (Triticum aestivum L.), causing considerable losses in yield and end-use quality. Phenotyping of FHB resistance traits, Fusarium-damaged kernels (FDK), and deoxynivalenol (DON), is either prone to human biases or resource expensive, hindering the progress in breeding for FHB-resistant cultivars. Though genomic selection (GS) can be an effective way to select these traits, inaccurate phenotyping remains a hurdle in exploiting this approach. Here, we used an artificial intelligence (AI)-based precise FDK estimation that exhibits high heritability and correlation with DON. Further, GS using AI-based FDK (FDK_QVIS/FDK_QNIR) showed a two-fold increase in predictive ability (PA) compared to GS for traditionally estimated FDK (FDK_V). Next, the AI-based FDK was evaluated along with other traits in multi-trait (MT) GS models to predict DON. The inclusion of FDK_QNIR and FDK_QVIS with days to heading as covariates improved the PA for DON by 58% over the baseline single-trait GS model. We next used hyperspectral imaging of FHB-infected wheat kernels as a novel avenue to improve the MT GS for DON. The PA for DON using selected wavebands derived from hyperspectral imaging in MT GS models surpassed the single-trait GS model by around 40%. Finally, we evaluated phenomic prediction for DON by integrating hyperspectral imaging with deep learning to directly predict DON in FHB-infected wheat kernels and observed an accuracy (R<sup>2</sup> = 0.45) comparable to best-performing MT GS models. This study demonstrates the potential application of AI and vision-based platforms to improve PA for FHB-related traits using genomic and phenomic selection.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20470"},"PeriodicalIF":3.9,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141297021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-01Epub Date: 2024-07-12DOI: 10.1002/tpg2.20487
Gwonjin Lee, Charlotte N DiBiase, Beibei Liu, Tong Li, Austin G McCoy, Martin I Chilvers, Lianjun Sun, Dechun Wang, Feng Lin, Meixia Zhao
Phytophthora root rot, caused by oomycete pathogens in the Phytophthora genus, poses a significant threat to soybean productivity. While resistance mechanisms against Phytophthora sojae have been extensively studied in soybean, the molecular basis underlying immune responses to Phytophthora sansomeana remains unclear. In this study, we investigated transcriptomic and epigenetic responses of two resistant (Colfax and NE2701) and two susceptible (Williams 82 and Senaki) soybean lines at four time points (2, 4, 8, and 16 h post inoculation [hpi]) after P. sansomeana inoculation. Comparative transcriptomic analyses revealed a greater number of differentially expressed genes (DEGs) upon pathogen inoculation in resistant lines, particularly at 8 and 16 hpi. These DEGs were predominantly associated with defense response, ethylene, and reactive oxygen species-mediated defense pathways. Moreover, DE transposons were predominantly upregulated after inoculation, and more of them were enriched near genes in Colfax than other soybean lines. Notably, we identified a long non-coding RNA (lncRNA) within the mapped region of the resistance gene that exhibited exclusive upregulation in the resistant lines after inoculation, potentially regulating two flanking LURP-one-related genes. Furthermore, DNA methylation analysis revealed increased CHH (where H = A, T, or C) methylation levels in lncRNAs after inoculation, with delayed responses in Colfax compared to Williams 82. Overall, our results provide comprehensive insights into soybean responses to P. sansomeana, highlighting potential roles of lncRNAs and epigenetic regulation in plant defense.
{"title":"Transcriptomic and epigenetic responses shed light on soybean resistance to Phytophthora sansomeana.","authors":"Gwonjin Lee, Charlotte N DiBiase, Beibei Liu, Tong Li, Austin G McCoy, Martin I Chilvers, Lianjun Sun, Dechun Wang, Feng Lin, Meixia Zhao","doi":"10.1002/tpg2.20487","DOIUrl":"10.1002/tpg2.20487","url":null,"abstract":"<p><p>Phytophthora root rot, caused by oomycete pathogens in the Phytophthora genus, poses a significant threat to soybean productivity. While resistance mechanisms against Phytophthora sojae have been extensively studied in soybean, the molecular basis underlying immune responses to Phytophthora sansomeana remains unclear. In this study, we investigated transcriptomic and epigenetic responses of two resistant (Colfax and NE2701) and two susceptible (Williams 82 and Senaki) soybean lines at four time points (2, 4, 8, and 16 h post inoculation [hpi]) after P. sansomeana inoculation. Comparative transcriptomic analyses revealed a greater number of differentially expressed genes (DEGs) upon pathogen inoculation in resistant lines, particularly at 8 and 16 hpi. These DEGs were predominantly associated with defense response, ethylene, and reactive oxygen species-mediated defense pathways. Moreover, DE transposons were predominantly upregulated after inoculation, and more of them were enriched near genes in Colfax than other soybean lines. Notably, we identified a long non-coding RNA (lncRNA) within the mapped region of the resistance gene that exhibited exclusive upregulation in the resistant lines after inoculation, potentially regulating two flanking LURP-one-related genes. Furthermore, DNA methylation analysis revealed increased CHH (where H = A, T, or C) methylation levels in lncRNAs after inoculation, with delayed responses in Colfax compared to Williams 82. Overall, our results provide comprehensive insights into soybean responses to P. sansomeana, highlighting potential roles of lncRNAs and epigenetic regulation in plant defense.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20487"},"PeriodicalIF":3.9,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141602026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-01Epub Date: 2024-07-29DOI: 10.1002/tpg2.20499
Carl VanGessel, Brian Rice, Terry J Felderhoff, Jean Rigaud Charles, Gael Pressoir, Vamsi Nalam, Geoffrey P Morris
{"title":"Erratum to: Globally deployed sorghum aphid resistance gene RMES1 is vulnerable to biotype shifts but is bolstered by RMES2.","authors":"Carl VanGessel, Brian Rice, Terry J Felderhoff, Jean Rigaud Charles, Gael Pressoir, Vamsi Nalam, Geoffrey P Morris","doi":"10.1002/tpg2.20499","DOIUrl":"10.1002/tpg2.20499","url":null,"abstract":"","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20499"},"PeriodicalIF":3.9,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141793853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}