Dian Chao, Hao Wang, Fengqiang Wan, Shen Yan, Wei Fang, Yang Yang
{"title":"MtCro: multi-task deep learning framework improves multi-trait genomic prediction of crops.","authors":"Dian Chao, Hao Wang, Fengqiang Wan, Shen Yan, Wei Fang, Yang Yang","doi":"10.1186/s13007-024-01321-0","DOIUrl":null,"url":null,"abstract":"<p><p>Genomic Selection (GS) predicts traits using genome-wide markers, speeding up genetic progress and enhancing breeding efficiency. Recent emphasis has been placed on deep learning models to enhance prediction accuracy. However, current deep learning models focus on learning specific phenotypes for the given task, overlooking the inter-correlations among different phenotypes. In response, we introduce MtCro, a multi-task learning approach that simultaneously captures diverse plant phenotypes within a shared parameter space. Extensive experiments reveal that MtCro outperforms mainstream models, including DNNGP and SoyDNGP, with performance gains of 1-9% on the Wheat2000 dataset, 1-8% on Wheat599, and 1-3% on Maize8652. Furthermore, comparative analysis shows a consistent 2-3% improvement in multi-phenotype predictions, emphasizing the impact of inter-phenotype correlations on accuracy. By leveraging multi-task learning, MtCro efficiently captures diverse plant phenotypes, enhancing both model training efficiency and prediction accuracy, ultimately accelerating the progress of plant genetic breeding. Our code is available on https://github.com/chaodian12/mtcro .</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":"21 1","pages":"12"},"PeriodicalIF":4.7000,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11796200/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Methods","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13007-024-01321-0","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Genomic Selection (GS) predicts traits using genome-wide markers, speeding up genetic progress and enhancing breeding efficiency. Recent emphasis has been placed on deep learning models to enhance prediction accuracy. However, current deep learning models focus on learning specific phenotypes for the given task, overlooking the inter-correlations among different phenotypes. In response, we introduce MtCro, a multi-task learning approach that simultaneously captures diverse plant phenotypes within a shared parameter space. Extensive experiments reveal that MtCro outperforms mainstream models, including DNNGP and SoyDNGP, with performance gains of 1-9% on the Wheat2000 dataset, 1-8% on Wheat599, and 1-3% on Maize8652. Furthermore, comparative analysis shows a consistent 2-3% improvement in multi-phenotype predictions, emphasizing the impact of inter-phenotype correlations on accuracy. By leveraging multi-task learning, MtCro efficiently captures diverse plant phenotypes, enhancing both model training efficiency and prediction accuracy, ultimately accelerating the progress of plant genetic breeding. Our code is available on https://github.com/chaodian12/mtcro .
期刊介绍:
Plant Methods is an open access, peer-reviewed, online journal for the plant research community that encompasses all aspects of technological innovation in the plant sciences.
There is no doubt that we have entered an exciting new era in plant biology. The completion of the Arabidopsis genome sequence, and the rapid progress being made in other plant genomics projects are providing unparalleled opportunities for progress in all areas of plant science. Nevertheless, enormous challenges lie ahead if we are to understand the function of every gene in the genome, and how the individual parts work together to make the whole organism. Achieving these goals will require an unprecedented collaborative effort, combining high-throughput, system-wide technologies with more focused approaches that integrate traditional disciplines such as cell biology, biochemistry and molecular genetics.
Technological innovation is probably the most important catalyst for progress in any scientific discipline. Plant Methods’ goal is to stimulate the development and adoption of new and improved techniques and research tools and, where appropriate, to promote consistency of methodologies for better integration of data from different laboratories.