{"title":"Machine learning-enhanced multi-trait genomic prediction for optimizing cannabinoid profiles in cannabis.","authors":"Mohsen Yoosefzadeh Najafabadi, Davoud Torkamaneh","doi":"10.1111/tpj.17164","DOIUrl":null,"url":null,"abstract":"<p><p>Cannabis sativa L., known for its medicinal and psychoactive properties, has recently experienced rapid market expansion but remains understudied in terms of its fundamental biology due to historical prohibitions. This pioneering study implements GS and ML to optimize cannabinoid profiles in cannabis breeding. We analyzed a representative population of drug-type cannabis accessions, quantifying major cannabinoids and utilizing high-density genotyping with 250K SNPs for GS. Our evaluations of various models-including ML algorithms, statistical methods, and Bayesian approaches-highlighted Random Forest's superior predictive accuracy for single and multi-trait genomic predictions, particularly for THC, CBD, and their precursors. Multi-trait analyses elucidated complex genetic interdependencies and identified key loci crucial to cannabinoid biosynthesis. These results demonstrate the efficacy of integrating GS and ML in developing cannabis varieties with tailored cannabinoid profiles.</p>","PeriodicalId":233,"journal":{"name":"The Plant Journal","volume":" ","pages":"e17164"},"PeriodicalIF":6.2000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11711876/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Plant Journal","FirstCategoryId":"2","ListUrlMain":"https://doi.org/10.1111/tpj.17164","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/27 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Cannabis sativa L., known for its medicinal and psychoactive properties, has recently experienced rapid market expansion but remains understudied in terms of its fundamental biology due to historical prohibitions. This pioneering study implements GS and ML to optimize cannabinoid profiles in cannabis breeding. We analyzed a representative population of drug-type cannabis accessions, quantifying major cannabinoids and utilizing high-density genotyping with 250K SNPs for GS. Our evaluations of various models-including ML algorithms, statistical methods, and Bayesian approaches-highlighted Random Forest's superior predictive accuracy for single and multi-trait genomic predictions, particularly for THC, CBD, and their precursors. Multi-trait analyses elucidated complex genetic interdependencies and identified key loci crucial to cannabinoid biosynthesis. These results demonstrate the efficacy of integrating GS and ML in developing cannabis varieties with tailored cannabinoid profiles.
期刊介绍:
Publishing the best original research papers in all key areas of modern plant biology from the world"s leading laboratories, The Plant Journal provides a dynamic forum for this ever growing international research community.
Plant science research is now at the forefront of research in the biological sciences, with breakthroughs in our understanding of fundamental processes in plants matching those in other organisms. The impact of molecular genetics and the availability of model and crop species can be seen in all aspects of plant biology. For publication in The Plant Journal the research must provide a highly significant new contribution to our understanding of plants and be of general interest to the plant science community.