Raja Rajeswary Thanmalagan, Abhijeet Roy, Aiswarya Jayaprakash, P.T.V. Lakshmi
{"title":"Comprehensive meta-analysis and machine learning approaches identified the role of novel drought specific genes in Oryza sativa","authors":"Raja Rajeswary Thanmalagan, Abhijeet Roy, Aiswarya Jayaprakash, P.T.V. Lakshmi","doi":"10.1016/j.plgene.2022.100382","DOIUrl":null,"url":null,"abstract":"<div><p><span>Rice is a major food crop and provides nutrition for half of the world's population. Rice production is majorly affected by drought at different developmental stages and accounted for annual yield loss depending on the intensity of drought. Hence, the need to study the molecular mechanism in a holistic manner behind drought tolerance is a prerequisite to mitigating this problem. Therefore, in the current study, the drought tolerance mechanism of rice plants was elucidated through a meta-analysis on the publically available </span>transcriptomic<span> datasets by integrating these datasets using a R package to remove the batch effects and applying machine learning approaches for prediction robustness and accuracy. Thus, the classifier model identified 128 essential genes through feature selection algorithms and classification methods on training datasets. The comprehensive study revealed that Naïve Bayes<span> classification and correlation-based feature selection was robust in the prediction of essential genes. The accuracy and performance of the classification model was validated with the independent test dataset and the prediction accuracy of the classifier was 93% with ROC (0.972) and F-measures (0.927). Further, the biological significance of the identified genes in drought tolerance was assessed. The current analysis highlighted the regulatory roles of novel genes such as Os01g0844300, Os06g0246500, Os05g03733900, Os05g0550600 Os08g0442900, Os08g0104400, Os01g0256500, Os02g0259900 and Os05g0572700 in the enhancement of drought tolerance mechanisms. Thus the identified genes might be the potential targets for molecular breeding of drought-tolerant rice cultivars.</span></span></p></div>","PeriodicalId":38041,"journal":{"name":"Plant Gene","volume":"32 ","pages":"Article 100382"},"PeriodicalIF":2.2000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Gene","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352407322000324","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 1
Abstract
Rice is a major food crop and provides nutrition for half of the world's population. Rice production is majorly affected by drought at different developmental stages and accounted for annual yield loss depending on the intensity of drought. Hence, the need to study the molecular mechanism in a holistic manner behind drought tolerance is a prerequisite to mitigating this problem. Therefore, in the current study, the drought tolerance mechanism of rice plants was elucidated through a meta-analysis on the publically available transcriptomic datasets by integrating these datasets using a R package to remove the batch effects and applying machine learning approaches for prediction robustness and accuracy. Thus, the classifier model identified 128 essential genes through feature selection algorithms and classification methods on training datasets. The comprehensive study revealed that Naïve Bayes classification and correlation-based feature selection was robust in the prediction of essential genes. The accuracy and performance of the classification model was validated with the independent test dataset and the prediction accuracy of the classifier was 93% with ROC (0.972) and F-measures (0.927). Further, the biological significance of the identified genes in drought tolerance was assessed. The current analysis highlighted the regulatory roles of novel genes such as Os01g0844300, Os06g0246500, Os05g03733900, Os05g0550600 Os08g0442900, Os08g0104400, Os01g0256500, Os02g0259900 and Os05g0572700 in the enhancement of drought tolerance mechanisms. Thus the identified genes might be the potential targets for molecular breeding of drought-tolerant rice cultivars.
Plant GeneAgricultural and Biological Sciences-Plant Science
CiteScore
4.50
自引率
0.00%
发文量
42
审稿时长
51 days
期刊介绍:
Plant Gene publishes papers that focus on the regulation, expression, function and evolution of genes in plants, algae and other photosynthesizing organisms (e.g., cyanobacteria), and plant-associated microorganisms. Plant Gene strives to be a diverse plant journal and topics in multiple fields will be considered for publication. Although not limited to the following, some general topics include: Gene discovery and characterization, Gene regulation in response to environmental stress (e.g., salinity, drought, etc.), Genetic effects of transposable elements, Genetic control of secondary metabolic pathways and metabolic enzymes. Herbal Medicine - regulation and medicinal properties of plant products, Plant hormonal signaling, Plant evolutionary genetics, molecular evolution, population genetics, and phylogenetics, Profiling of plant gene expression and genetic variation, Plant-microbe interactions (e.g., influence of endophytes on gene expression; horizontal gene transfer studies; etc.), Agricultural genetics - biotechnology and crop improvement.