{"title":"A comparison of physics-based, data-driven, and hybrid modeling approaches for rice phenology prediction","authors":"Jin Yu, Yifan Zhao, Guoqing Lei, Wenzhi Zeng","doi":"10.1002/agj2.70010","DOIUrl":null,"url":null,"abstract":"<p>Accurate prediction of paddy rice (<i>Oryza sativa</i> L.) phenology is necessary for informing field management and improving yield. There exist different ways, including physics-based, data-driven, and hybrid approaches, to make rice phenology prediction. However, few studies have investigated the performance of the above three modeling approaches. This study compared the performance of a physics-based model (ORYZA), a data-driven model (using the distributed random forest [DRF] technique), and a hybrid model (an integration of the ORYZA model and DRF-based rice development rate parameter estimates) for rice panicle initiation and flowering date prediction. The feature importance analysis method was introduced to quantify the relative importance of input variables for rice phenology prediction. The results showed the following: (1) Rice genotypes and cultivation patterns resulted in poor performance of the ORYZA model for phenology prediction, whose root mean square error (RMSE) ranged from 6.01 to 8.12 days, and the coefficient of determination (<i>R</i><sup>2</sup>) ranged from 0.06 to 0.24. (2) The hybrid model, whose RMSE ranged from 3.11 to 3.66 days, improved the ORYZA model but still underperformed the data-driven model, whose RMSE ranged from 2.44 to 2.57 days. The worse performance might be attributed to the poor prediction accuracy of the model parameter, development rate in the juvenile phase, where the mean absolute percentage error was 0.286. (3) Satellite-based vegetation indices, leaf area index, and evapotranspiration played an important role in determining the predictive capacity of the DRF technique for ORYZA model parameters and rice phenology. Overall, we suggested using data-driven models for accurate rice phenology prediction.</p>","PeriodicalId":7522,"journal":{"name":"Agronomy Journal","volume":"117 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Agronomy Journal","FirstCategoryId":"97","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/agj2.70010","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate prediction of paddy rice (Oryza sativa L.) phenology is necessary for informing field management and improving yield. There exist different ways, including physics-based, data-driven, and hybrid approaches, to make rice phenology prediction. However, few studies have investigated the performance of the above three modeling approaches. This study compared the performance of a physics-based model (ORYZA), a data-driven model (using the distributed random forest [DRF] technique), and a hybrid model (an integration of the ORYZA model and DRF-based rice development rate parameter estimates) for rice panicle initiation and flowering date prediction. The feature importance analysis method was introduced to quantify the relative importance of input variables for rice phenology prediction. The results showed the following: (1) Rice genotypes and cultivation patterns resulted in poor performance of the ORYZA model for phenology prediction, whose root mean square error (RMSE) ranged from 6.01 to 8.12 days, and the coefficient of determination (R2) ranged from 0.06 to 0.24. (2) The hybrid model, whose RMSE ranged from 3.11 to 3.66 days, improved the ORYZA model but still underperformed the data-driven model, whose RMSE ranged from 2.44 to 2.57 days. The worse performance might be attributed to the poor prediction accuracy of the model parameter, development rate in the juvenile phase, where the mean absolute percentage error was 0.286. (3) Satellite-based vegetation indices, leaf area index, and evapotranspiration played an important role in determining the predictive capacity of the DRF technique for ORYZA model parameters and rice phenology. Overall, we suggested using data-driven models for accurate rice phenology prediction.
期刊介绍:
After critical review and approval by the editorial board, AJ publishes articles reporting research findings in soil–plant relationships; crop science; soil science; biometry; crop, soil, pasture, and range management; crop, forage, and pasture production and utilization; turfgrass; agroclimatology; agronomic models; integrated pest management; integrated agricultural systems; and various aspects of entomology, weed science, animal science, plant pathology, and agricultural economics as applied to production agriculture.
Notes are published about apparatus, observations, and experimental techniques. Observations usually are limited to studies and reports of unrepeatable phenomena or other unique circumstances. Review and interpretation papers are also published, subject to standard review. Contributions to the Forum section deal with current agronomic issues and questions in brief, thought-provoking form. Such papers are reviewed by the editor in consultation with the editorial board.