Andres Molares-Ulloa , Elisabet Rocruz , Daniel Rivero , Xosé A. Padin , Rita Nolasco , Jesús Dubert , Enrique Fernandez-Blanco
{"title":"Towards improved harmful algal bloom forecasts: A comparison of symbolic regression with DoME and stream learning performance","authors":"Andres Molares-Ulloa , Elisabet Rocruz , Daniel Rivero , Xosé A. Padin , Rita Nolasco , Jesús Dubert , Enrique Fernandez-Blanco","doi":"10.1016/j.compag.2025.110112","DOIUrl":null,"url":null,"abstract":"<div><div>Diarrhetic Shellfish Poisoning (DSP) is a global health issue caused by shellfish contaminated with toxins from dinoflagellates, posing significant risks to public health and the shellfish industry. Harmful Algal Blooms (HABs), driven by toxin-producing algae like DSP, require effective monitoring and forecasting systems. Predicting HABs is challenging due to the time-series nature of the problem, influenced by historical seasonal patterns and recent anomalies from meteorological and oceanographic changes. Stream Learning shows promise for handling time-series problems with concept drifts but has yet to be validated for HAB prediction compared to Batch Learning. Limited historical data availability in oceanography highlights the importance of advanced tools like the CROCO ocean hydrodynamic model, which provides high-resolution temporal and spatial data. This study developed a machine learning workflow to predict toxic dinoflagellate (<em>Dinophysis acuminata</em>) cell counts, comparing seven algorithms across two learning paradigms. The CROCO model data addressed historical data gaps. The DoME model, with an average <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.77 for 3-day-ahead predictions, proved the most effective and interpretable, underscoring the value of model explainability and rigorous comparison methodologies.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"233 ","pages":"Article 110112"},"PeriodicalIF":7.7000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925002182","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Diarrhetic Shellfish Poisoning (DSP) is a global health issue caused by shellfish contaminated with toxins from dinoflagellates, posing significant risks to public health and the shellfish industry. Harmful Algal Blooms (HABs), driven by toxin-producing algae like DSP, require effective monitoring and forecasting systems. Predicting HABs is challenging due to the time-series nature of the problem, influenced by historical seasonal patterns and recent anomalies from meteorological and oceanographic changes. Stream Learning shows promise for handling time-series problems with concept drifts but has yet to be validated for HAB prediction compared to Batch Learning. Limited historical data availability in oceanography highlights the importance of advanced tools like the CROCO ocean hydrodynamic model, which provides high-resolution temporal and spatial data. This study developed a machine learning workflow to predict toxic dinoflagellate (Dinophysis acuminata) cell counts, comparing seven algorithms across two learning paradigms. The CROCO model data addressed historical data gaps. The DoME model, with an average of 0.77 for 3-day-ahead predictions, proved the most effective and interpretable, underscoring the value of model explainability and rigorous comparison methodologies.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.