Hakjong Shin, Taehyun Park, Seng-Kyoun Jo, Jae Young Jung
{"title":"加强水产养殖系统监测:缺失数据输入的机器学习算法的比较研究","authors":"Hakjong Shin, Taehyun Park, Seng-Kyoun Jo, Jae Young Jung","doi":"10.1016/j.aquaculture.2025.742303","DOIUrl":null,"url":null,"abstract":"<div><div>This study evaluated flow-through aquaculture systems by assessing the efficiency of various machine learning algorithms for imputing missing water-quality data, including dissolved oxygen, water temperature, pH, and salinity. Artificial missing data were generated based on real-world missing data mechanisms, and a comprehensive statistical analysis of the data characteristics was conducted to identify suitable imputation methods. Results showed that basic imputation methods like linear interpolation, often insufficient for datasets with high variability and non-linear relationships, performed well for certain data distributions, particularly for salinity and pH data with high kurtosis and symmetric distributions. However, advanced machine learning-based imputation techniques, especially TimesNet, consistently outperformed other methods in handling complex and variable patterns in the water-quality data. This study underscores the importance of selecting appropriate imputation methods based on data properties to enhance environmental monitoring systems in aquaculture and improve operational efficiency and sustainability.</div></div>","PeriodicalId":8375,"journal":{"name":"Aquaculture","volume":"601 ","pages":"Article 742303"},"PeriodicalIF":4.4000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing flow-through aquaculture system monitoring: A comparative study of machine learning algorithms for missing-data imputation\",\"authors\":\"Hakjong Shin, Taehyun Park, Seng-Kyoun Jo, Jae Young Jung\",\"doi\":\"10.1016/j.aquaculture.2025.742303\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study evaluated flow-through aquaculture systems by assessing the efficiency of various machine learning algorithms for imputing missing water-quality data, including dissolved oxygen, water temperature, pH, and salinity. Artificial missing data were generated based on real-world missing data mechanisms, and a comprehensive statistical analysis of the data characteristics was conducted to identify suitable imputation methods. Results showed that basic imputation methods like linear interpolation, often insufficient for datasets with high variability and non-linear relationships, performed well for certain data distributions, particularly for salinity and pH data with high kurtosis and symmetric distributions. However, advanced machine learning-based imputation techniques, especially TimesNet, consistently outperformed other methods in handling complex and variable patterns in the water-quality data. This study underscores the importance of selecting appropriate imputation methods based on data properties to enhance environmental monitoring systems in aquaculture and improve operational efficiency and sustainability.</div></div>\",\"PeriodicalId\":8375,\"journal\":{\"name\":\"Aquaculture\",\"volume\":\"601 \",\"pages\":\"Article 742303\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Aquaculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0044848625001899\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/15 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"FISHERIES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aquaculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0044848625001899","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/15 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"FISHERIES","Score":null,"Total":0}
Enhancing flow-through aquaculture system monitoring: A comparative study of machine learning algorithms for missing-data imputation
This study evaluated flow-through aquaculture systems by assessing the efficiency of various machine learning algorithms for imputing missing water-quality data, including dissolved oxygen, water temperature, pH, and salinity. Artificial missing data were generated based on real-world missing data mechanisms, and a comprehensive statistical analysis of the data characteristics was conducted to identify suitable imputation methods. Results showed that basic imputation methods like linear interpolation, often insufficient for datasets with high variability and non-linear relationships, performed well for certain data distributions, particularly for salinity and pH data with high kurtosis and symmetric distributions. However, advanced machine learning-based imputation techniques, especially TimesNet, consistently outperformed other methods in handling complex and variable patterns in the water-quality data. This study underscores the importance of selecting appropriate imputation methods based on data properties to enhance environmental monitoring systems in aquaculture and improve operational efficiency and sustainability.
期刊介绍:
Aquaculture is an international journal for the exploration, improvement and management of all freshwater and marine food resources. It publishes novel and innovative research of world-wide interest on farming of aquatic organisms, which includes finfish, mollusks, crustaceans and aquatic plants for human consumption. Research on ornamentals is not a focus of the Journal. Aquaculture only publishes papers with a clear relevance to improving aquaculture practices or a potential application.