小型水产养殖数据库缺失值的回归处理

Ricky Afiful Maula, A. Gunawan, Bima Sena Bayu Dewantara, M. A. Al Rasyid, Setiawardhana Setiawardhana, Ferry Astika Saputra, Junaedi Ispianto
{"title":"小型水产养殖数据库缺失值的回归处理","authors":"Ricky Afiful Maula, A. Gunawan, Bima Sena Bayu Dewantara, M. A. Al Rasyid, Setiawardhana Setiawardhana, Ferry Astika Saputra, Junaedi Ispianto","doi":"10.17529/jre.v18i3.25903","DOIUrl":null,"url":null,"abstract":"Regression (SVR) 0.739, (KR) Abstract —Shrimp cultivation is strongly influenced by pond water quality conditions. Farmers must know the appropriate action in regulating water quality that is suitable for shrimp survival. The state of water quality can be understood by measuring pond parameters using various sensors. Installing sensors equipped with artificial intelligence modules to inform water quality conditions is the right action. However, the sensor cannot be separated from errors, so it results in not being able to get data or missing data. In this case, the approach of 5 parameters of pond water quality from 13 available parameters is carried out. This paper proposes a technique to obtain lost data caused by sensor error and looks for the best model. A simple approach can be taken, such as the Handling Missing Value (HMV) which is commonly used, namely the mean, with the K-Nearest Neighbors (KNN) classifier optimized using a grid search. However, the accuracy of this technique is still low, reaching 0.739 at 20-fold cross-validation. Calculations were carried out with other methods to further improve the prediction accuracy. It was found that Linear Regression (LR) can increase accuracy up to 0.757, which outperforms different approaches such as the statistical approach to mean 0.739, mode 0.716, median 0.734, and regression approach KNN 0.742, Lasso 0.751, Passive Aggressive Regressor (PAR) 0.737, Support Vector Regression (SVR) 0.739, Kernel Ridge (KR) 0.731, and Stochastic Gradient Descent (SGD) 0.734.","PeriodicalId":30766,"journal":{"name":"Jurnal Rekayasa Elektrika","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Handling Missing Value dengan Pendekatan Regresi pada Dataset Akuakultur Berukuran Kecil\",\"authors\":\"Ricky Afiful Maula, A. Gunawan, Bima Sena Bayu Dewantara, M. A. Al Rasyid, Setiawardhana Setiawardhana, Ferry Astika Saputra, Junaedi Ispianto\",\"doi\":\"10.17529/jre.v18i3.25903\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Regression (SVR) 0.739, (KR) Abstract —Shrimp cultivation is strongly influenced by pond water quality conditions. Farmers must know the appropriate action in regulating water quality that is suitable for shrimp survival. The state of water quality can be understood by measuring pond parameters using various sensors. Installing sensors equipped with artificial intelligence modules to inform water quality conditions is the right action. However, the sensor cannot be separated from errors, so it results in not being able to get data or missing data. In this case, the approach of 5 parameters of pond water quality from 13 available parameters is carried out. This paper proposes a technique to obtain lost data caused by sensor error and looks for the best model. A simple approach can be taken, such as the Handling Missing Value (HMV) which is commonly used, namely the mean, with the K-Nearest Neighbors (KNN) classifier optimized using a grid search. However, the accuracy of this technique is still low, reaching 0.739 at 20-fold cross-validation. Calculations were carried out with other methods to further improve the prediction accuracy. It was found that Linear Regression (LR) can increase accuracy up to 0.757, which outperforms different approaches such as the statistical approach to mean 0.739, mode 0.716, median 0.734, and regression approach KNN 0.742, Lasso 0.751, Passive Aggressive Regressor (PAR) 0.737, Support Vector Regression (SVR) 0.739, Kernel Ridge (KR) 0.731, and Stochastic Gradient Descent (SGD) 0.734.\",\"PeriodicalId\":30766,\"journal\":{\"name\":\"Jurnal Rekayasa Elektrika\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jurnal Rekayasa Elektrika\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.17529/jre.v18i3.25903\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Rekayasa Elektrika","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17529/jre.v18i3.25903","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

回归(SVR)0.739,(KR)摘要——池塘水质条件对对虾养殖有很大影响。农民必须知道在调节适合虾生存的水质方面采取的适当行动。水质状态可以通过使用各种传感器测量池塘参数来理解。安装装有人工智能模块的传感器来告知水质状况是正确的做法。然而,传感器无法与错误分离,因此导致无法获取数据或数据丢失。在这种情况下,对13个可用参数中的5个池塘水质参数进行了逼近。本文提出了一种获取传感器误差引起的数据丢失的技术,并寻找最佳模型。可以采取一种简单的方法,例如通常使用的处理缺失值(HMV),即均值,以及使用网格搜索优化的K-最近邻(KNN)分类器。然而,该技术的准确性仍然很低,在20倍交叉验证时达到0.739。采用其他方法进行了计算,进一步提高了预测精度。研究发现,线性回归(LR)可以将精度提高到0.757,这优于不同的方法,如平均值为0.739、模式为0.716、中值为0.734的统计方法,以及KNN 0.742、Lasso 0.751、被动攻击性回归(标准杆数)0.737、支持向量回归(SVR)0.739、核岭(KR)0.731和随机梯度下降(SGD)0.734的回归方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Handling Missing Value dengan Pendekatan Regresi pada Dataset Akuakultur Berukuran Kecil
Regression (SVR) 0.739, (KR) Abstract —Shrimp cultivation is strongly influenced by pond water quality conditions. Farmers must know the appropriate action in regulating water quality that is suitable for shrimp survival. The state of water quality can be understood by measuring pond parameters using various sensors. Installing sensors equipped with artificial intelligence modules to inform water quality conditions is the right action. However, the sensor cannot be separated from errors, so it results in not being able to get data or missing data. In this case, the approach of 5 parameters of pond water quality from 13 available parameters is carried out. This paper proposes a technique to obtain lost data caused by sensor error and looks for the best model. A simple approach can be taken, such as the Handling Missing Value (HMV) which is commonly used, namely the mean, with the K-Nearest Neighbors (KNN) classifier optimized using a grid search. However, the accuracy of this technique is still low, reaching 0.739 at 20-fold cross-validation. Calculations were carried out with other methods to further improve the prediction accuracy. It was found that Linear Regression (LR) can increase accuracy up to 0.757, which outperforms different approaches such as the statistical approach to mean 0.739, mode 0.716, median 0.734, and regression approach KNN 0.742, Lasso 0.751, Passive Aggressive Regressor (PAR) 0.737, Support Vector Regression (SVR) 0.739, Kernel Ridge (KR) 0.731, and Stochastic Gradient Descent (SGD) 0.734.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
24
审稿时长
24 weeks
期刊最新文献
Impact of Segmentation and Popularity-based Cache Replacement Policies on Named Data Networking Real-Time Detection of Power Quality Disturbance Using Fast Fourier Transform and Adaptive Neuro-Fuzzy Inference System Wireless Photoplethysmography (PPG) Measurement with Pulse Wave Velocity (PWV) Method for Arterial Stiffness Evaluation Augmentation of Additional Arabic Dataset for Jawi Writing and Classification Using Deep Learning Anomaly Detection for Security in Children's Play Areas Based on Image Using Multiple Lines Detection Method
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1