印度尼西亚降雨预测模型的数据挖掘技术性能比较

M. T. Anwar, W. Hadikurniawati, Edy Winarno, W. Widiyatmoko
{"title":"印度尼西亚降雨预测模型的数据挖掘技术性能比较","authors":"M. T. Anwar, W. Hadikurniawati, Edy Winarno, W. Widiyatmoko","doi":"10.1109/ISRITI51436.2020.9315460","DOIUrl":null,"url":null,"abstract":"Rain prediction is a crucial topic that continues to gain interest across the globe. Rain has a massive impact on various aspects of human life such as in agriculture, health, transportation, etc, and also some natural disasters. Various impacts of rain on human life prompts us to build a model to understand and predict rain to provide early warning for various use cases in various fields. Previous research on rain modeling using Data Mining (DM) techniques had suffered from low accuracy caused by the limited availability of the training data and their meteorological attributes. This research aims to address those issues by building the rain model using a richer and more abundant rain data in Indonesia. Four DM techniques are used and compared in this research i.e. the C4.5/J48, Random Forest (RF), Naïve Bayes (NB), and Multilayer Perceptron (MLP). The experimental results showed that the MLP and J48 algorithm can provide the best accuracy (up to 78,4%), which is better than previous research. Other key findings in this research include: (a) the selection of DM techniques has little effect on the model accuracy; (b) a larger training dataset generally improves model accuracy and a larger test dataset is necessary to get a representative realworld test accuracy, and (c) the two most influential attributes in rain modeling are the relative humidity and the minimum temperature, and we suggest to include cloud condensation nuclei in the next research to complete the model.","PeriodicalId":325920,"journal":{"name":"2020 3rd International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Performance Comparison of Data Mining Techniques for Rain Prediction Models in Indonesia\",\"authors\":\"M. T. Anwar, W. Hadikurniawati, Edy Winarno, W. Widiyatmoko\",\"doi\":\"10.1109/ISRITI51436.2020.9315460\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Rain prediction is a crucial topic that continues to gain interest across the globe. Rain has a massive impact on various aspects of human life such as in agriculture, health, transportation, etc, and also some natural disasters. Various impacts of rain on human life prompts us to build a model to understand and predict rain to provide early warning for various use cases in various fields. Previous research on rain modeling using Data Mining (DM) techniques had suffered from low accuracy caused by the limited availability of the training data and their meteorological attributes. This research aims to address those issues by building the rain model using a richer and more abundant rain data in Indonesia. Four DM techniques are used and compared in this research i.e. the C4.5/J48, Random Forest (RF), Naïve Bayes (NB), and Multilayer Perceptron (MLP). The experimental results showed that the MLP and J48 algorithm can provide the best accuracy (up to 78,4%), which is better than previous research. Other key findings in this research include: (a) the selection of DM techniques has little effect on the model accuracy; (b) a larger training dataset generally improves model accuracy and a larger test dataset is necessary to get a representative realworld test accuracy, and (c) the two most influential attributes in rain modeling are the relative humidity and the minimum temperature, and we suggest to include cloud condensation nuclei in the next research to complete the model.\",\"PeriodicalId\":325920,\"journal\":{\"name\":\"2020 3rd International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 3rd International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISRITI51436.2020.9315460\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 3rd International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISRITI51436.2020.9315460","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

降雨预报是一个至关重要的话题,在全球范围内不断引起人们的兴趣。雨水对人类生活的各个方面都有巨大的影响,如农业、健康、交通等,也有一些自然灾害。降雨对人类生活的各种影响促使我们建立一个模型来理解和预测降雨,为不同领域的各种用例提供预警。以往利用数据挖掘技术进行降雨建模的研究,由于训练数据及其气象属性的可用性有限,导致建模精度较低。本研究旨在通过使用印度尼西亚更丰富的降雨数据建立降雨模型来解决这些问题。本研究使用并比较了四种DM技术,即C4.5/J48、随机森林(RF)、Naïve贝叶斯(NB)和多层感知器(MLP)。实验结果表明,MLP和J48算法可以提供最好的准确率(高达78.4%),优于以往的研究。本研究的其他主要发现包括:(a) DM技术的选择对模型精度的影响不大;(b)更大的训练数据集通常会提高模型的精度,而更大的测试数据集才能获得具有代表性的真实世界测试精度;(c)降雨建模中影响最大的两个属性是相对湿度和最低温度,我们建议在下一步的研究中加入云凝结核来完成模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Performance Comparison of Data Mining Techniques for Rain Prediction Models in Indonesia
Rain prediction is a crucial topic that continues to gain interest across the globe. Rain has a massive impact on various aspects of human life such as in agriculture, health, transportation, etc, and also some natural disasters. Various impacts of rain on human life prompts us to build a model to understand and predict rain to provide early warning for various use cases in various fields. Previous research on rain modeling using Data Mining (DM) techniques had suffered from low accuracy caused by the limited availability of the training data and their meteorological attributes. This research aims to address those issues by building the rain model using a richer and more abundant rain data in Indonesia. Four DM techniques are used and compared in this research i.e. the C4.5/J48, Random Forest (RF), Naïve Bayes (NB), and Multilayer Perceptron (MLP). The experimental results showed that the MLP and J48 algorithm can provide the best accuracy (up to 78,4%), which is better than previous research. Other key findings in this research include: (a) the selection of DM techniques has little effect on the model accuracy; (b) a larger training dataset generally improves model accuracy and a larger test dataset is necessary to get a representative realworld test accuracy, and (c) the two most influential attributes in rain modeling are the relative humidity and the minimum temperature, and we suggest to include cloud condensation nuclei in the next research to complete the model.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Combined Firefly Algorithm-Random Forest to Classify Autistic Spectrum Disorders Analysis of Indonesia's Internet Topology Borders at the Autonomous System Level Influence Distribution Training Data on Performance Supervised Machine Learning Algorithms Design of Optimal Satellite Constellation for Indonesian Regional Navigation System based on GEO and GSO Satellites Real-time Testing on Improved Data Transmission Security in the Industrial Control System
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1