{"title":"Comparison of Prediction Methods for Air Pollution Data in Malaysia and Singapore","authors":"Merlinda Wibowo, Sarina Sulaiman, S. Shamsuddin","doi":"10.11113/IJIC.V8N3.202","DOIUrl":null,"url":null,"abstract":"The process for analyzing and extracting useful information from a large database that employs one or more machine learning techniques is Data Mining. There are many data mining methods that can be used in a variety of data patterns. One of them is prediction modeling. This study compares several data mining performance methods for prediction such as Naïve Bayes, Random Tree, J48, and Rough Set to get the most powerful classifier to extract the knowledge of air pollution data. The parameters being used for observation in the performance of the prediction methods are correctly and incorrectly classified instances, the time taken, and kappa statistic. The experimental result reveals that Rough Set is extremely good for classifying the Air Pollutant Index (API) data from Malaysia and Singapore. Rough Set has the lowest error and the highest performance compared to other methods with the accuracy more than 97%.","PeriodicalId":50314,"journal":{"name":"International Journal of Innovative Computing Information and Control","volume":"9 1","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2018-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Innovative Computing Information and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11113/IJIC.V8N3.202","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 6
Abstract
The process for analyzing and extracting useful information from a large database that employs one or more machine learning techniques is Data Mining. There are many data mining methods that can be used in a variety of data patterns. One of them is prediction modeling. This study compares several data mining performance methods for prediction such as Naïve Bayes, Random Tree, J48, and Rough Set to get the most powerful classifier to extract the knowledge of air pollution data. The parameters being used for observation in the performance of the prediction methods are correctly and incorrectly classified instances, the time taken, and kappa statistic. The experimental result reveals that Rough Set is extremely good for classifying the Air Pollutant Index (API) data from Malaysia and Singapore. Rough Set has the lowest error and the highest performance compared to other methods with the accuracy more than 97%.
期刊介绍:
The primary aim of the International Journal of Innovative Computing, Information and Control (IJICIC) is to publish high-quality papers of new developments and trends, novel techniques and approaches, innovative methodologies and technologies on the theory and applications of intelligent systems, information and control. The IJICIC is a peer-reviewed English language journal and is published bimonthly