一种有效分类水的集成机器学习框架

Isha Aleem, Attique ur Rehman, Sabeen Javaid, Tahir Muhammad Ali
{"title":"一种有效分类水的集成机器学习框架","authors":"Isha Aleem, Attique ur Rehman, Sabeen Javaid, Tahir Muhammad Ali","doi":"10.1109/ICEPECC57281.2023.10209495","DOIUrl":null,"url":null,"abstract":"Water is an essential need for all living things. The problem people are facing especially in urban areas is the diseases that have been caused by water which is mainly Dengue or malaria because there are many metals such as ammonium aluminum silver and other bacterial or viral things present in water. The first step toward a healthy lifestyle is to drink purified water. In this paper, the methodology that has been used here is to detect whether the water people are using for drinking purposes is safe enough to use or not in the data set and the specific methodology is binomial type as yes or no, 912 for positive states and 7084 negative states which means that 0.8864 for 0 and 0.114 for 1. The ratio for negative is far higher than the positive one. We tested the model in two ways first with simple feature extraction, smote Upsampling and with vote ensemble. In smote Upsampling accuracy with the random forest is 88.21 % (highest). The classification error of random forest is 11.7% and with the highest rate is 94.73% which is recall and specificity is 81.71% which is the lowest in the random forest whereas with vote ensemble the combinations of two algorithms have been used and the highest accuracy is from naive Bayes and KNN and the accuracy from them is 96.00% with the classification error of 6% only its precision rate is higher which is 98.89% and lowest specificity of 75.00%.","PeriodicalId":102289,"journal":{"name":"2023 International Conference on Energy, Power, Environment, Control, and Computing (ICEPECC)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Integrated Machine Learning Framework for Effective Classification of Water\",\"authors\":\"Isha Aleem, Attique ur Rehman, Sabeen Javaid, Tahir Muhammad Ali\",\"doi\":\"10.1109/ICEPECC57281.2023.10209495\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Water is an essential need for all living things. The problem people are facing especially in urban areas is the diseases that have been caused by water which is mainly Dengue or malaria because there are many metals such as ammonium aluminum silver and other bacterial or viral things present in water. The first step toward a healthy lifestyle is to drink purified water. In this paper, the methodology that has been used here is to detect whether the water people are using for drinking purposes is safe enough to use or not in the data set and the specific methodology is binomial type as yes or no, 912 for positive states and 7084 negative states which means that 0.8864 for 0 and 0.114 for 1. The ratio for negative is far higher than the positive one. We tested the model in two ways first with simple feature extraction, smote Upsampling and with vote ensemble. In smote Upsampling accuracy with the random forest is 88.21 % (highest). The classification error of random forest is 11.7% and with the highest rate is 94.73% which is recall and specificity is 81.71% which is the lowest in the random forest whereas with vote ensemble the combinations of two algorithms have been used and the highest accuracy is from naive Bayes and KNN and the accuracy from them is 96.00% with the classification error of 6% only its precision rate is higher which is 98.89% and lowest specificity of 75.00%.\",\"PeriodicalId\":102289,\"journal\":{\"name\":\"2023 International Conference on Energy, Power, Environment, Control, and Computing (ICEPECC)\",\"volume\":\"114 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Energy, Power, Environment, Control, and Computing (ICEPECC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEPECC57281.2023.10209495\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Energy, Power, Environment, Control, and Computing (ICEPECC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEPECC57281.2023.10209495","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

水是所有生物的必需品。人们面临的问题,尤其是在城市地区,是由水引起的疾病,主要是登革热或疟疾,因为有许多金属,如铵铝银和其他细菌或病毒的东西存在于水中。健康生活方式的第一步是喝纯净水。在本文中,这里使用的方法是检测数据集中人们用于饮用的水是否足够安全,具体方法是二项型,为yes或no, 912为阳性状态,7084为阴性状态,即0.8864为0,0.114为1。负的比率远远高于正的比率。我们用两种方法测试了该模型,首先是简单的特征提取,smote Upsampling和投票集成。在smote中,随机森林的上采样精度为88.21%(最高)。随机森林的分类误差是11.7%,召回率最高的是94.73%,特异性是81.71%最低的随机森林而投票合奏的组合有两种算法和最高精度从朴素贝叶斯和资讯和分类的精度是96.00%的误差只有6%精确率较高的最低98.89%,特异性75.00%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
An Integrated Machine Learning Framework for Effective Classification of Water
Water is an essential need for all living things. The problem people are facing especially in urban areas is the diseases that have been caused by water which is mainly Dengue or malaria because there are many metals such as ammonium aluminum silver and other bacterial or viral things present in water. The first step toward a healthy lifestyle is to drink purified water. In this paper, the methodology that has been used here is to detect whether the water people are using for drinking purposes is safe enough to use or not in the data set and the specific methodology is binomial type as yes or no, 912 for positive states and 7084 negative states which means that 0.8864 for 0 and 0.114 for 1. The ratio for negative is far higher than the positive one. We tested the model in two ways first with simple feature extraction, smote Upsampling and with vote ensemble. In smote Upsampling accuracy with the random forest is 88.21 % (highest). The classification error of random forest is 11.7% and with the highest rate is 94.73% which is recall and specificity is 81.71% which is the lowest in the random forest whereas with vote ensemble the combinations of two algorithms have been used and the highest accuracy is from naive Bayes and KNN and the accuracy from them is 96.00% with the classification error of 6% only its precision rate is higher which is 98.89% and lowest specificity of 75.00%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Generative Adversarial Networks (GANs) Video Framework: A Systematic Literature Review A Novel Decentralized Coordination Control Scheme for the Complex Transactive Energy Prosumers Biometric Electroencephalogram Based Random Number Generator Seasonal Impact on the Storage Capacity Sizing in a Renewable Energy System Under the Condition of Safe Operation Investigation of Optimum Phase Change Material for PV Panels in Malaysian Climatic Conditions
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1