Combination of One-Class and Multi-Class Anomaly Detection Using Under-Sampling and Ensemble Technique in IoT Healthcare Data

S. Subha, J. Sathiaseelan
{"title":"Combination of One-Class and Multi-Class Anomaly Detection Using Under-Sampling and Ensemble Technique in IoT Healthcare Data","authors":"S. Subha, J. Sathiaseelan","doi":"10.17485/ijst/v17i5.1645","DOIUrl":null,"url":null,"abstract":"Objectives: This study addresses the concept drift issue in anomaly detection for IoT systems. The objective is to develop a novel approach that effectively handles the dynamic nature of IoT data. Methods: The proposed COMCADSET (Combination of One-Class and Multi-Class Anomaly Detection Using Under-Sampling and Ensemble Technique) addresses the concept drift challenge. It adapts to evolving data distributions, detects anomalies in IoT healthcare data, mitigates class distribution imbalances through under-sampling, and enhances performance with ensemble techniques. The approach involves four phases: multi-class anomaly spotting, one-class anomaly isolation, concept-drift-free dataset creation, and robust anomaly detection using ensembles. Evaluation utilizes the \"Heart Failure Prediction\" dataset from Kaggle, with comprehensive experiments and three classification algorithms. COMCADSET's innovation merges one-class and multi-class anomaly detection, under-sampling, and ensemble classification. It's compared against gold standards for classification accuracy, concept drift management, and anomaly detection performance. Findings: Conduct comprehensive experiments using a concept drift dataset and three classification algorithms to evaluate the efficacy of the COMCADSET technique. The experimental result shows the proposed COMCADSET technique attains an impressive 98.401% accuracy, decisively enhancing classification accuracy by adeptly addressing concept drift and identifying anomalies in IoT data. Early detection of abnormal behaviour prevents more significant issues and potential security vulnerabilities in IoT systems. Novelty: The novelty of the COMCADSET technique lies in its ability to address the concept drift issue and improve anomaly detection accuracy in IoT systems. By integrating one-class and multi-class anomaly detection, under-sampling, and ensemble techniques, the proposed approach provides a robust solution for handling the dynamic nature of IoT data. Keywords: Anomaly Detection, Concept Drift, Ensemble Classification, Internet of Things, Under­Sampling","PeriodicalId":508200,"journal":{"name":"Indian Journal Of Science And Technology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indian Journal Of Science And Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17485/ijst/v17i5.1645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives: This study addresses the concept drift issue in anomaly detection for IoT systems. The objective is to develop a novel approach that effectively handles the dynamic nature of IoT data. Methods: The proposed COMCADSET (Combination of One-Class and Multi-Class Anomaly Detection Using Under-Sampling and Ensemble Technique) addresses the concept drift challenge. It adapts to evolving data distributions, detects anomalies in IoT healthcare data, mitigates class distribution imbalances through under-sampling, and enhances performance with ensemble techniques. The approach involves four phases: multi-class anomaly spotting, one-class anomaly isolation, concept-drift-free dataset creation, and robust anomaly detection using ensembles. Evaluation utilizes the "Heart Failure Prediction" dataset from Kaggle, with comprehensive experiments and three classification algorithms. COMCADSET's innovation merges one-class and multi-class anomaly detection, under-sampling, and ensemble classification. It's compared against gold standards for classification accuracy, concept drift management, and anomaly detection performance. Findings: Conduct comprehensive experiments using a concept drift dataset and three classification algorithms to evaluate the efficacy of the COMCADSET technique. The experimental result shows the proposed COMCADSET technique attains an impressive 98.401% accuracy, decisively enhancing classification accuracy by adeptly addressing concept drift and identifying anomalies in IoT data. Early detection of abnormal behaviour prevents more significant issues and potential security vulnerabilities in IoT systems. Novelty: The novelty of the COMCADSET technique lies in its ability to address the concept drift issue and improve anomaly detection accuracy in IoT systems. By integrating one-class and multi-class anomaly detection, under-sampling, and ensemble techniques, the proposed approach provides a robust solution for handling the dynamic nature of IoT data. Keywords: Anomaly Detection, Concept Drift, Ensemble Classification, Internet of Things, Under­Sampling
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在物联网医疗数据中使用欠采样和集合技术将单类和多类异常检测相结合
研究目的本研究旨在解决物联网系统异常检测中的概念漂移问题。目的是开发一种新型方法,有效处理物联网数据的动态特性。方法提出的 COMCADSET(使用欠采样和集合技术的单类和多类异常检测组合)可解决概念漂移难题。它能适应不断变化的数据分布,检测物联网医疗数据中的异常,通过欠采样缓解类分布不平衡,并利用集合技术提高性能。该方法包括四个阶段:多类异常发现、单类异常隔离、无概念漂移数据集创建以及使用集合进行稳健异常检测。评估使用了 Kaggle 的 "心衰预测 "数据集,并进行了综合实验和三种分类算法。COMCADSET 的创新融合了单类和多类异常检测、欠采样和集合分类。它在分类准确性、概念漂移管理和异常检测性能方面与黄金标准进行了比较。研究结果使用概念漂移数据集和三种分类算法进行综合实验,评估 COMCADSET 技术的功效。实验结果表明,所提出的 COMCADSET 技术达到了令人印象深刻的 98.401% 的准确率,通过巧妙地处理概念漂移和识别物联网数据中的异常,决定性地提高了分类准确率。对异常行为的早期检测可防止物联网系统出现更多重大问题和潜在安全漏洞。新颖性:COMCADSET 技术的新颖性在于它能够解决概念漂移问题并提高物联网系统中异常检测的准确性。通过整合单类和多类异常检测、欠采样和集合技术,所提出的方法为处理物联网数据的动态特性提供了一种稳健的解决方案。关键词异常检测 概念漂移 集合分类 物联网 欠采样
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Theoretical and Finite Element Analysis of Pressure Vessel Image Analysis and Classification Using HRSVM-CNN for Land-Cover Classification by Using Remote Sensing Images Lattice Ordered G􀀀Semirings On the Solution of Blasius Boundary Layer Equations of Prandtl-Eyring Fluid Flow Past a Stretching Sheet Optimizing High Data Rate Up to 2.5 Tbps Transmission using 64-Channel DWDM System with DCF and NRZ Modulation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1