{"title":"Data-centric Cyber-attack Detection in Community Microgrids Using ML Techniques","authors":"R. Trivedi, S. Patra, S. Khadem","doi":"10.1109/GlobConPT57482.2022.9938333","DOIUrl":null,"url":null,"abstract":"This article proposes a data-centric strategy that emphasises data preprocessing, interpretation of machine learning models' performance, improving data quality and modifying models to deal with issues identified during the iterative loop of classification model development. The framework consists of three stages: stage-1 focuses on data collection and pre-processing, followed by data quality improvement and feature extraction in stage-2, and the final stage-3 with model hyper-parameter tuning. The concept of model interpretation is added within the framework that helps to understand the learning behaviour of machine learning (ML) models. This makes the models' performance more explainable and is known as Explainable Artificial Intelligence (XAI). For stage-1, the data is generated from a simulation of cyber-attacks in CIGRE low voltage microgrid network, which is then preprocessed. In stage-2, data is augmented using ensembled Synthetic Minority Over-sampling Technique (SMOTE) and Edited Nearest Neighbour (ENN) methods, followed by feature extraction using the Boruta python package. Finally, the hyper-parameters are tuned through a Tree-structured Parzen Estimator (TPE) algorithm. A time-series transformer model is also presented for cyber-attack detection. The findings from the proposed approach demonstrate that the model's predictive performance increases with subsequent stages.","PeriodicalId":431406,"journal":{"name":"2022 IEEE Global Conference on Computing, Power and Communication Technologies (GlobConPT)","volume":"8 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Global Conference on Computing, Power and Communication Technologies (GlobConPT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GlobConPT57482.2022.9938333","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This article proposes a data-centric strategy that emphasises data preprocessing, interpretation of machine learning models' performance, improving data quality and modifying models to deal with issues identified during the iterative loop of classification model development. The framework consists of three stages: stage-1 focuses on data collection and pre-processing, followed by data quality improvement and feature extraction in stage-2, and the final stage-3 with model hyper-parameter tuning. The concept of model interpretation is added within the framework that helps to understand the learning behaviour of machine learning (ML) models. This makes the models' performance more explainable and is known as Explainable Artificial Intelligence (XAI). For stage-1, the data is generated from a simulation of cyber-attacks in CIGRE low voltage microgrid network, which is then preprocessed. In stage-2, data is augmented using ensembled Synthetic Minority Over-sampling Technique (SMOTE) and Edited Nearest Neighbour (ENN) methods, followed by feature extraction using the Boruta python package. Finally, the hyper-parameters are tuned through a Tree-structured Parzen Estimator (TPE) algorithm. A time-series transformer model is also presented for cyber-attack detection. The findings from the proposed approach demonstrate that the model's predictive performance increases with subsequent stages.