M. Yousaf, Muhammad Ahmad Khan, M. F. Tahir, Chen Zhichu, Fazal Badshah, S. Khalid
{"title":"相位测量单元数据清洗的有限元方法","authors":"M. Yousaf, Muhammad Ahmad Khan, M. F. Tahir, Chen Zhichu, Fazal Badshah, S. Khalid","doi":"10.1109/ETECTE55893.2022.10007132","DOIUrl":null,"url":null,"abstract":"With the rising use of Phase Measurement Units (PMUs) in smart grid applications, it is important for PMUs to function in extreme circumstances, resulting in outliers and missing dataset. Traditional approaches take an inordinate amount of time to clear outliers and fill missing data to assure better accuracy. This study offers a flexible ensemble approach (FEA) to construct a precise, rapid, and sustainable data cleaning procedure with Apache Spark. To discover outliers in the suggested system, an ensemble model based on a soft voting technique employs PCA in combination with the K-means, GMM, and iForest approach. The suggested method fills the data with an improved gradient-boosting decision tree for each obtained PMUs characteristic after outlier detection. The test results demonstrate that the proposed model acquired good accuracy during comparing with LOF and DBSCAN techniques. To evaluate the suggested technique's data-filling outcomes against modern methods such as decision tree and linear regression techniques, the MAE and RMSE criteria are applied.","PeriodicalId":131572,"journal":{"name":"2022 International Conference on Emerging Trends in Electrical, Control, and Telecommunication Engineering (ETECTE)","volume":"316 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The FE Approach for Data Cleaning of Phase Measurement Units\",\"authors\":\"M. Yousaf, Muhammad Ahmad Khan, M. F. Tahir, Chen Zhichu, Fazal Badshah, S. Khalid\",\"doi\":\"10.1109/ETECTE55893.2022.10007132\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rising use of Phase Measurement Units (PMUs) in smart grid applications, it is important for PMUs to function in extreme circumstances, resulting in outliers and missing dataset. Traditional approaches take an inordinate amount of time to clear outliers and fill missing data to assure better accuracy. This study offers a flexible ensemble approach (FEA) to construct a precise, rapid, and sustainable data cleaning procedure with Apache Spark. To discover outliers in the suggested system, an ensemble model based on a soft voting technique employs PCA in combination with the K-means, GMM, and iForest approach. The suggested method fills the data with an improved gradient-boosting decision tree for each obtained PMUs characteristic after outlier detection. The test results demonstrate that the proposed model acquired good accuracy during comparing with LOF and DBSCAN techniques. To evaluate the suggested technique's data-filling outcomes against modern methods such as decision tree and linear regression techniques, the MAE and RMSE criteria are applied.\",\"PeriodicalId\":131572,\"journal\":{\"name\":\"2022 International Conference on Emerging Trends in Electrical, Control, and Telecommunication Engineering (ETECTE)\",\"volume\":\"316 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Emerging Trends in Electrical, Control, and Telecommunication Engineering (ETECTE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ETECTE55893.2022.10007132\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Emerging Trends in Electrical, Control, and Telecommunication Engineering (ETECTE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ETECTE55893.2022.10007132","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The FE Approach for Data Cleaning of Phase Measurement Units
With the rising use of Phase Measurement Units (PMUs) in smart grid applications, it is important for PMUs to function in extreme circumstances, resulting in outliers and missing dataset. Traditional approaches take an inordinate amount of time to clear outliers and fill missing data to assure better accuracy. This study offers a flexible ensemble approach (FEA) to construct a precise, rapid, and sustainable data cleaning procedure with Apache Spark. To discover outliers in the suggested system, an ensemble model based on a soft voting technique employs PCA in combination with the K-means, GMM, and iForest approach. The suggested method fills the data with an improved gradient-boosting decision tree for each obtained PMUs characteristic after outlier detection. The test results demonstrate that the proposed model acquired good accuracy during comparing with LOF and DBSCAN techniques. To evaluate the suggested technique's data-filling outcomes against modern methods such as decision tree and linear regression techniques, the MAE and RMSE criteria are applied.