Supakan Janthong, Rakkrit Duangsoithong, K. Chalermyanont
{"title":"利用电力档案和物理数据对电力公司中的风险群体和窃电行为进行特征提取分类","authors":"Supakan Janthong, Rakkrit Duangsoithong, K. Chalermyanont","doi":"10.37936/ecti-cit.2024181.252738","DOIUrl":null,"url":null,"abstract":"Non-technical loss (NTL) is one of the problems that has been a major issue in lost revenue for many years. Electricity distributors have attempted to reduce NTL by detecting electricity theft using various methods. Some events are difficult to detect that conventional meters inspection is inadequate. Moreover, many anomaly patterns found are very complex, confusing in identifying or distinguishing what types of electricity customers are at abnormal risk or energy theft that affects NTL. This paper proposes five key feature extraction methods and six classifying electricity customers using supervised learning. The main problem was studied and collected information, including kilowatt meters, electronic meters, TOU meters, and AMR meters, which cover four customer types that were recorded in the Provincial Electricity Authority (PEA) of Thailand. An electrical profile to be extracted for in-depth analysis of the behavior of each type of electricity customer, combined with the information of physical data to help enhance and increase efficiency. All features examined the relationships in each feature using Pearson correlation and handled unbalanced data using random oversampling (ROS). Then, the extracted data has been trained, validated, and tested to classify three classes: normal, risk, and theft, where we evaluate the results with performance metrics. The results show that random forest (RF) outperforms the rest of the classifiers by achieving a precision-recall area under the curve of 90% and a receiver operating characteristic curve of 78%. Significantly, the results were compared to previous studies and benchmark datasets, which revealed that the proposed method gave better results than other techniques.","PeriodicalId":507234,"journal":{"name":"ECTI Transactions on Computer and Information Technology (ECTI-CIT)","volume":"32 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature Extraction of Risk Group and Electricity Theft by using Electrical Profiles and Physical Data for Classification in the Power Utilities\",\"authors\":\"Supakan Janthong, Rakkrit Duangsoithong, K. Chalermyanont\",\"doi\":\"10.37936/ecti-cit.2024181.252738\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Non-technical loss (NTL) is one of the problems that has been a major issue in lost revenue for many years. Electricity distributors have attempted to reduce NTL by detecting electricity theft using various methods. Some events are difficult to detect that conventional meters inspection is inadequate. Moreover, many anomaly patterns found are very complex, confusing in identifying or distinguishing what types of electricity customers are at abnormal risk or energy theft that affects NTL. This paper proposes five key feature extraction methods and six classifying electricity customers using supervised learning. The main problem was studied and collected information, including kilowatt meters, electronic meters, TOU meters, and AMR meters, which cover four customer types that were recorded in the Provincial Electricity Authority (PEA) of Thailand. An electrical profile to be extracted for in-depth analysis of the behavior of each type of electricity customer, combined with the information of physical data to help enhance and increase efficiency. All features examined the relationships in each feature using Pearson correlation and handled unbalanced data using random oversampling (ROS). Then, the extracted data has been trained, validated, and tested to classify three classes: normal, risk, and theft, where we evaluate the results with performance metrics. The results show that random forest (RF) outperforms the rest of the classifiers by achieving a precision-recall area under the curve of 90% and a receiver operating characteristic curve of 78%. Significantly, the results were compared to previous studies and benchmark datasets, which revealed that the proposed method gave better results than other techniques.\",\"PeriodicalId\":507234,\"journal\":{\"name\":\"ECTI Transactions on Computer and Information Technology (ECTI-CIT)\",\"volume\":\"32 4\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ECTI Transactions on Computer and Information Technology (ECTI-CIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.37936/ecti-cit.2024181.252738\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ECTI Transactions on Computer and Information Technology (ECTI-CIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37936/ecti-cit.2024181.252738","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
非技术性损失(NTL)是多年来收入损失的主要问题之一。配电公司试图通过各种方法检测窃电行为来减少 NTL。有些事件很难被发现,传统的电表检查是不够的。此外,发现的许多异常模式非常复杂,在识别或区分哪些类型的电力客户存在异常风险或影响 NTL 的窃电行为时容易混淆。本文提出了五种关键特征提取方法和六种利用监督学习对电力客户进行分类的方法。主要问题是研究和收集信息,包括千瓦表、电子表、TOU 表和 AMR 表,涵盖泰国省电力局(PEA)记录的四种客户类型。将提取的电力概况与物理数据信息相结合,对各类电力客户的行为进行深入分析,以帮助增强和提高效率。所有特征均使用皮尔逊相关性检验每个特征中的关系,并使用随机超采样(ROS)处理不平衡数据。然后,对提取的数据进行训练、验证和测试,将其分为三个等级:正常、风险和盗窃,并用性能指标对结果进行评估。结果表明,随机森林(RF)的精度-召回曲线下面积达到 90%,接收者操作特征曲线达到 78%,优于其他分类器。值得注意的是,将结果与以前的研究和基准数据集进行比较后发现,所提出的方法比其他技术取得了更好的结果。
Feature Extraction of Risk Group and Electricity Theft by using Electrical Profiles and Physical Data for Classification in the Power Utilities
Non-technical loss (NTL) is one of the problems that has been a major issue in lost revenue for many years. Electricity distributors have attempted to reduce NTL by detecting electricity theft using various methods. Some events are difficult to detect that conventional meters inspection is inadequate. Moreover, many anomaly patterns found are very complex, confusing in identifying or distinguishing what types of electricity customers are at abnormal risk or energy theft that affects NTL. This paper proposes five key feature extraction methods and six classifying electricity customers using supervised learning. The main problem was studied and collected information, including kilowatt meters, electronic meters, TOU meters, and AMR meters, which cover four customer types that were recorded in the Provincial Electricity Authority (PEA) of Thailand. An electrical profile to be extracted for in-depth analysis of the behavior of each type of electricity customer, combined with the information of physical data to help enhance and increase efficiency. All features examined the relationships in each feature using Pearson correlation and handled unbalanced data using random oversampling (ROS). Then, the extracted data has been trained, validated, and tested to classify three classes: normal, risk, and theft, where we evaluate the results with performance metrics. The results show that random forest (RF) outperforms the rest of the classifiers by achieving a precision-recall area under the curve of 90% and a receiver operating characteristic curve of 78%. Significantly, the results were compared to previous studies and benchmark datasets, which revealed that the proposed method gave better results than other techniques.