Thomas Lee Young, James Gopsill, Maria Valero, Sindre Eikevåg, Ben Hicks
{"title":"Comparing four machine learning algorithms for household non-intrusive load monitoring","authors":"Thomas Lee Young, James Gopsill, Maria Valero, Sindre Eikevåg, Ben Hicks","doi":"10.1016/j.egyai.2024.100384","DOIUrl":null,"url":null,"abstract":"<div><p>The combination of Machine Learning (ML), smart energy meters, and availability of household appliance energy profile data has opened new opportunities for Non-Intrusive Load Monitoring (NILM). However, the number of options makes it challenging in selecting optimal combinations for different energy applications, which requires studies to examine their trade-offs.</p><p>This paper contributes one such study that investigated four established ML approaches – K Nearest Neighbour (KNN), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGBoost) and Convolutional Neural Network (CNN) – and their performance in classifying appliance events from Alternating Current (AC) and Root Mean Square (RMS) energy data where the sampling frequency and training dataset set size was varied (10 Hz–1 kHz and 50–2000 examples per class, respectively). The computational expense during training, testing and storage was also assessed and evaluated with reference to real-world applications.</p><p>The CNN classifier trained on AC data at 500 Hz and 11,000 examples gave the best F1-score <span><math><mfenced><mrow><mn>0</mn><mo>.</mo><mn>989</mn></mrow></mfenced></math></span> followed by the KNN classifier <span><math><mfenced><mrow><mn>0</mn><mo>.</mo><mn>940</mn></mrow></mfenced></math></span>. The storage size required by the CNN models was 3̃MB, which is very close to fitting on cost-effective embedded system microcontrollers. This would prevent high-rate data needing to be sent to the cloud as analysis could be performed on edge computing Internet-of-Things (IoT) devices.</p></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"17 ","pages":"Article 100384"},"PeriodicalIF":9.6000,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666546824000508/pdfft?md5=240b289da6cfc06f2620e646326a2a01&pid=1-s2.0-S2666546824000508-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and AI","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666546824000508","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The combination of Machine Learning (ML), smart energy meters, and availability of household appliance energy profile data has opened new opportunities for Non-Intrusive Load Monitoring (NILM). However, the number of options makes it challenging in selecting optimal combinations for different energy applications, which requires studies to examine their trade-offs.
This paper contributes one such study that investigated four established ML approaches – K Nearest Neighbour (KNN), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGBoost) and Convolutional Neural Network (CNN) – and their performance in classifying appliance events from Alternating Current (AC) and Root Mean Square (RMS) energy data where the sampling frequency and training dataset set size was varied (10 Hz–1 kHz and 50–2000 examples per class, respectively). The computational expense during training, testing and storage was also assessed and evaluated with reference to real-world applications.
The CNN classifier trained on AC data at 500 Hz and 11,000 examples gave the best F1-score followed by the KNN classifier . The storage size required by the CNN models was 3̃MB, which is very close to fitting on cost-effective embedded system microcontrollers. This would prevent high-rate data needing to be sent to the cloud as analysis could be performed on edge computing Internet-of-Things (IoT) devices.