{"title":"Exploiting Breadth in Energy Datasets for Automated Device Identification","authors":"S. Barker, Kyle Morrison, Tucker Williams","doi":"10.1109/SmartGridComm.2019.8909725","DOIUrl":null,"url":null,"abstract":"The recent explosion of interest in smart building energy-efficiency has led to a proliferation of public energy datasets. Most of these datasets focus on depth (i.e., many devices in a few buildings) as opposed to breadth (e.g., a few devices in many buildings), and thus most smart building algorithms are evaluated on depth-oriented datasets. We argue that increasing data breadth conveys important benefits that are not easily achieved by even a large quantity of deep data. As an illustrative case study, we consider the problem of classifying previously unseen appliances using an off-the-shelf classifier trained on known instances of other devices. Our experiments on multiple real-world datasets (both depth- and breadth-oriented) demonstrate significant and sustained benefits from increased data breadth, and point to the importance of incorporating greater breadth into similar techniques that rely on generalized electrical load models.","PeriodicalId":377150,"journal":{"name":"2019 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SmartGridComm.2019.8909725","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The recent explosion of interest in smart building energy-efficiency has led to a proliferation of public energy datasets. Most of these datasets focus on depth (i.e., many devices in a few buildings) as opposed to breadth (e.g., a few devices in many buildings), and thus most smart building algorithms are evaluated on depth-oriented datasets. We argue that increasing data breadth conveys important benefits that are not easily achieved by even a large quantity of deep data. As an illustrative case study, we consider the problem of classifying previously unseen appliances using an off-the-shelf classifier trained on known instances of other devices. Our experiments on multiple real-world datasets (both depth- and breadth-oriented) demonstrate significant and sustained benefits from increased data breadth, and point to the importance of incorporating greater breadth into similar techniques that rely on generalized electrical load models.