{"title":"利用聚类技术突出时间序列数据尺寸缩减的突出特征","authors":"Anupama Jawale, Ganesh M. Magar","doi":"10.1109/IBSSC56953.2022.10037499","DOIUrl":null,"url":null,"abstract":"There exist many techniques for feature selection and reduction to reduce dimensions of the large sensor dataset. For real time data processing, compressed and prominent feature of highest significance is desirable for efficient way of resource optimization and computation cost reduction. The goal of this research study is to highlight most significant feature of the dataset and to generate compressed time series by highlighting it. The highlighted feature of accelerometer sensor dataset is extracted, and a more compressed form of time series is generated using statistical and clustering methods like k-means, Partition around Medoids (PAM), Max-Value, 95% Confidence Interval values and Ceil Function calculations. As a result, around 80 % reduction in dataset with the similar pattern as of original time series is achieved. The original time series is compared with generated output series using Dynamic Time Warping method, where, we have obtained normalized error distance of 0.02. (Accuracy 98%)","PeriodicalId":426897,"journal":{"name":"2022 IEEE Bombay Section Signature Conference (IBSSC)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Highlighting Prominent Features for Size Reduction in Time Series Data using Clustering Techniques\",\"authors\":\"Anupama Jawale, Ganesh M. Magar\",\"doi\":\"10.1109/IBSSC56953.2022.10037499\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There exist many techniques for feature selection and reduction to reduce dimensions of the large sensor dataset. For real time data processing, compressed and prominent feature of highest significance is desirable for efficient way of resource optimization and computation cost reduction. The goal of this research study is to highlight most significant feature of the dataset and to generate compressed time series by highlighting it. The highlighted feature of accelerometer sensor dataset is extracted, and a more compressed form of time series is generated using statistical and clustering methods like k-means, Partition around Medoids (PAM), Max-Value, 95% Confidence Interval values and Ceil Function calculations. As a result, around 80 % reduction in dataset with the similar pattern as of original time series is achieved. The original time series is compared with generated output series using Dynamic Time Warping method, where, we have obtained normalized error distance of 0.02. (Accuracy 98%)\",\"PeriodicalId\":426897,\"journal\":{\"name\":\"2022 IEEE Bombay Section Signature Conference (IBSSC)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE Bombay Section Signature Conference (IBSSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IBSSC56953.2022.10037499\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Bombay Section Signature Conference (IBSSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBSSC56953.2022.10037499","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
为了对大型传感器数据集进行降维,存在许多特征选择和降维技术。在实时数据处理中,最重要的压缩和突出特征是优化资源和降低计算成本的有效途径。本研究的目的是突出数据集的最重要特征,并通过突出数据集来生成压缩时间序列。提取加速度计传感器数据集的突出特征,并使用k-means, Partition around mediids (PAM), Max-Value, 95%置信区间值和Ceil函数计算等统计和聚类方法生成更压缩的时间序列。结果表明,具有与原始时间序列相似模式的数据集的概率降低了80%左右。使用Dynamic time Warping方法将原始时间序列与生成的输出序列进行比较,得到归一化误差距离为0.02。(精度为98%)
Highlighting Prominent Features for Size Reduction in Time Series Data using Clustering Techniques
There exist many techniques for feature selection and reduction to reduce dimensions of the large sensor dataset. For real time data processing, compressed and prominent feature of highest significance is desirable for efficient way of resource optimization and computation cost reduction. The goal of this research study is to highlight most significant feature of the dataset and to generate compressed time series by highlighting it. The highlighted feature of accelerometer sensor dataset is extracted, and a more compressed form of time series is generated using statistical and clustering methods like k-means, Partition around Medoids (PAM), Max-Value, 95% Confidence Interval values and Ceil Function calculations. As a result, around 80 % reduction in dataset with the similar pattern as of original time series is achieved. The original time series is compared with generated output series using Dynamic Time Warping method, where, we have obtained normalized error distance of 0.02. (Accuracy 98%)