Attila Egri, I. Horváth, Ferenc Kovács, Roland Molontay, K. Varga
{"title":"Cross-correlation based clustering and dimension reduction of multivariate time series","authors":"Attila Egri, I. Horváth, Ferenc Kovács, Roland Molontay, K. Varga","doi":"10.1109/ines.2017.8118563","DOIUrl":null,"url":null,"abstract":"In this paper, we investigate dimension reduction possibilities of multidimensional time series data and we introduce a graph based clustering approach using the cross-correlation between time series. The proposed solution consists of two main steps: introducing a novel similarity measure for measuring cross-correlations and a graph-based clustering technique. These two parts are both compared to existing techniques, including noise tolerance and our solution performs better in a noisy environment. The proposed solution is applied to performance metrics of a specific data processing system in order to identify and efficiently visualize connections among the collected metrics. The introduced method provides a more balanced clustering than classic ones, and it is suitable to reveal dependencies and connections among performance metrics time series data.","PeriodicalId":344933,"journal":{"name":"2017 IEEE 21st International Conference on Intelligent Engineering Systems (INES)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 21st International Conference on Intelligent Engineering Systems (INES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ines.2017.8118563","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
In this paper, we investigate dimension reduction possibilities of multidimensional time series data and we introduce a graph based clustering approach using the cross-correlation between time series. The proposed solution consists of two main steps: introducing a novel similarity measure for measuring cross-correlations and a graph-based clustering technique. These two parts are both compared to existing techniques, including noise tolerance and our solution performs better in a noisy environment. The proposed solution is applied to performance metrics of a specific data processing system in order to identify and efficiently visualize connections among the collected metrics. The introduced method provides a more balanced clustering than classic ones, and it is suitable to reveal dependencies and connections among performance metrics time series data.