{"title":"多维时间序列的时空符号化","authors":"S. Hidaka, Chen Yu","doi":"10.1109/ICDMW.2010.86","DOIUrl":null,"url":null,"abstract":"The present study proposes a new symbolization algorithm for multidimensional time series. We view temporal sequences as observed data generated by a dynamical system, and therefore the goal of symbolization is to estimate symbolic sequences that minimize loss of information, which is called generating partition in nonlinear physics. In order to utilize the theoretical property of symbol dynamics in data mining, our algorithm estimates symbols on multivariate time series by integrating both spatial and temporal information and selecting those dimensions in multidimensional time series containing useful information. Probabilistic symbolic sequences derived from our symbolization method can be used in various supervised and unsupervised data-mining tasks. To demonstrate this, the algorithm is evaluated by applying it to both simulated data and a real-world dataset. In both cases, the new algorithm outperforms its alternative approaches.","PeriodicalId":170201,"journal":{"name":"2010 IEEE International Conference on Data Mining Workshops","volume":"04 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Spatio-Temporal Symbolization of Multidimensional Time Series\",\"authors\":\"S. Hidaka, Chen Yu\",\"doi\":\"10.1109/ICDMW.2010.86\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The present study proposes a new symbolization algorithm for multidimensional time series. We view temporal sequences as observed data generated by a dynamical system, and therefore the goal of symbolization is to estimate symbolic sequences that minimize loss of information, which is called generating partition in nonlinear physics. In order to utilize the theoretical property of symbol dynamics in data mining, our algorithm estimates symbols on multivariate time series by integrating both spatial and temporal information and selecting those dimensions in multidimensional time series containing useful information. Probabilistic symbolic sequences derived from our symbolization method can be used in various supervised and unsupervised data-mining tasks. To demonstrate this, the algorithm is evaluated by applying it to both simulated data and a real-world dataset. In both cases, the new algorithm outperforms its alternative approaches.\",\"PeriodicalId\":170201,\"journal\":{\"name\":\"2010 IEEE International Conference on Data Mining Workshops\",\"volume\":\"04 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE International Conference on Data Mining Workshops\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDMW.2010.86\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Data Mining Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2010.86","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Spatio-Temporal Symbolization of Multidimensional Time Series
The present study proposes a new symbolization algorithm for multidimensional time series. We view temporal sequences as observed data generated by a dynamical system, and therefore the goal of symbolization is to estimate symbolic sequences that minimize loss of information, which is called generating partition in nonlinear physics. In order to utilize the theoretical property of symbol dynamics in data mining, our algorithm estimates symbols on multivariate time series by integrating both spatial and temporal information and selecting those dimensions in multidimensional time series containing useful information. Probabilistic symbolic sequences derived from our symbolization method can be used in various supervised and unsupervised data-mining tasks. To demonstrate this, the algorithm is evaluated by applying it to both simulated data and a real-world dataset. In both cases, the new algorithm outperforms its alternative approaches.