多维时间序列的时空符号化

S. Hidaka, Chen Yu
{"title":"多维时间序列的时空符号化","authors":"S. Hidaka, Chen Yu","doi":"10.1109/ICDMW.2010.86","DOIUrl":null,"url":null,"abstract":"The present study proposes a new symbolization algorithm for multidimensional time series. We view temporal sequences as observed data generated by a dynamical system, and therefore the goal of symbolization is to estimate symbolic sequences that minimize loss of information, which is called generating partition in nonlinear physics. In order to utilize the theoretical property of symbol dynamics in data mining, our algorithm estimates symbols on multivariate time series by integrating both spatial and temporal information and selecting those dimensions in multidimensional time series containing useful information. Probabilistic symbolic sequences derived from our symbolization method can be used in various supervised and unsupervised data-mining tasks. To demonstrate this, the algorithm is evaluated by applying it to both simulated data and a real-world dataset. In both cases, the new algorithm outperforms its alternative approaches.","PeriodicalId":170201,"journal":{"name":"2010 IEEE International Conference on Data Mining Workshops","volume":"04 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Spatio-Temporal Symbolization of Multidimensional Time Series\",\"authors\":\"S. Hidaka, Chen Yu\",\"doi\":\"10.1109/ICDMW.2010.86\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The present study proposes a new symbolization algorithm for multidimensional time series. We view temporal sequences as observed data generated by a dynamical system, and therefore the goal of symbolization is to estimate symbolic sequences that minimize loss of information, which is called generating partition in nonlinear physics. In order to utilize the theoretical property of symbol dynamics in data mining, our algorithm estimates symbols on multivariate time series by integrating both spatial and temporal information and selecting those dimensions in multidimensional time series containing useful information. Probabilistic symbolic sequences derived from our symbolization method can be used in various supervised and unsupervised data-mining tasks. To demonstrate this, the algorithm is evaluated by applying it to both simulated data and a real-world dataset. In both cases, the new algorithm outperforms its alternative approaches.\",\"PeriodicalId\":170201,\"journal\":{\"name\":\"2010 IEEE International Conference on Data Mining Workshops\",\"volume\":\"04 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE International Conference on Data Mining Workshops\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDMW.2010.86\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Data Mining Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2010.86","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

本文提出了一种新的多维时间序列符号化算法。我们将时间序列视为由动态系统产生的观测数据,因此符号化的目标是估计信息损失最小的符号序列,这在非线性物理中称为生成分区。为了利用符号动力学在数据挖掘中的理论特性,我们的算法通过整合空间和时间信息,并在多维时间序列中选择包含有用信息的维度来估计多变量时间序列上的符号。由我们的符号化方法得到的概率符号序列可用于各种监督和无监督数据挖掘任务。为了证明这一点,该算法通过将其应用于模拟数据和现实世界数据集来评估。在这两种情况下,新算法都优于其替代方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Spatio-Temporal Symbolization of Multidimensional Time Series
The present study proposes a new symbolization algorithm for multidimensional time series. We view temporal sequences as observed data generated by a dynamical system, and therefore the goal of symbolization is to estimate symbolic sequences that minimize loss of information, which is called generating partition in nonlinear physics. In order to utilize the theoretical property of symbol dynamics in data mining, our algorithm estimates symbols on multivariate time series by integrating both spatial and temporal information and selecting those dimensions in multidimensional time series containing useful information. Probabilistic symbolic sequences derived from our symbolization method can be used in various supervised and unsupervised data-mining tasks. To demonstrate this, the algorithm is evaluated by applying it to both simulated data and a real-world dataset. In both cases, the new algorithm outperforms its alternative approaches.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Quantum Path Integral Inspired Query Sequence Suggestion for User Search Task Simplification PTCR-Miner: Progressive Temporal Class Rule Mining for Multivariate Temporal Data Classification Bridging Folksonomies and Domain Ontologies: Getting Out Non-taxonomic Relations SIMPLE: Interactive Analytics on Patent Data Parallel EM-Clustering: Fast Convergence by Asynchronous Model Updates
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1