多元时间序列中因果关系的识别

Sujoy Roy Chowdhury, Serene Banerjee, Ranjani H. G., Chaitanya Kapoor
{"title":"多元时间序列中因果关系的识别","authors":"Sujoy Roy Chowdhury, Serene Banerjee, Ranjani H. G., Chaitanya Kapoor","doi":"10.1145/3564121.3564810","DOIUrl":null,"url":null,"abstract":"Telecommunications networks operate on enormous amount of time-series data, and often exhibit anomalous trends in their behaviour. This is caused due to increased latency and reduced throughput in the network which inevitably leads to poor customer experience [17]. One of the common problems in machine learning in the telecom domain is to predict anomalous behaviour ahead of time. Whilst this is a well-researched problem, there is far less work done in identifying causal structures from the temporal patterns of various Key Performance Indicators (KPI) in the telecom network. The ability to identify causal structures from anomalous behaviours would allow more effective intervention and generalisation of different environments and networks. The tutorial is focused on discussing existing frameworks for establishing causal discovery for time-series data sets. In this hands-on tutorial, we will be covering at least 3 state-of-the-art (SOTA) methods on causal time series analysis including Granger causality[8],convergent cross-mapping [4, 10, 15], Peter-Clark Momentary Conditional Independence (PC-MCI) [6, 14] and Temporal Causal discovery framework (TCDF)[11]. The need for a causation analysis[7], beyond correlation will also be explained using publicly available datasets, such as, double pendulum dataset [1]. The state-of-art methods are chosen to cover various aspects of the causal time series analysis, such as modelling the non-linearity (non-linear Granger Causality), attempting the problem from chaos and dynamic systems (CCM), information-theoretic approaches (PC-MCI, or having a data-driven approach (TCDF). State-of-the-art survey papers [2, 12] show that none of the methods can be said to be ideal for all the possible time series and there are relative advantages and shortcomings for each of these methods.","PeriodicalId":166150,"journal":{"name":"Proceedings of the Second International Conference on AI-ML Systems","volume":"410 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identification of Causal Dependencies in Multivariate Time Series\",\"authors\":\"Sujoy Roy Chowdhury, Serene Banerjee, Ranjani H. G., Chaitanya Kapoor\",\"doi\":\"10.1145/3564121.3564810\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Telecommunications networks operate on enormous amount of time-series data, and often exhibit anomalous trends in their behaviour. This is caused due to increased latency and reduced throughput in the network which inevitably leads to poor customer experience [17]. One of the common problems in machine learning in the telecom domain is to predict anomalous behaviour ahead of time. Whilst this is a well-researched problem, there is far less work done in identifying causal structures from the temporal patterns of various Key Performance Indicators (KPI) in the telecom network. The ability to identify causal structures from anomalous behaviours would allow more effective intervention and generalisation of different environments and networks. The tutorial is focused on discussing existing frameworks for establishing causal discovery for time-series data sets. In this hands-on tutorial, we will be covering at least 3 state-of-the-art (SOTA) methods on causal time series analysis including Granger causality[8],convergent cross-mapping [4, 10, 15], Peter-Clark Momentary Conditional Independence (PC-MCI) [6, 14] and Temporal Causal discovery framework (TCDF)[11]. The need for a causation analysis[7], beyond correlation will also be explained using publicly available datasets, such as, double pendulum dataset [1]. The state-of-art methods are chosen to cover various aspects of the causal time series analysis, such as modelling the non-linearity (non-linear Granger Causality), attempting the problem from chaos and dynamic systems (CCM), information-theoretic approaches (PC-MCI, or having a data-driven approach (TCDF). State-of-the-art survey papers [2, 12] show that none of the methods can be said to be ideal for all the possible time series and there are relative advantages and shortcomings for each of these methods.\",\"PeriodicalId\":166150,\"journal\":{\"name\":\"Proceedings of the Second International Conference on AI-ML Systems\",\"volume\":\"410 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Second International Conference on AI-ML Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3564121.3564810\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Second International Conference on AI-ML Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3564121.3564810","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

电信网络在大量的时间序列数据上运行,其行为经常表现出反常的趋势。这是由于网络中的延迟增加和吞吐量降低而导致的,这不可避免地会导致糟糕的客户体验[17]。电信领域机器学习的常见问题之一是提前预测异常行为。虽然这是一个研究得很好的问题,但在从电信网络中各种关键绩效指标(KPI)的时间模式中识别因果结构方面所做的工作要少得多。从异常行为中识别因果结构的能力将允许对不同的环境和网络进行更有效的干预和概括。本教程的重点是讨论为时间序列数据集建立因果发现的现有框架。在本实践教程中,我们将涵盖至少3种最先进的(SOTA)因果时间序列分析方法,包括格兰杰因果关系[8],收敛交叉映射[4,10,15],彼得-克拉克瞬间条件独立(PC-MCI)[6,14]和时间因果发现框架(TCDF)[11]。除了相关性之外,对因果分析[7]的需求也将使用公开可用的数据集进行解释,例如,双摆数据集[1]。选择最先进的方法来涵盖因果时间序列分析的各个方面,例如非线性建模(非线性格兰杰因果关系),尝试从混沌和动态系统(CCM),信息理论方法(PC-MCI)或数据驱动方法(TCDF)中解决问题。最新的调查论文[2,12]表明,没有一种方法可以说对所有可能的时间序列都是理想的,每种方法都有相对的优点和缺点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Identification of Causal Dependencies in Multivariate Time Series
Telecommunications networks operate on enormous amount of time-series data, and often exhibit anomalous trends in their behaviour. This is caused due to increased latency and reduced throughput in the network which inevitably leads to poor customer experience [17]. One of the common problems in machine learning in the telecom domain is to predict anomalous behaviour ahead of time. Whilst this is a well-researched problem, there is far less work done in identifying causal structures from the temporal patterns of various Key Performance Indicators (KPI) in the telecom network. The ability to identify causal structures from anomalous behaviours would allow more effective intervention and generalisation of different environments and networks. The tutorial is focused on discussing existing frameworks for establishing causal discovery for time-series data sets. In this hands-on tutorial, we will be covering at least 3 state-of-the-art (SOTA) methods on causal time series analysis including Granger causality[8],convergent cross-mapping [4, 10, 15], Peter-Clark Momentary Conditional Independence (PC-MCI) [6, 14] and Temporal Causal discovery framework (TCDF)[11]. The need for a causation analysis[7], beyond correlation will also be explained using publicly available datasets, such as, double pendulum dataset [1]. The state-of-art methods are chosen to cover various aspects of the causal time series analysis, such as modelling the non-linearity (non-linear Granger Causality), attempting the problem from chaos and dynamic systems (CCM), information-theoretic approaches (PC-MCI, or having a data-driven approach (TCDF). State-of-the-art survey papers [2, 12] show that none of the methods can be said to be ideal for all the possible time series and there are relative advantages and shortcomings for each of these methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Hybrid Planning System for Smart Charging of Electric Fleets CluSpa: Computation Reduction in CNN Inference by exploiting Clustering and Sparsity Acceleration-aware, Retraining-free Evolutionary Pruning for Automated Fitment of Deep Learning Models on Edge Devices Patch-wise Features for Blur Image Classification Identification of Causal Dependencies in Multivariate Time Series
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1