Ratai: recurrent autoencoder with imputation units and temporal attention for multivariate time series imputation

IF 13.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Artificial Intelligence Review Pub Date : 2024-12-20 DOI:10.1007/s10462-024-11039-z

Xiaochen Lai, Yachen Yao, Jichong Mu, Wei Lu, Liyong Zhang

{"title":"Ratai: recurrent autoencoder with imputation units and temporal attention for multivariate time series imputation","authors":"Xiaochen Lai, Yachen Yao, Jichong Mu, Wei Lu, Liyong Zhang","doi":"10.1007/s10462-024-11039-z","DOIUrl":null,"url":null,"abstract":"<div><p>Multivariate time series is ubiquitous in real-world applications, yet it often suffers from missing values that impede downstream analytical tasks. In this paper, we introduce the Long Short-Term Memory Network based Recurrent Autoencoder with Imputation Units and Temporal Attention Imputation Model (RATAI), tailored for multivariate time series. RATAI is designed to address certain limitations of traditional RNN-based imputation methods, which often focus on predictive modeling to estimate missing values, sometimes neglecting the contextual impact of observed data at and beyond the target time step. Drawing inspiration from Kalman smoothing, which effectively integrates past and future information to refine state estimations, RATAI aims to extract feature representations from time series data and use them to reconstruct a complete time series, thus overcoming the shortcomings of existing approaches. It employs a dual-stage imputation process: the encoder utilizes temporal information and attribute correlations to predict and impute missing values, and extract feature representation of imputed time series. Subsequently, the decoder reconstructs the series from the feature representation, and the reconstructed values are used as the final imputation values. Additionally, RATAI incorporates a temporal attention mechanism, allowing the decoder to focus on highly relevant inputs during reconstruction. This model can be trained directly using data that contains missing values, avoiding the misleading effects on model training that can arise from setting initial values for missing values. Our experiments demonstrate that RATAI outperforms benchmark models in multivariate time series imputation.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 2","pages":""},"PeriodicalIF":13.9000,"publicationDate":"2024-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-024-11039-z.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-024-11039-z","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Multivariate time series is ubiquitous in real-world applications, yet it often suffers from missing values that impede downstream analytical tasks. In this paper, we introduce the Long Short-Term Memory Network based Recurrent Autoencoder with Imputation Units and Temporal Attention Imputation Model (RATAI), tailored for multivariate time series. RATAI is designed to address certain limitations of traditional RNN-based imputation methods, which often focus on predictive modeling to estimate missing values, sometimes neglecting the contextual impact of observed data at and beyond the target time step. Drawing inspiration from Kalman smoothing, which effectively integrates past and future information to refine state estimations, RATAI aims to extract feature representations from time series data and use them to reconstruct a complete time series, thus overcoming the shortcomings of existing approaches. It employs a dual-stage imputation process: the encoder utilizes temporal information and attribute correlations to predict and impute missing values, and extract feature representation of imputed time series. Subsequently, the decoder reconstructs the series from the feature representation, and the reconstructed values are used as the final imputation values. Additionally, RATAI incorporates a temporal attention mechanism, allowing the decoder to focus on highly relevant inputs during reconstruction. This model can be trained directly using data that contains missing values, avoiding the misleading effects on model training that can arise from setting initial values for missing values. Our experiments demonstrate that RATAI outperforms benchmark models in multivariate time series imputation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Ratai：用于多变量时间序列估算的带有估算单元和时间注意力的递归自动编码器

多变量时间序列在现实世界的应用中无处不在，但它经常受到缺失值的影响，从而妨碍下游分析任务。在本文中，我们介绍了专为多变量时间序列量身定制的基于长短期记忆网络的递归自动编码器（Recurrent Autoencoder）与输入单元和时间注意力输入模型（RATAI）。RATAI 的设计目的是解决基于 RNN 的传统估算方法的某些局限性，这些方法通常侧重于预测建模来估算缺失值，有时会忽略目标时间步及目标时间步以外的观测数据的上下文影响。卡尔曼平滑法能有效整合过去和未来的信息以完善状态估计，RATAI 从卡尔曼平滑法中汲取灵感，旨在从时间序列数据中提取特征表征，并利用它们重建完整的时间序列，从而克服现有方法的不足。它采用了双阶段估算过程：编码器利用时间信息和属性相关性来预测和估算缺失值，并提取估算时间序列的特征表示。随后，解码器根据特征表示重建序列，并将重建值用作最终估算值。此外，RATAI 还包含一个时间关注机制，允许解码器在重建过程中关注高度相关的输入。该模型可以直接使用包含缺失值的数据进行训练，避免了为缺失值设置初始值可能对模型训练产生的误导作用。我们的实验证明，RATAI 在多变量时间序列估算方面优于基准模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Artificial Intelligence Review 工程技术-计算机：人工智能

CiteScore

22.00

自引率

3.30%

发文量

194

审稿时长

5.3 months

期刊介绍： Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.