A robust approach for outlier imputation: Singular spectrum decomposition

Maryam Movahedifar, Hossein Hassani, M. Yarmohammadi, M. Kalantari, Rangan Gupta
{"title":"A robust approach for outlier imputation: Singular spectrum decomposition","authors":"Maryam Movahedifar, Hossein Hassani, M. Yarmohammadi, M. Kalantari, Rangan Gupta","doi":"10.1080/23737484.2021.2017810","DOIUrl":null,"url":null,"abstract":"Abstract Singular spectrum analysis (SSA) is a nonparametric method for separating time series data into a sum of small numbers of interpretable components (signal + noise). One of the steps of the SSA method, which is referenced to Embedding, is extremely sensitive to contamination of outliers which are often founded in time series analysis. To reduce the effect of outliers, SSA based on Singular Spectrum Decomposition (SSD) method is proposed. In this article, the ability of SSA based on SSD and basic SSA are compared in time series reconstruction in the presence of outliers. It is noteworthy that the matrix norm used in Basic SSA is the Frobenius norm or L 2-norm. There is a newer version of SSA that is based on L 1-norm and called L 1-SSA. It was confirmed that L 1-SSA is robust against outliers. In this regard, this research is also introduced a new version of SSD based on L 1-norm which is called L 1-SSD. A wide empirical study on both simulated and real data verifies the efficiency of basic SSA based on SSD and L 1-norm in reconstructing the time series where polluted by outliers.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"34 1","pages":"234 - 250"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Statistics Case Studies Data Analysis and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/23737484.2021.2017810","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 1

Abstract

Abstract Singular spectrum analysis (SSA) is a nonparametric method for separating time series data into a sum of small numbers of interpretable components (signal + noise). One of the steps of the SSA method, which is referenced to Embedding, is extremely sensitive to contamination of outliers which are often founded in time series analysis. To reduce the effect of outliers, SSA based on Singular Spectrum Decomposition (SSD) method is proposed. In this article, the ability of SSA based on SSD and basic SSA are compared in time series reconstruction in the presence of outliers. It is noteworthy that the matrix norm used in Basic SSA is the Frobenius norm or L 2-norm. There is a newer version of SSA that is based on L 1-norm and called L 1-SSA. It was confirmed that L 1-SSA is robust against outliers. In this regard, this research is also introduced a new version of SSD based on L 1-norm which is called L 1-SSD. A wide empirical study on both simulated and real data verifies the efficiency of basic SSA based on SSD and L 1-norm in reconstructing the time series where polluted by outliers.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种鲁棒的离群值归算方法:奇异谱分解
奇异谱分析(SSA)是一种将时间序列数据分离为少量可解释分量(信号+噪声)之和的非参数方法。SSA方法的其中一个步骤,即嵌入,对时间序列分析中经常出现的异常点污染非常敏感。为了降低异常值的影响,提出了基于奇异谱分解(SSD)的SSA方法。本文比较了基于SSD的SSA和基本SSA在存在异常点的情况下重建时间序列的能力。值得注意的是,在Basic SSA中使用的矩阵范数是Frobenius范数或l2范数。有一个更新版本的SSA是基于l1规范的,称为l1 -SSA。结果表明,l1 - ssa对异常值具有鲁棒性。对此,本研究还介绍了一种基于l1规范的新型SSD,称为l1 -SSD。通过对模拟数据和实际数据的大量实证研究,验证了基于SSD和L -范数的基本SSA在被异常值污染的时间序列重构中的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
1.00
自引率
0.00%
发文量
29
期刊最新文献
The reciprocal elastic net Detection of influential observations in high-dimensional survival data Small area estimation of trends in household living standards in Uganda using a GMANOVA-MANOVA model and repeated surveys Applications of a new loss and cost-based process capability index to electronic industries A methodological framework for imputing missing spatial data at an aggregate level and guaranteeing data privacy: the AFFINITY method; implementation in the context of the official spatial Greek census data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1