Kalman filter with impulse noised outliers: a robust sequential algorithm to filter data with a large number of outliers

IF 1 4区 数学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY International Journal of Biostatistics Pub Date : 2024-04-16 DOI:10.1515/ijb-2023-0065
Bertrand Cloez, Bénédicte Fontez, Eliel González-García, Isabelle Sanchez
{"title":"Kalman filter with impulse noised outliers: a robust sequential algorithm to filter data with a large number of outliers","authors":"Bertrand Cloez, Bénédicte Fontez, Eliel González-García, Isabelle Sanchez","doi":"10.1515/ijb-2023-0065","DOIUrl":null,"url":null,"abstract":"Impulse noised outliers are data points that differ significantly from other observations. They are generally removed from the data set through local regression or the Kalman filter algorithm. However, these methods, or their generalizations, are not well suited when the number of outliers is of the same order as the number of low-noise data (often called <jats:italic>nominal measurement</jats:italic>). In this article, we propose a new model for impulsed noise outliers. It is based on a hierarchical model and a simple linear Gaussian process as with the Kalman Filter. We present a fast forward-backward algorithm to filter and smooth sequential data and which also detects these outliers. We compare the robustness and efficiency of this algorithm with classical methods. Finally, we apply this method on a real data set from a Walk Over Weighing system admitting around 60 % of outliers. For this application, we further develop an (explicit) EM algorithm to calibrate some algorithm parameters.","PeriodicalId":49058,"journal":{"name":"International Journal of Biostatistics","volume":"4 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Biostatistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1515/ijb-2023-0065","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Impulse noised outliers are data points that differ significantly from other observations. They are generally removed from the data set through local regression or the Kalman filter algorithm. However, these methods, or their generalizations, are not well suited when the number of outliers is of the same order as the number of low-noise data (often called nominal measurement). In this article, we propose a new model for impulsed noise outliers. It is based on a hierarchical model and a simple linear Gaussian process as with the Kalman Filter. We present a fast forward-backward algorithm to filter and smooth sequential data and which also detects these outliers. We compare the robustness and efficiency of this algorithm with classical methods. Finally, we apply this method on a real data set from a Walk Over Weighing system admitting around 60 % of outliers. For this application, we further develop an (explicit) EM algorithm to calibrate some algorithm parameters.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
具有脉冲噪声离群值的卡尔曼滤波器:过滤大量离群值数据的稳健顺序算法
脉冲噪声离群值是指与其他观测值有显著差异的数据点。通常通过局部回归或卡尔曼滤波算法将其从数据集中剔除。然而,当离群值的数量与低噪声数据(通常称为标称测量)的数量同阶时,这些方法或其广义方法就不太适用了。在本文中,我们提出了一种针对脉冲噪声离群值的新模型。它与卡尔曼滤波器一样,基于分层模型和简单的线性高斯过程。我们提出了一种快速的前向后向算法,用于过滤和平滑连续数据,并检测这些离群值。我们将该算法的鲁棒性和效率与经典方法进行了比较。最后,我们将该方法应用于一个来自步行称重系统的真实数据集,该数据集含有约 60% 的异常值。针对这一应用,我们进一步开发了一种(显式)EM 算法来校准一些算法参数。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of Biostatistics
International Journal of Biostatistics MATHEMATICAL & COMPUTATIONAL BIOLOGY-STATISTICS & PROBABILITY
CiteScore
2.10
自引率
8.30%
发文量
28
审稿时长
>12 weeks
期刊介绍: The International Journal of Biostatistics (IJB) seeks to publish new biostatistical models and methods, new statistical theory, as well as original applications of statistical methods, for important practical problems arising from the biological, medical, public health, and agricultural sciences with an emphasis on semiparametric methods. Given many alternatives to publish exist within biostatistics, IJB offers a place to publish for research in biostatistics focusing on modern methods, often based on machine-learning and other data-adaptive methodologies, as well as providing a unique reading experience that compels the author to be explicit about the statistical inference problem addressed by the paper. IJB is intended that the journal cover the entire range of biostatistics, from theoretical advances to relevant and sensible translations of a practical problem into a statistical framework. Electronic publication also allows for data and software code to be appended, and opens the door for reproducible research allowing readers to easily replicate analyses described in a paper. Both original research and review articles will be warmly received, as will articles applying sound statistical methods to practical problems.
期刊最新文献
Detecting differentially expressed genes from RNA-seq data using fuzzy clustering Comments on “sensitivity of estimands in clinical trials with imperfect compliance” by Chen and Heitjan Random forests for survival data: which methods work best and under what conditions? Kalman filter with impulse noised outliers: a robust sequential algorithm to filter data with a large number of outliers Ensemble learning methods of inference for spatially stratified infectious disease systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1