Modeling serially correlated heavy-tailed data with some missing response values using stochastic EM algorithm

U. Nduka, I. Iwueze, C. Nwaigwe
{"title":"Modeling serially correlated heavy-tailed data with some missing response values using stochastic EM algorithm","authors":"U. Nduka, I. Iwueze, C. Nwaigwe","doi":"10.1080/23737484.2021.2017808","DOIUrl":null,"url":null,"abstract":"Abstract The linear regression model is a popular tool used by almost all in different areas of research. The model relies mainly on the assumption of uncorrelated errors from a Gaussian distribution. However, many datasets in practice violate this basic assumption, making inference in such cases invalid. Therefore, the linear regression model with structured errors driven by heavy-tailed innovations are preferred in practice. Another issue that occur frequently with real-life data is missing values, due to some reasons such as system breakdown and labor unrest. Despite the challenge these two issues pose to practitioners, there is scarcity of literature where they have jointly been studied. Hence, this article considers these two issues jointly, for the first time, and develops an efficient parameter estimation procedure for Student’s-t autoregressive regression model for time series with missing values of the response variable. The procedure is based on a stochastic approximation expectation–maximization algorithm coupled with a Markov chain Monte Carlo technique. The procedure gives efficient closed-form expressions for the parameters of the model, which are very easy to compute. Simulations and real-life data analysis show that the method is efficient for use with incomplete time series data.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"58 1","pages":"81 - 104"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Statistics Case Studies Data Analysis and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/23737484.2021.2017808","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract The linear regression model is a popular tool used by almost all in different areas of research. The model relies mainly on the assumption of uncorrelated errors from a Gaussian distribution. However, many datasets in practice violate this basic assumption, making inference in such cases invalid. Therefore, the linear regression model with structured errors driven by heavy-tailed innovations are preferred in practice. Another issue that occur frequently with real-life data is missing values, due to some reasons such as system breakdown and labor unrest. Despite the challenge these two issues pose to practitioners, there is scarcity of literature where they have jointly been studied. Hence, this article considers these two issues jointly, for the first time, and develops an efficient parameter estimation procedure for Student’s-t autoregressive regression model for time series with missing values of the response variable. The procedure is based on a stochastic approximation expectation–maximization algorithm coupled with a Markov chain Monte Carlo technique. The procedure gives efficient closed-form expressions for the parameters of the model, which are very easy to compute. Simulations and real-life data analysis show that the method is efficient for use with incomplete time series data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用随机电磁算法对响应值缺失的重尾序列数据进行建模
线性回归模型是几乎所有研究领域都在使用的一种流行工具。该模型主要依赖于高斯分布中不相关误差的假设。然而,在实践中,许多数据集违背了这一基本假设,使得在这种情况下的推理无效。因此,在实践中,由重尾创新驱动的具有结构误差的线性回归模型是首选的。另一个在现实数据中经常发生的问题是由于系统故障和劳工骚乱等原因导致的值丢失。尽管这两个问题给实践者带来了挑战,但它们共同研究的文献却很少。因此,本文首次将这两个问题结合起来考虑,并对响应变量缺失的时间序列的Student -t自回归模型提出了一种有效的参数估计方法。该程序是基于随机逼近期望最大化算法与马尔可夫链蒙特卡罗技术相结合。该程序给出了模型参数的有效的封闭表达式,易于计算。仿真和实际数据分析表明,该方法对于不完全时间序列数据是有效的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
1.00
自引率
0.00%
发文量
29
期刊最新文献
The reciprocal elastic net Detection of influential observations in high-dimensional survival data Small area estimation of trends in household living standards in Uganda using a GMANOVA-MANOVA model and repeated surveys Applications of a new loss and cost-based process capability index to electronic industries A methodological framework for imputing missing spatial data at an aggregate level and guaranteeing data privacy: the AFFINITY method; implementation in the context of the official spatial Greek census data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1