{"title":"Bayesian Approach for Comparing Parameter Estimation of Regression Model for Outlier Data","authors":"Autcha Araveeporn, K. Kumnungkit","doi":"10.1145/3545839.3545841","DOIUrl":null,"url":null,"abstract":"This research compares and contrasts the simple regression model's parameter estimation methods, which consisted of a dependent variable and one independent variable. Parameter estimation uses the ordinary least square method, Bayesian method, Markov Chain Monte Carlo (MCMC) method, and local weight Markov Chain Monte Carlo (LWMCMC) method. The standard method is the ordinary least square method, which uses the concept of minimum sum square error to estimate parameters for fitting the linear regression model. However, for a set of the parameter relating to the Bayesian approach, the use of prior and posterior distributions may affect the approximation of the Bayesian, MCMC, LWMCMC methods. This paper considers the ordinal least square method and Bayesian approach by estimating the parameter for outlier data while some data points are far from other observations. The independent variable is simulated from the contaminated normal distribution, and the error is simulated from the normal distribution that made the outlier data on dependent and independent variables for the several sample sizes as 20, 50, 100, and 200. The criterion of the best efficiency is considered by the minimum of the average mean square errors. Through simulation data, the Bayesian method presents the minimum of average mean square errors at the sample sizes 20 and 50. However, when the sample size value increases, the MCMC and LWMCMC method are the best efficiency method at the sample sizes 100 and 200, respectively.","PeriodicalId":249161,"journal":{"name":"Proceedings of the 2022 5th International Conference on Mathematics and Statistics","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Mathematics and Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3545839.3545841","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This research compares and contrasts the simple regression model's parameter estimation methods, which consisted of a dependent variable and one independent variable. Parameter estimation uses the ordinary least square method, Bayesian method, Markov Chain Monte Carlo (MCMC) method, and local weight Markov Chain Monte Carlo (LWMCMC) method. The standard method is the ordinary least square method, which uses the concept of minimum sum square error to estimate parameters for fitting the linear regression model. However, for a set of the parameter relating to the Bayesian approach, the use of prior and posterior distributions may affect the approximation of the Bayesian, MCMC, LWMCMC methods. This paper considers the ordinal least square method and Bayesian approach by estimating the parameter for outlier data while some data points are far from other observations. The independent variable is simulated from the contaminated normal distribution, and the error is simulated from the normal distribution that made the outlier data on dependent and independent variables for the several sample sizes as 20, 50, 100, and 200. The criterion of the best efficiency is considered by the minimum of the average mean square errors. Through simulation data, the Bayesian method presents the minimum of average mean square errors at the sample sizes 20 and 50. However, when the sample size value increases, the MCMC and LWMCMC method are the best efficiency method at the sample sizes 100 and 200, respectively.