Sampling methods for the concentration parameter and discrete baseline of the Dirichlet Process

Q4 Mathematics Statistics in Transition Pub Date : 2022-12-01 DOI:10.2478/stattrans-2022-0040
Yang Liu, B. Nandram
{"title":"Sampling methods for the concentration parameter and discrete baseline of the Dirichlet Process","authors":"Yang Liu, B. Nandram","doi":"10.2478/stattrans-2022-0040","DOIUrl":null,"url":null,"abstract":"Abstract There are many models in the current statistical literature for making inferences based on samples selected from a finite population. Parametric models may be problematic because statistical inference is sensitive to parametric assumptions. The Dirichlet process (DP) prior is very flexible and determines the complexity of the model. It is indexed by two hyper-parameters: the baseline distribution and concentration parameter. We address two distinct problems in the article. Firstly, we review the current sampling methods for the concentration parameter, which use the continuous baseline distribution. We compare three different methods: the adaptive rejection method, the mixture of Gammas method and the grid method. We also propose a new method based on the ratio of uniforms. Secondly, in practice, some survey responses are known to be discrete. If a continuous distribution is adopted as the baseline distribution, the model is misspecified and standard inference may be invalid. We propose a discrete baseline approach to the DP prior and sample the unobserved responses from the finite population both using a Polya urn scheme and a Multinomial distribution. We applied our discrete baseline approach to a Phytophthora data set.","PeriodicalId":37985,"journal":{"name":"Statistics in Transition","volume":"23 1","pages":"21 - 36"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics in Transition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/stattrans-2022-0040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 1

Abstract

Abstract There are many models in the current statistical literature for making inferences based on samples selected from a finite population. Parametric models may be problematic because statistical inference is sensitive to parametric assumptions. The Dirichlet process (DP) prior is very flexible and determines the complexity of the model. It is indexed by two hyper-parameters: the baseline distribution and concentration parameter. We address two distinct problems in the article. Firstly, we review the current sampling methods for the concentration parameter, which use the continuous baseline distribution. We compare three different methods: the adaptive rejection method, the mixture of Gammas method and the grid method. We also propose a new method based on the ratio of uniforms. Secondly, in practice, some survey responses are known to be discrete. If a continuous distribution is adopted as the baseline distribution, the model is misspecified and standard inference may be invalid. We propose a discrete baseline approach to the DP prior and sample the unobserved responses from the finite population both using a Polya urn scheme and a Multinomial distribution. We applied our discrete baseline approach to a Phytophthora data set.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
狄利克雷过程中浓度参数和离散基线的采样方法
摘要当前的统计文献中有许多模型用于根据从有限总体中选择的样本进行推断。参数模型可能有问题,因为统计推断对参数假设很敏感。狄利克雷过程(DP)先验是非常灵活的,并决定了模型的复杂性。它由两个超参数索引:基线分布和浓度参数。我们在文章中讨论了两个不同的问题。首先,我们回顾了目前使用连续基线分布的浓度参数采样方法。我们比较了三种不同的方法:自适应抑制方法、伽玛混合方法和网格方法。我们还提出了一种基于制服比例的新方法。其次,在实践中,一些调查答复是离散的。如果采用连续分布作为基线分布,则模型指定错误,标准推断可能无效。我们提出了一种DP先验的离散基线方法,并使用Polya-urn方案和多项式分布对有限总体中未观察到的响应进行采样。我们将离散基线方法应用于疫霉菌数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Statistics in Transition
Statistics in Transition Decision Sciences-Statistics, Probability and Uncertainty
CiteScore
1.00
自引率
0.00%
发文量
0
审稿时长
9 weeks
期刊介绍: Statistics in Transition (SiT) is an international journal published jointly by the Polish Statistical Association (PTS) and the Central Statistical Office of Poland (CSO/GUS), which sponsors this publication. Launched in 1993, it was issued twice a year until 2006; since then it appears - under a slightly changed title, Statistics in Transition new series - three times a year; and after 2013 as a regular quarterly journal." The journal provides a forum for exchange of ideas and experience amongst members of international community of statisticians, data producers and users, including researchers, teachers, policy makers and the general public. Its initially dominating focus on statistical issues pertinent to transition from centrally planned to a market-oriented economy has gradually been extended to embracing statistical problems related to development and modernization of the system of public (official) statistics, in general.
期刊最新文献
Estimating the probability of leaving unemployment for older people in Poland using survival models with censored data Does economic freedom promote financial development? Evidence from EU countries Rotation schemes and Chebyshev polynomials A nonparametric analysis of discrete time competing risks data: a comparison of the cause-specific-hazards approach and the vertical approach Comments on „Probability vs. Nonprobability Sampling: From the Birth of Survey Sampling to the Present Day” by Graham Kalton
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1