Investigating the association of a sensitive attribute with a random variable using the Christofides generalised randomised response design and Bayesian methods
Shen-Ming Lee, Truong-Nhat Le, Phuoc-Loc Tran, Chin-Shang Li
{"title":"Investigating the association of a sensitive attribute with a random variable using the Christofides generalised randomised response design and Bayesian methods","authors":"Shen-Ming Lee, Truong-Nhat Le, Phuoc-Loc Tran, Chin-Shang Li","doi":"10.1111/rssc.12585","DOIUrl":null,"url":null,"abstract":"<p>In empirical studies involving sensitive topics, in addition to the problem of estimating the population proportion with a sensitive characteristic, a question arises as to whether or not there is heterogeneity in the distribution of an auxiliary random variable representing the information of subjects collected from a sensitive group and a non-sensitive group. That is, it is of interest to investigate the influence of sensitive attribute on the auxiliary random variable of interest. Finite mixture models are utilised to evaluate the association. A proposed Bayesian method through data augmentation and Markov chain Monte Carlo is applied to estimate unknown parameters of interest. Deviance information criterion and marginal likelihood are employed to select a suitable model to describe the association of the sensitive characteristic with the auxiliary random variable. Simulation and real data studies are conducted to assess the performance of and illustrate applications of the proposed methodology.</p>","PeriodicalId":49981,"journal":{"name":"Journal of the Royal Statistical Society Series C-Applied Statistics","volume":"71 5","pages":"1471-1502"},"PeriodicalIF":1.0000,"publicationDate":"2022-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Royal Statistical Society Series C-Applied Statistics","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/rssc.12585","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 1
Abstract
In empirical studies involving sensitive topics, in addition to the problem of estimating the population proportion with a sensitive characteristic, a question arises as to whether or not there is heterogeneity in the distribution of an auxiliary random variable representing the information of subjects collected from a sensitive group and a non-sensitive group. That is, it is of interest to investigate the influence of sensitive attribute on the auxiliary random variable of interest. Finite mixture models are utilised to evaluate the association. A proposed Bayesian method through data augmentation and Markov chain Monte Carlo is applied to estimate unknown parameters of interest. Deviance information criterion and marginal likelihood are employed to select a suitable model to describe the association of the sensitive characteristic with the auxiliary random variable. Simulation and real data studies are conducted to assess the performance of and illustrate applications of the proposed methodology.
期刊介绍:
The Journal of the Royal Statistical Society, Series C (Applied Statistics) is a journal of international repute for statisticians both inside and outside the academic world. The journal is concerned with papers which deal with novel solutions to real life statistical problems by adapting or developing methodology, or by demonstrating the proper application of new or existing statistical methods to them. At their heart therefore the papers in the journal are motivated by examples and statistical data of all kinds. The subject-matter covers the whole range of inter-disciplinary fields, e.g. applications in agriculture, genetics, industry, medicine and the physical sciences, and papers on design issues (e.g. in relation to experiments, surveys or observational studies).
A deep understanding of statistical methodology is not necessary to appreciate the content. Although papers describing developments in statistical computing driven by practical examples are within its scope, the journal is not concerned with simply numerical illustrations or simulation studies. The emphasis of Series C is on case-studies of statistical analyses in practice.