首页 > 最新文献

Journal of Applied Statistics最新文献

英文 中文
Multivariate meta-analysis with a robustified diagonal likelihood function. 采用稳健对角似然函数的多元元分析。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-04-11 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2025.2487912
Zongliang Hu, Qianyu Zhou, Guanfu Liu

Multivariate meta-analysis is an efficient tool to analyze multivariate outcomes from independent studies, with the advantage of accounting for correlations between these outcomes. However, existing methods are sensitive to outliers in the data. In this paper, we propose new robust estimation methods for multivariate meta-analysis. In practice, within-study correlations are frequently not reported in studies, conventional robust multivariate methods using modified estimation equations may not be applicable. To address this challenge, we utilize robust functions to create new log-likelihood functions, by only using the diagonal components of the full covariance matrices. This approach bypasses the need for within-study correlations and also avoids the singularity problem of covariance matrices in the computation. Furthermore, the asymptotic distributions can automatically account for the missing correlations between multiple outcomes, enabling valid confidence intervals on functions of parameter estimates. Simulation studies and two real-data analyses are also carried out to demonstrate the advantages of our new robust estimation methods. Our primary focus is on bivariate meta-analysis, although the approaches can be applied more generally.

多变量荟萃分析是一种分析独立研究多变量结果的有效工具,具有考虑这些结果之间相关性的优势。然而,现有的方法对数据中的异常值很敏感。在本文中,我们提出了新的鲁棒估计方法用于多元元分析。在实践中,研究中经常没有报告研究内相关性,使用修正估计方程的传统稳健多元方法可能不适用。为了解决这一挑战,我们利用鲁棒函数来创建新的对数似然函数,只使用完整协方差矩阵的对角分量。这种方法绕过了研究内相关性的需要,也避免了计算中协方差矩阵的奇异性问题。此外,渐近分布可以自动解释多个结果之间缺失的相关性,从而在参数估计函数上实现有效的置信区间。仿真研究和两个实际数据分析也证明了我们的新鲁棒估计方法的优越性。我们的主要重点是双变量元分析,尽管这些方法可以更广泛地应用。
{"title":"Multivariate meta-analysis with a robustified diagonal likelihood function.","authors":"Zongliang Hu, Qianyu Zhou, Guanfu Liu","doi":"10.1080/02664763.2025.2487912","DOIUrl":"https://doi.org/10.1080/02664763.2025.2487912","url":null,"abstract":"<p><p>Multivariate meta-analysis is an efficient tool to analyze multivariate outcomes from independent studies, with the advantage of accounting for correlations between these outcomes. However, existing methods are sensitive to outliers in the data. In this paper, we propose new robust estimation methods for multivariate meta-analysis. In practice, within-study correlations are frequently not reported in studies, conventional robust multivariate methods using modified estimation equations may not be applicable. To address this challenge, we utilize robust functions to create new log-likelihood functions, by only using the diagonal components of the full covariance matrices. This approach bypasses the need for within-study correlations and also avoids the singularity problem of covariance matrices in the computation. Furthermore, the asymptotic distributions can automatically account for the missing correlations between multiple outcomes, enabling valid confidence intervals on functions of parameter estimates. Simulation studies and two real-data analyses are also carried out to demonstrate the advantages of our new robust estimation methods. Our primary focus is on bivariate meta-analysis, although the approaches can be applied more generally.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 15","pages":"2836-2872"},"PeriodicalIF":1.1,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12671434/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145668782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalised random tessellation stratified sampling over auxiliary spaces. 辅助空间上的广义随机镶嵌分层抽样。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-04-10 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2025.2490967
B L Robertson, C J Price, M Reale, J A Brown

Generalised Random Tessellation Stratified (GRTS) is a popular spatially balanced sampling design. GRTS can draw spatially balanced probability samples in two dimensions but has not been used to sample higher-dimensional auxiliary spaces. This article considers applying dimensionality reduction techniques to multidimensional auxiliary spaces to allow GRTS to be used to sample them. The aim is to improve the precision of GRTS-based estimators of population characteristics by incorporating auxiliary information into the GRTS sample. We numerically evaluate two dimensionality reduction techniques for equal and unequal probability samples on two spatial populations. Multipurpose surveys are also considered. Results show that GRTS samples from these two-dimensional spaces can improve the precision of GRTS over spatial coordinates.

广义随机镶嵌分层(GRTS)是一种流行的空间平衡抽样设计。GRTS可以在二维空间中绘制空间平衡概率样本,但尚未用于对高维辅助空间进行采样。本文考虑将降维技术应用于多维辅助空间,以便使用GRTS对其进行采样。目的是通过在GRTS样本中加入辅助信息来提高基于GRTS的总体特征估计的精度。我们数值评估了两个空间总体上的等概率和不等概率样本的二维降维技术。还考虑了多用途调查。结果表明,从这些二维空间中提取的GRTS样本可以提高GRTS在空间坐标上的精度。
{"title":"Generalised random tessellation stratified sampling over auxiliary spaces.","authors":"B L Robertson, C J Price, M Reale, J A Brown","doi":"10.1080/02664763.2025.2490967","DOIUrl":"10.1080/02664763.2025.2490967","url":null,"abstract":"<p><p>Generalised Random Tessellation Stratified (GRTS) is a popular spatially balanced sampling design. GRTS can draw spatially balanced probability samples in two dimensions but has not been used to sample higher-dimensional auxiliary spaces. This article considers applying dimensionality reduction techniques to multidimensional auxiliary spaces to allow GRTS to be used to sample them. The aim is to improve the precision of GRTS-based estimators of population characteristics by incorporating auxiliary information into the GRTS sample. We numerically evaluate two dimensionality reduction techniques for equal and unequal probability samples on two spatial populations. Multipurpose surveys are also considered. Results show that GRTS samples from these two-dimensional spaces can improve the precision of GRTS over spatial coordinates.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 15","pages":"2972-2983"},"PeriodicalIF":1.1,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12673988/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145677855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LPRE estimation for functional multiplicative model and optimal subsampling. 函数乘法模型的LPRE估计与最优子抽样。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-04-09 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2025.2487922
Qian Yan, Hanyu Li

In this paper, we study the functional linear multiplicative model based on the least product relative error criterion. Under some regularization conditions, we establish the consistency and asymptotic normality of the estimator. Further, we investigate the optimal subsampling for this model with massive data. Both the consistency and asymptotic distribution of the subsampling estimator are first derived. Then, we obtain the optimal subsampling probabilities based on the A-optimality criterion. Moreover, the useful alternative subsampling probabilities without computing the inverse of the Hessian matrix are also proposed, which are easier to implement in practise. Finally, numerical studies and real data analysis are carried out to evaluate the performance of the proposed approaches.

本文研究了基于最小积相对误差准则的泛函线性乘法模型。在一些正则化条件下,我们建立了估计量的相合性和渐近正态性。进一步,我们研究了该模型的最优子采样。首先导出了子抽样估计量的相合性和渐近分布。然后,我们根据a -最优准则得到最优子抽样概率。此外,还提出了一种不需要计算Hessian矩阵逆的备选子抽样概率,在实际应用中更容易实现。最后,进行了数值研究和实际数据分析,以评估所提出方法的性能。
{"title":"LPRE estimation for functional multiplicative model and optimal subsampling.","authors":"Qian Yan, Hanyu Li","doi":"10.1080/02664763.2025.2487922","DOIUrl":"https://doi.org/10.1080/02664763.2025.2487922","url":null,"abstract":"<p><p>In this paper, we study the functional linear multiplicative model based on the least product relative error criterion. Under some regularization conditions, we establish the consistency and asymptotic normality of the estimator. Further, we investigate the optimal subsampling for this model with massive data. Both the consistency and asymptotic distribution of the subsampling estimator are first derived. Then, we obtain the optimal subsampling probabilities based on the A-optimality criterion. Moreover, the useful alternative subsampling probabilities without computing the inverse of the Hessian matrix are also proposed, which are easier to implement in practise. Finally, numerical studies and real data analysis are carried out to evaluate the performance of the proposed approaches.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 15","pages":"2894-2923"},"PeriodicalIF":1.1,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12671435/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145668793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lasso Monte Carlo, a variation on multi fidelity methods for high-dimensional uncertainty quantification. Lasso Monte Carlo,一种对高维不确定度量化的多保真度方法的改进。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-04-08 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2025.2487505
Arnau Albà, Romana Boiger, Dimitri Rochman, Andreas Adelmann

Uncertainty quantification (UQ) is an active area of research, and an essential technique used in all fields of science and engineering. The most common methods for UQ are Monte Carlo and surrogate-modelling. The former method is dimensionality independent but has slow convergence, while the latter method has been shown to yield large computational speedups with respect to Monte Carlo. However, surrogate models suffer from the so-called curse of dimensionality, and become costly to train for high-dimensional problems, where UQ might become computationally prohibitive. In this paper we present a new technique, Lasso Monte Carlo (LMC), which combines a Lasso surrogate model with the multifidelity Monte Carlo technique, in order to perform UQ in high-dimensional settings, at a reduced computational cost. We provide mathematical guarantees for the unbiasedness of the method, and show that LMC can be more accurate than simple Monte Carlo. The theory is numerically tested with benchmarks on toy problems, as well as on a real example of UQ from the field of nuclear engineering. In all presented examples LMC is more accurate than simple Monte Carlo and other multifidelity methods. Thanks to LMC, computational costs are reduced by more than a factor of 5 with respect to simple MC, in relevant cases.

不确定度量化是一个活跃的研究领域,是应用于所有科学和工程领域的一项重要技术。UQ最常用的方法是蒙特卡罗和代理建模。前一种方法是维数无关的,但收敛速度较慢,而后一种方法已被证明相对于蒙特卡罗具有较大的计算速度。然而,代理模型遭受所谓的维度诅咒,并且在高维问题的训练上变得昂贵,其中UQ可能在计算上变得令人望而却步。在本文中,我们提出了一种新的技术,Lasso蒙特卡罗(LMC),它结合了Lasso代理模型和多保真蒙特卡罗技术,以减少计算成本在高维环境中执行UQ。我们为该方法的无偏性提供了数学保证,并表明LMC比简单的蒙特卡罗更准确。该理论在玩具问题的基准上进行了数值测试,并在核工程领域的UQ实际例子上进行了测试。在所有给出的例子中,LMC比简单的蒙特卡罗和其他多保真度方法更准确。由于LMC,在相关情况下,与简单MC相比,计算成本降低了五分之一以上。
{"title":"Lasso Monte Carlo, a variation on multi fidelity methods for high-dimensional uncertainty quantification.","authors":"Arnau Albà, Romana Boiger, Dimitri Rochman, Andreas Adelmann","doi":"10.1080/02664763.2025.2487505","DOIUrl":"10.1080/02664763.2025.2487505","url":null,"abstract":"<p><p>Uncertainty quantification (UQ) is an active area of research, and an essential technique used in all fields of science and engineering. The most common methods for UQ are Monte Carlo and surrogate-modelling. The former method is dimensionality independent but has slow convergence, while the latter method has been shown to yield large computational speedups with respect to Monte Carlo. However, surrogate models suffer from the so-called <i>curse of dimensionality</i>, and become costly to train for high-dimensional problems, where UQ might become computationally prohibitive. In this paper we present a new technique, Lasso Monte Carlo (LMC), which combines a Lasso surrogate model with the multifidelity Monte Carlo technique, in order to perform UQ in high-dimensional settings, at a reduced computational cost. We provide mathematical guarantees for the unbiasedness of the method, and show that LMC can be more accurate than simple Monte Carlo. The theory is numerically tested with benchmarks on toy problems, as well as on a real example of UQ from the field of nuclear engineering. In all presented examples LMC is more accurate than simple Monte Carlo and other multifidelity methods. Thanks to LMC, computational costs are reduced by more than a factor of 5 with respect to simple MC, in relevant cases.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 15","pages":"2799-2835"},"PeriodicalIF":1.1,"publicationDate":"2025-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12671436/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145668720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting a time series of Lorenz curves: one-way functional analysis of variance. 预测洛伦兹曲线的时间序列:方差的单向泛函分析。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-04-08 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2025.2490093
Han Lin Shang

The Lorenz curve is a fundamental tool for analysing income and wealth distribution and inequality at national and regional levels. We utilise a one-way functional analysis of variance to decompose a time series of Lorenz curves and develop a method for producing one-step-ahead point and interval forecasts. The one-way functional analysis of variance is easily interpretable by decomposing an array into a functional grand effect, a functional row effect and residual functions. We evaluate and compare the forecast accuracy between the functional analysis of variance and three non-functional methods using the Italian household income and wealth data.

洛伦兹曲线是分析国家和地区收入和财富分配以及不平等的基本工具。我们利用方差的单向函数分析来分解洛伦兹曲线的时间序列,并开发了一种方法来产生一步前的点和区间预测。方差的单向泛函分析很容易通过将数组分解为泛函大效应、泛函行效应和残差函数来解释。我们利用意大利家庭收入和财富数据,对方差的功能分析和三种非功能方法的预测准确性进行了评估和比较。
{"title":"Forecasting a time series of Lorenz curves: one-way functional analysis of variance.","authors":"Han Lin Shang","doi":"10.1080/02664763.2025.2490093","DOIUrl":"https://doi.org/10.1080/02664763.2025.2490093","url":null,"abstract":"<p><p>The Lorenz curve is a fundamental tool for analysing income and wealth distribution and inequality at national and regional levels. We utilise a one-way functional analysis of variance to decompose a time series of Lorenz curves and develop a method for producing one-step-ahead point and interval forecasts. The one-way functional analysis of variance is easily interpretable by decomposing an array into a functional grand effect, a functional row effect and residual functions. We evaluate and compare the forecast accuracy between the functional analysis of variance and three non-functional methods using the Italian household income and wealth data.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 15","pages":"2924-2940"},"PeriodicalIF":1.1,"publicationDate":"2025-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12671420/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145668625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New strategies for detecting atypical observations based on the information matrix equality. 基于信息矩阵等式的非典型观测值检测新策略。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-04-04 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2025.2487914
Francisco Cribari-Neto, Klaus L P Vasconcellos, José J Santana-E-Silva

Diagnostic analyses in regression modeling are usually based on residuals or local influence measures and are used for detecting atypical observations. We develop a new approach for identifying such observations when the parameters of the model are estimated by maximum likelihood. The proposed approach is based on the information matrix equality, which holds when the model is correctly specified. We introduce a new definition of an atypical observation: one that disproportionately affects the degree of adequate specification of the model as measured using the sample counterparts of the matrices that comprise the information matrix equality. We consider various measures of distance between two symmetric matrices and apply them such that a zero distance indicates correct model specification. These measures quantify the degree of model adequacy and help identify atypical cases that significantly impact the model's adequacy. We also introduce a modified generalized Cook distance and a new criterion that uses the two generalized Cook's distances (modified and unmodified). Empirical applications involving Gaussian and beta regression models are presented and discussed.

回归模型中的诊断分析通常基于残差或局部影响度量,用于检测非典型观测值。我们开发了一种新的方法来识别这种观测时,模型的参数是由最大似然估计。该方法基于信息矩阵等式,当模型被正确指定时,信息矩阵等式成立。我们引入了一个非典型观察的新定义:一个不成比例地影响模型充分规范的程度,使用包含信息矩阵等式的矩阵的样本对应物来测量。我们考虑了两个对称矩阵之间的各种距离度量,并应用它们,使得零距离表示正确的模型规范。这些措施量化了模型充分性的程度,并帮助识别显著影响模型充分性的非典型案例。我们还引入了一个改进的广义Cook距离和一个使用两个广义Cook距离(修改和未修改)的新准则。提出并讨论了涉及高斯和β回归模型的经验应用。
{"title":"New strategies for detecting atypical observations based on the information matrix equality.","authors":"Francisco Cribari-Neto, Klaus L P Vasconcellos, José J Santana-E-Silva","doi":"10.1080/02664763.2025.2487914","DOIUrl":"https://doi.org/10.1080/02664763.2025.2487914","url":null,"abstract":"<p><p>Diagnostic analyses in regression modeling are usually based on residuals or local influence measures and are used for detecting atypical observations. We develop a new approach for identifying such observations when the parameters of the model are estimated by maximum likelihood. The proposed approach is based on the information matrix equality, which holds when the model is correctly specified. We introduce a new definition of an atypical observation: one that disproportionately affects the degree of adequate specification of the model as measured using the sample counterparts of the matrices that comprise the information matrix equality. We consider various measures of distance between two symmetric matrices and apply them such that a zero distance indicates correct model specification. These measures quantify the degree of model adequacy and help identify atypical cases that significantly impact the model's adequacy. We also introduce a modified generalized Cook distance and a new criterion that uses the two generalized Cook's distances (modified and unmodified). Empirical applications involving Gaussian and beta regression models are presented and discussed.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 15","pages":"2873-2893"},"PeriodicalIF":1.1,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12671423/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145668709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A multivariate randomized response model for mixed-type data. 混合类型数据的多变量随机响应模型。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-04-02 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2025.2480865
Amanda M Y Chu, Yasuhiro Omori, Hing-Yu So, Mike K P So

It is not uncommon for surveys in the social sciences to ask sensitive questions. Asking sensitive questions indirectly enables collecting of the desirable sensitive information while at the same time protecting respondents' data privacy. The randomized response technique, which uses a randomization scheme to collect sensitive responses, is one common approach used to achieve this. In this paper, we propose a multivariate ordered probit model to jointly analyze binary and ordinal sensitive response variables. We also develop Bayesian methods to estimate the probit model and perform posterior inference. The proposed probit model is applied to a large-scale drug administration survey to understand the work practice and experience of staff in three hospitals in Hong Kong. Randomized response technique was adopted in this drug administration survey to maintain the anonymity of staff whose work practice may deviate from official hospital guidelines. Empirical results using the drug administration data illustrate that we can understand the experience and practice of staff members in giving medication through probit modeling. Knowing the staff's practice on giving medication can indicate what drug administration procedures the staff may not follow properly and what areas to focus on for the enhancing of drug administration.

社会科学领域的调查提出敏感问题并不罕见。间接询问敏感问题可以收集所需的敏感信息,同时保护受访者的数据隐私。随机响应技术是实现这一目标的一种常用方法,它使用随机化方案来收集敏感响应。本文提出了一种多元有序概率模型,用于二元和有序敏感响应变量的联合分析。我们还开发了贝叶斯方法来估计概率模型和执行后验推理。本文将提出的probit模型应用于一项大规模的药物管理调查,以了解香港三家医院工作人员的工作实践和经验。在本次药物管理调查中采用随机反应技术,以保持工作实践可能偏离医院官方指南的工作人员的匿名性。使用给药数据的实证结果表明,我们可以通过probit模型来了解工作人员给药的经验和做法。了解员工给药实践,可以指出员工可能没有正确遵循哪些给药程序,以及加强给药工作的重点领域。
{"title":"A multivariate randomized response model for mixed-type data.","authors":"Amanda M Y Chu, Yasuhiro Omori, Hing-Yu So, Mike K P So","doi":"10.1080/02664763.2025.2480865","DOIUrl":"https://doi.org/10.1080/02664763.2025.2480865","url":null,"abstract":"<p><p>It is not uncommon for surveys in the social sciences to ask sensitive questions. Asking sensitive questions indirectly enables collecting of the desirable sensitive information while at the same time protecting respondents' data privacy. The randomized response technique, which uses a randomization scheme to collect sensitive responses, is one common approach used to achieve this. In this paper, we propose a multivariate ordered probit model to jointly analyze binary and ordinal sensitive response variables. We also develop Bayesian methods to estimate the probit model and perform posterior inference. The proposed probit model is applied to a large-scale drug administration survey to understand the work practice and experience of staff in three hospitals in Hong Kong. Randomized response technique was adopted in this drug administration survey to maintain the anonymity of staff whose work practice may deviate from official hospital guidelines. Empirical results using the drug administration data illustrate that we can understand the experience and practice of staff members in giving medication through probit modeling. Knowing the staff's practice on giving medication can indicate what drug administration procedures the staff may not follow properly and what areas to focus on for the enhancing of drug administration.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 14","pages":"2597-2635"},"PeriodicalIF":1.1,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12581783/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145444872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Calibrating a simulated exposure distribution using measurement error models. 使用测量误差模型校准模拟的曝光分布。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-03-29 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2025.2481460
Jiwoong Yu, Xueyan Zheng, Kwan-Young Bak, Kiyoung Lee, Woojoo Lee

Indirect exposure assessment based on average environmental concentrations in microenvironments and time spent in each environment has been considered an important way of assessing personal exposure to air pollutants. Using this indirect approach, the exposure simulator generates personal exposure values or the distribution of personal exposure for air pollutants. To match the simulator with the actual exposure measurements well, some calibration is necessary. However, unlike simulators generating personal exposure values, research evaluating the validity of the second type of simulator is rare. This study aims to develop a method for calibrating a simulator that generates an exposure distribution. To describe the relationship between the actual exposure measurements and the simulator, we introduce measurement error models (MEMs) and explain how the coefficients in the models can be used for calibrating the exposure distribution. We illustrate the proposed method using a Korea Simulation Exposure Model for fine particulate matter (KoSEM-PMII).

基于微环境中的平均环境浓度和在每个环境中花费的时间的间接暴露评估被认为是评估个人暴露于空气污染物的重要方法。使用这种间接方法,暴露模拟器产生个人暴露值或个人暴露于空气污染物的分布。为了使模拟器与实际曝光测量值很好地匹配,需要进行一些校准。然而,与产生个人暴露值的模拟器不同,评估第二类模拟器有效性的研究很少。本研究旨在开发一种方法来校准产生曝光分布的模拟器。为了描述实际曝光测量与模拟器之间的关系,我们引入了测量误差模型(MEMs),并解释了如何使用模型中的系数来校准曝光分布。我们使用韩国细颗粒物模拟暴露模型(KoSEM-PMII)来说明所提出的方法。
{"title":"Calibrating a simulated exposure distribution using measurement error models.","authors":"Jiwoong Yu, Xueyan Zheng, Kwan-Young Bak, Kiyoung Lee, Woojoo Lee","doi":"10.1080/02664763.2025.2481460","DOIUrl":"https://doi.org/10.1080/02664763.2025.2481460","url":null,"abstract":"<p><p>Indirect exposure assessment based on average environmental concentrations in microenvironments and time spent in each environment has been considered an important way of assessing personal exposure to air pollutants. Using this indirect approach, the exposure simulator generates personal exposure values or the distribution of personal exposure for air pollutants. To match the simulator with the actual exposure measurements well, some calibration is necessary. However, unlike simulators generating personal exposure values, research evaluating the validity of the second type of simulator is rare. This study aims to develop a method for calibrating a simulator that generates an exposure distribution. To describe the relationship between the actual exposure measurements and the simulator, we introduce measurement error models (MEMs) and explain how the coefficients in the models can be used for calibrating the exposure distribution. We illustrate the proposed method using a Korea Simulation Exposure Model for fine particulate matter (KoSEM-PMII).</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 14","pages":"2707-2719"},"PeriodicalIF":1.1,"publicationDate":"2025-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12581748/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145444811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying outlying groups through residual analysis and its application to healthcare expenditure. 残差分析识别外围群体及其在医疗支出中的应用。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-03-28 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2025.2484599
Hyukdong Kwon, Jihnhee Yu, Mingliang Li

Traditional regression analysis primarily aims to describe the overall relationship between variables, often overlooking unexplainable aspects by design. Our focus is on these unexplained aspects, leveraging them to identify disparity groups with outlying behavior that deviate from the established model. We introduce a data-driven method for identifying such groups using group studentized residuals, which we term the mean squared of external studentized residuals. We apply this method to investigate disparities within healthcare markets, examining healthcare purchasing behavior and identifying the characteristics of disparity groups.

传统的回归分析主要旨在描述变量之间的整体关系,往往忽略了设计中不可解释的方面。我们的重点是这些无法解释的方面,利用它们来识别具有偏离既定模型的极端行为的差异群体。我们引入了一种数据驱动的方法来识别这样的群体,使用群体学生化残差,我们称之为外部学生化残差的均方。我们运用这种方法来调查医疗保健市场中的差异,检查医疗保健购买行为并确定差异群体的特征。
{"title":"Identifying outlying groups through residual analysis and its application to healthcare expenditure.","authors":"Hyukdong Kwon, Jihnhee Yu, Mingliang Li","doi":"10.1080/02664763.2025.2484599","DOIUrl":"https://doi.org/10.1080/02664763.2025.2484599","url":null,"abstract":"<p><p>Traditional regression analysis primarily aims to describe the overall relationship between variables, often overlooking unexplainable aspects by design. Our focus is on these unexplained aspects, leveraging them to identify disparity groups with outlying behavior that deviate from the established model. We introduce a data-driven method for identifying such groups using group studentized residuals, which we term the mean squared of external studentized residuals. We apply this method to investigate disparities within healthcare markets, examining healthcare purchasing behavior and identifying the characteristics of disparity groups.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 15","pages":"2777-2798"},"PeriodicalIF":1.1,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12671417/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145668622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Order selection in GARMA models for count time series: a Bayesian perspective. 计数时间序列的GARMA模型中的顺序选择:贝叶斯视角。
IF 1.1 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-03-26 eCollection Date: 2025-01-01 DOI: 10.1080/02664763.2025.2483309
Katerine Zuniga Lastra, Guilherme Pumi, Taiane Schaedler Prass

Estimation in GARMA models has traditionally been carried out under the frequentist approach. To date, Bayesian approaches for such estimation have been relatively limited. In the context of GARMA models for count time series, Bayesian estimation achieves satisfactory results in terms of point estimation. Model selection in this context often relies on the use of information criteria. Despite its prominence in the literature, the use of information criteria for model selection in GARMA models for count time series have been shown to present poor performance in simulations, especially in terms of their ability to correctly identify models, even under large sample sizes. In this work, we study the problem of order selection in GARMA models for count time series, adopting a Bayesian perspective considering the Reversible Jump Markov Chain Monte Carlo approach. Monte Carlo simulation studies are conducted to assess the finite sample performance of the developed ideas, including point and interval inference, sensitivity analysis, effects of burn-in and thinning, as well as the choice of related priors and hyperparameters. Two real-data applications are presented, one considering automobile production in Brazil and the other considering bus exportation in Brazil before and after the COVID-19 pandemic, showcasing the method's capabilities and further exploring its flexibility.

GARMA模型中的估计传统上是在频率方法下进行的。迄今为止,用于此类估计的贝叶斯方法相对有限。在计数时间序列的GARMA模型中,贝叶斯估计在点估计方面取得了令人满意的结果。在这种情况下,模型选择通常依赖于信息标准的使用。尽管它在文献中占有突出地位,但在GARMA模型中对计数时间序列的模型选择使用信息标准已被证明在模拟中表现不佳,特别是在正确识别模型的能力方面,即使在大样本量下。在这项工作中,我们研究了计数时间序列GARMA模型的顺序选择问题,采用贝叶斯视角考虑可逆跳跃马尔可夫链蒙特卡罗方法。通过蒙特卡罗模拟研究来评估所开发思想的有限样本性能,包括点和区间推理,灵敏度分析,老化和细化的影响,以及相关先验和超参数的选择。给出了两个实际数据应用,一个考虑巴西的汽车生产,另一个考虑巴西在COVID-19大流行前后的公共汽车出口,展示了该方法的能力,并进一步探索了其灵活性。
{"title":"Order selection in GARMA models for count time series: a Bayesian perspective.","authors":"Katerine Zuniga Lastra, Guilherme Pumi, Taiane Schaedler Prass","doi":"10.1080/02664763.2025.2483309","DOIUrl":"10.1080/02664763.2025.2483309","url":null,"abstract":"<p><p>Estimation in GARMA models has traditionally been carried out under the frequentist approach. To date, Bayesian approaches for such estimation have been relatively limited. In the context of GARMA models for count time series, Bayesian estimation achieves satisfactory results in terms of point estimation. Model selection in this context often relies on the use of information criteria. Despite its prominence in the literature, the use of information criteria for model selection in GARMA models for count time series have been shown to present poor performance in simulations, especially in terms of their ability to correctly identify models, even under large sample sizes. In this work, we study the problem of order selection in GARMA models for count time series, adopting a Bayesian perspective considering the Reversible Jump Markov Chain Monte Carlo approach. Monte Carlo simulation studies are conducted to assess the finite sample performance of the developed ideas, including point and interval inference, sensitivity analysis, effects of burn-in and thinning, as well as the choice of related priors and hyperparameters. Two real-data applications are presented, one considering automobile production in Brazil and the other considering bus exportation in Brazil before and after the COVID-19 pandemic, showcasing the method's capabilities and further exploring its flexibility.</p>","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"52 14","pages":"2720-2744"},"PeriodicalIF":1.1,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12599011/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145495089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Applied Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1