首页 > 最新文献

Applied Stochastic Models in Business and Industry最新文献

英文 中文
Explainable Fairness and Propensity Score Matching 可解释公平与倾向得分匹配
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-28 DOI: 10.1002/asmb.70047
Paolo Giudici, Golnoosh Babaei

Fairness is a key requirement for artificial intelligence applications. The assessment of fairness is typically based on group-based measures, such as statistical parity, which compares the machine learning output for the different population groups of a protected variable. Although intuitive and simple, statistical parity may be affected by the presence of control variables, correlated with the protected variable. To remove this effect, we propose to employ Shapley values, which measure the additional difference in output specifically due to the protected variable. To remove the possible impact of correlations on Shapley values, we compare them across different subgroups of the most correlated control variables, checking for the presence of Simpson's paradox, for which a fair model may become unfair when conditioning on a control variable. We also show how to mitigate unfairness by means of a propensity score matching that can improve statistical parity, building a training sample that matches similar individuals in different protected groups. We apply our proposal to a real-world database containing 157,269 personal lending decisions and show that both logistic regression and random forest models are fair when all loan applications are considered, but become unfair for high loan amounts requested. We show how propensity score matching can mitigate this bias.

公平性是人工智能应用的关键要求。对公平性的评估通常基于基于群体的度量,例如统计平价,它比较受保护变量的不同人口群体的机器学习输出。虽然直观和简单,但统计奇偶性可能会受到与受保护变量相关的控制变量的影响。为了消除这种影响,我们建议使用Shapley值,它测量由于受保护变量而导致的输出的额外差异。为了消除相关性对Shapley值的可能影响,我们在最相关的控制变量的不同子组中比较它们,检查辛普森悖论的存在,其中公平模型可能在控制变量的条件作用下变得不公平。我们还展示了如何通过倾向得分匹配来减轻不公平,这可以提高统计平价,建立一个训练样本来匹配不同受保护群体中的相似个体。我们将我们的建议应用于包含157,269个个人贷款决策的真实数据库,并表明在考虑所有贷款申请时,逻辑回归和随机森林模型都是公平的,但对于要求的高贷款金额就变得不公平了。我们展示了倾向得分匹配如何减轻这种偏见。
{"title":"Explainable Fairness and Propensity Score Matching","authors":"Paolo Giudici,&nbsp;Golnoosh Babaei","doi":"10.1002/asmb.70047","DOIUrl":"https://doi.org/10.1002/asmb.70047","url":null,"abstract":"<p>Fairness is a key requirement for artificial intelligence applications. The assessment of fairness is typically based on group-based measures, such as statistical parity, which compares the machine learning output for the different population groups of a protected variable. Although intuitive and simple, statistical parity may be affected by the presence of control variables, correlated with the protected variable. To remove this effect, we propose to employ Shapley values, which measure the additional difference in output specifically due to the protected variable. To remove the possible impact of correlations on Shapley values, we compare them across different subgroups of the most correlated control variables, checking for the presence of Simpson's paradox, for which a fair model may become unfair when conditioning on a control variable. We also show how to mitigate unfairness by means of a propensity score matching that can improve statistical parity, building a training sample that matches similar individuals in different protected groups. We apply our proposal to a real-world database containing 157,269 personal lending decisions and show that both logistic regression and random forest models are fair when all loan applications are considered, but become unfair for high loan amounts requested. We show how propensity score matching can mitigate this bias.</p>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 6","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asmb.70047","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145406708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inference on Common Intraday Periodicity at High Frequencies With Jumps 有跳跃的高频共日内周期的推论
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-27 DOI: 10.1002/asmb.70050
Fan Wu

In this paper, we investigate the presence of common intraday periodicity of assets and model the commonalities using functional data analysis when the price processes contain jumps. We implement the information criterion to select the number of common intraday periodic factors, and model the volatility part using nonparametric threshold method. Consistency of the estimated number of factors and uniform convergence of the spot volatility curve are established. Simulation and real data analysis justify that our estimation is accurate and can provide better forecasts of the spot volatility curve.

在本文中,我们研究了资产的共同日内周期性的存在,并使用函数数据分析对价格过程中包含跳跃的共性进行了建模。我们实现了信息准则来选择常见的日内周期因子的数量,并使用非参数阈值方法对波动部分进行建模。建立了估计因子数的一致性和现货波动率曲线的均匀收敛性。仿真和实际数据分析表明,本文的估计是准确的,可以较好地预测现货波动率曲线。
{"title":"Inference on Common Intraday Periodicity at High Frequencies With Jumps","authors":"Fan Wu","doi":"10.1002/asmb.70050","DOIUrl":"https://doi.org/10.1002/asmb.70050","url":null,"abstract":"<div>\u0000 \u0000 <p>In this paper, we investigate the presence of common intraday periodicity of assets and model the commonalities using functional data analysis when the price processes contain jumps. We implement the information criterion to select the number of common intraday periodic factors, and model the volatility part using nonparametric threshold method. Consistency of the estimated number of factors and uniform convergence of the spot volatility curve are established. Simulation and real data analysis justify that our estimation is accurate and can provide better forecasts of the spot volatility curve.</p>\u0000 </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 6","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145406810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing the Impact on Sales of Competing Nearby Supermarkets 评估对附近竞争超市销售的影响
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-22 DOI: 10.1002/asmb.70048
Daniel González Ibáñez, Xavier Puig

The supermarket industry faces fierce competition in many geographic areas, with various brands competing for a market share that represents a significant portion of a country's GDP. For these businesses, understanding the factors that influence their sales and how they do so is crucial: location, pricing, quality, customer loyalty, and more. While much has been written about how these elements can affect revenue, there is a lack of existing models quantifying the monetary impact of having competitor stores within an own establishment area of influence. This article introduces a hierarchical Bayesian model that quantifies this impact for each product category within the brand's catalog. The model requires careful parameterization to adapt to the specific context, and therefore both its application and design represent a methodological novelty in this area. The main result is a matrix of coefficients that indicate the monetary effect of the presence of every competitor for each product category. The findings revealed by the model show that the influence of the proximity of each competitor does indeed vary depending on these categories. This information is valuable for the company when making decisions, such as choosing new store locations or determining the product assortment to offer in each establishment. The use of the results derived from this model provides a competitive advantage over others and helps to better understand the market by identifying which supermarket brands are the perceived leaders in each product category.

超市行业在许多地理区域面临着激烈的竞争,各种品牌争夺市场份额,这代表了一个国家GDP的很大一部分。对于这些企业来说,了解影响其销售的因素以及如何影响销售至关重要:地点、价格、质量、客户忠诚度等等。虽然已经有很多关于这些因素如何影响收入的文章,但缺乏现有的模型来量化在自己的影响力范围内拥有竞争对手商店的货币影响。本文介绍了一个层次贝叶斯模型,该模型量化了品牌目录中每个产品类别的影响。该模型需要仔细的参数化以适应特定的环境,因此它的应用和设计都代表了该领域方法论的新颖性。主要结果是一个系数矩阵,它表明每个产品类别的每个竞争者的存在对货币的影响。该模型揭示的结果表明,每个竞争者的邻近程度的影响确实因这些类别而异。当公司做出决策时,这些信息是有价值的,例如选择新的商店位置或确定在每个机构中提供的产品分类。使用这个模型得出的结果提供了竞争优势,并有助于更好地了解市场,通过确定哪些超市品牌是每个产品类别的领导者。
{"title":"Assessing the Impact on Sales of Competing Nearby Supermarkets","authors":"Daniel González Ibáñez,&nbsp;Xavier Puig","doi":"10.1002/asmb.70048","DOIUrl":"https://doi.org/10.1002/asmb.70048","url":null,"abstract":"<p>The supermarket industry faces fierce competition in many geographic areas, with various brands competing for a market share that represents a significant portion of a country's GDP. For these businesses, understanding the factors that influence their sales and how they do so is crucial: location, pricing, quality, customer loyalty, and more. While much has been written about how these elements can affect revenue, there is a lack of existing models quantifying the monetary impact of having competitor stores within an own establishment area of influence. This article introduces a hierarchical Bayesian model that quantifies this impact for each product category within the brand's catalog. The model requires careful parameterization to adapt to the specific context, and therefore both its application and design represent a methodological novelty in this area. The main result is a matrix of coefficients that indicate the monetary effect of the presence of every competitor for each product category. The findings revealed by the model show that the influence of the proximity of each competitor does indeed vary depending on these categories. This information is valuable for the company when making decisions, such as choosing new store locations or determining the product assortment to offer in each establishment. The use of the results derived from this model provides a competitive advantage over others and helps to better understand the market by identifying which supermarket brands are the perceived leaders in each product category.</p>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 6","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asmb.70048","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145366657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable Inference via Averaged Robbins-Monro Bootstrap 基于平均罗宾斯-门罗Bootstrap的可扩展推理
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-14 DOI: 10.1002/asmb.70046
Giuseppe Alfonzetti, Ruggero Bellio

Bootstrap procedures represent a straightforward approach to assessing the uncertainty around estimates of interest in statistical models. However, with the rising prevalence of massive datasets in statistical problems, the computational cost of bootstrap methods can quickly become prohibitive in many settings. To this end, this paper proposes the Averaged Robbins-Monro Bootstrap (ARM-B), a scalable tool for estimating parameter variability via multiple chains of Robbins-Monro updates. The method is illustrated in large-scale Poisson regression and logistic regression settings and compared with the alternative scalable method given by the bag of little bootstraps (BLB). Some simulation experiments and an illustrative analysis on a large-scale dataset show that ARM-B has comparable accuracy with ordinary bootstrap, but, at the same time, it is significantly less computationally demanding and quite competitive with BLB.

Bootstrap程序代表了一种直接的方法来评估统计模型中兴趣估计的不确定性。然而,随着统计问题中海量数据集的日益流行,在许多情况下,自举方法的计算成本很快就会变得令人望而却步。为此,本文提出了平均罗宾斯-门罗Bootstrap (ARM-B),这是一种可扩展的工具,用于通过多个罗宾斯-门罗更新链估计参数可变性。该方法在大规模泊松回归和逻辑回归设置下进行了说明,并与小自举袋(BLB)给出的替代可扩展方法进行了比较。一些仿真实验和对大规模数据集的说明性分析表明,ARM-B具有与普通自举相当的精度,但同时,它的计算需求明显低于BLB,具有很强的竞争力。
{"title":"Scalable Inference via Averaged Robbins-Monro Bootstrap","authors":"Giuseppe Alfonzetti,&nbsp;Ruggero Bellio","doi":"10.1002/asmb.70046","DOIUrl":"https://doi.org/10.1002/asmb.70046","url":null,"abstract":"<p>Bootstrap procedures represent a straightforward approach to assessing the uncertainty around estimates of interest in statistical models. However, with the rising prevalence of massive datasets in statistical problems, the computational cost of bootstrap methods can quickly become prohibitive in many settings. To this end, this paper proposes the Averaged Robbins-Monro Bootstrap (ARM-B), a scalable tool for estimating parameter variability via multiple chains of Robbins-Monro updates. The method is illustrated in large-scale Poisson regression and logistic regression settings and compared with the alternative scalable method given by the bag of little bootstraps (BLB). Some simulation experiments and an illustrative analysis on a large-scale dataset show that ARM-B has comparable accuracy with ordinary bootstrap, but, at the same time, it is significantly less computationally demanding and quite competitive with BLB.</p>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 6","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asmb.70046","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145297487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conformal Prediction Inference in Regularized Insurance Models 正则化保险模型的保形预测推理
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-12 DOI: 10.1002/asmb.70045
Alokesh Manna, Aditya Vikram Sett, Dipak K. Dey, Yuwen Gu, Elizabeth D. Schifano, Jichao He

Prediction uncertainty quantification has become a key research topic in recent years, with applications in both scientific and business problems. In the insurance industry, assessing the range of possible claim costs for individual drivers improves premium pricing accuracy. It also enables insurers to manage risk more effectively by accounting for uncertainty in accident likelihood and severity. In the presence of covariates, a variety of regression-type models are often used for modeling insurance claims, ranging from relatively simple generalized linear models (GLMs) to regularized GLMs to gradient boosting models (GBMs). Conformal predictive inference has arisen as a popular distribution-free approach for quantifying predictive uncertainty under relatively weak assumptions of exchangeability, and has been well studied under the classic linear regression setting. In this work, we leverage GLMs and GBMs to define meaningful non-conformity measures, which are then used within the conformal prediction framework to provide reliable uncertainty quantification for these types of regression problems. Using regularized Tweedie GLM regression and LightGBM with Tweedie loss, we demonstrate conformal prediction performance with these non-conformity measures in insurance claims data. Our simulation results favor the use of locally weighted Pearson residuals for LightGBM over other methods considered, as the resulting intervals maintained the nominal coverage with the smallest average width.

预测不确定性量化已成为近年来的一个重要研究课题,在科学和商业问题中都有应用。在保险行业,评估个别司机可能的索赔成本范围可以提高保费定价的准确性。它还使保险公司能够通过考虑事故可能性和严重程度的不确定性,更有效地管理风险。在协变量存在的情况下,各种回归型模型经常用于保险索赔建模,从相对简单的广义线性模型(GLMs)到正则化的广义线性模型(GLMs)再到梯度增强模型(GBMs)。保形预测推理作为一种在相对较弱的可交换性假设下量化预测不确定性的无分布方法而兴起,并在经典线性回归设置下得到了很好的研究。在这项工作中,我们利用glm和gbm来定义有意义的不一致性度量,然后在适形预测框架内使用,为这些类型的回归问题提供可靠的不确定性量化。利用正则化Tweedie GLM回归和带Tweedie loss的LightGBM,我们证明了这些不符合度量在保险索赔数据中的符合性预测性能。与其他方法相比,我们的模拟结果更倾向于使用LightGBM的局部加权Pearson残差,因为得到的区间保持了最小平均宽度的名义覆盖范围。
{"title":"Conformal Prediction Inference in Regularized Insurance Models","authors":"Alokesh Manna,&nbsp;Aditya Vikram Sett,&nbsp;Dipak K. Dey,&nbsp;Yuwen Gu,&nbsp;Elizabeth D. Schifano,&nbsp;Jichao He","doi":"10.1002/asmb.70045","DOIUrl":"https://doi.org/10.1002/asmb.70045","url":null,"abstract":"<div>\u0000 \u0000 <p>Prediction uncertainty quantification has become a key research topic in recent years, with applications in both scientific and business problems. In the insurance industry, assessing the range of possible claim costs for individual drivers improves premium pricing accuracy. It also enables insurers to manage risk more effectively by accounting for uncertainty in accident likelihood and severity. In the presence of covariates, a variety of regression-type models are often used for modeling insurance claims, ranging from relatively simple generalized linear models (GLMs) to regularized GLMs to gradient boosting models (GBMs). Conformal predictive inference has arisen as a popular distribution-free approach for quantifying predictive uncertainty under relatively weak assumptions of exchangeability, and has been well studied under the classic linear regression setting. In this work, we leverage GLMs and GBMs to define meaningful non-conformity measures, which are then used within the conformal prediction framework to provide reliable uncertainty quantification for these types of regression problems. Using regularized Tweedie GLM regression and LightGBM with Tweedie loss, we demonstrate conformal prediction performance with these non-conformity measures in insurance claims data. Our simulation results favor the use of locally weighted Pearson residuals for LightGBM over other methods considered, as the resulting intervals maintained the nominal coverage with the smallest average width.</p>\u0000 </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145316806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Uncertainties in Health Economic Models: A Review and Guide 评估卫生经济模型中的不确定性:综述与指南
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-02 DOI: 10.1002/asmb.70044
Mohammad A. Chaudhary, Haitao Chu, Joseph C. Cappelleri

In health economics, decision-makers rely on models to assess the cost-effectiveness of healthcare interventions and guide resource allocation. Health Technology Assessment (HTA) agencies employ cost-effectiveness models to determine the approval and market access of new therapies within their respective jurisdictions. Health economists use quantitative techniques to synthesize clinical, epidemiological, and economic data to model the costs and effectiveness of a new drug compared to the current standard of care over the lifetime of the patients. These models frequently integrate a wide range of assumptions and data inputs from various sources, which renders them vulnerable to a significant level of uncertainty. Economic models commonly confront multiple forms of uncertainty, such as stochastic uncertainty (first-order), which differs from parameter uncertainty (second-order), as well as the presence of heterogeneity within patient populations. Additionally, structural uncertainty related to the model itself adds another layer of complexity. Uncertainty assessment is essential in model-based health economic evaluations that inform regulatory and reimbursement decisions. Understanding these sources of uncertainty, taking steps to minimize their impact, and analyzing, quantifying, and reporting these inherent uncertainties are crucial for ensuring that health economic models provide robust and reliable insights for effective decision-making. This article examines different types of uncertainty in health economic models and methods to analyze and quantify them, offering practical guidelines with examples from recent literature.

在卫生经济学中,决策者依靠模型来评估卫生保健干预措施的成本效益并指导资源分配。卫生技术评估(HTA)机构采用成本效益模型来确定新疗法在各自管辖范围内的批准和市场准入。卫生经济学家使用定量技术来综合临床、流行病学和经济数据,以模拟一种新药的成本和有效性,并将其与患者一生中目前的护理标准进行比较。这些模型经常集成来自各种来源的广泛假设和数据输入,这使得它们容易受到很大程度的不确定性的影响。经济模型通常面临多种形式的不确定性,如随机不确定性(一阶),它不同于参数不确定性(二阶),以及患者群体中异质性的存在。此外,与模型本身相关的结构不确定性增加了另一层复杂性。不确定性评估在为监管和报销决策提供信息的基于模型的卫生经济评估中至关重要。了解这些不确定性的来源,采取措施尽量减少其影响,并分析、量化和报告这些固有的不确定性,对于确保卫生经济模型为有效决策提供有力和可靠的见解至关重要。本文考察了卫生经济模型中不同类型的不确定性以及分析和量化这些不确定性的方法,并从最近的文献中提供了实用的指导方针。
{"title":"Evaluating Uncertainties in Health Economic Models: A Review and Guide","authors":"Mohammad A. Chaudhary,&nbsp;Haitao Chu,&nbsp;Joseph C. Cappelleri","doi":"10.1002/asmb.70044","DOIUrl":"https://doi.org/10.1002/asmb.70044","url":null,"abstract":"<div>\u0000 \u0000 <p>In health economics, decision-makers rely on models to assess the cost-effectiveness of healthcare interventions and guide resource allocation. Health Technology Assessment (HTA) agencies employ cost-effectiveness models to determine the approval and market access of new therapies within their respective jurisdictions. Health economists use quantitative techniques to synthesize clinical, epidemiological, and economic data to model the costs and effectiveness of a new drug compared to the current standard of care over the lifetime of the patients. These models frequently integrate a wide range of assumptions and data inputs from various sources, which renders them vulnerable to a significant level of uncertainty. Economic models commonly confront multiple forms of uncertainty, such as stochastic uncertainty (first-order), which differs from parameter uncertainty (second-order), as well as the presence of heterogeneity within patient populations. Additionally, structural uncertainty related to the model itself adds another layer of complexity. Uncertainty assessment is essential in model-based health economic evaluations that inform regulatory and reimbursement decisions. Understanding these sources of uncertainty, taking steps to minimize their impact, and analyzing, quantifying, and reporting these inherent uncertainties are crucial for ensuring that health economic models provide robust and reliable insights for effective decision-making. This article examines different types of uncertainty in health economic models and methods to analyze and quantify them, offering practical guidelines with examples from recent literature.</p>\u0000 </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145224113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Beyond Randomization: Design and Analysis of Discrete Choice Experiments in the Presence of Profile Order Effects Within Choice Sets 超越随机化:在选择集中存在剖面顺序效应的离散选择实验的设计与分析
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-18 DOI: 10.1002/asmb.70043
Yicheng Mao, Roselinde Kessels, Robert Mee

Discrete Choice Experiments (DCEs) investigate the attributes that affect individual choices among different options and are widely applied across numerous fields. Past DCEs provide clear evidence that the presentation order of the profiles within a choice set can impact the respondents' choices. Ignoring such order effects can produce severely biased estimates, as we illustrate using a product packaging DCE performed for Procter & Gamble in Mexico. Currently, the most common approach to address profile order effects is to randomize the profile order. While this method is relatively easy to implement in online surveys, it can be considerably cumbersome in offline experimental settings. To address this, we suggest incorporating an order covariate in the model to measure the effect of profile order, and propose a Bayesian optimal Balanced Profile Order Design (BPOD) that accounts for this order effect. Our simulation experiments reveal that our Bayesian optimal BPOD achieves accurate parameter estimates comparable to those obtained through randomization in both the multinomial logit model and the panel mixed logit model. Beyond DCEs, this design strategy contributes to broader efforts in experimental design by providing a generalizable framework for addressing structural sources of bias in applied statistical research.

离散选择实验(dce)研究影响个体在不同选择中的选择的属性,并被广泛应用于许多领域。过去的dce提供了明确的证据,表明选择集中概要文件的呈现顺序会影响受访者的选择。忽略这样的顺序效应会产生严重的有偏差的估计,正如我们使用为墨西哥宝洁公司执行的产品包装DCE所说明的那样。目前,解决概要文件顺序效应的最常用方法是随机化概要文件顺序。虽然这种方法在在线调查中相对容易实施,但在离线实验环境中可能相当麻烦。为了解决这个问题,我们建议在模型中加入一个顺序协变量来衡量轮廓顺序的影响,并提出一个贝叶斯最优平衡轮廓顺序设计(BPOD)来解释这种顺序效应。我们的仿真实验表明,我们的贝叶斯最优BPOD在多项logit模型和面板混合logit模型中都获得了与随机化相当的精确参数估计。除了dce之外,这种设计策略通过提供一个可推广的框架来解决应用统计研究中的结构性偏差来源,从而有助于在实验设计中做出更广泛的努力。
{"title":"Beyond Randomization: Design and Analysis of Discrete Choice Experiments in the Presence of Profile Order Effects Within Choice Sets","authors":"Yicheng Mao,&nbsp;Roselinde Kessels,&nbsp;Robert Mee","doi":"10.1002/asmb.70043","DOIUrl":"https://doi.org/10.1002/asmb.70043","url":null,"abstract":"<p>Discrete Choice Experiments (DCEs) investigate the attributes that affect individual choices among different options and are widely applied across numerous fields. Past DCEs provide clear evidence that the presentation order of the profiles within a choice set can impact the respondents' choices. Ignoring such order effects can produce severely biased estimates, as we illustrate using a product packaging DCE performed for Procter &amp; Gamble in Mexico. Currently, the most common approach to address profile order effects is to randomize the profile order. While this method is relatively easy to implement in online surveys, it can be considerably cumbersome in offline experimental settings. To address this, we suggest incorporating an order covariate in the model to measure the effect of profile order, and propose a Bayesian optimal Balanced Profile Order Design (BPOD) that accounts for this order effect. Our simulation experiments reveal that our Bayesian optimal BPOD achieves accurate parameter estimates comparable to those obtained through randomization in both the multinomial logit model and the panel mixed logit model. Beyond DCEs, this design strategy contributes to broader efforts in experimental design by providing a generalizable framework for addressing structural sources of bias in applied statistical research.</p>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asmb.70043","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145101878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting Gold Returns Volatility Over 1258–2023: The Role of Moments 预测1258-2023年黄金收益波动:时刻的作用
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-15 DOI: 10.1002/asmb.70042
Thanoj K. Muddana, Komal S. R. Bhimireddy, Anandamayee Majumdar, Rangan Gupta

We analyze the role of leverage, lower and upper tail risks, skewness, and kurtosis of real gold returns in forecasting its volatility over the annual data sample from 1258 to 2023. To conduct our forecasting experiment, we first fit Bayesian time-varying parameters quantile regressions to real gold returns, under six alternative prior settings, to obtain the estimates of volatility (as inter-quantile range), lower and upper tail risks, skewness, and kurtosis. Second, we forecast the derived estimates of conditional volatility using the information contained in leverage of gold returns, tail risks, skewness, and kurtosis using recursively estimated linear predictive regressions over the out-of-sample periods. We find strong statistical evidence of the role of the moments-based predictors in forecasting gold returns volatility over the short to medium term, i.e., till 1–5-year ahead, when compared to the autoregressive benchmark. Robustness of our main result is also validated based on a shorter sample involving higher-frequency data. Our results have important implications for investors and policymakers.

本文分析了杠杆、上下尾风险、偏度和峰度对1258 - 2023年实物黄金收益率波动率预测的影响。为了进行预测实验,我们首先将贝叶斯时变参数分位数回归拟合到真实黄金收益中,在六种不同的先验设置下,获得波动性(作为分位数间范围)、上下尾风险、偏度和峰度的估计。其次,我们使用样本外周期递归估计的线性预测回归,利用黄金收益杠杆、尾部风险、偏度和峰度中包含的信息预测条件波动的推导估计。与自回归基准相比,我们发现了强有力的统计证据,证明基于时刻的预测因子在预测黄金短期至中期(即未来1 - 5年)回报波动方面的作用。我们的主要结果的鲁棒性也验证了基于更短的样本涉及高频数据。我们的研究结果对投资者和政策制定者具有重要意义。
{"title":"Forecasting Gold Returns Volatility Over 1258–2023: The Role of Moments","authors":"Thanoj K. Muddana,&nbsp;Komal S. R. Bhimireddy,&nbsp;Anandamayee Majumdar,&nbsp;Rangan Gupta","doi":"10.1002/asmb.70042","DOIUrl":"https://doi.org/10.1002/asmb.70042","url":null,"abstract":"<div>\u0000 \u0000 <p>We analyze the role of leverage, lower and upper tail risks, skewness, and kurtosis of real gold returns in forecasting its volatility over the annual data sample from 1258 to 2023. To conduct our forecasting experiment, we first fit Bayesian time-varying parameters quantile regressions to real gold returns, under six alternative prior settings, to obtain the estimates of volatility (as inter-quantile range), lower and upper tail risks, skewness, and kurtosis. Second, we forecast the derived estimates of conditional volatility using the information contained in leverage of gold returns, tail risks, skewness, and kurtosis using recursively estimated linear predictive regressions over the out-of-sample periods. We find strong statistical evidence of the role of the moments-based predictors in forecasting gold returns volatility over the short to medium term, i.e., till 1–5-year ahead, when compared to the autoregressive benchmark. Robustness of our main result is also validated based on a shorter sample involving higher-frequency data. Our results have important implications for investors and policymakers.</p>\u0000 </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145101132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Backdoor Attacks on DNN and GBDT: A Case Study From the Insurance Domain 对DNN和GBDT的后门攻击:来自保险领域的案例研究
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-07 DOI: 10.1002/asmb.70029
Robin Kühlem, Daniel Otten, Daniel Ludwig, Anselm Hudde, Alexander Rosenbaum, Andreas Mauthe

Machine learning (ML) will most likely play a large role in many processes in the future, also in the insurance industry. However, ML models are at risk of being attacked and manipulated. A model compromised by a backdoor attack loses its integrity and can no longer be deemed trustworthy. Ensuring the trustworthiness of ML models is crucial, as compromised models can lead to significant financial and reputational damage for insurance companies. In this work the robustness of Gradient Boosted Decision Tree (GBDT) models and Deep Neural Networks (DNN) within an insurance context is evaluated. Therefore, two GBDT models and two DNNs are trained on two different tabular datasets from an insurance context. Past research in this domain mainly used homogeneous data and there are comparably little insights regarding heterogeneous tabular data. The ML tasks performed on the datasets are claim prediction (regression) and fraud detection (binary classification). For the backdoor attacks different samples containing a specific pattern were crafted and added to the training data. It is shown, that this type of attack can be highly successful, even with a few added samples. The backdoor attacks worked well on the models trained on one dataset but poorly on the models trained on the other. In real-world scenarios the attacker will have to face several obstacles but as attacks can work with very few added samples this risk should be evaluated. Therefore, understanding and mitigating these risks is essential for the reliable deployment of ML in critical applications.

机器学习(ML)很可能在未来的许多流程中发挥重要作用,在保险业也是如此。然而,ML模型面临被攻击和操纵的风险。被后门攻击破坏的模型失去了完整性,不再被认为是值得信赖的。确保机器学习模型的可信度至关重要,因为受损的模型可能会给保险公司带来重大的财务和声誉损失。在这项工作中,评估了梯度增强决策树(GBDT)模型和深度神经网络(DNN)在保险环境中的鲁棒性。因此,两个GBDT模型和两个dnn在来自保险上下文的两个不同的表格数据集上进行训练。过去在该领域的研究主要使用同构数据,对于异构表格数据的见解相对较少。在数据集上执行的ML任务是索赔预测(回归)和欺诈检测(二元分类)。对于后门攻击,包含特定模式的不同样本被精心制作并添加到训练数据中。结果表明,即使添加少量样本,这种类型的攻击也可以非常成功。后门攻击在一个数据集上训练的模型上效果很好,但在另一个数据集上训练的模型上效果很差。在现实场景中,攻击者将不得不面对一些障碍,但由于攻击可以使用很少的添加样本,因此应该评估这种风险。因此,理解和减轻这些风险对于在关键应用程序中可靠地部署ML至关重要。
{"title":"Backdoor Attacks on DNN and GBDT: A Case Study From the Insurance Domain","authors":"Robin Kühlem,&nbsp;Daniel Otten,&nbsp;Daniel Ludwig,&nbsp;Anselm Hudde,&nbsp;Alexander Rosenbaum,&nbsp;Andreas Mauthe","doi":"10.1002/asmb.70029","DOIUrl":"https://doi.org/10.1002/asmb.70029","url":null,"abstract":"<p>Machine learning (ML) will most likely play a large role in many processes in the future, also in the insurance industry. However, ML models are at risk of being attacked and manipulated. A model compromised by a backdoor attack loses its integrity and can no longer be deemed trustworthy. Ensuring the trustworthiness of ML models is crucial, as compromised models can lead to significant financial and reputational damage for insurance companies. In this work the robustness of Gradient Boosted Decision Tree (GBDT) models and Deep Neural Networks (DNN) within an insurance context is evaluated. Therefore, two GBDT models and two DNNs are trained on two different tabular datasets from an insurance context. Past research in this domain mainly used homogeneous data and there are comparably little insights regarding heterogeneous tabular data. The ML tasks performed on the datasets are claim prediction (regression) and fraud detection (binary classification). For the backdoor attacks different samples containing a specific pattern were crafted and added to the training data. It is shown, that this type of attack can be highly successful, even with a few added samples. The backdoor attacks worked well on the models trained on one dataset but poorly on the models trained on the other. In real-world scenarios the attacker will have to face several obstacles but as attacks can work with very few added samples this risk should be evaluated. Therefore, understanding and mitigating these risks is essential for the reliable deployment of ML in critical applications.</p>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asmb.70029","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145007920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pricing Basket Spread Option Under the Correlated Skew Brownian Motions 相关偏布朗运动下的一篮子价差期权定价
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-08-31 DOI: 10.1002/asmb.70040
Qifeng Zhong, Xingye Yue, Jing Yao

In this article, we propose synthetic analytical approximate pricing formulas for basket and basket spread options within a multidimensional correlated skew Brownian motion framework. For basket options, we derive two analytical approximate pricing formulas by utilizing the partial exact approximation, the moment matching method and convex bounds approximation together and achieve accurate and analytical approximations. For basket spread options, we derive a lower bound approximation using the Bjerksund–Stensland-type approach. Numerical examples demonstrate superior performance with consistent robustness and high precision of our formulas, remarkably maintaining excellent performance for high-dimensional options. We also note that these approximate pricing formulas can serve as powerful control variates for the variance reduction of Monte Carlo simulations.

在本文中,我们提出了多维相关偏布朗运动框架下篮子和篮子价差期权的综合解析近似定价公式。对于一篮子期权,我们将部分精确近似、矩匹配法和凸界近似结合起来,推导出两个解析近似定价公式,实现了精确的解析近似。对于一篮子价差期权,我们使用bjerksun - stensland型方法导出了下界近似。数值算例表明,我们的公式具有稳定的鲁棒性和高精度,显著地保持了高维选项的优异性能。我们还注意到,这些近似定价公式可以作为蒙特卡罗模拟方差减小的强大控制变量。
{"title":"Pricing Basket Spread Option Under the Correlated Skew Brownian Motions","authors":"Qifeng Zhong,&nbsp;Xingye Yue,&nbsp;Jing Yao","doi":"10.1002/asmb.70040","DOIUrl":"https://doi.org/10.1002/asmb.70040","url":null,"abstract":"<div>\u0000 \u0000 <p>In this article, we propose synthetic analytical approximate pricing formulas for basket and basket spread options within a multidimensional correlated skew Brownian motion framework. For basket options, we derive two analytical approximate pricing formulas by utilizing the partial exact approximation, the moment matching method and convex bounds approximation together and achieve accurate and analytical approximations. For basket spread options, we derive a lower bound approximation using the Bjerksund–Stensland-type approach. Numerical examples demonstrate superior performance with consistent robustness and high precision of our formulas, remarkably maintaining excellent performance for high-dimensional options. We also note that these approximate pricing formulas can serve as powerful control variates for the variance reduction of Monte Carlo simulations.</p>\u0000 </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144923830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Applied Stochastic Models in Business and Industry
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1