首页 > 最新文献

Applied Stochastic Models in Business and Industry最新文献

英文 中文
Scalable Inference via Averaged Robbins-Monro Bootstrap 基于平均罗宾斯-门罗Bootstrap的可扩展推理
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-14 DOI: 10.1002/asmb.70046
Giuseppe Alfonzetti, Ruggero Bellio

Bootstrap procedures represent a straightforward approach to assessing the uncertainty around estimates of interest in statistical models. However, with the rising prevalence of massive datasets in statistical problems, the computational cost of bootstrap methods can quickly become prohibitive in many settings. To this end, this paper proposes the Averaged Robbins-Monro Bootstrap (ARM-B), a scalable tool for estimating parameter variability via multiple chains of Robbins-Monro updates. The method is illustrated in large-scale Poisson regression and logistic regression settings and compared with the alternative scalable method given by the bag of little bootstraps (BLB). Some simulation experiments and an illustrative analysis on a large-scale dataset show that ARM-B has comparable accuracy with ordinary bootstrap, but, at the same time, it is significantly less computationally demanding and quite competitive with BLB.

Bootstrap程序代表了一种直接的方法来评估统计模型中兴趣估计的不确定性。然而,随着统计问题中海量数据集的日益流行,在许多情况下,自举方法的计算成本很快就会变得令人望而却步。为此,本文提出了平均罗宾斯-门罗Bootstrap (ARM-B),这是一种可扩展的工具,用于通过多个罗宾斯-门罗更新链估计参数可变性。该方法在大规模泊松回归和逻辑回归设置下进行了说明,并与小自举袋(BLB)给出的替代可扩展方法进行了比较。一些仿真实验和对大规模数据集的说明性分析表明,ARM-B具有与普通自举相当的精度,但同时,它的计算需求明显低于BLB,具有很强的竞争力。
{"title":"Scalable Inference via Averaged Robbins-Monro Bootstrap","authors":"Giuseppe Alfonzetti,&nbsp;Ruggero Bellio","doi":"10.1002/asmb.70046","DOIUrl":"https://doi.org/10.1002/asmb.70046","url":null,"abstract":"<p>Bootstrap procedures represent a straightforward approach to assessing the uncertainty around estimates of interest in statistical models. However, with the rising prevalence of massive datasets in statistical problems, the computational cost of bootstrap methods can quickly become prohibitive in many settings. To this end, this paper proposes the Averaged Robbins-Monro Bootstrap (ARM-B), a scalable tool for estimating parameter variability via multiple chains of Robbins-Monro updates. The method is illustrated in large-scale Poisson regression and logistic regression settings and compared with the alternative scalable method given by the bag of little bootstraps (BLB). Some simulation experiments and an illustrative analysis on a large-scale dataset show that ARM-B has comparable accuracy with ordinary bootstrap, but, at the same time, it is significantly less computationally demanding and quite competitive with BLB.</p>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 6","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asmb.70046","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145297487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conformal Prediction Inference in Regularized Insurance Models 正则化保险模型的保形预测推理
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-12 DOI: 10.1002/asmb.70045
Alokesh Manna, Aditya Vikram Sett, Dipak K. Dey, Yuwen Gu, Elizabeth D. Schifano, Jichao He

Prediction uncertainty quantification has become a key research topic in recent years, with applications in both scientific and business problems. In the insurance industry, assessing the range of possible claim costs for individual drivers improves premium pricing accuracy. It also enables insurers to manage risk more effectively by accounting for uncertainty in accident likelihood and severity. In the presence of covariates, a variety of regression-type models are often used for modeling insurance claims, ranging from relatively simple generalized linear models (GLMs) to regularized GLMs to gradient boosting models (GBMs). Conformal predictive inference has arisen as a popular distribution-free approach for quantifying predictive uncertainty under relatively weak assumptions of exchangeability, and has been well studied under the classic linear regression setting. In this work, we leverage GLMs and GBMs to define meaningful non-conformity measures, which are then used within the conformal prediction framework to provide reliable uncertainty quantification for these types of regression problems. Using regularized Tweedie GLM regression and LightGBM with Tweedie loss, we demonstrate conformal prediction performance with these non-conformity measures in insurance claims data. Our simulation results favor the use of locally weighted Pearson residuals for LightGBM over other methods considered, as the resulting intervals maintained the nominal coverage with the smallest average width.

预测不确定性量化已成为近年来的一个重要研究课题,在科学和商业问题中都有应用。在保险行业,评估个别司机可能的索赔成本范围可以提高保费定价的准确性。它还使保险公司能够通过考虑事故可能性和严重程度的不确定性,更有效地管理风险。在协变量存在的情况下,各种回归型模型经常用于保险索赔建模,从相对简单的广义线性模型(GLMs)到正则化的广义线性模型(GLMs)再到梯度增强模型(GBMs)。保形预测推理作为一种在相对较弱的可交换性假设下量化预测不确定性的无分布方法而兴起,并在经典线性回归设置下得到了很好的研究。在这项工作中,我们利用glm和gbm来定义有意义的不一致性度量,然后在适形预测框架内使用,为这些类型的回归问题提供可靠的不确定性量化。利用正则化Tweedie GLM回归和带Tweedie loss的LightGBM,我们证明了这些不符合度量在保险索赔数据中的符合性预测性能。与其他方法相比,我们的模拟结果更倾向于使用LightGBM的局部加权Pearson残差,因为得到的区间保持了最小平均宽度的名义覆盖范围。
{"title":"Conformal Prediction Inference in Regularized Insurance Models","authors":"Alokesh Manna,&nbsp;Aditya Vikram Sett,&nbsp;Dipak K. Dey,&nbsp;Yuwen Gu,&nbsp;Elizabeth D. Schifano,&nbsp;Jichao He","doi":"10.1002/asmb.70045","DOIUrl":"https://doi.org/10.1002/asmb.70045","url":null,"abstract":"<div>\u0000 \u0000 <p>Prediction uncertainty quantification has become a key research topic in recent years, with applications in both scientific and business problems. In the insurance industry, assessing the range of possible claim costs for individual drivers improves premium pricing accuracy. It also enables insurers to manage risk more effectively by accounting for uncertainty in accident likelihood and severity. In the presence of covariates, a variety of regression-type models are often used for modeling insurance claims, ranging from relatively simple generalized linear models (GLMs) to regularized GLMs to gradient boosting models (GBMs). Conformal predictive inference has arisen as a popular distribution-free approach for quantifying predictive uncertainty under relatively weak assumptions of exchangeability, and has been well studied under the classic linear regression setting. In this work, we leverage GLMs and GBMs to define meaningful non-conformity measures, which are then used within the conformal prediction framework to provide reliable uncertainty quantification for these types of regression problems. Using regularized Tweedie GLM regression and LightGBM with Tweedie loss, we demonstrate conformal prediction performance with these non-conformity measures in insurance claims data. Our simulation results favor the use of locally weighted Pearson residuals for LightGBM over other methods considered, as the resulting intervals maintained the nominal coverage with the smallest average width.</p>\u0000 </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145316806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Uncertainties in Health Economic Models: A Review and Guide 评估卫生经济模型中的不确定性:综述与指南
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-02 DOI: 10.1002/asmb.70044
Mohammad A. Chaudhary, Haitao Chu, Joseph C. Cappelleri

In health economics, decision-makers rely on models to assess the cost-effectiveness of healthcare interventions and guide resource allocation. Health Technology Assessment (HTA) agencies employ cost-effectiveness models to determine the approval and market access of new therapies within their respective jurisdictions. Health economists use quantitative techniques to synthesize clinical, epidemiological, and economic data to model the costs and effectiveness of a new drug compared to the current standard of care over the lifetime of the patients. These models frequently integrate a wide range of assumptions and data inputs from various sources, which renders them vulnerable to a significant level of uncertainty. Economic models commonly confront multiple forms of uncertainty, such as stochastic uncertainty (first-order), which differs from parameter uncertainty (second-order), as well as the presence of heterogeneity within patient populations. Additionally, structural uncertainty related to the model itself adds another layer of complexity. Uncertainty assessment is essential in model-based health economic evaluations that inform regulatory and reimbursement decisions. Understanding these sources of uncertainty, taking steps to minimize their impact, and analyzing, quantifying, and reporting these inherent uncertainties are crucial for ensuring that health economic models provide robust and reliable insights for effective decision-making. This article examines different types of uncertainty in health economic models and methods to analyze and quantify them, offering practical guidelines with examples from recent literature.

在卫生经济学中,决策者依靠模型来评估卫生保健干预措施的成本效益并指导资源分配。卫生技术评估(HTA)机构采用成本效益模型来确定新疗法在各自管辖范围内的批准和市场准入。卫生经济学家使用定量技术来综合临床、流行病学和经济数据,以模拟一种新药的成本和有效性,并将其与患者一生中目前的护理标准进行比较。这些模型经常集成来自各种来源的广泛假设和数据输入,这使得它们容易受到很大程度的不确定性的影响。经济模型通常面临多种形式的不确定性,如随机不确定性(一阶),它不同于参数不确定性(二阶),以及患者群体中异质性的存在。此外,与模型本身相关的结构不确定性增加了另一层复杂性。不确定性评估在为监管和报销决策提供信息的基于模型的卫生经济评估中至关重要。了解这些不确定性的来源,采取措施尽量减少其影响,并分析、量化和报告这些固有的不确定性,对于确保卫生经济模型为有效决策提供有力和可靠的见解至关重要。本文考察了卫生经济模型中不同类型的不确定性以及分析和量化这些不确定性的方法,并从最近的文献中提供了实用的指导方针。
{"title":"Evaluating Uncertainties in Health Economic Models: A Review and Guide","authors":"Mohammad A. Chaudhary,&nbsp;Haitao Chu,&nbsp;Joseph C. Cappelleri","doi":"10.1002/asmb.70044","DOIUrl":"https://doi.org/10.1002/asmb.70044","url":null,"abstract":"<div>\u0000 \u0000 <p>In health economics, decision-makers rely on models to assess the cost-effectiveness of healthcare interventions and guide resource allocation. Health Technology Assessment (HTA) agencies employ cost-effectiveness models to determine the approval and market access of new therapies within their respective jurisdictions. Health economists use quantitative techniques to synthesize clinical, epidemiological, and economic data to model the costs and effectiveness of a new drug compared to the current standard of care over the lifetime of the patients. These models frequently integrate a wide range of assumptions and data inputs from various sources, which renders them vulnerable to a significant level of uncertainty. Economic models commonly confront multiple forms of uncertainty, such as stochastic uncertainty (first-order), which differs from parameter uncertainty (second-order), as well as the presence of heterogeneity within patient populations. Additionally, structural uncertainty related to the model itself adds another layer of complexity. Uncertainty assessment is essential in model-based health economic evaluations that inform regulatory and reimbursement decisions. Understanding these sources of uncertainty, taking steps to minimize their impact, and analyzing, quantifying, and reporting these inherent uncertainties are crucial for ensuring that health economic models provide robust and reliable insights for effective decision-making. This article examines different types of uncertainty in health economic models and methods to analyze and quantify them, offering practical guidelines with examples from recent literature.</p>\u0000 </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145224113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Beyond Randomization: Design and Analysis of Discrete Choice Experiments in the Presence of Profile Order Effects Within Choice Sets 超越随机化:在选择集中存在剖面顺序效应的离散选择实验的设计与分析
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-18 DOI: 10.1002/asmb.70043
Yicheng Mao, Roselinde Kessels, Robert Mee

Discrete Choice Experiments (DCEs) investigate the attributes that affect individual choices among different options and are widely applied across numerous fields. Past DCEs provide clear evidence that the presentation order of the profiles within a choice set can impact the respondents' choices. Ignoring such order effects can produce severely biased estimates, as we illustrate using a product packaging DCE performed for Procter & Gamble in Mexico. Currently, the most common approach to address profile order effects is to randomize the profile order. While this method is relatively easy to implement in online surveys, it can be considerably cumbersome in offline experimental settings. To address this, we suggest incorporating an order covariate in the model to measure the effect of profile order, and propose a Bayesian optimal Balanced Profile Order Design (BPOD) that accounts for this order effect. Our simulation experiments reveal that our Bayesian optimal BPOD achieves accurate parameter estimates comparable to those obtained through randomization in both the multinomial logit model and the panel mixed logit model. Beyond DCEs, this design strategy contributes to broader efforts in experimental design by providing a generalizable framework for addressing structural sources of bias in applied statistical research.

离散选择实验(dce)研究影响个体在不同选择中的选择的属性,并被广泛应用于许多领域。过去的dce提供了明确的证据,表明选择集中概要文件的呈现顺序会影响受访者的选择。忽略这样的顺序效应会产生严重的有偏差的估计,正如我们使用为墨西哥宝洁公司执行的产品包装DCE所说明的那样。目前,解决概要文件顺序效应的最常用方法是随机化概要文件顺序。虽然这种方法在在线调查中相对容易实施,但在离线实验环境中可能相当麻烦。为了解决这个问题,我们建议在模型中加入一个顺序协变量来衡量轮廓顺序的影响,并提出一个贝叶斯最优平衡轮廓顺序设计(BPOD)来解释这种顺序效应。我们的仿真实验表明,我们的贝叶斯最优BPOD在多项logit模型和面板混合logit模型中都获得了与随机化相当的精确参数估计。除了dce之外,这种设计策略通过提供一个可推广的框架来解决应用统计研究中的结构性偏差来源,从而有助于在实验设计中做出更广泛的努力。
{"title":"Beyond Randomization: Design and Analysis of Discrete Choice Experiments in the Presence of Profile Order Effects Within Choice Sets","authors":"Yicheng Mao,&nbsp;Roselinde Kessels,&nbsp;Robert Mee","doi":"10.1002/asmb.70043","DOIUrl":"https://doi.org/10.1002/asmb.70043","url":null,"abstract":"<p>Discrete Choice Experiments (DCEs) investigate the attributes that affect individual choices among different options and are widely applied across numerous fields. Past DCEs provide clear evidence that the presentation order of the profiles within a choice set can impact the respondents' choices. Ignoring such order effects can produce severely biased estimates, as we illustrate using a product packaging DCE performed for Procter &amp; Gamble in Mexico. Currently, the most common approach to address profile order effects is to randomize the profile order. While this method is relatively easy to implement in online surveys, it can be considerably cumbersome in offline experimental settings. To address this, we suggest incorporating an order covariate in the model to measure the effect of profile order, and propose a Bayesian optimal Balanced Profile Order Design (BPOD) that accounts for this order effect. Our simulation experiments reveal that our Bayesian optimal BPOD achieves accurate parameter estimates comparable to those obtained through randomization in both the multinomial logit model and the panel mixed logit model. Beyond DCEs, this design strategy contributes to broader efforts in experimental design by providing a generalizable framework for addressing structural sources of bias in applied statistical research.</p>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asmb.70043","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145101878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting Gold Returns Volatility Over 1258–2023: The Role of Moments 预测1258-2023年黄金收益波动:时刻的作用
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-15 DOI: 10.1002/asmb.70042
Thanoj K. Muddana, Komal S. R. Bhimireddy, Anandamayee Majumdar, Rangan Gupta

We analyze the role of leverage, lower and upper tail risks, skewness, and kurtosis of real gold returns in forecasting its volatility over the annual data sample from 1258 to 2023. To conduct our forecasting experiment, we first fit Bayesian time-varying parameters quantile regressions to real gold returns, under six alternative prior settings, to obtain the estimates of volatility (as inter-quantile range), lower and upper tail risks, skewness, and kurtosis. Second, we forecast the derived estimates of conditional volatility using the information contained in leverage of gold returns, tail risks, skewness, and kurtosis using recursively estimated linear predictive regressions over the out-of-sample periods. We find strong statistical evidence of the role of the moments-based predictors in forecasting gold returns volatility over the short to medium term, i.e., till 1–5-year ahead, when compared to the autoregressive benchmark. Robustness of our main result is also validated based on a shorter sample involving higher-frequency data. Our results have important implications for investors and policymakers.

本文分析了杠杆、上下尾风险、偏度和峰度对1258 - 2023年实物黄金收益率波动率预测的影响。为了进行预测实验,我们首先将贝叶斯时变参数分位数回归拟合到真实黄金收益中,在六种不同的先验设置下,获得波动性(作为分位数间范围)、上下尾风险、偏度和峰度的估计。其次,我们使用样本外周期递归估计的线性预测回归,利用黄金收益杠杆、尾部风险、偏度和峰度中包含的信息预测条件波动的推导估计。与自回归基准相比,我们发现了强有力的统计证据,证明基于时刻的预测因子在预测黄金短期至中期(即未来1 - 5年)回报波动方面的作用。我们的主要结果的鲁棒性也验证了基于更短的样本涉及高频数据。我们的研究结果对投资者和政策制定者具有重要意义。
{"title":"Forecasting Gold Returns Volatility Over 1258–2023: The Role of Moments","authors":"Thanoj K. Muddana,&nbsp;Komal S. R. Bhimireddy,&nbsp;Anandamayee Majumdar,&nbsp;Rangan Gupta","doi":"10.1002/asmb.70042","DOIUrl":"https://doi.org/10.1002/asmb.70042","url":null,"abstract":"<div>\u0000 \u0000 <p>We analyze the role of leverage, lower and upper tail risks, skewness, and kurtosis of real gold returns in forecasting its volatility over the annual data sample from 1258 to 2023. To conduct our forecasting experiment, we first fit Bayesian time-varying parameters quantile regressions to real gold returns, under six alternative prior settings, to obtain the estimates of volatility (as inter-quantile range), lower and upper tail risks, skewness, and kurtosis. Second, we forecast the derived estimates of conditional volatility using the information contained in leverage of gold returns, tail risks, skewness, and kurtosis using recursively estimated linear predictive regressions over the out-of-sample periods. We find strong statistical evidence of the role of the moments-based predictors in forecasting gold returns volatility over the short to medium term, i.e., till 1–5-year ahead, when compared to the autoregressive benchmark. Robustness of our main result is also validated based on a shorter sample involving higher-frequency data. Our results have important implications for investors and policymakers.</p>\u0000 </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145101132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Backdoor Attacks on DNN and GBDT: A Case Study From the Insurance Domain 对DNN和GBDT的后门攻击:来自保险领域的案例研究
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-07 DOI: 10.1002/asmb.70029
Robin Kühlem, Daniel Otten, Daniel Ludwig, Anselm Hudde, Alexander Rosenbaum, Andreas Mauthe

Machine learning (ML) will most likely play a large role in many processes in the future, also in the insurance industry. However, ML models are at risk of being attacked and manipulated. A model compromised by a backdoor attack loses its integrity and can no longer be deemed trustworthy. Ensuring the trustworthiness of ML models is crucial, as compromised models can lead to significant financial and reputational damage for insurance companies. In this work the robustness of Gradient Boosted Decision Tree (GBDT) models and Deep Neural Networks (DNN) within an insurance context is evaluated. Therefore, two GBDT models and two DNNs are trained on two different tabular datasets from an insurance context. Past research in this domain mainly used homogeneous data and there are comparably little insights regarding heterogeneous tabular data. The ML tasks performed on the datasets are claim prediction (regression) and fraud detection (binary classification). For the backdoor attacks different samples containing a specific pattern were crafted and added to the training data. It is shown, that this type of attack can be highly successful, even with a few added samples. The backdoor attacks worked well on the models trained on one dataset but poorly on the models trained on the other. In real-world scenarios the attacker will have to face several obstacles but as attacks can work with very few added samples this risk should be evaluated. Therefore, understanding and mitigating these risks is essential for the reliable deployment of ML in critical applications.

机器学习(ML)很可能在未来的许多流程中发挥重要作用,在保险业也是如此。然而,ML模型面临被攻击和操纵的风险。被后门攻击破坏的模型失去了完整性,不再被认为是值得信赖的。确保机器学习模型的可信度至关重要,因为受损的模型可能会给保险公司带来重大的财务和声誉损失。在这项工作中,评估了梯度增强决策树(GBDT)模型和深度神经网络(DNN)在保险环境中的鲁棒性。因此,两个GBDT模型和两个dnn在来自保险上下文的两个不同的表格数据集上进行训练。过去在该领域的研究主要使用同构数据,对于异构表格数据的见解相对较少。在数据集上执行的ML任务是索赔预测(回归)和欺诈检测(二元分类)。对于后门攻击,包含特定模式的不同样本被精心制作并添加到训练数据中。结果表明,即使添加少量样本,这种类型的攻击也可以非常成功。后门攻击在一个数据集上训练的模型上效果很好,但在另一个数据集上训练的模型上效果很差。在现实场景中,攻击者将不得不面对一些障碍,但由于攻击可以使用很少的添加样本,因此应该评估这种风险。因此,理解和减轻这些风险对于在关键应用程序中可靠地部署ML至关重要。
{"title":"Backdoor Attacks on DNN and GBDT: A Case Study From the Insurance Domain","authors":"Robin Kühlem,&nbsp;Daniel Otten,&nbsp;Daniel Ludwig,&nbsp;Anselm Hudde,&nbsp;Alexander Rosenbaum,&nbsp;Andreas Mauthe","doi":"10.1002/asmb.70029","DOIUrl":"https://doi.org/10.1002/asmb.70029","url":null,"abstract":"<p>Machine learning (ML) will most likely play a large role in many processes in the future, also in the insurance industry. However, ML models are at risk of being attacked and manipulated. A model compromised by a backdoor attack loses its integrity and can no longer be deemed trustworthy. Ensuring the trustworthiness of ML models is crucial, as compromised models can lead to significant financial and reputational damage for insurance companies. In this work the robustness of Gradient Boosted Decision Tree (GBDT) models and Deep Neural Networks (DNN) within an insurance context is evaluated. Therefore, two GBDT models and two DNNs are trained on two different tabular datasets from an insurance context. Past research in this domain mainly used homogeneous data and there are comparably little insights regarding heterogeneous tabular data. The ML tasks performed on the datasets are claim prediction (regression) and fraud detection (binary classification). For the backdoor attacks different samples containing a specific pattern were crafted and added to the training data. It is shown, that this type of attack can be highly successful, even with a few added samples. The backdoor attacks worked well on the models trained on one dataset but poorly on the models trained on the other. In real-world scenarios the attacker will have to face several obstacles but as attacks can work with very few added samples this risk should be evaluated. Therefore, understanding and mitigating these risks is essential for the reliable deployment of ML in critical applications.</p>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asmb.70029","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145007920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pricing Basket Spread Option Under the Correlated Skew Brownian Motions 相关偏布朗运动下的一篮子价差期权定价
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-08-31 DOI: 10.1002/asmb.70040
Qifeng Zhong, Xingye Yue, Jing Yao

In this article, we propose synthetic analytical approximate pricing formulas for basket and basket spread options within a multidimensional correlated skew Brownian motion framework. For basket options, we derive two analytical approximate pricing formulas by utilizing the partial exact approximation, the moment matching method and convex bounds approximation together and achieve accurate and analytical approximations. For basket spread options, we derive a lower bound approximation using the Bjerksund–Stensland-type approach. Numerical examples demonstrate superior performance with consistent robustness and high precision of our formulas, remarkably maintaining excellent performance for high-dimensional options. We also note that these approximate pricing formulas can serve as powerful control variates for the variance reduction of Monte Carlo simulations.

在本文中,我们提出了多维相关偏布朗运动框架下篮子和篮子价差期权的综合解析近似定价公式。对于一篮子期权,我们将部分精确近似、矩匹配法和凸界近似结合起来,推导出两个解析近似定价公式,实现了精确的解析近似。对于一篮子价差期权,我们使用bjerksun - stensland型方法导出了下界近似。数值算例表明,我们的公式具有稳定的鲁棒性和高精度,显著地保持了高维选项的优异性能。我们还注意到,这些近似定价公式可以作为蒙特卡罗模拟方差减小的强大控制变量。
{"title":"Pricing Basket Spread Option Under the Correlated Skew Brownian Motions","authors":"Qifeng Zhong,&nbsp;Xingye Yue,&nbsp;Jing Yao","doi":"10.1002/asmb.70040","DOIUrl":"https://doi.org/10.1002/asmb.70040","url":null,"abstract":"<div>\u0000 \u0000 <p>In this article, we propose synthetic analytical approximate pricing formulas for basket and basket spread options within a multidimensional correlated skew Brownian motion framework. For basket options, we derive two analytical approximate pricing formulas by utilizing the partial exact approximation, the moment matching method and convex bounds approximation together and achieve accurate and analytical approximations. For basket spread options, we derive a lower bound approximation using the Bjerksund–Stensland-type approach. Numerical examples demonstrate superior performance with consistent robustness and high precision of our formulas, remarkably maintaining excellent performance for high-dimensional options. We also note that these approximate pricing formulas can serve as powerful control variates for the variance reduction of Monte Carlo simulations.</p>\u0000 </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144923830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Directional False Discovery Rate Control in Large-Scale Multiple Testing Under Data Dependence 数据依赖下大规模多重测试的定向错误发现率控制
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-08-31 DOI: 10.1002/asmb.70041
Wendong Li, Jianqing Shi, Yi Wang, Dongdong Xiang

Detecting directional signals in multiple testing is crucial to take targeted and effective measures. In this article, we consider the directional multiple testing under the dependence problem within a three-group model. Given the assumption that the observed data are generated according to an underlying three-state hidden Markov model, we develop oracle and data-driven procedures to maximize the expected number of true discoveries (ETD) while controlling the false discovery rates (FDRs) of both alternative states at their nominal levels. It is shown theoretically that the proposed directional multiple testing procedures are valid and have certain optimality properties for directional FDR-control. An extensive numerical study shows that our procedures are significantly more powerful than their competitors since the former can accommodate the dependence structure among hypotheses. The proposed procedures also exhibit high flexibility by allowing different nominal levels for the two alternative states, which is appealing in cases when the false discoveries of different alternative states are not equally important. As a demonstration, the proposed data-driven procedure is applied to learn the transcriptomic characteristics of bronchoalveolar lavage fluid in COVID-19 patients.

在多次测试中检测方向信号,采取有针对性的有效措施至关重要。在本文中,我们考虑了三组模型中依赖问题下的定向多重检验。假设观察到的数据是根据底层的三状态隐马尔可夫模型生成的,我们开发了oracle和数据驱动的过程,以最大化真实发现(ETD)的预期数量,同时将两种可选状态的错误发现率(fdr)控制在其名义水平上。从理论上证明了所提出的定向多重测试方法对定向fdr控制是有效的,并具有一定的最优性。一项广泛的数值研究表明,由于前者可以适应假设之间的依赖结构,我们的程序明显比他们的竞争对手更强大。所提出的程序还表现出高度的灵活性,允许两种可选状态的不同名义水平,这在不同可选状态的错误发现并不同等重要的情况下很有吸引力。作为演示,应用所提出的数据驱动程序来了解COVID-19患者支气管肺泡灌洗液的转录组学特征。
{"title":"Directional False Discovery Rate Control in Large-Scale Multiple Testing Under Data Dependence","authors":"Wendong Li,&nbsp;Jianqing Shi,&nbsp;Yi Wang,&nbsp;Dongdong Xiang","doi":"10.1002/asmb.70041","DOIUrl":"https://doi.org/10.1002/asmb.70041","url":null,"abstract":"<div>\u0000 \u0000 <p>Detecting directional signals in multiple testing is crucial to take targeted and effective measures. In this article, we consider the directional multiple testing under the dependence problem within a three-group model. Given the assumption that the observed data are generated according to an underlying three-state hidden Markov model, we develop oracle and data-driven procedures to maximize the expected number of true discoveries (ETD) while controlling the false discovery rates (FDRs) of both alternative states at their nominal levels. It is shown theoretically that the proposed directional multiple testing procedures are valid and have certain optimality properties for directional FDR-control. An extensive numerical study shows that our procedures are significantly more powerful than their competitors since the former can accommodate the dependence structure among hypotheses. The proposed procedures also exhibit high flexibility by allowing different nominal levels for the two alternative states, which is appealing in cases when the false discoveries of different alternative states are not equally important. As a demonstration, the proposed data-driven procedure is applied to learn the transcriptomic characteristics of bronchoalveolar lavage fluid in COVID-19 patients.</p>\u0000 </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144923789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Analysis of Shared Frailty Models for Repairable Systems Subject to Imperfect Repair 不完全修复下可修复系统共享脆弱性模型的贝叶斯分析
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-08-28 DOI: 10.1002/asmb.70039
Éder S. Brito, Vera L. D. Tomazella, Paulo H. Ferreira, Francisco Louzada

Repairable systems, crucial in reliability studies, are characterized by recurrent failure times modeled as counting processes with intensity functions. This paper explores models for these failure times incorporating imperfect repairs, addressing unobserved heterogeneity via shared frailty models. In this context, our approach involves scenarios with general imperfect repairs, which offer a more realistic perspective compared to the minimal or perfect repair assumptions commonly employed in the reliability literature. We propose hierarchical Bayesian methods to estimate parameters, leveraging the Power-Law Process for initial intensities and gamma distributions for frailty terms. Bayesian methods are highly flexible and can accommodate complex shared frailty models that include random effects and dependencies between units. Applying Bayesian inference with gamma and beta distribution priors, coupled with Monte Carlo simulations, provides a robust methodology for estimating unknown parameters and deriving posterior distributions. This flexibility is crucial for capturing the underlying structure of the data in repairable systems with imperfect repairs. Our hierarchical Bayesian framework accommodates multiple systems, providing insights into failure processes and supporting enhanced maintenance strategies. We demonstrate our approach using a real failure times dataset and evaluate its performance through simulation studies, showcasing its applicability and relevance in practical settings.

可修复系统在可靠性研究中至关重要,其特点是反复出现的故障时间被建模为具有强度函数的计数过程。本文探讨了包含不完美修复的这些故障时间的模型,通过共享脆弱性模型解决了未观察到的异质性。在这种情况下,我们的方法涉及一般不完美修复的场景,与可靠性文献中通常采用的最小或完美修复假设相比,它提供了更现实的视角。我们提出了分层贝叶斯方法来估计参数,利用幂律过程的初始强度和脆弱项的伽马分布。贝叶斯方法是高度灵活的,可以适应复杂的共享脆弱性模型,包括随机效应和单位之间的依赖关系。将贝叶斯推理与gamma和beta分布先验相结合,结合蒙特卡罗模拟,为估计未知参数和推导后验分布提供了一种稳健的方法。这种灵活性对于在修复不完善的可修复系统中捕获数据的底层结构至关重要。我们的分层贝叶斯框架可容纳多个系统,提供对故障过程的洞察,并支持增强的维护策略。我们使用真实的故障时间数据集演示了我们的方法,并通过模拟研究评估了其性能,展示了其在实际环境中的适用性和相关性。
{"title":"Bayesian Analysis of Shared Frailty Models for Repairable Systems Subject to Imperfect Repair","authors":"Éder S. Brito,&nbsp;Vera L. D. Tomazella,&nbsp;Paulo H. Ferreira,&nbsp;Francisco Louzada","doi":"10.1002/asmb.70039","DOIUrl":"https://doi.org/10.1002/asmb.70039","url":null,"abstract":"<div>\u0000 \u0000 <p>Repairable systems, crucial in reliability studies, are characterized by recurrent failure times modeled as counting processes with intensity functions. This paper explores models for these failure times incorporating imperfect repairs, addressing unobserved heterogeneity via shared frailty models. In this context, our approach involves scenarios with general imperfect repairs, which offer a more realistic perspective compared to the minimal or perfect repair assumptions commonly employed in the reliability literature. We propose hierarchical Bayesian methods to estimate parameters, leveraging the Power-Law Process for initial intensities and gamma distributions for frailty terms. Bayesian methods are highly flexible and can accommodate complex shared frailty models that include random effects and dependencies between units. Applying Bayesian inference with gamma and beta distribution priors, coupled with Monte Carlo simulations, provides a robust methodology for estimating unknown parameters and deriving posterior distributions. This flexibility is crucial for capturing the underlying structure of the data in repairable systems with imperfect repairs. Our hierarchical Bayesian framework accommodates multiple systems, providing insights into failure processes and supporting enhanced maintenance strategies. We demonstrate our approach using a real failure times dataset and evaluate its performance through simulation studies, showcasing its applicability and relevance in practical settings.</p>\u0000 </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144915101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inference for Simple Step Stress Accelerated Life Test Model Under Progressively Censored Gompertz Data 渐进式截尾Gompertz数据下简单阶跃应力加速寿命试验模型的推断
IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-08-25 DOI: 10.1002/asmb.70037
Rajat Das, Yogesh Mani Tripathi, Liang Wang, Shuo-Jye Wu

In this article analysis of a simple step-stress accelerated life test is considered under progressive type-II censoring. A cumulative exposure model is considered when the latent lifetimes of test units follow the Gompertz distribution with different shape parameters and a common scale parameter. We explore the study by estimating all unknown parameters using classical and Bayesian techniques. The model parameters are estimated using maximum likelihood and Bayesian methods. Subsequently, interval estimates are derived based on the observed Fisher information matrix. Bayesian estimates are obtained using squared error and linear exponential loss functions. Subsequently highest posterior density intervals are also constructed. We examine the efficiency of all estimators through simulation studies. Finally, we provide a real-life example in support of the considered model.

本文考虑了渐进式ii型截割下的简单阶跃应力加速寿命试验分析。当试验单元的潜在寿命服从不同形状参数和相同尺度参数的Gompertz分布时,考虑累积暴露模型。我们通过使用经典和贝叶斯技术估计所有未知参数来探索研究。利用极大似然和贝叶斯方法对模型参数进行估计。然后,根据观察到的Fisher信息矩阵推导出区间估计。贝叶斯估计是利用平方误差和线性指数损失函数得到的。随后也构造了最高后验密度区间。我们通过模拟研究检验了所有估计器的效率。最后,我们提供了一个现实生活中的例子来支持所考虑的模型。
{"title":"Inference for Simple Step Stress Accelerated Life Test Model Under Progressively Censored Gompertz Data","authors":"Rajat Das,&nbsp;Yogesh Mani Tripathi,&nbsp;Liang Wang,&nbsp;Shuo-Jye Wu","doi":"10.1002/asmb.70037","DOIUrl":"https://doi.org/10.1002/asmb.70037","url":null,"abstract":"<div>\u0000 \u0000 <p>In this article analysis of a simple step-stress accelerated life test is considered under progressive type-II censoring. A cumulative exposure model is considered when the latent lifetimes of test units follow the Gompertz distribution with different shape parameters and a common scale parameter. We explore the study by estimating all unknown parameters using classical and Bayesian techniques. The model parameters are estimated using maximum likelihood and Bayesian methods. Subsequently, interval estimates are derived based on the observed Fisher information matrix. Bayesian estimates are obtained using squared error and linear exponential loss functions. Subsequently highest posterior density intervals are also constructed. We examine the efficiency of all estimators through simulation studies. Finally, we provide a real-life example in support of the considered model.</p>\u0000 </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144897464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Applied Stochastic Models in Business and Industry
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1