Asta-Advances in Statistical Analysis最新文献

英文中文

A note on dynamic spatiotemporal ARCH models: small- and large-sample results 动态时空ARCH模型注释：小样本和大样本结果

IF 1.4 4区数学 Q2 STATISTICS & PROBABILITY

Asta-Advances in Statistical Analysis

Pub Date : 2025-12-19 DOI: 10.1007/s10182-025-00552-3

Philipp Otto, Osman Doğan, Süleyman Taşpınar

This short paper explores the estimation of a dynamic spatiotemporal autoregressive conditional heteroscedasticity (ARCH) model. The log-volatility term in this model can depend on (i) the spatial lag of the log-squared outcome variable, (ii) the time-lag of the log-squared outcome variable, (iii) the spatiotemporal lag of the log-squared outcome variable, (iv) exogenous variables, and (v) the unobserved heterogeneity across regions and time, i.e., the regional and time fixed effects. We examine the small- and large-sample properties of two quasi-maximum likelihood estimators and a generalised method of moments estimator for this model. We first summarize the theoretical properties of these estimators and then compare their finite sample properties through Monte Carlo simulations.

本文探讨了动态时空自回归条件异方差（ARCH）模型的估计。该模型中的对数波动项取决于(i)对数平方结果变量的空间滞后，（ii）对数平方结果变量的时间滞后，（iii）对数平方结果变量的时空滞后，（iv）外生变量，以及(v)跨区域和时间的未观察到的异质性，即区域和时间固定效应。研究了两个拟极大似然估计量的小样本和大样本性质，并给出了该模型的广义矩估计方法。我们首先总结了这些估计器的理论性质，然后通过蒙特卡罗模拟比较了它们的有限样本性质。

引用次数: 0

Advances in spatial econometrics and geostatistics: methods, theory, and applications 空间计量经济学与地质统计学进展：方法、理论与应用

IF 1.4 4区数学 Q2 STATISTICS & PROBABILITY

Asta-Advances in Statistical Analysis

Pub Date : 2025-12-12 DOI: 10.1007/s10182-025-00551-4

Philipp Otto, Janine Illian

引用次数: 0

Fuzzy C-modes clustering with spatial regularization and noise cluster 基于空间正则化和噪声聚类的模糊c模聚类

IF 1.4 4区数学 Q2 STATISTICS & PROBABILITY

Asta-Advances in Statistical Analysis

Pub Date : 2025-11-21 DOI: 10.1007/s10182-025-00547-0

Pierpaolo D’Urso, Livia De Giovanni, Lorenzo Federico, Vincenzina Vitale

Clustering categorical data presents unique challenges that traditional techniques do not adequately address. This paper proposes an extension of the fuzzy C-modes algorithm. By incorporating a noise cluster and integrating spatial contiguity relationships among units, the algorithm’s robustness is significantly enhanced. Performance evaluations using synthetic data demonstrate the efficacy of the proposed algorithm in handling both global and local outliers. Furthermore, the paper discusses the application of the algorithm to real-world data on sustainable urban mobility in the Italian provincial capitals during 2021, highlighting its practical relevance and potential impact in real-world scenarios.

聚类分类数据提出了传统技术无法充分解决的独特挑战。本文提出了模糊c模算法的一种扩展。通过引入噪声聚类并整合单元间的空间连续关系，显著增强了算法的鲁棒性。使用综合数据的性能评估证明了该算法在处理全局和局部异常值方面的有效性。此外，本文还讨论了该算法在2021年意大利省会城市可持续城市交通的实际数据中的应用，强调了其在现实场景中的实际相关性和潜在影响。

引用次数: 0

Machine Learning Approach for Analyzing Mixed Case Interval Censored Data with a Cured Subgroup. 带有治愈子群的混合案例区间截尾数据分析的机器学习方法。

IF 1.4 4区数学 Q2 STATISTICS & PROBABILITY

Asta-Advances in Statistical Analysis

Pub Date : 2025-10-06 DOI: 10.1007/s10182-025-00544-3

Wisdom Aselisewine, Suvra Pal

We introduce a novel two-component framework for analyzing mixed case interval censored (MCIC) data featuring a cured subgroup. In such data, the time-to-event is known only within certain intervals determined by multiple random examination time points. Moreover, a portion of the subjects will never experience the event. The first component of our model focuses on estimating the likelihood of being cured (incidence), departing from the conventional generalized linear model to adopt a more adaptable support vector machine (SVM) approach capable of accommodating complex or non-linear covariate effects. The second component addresses the survival distribution of the uncured individuals (latency) and employs a Cox proportional hazards structure to maintain the straightforward interpretation of covariate effects. We develop an expectation maximization algorithm, incorporating the Platt scaling method, to estimate the probability of being cured. Our simulation study demonstrates that our model outperforms both logit-based and spline-based models in capturing complex classification boundaries, leading to more accurate estimates of cured/uncured probabilities and enhanced predictive accuracy for cure. We emphasize that enhancing the estimation accuracy regarding incidence subsequently improves the estimation outcomes concerning latency. Finally, we illustrate the efficacy of our methodology by applying it to the NASA's Hypobaric Decompression Sickness Data.

我们引入了一种新的双分量框架来分析具有治愈子群的混合病例间隔截除（MCIC）数据。在这些数据中，事件发生的时间仅在由多个随机检查时间点确定的一定间隔内已知。此外，一部分受试者将永远不会经历该事件。我们模型的第一个组成部分侧重于估计治愈的可能性（发病率），与传统的广义线性模型不同，采用适应性更强的支持向量机（SVM）方法，能够适应复杂或非线性协变量效应。第二个组成部分涉及未治愈个体的生存分布（潜伏期），并采用Cox比例风险结构来保持对协变量效应的直接解释。我们开发了一种期望最大化算法，结合Platt缩放法来估计治愈的概率。我们的模拟研究表明，我们的模型在捕获复杂分类边界方面优于基于对数和基于样条的模型，从而更准确地估计治愈/未治愈的概率，并提高了治愈的预测精度。我们强调，提高关于发生率的估计精度随后会改善关于延迟的估计结果。最后，我们通过将其应用于NASA的低压减压病数据来说明我们的方法的有效性。

{"title":"Machine Learning Approach for Analyzing Mixed Case Interval Censored Data with a Cured Subgroup.","authors":"Wisdom Aselisewine, Suvra Pal","doi":"10.1007/s10182-025-00544-3","DOIUrl":"10.1007/s10182-025-00544-3","url":null,"abstract":"<p><p>We introduce a novel two-component framework for analyzing mixed case interval censored (MCIC) data featuring a cured subgroup. In such data, the time-to-event is known only within certain intervals determined by multiple random examination time points. Moreover, a portion of the subjects will never experience the event. The first component of our model focuses on estimating the likelihood of being cured (incidence), departing from the conventional generalized linear model to adopt a more adaptable support vector machine (SVM) approach capable of accommodating complex or non-linear covariate effects. The second component addresses the survival distribution of the uncured individuals (latency) and employs a Cox proportional hazards structure to maintain the straightforward interpretation of covariate effects. We develop an expectation maximization algorithm, incorporating the Platt scaling method, to estimate the probability of being cured. Our simulation study demonstrates that our model outperforms both logit-based and spline-based models in capturing complex classification boundaries, leading to more accurate estimates of cured/uncured probabilities and enhanced predictive accuracy for cure. We emphasize that enhancing the estimation accuracy regarding incidence subsequently improves the estimation outcomes concerning latency. Finally, we illustrate the efficacy of our methodology by applying it to the NASA's Hypobaric Decompression Sickness Data.</p>","PeriodicalId":55446,"journal":{"name":"Asta-Advances in Statistical Analysis","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12514071/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145281887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Testing for causal effect for binary data when propensity scores are estimated through Bayesian Networks 通过贝叶斯网络估计倾向分数时，二元数据的因果效应检验

IF 1.4 4区数学 Q2 STATISTICS & PROBABILITY

Asta-Advances in Statistical Analysis

Pub Date : 2025-08-28 DOI: 10.1007/s10182-025-00535-4

Paola Vicard, Paola Maria Vittoria Rancoita, Federica Cugnata, Alberto Briganti, Fulvia Mecatti, Clelia Di Serio, Pier Luigi Conti

This paper proposes a new statistical approach for assessing treatment effect using Bayesian Networks (BNs). The goal is to draw causal inferences from observational data with a binary outcome and discrete covariates. The BNs are here used to estimate the propensity score, which enables flexible modeling and ensures maximum likelihood properties. When the propensity score is estimated by BNs, two point estimators are considered—Hájek and Horvitz–Thompson—based on inverse probability weighting, and their main distributional properties are derived for constructing confidence intervals and testing hypotheses about the absence of the treatment effect. Empirical evidence is presented to show the good behavior of the proposed methodology through a simulation study mimicking the characteristics of a real dataset of prostate cancer patients from Milan San Raffaele Hospital.

本文提出了一种利用贝叶斯网络（BNs）评价治疗效果的统计方法。目的是从具有二元结果和离散协变量的观测数据中得出因果推论。bn在这里用于估计倾向得分，这使得建模灵活，并确保最大似然属性。当使用bp估计倾向得分时，两个点估计是considered-Hájek和基于逆概率加权的horvitz - thompson，并推导了它们的主要分布性质，用于构建治疗效果缺失的置信区间和检验假设。通过模拟米兰圣拉斐尔医院前列腺癌患者真实数据集的特征，提出了经验证据，以显示所提出的方法的良好行为。

引用次数: 0

Basketball players performance measurement with algorithmic survival data analysis 基于算法的篮球运动员生存数据分析

IF 1.4 4区数学 Q2 STATISTICS & PROBABILITY

Asta-Advances in Statistical Analysis

Pub Date : 2025-08-28 DOI: 10.1007/s10182-025-00533-6

Ambra Macis

Performance measurement is of paramount importance in the context of sports analytics. A great variety of data analysis methods has been exploited to this aim. All these proposals almost never include resorting to survival analysis techniques, although time-to-event data are suitable for addressing this issue. This work aims to identify the main achievements of a National Basketball Association player that affect the time it takes for him to exceed a given threshold of points. In order to identify nonlinear effects and possible interactions among the predictors, the analysis is carried out with machine learning methods, specifically survival trees and random survival forests.

在体育分析的背景下，绩效衡量是至关重要的。各种各样的数据分析方法已经被用来达到这个目的。所有这些建议几乎都不包括诉诸生存分析技术，尽管时间到事件数据适合解决这个问题。这项工作旨在确定nba球员的主要成就，这些成就会影响他超过给定分数门槛所需的时间。为了识别非线性效应和预测因子之间可能的相互作用，使用机器学习方法进行分析，特别是生存树和随机生存森林。

引用次数: 0

Forecasting time series by long-memory models for count data with an application to price jumps 用长记忆模型预测计数数据的时间序列，并应用于价格跳跃

IF 1.4 4区数学 Q2 STATISTICS & PROBABILITY

Asta-Advances in Statistical Analysis

Pub Date : 2025-08-13 DOI: 10.1007/s10182-025-00538-1

Luisa Bisaglia, Massimiliano Caporin, Matteo Grigoletto

We discuss the estimation and forecast of long-memory models for count data time series. We first demonstrate by Monte Carlo simulations that the Whittle estimator is the most appropriate for recovering the memory degree of a count data time series. In the following, we introduce the possibility of forecasting count data by exploiting the infinite autoregressive representation of the model. We complete our analysis with an empirical example in which we verify the predictability of the price jump numbers.

讨论了计数数据时间序列的长记忆模型的估计和预测。我们首先通过蒙特卡罗模拟证明了Whittle估计器最适合于恢复计数数据时间序列的内存程度。在下文中，我们介绍了利用模型的无限自回归表示来预测计数数据的可能性。我们用一个经验例子来完成我们的分析，在这个例子中，我们验证了价格跳跃数字的可预测性。

引用次数: 0

Correction: Fuzzy group fixed-effects estimation with spatial clustering 修正：空间聚类模糊群固定效应估计

IF 1.4 4区数学 Q2 STATISTICS & PROBABILITY

Asta-Advances in Statistical Analysis

Pub Date : 2025-07-07 DOI: 10.1007/s10182-025-00532-7

Roy Cerqueti, Pierpaolo D’Urso, Raffaele Mattera

引用次数: 0

Functional data analysis for wearable sensor data: a systematic review 可穿戴传感器数据的功能数据分析：系统综述

IF 1.4 4区数学 Q2 STATISTICS & PROBABILITY

Asta-Advances in Statistical Analysis

Pub Date : 2025-06-18 DOI: 10.1007/s10182-025-00531-8

Nihan Acar-Denizli, Pedro Delicado

Wearable devices and sensors have recently become a popular way to collect data, especially in the health sciences. The use of sensors allows patients to be monitored over a period of time with a high observation frequency. Due to the continuous-on-time structure of the data, novel statistical methods are recommended for the analysis of sensor data. One of the popular approaches in the analysis of wearable sensor data is functional data analysis. The main objective of this paper is to review functional data analysis methods applied to wearable device data according to the type of sensor. In addition, we introduce several freely available software packages and open databases of wearable device data to facilitate access to sensor data in different fields.

可穿戴设备和传感器最近已经成为一种流行的数据收集方式，特别是在健康科学领域。传感器的使用使患者能够在一段时间内以高观察频率进行监测。由于数据的连续准时结构，建议采用新的统计方法对传感器数据进行分析。分析可穿戴传感器数据的常用方法之一是功能数据分析。本文的主要目的是根据传感器的类型，综述应用于可穿戴设备数据的功能数据分析方法。此外，我们引入了几个免费的软件包和开放的可穿戴设备数据数据库，以方便访问不同领域的传感器数据。

引用次数: 0

Anisotropic local covariance matrices for spatial blind source separation 空间盲源分离的各向异性局部协方差矩阵

IF 1.4 4区数学 Q2 STATISTICS & PROBABILITY

Asta-Advances in Statistical Analysis

Pub Date : 2025-06-10 DOI: 10.1007/s10182-025-00529-2

Christoph Muehlmann, Claudia Cappello, Sandra De Iaco, Klaus Nordhausen

This paper aims to introduce a novel approach to spatial blind source separation (SBSS) that addresses the limitations of existing methods. Current SBSS techniques rely on the joint diagonalization of multiple local covariance functions, all of which assume isotropy. To overcome this constraint, anisotropic local covariance matrices that relax the isotropy assumption are proposed. A simulation study and an application on real-world data demonstrate the performance improvement obtained by incorporating these anisotropic covariance matrices into the SBSS framework and highlight the potential of this new approach for more accurate and flexible source separation in spatial data analysis.

本文旨在介绍一种新的空间盲源分离方法，以解决现有方法的局限性。当前的SBSS技术依赖于多个局部协方差函数的联合对角化，这些协方差函数都假设各向同性。为了克服这一限制，提出了放宽各向异性假设的各向异性局部协方差矩阵。仿真研究和在实际数据中的应用表明，将这些各向异性协方差矩阵纳入SBSS框架可以提高性能，并突出了这种新方法在空间数据分析中更加准确和灵活的源分离的潜力。

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Asta-Advances in Statistical Analysis

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀