Statistics Surveys最新文献

英文中文

White noise testing for functional time series 函数时间序列的白噪声测试

IF 3.3 Q1 STATISTICS & PROBABILITY

Statistics Surveys

Pub Date : 2023-01-01 DOI: 10.1214/23-ss143

Mihyun Kim, P. Kokoszka, Gregory Rice

引用次数: 0

Spline local basis methods for nonparametric density estimation 非参数密度估计的样条局部基方法

Q1 STATISTICS & PROBABILITY

Statistics Surveys

Pub Date : 2023-01-01 DOI: 10.1214/23-ss142

J. Lars Kirkby, Álvaro Leitao, Duy Nguyen

This work reviews the literature on spline local basis methods for non-parametric density estimation. Particular attention is paid to B-spline density estimators which have experienced recent advances in both theory and methodology. These estimators occupy a very interesting space in statistics, which lies aptly at the cross-section of numerous statistical frameworks. New insights, experiments, and analyses are presented to cast the various estimation concepts in a unified context, while parallels and contrasts are drawn to the more familiar contexts of kernel density estimation. Unlike kernel density estimation, the study of local basis estimation is not yet fully mature, and this work also aims to highlight the gaps in existing literature which merit further investigation.

本文综述了非参数密度估计的样条局部基方法。特别注意的是最近在理论和方法上都取得进展的b样条密度估计。这些估计器在统计学中占据了一个非常有趣的空间，它恰好位于许多统计框架的横截面上。提出了新的见解、实验和分析，将各种估计概念置于统一的上下文中，同时将其与更熟悉的核密度估计上下文中进行类比和对比。与核密度估计不同，局部基估计的研究尚未完全成熟，本工作也旨在突出现有文献中值得进一步研究的空白。

引用次数: 1

Core-periphery structure in networks: A statistical exposition 网络中的核心-外围结构:一个统计分析

IF 3.3 Q1 STATISTICS & PROBABILITY

Statistics Surveys

Pub Date : 2022-02-09 DOI: 10.1214/23-ss141

Eric Yanchenko, Srijan Sengupta

Many real-world networks are theorized to have core-periphery structure consisting of a densely-connected core and a loosely-connected periphery. While this phenomenon has been extensively studied in a range of scientific disciplines, it has not received sufficient attention in the statistics community. In this expository article, our goal is to raise awareness about this topic and encourage statisticians to address the many open inference problems in this area. To this end, we first summarize the current research landscape by reviewing the metrics and models that have been used for quantitative studies on core-periphery structure. Next, we formulate and explore various inferential problems in this context, such as estimation, hypothesis testing, and Bayesian inference, and discuss related computational techniques. We also outline the multidisciplinary scientific impact of core-periphery structure in a number of real-world networks. Throughout the article, we provide our own interpretation of the literature from a statistical perspective, with the goal of prioritizing open problems where contribution from the statistics community will be most effective and important.

许多现实世界的网络理论上都具有由密集连接的核心和松散连接的外围组成的核心-外围结构。虽然这一现象在一系列科学学科中得到了广泛的研究，但在统计界却没有得到足够的重视。在这篇说明性文章中，我们的目标是提高对这一主题的认识，并鼓励统计学家解决这一领域的许多开放推理问题。为此，我们首先通过回顾已经用于核心-边缘结构定量研究的指标和模型来总结当前的研究概况。接下来，我们将在此背景下制定和探索各种推理问题，如估计、假设检验和贝叶斯推理，并讨论相关的计算技术。我们还概述了核心-外围结构在许多现实世界网络中的多学科科学影响。在整篇文章中，我们从统计的角度提供了我们自己对文献的解释，目标是优先考虑统计社区的贡献将是最有效和最重要的开放问题。

引用次数: 7

Central subspaces review: methods and applications 中心子空间综述:方法与应用

IF 3.3 Q1 STATISTICS & PROBABILITY

Statistics Surveys

Pub Date : 2022-01-01 DOI: 10.1214/22-ss138

Sabrina A. Rodrigues, Richard Huggins, B. Liquet

引用次数: 0

A brief and understandable guide to pseudo-random number generators and specific models for security 一个简单易懂的伪随机数生成器和特定的安全模型指南

IF 3.3 Q1 STATISTICS & PROBABILITY

Statistics Surveys

Pub Date : 2022-01-01 DOI: 10.1214/22-ss136

Elena Almaraz Luengo

: The generation of random sequences is the basis of simulation and can be used in many diﬀerent areas such as Statistics, Computer Science, Systems Management and Control, Biology, Particle Physics, Cryp- tography or Cyber-Security, among others. It is crucial that the numbers generated were random or at least, behave as such. The fundamental sta- tistical properties required for such sequences are randomness and independence and, from a cryptographic perspective, unpredictability. There is a variety of methods to generate these sequences. The main ones are physical and arithmetic methods. In this work, a detailed study of the main arith- metic methods is carried out. On the other hand, the necessity of secure sequence generation will be analyzed and new lines of ongoing research fo- cusing applications in Internet of Things and new generator designs will be described.

随机序列的生成是模拟的基础，可用于许多不同的领域，如统计学、计算机科学、系统管理与控制、生物学、粒子物理学、密码学或网络安全等。至关重要的是，生成的数字是随机的，或者至少表现为随机的。这些序列所要求的基本统计性质是随机性和独立性，从密码学的角度来看，是不可预测性。有多种方法可以生成这些序列。主要有物理方法和算术方法。本文对主要的算法进行了详细的研究。另一方面，将分析安全序列生成的必要性，并描述正在进行的新研究方向，以引起物联网应用和新的发生器设计。

引用次数: 1

Post-model-selection inference in linear regression models: An integrated review 线性回归模型中的后模型选择推理:综合综述

IF 3.3 Q1 STATISTICS & PROBABILITY

Statistics Surveys

Pub Date : 2022-01-01 DOI: 10.1214/22-ss135

Dongliang Zhang, Abbas Khalili, M. Asgharian

The research on statistical inference after data-driven model selection can be traced as far back as Koopmans (1949). The intensive research on modern model selection methods for high-dimensional data over the past three decades revived the interest in statistical inference after model selection. In recent years, there has been a surge of articles on statistical inference after model selection and now a rather vast literature exists on this topic. Our manuscript aims at presenting a holistic review of post-model-selection inference in linear regression models, while also incorporating perspectives from high-dimensional inference in these models. We first give a simulated example motivating the necessity for valid statistical inference after model selection. We then provide theoretical insights explaining the phenomena observed in the example. This is done through a literature survey on the post-selection sampling distribution of regression parameter estimators and properties of coverage probabilities of näıve confidence intervals. Categorized according to two types of estimation targets, namely the populationand projection-based regression coefficients, we present a review of recent uncertainty assessment methods. We also discuss possible pros and cons for the confidence intervals constructed by different methods. MSC2020 subject classifications: Primary 62F25; secondary 62J07.

对数据驱动模型选择后的统计推断的研究，最早可以追溯到Koopmans(1949)。近三十年来，对现代高维数据模型选择方法的深入研究，重新唤起了对模型选择后统计推断的兴趣。近年来，关于模型选择后的统计推断的文章激增，目前已有相当多的文献。我们的手稿旨在对线性回归模型中的后模型选择推理进行全面回顾，同时也结合了这些模型中高维推理的观点。我们首先给出一个模拟的例子，说明在模型选择后进行有效统计推断的必要性。然后，我们提供理论见解来解释在示例中观察到的现象。这是通过对回归参数估计器的选择后抽样分布和näıve置信区间的覆盖概率属性的文献调查来完成的。根据两类估计目标，即基于人口的回归系数和基于预测的回归系数，我们对最近的不确定性评估方法进行了综述。我们还讨论了不同方法构造的置信区间可能的优缺点。MSC2020学科分类:Primary 62F25;二次62 j07。

{"title":"Post-model-selection inference in linear regression models: An integrated review","authors":"Dongliang Zhang, Abbas Khalili, M. Asgharian","doi":"10.1214/22-ss135","DOIUrl":"https://doi.org/10.1214/22-ss135","url":null,"abstract":"The research on statistical inference after data-driven model selection can be traced as far back as Koopmans (1949). The intensive research on modern model selection methods for high-dimensional data over the past three decades revived the interest in statistical inference after model selection. In recent years, there has been a surge of articles on statistical inference after model selection and now a rather vast literature exists on this topic. Our manuscript aims at presenting a holistic review of post-model-selection inference in linear regression models, while also incorporating perspectives from high-dimensional inference in these models. We first give a simulated example motivating the necessity for valid statistical inference after model selection. We then provide theoretical insights explaining the phenomena observed in the example. This is done through a literature survey on the post-selection sampling distribution of regression parameter estimators and properties of coverage probabilities of näıve confidence intervals. Categorized according to two types of estimation targets, namely the populationand projection-based regression coefficients, we present a review of recent uncertainty assessment methods. We also discuss possible pros and cons for the confidence intervals constructed by different methods. MSC2020 subject classifications: Primary 62F25; secondary 62J07.","PeriodicalId":46627,"journal":{"name":"Statistics Surveys","volume":"1 1","pages":""},"PeriodicalIF":3.3,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83355813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Kronecker-structured covariance models for multiway data 多路数据的kronecker结构协方差模型

IF 3.3 Q1 STATISTICS & PROBABILITY

Statistics Surveys

Pub Date : 2022-01-01 DOI: 10.1214/22-ss139

Yu Wang, Zeyu Sun, Dogyoon Song, A. Hero

: Many applications produce multiway data of exceedingly high dimension. Modeling such multi-way data is important in multichannel signal and video processing where sensors produce multi-indexed data, e.g. over spatial, frequency, and temporal dimensions. We will address the challenges of covariance representation of multiway data and review some of the progress in statistical modeling of multiway covariance over the past two decades, focusing on tensor-valued covariance models and their infer- ence. We will illustrate through a space weather application: predicting the evolution of solar active regions over time.

许多应用程序产生多维度极高的多向数据。这种多路数据建模在多通道信号和视频处理中非常重要，其中传感器产生多索引数据，例如在空间，频率和时间维度上。我们将解决多向数据协方差表示的挑战，并回顾过去二十年来多向协方差统计建模的一些进展，重点是张量值协方差模型及其推断。我们将通过一个空间天气应用来说明:预测太阳活动区随时间的演变。

引用次数: 3

General-purpose imputation of planned missing data in social surveys: Different strategies and their effect on correlations 社会调查中计划缺失数据的通用代入:不同策略及其对相关性的影响

IF 3.3 Q1 STATISTICS & PROBABILITY

Statistics Surveys

Pub Date : 2022-01-01 DOI: 10.1214/22-ss137

Julian B. Axenfeld, Christiane Bruch, C. Wolf

引用次数: 3

Nested sampling methods 嵌套抽样方法

IF 3.3 Q1 STATISTICS & PROBABILITY

Statistics Surveys

Pub Date : 2021-01-24 DOI: 10.1214/23-SS144

J. Buchner

Nested sampling (NS) computes parameter posterior distributions and makes Bayesian model comparison computationally feasible. Its strengths are the unsupervised navigation of complex, potentially multi-modal posteriors until a well-defined termination point. A systematic literature review of nested sampling algorithms and variants is presented. We focus on complete algorithms, including solutions to likelihood-restricted prior sampling, parallelisation, termination and diagnostics. The relation between number of live points, dimensionality and computational cost is studied for two complete algorithms. A new formulation of NS is presented, which casts the parameter space exploration as a search on a tree data structure. Previously published ways of obtaining robust error estimates and dynamic variations of the number of live points are presented as special cases of this formulation. A new online diagnostic test is presented based on previous insertion rank order work. The survey of nested sampling methods concludes with outlooks for future research.

嵌套抽样(NS)计算参数后验分布，使贝叶斯模型比较在计算上可行。它的优点是对复杂的、潜在的多模式后验进行无监督导航，直到一个明确定义的终止点。对嵌套采样算法和变量进行了系统的文献综述。我们专注于完整的算法，包括解决似然限制的先验采样，并行化，终止和诊断。研究了两种完整算法的活点数、维数和计算量之间的关系。提出了一种新的NS公式，将参数空间探索转换为对树状数据结构的搜索。以前发表的获得鲁棒误差估计和活点数量动态变化的方法作为该公式的特殊情况提出。提出了一种基于先前插入秩序工作的在线诊断测试方法。对嵌套抽样方法的调查总结了对未来研究的展望。

引用次数: 33

A review of uncertainty quantification for density estimation 密度估计的不确定度量化综述

IF 3.3 Q1 STATISTICS & PROBABILITY

Statistics Surveys

Pub Date : 2021-01-01 DOI: 10.1214/21-SS130

Shaun McDonald, D. Campbell

It is often useful to conduct inference for probability densities by constructing “plausible” sets in which the unknown density of given data may lie. Examples of such sets include pointwise intervals, simultaneous bands, or balls in a function space, and they may be frequentist or Bayesian in interpretation. For almost any density estimator, there are multiple approaches to inference available in the literature. Here we review such literature, providing a thorough overview of existing methods for density uncertainty quantification. The literature considered here comprises a spectrum from theoretical to practical ideas, and for some methods there is little commonality between these two extremes. After detailing some of the key concepts of nonparametric inference – the different types of “plausible” sets, and their interpretation and behaviour – we list the most prominent density estimators and the corresponding uncertainty quantification methods for each.

通过构造给定数据的未知密度可能存在的“似是而非”的集合来进行概率密度的推断通常是有用的。这种集合的例子包括点间隔、同时带或函数空间中的球，它们可以是频域的或贝叶斯的解释。对于几乎任何密度估计器，文献中都有多种可用的推断方法。在这里，我们回顾了这些文献，提供了密度不确定度量化的现有方法的全面概述。这里考虑的文献包括从理论到实践思想的范围，对于某些方法来说，这两个极端之间几乎没有共同点。在详细介绍了非参数推理的一些关键概念——不同类型的“似是而非”的集合，以及它们的解释和行为之后，我们列出了最突出的密度估计器和相应的不确定性量化方法。

引用次数: 2

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Statistics Surveys

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀