首页 > 最新文献

International Statistical Review最新文献

英文 中文
Wavelets from a Statistical Perspective Maarten Jansen Chapman and Hall/CRC, 2022, xix + 325 pages, $96 (hardcover) ISBN: 978-1-032-20067-5 (hardcover) Maarten Jansen Chapman and Hall/CRC, 2022, 19 + 325页,96美元(精装)ISBN: 978-1-032-20067-5(精装)
IF 2 3区 数学 Q1 Mathematics Pub Date : 2022-07-18 DOI: 10.1111/insr.12515
Krzysztof Podgórski
{"title":"Wavelets from a Statistical Perspective Maarten Jansen Chapman and Hall/CRC, 2022, xix + 325 pages, $96 (hardcover) ISBN: 978-1-032-20067-5 (hardcover)","authors":"Krzysztof Podgórski","doi":"10.1111/insr.12515","DOIUrl":"10.1111/insr.12515","url":null,"abstract":"","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44357863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Are You All Normal? It Depends! 你一切正常吗?视情况而定!
IF 2 3区 数学 Q1 Mathematics Pub Date : 2022-07-07 DOI: 10.1111/insr.12512
Wanfang Chen, Marc G. Genton

The assumption of normality has underlain much of the development of statistics, including spatial statistics, and many tests have been proposed. In this work, we focus on the multivariate setting and first review the recent advances in multivariate normality tests for i.i.d. data, with emphasis on the skewness and kurtosis approaches. We show through simulation studies that some of these tests cannot be used directly for testing normality of spatial data. We further review briefly the few existing univariate tests under dependence (time or space), and then propose a new multivariate normality test for spatial data by accounting for the spatial dependence. The new test utilises the union-intersection principle to decompose the null hypothesis into intersections of univariate normality hypotheses for projection data, and it rejects the multivariate normality if any individual hypothesis is rejected. The individual hypotheses for univariate normality are conducted using a Jarque–Bera type test statistic that accounts for the spatial dependence in the data. We also show in simulation studies that the new test has a good control of the type I error and a high empirical power, especially for large sample sizes. We further illustrate our test on bivariate wind data over the Arabian Peninsula.

正态性假设是包括空间统计在内的许多统计学发展的基础,并提出了许多检验。在这项工作中,我们关注多变量设置,并首先回顾了i.i.d数据的多变量正态性检验的最新进展,重点是偏度和峰度方法。我们通过模拟研究表明,其中一些测试不能直接用于测试空间数据的正态性。在此基础上,我们进一步回顾了现有的几种基于时间或空间相关性的单变量检验方法,并提出了一种考虑空间相关性的空间数据多元正态性检验方法。新的检验利用并交原理将零假设分解为投影数据的单变量正态性假设的交叉点,如果任何单个假设被拒绝,它会拒绝多元正态性。单变量正态性的个体假设使用Jarque-Bera型检验统计量进行,该统计量考虑了数据中的空间依赖性。我们还在仿真研究中表明,新的测试具有良好的I型误差控制和高经验功率,特别是对于大样本量。我们进一步说明了我们对阿拉伯半岛二元风数据的测试。
{"title":"Are You All Normal? It Depends!","authors":"Wanfang Chen,&nbsp;Marc G. Genton","doi":"10.1111/insr.12512","DOIUrl":"10.1111/insr.12512","url":null,"abstract":"<div>\u0000 \u0000 <p>The assumption of normality has underlain much of the development of statistics, including spatial statistics, and many tests have been proposed. In this work, we focus on the multivariate setting and first review the recent advances in multivariate normality tests for i.i.d. data, with emphasis on the skewness and kurtosis approaches. We show through simulation studies that some of these tests cannot be used directly for testing normality of spatial data. We further review briefly the few existing univariate tests under dependence (time or space), and then propose a new multivariate normality test for spatial data by accounting for the spatial dependence. The new test utilises the union-intersection principle to decompose the null hypothesis into intersections of univariate normality hypotheses for projection data, and it rejects the multivariate normality if any individual hypothesis is rejected. The individual hypotheses for univariate normality are conducted using a Jarque–Bera type test statistic that accounts for the spatial dependence in the data. We also show in simulation studies that the new test has a good control of the type I error and a high empirical power, especially for large sample sizes. We further illustrate our test on bivariate wind data over the Arabian Peninsula.</p>\u0000 </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2022-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48771273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Survival Modelling for Data From Combined Cohorts: Opening the Door to Meta Survival Analyses and Survival Analysis Using Electronic Health Records 合并队列数据的生存建模:打开Meta生存分析和使用电子健康记录进行生存分析的大门
IF 2 3区 数学 Q1 Mathematics Pub Date : 2022-06-16 DOI: 10.1111/insr.12510
James H. McVittie, Ana F. Best, David B. Wolfson, David A. Stephens, Julian Wolfson, David L. Buckeridge, Shahinaz M. Gadalla

Non-parametric estimation of the survival function using observed failure time data depends on the underlying data generating mechanism, including the ways in which the data may be censored and/or truncated. For data arising from a single source or collected from a single cohort, a wide range of estimators have been proposed and compared in the literature. Often, however, it may be possible, and indeed advantageous, to combine and then analyse survival data that have been collected under different study designs. We review non-parametric survival analysis for data obtained by combining the most common types of cohort. We have two main goals: (i) to clarify the differences in the model assumptions and (ii) to provide a single lens through which some of the proposed estimators may be viewed. Our discussion is relevant to the meta-analysis of survival data obtained from different types of study, and to the modern era of electronic health records.

使用观察到的故障时间数据对生存函数进行非参数估计取决于底层数据生成机制,包括数据可能被删节和/或截断的方式。对于来自单一来源或从单一队列收集的数据,文献中已经提出并比较了各种估计量。然而,通常情况下,将不同研究设计下收集到的生存数据进行合并和分析是可能的,而且确实是有利的。我们回顾了通过结合最常见的队列类型获得的数据的非参数生存分析。我们有两个主要目标:(i)澄清模型假设中的差异,(ii)提供一个单一的视角,通过这个视角可以查看一些建议的估计器。我们的讨论与从不同类型的研究中获得的生存数据的荟萃分析以及电子健康记录的现代时代有关。
{"title":"Survival Modelling for Data From Combined Cohorts: Opening the Door to Meta Survival Analyses and Survival Analysis Using Electronic Health Records","authors":"James H. McVittie,&nbsp;Ana F. Best,&nbsp;David B. Wolfson,&nbsp;David A. Stephens,&nbsp;Julian Wolfson,&nbsp;David L. Buckeridge,&nbsp;Shahinaz M. Gadalla","doi":"10.1111/insr.12510","DOIUrl":"10.1111/insr.12510","url":null,"abstract":"<div>\u0000 \u0000 <p>Non-parametric estimation of the survival function using observed failure time data depends on the underlying data generating mechanism, including the ways in which the data may be censored and/or truncated. For data arising from a single source or collected from a single cohort, a wide range of estimators have been proposed and compared in the literature. Often, however, it may be possible, and indeed advantageous, to combine and then analyse survival data that have been collected under different study designs. We review non-parametric survival analysis for data obtained by combining the most common types of cohort. We have two main goals: (i) to clarify the differences in the model assumptions and (ii) to provide a single lens through which some of the proposed estimators may be viewed. Our discussion is relevant to the meta-analysis of survival data obtained from different types of study, and to the modern era of electronic health records.</p>\u0000 </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2022-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/insr.12510","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9490735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable Bayesian Multiple Changepoint Detection via Auxiliary Uniformisation 通过辅助均匀化的可扩展贝叶斯多变更点检测
IF 2 3区 数学 Q1 Mathematics Pub Date : 2022-06-15 DOI: 10.1111/insr.12511
Lu Shaochuan

In this paper, we perform a sparse filtering recursion for efficient changepoint detection for discrete-time observations. We attach auxiliary event times to the chronologically ordered observations and formulate multiple changepoint problems of discrete-time observations into continuous-time observations. Ideally, both the computational and memory costs of the proposed auxiliary uniformisation forward-filtering backward-sampling algorithm can be quadratically scaled down to the number of changepoints instead of the number of observations, which would otherwise be prohibitive for a long sequence of observations. To avoid model bias, a time-varying changepoint recurrence rate across different segments is assumed to characterise diverse scales of run lengths of the changepoints. We demonstrate the methods through simulation studies and real data analysis.

在本文中,我们执行稀疏滤波递归来有效地检测离散时间观测的变化点。我们将辅助事件时间附加到按时间顺序排序的观测中,并将离散时间观测的多个变点问题表述为连续时间观测。理想情况下,所提出的辅助均匀化前向滤波后向采样算法的计算和内存成本都可以二次缩小到变化点的数量,而不是观测值的数量,否则对于长序列的观测值来说,这将是令人难以接受的。为了避免模型偏差,我们假设不同时间段的时变变点复发率表征不同尺度的变点运行长度。通过仿真研究和实际数据分析,对该方法进行了验证。
{"title":"Scalable Bayesian Multiple Changepoint Detection via Auxiliary Uniformisation","authors":"Lu Shaochuan","doi":"10.1111/insr.12511","DOIUrl":"10.1111/insr.12511","url":null,"abstract":"<div>\u0000 \u0000 <p>In this paper, we perform a sparse filtering recursion for efficient changepoint detection for discrete-time observations. We attach auxiliary event times to the chronologically ordered observations and formulate multiple changepoint problems of discrete-time observations into continuous-time observations. Ideally, both the computational and memory costs of the proposed auxiliary uniformisation forward-filtering backward-sampling algorithm can be quadratically scaled down to the number of changepoints instead of the number of observations, which would otherwise be prohibitive for a long sequence of observations. To avoid model bias, a time-varying changepoint recurrence rate across different segments is assumed to characterise diverse scales of run lengths of the changepoints. We demonstrate the methods through simulation studies and real data analysis.</p>\u0000 </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2022-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46806947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Diagnostic Tests for the Necessity of Weight in Regression With Survey Data 调查数据回归中权重必要性的诊断检验
IF 2 3区 数学 Q1 Mathematics Pub Date : 2022-06-09 DOI: 10.1111/insr.12509
Feng Wang, HaiYing Wang, Jun Yan

To weight or not to weight in regression analyses with survey data has been debated in the literature. The problem is essentially a tradeoff between the bias and the variance of the regression coefficient estimator. An array of diagnostic tests for informative weights have been developed. Nonetheless, studies comparing the performance of the tests, especially for finite samples, are scarce, and the theoretical equivalence of some tests has not been investigated. Focusing on the linear regression setting, we review a collection of such tests and propose enhanced versions of some of them that require an auxiliary regression model for the weight. Further, the equivalence of two popular tests is established which has not been reported before. In contrast to existing reviews with no empirical comparison, we compare the sizes and powers of the tests in simulation studies. The reviewed tests are applied to a regression analysis of the family expenditure using the data from the China Family Panel Study.

文献中对调查数据回归分析中的权重与否进行了争论。这个问题本质上是回归系数估计器的偏差和方差之间的权衡。已经开发了一系列用于信息权重的诊断测试。尽管如此,对测试性能的比较研究,特别是对有限样本的比较研究很少,而且一些测试的理论等效性也没有得到研究。专注于线性回归设置,我们回顾了一组这样的测试,并提出了其中一些测试的增强版本,这些测试需要权重的辅助回归模型。此外,建立了两种流行测试的等效性,这在以前是没有报道过的。与没有实证比较的现有综述相比,我们比较了模拟研究中测试的规模和能力。利用中国家庭小组研究的数据,将回顾性检验应用于家庭支出的回归分析。
{"title":"Diagnostic Tests for the Necessity of Weight in Regression With Survey Data","authors":"Feng Wang,&nbsp;HaiYing Wang,&nbsp;Jun Yan","doi":"10.1111/insr.12509","DOIUrl":"10.1111/insr.12509","url":null,"abstract":"<div>\u0000 \u0000 <p>To weight or not to weight in regression analyses with survey data has been debated in the literature. The problem is essentially a tradeoff between the bias and the variance of the regression coefficient estimator. An array of diagnostic tests for informative weights have been developed. Nonetheless, studies comparing the performance of the tests, especially for finite samples, are scarce, and the theoretical equivalence of some tests has not been investigated. Focusing on the linear regression setting, we review a collection of such tests and propose enhanced versions of some of them that require an auxiliary regression model for the weight. Further, the equivalence of two popular tests is established which has not been reported before. In contrast to existing reviews with no empirical comparison, we compare the sizes and powers of the tests in simulation studies. The reviewed tests are applied to a regression analysis of the family expenditure using the data from the China Family Panel Study.</p>\u0000 </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2022-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42527094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Pareto to Weibull – A Constructive Review of Distributions on ℝ+ 从Pareto到Weibull——关于ℝ+
IF 2 3区 数学 Q1 Mathematics Pub Date : 2022-06-06 DOI: 10.1111/insr.12508
Corinne Sinner, Yves Dominicy, Julien Trufin, Wout Waterschoot, Patrick Weber, Christophe Ley

Power laws and power laws with exponential cut-off are two distinct families of distributions on the positive real half-line. In the present paper, we propose a unified treatment of both families by building a family of distributions that interpolates between them, which we call Interpolating Family (IF) of distributions. Our original construction, which relies on techniques from statistical physics, provides a connection for hitherto unrelated distributions like the Pareto and Weibull distributions, and sheds new light on them. The IF also contains several distributions that are neither of power law nor of power law with exponential cut-off type. We calculate quantile-based properties, moments and modes for the IF. This allows us to review known properties of famous distributions on + and to provide in a single sweep these characteristics for various less known (and new) special cases of our Interpolating Family.

幂律和具有指数截止的幂律是正实半线上的两个不同的分布族。在本文中,我们提出了一种统一的处理这两个族的方法,通过建立一个在它们之间插值的分布族,我们称之为插值分布族(IF)。我们最初的构建依赖于统计物理学的技术,为迄今为止不相关的分布(如帕累托分布和威布尔分布)提供了联系,并为它们提供了新的线索。IF还包含几个既不是幂律也不是幂律的指数截断型分布。我们计算IF的基于分位数的性质、矩和模式。这使我们能够回顾上著名分布的已知性质ℝ+ 并在一次扫描中为我们的插值族的各种鲜为人知(和新的)特殊情况提供这些特征。
{"title":"From Pareto to Weibull – A Constructive Review of Distributions on ℝ+","authors":"Corinne Sinner,&nbsp;Yves Dominicy,&nbsp;Julien Trufin,&nbsp;Wout Waterschoot,&nbsp;Patrick Weber,&nbsp;Christophe Ley","doi":"10.1111/insr.12508","DOIUrl":"10.1111/insr.12508","url":null,"abstract":"<div>\u0000 \u0000 <p>Power laws and power laws with exponential cut-off are two distinct families of distributions on the positive real half-line. In the present paper, we propose a unified treatment of both families by building a family of distributions that interpolates between them, which we call Interpolating Family (IF) of distributions. Our original construction, which relies on techniques from statistical physics, provides a connection for hitherto unrelated distributions like the Pareto and Weibull distributions, and sheds new light on them. The IF also contains several distributions that are neither of power law nor of power law with exponential cut-off type. We calculate quantile-based properties, moments and modes for the IF. This allows us to review known properties of famous distributions on \u0000<math>\u0000 <msup>\u0000 <mrow>\u0000 <mi>ℝ</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mo>+</mo>\u0000 </mrow>\u0000 </msup></math> and to provide in a single sweep these characteristics for various less known (and new) special cases of our Interpolating Family.</p>\u0000 </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2022-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46009017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Using Survey Sampling Algorithms For Exact Inference in Logistic Regression 在Logistic回归中使用调查抽样算法进行精确推理
IF 2 3区 数学 Q1 Mathematics Pub Date : 2022-05-31 DOI: 10.1111/insr.12507
Louis-Paul Rivest, Serigne Abib Gaye

Several exact inference procedures for logistic regression require the simulation of a 0-1 dependent vector according to its conditional distribution, given the sufficient statistics for some nuisance parameters. This is viewed, in this work, as a sampling problem involving a population of n units, unequal selection probabilities and balancing constraints. The basis for this reformulation of exact inference is a proposition deriving the limit, as n goes to infinity, of the conditional distribution of the dependent vector given the logistic regression sufficient statistics. It is proposed to sample from this distribution using the cube sampling algorithm. The interest of this approach to exact inference is illustrated by tackling new problems. First it allows to carry out exact inference with continuous covariates. It is also useful for the investigation of a partial correlation between several 0-1 vectors. This is illustrated in an example dealing with presence-absence data in ecology.

逻辑回归的几个精确推理程序需要根据条件分布模拟0‐1相关向量,给定一些干扰参数的足够统计。在这项工作中,这被视为一个抽样问题,涉及n个单位的种群、不相等的选择概率和平衡约束。精确推理的这种重新表述的基础是一个命题,当n变为无穷大时,在给定逻辑回归充分统计量的情况下,推导出依赖向量的条件分布的极限。建议使用立方体采样算法对此分布进行采样。这种方法对精确推理的兴趣通过解决新问题来说明。首先,它允许用连续协变量进行精确推理。它也有助于研究几个0-1矢量之间的部分相关性。这在一个处理生态学中存在-不存在数据的例子中得到了说明。
{"title":"Using Survey Sampling Algorithms For Exact Inference in Logistic Regression","authors":"Louis-Paul Rivest,&nbsp;Serigne Abib Gaye","doi":"10.1111/insr.12507","DOIUrl":"10.1111/insr.12507","url":null,"abstract":"<div>\u0000 \u0000 <p>Several exact inference procedures for logistic regression require the simulation of a 0-1 dependent vector according to its conditional distribution, given the sufficient statistics for some nuisance parameters. This is viewed, in this work, as a sampling problem involving a population of \u0000<math>\u0000 <mi>n</mi></math> units, unequal selection probabilities and balancing constraints. The basis for this reformulation of exact inference is a proposition deriving the limit, as \u0000<math>\u0000 <mi>n</mi></math> goes to infinity, of the conditional distribution of the dependent vector given the logistic regression sufficient statistics. It is proposed to sample from this distribution using the cube sampling algorithm. The interest of this approach to exact inference is illustrated by tackling new problems. First it allows to carry out exact inference with continuous covariates. It is also useful for the investigation of a partial correlation between several 0-1 vectors. This is illustrated in an example dealing with presence-absence data in ecology.</p>\u0000 </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42562594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Some Solutions Inspired by Survey Sampling Theory to Build Effective Clinical Trials 调查抽样理论对构建有效临床试验的启示
IF 2 3区 数学 Q1 Mathematics Pub Date : 2022-04-10 DOI: 10.1111/insr.12498
Yves Tillé
The organisation of a design of experiments, for example, for the realisation of a clinical trial, is crucial. It is often desirable to balance designs so that the means of the covariates are approximately the same in the test and control groups. In survey sampling theory, balanced sampling and calibration are two techniques that improve the precision of estimates. In this paper, we show the links between the two areas. We begin by assessing the gain in precision between a balanced design and a simple random sampling for the least squares estimators and the estimator by differences. We compare rerandomisation techniques and the cube method in order to balance the design. We propose a new method, particularly efficient, which combines the cube method with multivariate matching. A set of simulations is carried out in order to evaluate the different methods. The interest of the calibration is shown even if the design is almost balanced. It is thus shown that tools used by survey statisticians can be useful for experimental designs and clinical trials.
例如,为了实现临床试验,组织实验设计至关重要。通常需要平衡设计,以便在测试组和对照组中协变的平均值大致相同。在调查抽样理论中,平衡抽样和校准是提高估计精度的两种技术。在本文中,我们展示了这两个领域之间的联系。我们首先评估最小二乘估计量和差分估计量的平衡设计和简单随机抽样之间的精度增益。为了平衡设计,我们比较了重新绘制技术和立方体方法。我们提出了一种新的方法,特别有效,它将立方体方法与多元匹配相结合。为了评估不同的方法,进行了一组模拟。即使设计几乎是平衡的,也会显示出校准的兴趣。因此,调查统计学家使用的工具可以用于实验设计和临床试验。
{"title":"Some Solutions Inspired by Survey Sampling Theory to Build Effective Clinical Trials","authors":"Yves Tillé","doi":"10.1111/insr.12498","DOIUrl":"https://doi.org/10.1111/insr.12498","url":null,"abstract":"The organisation of a design of experiments, for example, for the realisation of a clinical trial, is crucial. It is often desirable to balance designs so that the means of the covariates are approximately the same in the test and control groups. In survey sampling theory, balanced sampling and calibration are two techniques that improve the precision of estimates. In this paper, we show the links between the two areas. We begin by assessing the gain in precision between a balanced design and a simple random sampling for the least squares estimators and the estimator by differences. We compare rerandomisation techniques and the cube method in order to balance the design. We propose a new method, particularly efficient, which combines the cube method with multivariate matching. A set of simulations is carried out in order to evaluate the different methods. The interest of the calibration is shown even if the design is almost balanced. It is thus shown that tools used by survey statisticians can be useful for experimental designs and clinical trials.","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2022-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42827426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Bias, Fairness and Accountability with Artificial Intelligence and Machine Learning Algorithms 偏见,公平和问责与人工智能和机器学习算法
IF 2 3区 数学 Q1 Mathematics Pub Date : 2022-04-10 DOI: 10.1111/insr.12492
Nengfeng Zhou, Zach Zhang, Vijayan N. Nair, Harsh Singhal, Jie Chen

The advent of artificial intelligence (AI) and machine learning algorithms has led to opportunities as well as challenges in their use. In this overview paper, we begin with a discussion of bias and fairness issues that arise with the use of AI techniques, with a focus on supervised machine learning algorithms. We then describe the types and sources of data bias and discuss the nature of algorithmic unfairness. In addition, we provide a review of fairness metrics in the literature, discuss their limitations, and describe de-biasing (or mitigation) techniques in the model life cycle.

人工智能(AI)和机器学习算法的出现给它们的使用带来了机遇和挑战。在这篇概述文章中,我们首先讨论了使用人工智能技术时出现的偏见和公平问题,重点是监督机器学习算法。然后我们描述了数据偏差的类型和来源,并讨论了算法不公平的本质。此外,我们对文献中的公平性指标进行了回顾,讨论了它们的局限性,并描述了模型生命周期中的去偏(或缓解)技术。
{"title":"Bias, Fairness and Accountability with Artificial Intelligence and Machine Learning Algorithms","authors":"Nengfeng Zhou,&nbsp;Zach Zhang,&nbsp;Vijayan N. Nair,&nbsp;Harsh Singhal,&nbsp;Jie Chen","doi":"10.1111/insr.12492","DOIUrl":"10.1111/insr.12492","url":null,"abstract":"<div>\u0000 \u0000 <p>The advent of artificial intelligence (AI) and machine learning algorithms has led to opportunities as well as challenges in their use. In this overview paper, we begin with a discussion of bias and fairness issues that arise with the use of AI techniques, with a focus on supervised machine learning algorithms. We then describe the types and sources of data bias and discuss the nature of algorithmic unfairness. In addition, we provide a review of fairness metrics in the literature, discuss their limitations, and describe de-biasing (or mitigation) techniques in the model life cycle.</p>\u0000 </div>","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2022-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45085727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Communicating with Data: The Art of Writing for Data Science Deborah Nolan and Sara Stoudt Oxford University Press, 2021, vii + 331 pages, $45.95, paperback ISBN: 978-0-1988-6275-8 与数据沟通:数据科学写作的艺术Deborah Nolan and Sara Stoudt牛津大学出版社,2021,vii + 331页,45.95美元,平装ISBN: 978-0-1988-6275-8
IF 2 3区 数学 Q1 Mathematics Pub Date : 2022-03-15 DOI: 10.1111/insr.12496
Kelly McConville
{"title":"Communicating with Data: The Art of Writing for Data Science Deborah Nolan and Sara Stoudt Oxford University Press, 2021, vii + 331 pages, $45.95, paperback ISBN: 978-0-1988-6275-8","authors":"Kelly McConville","doi":"10.1111/insr.12496","DOIUrl":"10.1111/insr.12496","url":null,"abstract":"","PeriodicalId":14479,"journal":{"name":"International Statistical Review","volume":null,"pages":null},"PeriodicalIF":2.0,"publicationDate":"2022-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44294305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
International Statistical Review
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1