Stats最新文献

英文中文

Adaptations on the Use of p-Values for Statistical Inference: An Interpretation of Messages from Recent Public Discussions p值在统计推断中的应用:对近期公开讨论信息的解读

Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats

Pub Date : 2023-04-25 DOI: 10.3390/stats6020035

E. Verykouki, Chris Nakas

P-values have played a central role in the advancement of research in virtually all scientific fields; however, there has been significant controversy over their use. “The ASA president’s task force statement on statistical significance and replicability” has provided a solid basis for resolving the quarrel, but although the significance part is clearly dealt with, the replicability part raises further discussions. Given the clear statement regarding significance, in this article, we consider the validity of p-value use for statistical inference as de facto. We briefly review the bibliography regarding the relevant controversy in recent years and illustrate how already proposed approaches, or slight adaptations thereof, can be readily implemented to address both significance and reproducibility, adding credibility to empirical study findings. The definitions used for the notions of replicability and reproducibility are also clearly described. We argue that any p-value must be reported along with its corresponding s-value followed by (1−α)% confidence intervals and the rejection replication index.

p值在几乎所有科学领域的研究进展中发挥了核心作用;然而，它们的使用一直存在重大争议。“ASA主席的工作组关于统计显著性和可复制性的声明”为解决争论提供了坚实的基础，但尽管显著性部分得到了明确的处理，但可复制性部分引发了进一步的讨论。鉴于关于显著性的明确声明，在本文中，我们认为p值用于统计推断的有效性事实上。我们简要回顾了近年来有关争议的参考文献，并说明了如何已经提出的方法，或对其进行轻微调整，可以很容易地实施，以解决重要性和可重复性，增加实证研究结果的可信度。用于可复制性和可再现性概念的定义也被清楚地描述。我们认为，任何p值都必须与其对应的s值一起报告，然后是(1−α)%置信区间和拒绝复制指数。

引用次数: 0

Recurring Errors in Studies of Gender Differences in Variability 变异性性别差异研究中的重复性错误

Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats

Pub Date : 2023-04-21 DOI: 10.3390/stats6020033

T. Hill, Rosalind Arden

The past quarter century has seen a resurgence of research on the controversial topic of gender differences in variability, in part because of its potential implications for the issue of under- and over-representation of various subpopulations of our society, with respect to different traits. Unfortunately, several basic statistical, inferential, and logical errors are being propagated in studies on this highly publicized topic. These errors include conflicting interpretations of the numerical significance of actual variance ratio values; a mistaken claim about variance ratios in mixtures of distributions; incorrect inferences from variance ratio values regarding the relative roles of sociocultural and biological factors; and faulty experimental designs. Most importantly, without knowledge of the underlying distributions, the standard variance ratio test statistic is shown to have no implications for tail ratios. The main aim of this note is to correct the scientific record and to illuminate several of these key errors in order to reduce their further propagation. For concreteness, the arguments will focus on one highly influential paper.

在过去的四分之一个世纪里，关于变异性中的性别差异这一有争议的话题的研究死灰复燃，部分原因是它可能会对我们社会中不同亚群体在不同特征方面的代表性不足和过多问题产生影响。不幸的是，在这一备受关注的主题的研究中，一些基本的统计、推理和逻辑错误正在传播。这些错误包括对实际方差比值的数值意义的相互矛盾的解释；关于分布混合中方差比的错误说法；关于社会文化和生物因素的相对作用的方差比值的错误推断；以及错误的实验设计。最重要的是，在不了解潜在分布的情况下，标准方差比检验统计量对尾率没有影响。本说明的主要目的是纠正科学记录，并阐明其中几个关键错误，以减少其进一步传播。为了具体起见，争论将集中在一篇极具影响力的论文上。

引用次数: 0

Detecting Regional Differences in Italian Health Services during Five COVID-19 Waves 五次COVID-19浪潮期间意大利卫生服务的区域差异

Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats

Pub Date : 2023-04-15 DOI: 10.3390/stats6020032

Lucio Palazzo, Riccardo Ievoli

During the waves of the COVID-19 pandemic, both national and/or territorial healthcare systems have been severely stressed in many countries. The availability (and complexity) of data requires proper comparisons for understanding differences in the performance of health services. With this aim, we propose a methodological approach to compare the performance of the Italian healthcare system at the territorial level, i.e., considering NUTS 2 regions. Our approach consists of three steps: the choice of a distance measure between available time series, the application of weighted multidimensional scaling (wMDS) based on this distance, and, finally, a cluster analysis on the MDS coordinates. We separately consider daily time series regarding the deceased, intensive care units, and ordinary hospitalizations of patients affected by COVID-19. The proposed procedure identifies four clusters apart from two outlier regions. Changes between the waves at a regional level emerge from the main results, allowing the pressure on territorial health services to be mapped between 2020 and 2022.

在新冠肺炎大流行的浪潮中，许多国家的国家和/或地区医疗保健系统都受到了严重压力。数据的可用性（和复杂性）需要进行适当的比较，以了解卫生服务绩效的差异。为此，我们提出了一种方法论方法来比较意大利医疗保健系统在地区层面的表现，即考虑NUTS 2地区。我们的方法包括三个步骤：选择可用时间序列之间的距离度量，基于该距离的加权多维缩放（wMDS）的应用，最后，对MDS坐标进行聚类分析。我们分别考虑受新冠肺炎影响的患者的死者、重症监护室和普通住院的每日时间序列。所提出的过程从两个异常区域中识别出四个聚类。主要结果显示，区域层面的波动之间发生了变化，从而可以在2020年至2022年间描绘出领土卫生服务的压力。

引用次数: 1

Model Selection with Missing Data Embedded in Missing-at-Random Data 随机数据缺失中嵌入缺失数据的模型选择

Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats

Pub Date : 2023-04-11 DOI: 10.3390/stats6020031

Keiji Takai, Kenichi Hayashi

When models are built with missing data, an information criterion is needed to select the best model among the various candidates. Using a conventional information criterion for missing data may lead to the selection of the wrong model when data are not missing at random. Conventional information criteria implicitly assume that any subset of missing-at-random data is also missing at random, and thus the maximum likelihood estimator is assumed to be consistent; that is, it is assumed that the estimator will converge to the true value. However, this assumption may not be practical. In this paper, we develop an information criterion that works even for not-missing-at-random data, so long as the largest missing data set is missing at random. Simulations are performed to show the superiority of the proposed information criterion over conventional criteria.

当使用缺失数据构建模型时，需要一个信息标准来从各种候选模型中选择最佳模型。当数据不是随机丢失时，使用传统的信息准则来处理丢失数据可能会导致选择错误的模型。传统的信息准则隐含地假设随机缺失数据的任何子集也是随机缺失的，因此假设最大似然估计量是一致的;也就是说，假设估计量收敛于真值。然而，这种假设可能并不实际。在本文中，我们开发了一个信息准则，即使对于非随机缺失的数据，只要最大的缺失数据集是随机缺失的。仿真结果表明，所提出的信息准则优于传统准则。

引用次数: 0

The Network Bass Model with Behavioral Compartments 具有行为区隔的网络低音模型

Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats

Pub Date : 2023-03-24 DOI: 10.3390/stats6020030

G. Modanese

A Bass diffusion model is defined on an arbitrary network, with the additional introduction of behavioral compartments, such that nodes can have different probabilities of receiving the information/innovation from the source and transmitting it to other nodes. The dynamics are described by a large system of non-linear ordinary differential equations, whose numerical solutions can be analyzed in dependence on diffusion parameters, network parameters, and relations between the compartments. For example, in a simple case with two compartments (Enthusiasts and Sceptics about the innovation), we consider cases in which the “publicity” and imitation terms act differently on the compartments, and individuals from one compartment do not imitate those of the other, thus increasing the polarization of the system and creating sectors of the population where adoption becomes very slow. For some categories of scale-free networks, we also investigate the dependence on the features of the networks of the diffusion peak time and of the time at which adoptions reach 90% of the population.

在任意网络上定义了Bass扩散模型，并引入了额外的行为间隔，这样节点可以有不同的概率从源接收信息/创新并将其传递给其他节点。动力学由一个大的非线性常微分方程系统来描述，其数值解可以根据扩散参数、网络参数和隔室之间的关系来分析。例如，在一个有两个隔间的简单案例中(对创新的狂热者和怀疑论者)，我们考虑“宣传”和模仿术语对隔间的作用不同的情况，来自一个隔间的个人不模仿另一个隔间的人，从而增加了系统的两极分化，并创造了采用变得非常缓慢的人口部门。对于某些类别的无标度网络，我们还研究了扩散峰值时间和采用率达到90%的时间对网络特征的依赖关系。

引用次数: 1

Efficient Two-Stage Analysis for Complex Trait Association with Arbitrary Depth Sequencing Data 基于任意深度测序数据的复杂性状关联高效两阶段分析

Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats

Pub Date : 2023-03-19 DOI: 10.3390/stats6010029

Zheng Xu, Song Yan, Shuai Yuan, Cong Wu, Sixia Chen, Zifang Guo, Yun Li

Sequencing-based genetic association analysis is typically performed by first generating genotype calls from sequence data and then performing association tests on the called genotypes. Standard approaches require accurate genotype calling (GC), which can be achieved either with high sequencing depth (typically available in a small number of individuals) or via computationally intensive multi-sample linkage disequilibrium (LD)-aware methods. We propose a computationally efficient two-stage combination approach for association analysis, in which single-nucleotide polymorphisms (SNPs) are screened in the first stage via a rapid maximum likelihood (ML)-based method on sequence data directly (without first calling genotypes), and then the selected SNPs are evaluated in the second stage by performing association tests on genotypes from multi-sample LD-aware calling. Extensive simulation- and real data-based studies show that the proposed two-stage approaches can save 80% of the computational costs and still obtain more than 90% of the power of the classical method to genotype all markers at various depths d≥2.

基于测序的遗传关联分析通常是通过首先从序列数据生成基因型调用，然后对所调用的基因型进行关联测试来执行的。标准方法需要精确的基因型调用(GC)，这可以通过高测序深度(通常在少数个体中可用)或通过计算密集的多样本连锁不平衡(LD)感知方法来实现。我们提出了一种计算效率高的两阶段联合关联分析方法，其中第一阶段通过基于序列数据的快速最大似然(ML)方法直接筛选单核苷酸多态性(snp)(不首先调用基因型)，然后在第二阶段通过对来自多样本ld感知调用的基因型进行关联测试来评估选定的snp。大量基于模拟和真实数据的研究表明，所提出的两阶段方法可以节省80%的计算成本，并且仍然可以获得经典方法90%以上的功率，用于不同深度d≥2的所有标记的基因型。

引用次数: 1

A Phylogenetic Regression Model for Studying Trait Evolution on Network 一个用于研究网络性状进化的系统发育回归模型

Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats

Pub Date : 2023-03-18 DOI: 10.3390/stats6010028

Dwueng-Chwuan Jhwueng

A phylogenetic regression model that incorporates the network structure allowing the reticulation event to study trait evolution is proposed. The parameter estimation is achieved through the maximum likelihood approach, where an algorithm is developed by taking a phylogenetic network in eNewick format as the input to build up the variance–covariance matrix. The model is applied to study the common sunflower, Helianthus annuus, by investigating its traits used to respond to drought conditions. Results show that our model provides acceptable estimates of the parameters, where most of the traits analyzed were found to have a significant correlation with drought tolerance.

提出了一种系统发育回归模型，该模型结合了允许网状事件研究性状进化的网络结构。参数估计是通过最大似然方法实现的，其中通过将eNewick格式的系统发育网络作为输入来建立方差-协方差矩阵来开发算法。该模型应用于普通向日葵向日葵向日葵，通过研究其对干旱条件的反应特性。结果表明，我们的模型提供了可接受的参数估计，其中分析的大多数性状与耐旱性具有显著相关性。

引用次数: 0

Consecutive-k1 and k2-out-of-n: F Structures with a Single Change Point 具有单个变化点的n:F结构中的连续k1和k2

Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats

Pub Date : 2023-03-16 DOI: 10.3390/stats6010027

I. Triantafyllou, M. Chalikias

In the present paper, we establish a new consecutive-type reliability model with a single change point. The proposed structure has two common failure criteria and consists of two different types of components. The general framework for constructing the so-called consecutive-k1 and k2-out-of-n: F system with a single change point is launched. In addition, the number of path sets of the proposed structure is determined with the aid of a combinatorial approach. Moreover, two crucial performance characteristics of the proposed model are studied. The numerical investigation carried out reveals that the behavior of the new structure is outperforming against its competitors.

本文建立了一个新的单变点连续型可靠性模型。所提出的结构有两个共同的失效标准，由两种不同类型的部件组成。提出了构造具有单个变化点的所谓连续k1和k2-of-n:F系统的一般框架。此外，借助于组合方法来确定所提出的结构的路径集的数量。此外，还研究了该模型的两个关键性能特征。数值研究表明，新结构的性能优于竞争对手。

引用次数: 0

Renaissance of Creative Accounting Due to the Pandemic: New Patterns Explored by Correspondence Analysis 大流行导致的创造性会计的复兴:对应分析探索的新模式

Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats

Pub Date : 2023-03-03 DOI: 10.3390/stats6010025

R. Blazek, P. Durana, Jakub Michulek

The COVID-19 outbreak has rapidly affected global economies and the parties involved. There was a need to ensure the sustainability of corporate finance and avoid bankruptcy. The reactions of individuals were not routine, but covered a wide range of approaches to surviving the crisis. A creative way of accounting was also adopted. This study is primarily concerned with the behavior of businesses in the Visegrad Four countries between 2019 and 2021. The pandemic era was the driving force behind the renaissance of manipulation. Thus, the purpose of the article is to explore how the behavior of enterprises changed during the ongoing pandemic. The Beneish model was applied to reveal creative manipulation in the analyzed samples. Its M-score was calculated for 6113 Slovak, 153 Czech, 585 Polish, and 155 Hungarian enterprises. Increasing numbers of handling enterprises were confirmed in the V4 region. The dependency between the size of the enterprise and the occurrence of creative accounting was also proven. However, the structure of manipulators has been changing. Correspondence analysis specifically showed behavioral changes over time. Correspondence maps demonstrate which enterprises already used creative accounting before the pandemic in 2019. Then, it was noted that enterprises were influenced to modify their patterns in 2020 and 2021. The coronavirus pandemic had a significant potency on the use of creative accounting, not only for individual units, but for businesses of all sizes. In addition, the methodology may be applied for the investigation of individual sectors post-COVID.

新冠肺炎疫情已迅速影响到全球经济和有关各方。有必要确保公司融资的可持续性，避免破产。个人的反应不是例行公事，而是涵盖了在危机中幸存下来的广泛方法。还采用了一种创造性的会计方法。本研究主要关注2019年至2021年间维谢格拉德四国企业的行为。疫情时代是操纵行为复兴的驱动力。因此，本文的目的是探讨在持续的疫情期间，企业的行为是如何变化的。Beneish模型用于揭示分析样本中的创造性操作。它的M分是为6113家斯洛伐克企业、153家捷克企业、585家波兰企业和155家匈牙利企业计算的。V4地区确认了越来越多的装卸企业。企业规模与创造性会计的发生之间的相关性也得到了证明。然而，操纵器的结构一直在变化。对应分析特别显示了行为随时间的变化。对应图显示了在2019年疫情之前，哪些企业已经使用了创造性会计。然后，人们注意到，企业在2020年和2021年受到影响，改变了其模式。冠状病毒大流行对创造性会计的使用产生了重大影响，不仅对单个单位，而且对各种规模的企业都是如此。此外，该方法可用于新冠肺炎疫情后个别部门的调查。

{"title":"Renaissance of Creative Accounting Due to the Pandemic: New Patterns Explored by Correspondence Analysis","authors":"R. Blazek, P. Durana, Jakub Michulek","doi":"10.3390/stats6010025","DOIUrl":"https://doi.org/10.3390/stats6010025","url":null,"abstract":"The COVID-19 outbreak has rapidly affected global economies and the parties involved. There was a need to ensure the sustainability of corporate finance and avoid bankruptcy. The reactions of individuals were not routine, but covered a wide range of approaches to surviving the crisis. A creative way of accounting was also adopted. This study is primarily concerned with the behavior of businesses in the Visegrad Four countries between 2019 and 2021. The pandemic era was the driving force behind the renaissance of manipulation. Thus, the purpose of the article is to explore how the behavior of enterprises changed during the ongoing pandemic. The Beneish model was applied to reveal creative manipulation in the analyzed samples. Its M-score was calculated for 6113 Slovak, 153 Czech, 585 Polish, and 155 Hungarian enterprises. Increasing numbers of handling enterprises were confirmed in the V4 region. The dependency between the size of the enterprise and the occurrence of creative accounting was also proven. However, the structure of manipulators has been changing. Correspondence analysis specifically showed behavioral changes over time. Correspondence maps demonstrate which enterprises already used creative accounting before the pandemic in 2019. Then, it was noted that enterprises were influenced to modify their patterns in 2020 and 2021. The coronavirus pandemic had a significant potency on the use of creative accounting, not only for individual units, but for businesses of all sizes. In addition, the methodology may be applied for the investigation of individual sectors post-COVID.","PeriodicalId":93142,"journal":{"name":"Stats","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47562439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Analytic Error Function and Numeric Inverse Obtained by Geometric Means 解析误差函数与几何求逆

Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats

Pub Date : 2023-02-24 DOI: 10.3390/stats6010026

D. Martila, S. Groote

Using geometric considerations, we provided a clear derivation of the integral representation for the error function, known as the Craig formula. We calculated the corresponding power series expansion and proved the convergence. The same geometric means finally assisted in systematically deriving useful formulas that approximated the inverse error function. Our approach could be used for applications in high-speed Monte Carlo simulations, where this function is used extensively.

利用几何考虑，我们提供了误差函数积分表示的清晰推导，称为Craig公式。我们计算了相应的幂级数展开式，并证明了其收敛性。同样的几何平均数最终有助于系统地推导出近似误差反函数的有用公式。我们的方法可用于高速蒙特卡罗模拟中的应用，在这种模拟中，该函数被广泛使用。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Stats

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀