Fisher’s measure of variability in repeated samples

IF 1.7 2区 数学 Q2 STATISTICS & PROBABILITY Bernoulli Pub Date : 2023-05-01 DOI:10.3150/22-bej1494
Poly H. da Silva, Arash Jamshidpey, P. McCullagh, S. Tavaré
{"title":"Fisher’s measure of variability in repeated samples","authors":"Poly H. da Silva, Arash Jamshidpey, P. McCullagh, S. Tavaré","doi":"10.3150/22-bej1494","DOIUrl":null,"url":null,"abstract":"Fisher (1943) claimed that the expected value of the sample variance of the number of species found in large samples, each of n specimens taken from the same population, is asymptotically θ log2. This is at odds with the value θ log n obtained directly from the Ewens Sampling Formula (ESF), where θ specifies the rate at which new species are found. To resolve this apparent contradiction, we assume the species frequency spectrum in the population is determined by the ESF and that the samples are disjoint subsets drawn sequentially from this single population. We find an explicit formula for the required expected value for p samples of arbitrary size; in the limit of large equally-sized samples, it indeed has the value θ log2. We obtain limit theorems for the sample variance of p samples of size n under various limiting regimes as p , n or both tend to ∞ . We discuss further the behavior of the number of species present in all samples, and revisit Fisher’s log-series distribution as the limiting distribution of the number of specimens observed in typical species in a future, large sample.","PeriodicalId":55387,"journal":{"name":"Bernoulli","volume":" ","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bernoulli","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.3150/22-bej1494","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 3

Abstract

Fisher (1943) claimed that the expected value of the sample variance of the number of species found in large samples, each of n specimens taken from the same population, is asymptotically θ log2. This is at odds with the value θ log n obtained directly from the Ewens Sampling Formula (ESF), where θ specifies the rate at which new species are found. To resolve this apparent contradiction, we assume the species frequency spectrum in the population is determined by the ESF and that the samples are disjoint subsets drawn sequentially from this single population. We find an explicit formula for the required expected value for p samples of arbitrary size; in the limit of large equally-sized samples, it indeed has the value θ log2. We obtain limit theorems for the sample variance of p samples of size n under various limiting regimes as p , n or both tend to ∞ . We discuss further the behavior of the number of species present in all samples, and revisit Fisher’s log-series distribution as the limiting distribution of the number of specimens observed in typical species in a future, large sample.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
重复样本变异性的Fisher测度
Fisher(1943)声称,在大样本中发现的物种数量的样本方差的期望值是渐近的θ log2,每n个样本取自同一种群。这与直接从埃文斯抽样公式(ESF)得到的θ log n值不一致,在ESF中,θ表示发现新物种的速率。为了解决这个明显的矛盾,我们假设种群中的物种频谱是由ESF决定的,并且样本是从这个单一种群中顺序抽取的不相交的子集。我们找到了任意大小的p个样本所需期望值的显式公式;在大小相等的大样本的极限下,它的值确实是θ log2。当p、n或两者都趋于∞时,我们得到了大小为n的p个样本在各种极限情况下的样本方差的极限定理。我们进一步讨论了所有样本中存在的物种数量的行为,并重新审视Fisher的对数序列分布,作为未来大样本中典型物种中观察到的标本数量的极限分布。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Bernoulli
Bernoulli 数学-统计学与概率论
CiteScore
3.40
自引率
0.00%
发文量
116
审稿时长
6-12 weeks
期刊介绍: BERNOULLI is the journal of the Bernoulli Society for Mathematical Statistics and Probability, issued four times per year. The journal provides a comprehensive account of important developments in the fields of statistics and probability, offering an international forum for both theoretical and applied work. BERNOULLI will publish: Papers containing original and significant research contributions: with background, mathematical derivation and discussion of the results in suitable detail and, where appropriate, with discussion of interesting applications in relation to the methodology proposed. Papers of the following two types will also be considered for publication, provided they are judged to enhance the dissemination of research: Review papers which provide an integrated critical survey of some area of probability and statistics and discuss important recent developments. Scholarly written papers on some historical significant aspect of statistics and probability.
期刊最新文献
Semiparametric regression of panel count data with informative terminal event. Bootstrap inference in functional linear regression models with scalar response Cramér type moderate deviations for the Grenander estimator near the boundaries of the support Joint density of the stable process and its supremum: Regularity and upper bounds On the mean perimeter density of inhomogeneous random closed sets
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1