{"title":"关于连胜长度为 1 的热门统计偏差的其他证明","authors":"Maximilian Janisch","doi":"arxiv-2407.10577","DOIUrl":null,"url":null,"abstract":"For a sequence of $n$ random variables taking values $0$ or $1$, the hot hand\nstatistic of streak length $k$ counts what fraction of the streaks of length\n$k$, that is, $k$ consecutive variables taking the value $1$, among the $n$\nvariables are followed by another $1$. Since this statistic does not use the\nexpected value of how many streaks of length $k$ are observed, but instead uses\nthe realization of the number of streaks present in the data, it may be a\nbiased estimator of the conditional probability of a fixed random variable\ntaking value $1$ if it is preceded by a streak of length $k$, as was first\nstudied and observed explicitly in [Miller and Sanjurjo, 2018]. In this short\nnote, we suggest an alternative proof for an explicit formula of the\nexpectation of the hot hand statistic for the case of streak length one. This\nformula was obtained through a different argument in [Miller and Sanjurjo,\n2018] and [Rinott and Bar-Hillel, 2015].","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":"32 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Alternative proof for the bias of the hot hand statistic of streak length one\",\"authors\":\"Maximilian Janisch\",\"doi\":\"arxiv-2407.10577\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For a sequence of $n$ random variables taking values $0$ or $1$, the hot hand\\nstatistic of streak length $k$ counts what fraction of the streaks of length\\n$k$, that is, $k$ consecutive variables taking the value $1$, among the $n$\\nvariables are followed by another $1$. Since this statistic does not use the\\nexpected value of how many streaks of length $k$ are observed, but instead uses\\nthe realization of the number of streaks present in the data, it may be a\\nbiased estimator of the conditional probability of a fixed random variable\\ntaking value $1$ if it is preceded by a streak of length $k$, as was first\\nstudied and observed explicitly in [Miller and Sanjurjo, 2018]. In this short\\nnote, we suggest an alternative proof for an explicit formula of the\\nexpectation of the hot hand statistic for the case of streak length one. This\\nformula was obtained through a different argument in [Miller and Sanjurjo,\\n2018] and [Rinott and Bar-Hillel, 2015].\",\"PeriodicalId\":501323,\"journal\":{\"name\":\"arXiv - STAT - Other Statistics\",\"volume\":\"32 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Other Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2407.10577\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Other Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.10577","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
对于取值为 $0$ 或 $1$的 $n$ 随机变量序列,长度为 $k$ 的条纹长度热手统计量(hot handstatistic of streak length $k$)计算的是在 $n$ 变量中,长度为 $k$ 的条纹(即取值为 $1$的 $k$ 连续变量)中,有多少个是在另一个 $1$ 变量之后出现的。由于该统计量并不使用观察到的长度为$k$的条纹数量的预期值,而是使用数据中存在的条纹数量的实现值,因此它可能是固定随机变量取值$1$的条件概率的无偏估计值,如果它前面有长度为$k$的条纹,这在[Miller and Sanjurjo, 2018]中得到了首次研究和明确观察。在本短文中,我们提出了另一种证明方法,即在条纹长度为 1 的情况下,热手统计量期望值的明确公式。这个公式在 [Miller and Sanjurjo, 2018] 和 [Rinott and Bar-Hillel, 2015] 中通过不同的论证得到。
Alternative proof for the bias of the hot hand statistic of streak length one
For a sequence of $n$ random variables taking values $0$ or $1$, the hot hand
statistic of streak length $k$ counts what fraction of the streaks of length
$k$, that is, $k$ consecutive variables taking the value $1$, among the $n$
variables are followed by another $1$. Since this statistic does not use the
expected value of how many streaks of length $k$ are observed, but instead uses
the realization of the number of streaks present in the data, it may be a
biased estimator of the conditional probability of a fixed random variable
taking value $1$ if it is preceded by a streak of length $k$, as was first
studied and observed explicitly in [Miller and Sanjurjo, 2018]. In this short
note, we suggest an alternative proof for an explicit formula of the
expectation of the hot hand statistic for the case of streak length one. This
formula was obtained through a different argument in [Miller and Sanjurjo,
2018] and [Rinott and Bar-Hillel, 2015].