Preparing a rejoinder is a typically rewarding, sometimes depressing, and occasionally frustrating experience. The rewarding part is self-evident, and the depression sets in when a discussant has much deeper and crisper insights about the authors’ thesis than authors themselves. Frustrations arise when the authors thought they made some points crystal clear, but the reflections from the discussants show a very different picture. We are deeply grateful to the editors of Statistical Science and the discussants for providing us an opportunity to maximize the first, sample the second, and minimize the third.
{"title":"Rejoinder: Let’s Be Imprecise in Order to Be Precise (About What We Don’t Know)","authors":"Ruobin Gong, X. Meng","doi":"10.1214/21-STS765REJ","DOIUrl":"https://doi.org/10.1214/21-STS765REJ","url":null,"abstract":"Preparing a rejoinder is a typically rewarding, sometimes depressing, and occasionally frustrating experience. The rewarding part is self-evident, and the depression sets in when a discussant has much deeper and crisper insights about the authors’ thesis than authors themselves. Frustrations arise when the authors thought they made some points crystal clear, but the reflections from the discussants show a very different picture. We are deeply grateful to the editors of Statistical Science and the discussants for providing us an opportunity to maximize the first, sample the second, and minimize the third.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49551625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
E. Bura, Bing Li, Lexin Li, C. Nachtsheim, D. Peña, C. Setodji, R. Weiss
Dennis Cook is a Full Professor, School of Statistics, at the University of Minnesota. He received his BS degree in Mathematics from Northern Montana College, and MS and PhD degrees in Statistics from Kansas State University. He has served as Chair of the Department of Applied Statistics, Director of the Statistical Center and Director of the School of Statistics, all at the University of Minnesota. His research areas include dimension reduction, linear and nonlinear regression, experimental design, statistical diagnostics, statistical graphics and population genetics. He has authored over 200 research articles and is author or co-author of two textbooks— An Introduction to Regression Graphics and Applied Regression Including Computing and Graphics—and three research monographs, Influence and Residuals in Regression, Regression Graphics: Ideas for Studying Regressions through Graphics and An Introduction to Envelopes: Dimension Reduction for Efficient Estimation in Multivariate Statistics. He has served as Associate Editor of the Journal of the American Statistical Association, The Journal of Quality Technology, Biometrika, Journal of the Royal Statistical Society and Statistica Sinica. He is a four-time recipient of the Jack Youden Prize for Best Expository Paper in Technometrics as well as the Frank Wilcoxon Award for Best Technical Paper. He received the 2005 COPSS Fisher Lecture and Award, and he is a Fellow of the American Statistical Association and the Institute of Mathematical Statistics. The following conversation took place on March 22, 2019, following the banquet at the conference, “Cook’s Distance and Beyond: A Conference Celebrating the Contributions of R. Dennis Cook.” The interviewers were, Efstathia Bura (Effie), Bing Li, Lexin Li, Christopher Nachtsheim (Chris), Daniel Pena, Claude Messan Setodji and Robert Weiss (Rob).
{"title":"A Conversation with Dennis Cook","authors":"E. Bura, Bing Li, Lexin Li, C. Nachtsheim, D. Peña, C. Setodji, R. Weiss","doi":"10.1214/20-STS801","DOIUrl":"https://doi.org/10.1214/20-STS801","url":null,"abstract":"Dennis Cook is a Full Professor, School of Statistics, at the University of Minnesota. He received his BS degree in Mathematics from Northern Montana College, and MS and PhD degrees in Statistics from Kansas State University. He has served as Chair of the Department of Applied Statistics, Director of the Statistical Center and Director of the School of Statistics, all at the University of Minnesota.\u0000His research areas include dimension reduction, linear and nonlinear regression, experimental design, statistical diagnostics, statistical graphics and population genetics. He has authored over 200 research articles and is author or co-author of two textbooks— An Introduction to Regression Graphics and Applied Regression Including Computing and Graphics—and three research monographs, Influence and Residuals in Regression, Regression Graphics: Ideas for Studying Regressions through Graphics and An Introduction to Envelopes: Dimension Reduction for Efficient Estimation in Multivariate Statistics.\u0000He has served as Associate Editor of the Journal of the American Statistical Association, The Journal of Quality Technology, Biometrika, Journal of the Royal Statistical Society and Statistica Sinica. He is a four-time recipient of the Jack Youden Prize for Best Expository Paper in Technometrics as well as the Frank Wilcoxon Award for Best Technical Paper. He received the 2005 COPSS Fisher Lecture and Award, and he is a Fellow of the American Statistical Association and the Institute of Mathematical Statistics. The following conversation took place on March 22, 2019, following the banquet at the conference, “Cook’s Distance and Beyond: A Conference Celebrating the Contributions of R. Dennis Cook.” The interviewers were, Efstathia Bura (Effie), Bing Li, Lexin Li, Christopher Nachtsheim (Chris), Daniel Pena, Claude Messan Setodji and Robert Weiss (Rob).","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":"36 1","pages":"328-337"},"PeriodicalIF":5.7,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46070211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Y X Rachel Wang, Lexin Li, Jingyi Jessica Li, Haiyan Huang
The rise of network data in many different domains has offered researchers new insight into the problem of modeling complex systems and propelled the development of numerous innovative statistical methodologies and computational tools. In this paper, we primarily focus on two types of biological networks, gene networks and brain networks, where statistical network modeling has found both fruitful and challenging applications. Unlike other network examples such as social networks where network edges can be directly observed, both gene and brain networks require careful estimation of edges using covariates as a first step. We provide a discussion on existing statistical and computational methods for edge esitimation and subsequent statistical inference problems in these two types of biological networks.
{"title":"Network Modeling in Biology: Statistical Methods for Gene and Brain Networks.","authors":"Y X Rachel Wang, Lexin Li, Jingyi Jessica Li, Haiyan Huang","doi":"10.1214/20-sts792","DOIUrl":"10.1214/20-sts792","url":null,"abstract":"<p><p>The rise of network data in many different domains has offered researchers new insight into the problem of modeling complex systems and propelled the development of numerous innovative statistical methodologies and computational tools. In this paper, we primarily focus on two types of biological networks, gene networks and brain networks, where statistical network modeling has found both fruitful and challenging applications. Unlike other network examples such as social networks where network edges can be directly observed, both gene and brain networks require careful estimation of edges using covariates as a first step. We provide a discussion on existing statistical and computational methods for edge esitimation and subsequent statistical inference problems in these two types of biological networks.</p>","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":"36 1","pages":"89-108"},"PeriodicalIF":3.9,"publicationDate":"2021-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8296984/pdf/nihms-1636819.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39219268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Introduction to the Special Section","authors":"Yihong Wu, Harrison H. Zhou","doi":"10.1214/20-sts361ed","DOIUrl":"https://doi.org/10.1214/20-sts361ed","url":null,"abstract":"","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2021-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42468701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-02-01Epub Date: 2020-12-21DOI: 10.1214/19-sts749
Corwin M Zigler, Georgia Papadogeorgou
Statistical methods to evaluate the effectiveness of interventions are increasingly challenged by the inherent interconnectedness of units. Specifically, a recent flurry of methods research has addressed the problem of interference between observations, which arises when one observational unit's outcome depends not only on its treatment but also the treatment assigned to other units. We introduce the setting of bipartite causal inference with interference, which arises when 1) treatments are defined on observational units that are distinct from those at which outcomes are measured and 2) there is interference between units in the sense that outcomes for some units depend on the treatments assigned to many other units. The focus of this work is to formulate definitions and several possible causal estimands for this setting, highlighting similarities and differences with more commonly considered settings of causal inference with interference. Towards an empirical illustration, an inverse probability of treatment weighted estimator is adapted from existing literature to estimate a subset of simplified, but interesting, estimands. The estimators are deployed to evaluate how interventions to reduce air pollution from 473 power plants in the U.S. causally affect cardiovascular hospitalization among Medicare beneficiaries residing at 18,807 zip code locations.
{"title":"Bipartite Causal Inference with Interference.","authors":"Corwin M Zigler, Georgia Papadogeorgou","doi":"10.1214/19-sts749","DOIUrl":"https://doi.org/10.1214/19-sts749","url":null,"abstract":"<p><p>Statistical methods to evaluate the effectiveness of interventions are increasingly challenged by the inherent interconnectedness of units. Specifically, a recent flurry of methods research has addressed the problem of <i>interference</i> between observations, which arises when one observational unit's outcome depends not only on its treatment but also the treatment assigned to other units. We introduce the setting of <i>bipartite causal inference with interference,</i> which arises when 1) treatments are defined on observational units that are distinct from those at which outcomes are measured and 2) there is <i>interference</i> between units in the sense that outcomes for some units depend on the treatments assigned to many other units. The focus of this work is to formulate definitions and several possible causal estimands for this setting, highlighting similarities and differences with more commonly considered settings of causal inference with interference. Towards an empirical illustration, an inverse probability of treatment weighted estimator is adapted from existing literature to estimate a subset of simplified, but interesting, estimands. The estimators are deployed to evaluate how interventions to reduce air pollution from 473 power plants in the U.S. causally affect cardiovascular hospitalization among Medicare beneficiaries residing at 18,807 zip code locations.</p>","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":"36 1","pages":"109-123"},"PeriodicalIF":5.7,"publicationDate":"2021-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8048152/pdf/nihms-1056137.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38804958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This conversation began in June 2015 in the Department of Statistics at Columbia University during Lai’s visit to his alma mater where he celebrated his seventieth birthday. It continued in the subsequent years at Columbia and Stanford. Lai was born on June 28, 1945, in Hong Kong, where he grew up and attended The University of Hong Kong, receiving his B.A. degree (First Class Honors) in Mathematics in 1967. He went to Columbia University in 1968 for graduate study in statistics and received his Ph.D. degree in 1971. He stayed on the faculty at Columbia and was appointed Higgins Professor of Mathematical Statistics in 1986. A year later he moved to Stanford, where he is currently Ray Lyman Wilbur Professor of Statistics, and by courtesy, also of Biomedical Data Science and Computational and Mathematical Engineering. He is a fellow of the Institute of Mathematical Statistics, the American Statistical Association and an elected member of Academia Sinica in Taiwan. He was the third recipient of the COPSS Award which he won in 1983. He has been married to Letitia Chow since 1975, and they have two sons and two grandchildren.
{"title":"A Conversation with Tze Leung Lai","authors":"Ying Lu, Dylan S. Small, Z. Ying","doi":"10.1214/20-sts775","DOIUrl":"https://doi.org/10.1214/20-sts775","url":null,"abstract":"This conversation began in June 2015 in the Department of Statistics at Columbia University during Lai’s visit to his alma mater where he celebrated his seventieth birthday. It continued in the subsequent years at Columbia and Stanford. Lai was born on June 28, 1945, in Hong Kong, where he grew up and attended The University of Hong Kong, receiving his B.A. degree (First Class Honors) in Mathematics in 1967. He went to Columbia University in 1968 for graduate study in statistics and received his Ph.D. degree in 1971. He stayed on the faculty at Columbia and was appointed Higgins Professor of Mathematical Statistics in 1986. A year later he moved to Stanford, where he is currently Ray Lyman Wilbur Professor of Statistics, and by courtesy, also of Biomedical Data Science and Computational and Mathematical Engineering. He is a fellow of the Institute of Mathematical Statistics, the American Statistical Association and an elected member of Academia Sinica in Taiwan. He was the third recipient of the COPSS Award which he won in 1983. He has been married to Letitia Chow since 1975, and they have two sons and two grandchildren.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2021-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42132267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Under the random design of longitudinal data, observation times are irregular, and there are mainly two frameworks for analyzing such kind of longitudinal data. One is the clustered data framework and the other is the counting process framework. In this paper, we give a thorough comparison of these two frameworks in terms of data structure, model assumptions and estimation procedures. We find that modeling the observation times in the counting process framework will not gain any efficiency when the observation times are correlated with covariates but independent of the longitudinal response given covariates. Some simulation studies are conducted to compare the finite sample behaviors of the related estimators, and a real data analysis of the Alzheimer’s disease study is implemented for further comparison.
{"title":"Comparison of Two Frameworks for Analyzing Longitudinal Data","authors":"Jie Zhou, Xiao Zhou, Liuquan Sun","doi":"10.1214/20-sts813","DOIUrl":"https://doi.org/10.1214/20-sts813","url":null,"abstract":"Under the random design of longitudinal data, observation times are irregular, and there are mainly two frameworks for analyzing such kind of longitudinal data. One is the clustered data framework and the other is the counting process framework. In this paper, we give a thorough comparison of these two frameworks in terms of data structure, model assumptions and estimation procedures. We find that modeling the observation times in the counting process framework will not gain any efficiency when the observation times are correlated with covariates but independent of the longitudinal response given covariates. Some simulation studies are conducted to compare the finite sample behaviors of the related estimators, and a real data analysis of the Alzheimer’s disease study is implemented for further comparison.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":"1 1","pages":""},"PeriodicalIF":5.7,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66085640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Confidence and likelihood are fundamental statistical concepts with distinct technical interpretation and usage. Confidence is a meaningful concept of uncertainty within the context of confidence-interval procedure, while likelihood has been used predominantly as a tool for statistical modelling and inference given observed data. Here we show that confidence is in fact an extended likelihood, thus giving a much closer correspondence between the two concepts. This result gives the confidence concept an external meaning outside the confidence-interval context, and vice versa, it gives the confidence interpretation to the likelihood. In addition to the obvious interpretation purposes, this connection suggests two-way transfers of technical information. For example, the extended likelihood theory gives a clear way to update or combine confidence information. On the other hand, the confidence connection gives the extended likelihood direct access to the frequentist probability, an objective certification not directly available to the classical likelihood. This implies that intervals derived from the extended likelihood have the same logical status as confidence intervals, thus simplifying the terminology in the inference of random parameters.
{"title":"Confidence as Likelihood","authors":"Y. Pawitan, Youngjo Lee","doi":"10.1214/20-sts811","DOIUrl":"https://doi.org/10.1214/20-sts811","url":null,"abstract":"Confidence and likelihood are fundamental statistical concepts with distinct technical interpretation and usage. Confidence is a meaningful concept of uncertainty within the context of confidence-interval procedure, while likelihood has been used predominantly as a tool for statistical modelling and inference given observed data. Here we show that confidence is in fact an extended likelihood, thus giving a much closer correspondence between the two concepts. This result gives the confidence concept an external meaning outside the confidence-interval context, and vice versa, it gives the confidence interpretation to the likelihood. In addition to the obvious interpretation purposes, this connection suggests two-way transfers of technical information. For example, the extended likelihood theory gives a clear way to update or combine confidence information. On the other hand, the confidence connection gives the extended likelihood direct access to the frequentist probability, an objective certification not directly available to the classical likelihood. This implies that intervals derived from the extended likelihood have the same logical status as confidence intervals, thus simplifying the terminology in the inference of random parameters.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":"1 1","pages":""},"PeriodicalIF":5.7,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66085566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Consider gambler's ruin with three players, 1, 2, and 3, having initial capitals $A$, $B$, and $C$. At each round a pair of players is chosen (uniformly at random) and a fair coin flip is made resulting in the transfer of one unit between these two players. Eventually, one of the players is eliminated and the game continues with the remaining two. Let $sigmain S_3$ be the elimination order (e.g., $sigma=132$ means player 1 is eliminated first, player 3 is eliminated second, and player 2 is left with $A+B+C$). We seek approximations (and exact formulas) for the probabilities $P_{A,B,C}(sigma)$. One frequently used approximation, the independent chip model (ICM), is shown to be inadequate. A regression adjustment is proposed, which seems to give good approximations to the players' elimination order probabilities.
{"title":"Gambler’s Ruin and the ICM","authors":"P. Diaconis, S. Ethier","doi":"10.1214/21-sts826","DOIUrl":"https://doi.org/10.1214/21-sts826","url":null,"abstract":"Consider gambler's ruin with three players, 1, 2, and 3, having initial capitals $A$, $B$, and $C$. At each round a pair of players is chosen (uniformly at random) and a fair coin flip is made resulting in the transfer of one unit between these two players. Eventually, one of the players is eliminated and the game continues with the remaining two. Let $sigmain S_3$ be the elimination order (e.g., $sigma=132$ means player 1 is eliminated first, player 3 is eliminated second, and player 2 is left with $A+B+C$). \u0000We seek approximations (and exact formulas) for the probabilities $P_{A,B,C}(sigma)$. One frequently used approximation, the independent chip model (ICM), is shown to be inadequate. A regression adjustment is proposed, which seems to give good approximations to the players' elimination order probabilities.","PeriodicalId":51172,"journal":{"name":"Statistical Science","volume":" ","pages":""},"PeriodicalIF":5.7,"publicationDate":"2020-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48579330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}