首页 > 最新文献

Observational studies最新文献

英文 中文
Application of Propensity Scores to a Continuous Exposure: Effect of Lead Exposure in Early Childhood on Reading and Mathematics Scores 倾向性得分在持续暴露中的应用:儿童早期铅暴露对阅读和数学成绩的影响
Pub Date : 2021-06-04 DOI: 10.1353/obs.2015.0002
M. Elliott, Nanhua Zhang, Dylan S. Small
Abstract:The estimation of causal effects in observational studies is usually limited by the lack of randomization, which can result in different treatment or exposure groups differing systematically with respect to characteristics that influence outcomes. To remove such systematic differences, methods to ’’balance” subjects on observed covariates across treatment or exposure levels have been developed over the past three decades. These methods have been primarily developed in settings with binary treatment or exposures. However, in many observational studies, the exposures are continuous instead of being binary or discrete, and are usually considered as doses of treatment. In this manuscript we consider estimating the causal effect of early childhood lead exposure on youth academic achievement, where the exposure variable blood lead concentration can take any values that are greater than or equal to 0, using three balancing methods: propensity score analysis, non-bipartite matching, and Bayesian regression trees. We find some evidence that the standard logistic regression analysis controlling for age and socioeconomic confounders used in previous analyses (Zhang et al. (2013)) overstates the effect of lead exposure on performance on standardized mathematics and reading examinations; however, significant declines remain, including at doses currently below the recommended exposure levels.
摘要:观察性研究中因果效应的估计通常受到缺乏随机化的限制,这可能导致不同的治疗或暴露组在影响结果的特征方面存在系统性差异。为了消除这种系统性差异,在过去三十年中,已经开发出了在不同治疗或暴露水平的观察到的协变量上“平衡”受试者的方法。这些方法主要是在二元治疗或暴露的环境中开发的。然而,在许多观察性研究中,暴露是连续的,而不是二元或离散的,通常被视为治疗剂量。在这篇手稿中,我们考虑使用三种平衡方法来估计儿童早期铅暴露对青少年学业成绩的因果影响,其中暴露变量血铅浓度可以取大于或等于0的任何值:倾向得分分析、非二分匹配和贝叶斯回归树。我们发现一些证据表明,先前分析中使用的控制年龄和社会经济混杂因素的标准逻辑回归分析(Zhang et al.(2013))夸大了铅暴露对标准化数学和阅读考试成绩的影响;然而,仍有显著下降,包括目前低于建议暴露水平的剂量。
{"title":"Application of Propensity Scores to a Continuous Exposure: Effect of Lead Exposure in Early Childhood on Reading and Mathematics Scores","authors":"M. Elliott, Nanhua Zhang, Dylan S. Small","doi":"10.1353/obs.2015.0002","DOIUrl":"https://doi.org/10.1353/obs.2015.0002","url":null,"abstract":"Abstract:The estimation of causal effects in observational studies is usually limited by the lack of randomization, which can result in different treatment or exposure groups differing systematically with respect to characteristics that influence outcomes. To remove such systematic differences, methods to ’’balance” subjects on observed covariates across treatment or exposure levels have been developed over the past three decades. These methods have been primarily developed in settings with binary treatment or exposures. However, in many observational studies, the exposures are continuous instead of being binary or discrete, and are usually considered as doses of treatment. In this manuscript we consider estimating the causal effect of early childhood lead exposure on youth academic achievement, where the exposure variable blood lead concentration can take any values that are greater than or equal to 0, using three balancing methods: propensity score analysis, non-bipartite matching, and Bayesian regression trees. We find some evidence that the standard logistic regression analysis controlling for age and socioeconomic confounders used in previous analyses (Zhang et al. (2013)) overstates the effect of lead exposure on performance on standardized mathematics and reading examinations; however, significant declines remain, including at doses currently below the recommended exposure levels.","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2015.0002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43901491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Book review of “Observation and Experiment: An Introduction to Causal Inference” by Paul R. Rosenbaum 保罗·罗森鲍姆《观察与实验:因果推理导论》书评
Pub Date : 2021-06-04 DOI: 10.1353/obs.2017.0008
Dylan S. Small
The economist Paul Samuelson said, “My belief is that nothing that can be expressed by mathematics cannot be expressed by careful use of literary words.” Paul Rosenbaum brings this perspective to causal inference in his new book Observation and Experiment: An Introduction to Causal Inference (Harvard University Press, 2017). The book is a luminous presentation of concepts and strategies for causal inference with a minimum of technical material. An example of how Rosenbaum explains causal inference in a literary way is his use of a passage from Robert Frost’s poem “The Road Not Taken” to illuminate how causal questions involve comparing potential outcomes under two or more treatments where we can only see one potential outcome:
经济学家保罗·萨缪尔森(Paul Samuelson)说:“我的信念是,任何能用数学表达的东西,都不能通过仔细使用文学词汇来表达。”保罗·罗森鲍姆(Paul Rosenbaum)在他的新书《观察与实验:因果推理导论》(哈佛大学出版社,2017年)中将这一观点带入因果推理。这本书是一个发光的概念和策略的因果推理与最低限度的技术材料。罗森鲍姆如何用文学的方式解释因果推理的一个例子是,他使用了罗伯特·弗罗斯特的诗《未选择的路》中的一段来说明因果问题是如何涉及在两种或多种处理下比较潜在结果的,而我们只能看到一种潜在结果:
{"title":"Book review of “Observation and Experiment: An Introduction to Causal Inference” by Paul R. Rosenbaum","authors":"Dylan S. Small","doi":"10.1353/obs.2017.0008","DOIUrl":"https://doi.org/10.1353/obs.2017.0008","url":null,"abstract":"The economist Paul Samuelson said, “My belief is that nothing that can be expressed by mathematics cannot be expressed by careful use of literary words.” Paul Rosenbaum brings this perspective to causal inference in his new book Observation and Experiment: An Introduction to Causal Inference (Harvard University Press, 2017). The book is a luminous presentation of concepts and strategies for causal inference with a minimum of technical material. An example of how Rosenbaum explains causal inference in a literary way is his use of a passage from Robert Frost’s poem “The Road Not Taken” to illuminate how causal questions involve comparing potential outcomes under two or more treatments where we can only see one potential outcome:","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2017.0008","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41412544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Larry Brown: Remembrance and Connections of His Work to Observational Studies 拉里·布朗:纪念及其与观察性研究的联系
Pub Date : 2021-06-04 DOI: 10.1353/obs.2018.0011
Dylan S. Small
Statistics lost one of our giants when Larry Brown passed away in February, 2018 at the age of 77. Many lost a friend, a mentor and a teacher. I had the good fortune to be Larry’s colleague for the past 16 years. While I wasn’t as close to him as many others, Larry was my friend and I looked up to him. Larry loved thinking. When Larry got interested in something and scratched his head, I could see his enjoyment. Steam seemed to come out as Larry was thinking and then he happily shared his thoughts. Larry’s classes and talks were full of insights. After joining the faculty at Wharton, I attended Larry’s linear models first year PhD course even though I had seen the material before. I was glad I took the time to do so as I learned a lot, and seeing how Larry thought deeply through things from different perspectives (e.g., he often presented both a geometric and a statistical perspective) was memorable and inspiring. Larry often mentioned questions he had about methods or results, and directions of research he thought could be expanded upon, which I think motivated students to see statistics as a field full of open questions and research opportunities rather than a dead field. Larry was generous with his time. Whenever I had a student for whom it was unclear which other faculty members had the expertise to serve on their dissertation committee, I suggested asking Larry because I knew he would be willing to spend time talking with the student, read the dissertation seriously and have something thoughtful to say. A few days before Larry’s passing, when he knew his time was short, he was writing recommendation letters for students. Larry spent much time on public service, and he encouraged me about its value to society even though one may not get recognition for it. Larry worked hard. He was active in research, teaching and mentoring students and public service until his passing. The large number of Larry’s former students who traveled to his funeral from places far away at short notice (Hong Kong even!) was a testament to Larry’s impact on their lives. Larry also made good time for family and friends. Besides the much time spent together with his wife Linda and their family, Larry made the time for trips over the summer alone with his sons. Larry treated people with respect and decency regardless of their status. At a time when I was the postdoctoral coordinator for our department, a PhD student from a little known university in India contacted Larry about a post doc opening in our department and
2018年2月,77岁的拉里·布朗去世了,统计学失去了一位巨人。许多人失去了朋友、导师和老师。在过去的16年里,我有幸成为拉里的同事。虽然我和他不像其他人那么亲近,但拉里是我的朋友,我很尊敬他。拉里喜欢思考。当拉里对某件事感兴趣并挠头时,我能看出他很享受。当拉里思考时,蒸汽似乎冒了出来,然后他高兴地分享了他的想法。拉里的课堂和演讲充满了真知灼见。在加入沃顿商学院后,我参加了拉里的线性模型第一年的博士课程,尽管我之前看过这些材料。我很高兴我花了时间这样做,因为我学到了很多东西,看到拉里如何从不同的角度深入思考问题(例如,他经常从几何和统计的角度来展示)是令人难忘和鼓舞人心的。拉里经常提到他对方法或结果的疑问,以及他认为可以扩展的研究方向,我认为这促使学生们将统计学视为一个充满开放问题和研究机会的领域,而不是一个死寂的领域。拉里对他的时间很慷慨。每当我遇到一个学生,不知道还有哪些老师有资格担任他们的论文委员会成员时,我就建议去问拉里,因为我知道他会愿意花时间和学生交谈,认真阅读论文,并有一些有思想的想法要说。拉里去世前几天,当他知道自己的时间不多了,他还在给学生写推荐信。拉里花了很多时间在公共服务上,他鼓励我相信公共服务对社会的价值,尽管一个人可能没有得到认可。拉里工作很努力。他一直活跃在研究、教学和指导学生以及公共服务领域,直到他去世。Larry以前的很多学生都在短时间内从很远的地方赶来参加他的葬礼(甚至是香港!),这证明了Larry对他们生活的影响。拉里也为家人和朋友创造了美好的时光。除了与妻子琳达和家人一起度过的大部分时间外,拉里还在夏天抽出时间和儿子们一起旅行。拉里不管别人的地位如何,都尊重和体面地对待他们。在我担任我们系博士后协调员的时候,一位来自印度一所不知名大学的博士生联系拉里,说我们系有一个博士后空缺
{"title":"Larry Brown: Remembrance and Connections of His Work to Observational Studies","authors":"Dylan S. Small","doi":"10.1353/obs.2018.0011","DOIUrl":"https://doi.org/10.1353/obs.2018.0011","url":null,"abstract":"Statistics lost one of our giants when Larry Brown passed away in February, 2018 at the age of 77. Many lost a friend, a mentor and a teacher. I had the good fortune to be Larry’s colleague for the past 16 years. While I wasn’t as close to him as many others, Larry was my friend and I looked up to him. Larry loved thinking. When Larry got interested in something and scratched his head, I could see his enjoyment. Steam seemed to come out as Larry was thinking and then he happily shared his thoughts. Larry’s classes and talks were full of insights. After joining the faculty at Wharton, I attended Larry’s linear models first year PhD course even though I had seen the material before. I was glad I took the time to do so as I learned a lot, and seeing how Larry thought deeply through things from different perspectives (e.g., he often presented both a geometric and a statistical perspective) was memorable and inspiring. Larry often mentioned questions he had about methods or results, and directions of research he thought could be expanded upon, which I think motivated students to see statistics as a field full of open questions and research opportunities rather than a dead field. Larry was generous with his time. Whenever I had a student for whom it was unclear which other faculty members had the expertise to serve on their dissertation committee, I suggested asking Larry because I knew he would be willing to spend time talking with the student, read the dissertation seriously and have something thoughtful to say. A few days before Larry’s passing, when he knew his time was short, he was writing recommendation letters for students. Larry spent much time on public service, and he encouraged me about its value to society even though one may not get recognition for it. Larry worked hard. He was active in research, teaching and mentoring students and public service until his passing. The large number of Larry’s former students who traveled to his funeral from places far away at short notice (Hong Kong even!) was a testament to Larry’s impact on their lives. Larry also made good time for family and friends. Besides the much time spent together with his wife Linda and their family, Larry made the time for trips over the summer alone with his sons. Larry treated people with respect and decency regardless of their status. At a time when I was the postdoctoral coordinator for our department, a PhD student from a little known university in India contacted Larry about a post doc opening in our department and","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2018.0011","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42764860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Selective Inference for Effect Modification: An Empirical Investigation 效果修正的选择性推理:一项实证研究
Pub Date : 2021-06-04 DOI: 10.1353/obs.2019.0007
Qingyuan Zhao, Snigdha Panigrahi
Abstract:We demonstrate a selective inferential approach for discovering and making confident conclusions about treatment effect heterogeneity. Our method consists of two stages. First, we use Robinson’s transformation to eliminate confounding in the observational study. Next we select a simple model for effect modification using lasso-regularized regression and then use recently developed tools in selective inference to make valid statistical inference for the discovered effect modifiers. We analyze the Mindset Study data-set provided by the workshop organizers and compare our approach with other benchmark methods.
摘要:我们展示了一种选择性推理方法来发现和得出关于治疗效果异质性的可靠结论。我们的方法包括两个阶段。首先,我们使用罗宾逊变换来消除观察研究中的混淆。接下来,我们使用套索正则化回归选择一个简单的效果修正模型,然后使用最近开发的选择性推理工具对发现的效果修正器进行有效的统计推断。我们分析了研讨会组织者提供的心态研究数据集,并将我们的方法与其他基准方法进行了比较。
{"title":"Selective Inference for Effect Modification: An Empirical Investigation","authors":"Qingyuan Zhao, Snigdha Panigrahi","doi":"10.1353/obs.2019.0007","DOIUrl":"https://doi.org/10.1353/obs.2019.0007","url":null,"abstract":"Abstract:We demonstrate a selective inferential approach for discovering and making confident conclusions about treatment effect heterogeneity. Our method consists of two stages. First, we use Robinson’s transformation to eliminate confounding in the observational study. Next we select a simple model for effect modification using lasso-regularized regression and then use recently developed tools in selective inference to make valid statistical inference for the discovered effect modifiers. We analyze the Mindset Study data-set provided by the workshop organizers and compare our approach with other benchmark methods.","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2019.0007","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48461945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Standing on the shoulders of Austin Bradford Hill: The refinement of “specificity” as a consideration in causal inference 站在奥斯汀·布拉德福德·希尔的肩膀上:作为因果推理考虑的“特异性”的细化
Pub Date : 2021-06-04 DOI: 10.1353/OBS.2020.0009
N. Weiss
{"title":"Standing on the shoulders of Austin Bradford Hill: The refinement of “specificity” as a consideration in causal inference","authors":"N. Weiss","doi":"10.1353/OBS.2020.0009","DOIUrl":"https://doi.org/10.1353/OBS.2020.0009","url":null,"abstract":"","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/OBS.2020.0009","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49002003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Inheritance bequeathed to William G. Cochran that he willed forward and left for others to will forward again: The Limits of Observational Studies that seek to Mimic Randomized Experiments 《留给威廉·g·科克伦的遗产:试图模仿随机实验的观察性研究的局限性》是他的遗愿,留给其他人的遗愿
Pub Date : 2021-06-04 DOI: 10.1353/OBS.2015.0012
Thomas D. Cook
{"title":"The Inheritance bequeathed to William G. Cochran that he willed forward and left for others to will forward again: The Limits of Observational Studies that seek to Mimic Randomized Experiments","authors":"Thomas D. Cook","doi":"10.1353/OBS.2015.0012","DOIUrl":"https://doi.org/10.1353/OBS.2015.0012","url":null,"abstract":"","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/OBS.2015.0012","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44922735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The Validity and Efficiency of Hypothesis Testing in Observational Studies with Time-Varying Exposures 假设检验在时变暴露观察性研究中的有效性和有效性
Pub Date : 2021-06-04 DOI: 10.1353/obs.2018.0010
Harlan Campbell, P. Gustafson
Abstract:The fundamental obstacle of observational studies is that of unmeasured confounding. If all potential confounders are measured within the data, and treatment occurs at but a single time-point, conventional regression adjustment methods provide consistent estimates and allow for valid hypothesis testing in a relatively straightforward manner. In situations for which treatment occurs at several successive timepoints, as in many longitudinal studies, another type of confounding is also problematic: even if all confounders are known and measured in the data, time-dependent confounding may bias estimates and invalidate testing due to collider-stratification. While “causal inference methods” can adequately adjust for time-dependent confounding, these methods require strong and unverifiable assumptions. Alternatively, instrumental variable analysis can be used. By means of a simple illustrative scenario and simulation studies, this paper sheds light on the issues involved when considering the relative merits of these two approaches for the purpose of hypothesis testing in the presence of time-dependent confounding.
摘要:观察性研究的根本障碍是不可测量的混杂。如果在数据中测量了所有潜在的混杂因素,并且治疗只发生在一个时间点,传统的回归调整方法提供一致的估计,并允许以相对直接的方式进行有效的假设检验。在治疗发生在几个连续时间点的情况下,如在许多纵向研究中,另一种类型的混淆也是有问题的:即使所有的混杂因素都是已知的,并且在数据中测量到,时间相关的混淆可能会使估计偏倚,并由于碰撞分层而使测试无效。虽然“因果推理方法”可以充分调整时间相关的混淆,但这些方法需要强大且无法验证的假设。或者,可以使用工具变量分析。通过一个简单的说明性场景和模拟研究,本文揭示了当考虑这两种方法的相对优点时所涉及的问题,以便在存在时间相关混淆的情况下进行假设检验。
{"title":"The Validity and Efficiency of Hypothesis Testing in Observational Studies with Time-Varying Exposures","authors":"Harlan Campbell, P. Gustafson","doi":"10.1353/obs.2018.0010","DOIUrl":"https://doi.org/10.1353/obs.2018.0010","url":null,"abstract":"Abstract:The fundamental obstacle of observational studies is that of unmeasured confounding. If all potential confounders are measured within the data, and treatment occurs at but a single time-point, conventional regression adjustment methods provide consistent estimates and allow for valid hypothesis testing in a relatively straightforward manner. In situations for which treatment occurs at several successive timepoints, as in many longitudinal studies, another type of confounding is also problematic: even if all confounders are known and measured in the data, time-dependent confounding may bias estimates and invalidate testing due to collider-stratification. While “causal inference methods” can adequately adjust for time-dependent confounding, these methods require strong and unverifiable assumptions. Alternatively, instrumental variable analysis can be used. By means of a simple illustrative scenario and simulation studies, this paper sheds light on the issues involved when considering the relative merits of these two approaches for the purpose of hypothesis testing in the presence of time-dependent confounding.","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2018.0010","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48954253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Contemporary Conceptual Framework for Initial Data Analysis 初始数据分析的当代概念框架
Pub Date : 2021-06-04 DOI: 10.1353/obs.2018.0014
M. Huebner, S. le Cessie, C. O. Schmidt, W. Vach
Abstract:Initial data analyses (IDA) are often performed as part of studies with primary-data collection, where data are obtained to address a predefined set of research questions, and with a clear plan of the intended statistical analyses. An informal or unstructured approach may have a large and non-transparent impact on results and conclusions presented in publications. Key principles for IDA are to avoid analyses that are part of the research question, and full documentation and transparency.We develop a framework for IDA from the perspective of a study with primary-data collection and define and discuss six steps of IDA: (1) Metadata setup to properly conduct all following IDA steps, (2) Data cleaning to identify and correct data errors, (3) Data screening that consists of understanding the properties of the data, (4) Initial data reporting that informs all potential collaborators working with the data about insights, (5) Refining and updating the analysis plan to incorporate the relevant findings, (6) Reporting of IDA in research papers to document steps that impact the interpretation of results. We describe basic principles to be applied in each step and illustrate them by example.Initial data analysis needs to be recognized as an important part and independent element of the research process. Lack of resources or organizational barriers can be obstacles to IDA. Further methodological developments are needed for IDA dealing with multi-purpose studies or increasingly complex data sets.
摘要:初始数据分析(IDA)通常是作为研究的一部分进行的,主要数据收集是为了解决预先定义的一组研究问题,并有明确的统计分析计划。非正式或非结构化的方法可能对出版物中提出的结果和结论产生巨大而不透明的影响。IDA的主要原则是避免作为研究问题一部分的分析,以及完整的文件和透明度。我们从初级数据收集研究的角度为IDA开发了一个框架,并定义和讨论了IDA的六个步骤:(1)元数据设置,以正确执行以下IDA步骤;(2)数据清理,以识别和纠正数据错误,(4)初始数据报告,告知所有使用数据的潜在合作者有关见解,(5)完善和更新分析计划,以纳入相关发现,(6)在研究论文中报告IDA,以记录影响结果解释的步骤。我们描述了在每一步中应用的基本原则,并通过实例加以说明。初始数据分析需要被视为研究过程的重要组成部分和独立元素。缺乏资源或组织障碍可能是IDA的障碍。IDA需要进一步发展方法,以处理多用途研究或日益复杂的数据集。
{"title":"A Contemporary Conceptual Framework for Initial Data Analysis","authors":"M. Huebner, S. le Cessie, C. O. Schmidt, W. Vach","doi":"10.1353/obs.2018.0014","DOIUrl":"https://doi.org/10.1353/obs.2018.0014","url":null,"abstract":"Abstract:Initial data analyses (IDA) are often performed as part of studies with primary-data collection, where data are obtained to address a predefined set of research questions, and with a clear plan of the intended statistical analyses. An informal or unstructured approach may have a large and non-transparent impact on results and conclusions presented in publications. Key principles for IDA are to avoid analyses that are part of the research question, and full documentation and transparency.We develop a framework for IDA from the perspective of a study with primary-data collection and define and discuss six steps of IDA: (1) Metadata setup to properly conduct all following IDA steps, (2) Data cleaning to identify and correct data errors, (3) Data screening that consists of understanding the properties of the data, (4) Initial data reporting that informs all potential collaborators working with the data about insights, (5) Refining and updating the analysis plan to incorporate the relevant findings, (6) Reporting of IDA in research papers to document steps that impact the interpretation of results. We describe basic principles to be applied in each step and illustrate them by example.Initial data analysis needs to be recognized as an important part and independent element of the research process. Lack of resources or organizational barriers can be obstacles to IDA. Further methodological developments are needed for IDA dealing with multi-purpose studies or increasingly complex data sets.","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2018.0014","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43137526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
The Regression Discontinuity Design and the Social Corruption of Quantitative Indicators 回归不连续性设计与定量指标的社会腐败
Pub Date : 2021-06-04 DOI: 10.1353/obs.2017.0006
Vivian C. Wong, Coady Wing
Abstract:Thistlethwaite and Campbell (1960) (TC) introduced the Regression Discontinuity Design (RDD) as a strategy for learning about the causal effects of interventions in 1960. Their introduction highlights the most important strengths and weaknesses of the RDD. The main points of the original paper have held up well to more formal scrutiny. However, TC did not address “manipulation of assignment scores” as an important validity threat to the design. The insight that manipulation is a central validity threat is the most important conceptual advance in the methodological literature since its introduction. Although most modern RDD analyses include density tests for assessing manipulation, results are most convincing when diagnostic probes are used to address specific, plausible threats to validity. In this paper, we examine validity threats to two common RD designs used to evaluate the effects of No Child Left Behind and state pre-kindergarten programs.
摘要:Thistlethwaite和Campbell(1960)(TC)在1960年引入了回归不连续性设计(RDD),作为了解干预措施因果效应的一种策略。他们的介绍突出了RDD最重要的优势和劣势。原始论文的要点经过了更正式的审查。然而,TC并没有将“作业分数的操纵”视为对设计有效性的重要威胁。操纵是有效性威胁的核心,这是自引入以来方法论文献中最重要的概念进步。尽管大多数现代RDD分析包括用于评估操作的密度测试,但当诊断探针用于解决对有效性的特定、合理威胁时,结果最具说服力。在本文中,我们检验了两种常见的RD设计的有效性威胁,这两种设计用于评估“不让一个孩子掉队”和州立学前教育项目的效果。
{"title":"The Regression Discontinuity Design and the Social Corruption of Quantitative Indicators","authors":"Vivian C. Wong, Coady Wing","doi":"10.1353/obs.2017.0006","DOIUrl":"https://doi.org/10.1353/obs.2017.0006","url":null,"abstract":"Abstract:Thistlethwaite and Campbell (1960) (TC) introduced the Regression Discontinuity Design (RDD) as a strategy for learning about the causal effects of interventions in 1960. Their introduction highlights the most important strengths and weaknesses of the RDD. The main points of the original paper have held up well to more formal scrutiny. However, TC did not address “manipulation of assignment scores” as an important validity threat to the design. The insight that manipulation is a central validity threat is the most important conceptual advance in the methodological literature since its introduction. Although most modern RDD analyses include density tests for assessing manipulation, results are most convincing when diagnostic probes are used to address specific, plausible threats to validity. In this paper, we examine validity threats to two common RD designs used to evaluate the effects of No Child Left Behind and state pre-kindergarten programs.","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2017.0006","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48521501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Causal Thinking in the Twilight Zone 模糊地带的因果思维
Pub Date : 2021-06-04 DOI: 10.1353/obs.2015.0020
J. Pearl
To students of causality, the writings of William Cochran provide an excellent and intriguing vantage point for studying how statistics, lacking the necessary mathematical tools, managed nevertheless to cope with increasing demands for policy evaluation from observational studies. Cochran met this challenge in the years 1955-1980, when statistics was preparing for a profound, albeit tortuous transition from a science of data, to a science of data generating processes. The former, governed by Fisher’s dictum (Fisher, 1922) “the object of statistical methods is the reduction of data” was served well by the traditional language of probability theory. The latter, on the other hand, seeking causal effects and policy recommendations, required an extension of probability theory to facilitate mathematical representations of generating processes. No such representation was allowed into respectable statistical circles in the 1950-60s, when Cochran started looking into the social effects of public housing in Baltimore. While data showed improvement in health and well-being of families that moved from slums to public housing, it soon became obvious that the estimated improvement was strongly biased; Cochran reasoned that in order to become eligible for public housing the parent of a family may have to possess both initiative and some determination in dealing with the bureaucracy, thus making their families more likely to obtain better healthcare than non-eligible families. 1 This led him to suggest “adjustment for covariates” for the explicit purpose of reducing this causal effect bias. While there were others before Cochran who applied adjustment for various purposes, Cochran is credited for introducing this technique to statistics (Salsburg, 2002) primarily because he popularized the method and taxonomized it by purpose of usage. Unlike most of his contemporaries, who considered cause-effect relationships “ill-defined” outside the confines of Fisherian experiments, Cochran had no qualm admitting that he sought such relationships in observational studies. He in fact went as far as dening the objective of an observational study: “to elucidate causal-and-effect relationships” in situations where controlled experiments are infeasible (Cochran, 1965). Indeed, in the paper before us, the word “cause” is used fairly freely, and other causal terms such as “effect,” “influence,” and “explanation” are almost as frequent as “regression” or “variance.” Still, Cochran was well aware that he was dealing with unchartered extra-statistical territory and cautioned us: “Claim of proof of cause and effect must carry with it an explanation of the mechanism by which this effect is produced.”
对于因果关系的学生来说,William Cochran的著作为研究缺乏必要数学工具的统计学如何应对观察性研究对政策评估日益增长的需求提供了一个极好而有趣的视角。Cochran在1955-1980年遇到了这一挑战,当时统计学正在为从数据科学向数据生成过程科学的深刻而曲折的转变做准备。前者由Fisher的格言(Fisher,1922)“统计方法的目标是数据的减少”所支配,传统的概率论语言很好地服务于前者。另一方面,后者寻求因果效应和政策建议,需要扩展概率论,以促进生成过程的数学表示。在1950-60年代,当科克伦开始研究巴尔的摩公共住房的社会影响时,没有这样的代表被允许进入受人尊敬的统计界。虽然数据显示,从贫民窟搬到公共住房的家庭的健康和福祉有所改善,但很快就很明显,估计的改善有很大的偏差;Cochran认为,为了有资格获得公共住房,一个家庭的父母可能必须在应对官僚机构时既有主动性,又有一定的决心,从而使他们的家庭比不符合条件的家庭更有可能获得更好的医疗保健。1这导致他建议“对协变量进行调整”,以明确减少这种因果效应偏差。虽然在Cochran之前还有其他人将调整应用于各种目的,但Cochran将这一技术引入统计学(Salsburg,2002),主要是因为他推广了这一方法,并根据使用目的对其进行了分类。与他同时代的大多数人不同,他们认为因果关系在菲舍尔实验的范围之外“定义不清”,科克伦毫不犹豫地承认,他在观察性研究中寻求这种关系。事实上,他甚至否认了一项观察性研究的目标:在控制实验不可行的情况下“阐明因果关系”(Cochran,1965)。事实上,在我们面前的论文中,“原因”一词的使用相当自由,其他因果术语,如“效果”、“影响”和“解释”,几乎与“回归”或“方差”一样频繁,Cochran很清楚,他正在处理未知的额外统计领域,并提醒我们:“因果证明的声明必须附带对产生这种影响的机制的解释。”
{"title":"Causal Thinking in the Twilight Zone","authors":"J. Pearl","doi":"10.1353/obs.2015.0020","DOIUrl":"https://doi.org/10.1353/obs.2015.0020","url":null,"abstract":"To students of causality, the writings of William Cochran provide an excellent and intriguing vantage point for studying how statistics, lacking the necessary mathematical tools, managed nevertheless to cope with increasing demands for policy evaluation from observational studies. Cochran met this challenge in the years 1955-1980, when statistics was preparing for a profound, albeit tortuous transition from a science of data, to a science of data generating processes. The former, governed by Fisher’s dictum (Fisher, 1922) “the object of statistical methods is the reduction of data” was served well by the traditional language of probability theory. The latter, on the other hand, seeking causal effects and policy recommendations, required an extension of probability theory to facilitate mathematical representations of generating processes. No such representation was allowed into respectable statistical circles in the 1950-60s, when Cochran started looking into the social effects of public housing in Baltimore. While data showed improvement in health and well-being of families that moved from slums to public housing, it soon became obvious that the estimated improvement was strongly biased; Cochran reasoned that in order to become eligible for public housing the parent of a family may have to possess both initiative and some determination in dealing with the bureaucracy, thus making their families more likely to obtain better healthcare than non-eligible families. 1 This led him to suggest “adjustment for covariates” for the explicit purpose of reducing this causal effect bias. While there were others before Cochran who applied adjustment for various purposes, Cochran is credited for introducing this technique to statistics (Salsburg, 2002) primarily because he popularized the method and taxonomized it by purpose of usage. Unlike most of his contemporaries, who considered cause-effect relationships “ill-defined” outside the confines of Fisherian experiments, Cochran had no qualm admitting that he sought such relationships in observational studies. He in fact went as far as dening the objective of an observational study: “to elucidate causal-and-effect relationships” in situations where controlled experiments are infeasible (Cochran, 1965). Indeed, in the paper before us, the word “cause” is used fairly freely, and other causal terms such as “effect,” “influence,” and “explanation” are almost as frequent as “regression” or “variance.” Still, Cochran was well aware that he was dealing with unchartered extra-statistical territory and cautioned us: “Claim of proof of cause and effect must carry with it an explanation of the mechanism by which this effect is produced.”","PeriodicalId":74335,"journal":{"name":"Observational studies","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/obs.2015.0020","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48533165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Observational studies
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1