首页 > 最新文献

Chance (New York, N.Y.)最新文献

英文 中文
Lessons from Applying the Community Rapid Assessment Method to COVID-19 Protective Measures in Three Countries 社区快速评估方法应用于三国COVID-19防护措施的经验教训
Pub Date : 2021-07-03 DOI: 10.1080/09332480.2021.1979806
C. Andersen, U. Huynh, Andrés Ochoa Toasa, C. Wells, M. Wong
Abstract Previous major infection outbreaks have shown the importance of timely information about local conditions in guiding support and health care interventions. During the COVID-19 pandemic, UNICEF developed the Community Rapid Assessment (CRA) method to address this need. The CRA uses cell phone technology and a questionnaire based on an advanced behavioral model. The success of this kind of instrument depends on a variety of statistical issues such as whether the samples are representative of the population, the detailed design of questions, the quality of responses, and the choice of methods for inferential analysis. The purpose of this article is to describe the CRA method and lessons learned from a preliminary inferential analysis.
以往的重大感染暴发表明,及时了解当地情况对指导支持和卫生保健干预的重要性。在2019冠状病毒病大流行期间,联合国儿童基金会制定了社区快速评估(CRA)方法来满足这一需求。CRA使用手机技术和基于先进行为模型的问卷调查。这种工具的成功取决于各种统计问题,如样本是否能代表总体、问题的详细设计、回答的质量以及推理分析方法的选择。本文的目的是描述CRA方法和从初步推论分析中吸取的经验教训。
{"title":"Lessons from Applying the Community Rapid Assessment Method to COVID-19 Protective Measures in Three Countries","authors":"C. Andersen, U. Huynh, Andrés Ochoa Toasa, C. Wells, M. Wong","doi":"10.1080/09332480.2021.1979806","DOIUrl":"https://doi.org/10.1080/09332480.2021.1979806","url":null,"abstract":"Abstract Previous major infection outbreaks have shown the importance of timely information about local conditions in guiding support and health care interventions. During the COVID-19 pandemic, UNICEF developed the Community Rapid Assessment (CRA) method to address this need. The CRA uses cell phone technology and a questionnaire based on an advanced behavioral model. The success of this kind of instrument depends on a variety of statistical issues such as whether the samples are representative of the population, the detailed design of questions, the quality of responses, and the choice of methods for inferential analysis. The purpose of this article is to describe the CRA method and lessons learned from a preliminary inferential analysis.","PeriodicalId":88226,"journal":{"name":"Chance (New York, N.Y.)","volume":"2 1","pages":"6 - 12"},"PeriodicalIF":0.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80811001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
COVID Monitoring Framework for Indian Cities 印度城市COVID监测框架
Pub Date : 2021-07-03 DOI: 10.1080/09332480.2021.1981052
Preetam Debasish Saha Roy, Sangeeta Jayadevan
In this article, we discuss the attempt to synthesize disparate sources of information Non-profit Organizations India Excellence Forum (IEF) and Statistics without Borders (SWB) collaborated to develop a platform that would aid in decision making for different stakeholders. The goal was to leverage pre-existing infectious disease models and COVID-19 related open data to provide relevant monitoring metrics at different granular levels such as States, Districts, City and Wards.
在本文中,我们讨论了综合不同信息来源的尝试,非营利组织印度卓越论坛(IEF)和无国界统计组织(SWB)合作开发了一个平台,可以帮助不同利益相关者做出决策。目标是利用已有的传染病模型和与COVID-19相关的开放数据,提供州、区、市和病房等不同粒度级别的相关监测指标。
{"title":"COVID Monitoring Framework for Indian Cities","authors":"Preetam Debasish Saha Roy, Sangeeta Jayadevan","doi":"10.1080/09332480.2021.1981052","DOIUrl":"https://doi.org/10.1080/09332480.2021.1981052","url":null,"abstract":"In this article, we discuss the attempt to synthesize disparate sources of information Non-profit Organizations India Excellence Forum (IEF) and Statistics without Borders (SWB) collaborated to develop a platform that would aid in decision making for different stakeholders. The goal was to leverage pre-existing infectious disease models and COVID-19 related open data to provide relevant monitoring metrics at different granular levels such as States, Districts, City and Wards.","PeriodicalId":88226,"journal":{"name":"Chance (New York, N.Y.)","volume":"18 1","pages":"W73 - W81"},"PeriodicalIF":0.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85359947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Machine Learning Approach to Helping Small Businesses Find Pandemic Economic-Impact Relief 帮助小企业找到流行病经济影响救济的机器学习方法
Pub Date : 2021-07-03 DOI: 10.1080/09332480.2021.1979820
M. Czapski, S. Godfrey, Joshua Derenski, Isaac Khader
While the Global Health Organization was able to officially declare the spread of COVID-19 as a global pandemic late in Q1 2020, the most effective responses from both governmental and private organizations were by no means clear. Very little was known about what was then frequently referred to as the novel coronavirus, and medical professionals had few recommendations specific to this disease. Still, what was abundantly clear was stay-at-home and lockdown orders were needed to bend the curve or slow transmission. As customers sheltered in place and businesses closed their doors, the impact on small businesses was expected to be devastating. With so many sources for potential aid from U.S. governments, and private and philanthropic entities available C2CB, aided by SWB, focused on helping small businesses identify relevant aid resources. SWB, consulting with C2CB, built a multistage data pipeline using machine learning techniques to automatically curate a national list of small-business aid programs, presenting users with results to efficiently research and find relevant aid programs. While this project curates business relief grants, it is a proof-of-concept for a no-cost data pipeline using machine learning techniques with automated website relevancy classification.
虽然全球卫生组织(who)在2020年第一季度末才正式宣布新冠肺炎(COVID-19)为全球大流行,但政府和民间组织的最有效应对措施并不明确。人们对当时经常被称为新型冠状病毒的东西知之甚少,医疗专业人员也几乎没有针对这种疾病的建议。不过,非常清楚的是,需要居家和封锁令来扭转形势或减缓传播。由于顾客躲在原地,企业关门,预计对小企业的影响是毁灭性的。由于美国政府、私人和慈善机构提供的潜在援助来源如此之多,C2CB在SWB的协助下,专注于帮助小企业确定相关的援助资源。SWB咨询了C2CB,利用机器学习技术建立了一个多级数据管道,自动管理全国小企业援助项目清单,向用户展示结果,以便有效地研究和找到相关的援助项目。虽然这个项目管理商业救济拨款,但它是一个使用机器学习技术和自动网站相关性分类的免费数据管道的概念验证。
{"title":"A Machine Learning Approach to Helping Small Businesses Find Pandemic Economic-Impact Relief","authors":"M. Czapski, S. Godfrey, Joshua Derenski, Isaac Khader","doi":"10.1080/09332480.2021.1979820","DOIUrl":"https://doi.org/10.1080/09332480.2021.1979820","url":null,"abstract":"While the Global Health Organization was able to officially declare the spread of COVID-19 as a global pandemic late in Q1 2020, the most effective responses from both governmental and private organizations were by no means clear. Very little was known about what was then frequently referred to as the novel coronavirus, and medical professionals had few recommendations specific to this disease. Still, what was abundantly clear was stay-at-home and lockdown orders were needed to bend the curve or slow transmission. As customers sheltered in place and businesses closed their doors, the impact on small businesses was expected to be devastating. With so many sources for potential aid from U.S. governments, and private and philanthropic entities available C2CB, aided by SWB, focused on helping small businesses identify relevant aid resources. SWB, consulting with C2CB, built a multistage data pipeline using machine learning techniques to automatically curate a national list of small-business aid programs, presenting users with results to efficiently research and find relevant aid programs. While this project curates business relief grants, it is a proof-of-concept for a no-cost data pipeline using machine learning techniques with automated website relevancy classification.","PeriodicalId":88226,"journal":{"name":"Chance (New York, N.Y.)","volume":"215 1","pages":"61 - 68"},"PeriodicalIF":0.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74163958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editor’s Letter 编辑的信
Pub Date : 2021-07-03 DOI: 10.1080/09332480.2021.1979804
Amanda Peterson-Plunkett
{"title":"Editor’s Letter","authors":"Amanda Peterson-Plunkett","doi":"10.1080/09332480.2021.1979804","DOIUrl":"https://doi.org/10.1080/09332480.2021.1979804","url":null,"abstract":"","PeriodicalId":88226,"journal":{"name":"Chance (New York, N.Y.)","volume":"68 1","pages":"3 - 3"},"PeriodicalIF":0.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87062572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Special Issue on Statistics and Data Science for Good 统计和数据科学专刊
Pub Date : 2021-07-03 DOI: 10.1080/09332480.2021.1979805
C. Augustin, M. Brems, Davina Durgana
{"title":"Special Issue on Statistics and Data Science for Good","authors":"C. Augustin, M. Brems, Davina Durgana","doi":"10.1080/09332480.2021.1979805","DOIUrl":"https://doi.org/10.1080/09332480.2021.1979805","url":null,"abstract":"","PeriodicalId":88226,"journal":{"name":"Chance (New York, N.Y.)","volume":"14 1","pages":"4 - 5"},"PeriodicalIF":0.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79366086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Teaching Courses Focused on Social Good 以社会公益为重点的教学课程
Pub Date : 2021-07-03 DOI: 10.1080/09332480.2021.1979821
Maria Tackett, Kendra S. Burbank, Judith E. Canner, Mine Çetinkaya-Rundel
In this column we describe two courses that focus on the role of statistics in understanding social issues. The first is an introductory statistics course developed by Dr. Kendra Burbank at the University of Chicago where students learn statistical methods as they explore different aspects of the water crisis in Flint, Michigan. Then, we describe an intermediate-level service learning course taught by Dr. Judith Canner and Dr. Alana Unfried at California State University Monterey Bay University where students use statistics to consult with local nonprofits and learn how to take a data-driven approach to affect change. We conclude with resources for instructors interested in incorporating social issues in their courses.
在本专栏中,我们将介绍两门课程,重点关注统计学在理解社会问题中的作用。第一个是由芝加哥大学的肯德拉·伯班克博士开设的统计学入门课程,学生在探索密歇根州弗林特市水危机的不同方面时,可以学习统计学方法。然后,我们描述了一个中级服务学习课程,由加州州立大学蒙特利湾大学的Judith Canner博士和Alana Unfried博士教授,学生使用统计数据向当地非营利组织咨询,并学习如何采用数据驱动的方法来影响变革。最后,我们为有兴趣将社会问题纳入课程的教师提供了资源。
{"title":"Teaching Courses Focused on Social Good","authors":"Maria Tackett, Kendra S. Burbank, Judith E. Canner, Mine Çetinkaya-Rundel","doi":"10.1080/09332480.2021.1979821","DOIUrl":"https://doi.org/10.1080/09332480.2021.1979821","url":null,"abstract":"In this column we describe two courses that focus on the role of statistics in understanding social issues. The first is an introductory statistics course developed by Dr. Kendra Burbank at the University of Chicago where students learn statistical methods as they explore different aspects of the water crisis in Flint, Michigan. Then, we describe an intermediate-level service learning course taught by Dr. Judith Canner and Dr. Alana Unfried at California State University Monterey Bay University where students use statistics to consult with local nonprofits and learn how to take a data-driven approach to affect change. We conclude with resources for instructors interested in incorporating social issues in their courses.","PeriodicalId":88226,"journal":{"name":"Chance (New York, N.Y.)","volume":"11 1","pages":"69 - 72"},"PeriodicalIF":0.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89564317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Science for Social Good Volunteer Motivations and Limitations: An Exploratory Survey 数据科学对社会公益志愿者的动机和限制:一项探索性调查
Pub Date : 2021-07-03 DOI: 10.1080/09332480.2021.1981055
Benjamin Kinsella
Broadly understood as the Data for Good (D4G) movement, coordinated efforts between technologists, domain experts, and mission driven organizations are addressing some of the world’s most pressing challenges using data science and AI applications to. One prominent mode of D4G engagement is skills-based volunteering, such as the work conducted by DataKind, a global nonprofit that pairs pro bono technologists with social sector organizations. This article examines D4G volunteering, reporting on findings from DataKind’s global volunteer survey that explores the community’s characteristics, motivations, and even the limitations that hinder long-term project engagement. As a global data collection and assessment effort of a self-identified D4G community, this study informs the broader community of practice and collaboration opportunities, which seek to advance a more equitable and ethical D4G ecosystem.
被广泛理解为数据造福(D4G)运动,技术专家、领域专家和使命驱动型组织之间的协调努力正在利用数据科学和人工智能应用来解决一些世界上最紧迫的挑战。D4G参与的一个突出模式是基于技能的志愿服务,比如DataKind开展的工作。DataKind是一家全球性非营利组织,将公益性技术专家与社会部门组织配对。本文研究了D4G志愿者,报告了DataKind全球志愿者调查的结果,该调查探讨了社区的特征、动机,甚至是阻碍长期项目参与的限制。作为一个自我认定的D4G社区的全球数据收集和评估工作,本研究为更广泛的社区提供了实践和合作机会,旨在推动一个更加公平和道德的D4G生态系统。
{"title":"Data Science for Social Good Volunteer Motivations and Limitations: An Exploratory Survey","authors":"Benjamin Kinsella","doi":"10.1080/09332480.2021.1981055","DOIUrl":"https://doi.org/10.1080/09332480.2021.1981055","url":null,"abstract":"Broadly understood as the Data for Good (D4G) movement, coordinated efforts between technologists, domain experts, and mission driven organizations are addressing some of the world’s most pressing challenges using data science and AI applications to. One prominent mode of D4G engagement is skills-based volunteering, such as the work conducted by DataKind, a global nonprofit that pairs pro bono technologists with social sector organizations. This article examines D4G volunteering, reporting on findings from DataKind’s global volunteer survey that explores the community’s characteristics, motivations, and even the limitations that hinder long-term project engagement. As a global data collection and assessment effort of a self-identified D4G community, this study informs the broader community of practice and collaboration opportunities, which seek to advance a more equitable and ethical D4G ecosystem.","PeriodicalId":88226,"journal":{"name":"Chance (New York, N.Y.)","volume":"4 1","pages":"W86 - W95"},"PeriodicalIF":0.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72984372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Building Statistics and Data Science Capacity for Development 建设统计和数据科学促进发展的能力
Pub Date : 2021-07-03 DOI: 10.1080/09332480.2021.1979810
Eric A. Vance, Kim Love
Data-driven decision making for sustainable development requires domain expertise to ask the right questions; high-quality, relevant data; appropriate, nuanced statistical analyses; and the power to make and implement a decision. Statistics enables and accelerates all of these aspects. We propose a new model for building statistics and data science capacity to engage in data-driven development. Statisticians and data scientists must be able to understand the data and projects they are working with on both a deep and broad level and be able to communicate the results of statistical methods and analytical work in ways that provide actionable evidence to those who can use it to positively impact society. Our model for building statistics and data science capacity is to create statistics and data science collaboration laboratories (“stat labs”) that work in the intersections of data-driven development by collaborating with data producers and data decision makers to transform evidence into action. We present lessons learned from the LISA 2020 Network, which has leveraged the collective experiences of more than 30 newly created stat labs in developing countries to build such statistics and data science capacity by focusing on the intersections of data-driven development.
数据驱动的可持续发展决策需要领域专业知识来提出正确的问题;高质量、相关的数据;适当、细致的统计分析;以及制定和执行决策的权力。统计数据支持并加速了所有这些方面。我们提出了建立统计和数据科学能力的新模式,以参与数据驱动的发展。统计学家和数据科学家必须能够在深度和广泛的层面上理解他们正在处理的数据和项目,并能够以提供可操作证据的方式向那些可以使用它对社会产生积极影响的人传达统计方法和分析工作的结果。我们建立统计和数据科学能力的模式是创建统计和数据科学合作实验室(“统计实验室”),通过与数据生产者和数据决策者合作,将证据转化为行动,在数据驱动发展的交叉点工作。我们介绍了从LISA 2020网络中吸取的经验教训,该网络利用了发展中国家30多个新成立的统计实验室的集体经验,通过关注数据驱动发展的交叉点来建设此类统计和数据科学能力。
{"title":"Building Statistics and Data Science Capacity for Development","authors":"Eric A. Vance, Kim Love","doi":"10.1080/09332480.2021.1979810","DOIUrl":"https://doi.org/10.1080/09332480.2021.1979810","url":null,"abstract":"Data-driven decision making for sustainable development requires domain expertise to ask the right questions; high-quality, relevant data; appropriate, nuanced statistical analyses; and the power to make and implement a decision. Statistics enables and accelerates all of these aspects. We propose a new model for building statistics and data science capacity to engage in data-driven development. Statisticians and data scientists must be able to understand the data and projects they are working with on both a deep and broad level and be able to communicate the results of statistical methods and analytical work in ways that provide actionable evidence to those who can use it to positively impact society. Our model for building statistics and data science capacity is to create statistics and data science collaboration laboratories (“stat labs”) that work in the intersections of data-driven development by collaborating with data producers and data decision makers to transform evidence into action. We present lessons learned from the LISA 2020 Network, which has leveraged the collective experiences of more than 30 newly created stat labs in developing countries to build such statistics and data science capacity by focusing on the intersections of data-driven development.","PeriodicalId":88226,"journal":{"name":"Chance (New York, N.Y.)","volume":"13 1","pages":"38 - 46"},"PeriodicalIF":0.0,"publicationDate":"2021-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83484221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A History of Data Visualization and Graphic Communication 数据可视化和图形通信的历史
Pub Date : 2021-06-03 DOI: 10.4159/9780674259034
Leland Wilkinson
38 practical, and more effectively promoted, than their Bayesian counterparts. I am also not certain about the extent to which the replication crisis is due to the use of the p-value. Although it is certainly a contributing factor, I personally believe the root cause is that the data analyst is usually fully invested in the outcome, and has every incentive imaginable to obtain the most flattering result. All in all, I believe a book as belligerent and instantly controversial as Bernoulli’s Fallacy deserves an appendix in which some of the claims are debated with other statisticians (cf. Berger & Wolpert, 1988). My final misgiving is that the author occasionally gives Ed Jaynes a little too much credit. For instance, the notion that all probability statements are conditional on prior knowledge is found in Keynes (1921), and both Jeffreys and Lindley consistently conditioned on “H.” (for “history”) or “K” (for “knowledge”) before Jaynes. Similarly, the idea that Bayesian inference is a logic of partial beliefs predates Jaynes—it goes back at least to De Morgan (1847/2003), Ramsey (1926), and de Finetti (1974). Despite these minor misgivings, this book comes highly recommended. Bernoulli’s Fallacy elegantly connects the past to the present in an attempt to dismantle the reigning statistical orthodoxy. Buy this book, and give it to your students so they may learn about Bayesian inference and the history of statistics; give it to your colleagues working in the empirical sciences so they will understand that the frequentist emperor is scantily dressed; give it to your frequentist friends as a provocation. Or read it yourself, so you will be prompted to think more deeply about the foundations of statistical inference.
38比贝叶斯理论更实用,更有效地推广。我也不确定复制危机在多大程度上是由于使用p值造成的。虽然这当然是一个促成因素,但我个人认为根本原因是数据分析师通常完全投入到结果中,并且有一切可以想象的动机来获得最令人满意的结果。总而言之,我认为一本像伯努利谬误这样好战且立即引起争议的书值得在附录中与其他统计学家讨论其中的一些主张(参见Berger & Wolpert, 1988)。我最后的疑虑是,作者偶尔会给Ed Jaynes太多的信任。例如,凯恩斯(1921)认为,所有概率陈述都以先验知识为条件,杰弗里斯和林德利也始终以“H”为条件。(代表“历史”)或“K”(代表“知识”)。同样,贝叶斯推理是部分信念逻辑的观点早于詹尼斯——它至少可以追溯到德·摩根(1847/2003)、拉姆齐(1926)和德·菲内蒂(1974)。尽管有这些小小的疑虑,这本书还是被强烈推荐。伯努利的谬论巧妙地将过去与现在联系起来,试图瓦解占统治地位的统计正统。买这本书,把它给你的学生,这样他们就可以学习贝叶斯推理和统计学的历史;把它送给你在实证科学领域工作的同事,这样他们就会明白,这位频繁出入的皇帝衣着暴露;把它送给你的频繁使用的朋友,作为挑衅。或者自己读一下,这样你就会对统计推断的基础有更深入的思考。
{"title":"A History of Data Visualization and Graphic Communication","authors":"Leland Wilkinson","doi":"10.4159/9780674259034","DOIUrl":"https://doi.org/10.4159/9780674259034","url":null,"abstract":"38 practical, and more effectively promoted, than their Bayesian counterparts. I am also not certain about the extent to which the replication crisis is due to the use of the p-value. Although it is certainly a contributing factor, I personally believe the root cause is that the data analyst is usually fully invested in the outcome, and has every incentive imaginable to obtain the most flattering result. All in all, I believe a book as belligerent and instantly controversial as Bernoulli’s Fallacy deserves an appendix in which some of the claims are debated with other statisticians (cf. Berger & Wolpert, 1988). My final misgiving is that the author occasionally gives Ed Jaynes a little too much credit. For instance, the notion that all probability statements are conditional on prior knowledge is found in Keynes (1921), and both Jeffreys and Lindley consistently conditioned on “H.” (for “history”) or “K” (for “knowledge”) before Jaynes. Similarly, the idea that Bayesian inference is a logic of partial beliefs predates Jaynes—it goes back at least to De Morgan (1847/2003), Ramsey (1926), and de Finetti (1974). Despite these minor misgivings, this book comes highly recommended. Bernoulli’s Fallacy elegantly connects the past to the present in an attempt to dismantle the reigning statistical orthodoxy. Buy this book, and give it to your students so they may learn about Bayesian inference and the history of statistics; give it to your colleagues working in the empirical sciences so they will understand that the frequentist emperor is scantily dressed; give it to your frequentist friends as a provocation. Or read it yourself, so you will be prompted to think more deeply about the foundations of statistical inference.","PeriodicalId":88226,"journal":{"name":"Chance (New York, N.Y.)","volume":"4 1","pages":"38 - 40"},"PeriodicalIF":0.0,"publicationDate":"2021-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87267411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Who is Accountable for Data Bias? 谁对数据偏差负责?
Pub Date : 2021-04-03 DOI: 10.1080/09332480.2021.1915032
C. Parkey
39 Accountability for misuse of data is a big question in using data science and machine learning (ML) to advance society. Are the data collectors, model builders, or users ultimately accountable? The benefits of data sharing are widely recognized by the scientific community, but headlines can also be seen in the news about models that are released with known bias or without any impact monitoring and reporting in place. Examples include “Florida scientist says she was fired for not manipulating COVID-19 Data” and “Google Researcher Says She Was Fired Over Paper Highlighting Bias in A.I.” after a paper by Timnit Gebru that highlighted the risk of large language models was accepted. Organizations such as the World Health Organization (WHO) have pages of policies qualifying how data were collected, the limitations, and restrictions on use. At the same time, whistleblowers and researchers alike are pushing back, attempting to hold companies and states accountable for their misuse of data. While there is no clear answer, the question of accountability at multiple levels can be explored, as well as how to begin implementing systems of accountability now instead of waiting for regulations to provide guidance.
39在使用数据科学和机器学习(ML)推动社会进步的过程中,对数据滥用的问责是一个大问题。数据收集者、模型构建者或用户是否负有最终责任?数据共享的好处得到了科学界的广泛认可,但也可以在新闻中看到一些模型的头条新闻,这些模型发布时带有已知的偏见,或者没有进行任何影响监测和报告。例如,在Timnit Gebru的一篇强调大型语言模型风险的论文被接受后,“佛罗里达州的科学家说她因为没有操纵COVID-19数据而被解雇”和“谷歌研究人员说她因为强调人工智能偏见的论文而被解雇”。世界卫生组织(世卫组织)等组织有详细说明数据收集方式、限制和使用限制的政策页面。与此同时,举报人和研究人员都在反击,试图让公司和国家为滥用数据负责。虽然没有明确的答案,但可以探讨多层次问责制的问题,以及如何现在就开始实施问责制,而不是等待法规提供指导。
{"title":"Who is Accountable for Data Bias?","authors":"C. Parkey","doi":"10.1080/09332480.2021.1915032","DOIUrl":"https://doi.org/10.1080/09332480.2021.1915032","url":null,"abstract":"39 Accountability for misuse of data is a big question in using data science and machine learning (ML) to advance society. Are the data collectors, model builders, or users ultimately accountable? The benefits of data sharing are widely recognized by the scientific community, but headlines can also be seen in the news about models that are released with known bias or without any impact monitoring and reporting in place. Examples include “Florida scientist says she was fired for not manipulating COVID-19 Data” and “Google Researcher Says She Was Fired Over Paper Highlighting Bias in A.I.” after a paper by Timnit Gebru that highlighted the risk of large language models was accepted. Organizations such as the World Health Organization (WHO) have pages of policies qualifying how data were collected, the limitations, and restrictions on use. At the same time, whistleblowers and researchers alike are pushing back, attempting to hold companies and states accountable for their misuse of data. While there is no clear answer, the question of accountability at multiple levels can be explored, as well as how to begin implementing systems of accountability now instead of waiting for regulations to provide guidance.","PeriodicalId":88226,"journal":{"name":"Chance (New York, N.Y.)","volume":"29 1","pages":"39 - 43"},"PeriodicalIF":0.0,"publicationDate":"2021-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73948438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Chance (New York, N.Y.)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1