Approachable Case Studies Support Learning and Reproducibility in Data Science: An Example from Evolutionary Biology

IF 1.5 Q2 EDUCATION, SCIENTIFIC DISCIPLINES Journal of Statistics and Data Science Education Pub Date : 2022-07-13 DOI:10.1080/26939169.2022.2099487
Luna L. Sánchez Reyes, E. J. McTavish
{"title":"Approachable Case Studies Support Learning and Reproducibility in Data Science: An Example from Evolutionary Biology","authors":"Luna L. Sánchez Reyes, E. J. McTavish","doi":"10.1080/26939169.2022.2099487","DOIUrl":null,"url":null,"abstract":"ABSTRACT Research reproducibility is essential for scientific development. Yet, rates of reproducibility are low. As increasingly more research relies on computers and software, efforts for improving reproducibility rates have focused on making research products digitally available, such as publishing analysis workflows as computer code, and raw and processed data in computer readable form. However, research products that are digitally available are not necessarily friendly for learners and interested parties with little to no experience in the field. This renders research products unapproachable, counteracts their availability, and hinders scientific reproducibility. To improve both short- and long-term adoption of reproducible scientific practices, research products need to be made approachable for learners, the researchers of the future. Using a case study within evolutionary biology, we identify aspects of research workflows that make them unapproachable to the general audience: use of highly specialized language; unclear goals and high cognitive load; and lack of trouble-shooting examples. We propose principles to improve the unapproachable aspects of research workflows and illustrate their application using an online teaching resource. We elaborate on the general application of these principles for documenting research products and teaching materials, to provide present learners and future researchers with tools for successful scientific reproducibility. Supplementary materials for this article are available online.","PeriodicalId":34851,"journal":{"name":"Journal of Statistics and Data Science Education","volume":"30 1","pages":"304 - 310"},"PeriodicalIF":1.5000,"publicationDate":"2022-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Statistics and Data Science Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/26939169.2022.2099487","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"EDUCATION, SCIENTIFIC DISCIPLINES","Score":null,"Total":0}
引用次数: 2

Abstract

ABSTRACT Research reproducibility is essential for scientific development. Yet, rates of reproducibility are low. As increasingly more research relies on computers and software, efforts for improving reproducibility rates have focused on making research products digitally available, such as publishing analysis workflows as computer code, and raw and processed data in computer readable form. However, research products that are digitally available are not necessarily friendly for learners and interested parties with little to no experience in the field. This renders research products unapproachable, counteracts their availability, and hinders scientific reproducibility. To improve both short- and long-term adoption of reproducible scientific practices, research products need to be made approachable for learners, the researchers of the future. Using a case study within evolutionary biology, we identify aspects of research workflows that make them unapproachable to the general audience: use of highly specialized language; unclear goals and high cognitive load; and lack of trouble-shooting examples. We propose principles to improve the unapproachable aspects of research workflows and illustrate their application using an online teaching resource. We elaborate on the general application of these principles for documenting research products and teaching materials, to provide present learners and future researchers with tools for successful scientific reproducibility. Supplementary materials for this article are available online.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
可接近的案例研究支持数据科学中的学习和再现性:一个来自进化生物学的例子
摘要研究的再现性对科学发展至关重要。然而,再现率很低。随着越来越多的研究依赖于计算机和软件,提高再现率的努力集中在使研究产品数字化,例如以计算机代码的形式发布分析工作流程,以及以计算机可读形式发布原始和处理数据。然而,数字化的研究产品对在该领域几乎没有经验的学习者和感兴趣的各方来说并不一定友好。这使得研究产品无法接近,抵消了它们的可用性,并阻碍了科学的再现性。为了提高可重复科学实践的短期和长期采用率,研究产品需要让学习者和未来的研究人员能够接近。通过进化生物学中的一个案例研究,我们确定了研究工作流程中使其无法为普通受众所接受的方面:使用高度专业化的语言;目标不明确,认知负荷高;以及缺乏故障排除示例。我们提出了改进研究工作流程中不可接近的方面的原则,并使用在线教学资源说明了它们的应用。我们详细阐述了这些原则在记录研究产品和教材方面的一般应用,为现在的学习者和未来的研究人员提供成功的科学再现性工具。本文的补充材料可在线获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Statistics and Data Science Education
Journal of Statistics and Data Science Education EDUCATION, SCIENTIFIC DISCIPLINES-
CiteScore
3.90
自引率
35.30%
发文量
52
审稿时长
12 weeks
期刊最新文献
Investigating Sensitive Issues in Class Through Randomized Response Polling Teaching Students to Read COVID-19 Journal Articles in Statistics Courses Journal of Statistics and Data Science Education 2023 Associate Editors Interviews of Notable Statistics and Data Science Educators Coding Code: Qualitative Methods for Investigating Data Science Skills
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1