Benjamin S. Baumer, Mine Çetinkaya-Rundel, Andrew Bray, Linda Loi, N. Horton
{"title":"R Markdown: Integrating A Reproducible Analysis Tool into Introductory Statistics","authors":"Benjamin S. Baumer, Mine Çetinkaya-Rundel, Andrew Bray, Linda Loi, N. Horton","doi":"10.5070/T581020118","DOIUrl":null,"url":null,"abstract":"Nolan and Temple Lang argue that \"the ability to express statistical computations is an essential skill.\" A key related capacity is the ability to conduct and present data analysis in a way that another person can understand and replicate. The copy-and-paste workflow that is an artifact of antiquated user-interface design makes reproducibility of statistical analysis more difficult, especially as data become increasingly complex and statistical methods become increasingly sophisticated. R Markdown is a new technology that makes creating fully-reproducible statistical analysis simple and painless. It provides a solution suitable not only for cutting edge research, but also for use in an introductory statistics course. We present evidence that R Markdown can be used effectively in introductory statistics courses, and discuss its role in the rapidly-changing world of statistical computation.","PeriodicalId":413623,"journal":{"name":"arXiv: Other Statistics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"118","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv: Other Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5070/T581020118","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 118
Abstract
Nolan and Temple Lang argue that "the ability to express statistical computations is an essential skill." A key related capacity is the ability to conduct and present data analysis in a way that another person can understand and replicate. The copy-and-paste workflow that is an artifact of antiquated user-interface design makes reproducibility of statistical analysis more difficult, especially as data become increasingly complex and statistical methods become increasingly sophisticated. R Markdown is a new technology that makes creating fully-reproducible statistical analysis simple and painless. It provides a solution suitable not only for cutting edge research, but also for use in an introductory statistics course. We present evidence that R Markdown can be used effectively in introductory statistics courses, and discuss its role in the rapidly-changing world of statistical computation.