Statistical Analysis of Numerical Preclinical Radiobiological Data

Helene Z. Hill, J. Pitt
{"title":"Statistical Analysis of Numerical Preclinical Radiobiological Data","authors":"Helene Z. Hill, J. Pitt","doi":"10.14293/S2199-1006.1.SOR-STAT.AFHTWC.V1","DOIUrl":null,"url":null,"abstract":"Background: Scientific fraud is an increasingly vexing problem.  Many current programs for fraud detection focus on image manipulation, while techniques for detection based on anomalous patterns that may be discoverable in the underlying numerical data  get much less attention, even though these techniques are often easy to apply.  Methods: We applied statistical techniques in considering and comparing data sets from ten researchers in one laboratory and three outside investigators to determine whether anomalous patterns in data from a research teaching assistant (RTS) were likely to have occurred by chance. Rightmost digits of values in RTS data sets were not, as expected, uniform; equal pairs of terminal digits occurred at higher than expected frequency (> 10%); and, an unexpectedly large number of data triples commonly produced in such research included values near their means as an element. We applied standard statistical tests (chi-squared goodness of fit, binomial probabilities) to determine the likelihood of the first two anomalous patterns, and developed a new statistical model to test the third.  Results: Application of the three tests to various data sets reported by RTS resulted in repeated rejection of the hypotheses (often at p-levels well below 0.001) that anomalous patterns in those data may have occurred by chance.  Similar application to data sets from other investigators were entirely consistent with chance occurrence. Conclusions: This analysis emphasizes the importance of access to raw data that form the bases of publications, reports and grant applications in order to evaluate the correctness of the conclusions, and the importance of applying statistical methods to detect anomalous, especially potentially fabricated, numerical results.","PeriodicalId":91169,"journal":{"name":"ScienceOpen research","volume":"41 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2016-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ScienceOpen research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14293/S2199-1006.1.SOR-STAT.AFHTWC.V1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Background: Scientific fraud is an increasingly vexing problem.  Many current programs for fraud detection focus on image manipulation, while techniques for detection based on anomalous patterns that may be discoverable in the underlying numerical data  get much less attention, even though these techniques are often easy to apply.  Methods: We applied statistical techniques in considering and comparing data sets from ten researchers in one laboratory and three outside investigators to determine whether anomalous patterns in data from a research teaching assistant (RTS) were likely to have occurred by chance. Rightmost digits of values in RTS data sets were not, as expected, uniform; equal pairs of terminal digits occurred at higher than expected frequency (> 10%); and, an unexpectedly large number of data triples commonly produced in such research included values near their means as an element. We applied standard statistical tests (chi-squared goodness of fit, binomial probabilities) to determine the likelihood of the first two anomalous patterns, and developed a new statistical model to test the third.  Results: Application of the three tests to various data sets reported by RTS resulted in repeated rejection of the hypotheses (often at p-levels well below 0.001) that anomalous patterns in those data may have occurred by chance.  Similar application to data sets from other investigators were entirely consistent with chance occurrence. Conclusions: This analysis emphasizes the importance of access to raw data that form the bases of publications, reports and grant applications in order to evaluate the correctness of the conclusions, and the importance of applying statistical methods to detect anomalous, especially potentially fabricated, numerical results.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
临床前放射生物学数值数据的统计分析
背景:科学欺诈是一个日益令人烦恼的问题。许多当前的欺诈检测程序侧重于图像处理,而基于可能在底层数字数据中发现的异常模式的检测技术得到的关注要少得多,尽管这些技术通常很容易应用。方法:我们应用统计技术考虑和比较来自一个实验室的10名研究人员和3名外部调查人员的数据集,以确定来自研究教学助理(RTS)的数据中的异常模式是否可能是偶然发生的。RTS数据集中最右边的数值并不像预期的那样是一致的;等对终端数字出现的频率高于预期(> 10%);而且,在此类研究中通常产生的大量数据三元组将接近其均值的值作为元素。我们应用标准统计检验(卡方拟合优度、二项概率)来确定前两种异常模式的可能性,并开发了一种新的统计模型来检验第三种异常模式。结果:将这三种检验应用于RTS报告的各种数据集导致反复拒绝假设(通常在p水平远低于0.001),即这些数据中的异常模式可能是偶然发生的。类似的应用也适用于其他研究者的数据集,完全符合偶然发生的情况。结论:该分析强调了获取原始数据的重要性,这些数据构成了出版物、报告和拨款申请的基础,以便评估结论的正确性,以及应用统计方法检测异常,特别是可能捏造的数值结果的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
审稿时长
1 weeks
期刊最新文献
A review: CRISPR/Cas12-mediated genome editing in fungal cells: advancements, mechanisms, and future directions in plant-fungal pathology Psychosocial risks in the working environment – approaches to formative risk assessment Technological, legal, and sociological summary of biometric technology usage Policy learning from influenza and the preparedness of the public health sector: 2006/2007 influenza season in Latvia Mpemba Effect- the Effect of Time
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1