{"title":"从统计显著性可以得出什么结论?严格测试作为我们标准工具包的一个有吸引力的扩展","authors":"Christopher Milde","doi":"10.2139/ssrn.3413808","DOIUrl":null,"url":null,"abstract":"Assessments of statistical significance are ubiquitous in damage quantification practice. Little, however, can be concluded from them on the magnitude of the true effect: statistical significance (against zero) allows the conclusion that the true effect is not zero, but nothing else; and lack of statistical significance does not allow the conclusion that the true effect is zero. Thus, what can be learned? In this note I describe an extension to significance testing, SEVERE TESTING, which does allow valid conclusions on effect sizes after significance testing. It does so on an epistemically appealing, yet technically familiar (p-value) basis. It also makes a difference: loosely speaking, severe testing shifts the evidential weight from the centre of the confidence interval, as often assumed in prevailing practice, to its lower or upper edges.","PeriodicalId":425229,"journal":{"name":"ERN: Hypothesis Testing (Topic)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"What Can Be Concluded from Statistical Significance? Severe Testing as an Appealing Extension to Our Standard Toolkit\",\"authors\":\"Christopher Milde\",\"doi\":\"10.2139/ssrn.3413808\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Assessments of statistical significance are ubiquitous in damage quantification practice. Little, however, can be concluded from them on the magnitude of the true effect: statistical significance (against zero) allows the conclusion that the true effect is not zero, but nothing else; and lack of statistical significance does not allow the conclusion that the true effect is zero. Thus, what can be learned? In this note I describe an extension to significance testing, SEVERE TESTING, which does allow valid conclusions on effect sizes after significance testing. It does so on an epistemically appealing, yet technically familiar (p-value) basis. It also makes a difference: loosely speaking, severe testing shifts the evidential weight from the centre of the confidence interval, as often assumed in prevailing practice, to its lower or upper edges.\",\"PeriodicalId\":425229,\"journal\":{\"name\":\"ERN: Hypothesis Testing (Topic)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ERN: Hypothesis Testing (Topic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.3413808\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERN: Hypothesis Testing (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3413808","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
What Can Be Concluded from Statistical Significance? Severe Testing as an Appealing Extension to Our Standard Toolkit
Assessments of statistical significance are ubiquitous in damage quantification practice. Little, however, can be concluded from them on the magnitude of the true effect: statistical significance (against zero) allows the conclusion that the true effect is not zero, but nothing else; and lack of statistical significance does not allow the conclusion that the true effect is zero. Thus, what can be learned? In this note I describe an extension to significance testing, SEVERE TESTING, which does allow valid conclusions on effect sizes after significance testing. It does so on an epistemically appealing, yet technically familiar (p-value) basis. It also makes a difference: loosely speaking, severe testing shifts the evidential weight from the centre of the confidence interval, as often assumed in prevailing practice, to its lower or upper edges.