{"title":"What Can Be Concluded from Statistical Significance? Severe Testing as an Appealing Extension to Our Standard Toolkit","authors":"Christopher Milde","doi":"10.2139/ssrn.3413808","DOIUrl":null,"url":null,"abstract":"Assessments of statistical significance are ubiquitous in damage quantification practice. Little, however, can be concluded from them on the magnitude of the true effect: statistical significance (against zero) allows the conclusion that the true effect is not zero, but nothing else; and lack of statistical significance does not allow the conclusion that the true effect is zero. Thus, what can be learned? In this note I describe an extension to significance testing, SEVERE TESTING, which does allow valid conclusions on effect sizes after significance testing. It does so on an epistemically appealing, yet technically familiar (p-value) basis. It also makes a difference: loosely speaking, severe testing shifts the evidential weight from the centre of the confidence interval, as often assumed in prevailing practice, to its lower or upper edges.","PeriodicalId":425229,"journal":{"name":"ERN: Hypothesis Testing (Topic)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERN: Hypothesis Testing (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3413808","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Assessments of statistical significance are ubiquitous in damage quantification practice. Little, however, can be concluded from them on the magnitude of the true effect: statistical significance (against zero) allows the conclusion that the true effect is not zero, but nothing else; and lack of statistical significance does not allow the conclusion that the true effect is zero. Thus, what can be learned? In this note I describe an extension to significance testing, SEVERE TESTING, which does allow valid conclusions on effect sizes after significance testing. It does so on an epistemically appealing, yet technically familiar (p-value) basis. It also makes a difference: loosely speaking, severe testing shifts the evidential weight from the centre of the confidence interval, as often assumed in prevailing practice, to its lower or upper edges.