On the optimism correction of the area under the receiver operating characteristic curve in logistic prediction models

IF 0.7 4区 数学 Q4 OPERATIONS RESEARCH & MANAGEMENT SCIENCE Sort-Statistics and Operations Research Transactions Pub Date : 2019-06-11 DOI:10.2436/20.8080.02.82
Amaia Iparragirre, Irantzu Barrio, M. Rodríguez-Álvarez
{"title":"On the optimism correction of the area under the receiver operating characteristic curve in logistic prediction models","authors":"Amaia Iparragirre, Irantzu Barrio, M. Rodríguez-Álvarez","doi":"10.2436/20.8080.02.82","DOIUrl":null,"url":null,"abstract":"When the same data are used to fit a model and estimate its predictive performance, this estimate may be optimistic, and its correction is required. The aim of this work is to compare the behaviour of different methods proposed in the literature when correcting for the optimism of the estimated area under the receiver operating characteristic curve in logistic regression models. A simulation study (where the theoretical model is known) is conducted considering different number of covariates, sample size, prevalence and correlation among covariates. The results suggest the use of k-fold cross-validation with replication and bootstrap.","PeriodicalId":49497,"journal":{"name":"Sort-Statistics and Operations Research Transactions","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2019-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sort-Statistics and Operations Research Transactions","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.2436/20.8080.02.82","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OPERATIONS RESEARCH & MANAGEMENT SCIENCE","Score":null,"Total":0}
引用次数: 2

Abstract

When the same data are used to fit a model and estimate its predictive performance, this estimate may be optimistic, and its correction is required. The aim of this work is to compare the behaviour of different methods proposed in the literature when correcting for the optimism of the estimated area under the receiver operating characteristic curve in logistic regression models. A simulation study (where the theoretical model is known) is conducted considering different number of covariates, sample size, prevalence and correlation among covariates. The results suggest the use of k-fold cross-validation with replication and bootstrap.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
logistic预测模型中接收者工作特征曲线下面积的乐观修正
当使用相同的数据来拟合模型并估计其预测性能时,该估计可能是乐观的,并且需要对其进行校正。这项工作的目的是比较文献中提出的不同方法在修正逻辑回归模型中接收者工作特征曲线下估计面积的乐观性时的行为。在理论模型已知的情况下,考虑不同协变量数量、样本量、患病率和协变量之间的相关性,进行模拟研究。结果建议使用k-fold交叉验证与复制和自举。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Sort-Statistics and Operations Research Transactions
Sort-Statistics and Operations Research Transactions 管理科学-统计学与概率论
CiteScore
3.10
自引率
0.00%
发文量
0
审稿时长
>12 weeks
期刊介绍: SORT (Statistics and Operations Research Transactions) —formerly Qüestiió— is an international journal launched in 2003. It is published twice-yearly, in English, by the Statistical Institute of Catalonia (Idescat). The journal is co-edited by the Universitat Politècnica de Catalunya, Universitat de Barcelona, Universitat Autonòma de Barcelona, Universitat de Girona, Universitat Pompeu Fabra i Universitat de Lleida, with the co-operation of the Spanish Section of the International Biometric Society and the Catalan Statistical Society. SORT promotes the publication of original articles of a methodological or applied nature or motivated by an applied problem in statistics, operations research, official statistics or biometrics as well as book reviews. We encourage authors to include an example of a real data set in their manuscripts.
期刊最新文献
Green hybrid fleets using electric vehicles: solving the heterogeneous vehicle routing problem with multiple driving ranges and loading capacities Integer constraints for enhancing interpretability in linear regression On interpretations of tests and effect sizes in regression models with a compositional predictor Modelling count data using the logratio-normal-multinomial distribution Bayesian structured antedependence model proposals for longitudinal data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1