Multi-Class Classification using Mixtures of Univariate and Multivariate ROC Curves

Siva Gajjalavari, V. Rudravaram
{"title":"Multi-Class Classification using Mixtures of Univariate and Multivariate ROC Curves","authors":"Siva Gajjalavari, V. Rudravaram","doi":"10.18502/jbe.v8i2.10418","DOIUrl":null,"url":null,"abstract":"Introduction: Receiver Operating Characteristic (ROC) curve is one of the widely used supervised classification technique to allocate/classify the individuals and also instrumental in comparing diagnostic tests. Generally, to deal with classification problems we need to have knowledge on class labels. In most of the medical scenarios, most of data sets exhibit multi-model patterns in class labels which leads to multi-class classification problems. The main aim of this study is to address on the issue of constructing ROC models when there exists multimodel patterns in the class labels further, to classify the individuals for better diagnosis and also to reduce the complexity of graphical representation of ROC curves in such classification problems. \nMethods: A new version of univariate and multivariate ROC models are proposed in the framework of Finite Mixtures, due to the flexibility of identifying and modelling the subcomponents in the heterogeneous populations. \nResults: Oral Glucose Tolerance Test and Disk Hernia datasets are used and simulation studies are also performed. Results show that the proposed models possess better accuracy when compared with Bi-Normal and MROC models with reasonable low 1-Specificity and higher Sensitivity. The ROC curves are depicted in a 2D space rather than higher dimension for multi-class classification problem. \nConclusion: It is suggested that before one proceeds to model ROC curves, it is better to take a look at the density patterns of the study variable(s), which in turn help in explaining the true information between the classes and also provides good amount of “true” accuracy.","PeriodicalId":34310,"journal":{"name":"Journal of Biostatistics and Epidemiology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biostatistics and Epidemiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18502/jbe.v8i2.10418","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Receiver Operating Characteristic (ROC) curve is one of the widely used supervised classification technique to allocate/classify the individuals and also instrumental in comparing diagnostic tests. Generally, to deal with classification problems we need to have knowledge on class labels. In most of the medical scenarios, most of data sets exhibit multi-model patterns in class labels which leads to multi-class classification problems. The main aim of this study is to address on the issue of constructing ROC models when there exists multimodel patterns in the class labels further, to classify the individuals for better diagnosis and also to reduce the complexity of graphical representation of ROC curves in such classification problems. Methods: A new version of univariate and multivariate ROC models are proposed in the framework of Finite Mixtures, due to the flexibility of identifying and modelling the subcomponents in the heterogeneous populations. Results: Oral Glucose Tolerance Test and Disk Hernia datasets are used and simulation studies are also performed. Results show that the proposed models possess better accuracy when compared with Bi-Normal and MROC models with reasonable low 1-Specificity and higher Sensitivity. The ROC curves are depicted in a 2D space rather than higher dimension for multi-class classification problem. Conclusion: It is suggested that before one proceeds to model ROC curves, it is better to take a look at the density patterns of the study variable(s), which in turn help in explaining the true information between the classes and also provides good amount of “true” accuracy.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用单变量和多变量ROC曲线的混合的多类分类
引言:受试者操作特征(ROC)曲线是一种广泛使用的监督分类技术,用于分配/分类个体,也有助于比较诊断测试。一般来说,为了处理分类问题,我们需要了解类标签。在大多数医疗场景中,大多数数据集在类标签中表现出多模型模式,这导致了多类分类问题。本研究的主要目的是进一步解决当类标签中存在多模型模式时构建ROC模型的问题,对个体进行分类以进行更好的诊断,并降低此类分类问题中ROC曲线图形表示的复杂性。方法:由于在异质群体中识别和建模子成分的灵活性,在有限混合的框架下提出了新版本的单变量和多变量ROC模型。结果:使用了口服葡萄糖耐量试验和椎间盘突出症数据集,并进行了模拟研究。结果表明,与双正态和MROC模型相比,所提出的模型具有更好的精度,具有合理的低1-特异性和更高的灵敏度。对于多类分类问题,ROC曲线被描述在2D空间中,而不是更高维度。结论:建议在对ROC曲线进行建模之前,最好先看看研究变量的密度模式,这反过来有助于解释类之间的真实信息,并提供良好的“真实”准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
0.80
自引率
0.00%
发文量
26
审稿时长
12 weeks
期刊最新文献
Analysis of Copula Frailty defective models in presence of Cure Fraction The Pattern of Motorcyclists' Death Due to Accidents and a Three-year Forecast in East Azerbaijan Province, Iran: A Time Series Study Factors Affecting Loneliness in Older Adults: Evidence from Ardakan Cohort Study on Aging (ACSA) Understanding Knowledge and Behaviors Related To the Covid-19 Epidemic in Medical Students in Morocco Survival Prognostic Factors of Male Breast Cancer Using Appropriate Survival Analysis for Small Sample Size: Three Center Experience
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1