Precision-Recall-Optimization in Learning Vector Quantization Classifiers for Improved Medical Classification Systems

2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM) Pub Date : 2014-12-01 DOI:10.1109/CIDM.2014.7008150

T. Villmann, M. Kaden, M. Lange, P. Sturmer, W. Hermann

{"title":"Precision-Recall-Optimization in Learning Vector Quantization Classifiers for Improved Medical Classification Systems","authors":"T. Villmann, M. Kaden, M. Lange, P. Sturmer, W. Hermann","doi":"10.1109/CIDM.2014.7008150","DOIUrl":null,"url":null,"abstract":"Classification and decision systems in data analysis are mostly based on accuracy optimization. This criterion is only a conditional informative value if the data are imbalanced or false positive/negative decisions cause different costs. Therefore more sophisticated statistical quality measures are favored in medicine, like precision, recall etc.. Otherwise, most classification approaches in machine learning are designed for accuracy optimization. In this paper we consider variants of learning vector quantizers (LVQs) explicitly optimizing those advanced statistical quality measures while keeping the basic intuitive ingredients of these classifiers, which are the prototype based principle and the Hebbian learning. In particular we focus in this contribution particularly to precision and recall as important measures for use in medical applications. We investigate these problems in terms of precision-recall curves as well as receiver-operating characteristic (ROC) curves well-known in statistical classification and test analysis. With the underlying more general framework, we provide a principled alternatives traditional classifiers, such that a closer connection to statistical classification analysis can be drawn.","PeriodicalId":117542,"journal":{"name":"2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIDM.2014.7008150","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

Abstract

Classification and decision systems in data analysis are mostly based on accuracy optimization. This criterion is only a conditional informative value if the data are imbalanced or false positive/negative decisions cause different costs. Therefore more sophisticated statistical quality measures are favored in medicine, like precision, recall etc.. Otherwise, most classification approaches in machine learning are designed for accuracy optimization. In this paper we consider variants of learning vector quantizers (LVQs) explicitly optimizing those advanced statistical quality measures while keeping the basic intuitive ingredients of these classifiers, which are the prototype based principle and the Hebbian learning. In particular we focus in this contribution particularly to precision and recall as important measures for use in medical applications. We investigate these problems in terms of precision-recall curves as well as receiver-operating characteristic (ROC) curves well-known in statistical classification and test analysis. With the underlying more general framework, we provide a principled alternatives traditional classifiers, such that a closer connection to statistical classification analysis can be drawn.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

改进医学分类系统中学习向量量化分类器的准确率-召回率优化

数据分析中的分类和决策系统大多基于精度优化。如果数据不平衡或假阳性/阴性决策导致不同的成本，则该标准仅是有条件的信息值。因此，更复杂的统计质量指标在医学上更受青睐，比如精确度、召回率等。否则，机器学习中的大多数分类方法都是为了精度优化而设计的。在本文中，我们考虑了学习向量量化器(LVQs)的变体，在保持这些分类器的基本直观成分，即基于原型的原理和Hebbian学习的同时，显式地优化了这些高级统计质量度量。我们特别关注这方面的贡献，特别是作为医疗应用中使用的重要措施的准确性和召回率。我们根据统计分类和检验分析中众所周知的准确率-召回率曲线以及接收者-工作特征(ROC)曲线来研究这些问题。有了基础的更通用的框架，我们提供了一个有原则的替代传统分类器，这样就可以与统计分类分析建立更紧密的联系。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)

自引率

0.00%

发文量