From Convex to Nonconvex: A Loss Function Analysis for Binary Classification

2010 IEEE International Conference on Data Mining Workshops Pub Date : 2010-12-13 DOI:10.1109/ICDMW.2010.57

Lei Zhao, M. Mammadov, J. Yearwood

{"title":"From Convex to Nonconvex: A Loss Function Analysis for Binary Classification","authors":"Lei Zhao, M. Mammadov, J. Yearwood","doi":"10.1109/ICDMW.2010.57","DOIUrl":null,"url":null,"abstract":"Problems of data classification can be studied in the framework of regularization theory as ill-posed problems. In this framework, loss functions play an important role in the application of regularization theory to classification. In this paper, we review some important convex loss functions, including hinge loss, square loss, modified square loss, exponential loss, logistic regression loss, as well as some non-convex loss functions, such as sigmoid loss, $\\phi$-loss, ramp loss, normalized sigmoid loss, and the loss function of 2 layer neural network. Based on the analysis of these loss functions, we propose a new differentiable non-convex loss function, called smoothed 0-1 loss function, which is a natural approximation of the 0-1 loss function. To compare the performance of different loss functions, we propose two binary classification algorithms for binary classification, one for convex loss functions, the other for non-convex loss functions. A set of experiments are launched on several binary data sets from the UCI repository. The results show that the proposed smoothed 0-1 loss function is robust, especially for those noisy data sets with many outliers.","PeriodicalId":170201,"journal":{"name":"2010 IEEE International Conference on Data Mining Workshops","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"42","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Data Mining Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2010.57","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 42

Abstract

Problems of data classification can be studied in the framework of regularization theory as ill-posed problems. In this framework, loss functions play an important role in the application of regularization theory to classification. In this paper, we review some important convex loss functions, including hinge loss, square loss, modified square loss, exponential loss, logistic regression loss, as well as some non-convex loss functions, such as sigmoid loss, $\phi$-loss, ramp loss, normalized sigmoid loss, and the loss function of 2 layer neural network. Based on the analysis of these loss functions, we propose a new differentiable non-convex loss function, called smoothed 0-1 loss function, which is a natural approximation of the 0-1 loss function. To compare the performance of different loss functions, we propose two binary classification algorithms for binary classification, one for convex loss functions, the other for non-convex loss functions. A set of experiments are launched on several binary data sets from the UCI repository. The results show that the proposed smoothed 0-1 loss function is robust, especially for those noisy data sets with many outliers.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

从凸到非凸:二值分类的损失函数分析

数据分类问题可以在正则化理论的框架下作为不适定问题来研究。在这个框架中，损失函数在正则化理论在分类中的应用中起着重要的作用。本文综述了一些重要的凸损失函数，包括铰链损失、平方损失、修正平方损失、指数损失、逻辑回归损失，以及一些非凸损失函数，如sigmoid损失、$\phi$-损失、斜坡损失、归一化sigmoid损失和2层神经网络的损失函数。在分析这些损失函数的基础上，我们提出了一种新的可微非凸损失函数，称为光滑的0-1损失函数，它是0-1损失函数的自然逼近。为了比较不同损失函数的性能，我们提出了两种二元分类算法，一种用于凸损失函数，另一种用于非凸损失函数。在UCI存储库的几个二进制数据集上启动了一组实验。结果表明，所提出的平滑0-1损失函数具有较强的鲁棒性，尤其适用于含有大量异常值的噪声数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2010 IEEE International Conference on Data Mining Workshops

自引率

0.00%

发文量