最优混合整数线性优化训练的多元分类树

arXiv - CS - Discrete Mathematics Pub Date : 2024-08-02 DOI:arxiv-2408.01297

Brandon Alston, Illya V. Hicks

{"title":"最优混合整数线性优化训练的多元分类树","authors":"Brandon Alston, Illya V. Hicks","doi":"arxiv-2408.01297","DOIUrl":null,"url":null,"abstract":"Multivariate decision trees are powerful machine learning tools for\nclassification and regression that attract many researchers and industry\nprofessionals. An optimal binary tree has two types of vertices, (i) branching\nvertices which have exactly two children and where datapoints are assessed on a\nset of discrete features and (ii) leaf vertices at which datapoints are given a\nprediction, and can be obtained by solving a biobjective optimization problem\nthat seeks to (i) maximize the number of correctly classified datapoints and\n(ii) minimize the number of branching vertices. Branching vertices are linear\ncombinations of training features and therefore can be thought of as\nhyperplanes. In this paper, we propose two cut-based mixed integer linear\noptimization (MILO) formulations for designing optimal binary classification\ntrees (leaf vertices assign discrete classes). Our models leverage on-the-fly\nidentification of minimal infeasible subsystems (MISs) from which we derive\ncutting planes that hold the form of packing constraints. We show theoretical\nimprovements on the strongest flow-based MILO formulation currently in the\nliterature and conduct experiments on publicly available datasets to show our\nmodels' ability to scale, strength against traditional branch and bound\napproaches, and robustness in out-of-sample test performance. Our code and data\nare available on GitHub.","PeriodicalId":501216,"journal":{"name":"arXiv - CS - Discrete Mathematics","volume":"47 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimal Mixed Integer Linear Optimization Trained Multivariate Classification Trees\",\"authors\":\"Brandon Alston, Illya V. Hicks\",\"doi\":\"arxiv-2408.01297\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multivariate decision trees are powerful machine learning tools for\\nclassification and regression that attract many researchers and industry\\nprofessionals. An optimal binary tree has two types of vertices, (i) branching\\nvertices which have exactly two children and where datapoints are assessed on a\\nset of discrete features and (ii) leaf vertices at which datapoints are given a\\nprediction, and can be obtained by solving a biobjective optimization problem\\nthat seeks to (i) maximize the number of correctly classified datapoints and\\n(ii) minimize the number of branching vertices. Branching vertices are linear\\ncombinations of training features and therefore can be thought of as\\nhyperplanes. In this paper, we propose two cut-based mixed integer linear\\noptimization (MILO) formulations for designing optimal binary classification\\ntrees (leaf vertices assign discrete classes). Our models leverage on-the-fly\\nidentification of minimal infeasible subsystems (MISs) from which we derive\\ncutting planes that hold the form of packing constraints. We show theoretical\\nimprovements on the strongest flow-based MILO formulation currently in the\\nliterature and conduct experiments on publicly available datasets to show our\\nmodels' ability to scale, strength against traditional branch and bound\\napproaches, and robustness in out-of-sample test performance. Our code and data\\nare available on GitHub.\",\"PeriodicalId\":501216,\"journal\":{\"name\":\"arXiv - CS - Discrete Mathematics\",\"volume\":\"47 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Discrete Mathematics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.01297\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Discrete Mathematics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.01297","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

多变量决策树是用于分类和回归的强大机器学习工具，吸引了众多研究人员和行业专家。一棵最优二叉树有两种顶点类型：(i) 分支顶点，这些顶点正好有两个子顶点，数据点会根据离散特征集进行评估；(ii) 叶子顶点，数据点会在叶子顶点得到预测，可以通过求解生物目标优化问题来获得，该问题的目的是：(i) 使正确分类的数据点数量最大化；(ii) 使分支顶点数量最小化。分支顶点是训练特征的线性组合，因此可以看作是超平面。在本文中，我们提出了两种基于切分的混合整数线性优化（MILO）公式，用于设计最优二元分类树（叶顶点分配离散类别）。我们的模型利用了对最小不可行子系统（MIS）的即时识别，并由此推导出了具有打包约束形式的切割平面。我们展示了对目前文献中最强的基于流的 MILO 表述的理论改进，并在公开可用的数据集上进行了实验，以展示我们的模型的扩展能力、与传统分支和边界方法相比的优势以及样本外测试性能的稳健性。我们的代码和数据可在 GitHub 上获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Optimal Mixed Integer Linear Optimization Trained Multivariate Classification Trees

Multivariate decision trees are powerful machine learning tools for classification and regression that attract many researchers and industry professionals. An optimal binary tree has two types of vertices, (i) branching vertices which have exactly two children and where datapoints are assessed on a set of discrete features and (ii) leaf vertices at which datapoints are given a prediction, and can be obtained by solving a biobjective optimization problem that seeks to (i) maximize the number of correctly classified datapoints and (ii) minimize the number of branching vertices. Branching vertices are linear combinations of training features and therefore can be thought of as hyperplanes. In this paper, we propose two cut-based mixed integer linear optimization (MILO) formulations for designing optimal binary classification trees (leaf vertices assign discrete classes). Our models leverage on-the-fly identification of minimal infeasible subsystems (MISs) from which we derive cutting planes that hold the form of packing constraints. We show theoretical improvements on the strongest flow-based MILO formulation currently in the literature and conduct experiments on publicly available datasets to show our models' ability to scale, strength against traditional branch and bound approaches, and robustness in out-of-sample test performance. Our code and data are available on GitHub.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助