An Efficient and Versatile Variational Method for High-Dimensional Data Classification

IF 3.3 2区 数学 Q1 MATHEMATICS, APPLIED Journal of Scientific Computing Pub Date : 2024-08-01 DOI:10.1007/s10915-024-02644-9
Xiaohao Cai, Raymond H. Chan, Xiaoyu Xie, Tieyong Zeng
{"title":"An Efficient and Versatile Variational Method for High-Dimensional Data Classification","authors":"Xiaohao Cai, Raymond H. Chan, Xiaoyu Xie, Tieyong Zeng","doi":"10.1007/s10915-024-02644-9","DOIUrl":null,"url":null,"abstract":"<p>High-dimensional data classification is a fundamental task in machine learning and imaging science. In this paper, we propose an efficient and versatile multi-class semi-supervised classification method for classifying high-dimensional data and unstructured point clouds. To begin with, a warm initialization is generated by using a fuzzy classification method such as the standard support vector machine or random labeling. Then an unconstraint convex variational model is proposed to purify and smooth the initialization, followed by a step which is to project the smoothed partition obtained previously to a binary partition. These steps can be repeated, with the latest result as a new initialization, to keep improving the classification quality. We show that the convex model of the smoothing step has a unique solution and can be solved by a specifically designed primal–dual algorithm whose convergence is guaranteed. We test our method and compare it with the state-of-the-art methods on several benchmark data sets. Thorough experimental results demonstrate that our method is superior in both the classification accuracy and computation speed for high-dimensional data and point clouds.</p>","PeriodicalId":50055,"journal":{"name":"Journal of Scientific Computing","volume":"57 1","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Scientific Computing","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10915-024-02644-9","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0

Abstract

High-dimensional data classification is a fundamental task in machine learning and imaging science. In this paper, we propose an efficient and versatile multi-class semi-supervised classification method for classifying high-dimensional data and unstructured point clouds. To begin with, a warm initialization is generated by using a fuzzy classification method such as the standard support vector machine or random labeling. Then an unconstraint convex variational model is proposed to purify and smooth the initialization, followed by a step which is to project the smoothed partition obtained previously to a binary partition. These steps can be repeated, with the latest result as a new initialization, to keep improving the classification quality. We show that the convex model of the smoothing step has a unique solution and can be solved by a specifically designed primal–dual algorithm whose convergence is guaranteed. We test our method and compare it with the state-of-the-art methods on several benchmark data sets. Thorough experimental results demonstrate that our method is superior in both the classification accuracy and computation speed for high-dimensional data and point clouds.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于高维数据分类的高效多变方法
高维数据分类是机器学习和成像科学中的一项基本任务。在本文中,我们提出了一种高效、通用的多类半监督分类方法,用于对高维数据和非结构化点云进行分类。首先,使用标准支持向量机或随机标记等模糊分类方法生成一个温暖的初始化。然后,提出一个无约束凸变模型来净化和平滑初始化,接下来的步骤是将之前获得的平滑分区投影到二进制分区。这些步骤可以重复进行,并将最新结果作为新的初始化,以不断提高分类质量。我们证明,平滑步骤的凸模型有一个唯一的解,可以用专门设计的初等-二元算法来解决,其收敛性是有保证的。我们在多个基准数据集上测试了我们的方法,并将其与最先进的方法进行了比较。详尽的实验结果表明,对于高维数据和点云,我们的方法在分类精度和计算速度上都更胜一筹。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Scientific Computing
Journal of Scientific Computing 数学-应用数学
CiteScore
4.00
自引率
12.00%
发文量
302
审稿时长
4-8 weeks
期刊介绍: Journal of Scientific Computing is an international interdisciplinary forum for the publication of papers on state-of-the-art developments in scientific computing and its applications in science and engineering. The journal publishes high-quality, peer-reviewed original papers, review papers and short communications on scientific computing.
期刊最新文献
Stochastic Conformal Integrators for Linearly Damped Stochastic Poisson Systems. Inf-sup stable space-time Local Discontinuous Galerkin method for the heat equation. Fast Numerical Solvers for Parameter Identification Problems in Mathematical Biology. Automatic Differentiation is Essential in Training Neural Networks for Solving Differential Equations. Homotopy Relaxation Training Algorithms for Infinite-Width Two-Layer ReLU Neural Networks.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1