Contrastive Learning Dimensionality Reduction Method Based on Manifold Learning

Jinghao Situ
{"title":"Contrastive Learning Dimensionality Reduction Method Based on Manifold Learning","authors":"Jinghao Situ","doi":"10.56028/aetr.9.1.522.2024","DOIUrl":null,"url":null,"abstract":" With the development of the times, more and more high-dimensional datasets come into people's view. In order to reduce the time complexity and space complexity of downstream tasks, data dimensionality reduction becomes the primary choice. Classical dimensionality reduction algorithms are mainly divided into linear dimensionality reduction algorithms and nonlinear dimensionality reduction algorithms. Some of the traditional dimensionality reduction methods have the problems of not considering the nonlinear structure of the original dataset and the existence of weak generalisation, which makes the dimensionality reduction effect not good or model need to be recalculated because of the addition of new samples. In order to solve these problems, the research in this paper is a comparative learning dimensionality reduction method based on manifold learning. The idea of manifold learning using geodesic distance can fully consider the nonlinear structure of the original dataset. In this paper, comparative learning is the main framework. When the neural network completes the training, it only need to take the new data as input to calculate, the result can be obtained, no need to reconstruct the model which means the generality is high. Starting from the related work, this paper briefly introduces manifold learning, comparative learning and neural network algorithms. Subsequently, an innovative model is proposed, including three modules, Isomap to extract nonlinear structure, expanding neighbourhood to make pseudo-labels, and comparative learning training. Detailed analyses are carried out through experiments, comparing with PCA and LLE algorithms with the geodetic distance retention rate as an indicator, which proves that the data dimensionality reduction method of this model is more effective and ubiquitous.","PeriodicalId":355471,"journal":{"name":"Advances in Engineering Technology Research","volume":"24 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Engineering Technology Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56028/aetr.9.1.522.2024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

 With the development of the times, more and more high-dimensional datasets come into people's view. In order to reduce the time complexity and space complexity of downstream tasks, data dimensionality reduction becomes the primary choice. Classical dimensionality reduction algorithms are mainly divided into linear dimensionality reduction algorithms and nonlinear dimensionality reduction algorithms. Some of the traditional dimensionality reduction methods have the problems of not considering the nonlinear structure of the original dataset and the existence of weak generalisation, which makes the dimensionality reduction effect not good or model need to be recalculated because of the addition of new samples. In order to solve these problems, the research in this paper is a comparative learning dimensionality reduction method based on manifold learning. The idea of manifold learning using geodesic distance can fully consider the nonlinear structure of the original dataset. In this paper, comparative learning is the main framework. When the neural network completes the training, it only need to take the new data as input to calculate, the result can be obtained, no need to reconstruct the model which means the generality is high. Starting from the related work, this paper briefly introduces manifold learning, comparative learning and neural network algorithms. Subsequently, an innovative model is proposed, including three modules, Isomap to extract nonlinear structure, expanding neighbourhood to make pseudo-labels, and comparative learning training. Detailed analyses are carried out through experiments, comparing with PCA and LLE algorithms with the geodetic distance retention rate as an indicator, which proves that the data dimensionality reduction method of this model is more effective and ubiquitous.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于 Manifold Learning 的对比学习降维方法
随着时代的发展,越来越多的高维数据集进入人们的视野。为了降低下游任务的时间复杂度和空间复杂度,数据降维成为首要选择。经典的降维算法主要分为线性降维算法和非线性降维算法。一些传统的降维方法存在不考虑原始数据集的非线性结构、存在弱泛化等问题,使得降维效果不佳,或因增加新样本而需要重新计算模型。为了解决这些问题,本文研究了一种基于流形学习的比较学习降维方法。流形学习利用大地距离的思想可以充分考虑原始数据集的非线性结构。本文以比较学习为主要框架。当神经网络完成训练后,只需将新数据作为输入进行计算,即可得到结果,无需重构模型,通用性高。本文从相关工作出发,简要介绍了流形学习、比较学习和神经网络算法。随后,本文提出了一个创新模型,包括三个模块:提取非线性结构的 Isomap 模块、扩展邻域制作伪标签模块和比较学习训练模块。通过实验进行了详细分析,以大地测量距离保留率为指标,与 PCA 算法和 LLE 算法进行了比较,证明该模型的数据降维方法更有效、更普遍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Research on Response Characteristics of GPR Method for Detecting Phreatic Lines in Embankments Research on the composition of glass relics based on CART model Research on component content model of ancient glass products based on statistical analysis The Significance of Big Data to the Design and Transformation of Rural Art Space Coupled Vibration Analysis of a Beam-Arch Composite Continuous Rigid Structure with Parallel Traffic Flow
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1