S. Nanga, A. T. Bawah, Ben Acquaye, Mac-Issaka Billa, Francisco Baeta, N. Odai, Samuel Kwaku Obeng, Ampem Darko Nsiah
{"title":"Review of Dimension Reduction Methods","authors":"S. Nanga, A. T. Bawah, Ben Acquaye, Mac-Issaka Billa, Francisco Baeta, N. Odai, Samuel Kwaku Obeng, Ampem Darko Nsiah","doi":"10.4236/jdaip.2021.93013","DOIUrl":null,"url":null,"abstract":"Purpose: This study sought to review the characteristics, strengths, weaknesses \nvariants, applications areas and data types applied on the various Dimension Reduction techniques. Methodology: The \nmost commonly used databases employed to search for the papers were ScienceDirect, \nScopus, Google Scholar, IEEE Xplore and Mendeley. An integrative review was \nused for the study where 341 papers were reviewed. Results: The linear \ntechniques considered were Principal Component Analysis (PCA), Linear Discriminant \nAnalysis (LDA), Singular Value Decomposition (SVD), Latent Semantic Analysis \n(LSA), Locality Preserving Projections (LPP), Independent Component Analysis \n(ICA) and Project Pursuit (PP). The non-linear techniques which were developed \nto work with applications that have complex non-linear structures considered were Kernel Principal Component \nAnalysis (KPCA), Multi-dimensional \nScaling (MDS), Isomap, Locally Linear Embedding (LLE), Self-Organizing Map \n(SOM), Latent Vector Quantization (LVQ), t-Stochastic neighbor embedding (t-SNE) and Uniform Manifold Approximation and \nProjection (UMAP). DR techniques can further be categorized into supervised, \nunsupervised and more recently semi-supervised learning methods. The supervised \nversions are the LDA and LVQ. All the other techniques are unsupervised. \nSupervised variants of PCA, LPP, KPCA and MDS have been developed. \nSupervised and semi-supervised variants of PP and t-SNE have also been \ndeveloped and a semi supervised version of the LDA has been developed. Conclusion: The various application areas, strengths, weaknesses and variants of the DR \ntechniques were explored. The different data types that have been applied on \nthe various DR techniques were also explored.","PeriodicalId":71434,"journal":{"name":"数据分析和信息处理(英文)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"数据分析和信息处理(英文)","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.4236/jdaip.2021.93013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19
Abstract
Purpose: This study sought to review the characteristics, strengths, weaknesses
variants, applications areas and data types applied on the various Dimension Reduction techniques. Methodology: The
most commonly used databases employed to search for the papers were ScienceDirect,
Scopus, Google Scholar, IEEE Xplore and Mendeley. An integrative review was
used for the study where 341 papers were reviewed. Results: The linear
techniques considered were Principal Component Analysis (PCA), Linear Discriminant
Analysis (LDA), Singular Value Decomposition (SVD), Latent Semantic Analysis
(LSA), Locality Preserving Projections (LPP), Independent Component Analysis
(ICA) and Project Pursuit (PP). The non-linear techniques which were developed
to work with applications that have complex non-linear structures considered were Kernel Principal Component
Analysis (KPCA), Multi-dimensional
Scaling (MDS), Isomap, Locally Linear Embedding (LLE), Self-Organizing Map
(SOM), Latent Vector Quantization (LVQ), t-Stochastic neighbor embedding (t-SNE) and Uniform Manifold Approximation and
Projection (UMAP). DR techniques can further be categorized into supervised,
unsupervised and more recently semi-supervised learning methods. The supervised
versions are the LDA and LVQ. All the other techniques are unsupervised.
Supervised variants of PCA, LPP, KPCA and MDS have been developed.
Supervised and semi-supervised variants of PP and t-SNE have also been
developed and a semi supervised version of the LDA has been developed. Conclusion: The various application areas, strengths, weaknesses and variants of the DR
techniques were explored. The different data types that have been applied on
the various DR techniques were also explored.