Data should be made as simple as possible but not simpler: The method chosen for dimensionality reduction and its parameters can affect the clustering of runners based on their kinematics.
Adrian R Rivadulla, Xi Chen, Dario Cazzola, Grant Trewartha, Ezio Preatoni
{"title":"Data should be made as simple as possible but not simpler: The method chosen for dimensionality reduction and its parameters can affect the clustering of runners based on their kinematics.","authors":"Adrian R Rivadulla, Xi Chen, Dario Cazzola, Grant Trewartha, Ezio Preatoni","doi":"10.1016/j.jbiomech.2024.112433","DOIUrl":null,"url":null,"abstract":"<p><p>Dimensionality reduction is a critical step for the efficacy and efficiency of clustering analysis. Despite the multiple available methods, biomechanists have often defaulted to Principal Component Analysis (PCA). We evaluated two PCA- and one autoencoder-based dimensionality reduction methods for their data compression and reconstruction capability, assessed their effect on the output of clustering runners' based on kinematics, and discussed their implications for the biomechanical assessment of running technique. Eighty-four participants completed a 4-minute run at 12 km/h while trunk and lower-limb kinematics were collected. Data reconstruction quality was assessed for Direct PCA (PCA directly on original variables) and Fourier PCA (modelling time series as Fourier series and then applying PCA) using popular variance explained criteria; and a feedforward autoencoder (AE). Agglomerative hierarchical clustering was then applied and the agreement between the resulting partitions was assessed. Meaningful errors in the reconstructed signals were found when applying popular variance explained criteria, suggesting reconstruction error should be assessed to make a more informed decision about how many components to retain for further analysis. Direct PCA, Fourier PCA and AE yielded different clusters, warranting caution when comparing outcomes from studies that use different dimensionality reduction techniques: each method may be sensitive to different data features. Direct PCA retaining 99 % of the original variance emerged as the best compromise of data compression, reconstruction quality and cluster separability in our dataset. We encourage biomechanists to experiment with diverse dimensionality reduction methods to optimise clustering outcomes and enhance the real-world applicability of their findings.</p>","PeriodicalId":15168,"journal":{"name":"Journal of biomechanics","volume":"177 ","pages":"112433"},"PeriodicalIF":2.4000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of biomechanics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.jbiomech.2024.112433","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
Dimensionality reduction is a critical step for the efficacy and efficiency of clustering analysis. Despite the multiple available methods, biomechanists have often defaulted to Principal Component Analysis (PCA). We evaluated two PCA- and one autoencoder-based dimensionality reduction methods for their data compression and reconstruction capability, assessed their effect on the output of clustering runners' based on kinematics, and discussed their implications for the biomechanical assessment of running technique. Eighty-four participants completed a 4-minute run at 12 km/h while trunk and lower-limb kinematics were collected. Data reconstruction quality was assessed for Direct PCA (PCA directly on original variables) and Fourier PCA (modelling time series as Fourier series and then applying PCA) using popular variance explained criteria; and a feedforward autoencoder (AE). Agglomerative hierarchical clustering was then applied and the agreement between the resulting partitions was assessed. Meaningful errors in the reconstructed signals were found when applying popular variance explained criteria, suggesting reconstruction error should be assessed to make a more informed decision about how many components to retain for further analysis. Direct PCA, Fourier PCA and AE yielded different clusters, warranting caution when comparing outcomes from studies that use different dimensionality reduction techniques: each method may be sensitive to different data features. Direct PCA retaining 99 % of the original variance emerged as the best compromise of data compression, reconstruction quality and cluster separability in our dataset. We encourage biomechanists to experiment with diverse dimensionality reduction methods to optimise clustering outcomes and enhance the real-world applicability of their findings.
期刊介绍:
The Journal of Biomechanics publishes reports of original and substantial findings using the principles of mechanics to explore biological problems. Analytical, as well as experimental papers may be submitted, and the journal accepts original articles, surveys and perspective articles (usually by Editorial invitation only), book reviews and letters to the Editor. The criteria for acceptance of manuscripts include excellence, novelty, significance, clarity, conciseness and interest to the readership.
Papers published in the journal may cover a wide range of topics in biomechanics, including, but not limited to:
-Fundamental Topics - Biomechanics of the musculoskeletal, cardiovascular, and respiratory systems, mechanics of hard and soft tissues, biofluid mechanics, mechanics of prostheses and implant-tissue interfaces, mechanics of cells.
-Cardiovascular and Respiratory Biomechanics - Mechanics of blood-flow, air-flow, mechanics of the soft tissues, flow-tissue or flow-prosthesis interactions.
-Cell Biomechanics - Biomechanic analyses of cells, membranes and sub-cellular structures; the relationship of the mechanical environment to cell and tissue response.
-Dental Biomechanics - Design and analysis of dental tissues and prostheses, mechanics of chewing.
-Functional Tissue Engineering - The role of biomechanical factors in engineered tissue replacements and regenerative medicine.
-Injury Biomechanics - Mechanics of impact and trauma, dynamics of man-machine interaction.
-Molecular Biomechanics - Mechanical analyses of biomolecules.
-Orthopedic Biomechanics - Mechanics of fracture and fracture fixation, mechanics of implants and implant fixation, mechanics of bones and joints, wear of natural and artificial joints.
-Rehabilitation Biomechanics - Analyses of gait, mechanics of prosthetics and orthotics.
-Sports Biomechanics - Mechanical analyses of sports performance.