Ling-Jie Fan, Feng-Yi Wang, Jun-Han Zhao, Jun-Jie Zhang, Yang-An Li, Jia Tang, Tao Lin, Quan Wei
{"title":"From physical activity patterns to cognitive status: development and validation of novel digital biomarkers for cognitive assessment in older adults.","authors":"Ling-Jie Fan, Feng-Yi Wang, Jun-Han Zhao, Jun-Jie Zhang, Yang-An Li, Jia Tang, Tao Lin, Quan Wei","doi":"10.1186/s12966-025-01706-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>This study aims to investigate the associations between signal-level physical activity (PA) features derived from wrist accelerometry data and cognitive status in older adults, and to evaluate their potential predictive value when combined with demographics.</p><p><strong>Methods: </strong>We analyzed PA data from 3,363 older adults (NHATS: n = 747; NHANES: n = 2,616), with each participant contributing a complete 3-day continuous activity sequence. We extracted the most relevant PA features associated with cognitive function using feature engineering and recursive feature elimination. Demographic characteristics were also incorporated, and multimodal data fusion was achieved through canonical correlation analysis. We then developed explainable machine learning models, primarily random forest, optimized with hyperparameters, to predict individual cognitive function status.</p><p><strong>Results: </strong>Using recursive feature elimination, we identified the top 20 PA features from each dataset and combined them with demographic features for modeling. The models achieved AUCs of 0.84 and 0.80 for NHATS and NHANES. Change quantiles and FFT coefficients emerged as the consistently top-ranked PA features across datasets, ranking 1st and 2nd respectively in their predictive importance for cognitive function. Models based on the top 10 PA features common to both datasets, along with demographic features, achieved AUCs above 0.8.</p><p><strong>Conclusions: </strong>This study identifies novel time-frequency domain features of physical activity that show robust associations with cognitive status across two independent cohorts. These features, particularly those capturing activity variability and rhythmicity, provide complementary information beyond traditional cumulative PA measures. Based on these findings, we developed a proof-of-concept application that demonstrates the feasibility of translating these PA analytics into practical monitoring tools in real-world settings.</p>","PeriodicalId":50336,"journal":{"name":"International Journal of Behavioral Nutrition and Physical Activity","volume":"22 1","pages":"11"},"PeriodicalIF":5.6000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11748278/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Behavioral Nutrition and Physical Activity","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12966-025-01706-x","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NUTRITION & DIETETICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: This study aims to investigate the associations between signal-level physical activity (PA) features derived from wrist accelerometry data and cognitive status in older adults, and to evaluate their potential predictive value when combined with demographics.
Methods: We analyzed PA data from 3,363 older adults (NHATS: n = 747; NHANES: n = 2,616), with each participant contributing a complete 3-day continuous activity sequence. We extracted the most relevant PA features associated with cognitive function using feature engineering and recursive feature elimination. Demographic characteristics were also incorporated, and multimodal data fusion was achieved through canonical correlation analysis. We then developed explainable machine learning models, primarily random forest, optimized with hyperparameters, to predict individual cognitive function status.
Results: Using recursive feature elimination, we identified the top 20 PA features from each dataset and combined them with demographic features for modeling. The models achieved AUCs of 0.84 and 0.80 for NHATS and NHANES. Change quantiles and FFT coefficients emerged as the consistently top-ranked PA features across datasets, ranking 1st and 2nd respectively in their predictive importance for cognitive function. Models based on the top 10 PA features common to both datasets, along with demographic features, achieved AUCs above 0.8.
Conclusions: This study identifies novel time-frequency domain features of physical activity that show robust associations with cognitive status across two independent cohorts. These features, particularly those capturing activity variability and rhythmicity, provide complementary information beyond traditional cumulative PA measures. Based on these findings, we developed a proof-of-concept application that demonstrates the feasibility of translating these PA analytics into practical monitoring tools in real-world settings.
背景:本研究旨在探讨来自腕部加速度计数据的信号水平体力活动(PA)特征与老年人认知状态之间的关系,并评估其与人口统计学相结合时的潜在预测价值。方法:我们分析了3363名老年人的PA数据(NHATS: n = 747;NHANES: n = 2,616),每个参与者贡献一个完整的3天连续活动序列。我们使用特征工程和递归特征消去提取与认知功能相关的最相关的PA特征。结合人口统计学特征,通过典型相关分析实现多模态数据融合。然后,我们开发了可解释的机器学习模型,主要是随机森林,用超参数优化,以预测个人认知功能状态。结果:通过递归特征消去,我们从每个数据集中识别出了前20个PA特征,并将它们与人口统计学特征结合起来进行建模。NHATS和NHANES模型的auc分别为0.84和0.80。变化分位数和FFT系数在数据集中始终是排名最高的PA特征,它们对认知功能的预测重要性分别排名第一和第二。基于两个数据集共有的前10个PA特征以及人口统计特征的模型的auc均高于0.8。结论:本研究在两个独立的队列中确定了新的体力活动的时频域特征,这些特征显示出与认知状态的强大关联。这些特征,特别是那些捕获活动变异性和节律性的特征,提供了传统累积PA测量之外的补充信息。基于这些发现,我们开发了一个概念验证应用程序,该应用程序演示了将这些PA分析转换为现实环境中实际监控工具的可行性。
期刊介绍:
International Journal of Behavioral Nutrition and Physical Activity (IJBNPA) is an open access, peer-reviewed journal offering high quality articles, rapid publication and wide diffusion in the public domain.
IJBNPA is devoted to furthering the understanding of the behavioral aspects of diet and physical activity and is unique in its inclusion of multiple levels of analysis, including populations, groups and individuals and its inclusion of epidemiology, and behavioral, theoretical and measurement research areas.