多状态聚类分析(MSCA),一种无监督的方法,用于多个时间到事件的电子健康记录,应用于与心肌梗死相关的多种疾病。

IF 3.4 3区 医学 Q1 HEALTH CARE SCIENCES & SERVICES BMC Medical Research Methodology Pub Date : 2025-02-04 DOI:10.1186/s12874-025-02476-7
Marc Delord, Abdel Douiri
{"title":"多状态聚类分析(MSCA),一种无监督的方法,用于多个时间到事件的电子健康记录,应用于与心肌梗死相关的多种疾病。","authors":"Marc Delord, Abdel Douiri","doi":"10.1186/s12874-025-02476-7","DOIUrl":null,"url":null,"abstract":"<p><p>Multimorbidity is characterized by the accrual of two or more long-term conditions (LTCs) in an individual. This state of health is increasingly prevalent and poses public health challenges. Adapting approaches to effectively analyse electronic health records is needed to better understand multimorbidity. We propose a novel unsupervised clustering approach to multiple time-to-event health records denoted as multiple state clustering analysis (MSCA). In MSCA, patients' pairwise dissimilarities are computed using patients' state matrices which are composed of multiple censored time-to-event indicators reflecting patients' health history. The use of state matrices enables the analysis of an arbitrary number of LTCs without reducing patients' health trajectories to a particular sequence of events. MSCA was applied to analyse multimorbidity associated with myocardial infarction using electronic health records of 26 LTCs, including conventional cardiovascular risk factors (CVRFs) such as diabetes and hypertension, collected from south London general practices between 2005 and 2021 in 5087 patients using the MSCA R library. We identified a typology of 11 clusters, characterised by age at onset of myocardial infarction, sequences of conventional CVRFs and non-conventional risk factors including physical and mental health conditions. Interestingly, multivariate analysis revealed that clusters were also associated with various combinations of socio-demographic characteristics including gender and ethnicity. By identifying meaningful sequences of LTCs associated with myocardial infarction and distinct socio-demographic characteristics, MSCA proves to be an effective approach to the analysis of electronic health records, with the potential to enhance our understanding of multimorbidity for improved prevention and management.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"25 1","pages":"32"},"PeriodicalIF":3.4000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11792209/pdf/","citationCount":"0","resultStr":"{\"title\":\"Multiple states clustering analysis (MSCA), an unsupervised approach to multiple time-to-event electronic health records applied to multimorbidity associated with myocardial infarction.\",\"authors\":\"Marc Delord, Abdel Douiri\",\"doi\":\"10.1186/s12874-025-02476-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Multimorbidity is characterized by the accrual of two or more long-term conditions (LTCs) in an individual. This state of health is increasingly prevalent and poses public health challenges. Adapting approaches to effectively analyse electronic health records is needed to better understand multimorbidity. We propose a novel unsupervised clustering approach to multiple time-to-event health records denoted as multiple state clustering analysis (MSCA). In MSCA, patients' pairwise dissimilarities are computed using patients' state matrices which are composed of multiple censored time-to-event indicators reflecting patients' health history. The use of state matrices enables the analysis of an arbitrary number of LTCs without reducing patients' health trajectories to a particular sequence of events. MSCA was applied to analyse multimorbidity associated with myocardial infarction using electronic health records of 26 LTCs, including conventional cardiovascular risk factors (CVRFs) such as diabetes and hypertension, collected from south London general practices between 2005 and 2021 in 5087 patients using the MSCA R library. We identified a typology of 11 clusters, characterised by age at onset of myocardial infarction, sequences of conventional CVRFs and non-conventional risk factors including physical and mental health conditions. Interestingly, multivariate analysis revealed that clusters were also associated with various combinations of socio-demographic characteristics including gender and ethnicity. By identifying meaningful sequences of LTCs associated with myocardial infarction and distinct socio-demographic characteristics, MSCA proves to be an effective approach to the analysis of electronic health records, with the potential to enhance our understanding of multimorbidity for improved prevention and management.</p>\",\"PeriodicalId\":9114,\"journal\":{\"name\":\"BMC Medical Research Methodology\",\"volume\":\"25 1\",\"pages\":\"32\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-02-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11792209/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Research Methodology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12874-025-02476-7\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Research Methodology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12874-025-02476-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

摘要

多病的特点是个体中出现两种或两种以上的长期疾病(LTCs)。这种健康状况日益普遍,对公共卫生构成挑战。需要调整方法来有效地分析电子健康记录,以便更好地了解多重疾病。我们提出了一种新的无监督聚类方法来处理多个时间到事件的健康记录,称为多状态聚类分析(MSCA)。在MSCA中,使用患者状态矩阵计算患者的两两差异,该矩阵由反映患者健康史的多个截短时间到事件指标组成。使用状态矩阵可以分析任意数量的LTCs,而不会将患者的健康轨迹减少到特定的事件序列。MSCA应用于分析与心肌梗死相关的多病,使用26个LTCs的电子健康记录,包括传统的心血管危险因素(cvrf),如糖尿病和高血压,收集自2005年至2021年间使用MSCA R库的南伦敦全科诊所的5087名患者。我们确定了11个集群的类型,其特征是心肌梗死发病年龄、常规cvrf序列和包括身体和精神健康状况在内的非常规风险因素。有趣的是,多变量分析显示,聚类还与社会人口特征的各种组合有关,包括性别和种族。通过识别与心肌梗死相关的有意义的LTCs序列和独特的社会人口统计学特征,MSCA被证明是一种有效的电子健康记录分析方法,有可能增强我们对多病的理解,从而改进预防和管理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Multiple states clustering analysis (MSCA), an unsupervised approach to multiple time-to-event electronic health records applied to multimorbidity associated with myocardial infarction.

Multimorbidity is characterized by the accrual of two or more long-term conditions (LTCs) in an individual. This state of health is increasingly prevalent and poses public health challenges. Adapting approaches to effectively analyse electronic health records is needed to better understand multimorbidity. We propose a novel unsupervised clustering approach to multiple time-to-event health records denoted as multiple state clustering analysis (MSCA). In MSCA, patients' pairwise dissimilarities are computed using patients' state matrices which are composed of multiple censored time-to-event indicators reflecting patients' health history. The use of state matrices enables the analysis of an arbitrary number of LTCs without reducing patients' health trajectories to a particular sequence of events. MSCA was applied to analyse multimorbidity associated with myocardial infarction using electronic health records of 26 LTCs, including conventional cardiovascular risk factors (CVRFs) such as diabetes and hypertension, collected from south London general practices between 2005 and 2021 in 5087 patients using the MSCA R library. We identified a typology of 11 clusters, characterised by age at onset of myocardial infarction, sequences of conventional CVRFs and non-conventional risk factors including physical and mental health conditions. Interestingly, multivariate analysis revealed that clusters were also associated with various combinations of socio-demographic characteristics including gender and ethnicity. By identifying meaningful sequences of LTCs associated with myocardial infarction and distinct socio-demographic characteristics, MSCA proves to be an effective approach to the analysis of electronic health records, with the potential to enhance our understanding of multimorbidity for improved prevention and management.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Medical Research Methodology
BMC Medical Research Methodology 医学-卫生保健
CiteScore
6.50
自引率
2.50%
发文量
298
审稿时长
3-8 weeks
期刊介绍: BMC Medical Research Methodology is an open access journal publishing original peer-reviewed research articles in methodological approaches to healthcare research. Articles on the methodology of epidemiological research, clinical trials and meta-analysis/systematic review are particularly encouraged, as are empirical studies of the associations between choice of methodology and study outcomes. BMC Medical Research Methodology does not aim to publish articles describing scientific methods or techniques: these should be directed to the BMC journal covering the relevant biomedical subject area.
期刊最新文献
Mitigating bias in the analysis and inferences from using longitudinal EHR data in disease outcomes research. Experiences from conducting systematic reviews of systematic reviews. Predictive averaging and Rubin's rule-based model pooling to predict survival risk with imputations in the presence of missing patient data: methodology and verification using two case studies and simulations. Enhancing recruitment methodologies: leveraging the Tailored Design Method to survey populations with varied engagement in healthcare. Clustering longitudinal multivariate trajectories using an ensemble of principal component trees.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1