Clustering and machine learning framework for medical time series classification

IF 5.3 2区 医学 Q1 ENGINEERING, BIOMEDICAL Biocybernetics and Biomedical Engineering Pub Date : 2024-07-01 DOI:10.1016/j.bbe.2024.07.005
Samuel Ruipérez-Campillo , Michael Reiss , Elisa Ramírez , Antonio Cebrián , José Millet , Francisco Castells
{"title":"Clustering and machine learning framework for medical time series classification","authors":"Samuel Ruipérez-Campillo ,&nbsp;Michael Reiss ,&nbsp;Elisa Ramírez ,&nbsp;Antonio Cebrián ,&nbsp;José Millet ,&nbsp;Francisco Castells","doi":"10.1016/j.bbe.2024.07.005","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and motivation:</h3><p>The application of artificial intelligence in medical research, particularly unsupervised learning techniques, has shown promising potential. Medical time series data poses a unique challenge for analysis due to its complexity. Existing unsupervised learning methods often fail to effectively classify these variations, highlighting a gap in current approaches. We introduce a methodological clustering classification framework designed to accurately handle such data, aiming for improved classification tasks in biomedical signals.</p></div><div><h3>Methods:</h3><p>To address these challenges, we introduce a novel approach for the analysis and classification of medical time series data. Our method integrates agglomerative hierarchical clustering with Hilbert vector space representations of medical signals and biological sequences. We rigorously define the mathematical principles and conduct evaluations using simulations of cardiac signals, real-world neural signal datasets, open-source protein sequences, and the MNIST dataset for illustrative purposes.</p></div><div><h3>Results:</h3><p>The proposed method exhibited a 96% success rate in classifying protein sequences by function and effectively identifying families within a large protein set. In cardiac signal analysis, it retained 0.996 variance in a condensed 6-dimensional space, accurately classifying 87.4% of simulated atrial flutter groups and 99.91% of main groups when excluding conduction direction. For neural signals, it demonstrated near-perfect tracking accuracy of neural activity in mouse brain recordings, as confirmed by expert evaluations.</p></div><div><h3>Conclusion:</h3><p>Our proposed method offers a novel, translational approach for the treatment and classification of medical and biological time series, addressing some of the prevalent challenges in the field and paving the way for more reliable and effective biomedical signal analysis.</p></div>","PeriodicalId":55381,"journal":{"name":"Biocybernetics and Biomedical Engineering","volume":"44 3","pages":"Pages 521-533"},"PeriodicalIF":5.3000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biocybernetics and Biomedical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0208521624000524","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background and motivation:

The application of artificial intelligence in medical research, particularly unsupervised learning techniques, has shown promising potential. Medical time series data poses a unique challenge for analysis due to its complexity. Existing unsupervised learning methods often fail to effectively classify these variations, highlighting a gap in current approaches. We introduce a methodological clustering classification framework designed to accurately handle such data, aiming for improved classification tasks in biomedical signals.

Methods:

To address these challenges, we introduce a novel approach for the analysis and classification of medical time series data. Our method integrates agglomerative hierarchical clustering with Hilbert vector space representations of medical signals and biological sequences. We rigorously define the mathematical principles and conduct evaluations using simulations of cardiac signals, real-world neural signal datasets, open-source protein sequences, and the MNIST dataset for illustrative purposes.

Results:

The proposed method exhibited a 96% success rate in classifying protein sequences by function and effectively identifying families within a large protein set. In cardiac signal analysis, it retained 0.996 variance in a condensed 6-dimensional space, accurately classifying 87.4% of simulated atrial flutter groups and 99.91% of main groups when excluding conduction direction. For neural signals, it demonstrated near-perfect tracking accuracy of neural activity in mouse brain recordings, as confirmed by expert evaluations.

Conclusion:

Our proposed method offers a novel, translational approach for the treatment and classification of medical and biological time series, addressing some of the prevalent challenges in the field and paving the way for more reliable and effective biomedical signal analysis.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于医学时间序列分类的聚类和机器学习框架
背景与动机:人工智能在医学研究中的应用,尤其是无监督学习技术,已显示出巨大的潜力。医学时间序列数据的复杂性给分析带来了独特的挑战。现有的无监督学习方法往往无法对这些变化进行有效分类,这凸显了当前方法的不足。方法:为了应对这些挑战,我们引入了一种新的方法来分析和分类医疗时间序列数据。我们的方法将聚类分层聚类与医学信号和生物序列的希尔伯特矢量空间表示整合在一起。我们严格定义了数学原理,并使用模拟心脏信号、真实世界神经信号数据集、开源蛋白质序列和 MNIST 数据集进行了评估。在心脏信号分析中,该方法在浓缩的 6 维空间中保留了 0.996 个方差,准确划分了 87.4% 的模拟心房扑动组,在排除传导方向的情况下,准确划分了 99.91% 的主要组。结论:我们提出的方法为医学和生物时间序列的处理和分类提供了一种新颖的转化方法,解决了该领域的一些普遍难题,为更可靠、更有效的生物医学信号分析铺平了道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
16.50
自引率
6.20%
发文量
77
审稿时长
38 days
期刊介绍: Biocybernetics and Biomedical Engineering is a quarterly journal, founded in 1981, devoted to publishing the results of original, innovative and creative research investigations in the field of Biocybernetics and biomedical engineering, which bridges mathematical, physical, chemical and engineering methods and technology to analyse physiological processes in living organisms as well as to develop methods, devices and systems used in biology and medicine, mainly in medical diagnosis, monitoring systems and therapy. The Journal''s mission is to advance scientific discovery into new or improved standards of care, and promotion a wide-ranging exchange between science and its application to humans.
期刊最新文献
Automating synaptic plasticity analysis: A deep learning approach to segmenting hippocampal field potential signal Probabilistic and explainable modeling of Phase–Phase Cross-Frequency Coupling patterns in EEG. Application to dyslexia diagnosis Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms Profiled delivery of bicarbonate during weekly cycle of hemodialysis Lightweight beat score map method for electrocardiogram-based arrhythmia classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1