从心电图无创检测糖尿病和糖尿病前期的机器学习算法

IF 1.4 Q3 HEALTH CARE SCIENCES & SERVICES BMJ Innovations Pub Date : 2022-08-09 DOI:10.1136/bmjinnov-2021-000759
A. Kulkarni, Ashwini A Patel, Kanchan V Pipal, Sujeet G Jaiswal, Manisha T Jaisinghani, Vidya Thulkar, Lumbini Gajbhiye, Preeti Gondane, Archana B Patel, M. Mamtani, H. Kulkarni
{"title":"从心电图无创检测糖尿病和糖尿病前期的机器学习算法","authors":"A. Kulkarni, Ashwini A Patel, Kanchan V Pipal, Sujeet G Jaiswal, Manisha T Jaisinghani, Vidya Thulkar, Lumbini Gajbhiye, Preeti Gondane, Archana B Patel, M. Mamtani, H. Kulkarni","doi":"10.1136/bmjinnov-2021-000759","DOIUrl":null,"url":null,"abstract":"Objectives Early detection is of crucial importance for prevention of type 2 diabetes and pre-diabetes. Diagnosis of these conditions relies on the oral glucose tolerance test and haemoglobin A1c estimation which are invasive and challenging for large-scale screening. We aimed to combine the non-invasive nature of ECG with the power of machine learning to detect diabetes and pre-diabetes. Methods Data for this study come from Diabetes in Sindhi Families in Nagpur study of ethnically endogenous Sindhi population from central India. Final dataset included clinical data from 1262 individuals and 10 461 time-aligned heartbeats recorded digitally. The dataset was split into a training set, a validation set and independent test set (8892, 523 and 1046 beats, respectively). The ECG recordings were processed with median filtering, band-pass filtering and standard scaling. Minority oversampling was undertaken to balance the training dataset before initiation of training. Extreme gradient boosting (XGBoost) was used to train the classifier that used the signal-processed ECG as input and predicted the membership to ‘no diabetes’, pre-diabetes or type 2 diabetes classes (defined according to American Diabetes Association criteria). Results Prevalence of type 2 diabetes and pre-diabetes was ~30% and ~14%, respectively. Training was smooth and quick (convergence achieved within 40 epochs). In the independent test set, the DiaBeats algorithm predicted the classes with 97.1% precision, 96.2% recall, 96.8% accuracy and 96.6% F1 score. The calibrated model had a low calibration error (0.06). The feature importance maps indicated that leads III, augmented Vector Left (aVL), V4, V5 and V6 were most contributory to the classification performance. The predictions matched the clinical expectations based on the biological mechanisms of cardiac involvement in diabetes. Conclusions Machine-learning-based DiaBeats algorithm using ECG signal data accurately predicted diabetes-related classes. This algorithm can help in early detection of diabetes and pre-diabetes after robust validation in external datasets.","PeriodicalId":53454,"journal":{"name":"BMJ Innovations","volume":"59 1","pages":"32 - 42"},"PeriodicalIF":1.4000,"publicationDate":"2022-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Machine-learning algorithm to non-invasively detect diabetes and pre-diabetes from electrocardiogram\",\"authors\":\"A. Kulkarni, Ashwini A Patel, Kanchan V Pipal, Sujeet G Jaiswal, Manisha T Jaisinghani, Vidya Thulkar, Lumbini Gajbhiye, Preeti Gondane, Archana B Patel, M. Mamtani, H. Kulkarni\",\"doi\":\"10.1136/bmjinnov-2021-000759\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objectives Early detection is of crucial importance for prevention of type 2 diabetes and pre-diabetes. Diagnosis of these conditions relies on the oral glucose tolerance test and haemoglobin A1c estimation which are invasive and challenging for large-scale screening. We aimed to combine the non-invasive nature of ECG with the power of machine learning to detect diabetes and pre-diabetes. Methods Data for this study come from Diabetes in Sindhi Families in Nagpur study of ethnically endogenous Sindhi population from central India. Final dataset included clinical data from 1262 individuals and 10 461 time-aligned heartbeats recorded digitally. The dataset was split into a training set, a validation set and independent test set (8892, 523 and 1046 beats, respectively). The ECG recordings were processed with median filtering, band-pass filtering and standard scaling. Minority oversampling was undertaken to balance the training dataset before initiation of training. Extreme gradient boosting (XGBoost) was used to train the classifier that used the signal-processed ECG as input and predicted the membership to ‘no diabetes’, pre-diabetes or type 2 diabetes classes (defined according to American Diabetes Association criteria). Results Prevalence of type 2 diabetes and pre-diabetes was ~30% and ~14%, respectively. Training was smooth and quick (convergence achieved within 40 epochs). In the independent test set, the DiaBeats algorithm predicted the classes with 97.1% precision, 96.2% recall, 96.8% accuracy and 96.6% F1 score. The calibrated model had a low calibration error (0.06). The feature importance maps indicated that leads III, augmented Vector Left (aVL), V4, V5 and V6 were most contributory to the classification performance. The predictions matched the clinical expectations based on the biological mechanisms of cardiac involvement in diabetes. Conclusions Machine-learning-based DiaBeats algorithm using ECG signal data accurately predicted diabetes-related classes. This algorithm can help in early detection of diabetes and pre-diabetes after robust validation in external datasets.\",\"PeriodicalId\":53454,\"journal\":{\"name\":\"BMJ Innovations\",\"volume\":\"59 1\",\"pages\":\"32 - 42\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2022-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMJ Innovations\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1136/bmjinnov-2021-000759\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ Innovations","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1136/bmjinnov-2021-000759","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 8

摘要

目的早期发现对预防2型糖尿病及前驱糖尿病至关重要。这些疾病的诊断依赖于口服葡萄糖耐量试验和血红蛋白A1c的估计,这对于大规模筛查是侵入性的和具有挑战性的。我们的目标是将心电图的非侵入性与机器学习的力量结合起来,以检测糖尿病和糖尿病前期。方法本研究的数据来自那格浦尔信德家族糖尿病患者研究,研究对象是来自印度中部的内源性信德人。最终数据集包括1262个人的临床数据和10461次按时间排列的心跳数字记录。数据集分为训练集、验证集和独立测试集(分别为8892次、523次和1046次)。对心电记录进行中值滤波、带通滤波和标准缩放处理。在训练开始前进行少数过采样以平衡训练数据集。使用极端梯度增强(XGBoost)来训练分类器,该分类器使用信号处理后的ECG作为输入,并预测“无糖尿病”、糖尿病前期或2型糖尿病类别(根据美国糖尿病协会标准定义)的隶属度。结果2型糖尿病和糖尿病前期患病率分别为~30%和~14%。训练是顺利和快速的(在40个时期内达到收敛)。在独立测试集中,DiaBeats算法预测类别的准确率为97.1%,召回率为96.2%,准确率为96.8%,F1得分为96.6%。校正后的模型校正误差较低(0.06)。特征重要性图显示导联III、增强向量左(augmented Vector Left, aVL)、V4、V5和V6对分类效果的贡献最大。这些预测与基于糖尿病中心脏参与的生物学机制的临床预期相符。结论基于机器学习的DiaBeats算法利用心电信号数据准确预测糖尿病相关类别。该算法在外部数据集上经过鲁棒性验证后,可以帮助早期发现糖尿病和糖尿病前期。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Machine-learning algorithm to non-invasively detect diabetes and pre-diabetes from electrocardiogram
Objectives Early detection is of crucial importance for prevention of type 2 diabetes and pre-diabetes. Diagnosis of these conditions relies on the oral glucose tolerance test and haemoglobin A1c estimation which are invasive and challenging for large-scale screening. We aimed to combine the non-invasive nature of ECG with the power of machine learning to detect diabetes and pre-diabetes. Methods Data for this study come from Diabetes in Sindhi Families in Nagpur study of ethnically endogenous Sindhi population from central India. Final dataset included clinical data from 1262 individuals and 10 461 time-aligned heartbeats recorded digitally. The dataset was split into a training set, a validation set and independent test set (8892, 523 and 1046 beats, respectively). The ECG recordings were processed with median filtering, band-pass filtering and standard scaling. Minority oversampling was undertaken to balance the training dataset before initiation of training. Extreme gradient boosting (XGBoost) was used to train the classifier that used the signal-processed ECG as input and predicted the membership to ‘no diabetes’, pre-diabetes or type 2 diabetes classes (defined according to American Diabetes Association criteria). Results Prevalence of type 2 diabetes and pre-diabetes was ~30% and ~14%, respectively. Training was smooth and quick (convergence achieved within 40 epochs). In the independent test set, the DiaBeats algorithm predicted the classes with 97.1% precision, 96.2% recall, 96.8% accuracy and 96.6% F1 score. The calibrated model had a low calibration error (0.06). The feature importance maps indicated that leads III, augmented Vector Left (aVL), V4, V5 and V6 were most contributory to the classification performance. The predictions matched the clinical expectations based on the biological mechanisms of cardiac involvement in diabetes. Conclusions Machine-learning-based DiaBeats algorithm using ECG signal data accurately predicted diabetes-related classes. This algorithm can help in early detection of diabetes and pre-diabetes after robust validation in external datasets.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMJ Innovations
BMJ Innovations Medicine-Medicine (all)
CiteScore
4.20
自引率
0.00%
发文量
63
期刊介绍: Healthcare is undergoing a revolution and novel medical technologies are being developed to treat patients in better and faster ways. Mobile revolution has put a handheld computer in pockets of billions and we are ushering in an era of mHealth. In developed and developing world alike healthcare costs are a concern and frugal innovations are being promoted for bringing down the costs of healthcare. BMJ Innovations aims to promote innovative research which creates new, cost-effective medical devices, technologies, processes and systems that improve patient care, with particular focus on the needs of patients, physicians, and the health care industry as a whole and act as a platform to catalyse and seed more innovations. Submissions to BMJ Innovations will be considered from all clinical areas of medicine along with business and process innovations that make healthcare accessible and affordable. Submissions from groups of investigators engaged in international collaborations are especially encouraged. The broad areas of innovations that this journal aims to chronicle include but are not limited to: Medical devices, mHealth and wearable health technologies, Assistive technologies, Diagnostics, Health IT, systems and process innovation.
期刊最新文献
Biomechanical analysis of delivering chest compressions in virtual reality simulation: preliminary insights into fidelity and validity of virtual reality as a means of delivering life support training iCount: a human-factors engineered solution to vaginal swab retention – an early-stage innovation report Spreading and scaling innovation and improvement: understanding why the differences matter Role of twinnings in scaling up innovative solutions across Europe Wings of Healing: regulatory landscape, case studies and the Indian prologue of drones in healthcare
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1