机器学习模型在毛细管血清蛋白电泳分析中的性能和效率。

IF 4.1 3区 医学 Q2 MEDICAL LABORATORY TECHNOLOGY Clinica Chimica Acta Pub Date : 2025-03-01 Epub Date: 2025-01-26 DOI:10.1016/j.cca.2025.120165
Xia Wang , Mei Zhang , Chuan Li , Chengyao Jia , Xijie Yu , He He
{"title":"机器学习模型在毛细管血清蛋白电泳分析中的性能和效率。","authors":"Xia Wang ,&nbsp;Mei Zhang ,&nbsp;Chuan Li ,&nbsp;Chengyao Jia ,&nbsp;Xijie Yu ,&nbsp;He He","doi":"10.1016/j.cca.2025.120165","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and Objective</h3><div>Serum protein electrophoresis (SPEP) plays a critical role in diagnosing diseases associated with M−proteins. However, its clinical application is limited by a heavy reliance on experienced experts.</div></div><div><h3>Methods</h3><div>A dataset comprising 85,026 SPEP outcomes was utilized to develop artificial intelligence diagnostic models for the classification and localization of M−proteins. These models were trained and validated using three data features, and their performance was evaluated using comprehensive metrics, including sensitivity, positive predictive value (PPV), specificity, negative predictive value (NPV), F1 score, accuracy, area under the receiver operating characteristic curve (AUC), Matthews correlation coefficient (MCC), and Intersection over Union (IoU). The best-performing machine learning (ML) and deep learning (DL) models were further tested on a separate dataset of 1,079 samples. The localization ability of the DL model was compared against three clinical experts.</div></div><div><h3>Results</h3><div>Among the four ML models, the extreme gradient boosting (XGB) model achieved the best performance, with MCC, AUC, F1 score, sensitivity, specificity, accuracy, PPV, and NPV of 0.847, 0.903, 0.875, 0.822, 0.985, 0.951, 0.934, and 0.955, respectively. Different feature extraction methods significantly influenced model performance. The DL models outperformed the ML models in comprehensive performance. The U-Net combined with Transformer model demonstrated localization ability comparable to that of clinical experts, achieving sensitivity, specificity, accuracy, PPV, NPV, F1 score, AUC, MCC, and IoU of 0.947, 0.984, 0.976, 0.938, 0.986, 0.942, 0.966, 0.927, and 0.877, respectively.</div></div><div><h3>Conclusion</h3><div>The U-Net combined with the Transformer model demonstrated expert-level performance in M−protein classification and localization, achieving an accuracy of 0.976 and an IoU of 0.877. This exceptional performance highlights the potential of this combined model for automating clinical SPEP workflows.</div></div>","PeriodicalId":10205,"journal":{"name":"Clinica Chimica Acta","volume":"569 ","pages":"Article 120165"},"PeriodicalIF":4.1000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Performance and efficiency of machine learning models in analyzing capillary serum protein electrophoresis\",\"authors\":\"Xia Wang ,&nbsp;Mei Zhang ,&nbsp;Chuan Li ,&nbsp;Chengyao Jia ,&nbsp;Xijie Yu ,&nbsp;He He\",\"doi\":\"10.1016/j.cca.2025.120165\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background and Objective</h3><div>Serum protein electrophoresis (SPEP) plays a critical role in diagnosing diseases associated with M−proteins. However, its clinical application is limited by a heavy reliance on experienced experts.</div></div><div><h3>Methods</h3><div>A dataset comprising 85,026 SPEP outcomes was utilized to develop artificial intelligence diagnostic models for the classification and localization of M−proteins. These models were trained and validated using three data features, and their performance was evaluated using comprehensive metrics, including sensitivity, positive predictive value (PPV), specificity, negative predictive value (NPV), F1 score, accuracy, area under the receiver operating characteristic curve (AUC), Matthews correlation coefficient (MCC), and Intersection over Union (IoU). The best-performing machine learning (ML) and deep learning (DL) models were further tested on a separate dataset of 1,079 samples. The localization ability of the DL model was compared against three clinical experts.</div></div><div><h3>Results</h3><div>Among the four ML models, the extreme gradient boosting (XGB) model achieved the best performance, with MCC, AUC, F1 score, sensitivity, specificity, accuracy, PPV, and NPV of 0.847, 0.903, 0.875, 0.822, 0.985, 0.951, 0.934, and 0.955, respectively. Different feature extraction methods significantly influenced model performance. The DL models outperformed the ML models in comprehensive performance. The U-Net combined with Transformer model demonstrated localization ability comparable to that of clinical experts, achieving sensitivity, specificity, accuracy, PPV, NPV, F1 score, AUC, MCC, and IoU of 0.947, 0.984, 0.976, 0.938, 0.986, 0.942, 0.966, 0.927, and 0.877, respectively.</div></div><div><h3>Conclusion</h3><div>The U-Net combined with the Transformer model demonstrated expert-level performance in M−protein classification and localization, achieving an accuracy of 0.976 and an IoU of 0.877. This exceptional performance highlights the potential of this combined model for automating clinical SPEP workflows.</div></div>\",\"PeriodicalId\":10205,\"journal\":{\"name\":\"Clinica Chimica Acta\",\"volume\":\"569 \",\"pages\":\"Article 120165\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinica Chimica Acta\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0009898125000440\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/26 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL LABORATORY TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinica Chimica Acta","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0009898125000440","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/26 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景与目的:血清蛋白电泳(SPEP)在m蛋白相关疾病的诊断中起着至关重要的作用。然而,它的临床应用受到严重依赖经验丰富的专家的限制。方法:利用包含85,026个SPEP结果的数据集开发用于m蛋白分类和定位的人工智能诊断模型。这些模型使用三个数据特征进行训练和验证,并使用综合指标评估其性能,包括敏感性、阳性预测值(PPV)、特异性、阴性预测值(NPV)、F1评分、准确性、受试者工作特征曲线下面积(AUC)、马修斯相关系数(MCC)和交汇(IoU)。表现最好的机器学习(ML)和深度学习(DL)模型在1079个样本的单独数据集上进一步测试。将DL模型的定位能力与三位临床专家进行比较。结果:4种ML模型中,极限梯度增强(XGB)模型表现最佳,MCC、AUC、F1评分、灵敏度、特异度、准确度、PPV、NPV分别为0.847、0.903、0.875、0.822、0.985、0.951、0.934、0.955。不同的特征提取方法对模型性能有显著影响。DL模型在综合性能上优于ML模型。U-Net联合Transformer模型的定位能力与临床专家相当,灵敏度、特异性、准确度、PPV、NPV、F1评分、AUC、MCC、IoU分别为0.947、0.984、0.976、0.938、0.986、0.942、0.966、0.927、0.877。结论:U-Net结合Transformer模型在m蛋白分类和定位方面具有专家水平的性能,准确率为0.976,IoU为0.877。这种卓越的性能突出了这种组合模型在临床SPEP工作流程自动化方面的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Performance and efficiency of machine learning models in analyzing capillary serum protein electrophoresis

Background and Objective

Serum protein electrophoresis (SPEP) plays a critical role in diagnosing diseases associated with M−proteins. However, its clinical application is limited by a heavy reliance on experienced experts.

Methods

A dataset comprising 85,026 SPEP outcomes was utilized to develop artificial intelligence diagnostic models for the classification and localization of M−proteins. These models were trained and validated using three data features, and their performance was evaluated using comprehensive metrics, including sensitivity, positive predictive value (PPV), specificity, negative predictive value (NPV), F1 score, accuracy, area under the receiver operating characteristic curve (AUC), Matthews correlation coefficient (MCC), and Intersection over Union (IoU). The best-performing machine learning (ML) and deep learning (DL) models were further tested on a separate dataset of 1,079 samples. The localization ability of the DL model was compared against three clinical experts.

Results

Among the four ML models, the extreme gradient boosting (XGB) model achieved the best performance, with MCC, AUC, F1 score, sensitivity, specificity, accuracy, PPV, and NPV of 0.847, 0.903, 0.875, 0.822, 0.985, 0.951, 0.934, and 0.955, respectively. Different feature extraction methods significantly influenced model performance. The DL models outperformed the ML models in comprehensive performance. The U-Net combined with Transformer model demonstrated localization ability comparable to that of clinical experts, achieving sensitivity, specificity, accuracy, PPV, NPV, F1 score, AUC, MCC, and IoU of 0.947, 0.984, 0.976, 0.938, 0.986, 0.942, 0.966, 0.927, and 0.877, respectively.

Conclusion

The U-Net combined with the Transformer model demonstrated expert-level performance in M−protein classification and localization, achieving an accuracy of 0.976 and an IoU of 0.877. This exceptional performance highlights the potential of this combined model for automating clinical SPEP workflows.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Clinica Chimica Acta
Clinica Chimica Acta 医学-医学实验技术
CiteScore
10.10
自引率
2.00%
发文量
1268
审稿时长
23 days
期刊介绍: The Official Journal of the International Federation of Clinical Chemistry and Laboratory Medicine (IFCC) Clinica Chimica Acta is a high-quality journal which publishes original Research Communications in the field of clinical chemistry and laboratory medicine, defined as the diagnostic application of chemistry, biochemistry, immunochemistry, biochemical aspects of hematology, toxicology, and molecular biology to the study of human disease in body fluids and cells. The objective of the journal is to publish novel information leading to a better understanding of biological mechanisms of human diseases, their prevention, diagnosis, and patient management. Reports of an applied clinical character are also welcome. Papers concerned with normal metabolic processes or with constituents of normal cells or body fluids, such as reports of experimental or clinical studies in animals, are only considered when they are clearly and directly relevant to human disease. Evaluation of commercial products have a low priority for publication, unless they are novel or represent a technological breakthrough. Studies dealing with effects of drugs and natural products and studies dealing with the redox status in various diseases are not within the journal''s scope. Development and evaluation of novel analytical methodologies where applicable to diagnostic clinical chemistry and laboratory medicine, including point-of-care testing, and topics on laboratory management and informatics will also be considered. Studies focused on emerging diagnostic technologies and (big) data analysis procedures including digitalization, mobile Health, and artificial Intelligence applied to Laboratory Medicine are also of interest.
期刊最新文献
Evaluation of key analytical performance metrics for reliable lymphocyte subset enumeration using full-spectrum flow cytometry Advances in laboratory diagnostic studies of pheochromocytoma and paraganglioma 5’-tRFGly in serum is a specific tRNA-derived fragment biomarker for gastric cancer diagnosis Apolipoprotein B causes overestimation of serum albumin concentration measured by bromocresol green method in pediatric nephrotic syndrome patients Mitochondrial metabolism and succinylation-related gene signatures for predicting prognosis and drug resistance in colon adenocarcinoma
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1