机器学习模型在毛细管血清蛋白电泳分析中的性能和效率。

IF 4.1 3区医学 Q2 MEDICAL LABORATORY TECHNOLOGY Clinica Chimica Acta Pub Date : 2025-03-01 Epub Date: 2025-01-26 DOI:10.1016/j.cca.2025.120165

Xia Wang , Mei Zhang , Chuan Li , Chengyao Jia , Xijie Yu , He He

{"title":"机器学习模型在毛细管血清蛋白电泳分析中的性能和效率。","authors":"Xia Wang , Mei Zhang , Chuan Li , Chengyao Jia , Xijie Yu , He He","doi":"10.1016/j.cca.2025.120165","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and Objective</h3><div>Serum protein electrophoresis (SPEP) plays a critical role in diagnosing diseases associated with M−proteins. However, its clinical application is limited by a heavy reliance on experienced experts.</div></div><div><h3>Methods</h3><div>A dataset comprising 85,026 SPEP outcomes was utilized to develop artificial intelligence diagnostic models for the classification and localization of M−proteins. These models were trained and validated using three data features, and their performance was evaluated using comprehensive metrics, including sensitivity, positive predictive value (PPV), specificity, negative predictive value (NPV), F1 score, accuracy, area under the receiver operating characteristic curve (AUC), Matthews correlation coefficient (MCC), and Intersection over Union (IoU). The best-performing machine learning (ML) and deep learning (DL) models were further tested on a separate dataset of 1,079 samples. The localization ability of the DL model was compared against three clinical experts.</div></div><div><h3>Results</h3><div>Among the four ML models, the extreme gradient boosting (XGB) model achieved the best performance, with MCC, AUC, F1 score, sensitivity, specificity, accuracy, PPV, and NPV of 0.847, 0.903, 0.875, 0.822, 0.985, 0.951, 0.934, and 0.955, respectively. Different feature extraction methods significantly influenced model performance. The DL models outperformed the ML models in comprehensive performance. The U-Net combined with Transformer model demonstrated localization ability comparable to that of clinical experts, achieving sensitivity, specificity, accuracy, PPV, NPV, F1 score, AUC, MCC, and IoU of 0.947, 0.984, 0.976, 0.938, 0.986, 0.942, 0.966, 0.927, and 0.877, respectively.</div></div><div><h3>Conclusion</h3><div>The U-Net combined with the Transformer model demonstrated expert-level performance in M−protein classification and localization, achieving an accuracy of 0.976 and an IoU of 0.877. This exceptional performance highlights the potential of this combined model for automating clinical SPEP workflows.</div></div>","PeriodicalId":10205,"journal":{"name":"Clinica Chimica Acta","volume":"569 ","pages":"Article 120165"},"PeriodicalIF":4.1000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Performance and efficiency of machine learning models in analyzing capillary serum protein electrophoresis\",\"authors\":\"Xia Wang , Mei Zhang , Chuan Li , Chengyao Jia , Xijie Yu , He He\",\"doi\":\"10.1016/j.cca.2025.120165\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background and Objective</h3><div>Serum protein electrophoresis (SPEP) plays a critical role in diagnosing diseases associated with M−proteins. However, its clinical application is limited by a heavy reliance on experienced experts.</div></div><div><h3>Methods</h3><div>A dataset comprising 85,026 SPEP outcomes was utilized to develop artificial intelligence diagnostic models for the classification and localization of M−proteins. These models were trained and validated using three data features, and their performance was evaluated using comprehensive metrics, including sensitivity, positive predictive value (PPV), specificity, negative predictive value (NPV), F1 score, accuracy, area under the receiver operating characteristic curve (AUC), Matthews correlation coefficient (MCC), and Intersection over Union (IoU). The best-performing machine learning (ML) and deep learning (DL) models were further tested on a separate dataset of 1,079 samples. The localization ability of the DL model was compared against three clinical experts.</div></div><div><h3>Results</h3><div>Among the four ML models, the extreme gradient boosting (XGB) model achieved the best performance, with MCC, AUC, F1 score, sensitivity, specificity, accuracy, PPV, and NPV of 0.847, 0.903, 0.875, 0.822, 0.985, 0.951, 0.934, and 0.955, respectively. Different feature extraction methods significantly influenced model performance. The DL models outperformed the ML models in comprehensive performance. The U-Net combined with Transformer model demonstrated localization ability comparable to that of clinical experts, achieving sensitivity, specificity, accuracy, PPV, NPV, F1 score, AUC, MCC, and IoU of 0.947, 0.984, 0.976, 0.938, 0.986, 0.942, 0.966, 0.927, and 0.877, respectively.</div></div><div><h3>Conclusion</h3><div>The U-Net combined with the Transformer model demonstrated expert-level performance in M−protein classification and localization, achieving an accuracy of 0.976 and an IoU of 0.877. This exceptional performance highlights the potential of this combined model for automating clinical SPEP workflows.</div></div>\",\"PeriodicalId\":10205,\"journal\":{\"name\":\"Clinica Chimica Acta\",\"volume\":\"569 \",\"pages\":\"Article 120165\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinica Chimica Acta\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0009898125000440\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/26 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL LABORATORY TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinica Chimica Acta","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0009898125000440","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/26 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

背景与目的：血清蛋白电泳（SPEP）在m蛋白相关疾病的诊断中起着至关重要的作用。然而，它的临床应用受到严重依赖经验丰富的专家的限制。方法：利用包含85,026个SPEP结果的数据集开发用于m蛋白分类和定位的人工智能诊断模型。这些模型使用三个数据特征进行训练和验证，并使用综合指标评估其性能，包括敏感性、阳性预测值（PPV）、特异性、阴性预测值（NPV）、F1评分、准确性、受试者工作特征曲线下面积（AUC）、马修斯相关系数（MCC）和交汇（IoU）。表现最好的机器学习（ML）和深度学习（DL）模型在1079个样本的单独数据集上进一步测试。将DL模型的定位能力与三位临床专家进行比较。结果：4种ML模型中，极限梯度增强（XGB）模型表现最佳，MCC、AUC、F1评分、灵敏度、特异度、准确度、PPV、NPV分别为0.847、0.903、0.875、0.822、0.985、0.951、0.934、0.955。不同的特征提取方法对模型性能有显著影响。DL模型在综合性能上优于ML模型。U-Net联合Transformer模型的定位能力与临床专家相当，灵敏度、特异性、准确度、PPV、NPV、F1评分、AUC、MCC、IoU分别为0.947、0.984、0.976、0.938、0.986、0.942、0.966、0.927、0.877。结论：U-Net结合Transformer模型在m蛋白分类和定位方面具有专家水平的性能，准确率为0.976，IoU为0.877。这种卓越的性能突出了这种组合模型在临床SPEP工作流程自动化方面的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Performance and efficiency of machine learning models in analyzing capillary serum protein electrophoresis

Background and Objective

Serum protein electrophoresis (SPEP) plays a critical role in diagnosing diseases associated with M−proteins. However, its clinical application is limited by a heavy reliance on experienced experts.

Methods

A dataset comprising 85,026 SPEP outcomes was utilized to develop artificial intelligence diagnostic models for the classification and localization of M−proteins. These models were trained and validated using three data features, and their performance was evaluated using comprehensive metrics, including sensitivity, positive predictive value (PPV), specificity, negative predictive value (NPV), F1 score, accuracy, area under the receiver operating characteristic curve (AUC), Matthews correlation coefficient (MCC), and Intersection over Union (IoU). The best-performing machine learning (ML) and deep learning (DL) models were further tested on a separate dataset of 1,079 samples. The localization ability of the DL model was compared against three clinical experts.

Results

Among the four ML models, the extreme gradient boosting (XGB) model achieved the best performance, with MCC, AUC, F1 score, sensitivity, specificity, accuracy, PPV, and NPV of 0.847, 0.903, 0.875, 0.822, 0.985, 0.951, 0.934, and 0.955, respectively. Different feature extraction methods significantly influenced model performance. The DL models outperformed the ML models in comprehensive performance. The U-Net combined with Transformer model demonstrated localization ability comparable to that of clinical experts, achieving sensitivity, specificity, accuracy, PPV, NPV, F1 score, AUC, MCC, and IoU of 0.947, 0.984, 0.976, 0.938, 0.986, 0.942, 0.966, 0.927, and 0.877, respectively.

Conclusion

The U-Net combined with the Transformer model demonstrated expert-level performance in M−protein classification and localization, achieving an accuracy of 0.976 and an IoU of 0.877. This exceptional performance highlights the potential of this combined model for automating clinical SPEP workflows.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Clinica Chimica Acta 医学-医学实验技术

CiteScore

10.10

自引率

2.00%

发文量

1268

审稿时长

23 days

期刊介绍： The Official Journal of the International Federation of Clinical Chemistry and Laboratory Medicine (IFCC) Clinica Chimica Acta is a high-quality journal which publishes original Research Communications in the field of clinical chemistry and laboratory medicine, defined as the diagnostic application of chemistry, biochemistry, immunochemistry, biochemical aspects of hematology, toxicology, and molecular biology to the study of human disease in body fluids and cells. The objective of the journal is to publish novel information leading to a better understanding of biological mechanisms of human diseases, their prevention, diagnosis, and patient management. Reports of an applied clinical character are also welcome. Papers concerned with normal metabolic processes or with constituents of normal cells or body fluids, such as reports of experimental or clinical studies in animals, are only considered when they are clearly and directly relevant to human disease. Evaluation of commercial products have a low priority for publication, unless they are novel or represent a technological breakthrough. Studies dealing with effects of drugs and natural products and studies dealing with the redox status in various diseases are not within the journal''s scope. Development and evaluation of novel analytical methodologies where applicable to diagnostic clinical chemistry and laboratory medicine, including point-of-care testing, and topics on laboratory management and informatics will also be considered. Studies focused on emerging diagnostic technologies and (big) data analysis procedures including digitalization, mobile Health, and artificial Intelligence applied to Laboratory Medicine are also of interest.