Expert-level detection of M-proteins in serum protein electrophoresis using machine learning.

IF 3.7 2区医学 Q1 MEDICAL LABORATORY TECHNOLOGY Clinical chemistry and laboratory medicine Pub Date : 2024-06-17 Print Date: 2024-11-26 DOI:10.1515/cclm-2024-0222

Eike Elfert, Wolfgang E Kaminski, Christian Matek, Gregor Hoermann, Eyvind W Axelsen, Carsten Marr, Armin P Piehler

{"title":"Expert-level detection of M-proteins in serum protein electrophoresis using machine learning.","authors":"Eike Elfert, Wolfgang E Kaminski, Christian Matek, Gregor Hoermann, Eyvind W Axelsen, Carsten Marr, Armin P Piehler","doi":"10.1515/cclm-2024-0222","DOIUrl":null,"url":null,"abstract":"Objectives: Serum protein electrophoresis (SPE) in combination with immunotyping (IMT) is the diagnostic standard for detecting monoclonal proteins (M-proteins). However, interpretation of SPE and IMT is weakly standardized, time consuming and investigator dependent. Here, we present five machine learning (ML) approaches for automated detection of M-proteins on SPE on an unprecedented large and well-curated data set and compare the performance with that of laboratory experts.Methods: SPE and IMT were performed in serum samples from 69,722 individuals from Norway. IMT results were used to label the samples as M-protein present (positive, n=4,273) or absent (negative n=65,449). Four feature-based ML algorithms and one convolutional neural network (CNN) were trained on 68,722 randomly selected SPE patterns to detect M-proteins. Algorithm performance was compared to that of an expert group of clinical pathologists and laboratory technicians (n=10) on a test set of 1,000 samples.Results: The random forest classifier showed the best performance (F1-Score 93.2 %, accuracy 99.1 %, sensitivity 89.9 %, specificity 99.8 %, positive predictive value 96.9 %, negative predictive value 99.3 %) and outperformed the experts (F1-Score 61.2 ± 16.0 %, accuracy 89.2 ± 10.2 %, sensitivity 94.3 ± 2.8 %, specificity 88.9 ± 10.9 %, positive predictive value 47.3 ± 16.2 %, negative predictive value 99.5 ± 0.2 %) on the test set. Interestingly the performance of the RFC saturated, the CNN performance increased steadily within our training set (n=68,722).Conclusions: Feature-based ML systems are capable of automated detection of M-proteins on SPE beyond expert-level and show potential for use in the clinical laboratory.","PeriodicalId":10390,"journal":{"name":"Clinical chemistry and laboratory medicine","volume":" ","pages":"2498-2506"},"PeriodicalIF":3.7000,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11470231/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical chemistry and laboratory medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1515/cclm-2024-0222","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/26 0:00:00","PubModel":"Print","JCR":"Q1","JCRName":"MEDICAL LABORATORY TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Objectives: Serum protein electrophoresis (SPE) in combination with immunotyping (IMT) is the diagnostic standard for detecting monoclonal proteins (M-proteins). However, interpretation of SPE and IMT is weakly standardized, time consuming and investigator dependent. Here, we present five machine learning (ML) approaches for automated detection of M-proteins on SPE on an unprecedented large and well-curated data set and compare the performance with that of laboratory experts.

Methods: SPE and IMT were performed in serum samples from 69,722 individuals from Norway. IMT results were used to label the samples as M-protein present (positive, n=4,273) or absent (negative n=65,449). Four feature-based ML algorithms and one convolutional neural network (CNN) were trained on 68,722 randomly selected SPE patterns to detect M-proteins. Algorithm performance was compared to that of an expert group of clinical pathologists and laboratory technicians (n=10) on a test set of 1,000 samples.

Results: The random forest classifier showed the best performance (F1-Score 93.2 %, accuracy 99.1 %, sensitivity 89.9 %, specificity 99.8 %, positive predictive value 96.9 %, negative predictive value 99.3 %) and outperformed the experts (F1-Score 61.2 ± 16.0 %, accuracy 89.2 ± 10.2 %, sensitivity 94.3 ± 2.8 %, specificity 88.9 ± 10.9 %, positive predictive value 47.3 ± 16.2 %, negative predictive value 99.5 ± 0.2 %) on the test set. Interestingly the performance of the RFC saturated, the CNN performance increased steadily within our training set (n=68,722).

Conclusions: Feature-based ML systems are capable of automated detection of M-proteins on SPE beyond expert-level and show potential for use in the clinical laboratory.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用机器学习对血清蛋白电泳中的 M 蛋白进行专家级检测。

目的：血清蛋白电泳（SPE）结合免疫分型（IMT）是检测单克隆蛋白（M 蛋白）的诊断标准。然而，SPE 和 IMT 的判读标准化程度低、耗时长且依赖于研究人员。在此，我们介绍了五种机器学习（ML）方法，用于在史无前例的大型精选数据集上自动检测 SPE 中的 M 蛋白，并将其性能与实验室专家的性能进行比较：对来自挪威 69,722 人的血清样本进行了 SPE 和 IMT 检测。IMT 结果用于将样本标记为存在 M 蛋白（阳性，n=4,273）或不存在 M 蛋白（阴性，n=65,449）。四种基于特征的 ML 算法和一种卷积神经网络 (CNN) 在 68,722 个随机选择的 SPE 模式上进行了训练，以检测 M 蛋白。在 1,000 个样本的测试集上，将算法性能与临床病理学家和实验室技术人员专家组（10 人）的性能进行了比较：结果：随机森林分类器表现最佳（F1-分数 93.2 %，准确率 99.1 %，灵敏度 89.9 %，特异性 99.8 %，阳性预测值 96.9 %，阴性预测值 99.3 %），优于临床病理学家和实验室技术人员（人数=10）。3 %），在测试集上的表现优于专家（F1-分数 61.2 ± 16.0 %，准确率 89.2 ± 10.2 %，灵敏度 94.3 ± 2.8 %，特异性 88.9 ± 10.9 %，阳性预测值 47.3 ± 16.2 %，阴性预测值 99.5 ± 0.2 %）。有趣的是，在我们的训练集（n=68,722）中，RFC 的性能趋于饱和，而 CNN 的性能则稳步上升：结论：基于特征的 ML 系统能够自动检测 SPE 上的 M 蛋白，其能力已超过专家水平，并显示出在临床实验室中使用的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Clinical chemistry and laboratory medicine 医学-医学实验技术

CiteScore

11.30

自引率

16.20%

发文量

306

审稿时长

3 months

期刊介绍： Clinical Chemistry and Laboratory Medicine (CCLM) publishes articles on novel teaching and training methods applicable to laboratory medicine. CCLM welcomes contributions on the progress in fundamental and applied research and cutting-edge clinical laboratory medicine. It is one of the leading journals in the field, with an impact factor over 3. CCLM is issued monthly, and it is published in print and electronically. CCLM is the official journal of the European Federation of Clinical Chemistry and Laboratory Medicine (EFLM) and publishes regularly EFLM recommendations and news. CCLM is the official journal of the National Societies from Austria (ÖGLMKC); Belgium (RBSLM); Germany (DGKL); Hungary (MLDT); Ireland (ACBI); Italy (SIBioC); Portugal (SPML); and Slovenia (SZKK); and it is affiliated to AACB (Australia) and SFBC (France). Topics: - clinical biochemistry - clinical genomics and molecular biology - clinical haematology and coagulation - clinical immunology and autoimmunity - clinical microbiology - drug monitoring and analysis - evaluation of diagnostic biomarkers - disease-oriented topics (cardiovascular disease, cancer diagnostics, diabetes) - new reagents, instrumentation and technologies - new methodologies - reference materials and methods - reference values and decision limits - quality and safety in laboratory medicine - translational laboratory medicine - clinical metrology Follow @cclm_degruyter on Twitter!