A novel method for screening malignant hematological diseases by constructing an optimal machine learning model based on blood cell parameters.

IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS BMC Medical Informatics and Decision Making Pub Date : 2025-02-11 DOI:10.1186/s12911-025-02892-1
Dehua Sun, Wei Chen, Jun He, Yongjian He, Haoqin Jiang, Hong Jiang, Dandan Liu, Lu Li, Min Liu, Zhigang Mao, Chenxue Qu, Linlin Qu, Ziyong Sun, Jianbiao Wang, Wenjing Wu, Xuefeng Wang, Wei Xu, Ying Xing, Chi Zhang, Jingxian Zhang, Lei Zheng, Shihong Zhang, Bo Ye, Ming Guan
{"title":"A novel method for screening malignant hematological diseases by constructing an optimal machine learning model based on blood cell parameters.","authors":"Dehua Sun, Wei Chen, Jun He, Yongjian He, Haoqin Jiang, Hong Jiang, Dandan Liu, Lu Li, Min Liu, Zhigang Mao, Chenxue Qu, Linlin Qu, Ziyong Sun, Jianbiao Wang, Wenjing Wu, Xuefeng Wang, Wei Xu, Ying Xing, Chi Zhang, Jingxian Zhang, Lei Zheng, Shihong Zhang, Bo Ye, Ming Guan","doi":"10.1186/s12911-025-02892-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Screening of malignant hematological diseases is of great importance for their diagnosis and subsequent treatment. This study constructed an optimal screening model for malignant hematological diseases based on routine blood cell parameters.</p><p><strong>Methods: </strong>The venous blood samples of 1751 patients collected from 10 tertiary hospitals in China were divided into a training set (1223 cases) and a validation set (528 cases). In addition to the clinical diagnostic information of the samples in the training set, 26 blood cell parameters including morphological parameters were selected using manual screening and filtering to construct eight machine learning models. These models were used to identify hematological malignancies among the validation set.</p><p><strong>Results: </strong>Comparison of the discrimination, calibration and clinical detection performance of the eight machine learning models revealed that the artificial neural network (ANN) model performed the optimal in identifying malignant haematological diseases in the validation set (528 cases), with an area under the receiver operating characteristic curve (AUC), accuracy, sensitivity and specificity of 0.906, 0.857, 0.832 and 0.884, respectively.</p><p><strong>Conclusion: </strong>The ANN model constructed can be used for screening of malignant hematological diseases, especially in primary hospitals that lack comprehensive diagnosis, and this ANN model will help patients to get diagnosis and treatment of malignant hematological diseases as early as possible.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"72"},"PeriodicalIF":3.8000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11816569/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-02892-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Screening of malignant hematological diseases is of great importance for their diagnosis and subsequent treatment. This study constructed an optimal screening model for malignant hematological diseases based on routine blood cell parameters.

Methods: The venous blood samples of 1751 patients collected from 10 tertiary hospitals in China were divided into a training set (1223 cases) and a validation set (528 cases). In addition to the clinical diagnostic information of the samples in the training set, 26 blood cell parameters including morphological parameters were selected using manual screening and filtering to construct eight machine learning models. These models were used to identify hematological malignancies among the validation set.

Results: Comparison of the discrimination, calibration and clinical detection performance of the eight machine learning models revealed that the artificial neural network (ANN) model performed the optimal in identifying malignant haematological diseases in the validation set (528 cases), with an area under the receiver operating characteristic curve (AUC), accuracy, sensitivity and specificity of 0.906, 0.857, 0.832 and 0.884, respectively.

Conclusion: The ANN model constructed can be used for screening of malignant hematological diseases, especially in primary hospitals that lack comprehensive diagnosis, and this ANN model will help patients to get diagnosis and treatment of malignant hematological diseases as early as possible.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过构建基于血细胞参数的最佳机器学习模型筛查恶性血液病的新方法。
背景:恶性血液病的筛查对其诊断和后续治疗具有重要意义。本研究构建了基于常规血细胞参数的恶性血液病最佳筛选模型。方法:将全国10家三级医院1751例患者静脉血样本分为训练集(1223例)和验证集(528例)。除了训练集中样本的临床诊断信息外,通过人工筛选和过滤,选择包括形态学参数在内的26个血细胞参数,构建8个机器学习模型。这些模型用于在验证集中识别血液学恶性肿瘤。结果:对比8种机器学习模型的鉴别、校准和临床检测性能,人工神经网络(ANN)模型在528例验证集中对恶性血液病的识别效果最佳,其受试者工作特征曲线下面积(AUC)、准确率、灵敏度和特异性分别为0.906、0.857、0.832和0.884。结论:所构建的神经网络模型可用于恶性血液病的筛查,特别是在缺乏全面诊断的基层医院,该神经网络模型有助于患者尽早得到恶性血液病的诊断和治疗。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
期刊最新文献
Classification of pulmonary diseases using machine learning and deep learning models on GLI-2012 standardized spirometry features. Predicting the risk of activities of daily living dysfunction in middle-aged and older adults with comorbid hypertension and diabetes: a national population-based survey analysis. Explainable extratreeclassifier model for early detection of type 2 diabetes: evidence from the PERSIAN Dena Cohort. Harmonizing self-reported and free text medication data: a reproducible pipeline for gerontological research. Osmolarity trajectories and outcomes in patients with acute pancreatitis in the intensive care unit: group-based trajectory modeling.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1