首页 > 最新文献

BMC Medical Informatics and Decision Making最新文献

英文 中文
Explainable predictions of a machine learning model to forecast the postoperative length of stay for severe patients: machine learning model development and evaluation. 预测重症患者术后住院时间的机器学习模型的可解释预测:机器学习模型的开发与评估。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-20 DOI: 10.1186/s12911-024-02755-1
Ha Na Cho, Imjin Ahn, Hansle Gwon, Hee Jun Kang, Yunha Kim, Hyeram Seo, Heejung Choi, Minkyoung Kim, Jiye Han, Gaeun Kee, Seohyun Park, Tae Joon Jun, Young-Hak Kim

Background: Predicting the length of stay in advance will not only benefit the hospitals both clinically and financially but enable healthcare providers to better decision-making for improved quality of care. More importantly, understanding the length of stay of severe patients who require general anesthesia is key to enhancing health outcomes.

Objective: Here, we aim to discover how machine learning can support resource allocation management and decision-making resulting from the length of stay prediction.

Methods: A retrospective cohort study was conducted from January 2018 to October 2020. A total cohort of 240,000 patients' medical records was collected. The data were collected exclusively for preoperative variables to accurately analyze the predictive factors impacting the postoperative length of stay. The main outcome of this study is an analysis of the length of stay (in days) after surgery until discharge. The prediction was performed with ridge regression, random forest, XGBoost, and multi-layer perceptron neural network models.

Results: The XGBoost resulted in the best performance with an average error within 3 days. Moreover, we explain each feature's contribution over the XGBoost model and further display distinct predictors affecting the overall prediction outcome at the patient level. The risk factors that most importantly contributed to the stay after surgery were as follows: a direct bilirubin laboratory test, department change, calcium chloride medication, gender, and diagnosis with the removal of other organs. Our results suggest that healthcare providers take into account the risk factors such as the laboratory blood test, distributing patients, and the medication prescribed prior to the surgery.

Conclusion: We successfully predicted the length of stay after surgery and provide explainable models with supporting analyses. In summary, we demonstrate the interpretation with the XGBoost model presenting insights on preoperative features and defining higher risk predictors to the length of stay outcome. Our development in explainable models supports the current in-depth knowledge for the future length of stay prediction on electronic medical records that aids the decision-making and facilitation of the operation department.

背景:提前预测住院时间不仅能使医院在临床和经济上受益,还能使医疗服务提供者更好地做出决策,从而提高医疗质量。更重要的是,了解需要全身麻醉的重症患者的住院时间是提高医疗效果的关键。目的:在此,我们旨在探索机器学习如何支持住院时间预测所带来的资源分配管理和决策:从 2018 年 1 月至 2020 年 10 月进行了一项回顾性队列研究。共收集了 24 万份患者病历。为准确分析影响术后住院时间的预测因素,专门收集了术前变量数据。本研究的主要结果是分析手术后到出院前的住院时间(天数)。预测采用了脊回归、随机森林、XGBoost 和多层感知器神经网络模型:结果:XGBoost 的效果最好,平均误差在 3 天以内。此外,我们还解释了每个特征对 XGBoost 模型的贡献,并进一步显示了影响患者层面整体预测结果的不同预测因素。对术后住院时间影响最大的风险因素如下:直接胆红素实验室检测、科室变更、氯化钙药物、性别和切除其他器官的诊断。我们的研究结果表明,医疗服务提供者应将实验室血液检测、患者分布和术前用药等风险因素考虑在内:我们成功地预测了手术后的住院时间,并提供了可解释的模型和辅助分析。总之,我们通过 XGBoost 模型展示了对术前特征的解释,并定义了住院时间结果的高风险预测因素。我们开发的可解释模型支持当前对电子病历中未来住院时间预测的深入了解,有助于手术部门的决策和便利性。
{"title":"Explainable predictions of a machine learning model to forecast the postoperative length of stay for severe patients: machine learning model development and evaluation.","authors":"Ha Na Cho, Imjin Ahn, Hansle Gwon, Hee Jun Kang, Yunha Kim, Hyeram Seo, Heejung Choi, Minkyoung Kim, Jiye Han, Gaeun Kee, Seohyun Park, Tae Joon Jun, Young-Hak Kim","doi":"10.1186/s12911-024-02755-1","DOIUrl":"https://doi.org/10.1186/s12911-024-02755-1","url":null,"abstract":"<p><strong>Background: </strong>Predicting the length of stay in advance will not only benefit the hospitals both clinically and financially but enable healthcare providers to better decision-making for improved quality of care. More importantly, understanding the length of stay of severe patients who require general anesthesia is key to enhancing health outcomes.</p><p><strong>Objective: </strong>Here, we aim to discover how machine learning can support resource allocation management and decision-making resulting from the length of stay prediction.</p><p><strong>Methods: </strong>A retrospective cohort study was conducted from January 2018 to October 2020. A total cohort of 240,000 patients' medical records was collected. The data were collected exclusively for preoperative variables to accurately analyze the predictive factors impacting the postoperative length of stay. The main outcome of this study is an analysis of the length of stay (in days) after surgery until discharge. The prediction was performed with ridge regression, random forest, XGBoost, and multi-layer perceptron neural network models.</p><p><strong>Results: </strong>The XGBoost resulted in the best performance with an average error within 3 days. Moreover, we explain each feature's contribution over the XGBoost model and further display distinct predictors affecting the overall prediction outcome at the patient level. The risk factors that most importantly contributed to the stay after surgery were as follows: a direct bilirubin laboratory test, department change, calcium chloride medication, gender, and diagnosis with the removal of other organs. Our results suggest that healthcare providers take into account the risk factors such as the laboratory blood test, distributing patients, and the medication prescribed prior to the surgery.</p><p><strong>Conclusion: </strong>We successfully predicted the length of stay after surgery and provide explainable models with supporting analyses. In summary, we demonstrate the interpretation with the XGBoost model presenting insights on preoperative features and defining higher risk predictors to the length of stay outcome. Our development in explainable models supports the current in-depth knowledge for the future length of stay prediction on electronic medical records that aids the decision-making and facilitation of the operation department.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"350"},"PeriodicalIF":3.3,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142675306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modified multiscale Renyi distribution entropy for short-term heart rate variability analysis. 用于短期心率变异性分析的修正多尺度仁义分布熵。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-19 DOI: 10.1186/s12911-024-02763-1
Manhong Shi, Yinuo Shi, Yuxin Lin, Xue Qi

Background: Multiscale sample entropy (MSE) is a prevalent complexity metric to characterize a time series and has been extensively applied to the physiological signal analysis. However, for a short-term time series, the likelihood of identifying comparable subsequences decreases, leading to higher variability in the Sample Entropy (SampEn) calculation. Additionally, as the scale factor increases in the MSE calculation, the coarse-graining process further shortens the time series. Consequently, each newly generated time series at a larger scale consists of fewer data points, potentially resulting in unreliable or undefined entropy values, particularly at higher scales. To overcome the shortcoming, a modified multiscale Renyi distribution entropy (MMRDis) was proposed in our present work.

Methods: The MMRDis method uses a moving-averaging procedure to acquire a family of time series, each of which quantify the dynamic behaviors of the short-term time series over the multiple temporal scales. Then, MMRDis is constructed for the original and the coarse-grained time series.

Results: The MMRDis method demonstrated superior computational stability on simulated Gaussian white and 1/f noise time series, effectively avoiding undefined measurements in short-term time series. Analysis of short-term heart rate variability (HRV) signals from healthy elderly individuals, healthy young people, and subjects with congestive heart failure and atrial fibrillation revealed that MMRDis complexity measurement values decreased with aging and disease. Additionally, MMRDis exhibited better distinction capability for short-term HRV physiological/pathological signals compared to several recently proposed complexity metrics.

Conclusions: MMRDis was a promising measurement for screening cardiovascular condition within a short time.

背景:多尺度样本熵(MSE)是表征时间序列复杂性的常用指标,已被广泛应用于生理信号分析。然而,对于短期时间序列来说,识别可比子序列的可能性会降低,从而导致样本熵(SampEn)计算的变异性增大。此外,随着 MSE 计算中比例因子的增加,粗粒化过程会进一步缩短时间序列。因此,在更大尺度下新生成的每个时间序列包含的数据点更少,可能导致熵值不可靠或不确定,尤其是在更高的尺度下。为了克服这一缺陷,我们在本研究中提出了一种改进的多尺度仁义分布熵(MMRDis)方法:方法:MMRDis 方法使用移动平均程序获取时间序列系列,每个系列量化短期时间序列在多个时间尺度上的动态行为。然后,为原始时间序列和粗粒度时间序列构建 MMRDis:结果:MMRDis 方法在模拟高斯白噪声和 1/f 噪声时间序列上表现出卓越的计算稳定性,有效避免了短期时间序列中的未定义测量。对健康老年人、健康年轻人以及充血性心力衰竭和心房颤动受试者的短期心率变异性(HRV)信号进行分析后发现,MMRDis 的复杂度测量值随着年龄的增长和疾病的发生而降低。此外,与最近提出的几种复杂度指标相比,MMRDis 对短期心率变异生理/病理信号的区分能力更强:结论:MMRDis 是一种在短时间内筛查心血管状况的有前途的测量方法。
{"title":"Modified multiscale Renyi distribution entropy for short-term heart rate variability analysis.","authors":"Manhong Shi, Yinuo Shi, Yuxin Lin, Xue Qi","doi":"10.1186/s12911-024-02763-1","DOIUrl":"https://doi.org/10.1186/s12911-024-02763-1","url":null,"abstract":"<p><strong>Background: </strong>Multiscale sample entropy (MSE) is a prevalent complexity metric to characterize a time series and has been extensively applied to the physiological signal analysis. However, for a short-term time series, the likelihood of identifying comparable subsequences decreases, leading to higher variability in the Sample Entropy (SampEn) calculation. Additionally, as the scale factor increases in the MSE calculation, the coarse-graining process further shortens the time series. Consequently, each newly generated time series at a larger scale consists of fewer data points, potentially resulting in unreliable or undefined entropy values, particularly at higher scales. To overcome the shortcoming, a modified multiscale Renyi distribution entropy (MMRDis) was proposed in our present work.</p><p><strong>Methods: </strong>The MMRDis method uses a moving-averaging procedure to acquire a family of time series, each of which quantify the dynamic behaviors of the short-term time series over the multiple temporal scales. Then, MMRDis is constructed for the original and the coarse-grained time series.</p><p><strong>Results: </strong>The MMRDis method demonstrated superior computational stability on simulated Gaussian white and 1/f noise time series, effectively avoiding undefined measurements in short-term time series. Analysis of short-term heart rate variability (HRV) signals from healthy elderly individuals, healthy young people, and subjects with congestive heart failure and atrial fibrillation revealed that MMRDis complexity measurement values decreased with aging and disease. Additionally, MMRDis exhibited better distinction capability for short-term HRV physiological/pathological signals compared to several recently proposed complexity metrics.</p><p><strong>Conclusions: </strong>MMRDis was a promising measurement for screening cardiovascular condition within a short time.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"346"},"PeriodicalIF":3.3,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142675307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Anomaly-based threat detection in smart health using machine learning. 利用机器学习在智能健康领域进行基于异常的威胁检测。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-19 DOI: 10.1186/s12911-024-02760-4
Muntaha Tabassum, Saba Mahmood, Amal Bukhari, Bader Alshemaimri, Ali Daud, Fatima Khalique

Background: Anomaly detection is crucial in healthcare data due to challenges associated with the integration of smart technologies and healthcare. Anomaly in electronic health record can be associated with an insider trying to access and manipulate the data. This article focuses around the anomalies under different contexts.

Methodology: This research has proposed methodology to secure Electronic Health Records (EHRs) within a complex environment. We have employed a systematic approach encompassing data preprocessing, labeling, modeling, and evaluation. Anomalies are not labelled thus a mechanism is required that predicts them with greater accuracy and less false positive results. This research utilized unsupervised machine learning algorithms that includes Isolation Forest and Local Outlier Factor clustering algorithms. By calculating anomaly scores and validating clustering through metrics like the Silhouette Score and Dunn Score, we enhanced the capacity to secure sensitive healthcare data evolving digital threats. Three variations of Isolation Forest (IForest)models (SVM, Decision Tree, and Random Forest) and three variations of Local Outlier Factor (LOF) models (SVM, Decision Tree, and Random Forest) are evaluated based on accuracy, sensitivity, specificity, and F1 Score.

Results: Isolation Forest SVM achieves the highest accuracy of 99.21%, high sensitivity (99.75%) and specificity (99.32%), and a commendable F1 Score of 98.72%. The Isolation Forest Decision Tree also performs well with an accuracy of 98.92% and an F1 Score of 99.35%. However, the Isolation Forest Random Forest exhibits lower specificity (72.84%) than the other models.

Conclusion: The experimental results reveal that Isolation Forest SVM emerges as the top performer showcasing the effectiveness of these models in anomaly detection tasks. The proposed methodology utilizing isolation forest and SVM produced better results by detecting anomalies with less false positives in this specific EHR of a hospital in North England. Furthermore the proposal is also able to identify new contextual anomalies that were not identified in the baseline methodology.

背景:由于智能技术与医疗保健的融合带来的挑战,异常检测对医疗保健数据至关重要。电子健康记录中的异常可能与内部人员试图访问和操纵数据有关。本文主要围绕不同背景下的异常情况展开:本研究提出了在复杂环境中确保电子健康记录(EHR)安全的方法。我们采用的系统方法包括数据预处理、标记、建模和评估。异常情况没有标签,因此需要一种机制来预测异常情况,以提高准确率,减少误报。这项研究采用了无监督机器学习算法,包括隔离林和局部离群因子聚类算法。通过计算异常分数,并通过 Silhouette Score 和 Dunn Score 等指标验证聚类,我们提高了保护敏感医疗保健数据的能力,使数字威胁不断演变。我们根据准确性、灵敏度、特异性和 F1 分数对三种不同的隔离森林(IForest)模型(SVM、决策树和随机森林)和三种不同的局部离群因子(LOF)模型(SVM、决策树和随机森林)进行了评估:隔离森林 SVM 的准确率最高,达到 99.21%,灵敏度(99.75%)和特异度(99.32%)都很高,F1 分数也达到了 98.72%。隔离森林决策树也表现出色,准确率为 98.92%,F1 得分为 99.35%。然而,Isolation Forest 随机森林的特异性(72.84%)低于其他模型:实验结果表明,Isolation Forest SVM 在异常检测任务中表现最出色,展示了这些模型的有效性。利用隔离林和 SVM 提出的方法在英格兰北部一家医院的特定电子病历中以较少的误报率检测出异常,从而取得了更好的效果。此外,该建议还能识别出基线方法未识别出的新的上下文异常。
{"title":"Anomaly-based threat detection in smart health using machine learning.","authors":"Muntaha Tabassum, Saba Mahmood, Amal Bukhari, Bader Alshemaimri, Ali Daud, Fatima Khalique","doi":"10.1186/s12911-024-02760-4","DOIUrl":"https://doi.org/10.1186/s12911-024-02760-4","url":null,"abstract":"<p><strong>Background: </strong>Anomaly detection is crucial in healthcare data due to challenges associated with the integration of smart technologies and healthcare. Anomaly in electronic health record can be associated with an insider trying to access and manipulate the data. This article focuses around the anomalies under different contexts.</p><p><strong>Methodology: </strong>This research has proposed methodology to secure Electronic Health Records (EHRs) within a complex environment. We have employed a systematic approach encompassing data preprocessing, labeling, modeling, and evaluation. Anomalies are not labelled thus a mechanism is required that predicts them with greater accuracy and less false positive results. This research utilized unsupervised machine learning algorithms that includes Isolation Forest and Local Outlier Factor clustering algorithms. By calculating anomaly scores and validating clustering through metrics like the Silhouette Score and Dunn Score, we enhanced the capacity to secure sensitive healthcare data evolving digital threats. Three variations of Isolation Forest (IForest)models (SVM, Decision Tree, and Random Forest) and three variations of Local Outlier Factor (LOF) models (SVM, Decision Tree, and Random Forest) are evaluated based on accuracy, sensitivity, specificity, and F1 Score.</p><p><strong>Results: </strong>Isolation Forest SVM achieves the highest accuracy of 99.21%, high sensitivity (99.75%) and specificity (99.32%), and a commendable F1 Score of 98.72%. The Isolation Forest Decision Tree also performs well with an accuracy of 98.92% and an F1 Score of 99.35%. However, the Isolation Forest Random Forest exhibits lower specificity (72.84%) than the other models.</p><p><strong>Conclusion: </strong>The experimental results reveal that Isolation Forest SVM emerges as the top performer showcasing the effectiveness of these models in anomaly detection tasks. The proposed methodology utilizing isolation forest and SVM produced better results by detecting anomalies with less false positives in this specific EHR of a hospital in North England. Furthermore the proposal is also able to identify new contextual anomalies that were not identified in the baseline methodology.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"347"},"PeriodicalIF":3.3,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142675271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DAPNet: multi-view graph contrastive network incorporating disease clinical and molecular associations for disease progression prediction. DAPNet:结合疾病临床和分子关联的多视图对比网络,用于疾病进展预测。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-19 DOI: 10.1186/s12911-024-02756-0
Haoyu Tian, Xiong He, Kuo Yang, Xinyu Dai, Yiming Liu, Fengjin Zhang, Zixin Shu, Qiguang Zheng, Shihua Wang, Jianan Xia, Tiancai Wen, Baoyan Liu, Jian Yu, Xuezhong Zhou

Background: Timely and accurate prediction of disease progress is crucial for facilitating early intervention and treatment for various chronic diseases. However, due to the complicated and longitudinal nature of disease progression, the capacity and completeness of clinical data required for training deep learning models remains a significant challenge. This study aims to explore a new method that reduces data dependency and achieves predictive performance comparable to existing research.

Methods: This study proposed DAPNet, a deep learning-based disease progression prediction model that solely utilizes the comorbidity duration (without relying on multi-modal data or comprehensive medical records) and disease associations from biomedical knowledge graphs to deliver high-performance prediction. DAPNet is the first to apply multi-view graph contrastive learning to disease progression prediction tasks. Compared with other studies on comorbidities, DAPNet innovatively integrates molecular-level disease association information, combines disease co-occurrence and ICD10, and fully explores the associations between diseases; RESULTS: This study validated DAPNet using a de-identified clinical dataset derived from medical claims, which includes 2,714 patients and 10,856 visits. Meanwhile, a kidney dataset (606 patients) based on MIMIC-IV has also been constructed to fully validate its performance. The results showed that DAPNet achieved state-of-the-art performance on the severe pneumonia dataset (F1=0.84, with an improvement of 8.7%), and outperformed the six baseline models on the kidney disease dataset (F1=0.80, with an improvement of 21.3%). Through case analysis, we elucidated the clinical and molecular associations identified by the DAPNet model, which facilitated a better understanding and explanation of potential disease association, thereby providing interpretability for the model.

Conclusions: The proposed DAPNet, for the first time, utilizes comorbidity duration and disease associations network, enabling more accurate disease progression prediction based on a multi-view graph contrastive learning, which provides valuable insights for early diagnosis and treatment of patients. Based on disease association networks, our research has enhanced the interpretability of disease progression predictions.

背景:及时准确地预测疾病进展对于促进各种慢性疾病的早期干预和治疗至关重要。然而,由于疾病进展的复杂性和纵向性,训练深度学习模型所需的临床数据的容量和完整性仍然是一个重大挑战。本研究旨在探索一种新方法,既能降低数据依赖性,又能达到与现有研究相当的预测性能:本研究提出了基于深度学习的疾病进展预测模型 DAPNet,该模型仅利用合并症持续时间(不依赖多模态数据或全面的医疗记录)和生物医学知识图谱中的疾病关联来提供高性能预测。DAPNet 首次将多视图对比学习应用于疾病进展预测任务。与其他有关合并症的研究相比,DAPNet 创新性地整合了分子水平的疾病关联信息,结合了疾病共现和 ICD10,充分挖掘了疾病之间的关联;结果:本研究使用了一个去标识化的临床数据集对 DAPNet 进行了验证,该数据集来自医疗索赔,包括 2714 名患者和 10856 次就诊。同时,还构建了基于 MIMIC-IV 的肾脏数据集(606 名患者),以充分验证其性能。结果表明,DAPNet 在重症肺炎数据集上取得了最先进的性能(F1=0.84,提高了 8.7%),在肾病数据集上的表现优于六个基线模型(F1=0.80,提高了 21.3%)。通过病例分析,我们阐明了DAPNet模型确定的临床和分子关联,这有助于更好地理解和解释潜在的疾病关联,从而为模型提供了可解释性:所提出的 DAPNet 首次利用了合并症持续时间和疾病关联网络,在多视图对比学习的基础上实现了更准确的疾病进展预测,为患者的早期诊断和治疗提供了有价值的见解。基于疾病关联网络,我们的研究增强了疾病进展预测的可解释性。
{"title":"DAPNet: multi-view graph contrastive network incorporating disease clinical and molecular associations for disease progression prediction.","authors":"Haoyu Tian, Xiong He, Kuo Yang, Xinyu Dai, Yiming Liu, Fengjin Zhang, Zixin Shu, Qiguang Zheng, Shihua Wang, Jianan Xia, Tiancai Wen, Baoyan Liu, Jian Yu, Xuezhong Zhou","doi":"10.1186/s12911-024-02756-0","DOIUrl":"https://doi.org/10.1186/s12911-024-02756-0","url":null,"abstract":"<p><strong>Background: </strong>Timely and accurate prediction of disease progress is crucial for facilitating early intervention and treatment for various chronic diseases. However, due to the complicated and longitudinal nature of disease progression, the capacity and completeness of clinical data required for training deep learning models remains a significant challenge. This study aims to explore a new method that reduces data dependency and achieves predictive performance comparable to existing research.</p><p><strong>Methods: </strong>This study proposed DAPNet, a deep learning-based disease progression prediction model that solely utilizes the comorbidity duration (without relying on multi-modal data or comprehensive medical records) and disease associations from biomedical knowledge graphs to deliver high-performance prediction. DAPNet is the first to apply multi-view graph contrastive learning to disease progression prediction tasks. Compared with other studies on comorbidities, DAPNet innovatively integrates molecular-level disease association information, combines disease co-occurrence and ICD10, and fully explores the associations between diseases; RESULTS: This study validated DAPNet using a de-identified clinical dataset derived from medical claims, which includes 2,714 patients and 10,856 visits. Meanwhile, a kidney dataset (606 patients) based on MIMIC-IV has also been constructed to fully validate its performance. The results showed that DAPNet achieved state-of-the-art performance on the severe pneumonia dataset (F1=0.84, with an improvement of 8.7%), and outperformed the six baseline models on the kidney disease dataset (F1=0.80, with an improvement of 21.3%). Through case analysis, we elucidated the clinical and molecular associations identified by the DAPNet model, which facilitated a better understanding and explanation of potential disease association, thereby providing interpretability for the model.</p><p><strong>Conclusions: </strong>The proposed DAPNet, for the first time, utilizes comorbidity duration and disease associations network, enabling more accurate disease progression prediction based on a multi-view graph contrastive learning, which provides valuable insights for early diagnosis and treatment of patients. Based on disease association networks, our research has enhanced the interpretability of disease progression predictions.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"345"},"PeriodicalIF":3.3,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142675282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Risk factors and prediction model for acute ischemic stroke after off-pump coronary artery bypass grafting based on Bayesian network. 基于贝叶斯网络的冠状动脉旁路移植术后急性缺血性中风的风险因素和预测模型。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-19 DOI: 10.1186/s12911-024-02762-2
Wenlong Zou, Haipeng Zhao, Ming Ren, Chaoxiong Cui, Guobin Yuan, Boyi Yuan, Zeyu Ji, Chao Wu, Bin Cai, Tingting Yang, Jinjun Zou, Guangzhi Liu

Background: This study aimed to identify the risk factors of acute ischemic stroke (AIS) occurring during hospitalization in patients following off-pump coronary artery bypass grafting (OPCABG) and utilize Bayesian network (BN) methods to establish predictive models for this disease.

Methods: Data were collected from the electronic health records of adult patients who underwent OPCABG at Beijing Anzhen Hospital from January 2018 to December 2022. Patients were allocated to the training and test sets in an 8:2 ratio according to the principle of randomness. Subsequently, a BN model was established using the training dataset and validated against the testing dataset. The BN model was developed using a tabu search algorithm. Finally, receiver operating characteristic (ROC) and calibration curves were plotted to assess the extent of disparity in predictive performance between the BN and logistic models.

Results: A total of 10,184 patients (mean (SD) age, 62.45 (8.7) years; 2524 (24.7%) females) were enrolled, including 151 (1.5%) with AIS and 10,033 (98.5%) without AIS. Female sex, history of ischemic stroke, severe carotid artery stenosis, high glycated albumin (GA) levels, high D-dimer levels, high erythrocyte distribution width (RDW), and high blood urea nitrogen (BUN) levels were strongly associated with AIS. Type 2 diabetes mellitus (T2DM) was indirectly linked to AIS through GA and BUN. The BN models exhibited superior performance to logistic regression in both the training and testing sets, achieving accuracies of 72.64% and 71.48%, area under the curve (AUC) of 0.899 (95% confidence interval (CI), 0.876-0.921) and 0.852 (95% CI, 0.769-0.935), sensitivities of 91.87% and 89.29%, and specificities of 72.35% and 71.24% (using the optimal cut-off), respectively.

Conclusion: Female gender, IS history, carotid stenosis (> 70%), RDW-CV, GA, D-dimer, BUN, and T2DM are potential predictors of IS in our Chinese cohort. The BN model demonstrated greater efficiency than the logistic regression model. Hence, employing BN models could be conducive to the early diagnosis and prevention of AIS after OPCABG.

背景:本研究旨在确定非体外循环冠状动脉搭桥术(OPCABG)患者住院期间发生急性缺血性卒中(AIS)的风险因素,并利用贝叶斯网络(BN)方法建立该疾病的预测模型:从2018年1月至2022年12月期间在北京安贞医院接受OPCABG手术的成年患者的电子健康档案中收集数据。根据随机性原则,按照 8:2 的比例将患者分配到训练集和测试集。随后,利用训练数据集建立 BN 模型,并根据测试数据集进行验证。BN 模型是使用塔布搜索算法建立的。最后,绘制了接收者操作特征曲线(ROC)和校准曲线,以评估 BN 模型和逻辑模型在预测性能上的差异程度:共有 10184 名患者(平均(标清)年龄为 62.45(8.7)岁;2524 名女性(24.7%))入组,其中包括 151 名 AIS 患者(1.5%)和 10033 名无 AIS 患者(98.5%)。女性性别、缺血性中风病史、颈动脉严重狭窄、糖化白蛋白(GA)水平高、D-二聚体水平高、红细胞分布宽度(RDW)高和血尿素氮(BUN)水平高与 AIS 密切相关。2 型糖尿病(T2DM)通过 GA 和 BUN 与 AIS 间接相关。在训练集和测试集中,BN 模型的表现均优于逻辑回归,准确率分别为 72.64% 和 71.48%,曲线下面积 (AUC) 分别为 0.899(95% 置信区间 (CI),0.876-0.921)和 0.852(95% 置信区间 (CI),0.769-0.935),灵敏度分别为 91.87% 和 89.29%,特异性分别为 72.35% 和 71.24%(使用最佳临界值):结论:女性性别、IS病史、颈动脉狭窄(> 70%)、RDW-CV、GA、D-二聚体、BUN和T2DM是中国队列中IS的潜在预测因素。BN 模型比逻辑回归模型更有效。因此,采用 BN 模型有助于早期诊断和预防 OPCABG 后的 AIS。
{"title":"Risk factors and prediction model for acute ischemic stroke after off-pump coronary artery bypass grafting based on Bayesian network.","authors":"Wenlong Zou, Haipeng Zhao, Ming Ren, Chaoxiong Cui, Guobin Yuan, Boyi Yuan, Zeyu Ji, Chao Wu, Bin Cai, Tingting Yang, Jinjun Zou, Guangzhi Liu","doi":"10.1186/s12911-024-02762-2","DOIUrl":"https://doi.org/10.1186/s12911-024-02762-2","url":null,"abstract":"<p><strong>Background: </strong>This study aimed to identify the risk factors of acute ischemic stroke (AIS) occurring during hospitalization in patients following off-pump coronary artery bypass grafting (OPCABG) and utilize Bayesian network (BN) methods to establish predictive models for this disease.</p><p><strong>Methods: </strong>Data were collected from the electronic health records of adult patients who underwent OPCABG at Beijing Anzhen Hospital from January 2018 to December 2022. Patients were allocated to the training and test sets in an 8:2 ratio according to the principle of randomness. Subsequently, a BN model was established using the training dataset and validated against the testing dataset. The BN model was developed using a tabu search algorithm. Finally, receiver operating characteristic (ROC) and calibration curves were plotted to assess the extent of disparity in predictive performance between the BN and logistic models.</p><p><strong>Results: </strong>A total of 10,184 patients (mean (SD) age, 62.45 (8.7) years; 2524 (24.7%) females) were enrolled, including 151 (1.5%) with AIS and 10,033 (98.5%) without AIS. Female sex, history of ischemic stroke, severe carotid artery stenosis, high glycated albumin (GA) levels, high D-dimer levels, high erythrocyte distribution width (RDW), and high blood urea nitrogen (BUN) levels were strongly associated with AIS. Type 2 diabetes mellitus (T2DM) was indirectly linked to AIS through GA and BUN. The BN models exhibited superior performance to logistic regression in both the training and testing sets, achieving accuracies of 72.64% and 71.48%, area under the curve (AUC) of 0.899 (95% confidence interval (CI), 0.876-0.921) and 0.852 (95% CI, 0.769-0.935), sensitivities of 91.87% and 89.29%, and specificities of 72.35% and 71.24% (using the optimal cut-off), respectively.</p><p><strong>Conclusion: </strong>Female gender, IS history, carotid stenosis (> 70%), RDW-CV, GA, D-dimer, BUN, and T2DM are potential predictors of IS in our Chinese cohort. The BN model demonstrated greater efficiency than the logistic regression model. Hence, employing BN models could be conducive to the early diagnosis and prevention of AIS after OPCABG.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"349"},"PeriodicalIF":3.3,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142674788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Paramedic perceptions of decision-making when managing mental health-related presentations: a qualitative study. 辅助医务人员在处理与精神健康有关的病例时对决策的看法:一项定性研究。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-19 DOI: 10.1186/s12911-024-02768-w
Kate Emond, George Mnatzaganian, Michael Savic, Dan I Lubman, Melanie Bish

Background: Mental health presentations account for a considerable proportion of paramedic workload; however, the decision-making involved in managing these cases is poorly understood. This study aimed to explore how paramedics perceive their clinical decision-making when managing mental health presentations.

Methods: A qualitative descriptive study design was employed. Overall, 73 paramedics participated in semi structured interviews, and data were analyzed from transcribed interviews in NVivo.

Results: Four themes emerged that reflected participants' perceptions: the assessment process, experience, the use of documents and standard procedures, and consultation with other healthcare providers. There were conflicting perceptions about the clinical decision-making process, with perception of role having a potential impact. The dual process theory of clinical decision-making, which includes both analytical and intuitive approaches, was evident in the decision-making process.

Conclusion: Incorporating dual process theory into education and training, which highlights the strengths and weaknesses of analytical and intuitive decision-making, may reduce clinical errors made by cognitive bias. To further support clinical decision-making, additional education and training are warranted to promote critical thinking and clarify the scope of practice and roles when attending to mental health-related presentations.

背景:精神疾病在辅助医务人员的工作量中占有相当大的比例;然而,人们对管理这些病例所涉及的决策却知之甚少。本研究旨在探讨辅助医务人员在管理精神疾病患者时如何看待他们的临床决策:方法:采用定性描述研究设计。共有 73 名护理人员参加了半结构式访谈,并使用 NVivo 对访谈记录进行了数据分析:结果:出现了四个反映参与者看法的主题:评估过程、经验、文件和标准程序的使用以及与其他医疗服务提供者的协商。对临床决策过程的认识存在冲突,其中对角色的认识可能会产生影响。临床决策的双重过程理论包括分析和直觉两种方法,在决策过程中表现明显:结论:在教育和培训中纳入双重过程理论,强调分析和直觉决策的优缺点,可减少因认知偏差造成的临床错误。为了进一步支持临床决策,有必要开展更多的教育和培训,以促进批判性思维,并明确在处理与精神健康相关的病例时的实践范围和角色。
{"title":"Paramedic perceptions of decision-making when managing mental health-related presentations: a qualitative study.","authors":"Kate Emond, George Mnatzaganian, Michael Savic, Dan I Lubman, Melanie Bish","doi":"10.1186/s12911-024-02768-w","DOIUrl":"https://doi.org/10.1186/s12911-024-02768-w","url":null,"abstract":"<p><strong>Background: </strong>Mental health presentations account for a considerable proportion of paramedic workload; however, the decision-making involved in managing these cases is poorly understood. This study aimed to explore how paramedics perceive their clinical decision-making when managing mental health presentations.</p><p><strong>Methods: </strong>A qualitative descriptive study design was employed. Overall, 73 paramedics participated in semi structured interviews, and data were analyzed from transcribed interviews in NVivo.</p><p><strong>Results: </strong>Four themes emerged that reflected participants' perceptions: the assessment process, experience, the use of documents and standard procedures, and consultation with other healthcare providers. There were conflicting perceptions about the clinical decision-making process, with perception of role having a potential impact. The dual process theory of clinical decision-making, which includes both analytical and intuitive approaches, was evident in the decision-making process.</p><p><strong>Conclusion: </strong>Incorporating dual process theory into education and training, which highlights the strengths and weaknesses of analytical and intuitive decision-making, may reduce clinical errors made by cognitive bias. To further support clinical decision-making, additional education and training are warranted to promote critical thinking and clarify the scope of practice and roles when attending to mental health-related presentations.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"348"},"PeriodicalIF":3.3,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142675310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel prediction model for the prognosis of non-small cell lung cancer with clinical routine laboratory indicators: a machine learning approach. 利用临床常规实验室指标的非小细胞肺癌预后新型预测模型:一种机器学习方法。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-18 DOI: 10.1186/s12911-024-02753-3
Yuli Wang, Na Mei, Ziyi Zhou, Yuan Fang, Jiacheng Lin, Fanchen Zhao, Zhihong Fang, Yan Li

Background: Lung cancer is characterized by high morbidity and mortality due to the lack of practical early diagnostic and prognostic tools. The present study uses machine learning algorithms to construct a clinical predictive model for non-small cell lung cancer (NSCLC) patients.

Methods: Laboratory indices of the NSCLC patients at their initial visit were collected for quality control and exploratory analysis. By comparing the levels of the above indices between the survival and death groups, the statistically significant indices were selected for subsequent machine learning modeling. Ten machine learning algorithms were then employed to develop the predictive models with survival and recurrence as outcomes, respectively. Moreover, regression models were constructed using the random survival forest algorithm by incorporating the survival time dimension. Finally, critical variables in the optimal model were screened based on the interpretable algorithms to build a decision tree to facilitate clinical application.

Results: 682 patients were enrolled according to the inclusion and exclusion criteria. The preliminary comparison results revealed that except for fast blood glucose, CD3+T cell proportion, NK cell proportion, and CA72-4, there were significant statistical differences in other tumor markers, inflammation, metabolism, and immune-related indices between the survival and death groups (p < 0.01). Subsequently, indices with statistical differences were incorporated into machine learning modeling and evaluation. The results showed that among the ten prognostic models constructed using survival status as the outcome, the neural network model obtained the best predictive performance, with accuracy, sensitivity, specificity, AUC, and precision values of 0.993, 0.987, 1.000, 0.994, and 1.000, respectively. The corresponding SHAP16 algorithm revealed that the top five variables in terms of importance were interleukin6 (IL-6), soluble interleukin2 receptor (sIL-2R), cholesterol, CEA, and Cy211, respectively. The random survival forest model also confirmed the critical role of CEA, sIL-2R, and IL-6 in predicting the prognosis of NSCLC patients. A decision tree model with seven cut-off points based on the above three indices was eventually built for clinical application.

Conclusion: The neural network model exhibited ideal predictive performance in the survival status of NSCLC patients, and the decision tree model constructed based on selected important variables was conducive to rapid bedside prognosis assessment and decision-making.

背景:由于缺乏实用的早期诊断和预后工具,肺癌具有发病率和死亡率高的特点。本研究利用机器学习算法构建了非小细胞肺癌(NSCLC)患者的临床预测模型:方法:收集非小细胞肺癌患者初诊时的实验室指标,用于质量控制和探索性分析。通过比较生存组和死亡组之间的上述指标水平,选出具有统计学意义的指标用于随后的机器学习建模。随后,十种机器学习算法分别以存活和复发为结果建立预测模型。此外,利用随机生存森林算法,结合生存时间维度,构建了回归模型。最后,根据可解释算法筛选出最优模型中的关键变量,建立决策树,以方便临床应用:根据纳入和排除标准,共纳入 682 例患者。初步比较结果显示,除空腹血糖、CD3+T细胞比例、NK细胞比例和CA72-4外,生存组和死亡组在其他肿瘤标志物、炎症、代谢和免疫相关指标方面均有显著统计学差异(P 结论:神经网络模型具有理想的预测效果:神经网络模型对 NSCLC 患者的生存状况具有理想的预测性能,而基于所选重要变量构建的决策树模型有利于床旁快速评估预后和做出决策。
{"title":"A novel prediction model for the prognosis of non-small cell lung cancer with clinical routine laboratory indicators: a machine learning approach.","authors":"Yuli Wang, Na Mei, Ziyi Zhou, Yuan Fang, Jiacheng Lin, Fanchen Zhao, Zhihong Fang, Yan Li","doi":"10.1186/s12911-024-02753-3","DOIUrl":"10.1186/s12911-024-02753-3","url":null,"abstract":"<p><strong>Background: </strong>Lung cancer is characterized by high morbidity and mortality due to the lack of practical early diagnostic and prognostic tools. The present study uses machine learning algorithms to construct a clinical predictive model for non-small cell lung cancer (NSCLC) patients.</p><p><strong>Methods: </strong>Laboratory indices of the NSCLC patients at their initial visit were collected for quality control and exploratory analysis. By comparing the levels of the above indices between the survival and death groups, the statistically significant indices were selected for subsequent machine learning modeling. Ten machine learning algorithms were then employed to develop the predictive models with survival and recurrence as outcomes, respectively. Moreover, regression models were constructed using the random survival forest algorithm by incorporating the survival time dimension. Finally, critical variables in the optimal model were screened based on the interpretable algorithms to build a decision tree to facilitate clinical application.</p><p><strong>Results: </strong>682 patients were enrolled according to the inclusion and exclusion criteria. The preliminary comparison results revealed that except for fast blood glucose, CD<sub>3</sub><sup>+</sup>T cell proportion, NK cell proportion, and CA72-4, there were significant statistical differences in other tumor markers, inflammation, metabolism, and immune-related indices between the survival and death groups (p < 0.01). Subsequently, indices with statistical differences were incorporated into machine learning modeling and evaluation. The results showed that among the ten prognostic models constructed using survival status as the outcome, the neural network model obtained the best predictive performance, with accuracy, sensitivity, specificity, AUC, and precision values of 0.993, 0.987, 1.000, 0.994, and 1.000, respectively. The corresponding SHAP16 algorithm revealed that the top five variables in terms of importance were interleukin6 (IL-6), soluble interleukin2 receptor (sIL-2R), cholesterol, CEA, and Cy211, respectively. The random survival forest model also confirmed the critical role of CEA, sIL-2R, and IL-6 in predicting the prognosis of NSCLC patients. A decision tree model with seven cut-off points based on the above three indices was eventually built for clinical application.</p><p><strong>Conclusion: </strong>The neural network model exhibited ideal predictive performance in the survival status of NSCLC patients, and the decision tree model constructed based on selected important variables was conducive to rapid bedside prognosis assessment and decision-making.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"344"},"PeriodicalIF":3.3,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142667366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classification of lumbar spine disorders using large language models and MRI segmentation. 利用大型语言模型和磁共振成像分割对腰椎疾病进行分类。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-18 DOI: 10.1186/s12911-024-02740-8
Rongpeng Dong, Xueliang Cheng, Mingyang Kang, Yang Qu

Background: MRI is critical for diagnosing lumbar spine disorders but its complexity challenges diagnostic accuracy. This study proposes a BERT-based large language model (LLM) to enhance precision in classifying lumbar spine disorders through the integration of MRI data, textual reports, and numerical measurements.

Methods: The segmentation quality of MRI data is evaluated using dice coefficients (cut-off: 0.92) and intersection over union (IoU) metrics (cut-off: 0.88) to ensure precise anatomical feature extraction. The CNN extracts key lumbar spine features, such as lumbar lordotic angle (LLA) and disc heights, which are tokenized as direct scalar values representing positional relationships. A data source of 28,065 patients with various disorders, including degenerative disc disease, spinal stenosis, and spondylolisthesis, is used to establish diagnostic standards. These standards are refined through post-CNN processing of MRI texture features. The BERT-based spinal LLM model integrates these CNN-extracted MRI features and numerical values through early fusion layers.

Results: Segmentation analysis illustrate various lumbar spine disorders and their anatomical changes. The model achieved high performance, with all key metrics nearing 0.9, demonstrating its effectiveness in classifying conditions like spondylolisthesis, herniated disc, and spinal stenosis. External validation further confirmed the model's generalizability across different populations. External validation on 514 expert-validated MRI cases further confirms the model's clinical relevance and generalizability. The BERT-based model classifies 61 combinations of lumbar spine disorders.

Conclusions: The BERT-based spinal LLM significantly improves the precision of lumbar spine disorder classification, supporting accurate diagnosis and treatment planning.

背景:磁共振成像是诊断腰椎疾病的关键,但其复杂性对诊断的准确性提出了挑战。本研究提出了一种基于 BERT 的大型语言模型(LLM),通过整合核磁共振成像数据、文本报告和数值测量来提高腰椎疾病分类的准确性:方法:使用骰子系数(临界值:0.92)和交集大于联合(IoU)指标(临界值:0.88)评估核磁共振成像数据的分割质量,以确保精确的解剖特征提取。CNN 可提取关键的腰椎特征,如腰椎前凸角 (LLA) 和椎间盘高度,这些特征被标记为代表位置关系的直接标量值。28,065 名患有各种疾病(包括椎间盘退行性病变、椎管狭窄和脊椎滑脱症)的患者的数据源被用来建立诊断标准。通过对核磁共振成像纹理特征的后 CNN 处理,这些标准得到了完善。基于 BERT 的脊柱 LLM 模型通过早期融合层整合了这些 CNN 提取的 MRI 特征和数值:结果:分割分析显示了各种腰椎疾病及其解剖变化。该模型取得了很高的性能,所有关键指标均接近 0.9,证明其在对脊柱滑脱症、椎间盘突出症和椎管狭窄症等病症进行分类时非常有效。外部验证进一步证实了该模型在不同人群中的通用性。对 514 个经专家验证的核磁共振成像病例进行的外部验证进一步证实了该模型的临床相关性和通用性。基于 BERT 的模型可对 61 种腰椎疾病组合进行分类:基于 BERT 的脊柱 LLM 显著提高了腰椎疾病分类的精确度,为准确诊断和治疗计划提供了支持。
{"title":"Classification of lumbar spine disorders using large language models and MRI segmentation.","authors":"Rongpeng Dong, Xueliang Cheng, Mingyang Kang, Yang Qu","doi":"10.1186/s12911-024-02740-8","DOIUrl":"10.1186/s12911-024-02740-8","url":null,"abstract":"<p><strong>Background: </strong>MRI is critical for diagnosing lumbar spine disorders but its complexity challenges diagnostic accuracy. This study proposes a BERT-based large language model (LLM) to enhance precision in classifying lumbar spine disorders through the integration of MRI data, textual reports, and numerical measurements.</p><p><strong>Methods: </strong>The segmentation quality of MRI data is evaluated using dice coefficients (cut-off: 0.92) and intersection over union (IoU) metrics (cut-off: 0.88) to ensure precise anatomical feature extraction. The CNN extracts key lumbar spine features, such as lumbar lordotic angle (LLA) and disc heights, which are tokenized as direct scalar values representing positional relationships. A data source of 28,065 patients with various disorders, including degenerative disc disease, spinal stenosis, and spondylolisthesis, is used to establish diagnostic standards. These standards are refined through post-CNN processing of MRI texture features. The BERT-based spinal LLM model integrates these CNN-extracted MRI features and numerical values through early fusion layers.</p><p><strong>Results: </strong>Segmentation analysis illustrate various lumbar spine disorders and their anatomical changes. The model achieved high performance, with all key metrics nearing 0.9, demonstrating its effectiveness in classifying conditions like spondylolisthesis, herniated disc, and spinal stenosis. External validation further confirmed the model's generalizability across different populations. External validation on 514 expert-validated MRI cases further confirms the model's clinical relevance and generalizability. The BERT-based model classifies 61 combinations of lumbar spine disorders.</p><p><strong>Conclusions: </strong>The BERT-based spinal LLM significantly improves the precision of lumbar spine disorder classification, supporting accurate diagnosis and treatment planning.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"343"},"PeriodicalIF":3.3,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142667367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Construction and SHAP interpretability analysis of a risk prediction model for feeding intolerance in preterm newborns based on machine learning. 基于机器学习的早产新生儿喂养不耐受风险预测模型的构建和 SHAP 可解释性分析。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-18 DOI: 10.1186/s12911-024-02751-5
Hui Xu, Xingwang Peng, Ziyu Peng, Rui Wang, Rui Zhou, Lianguo Fu

Objective: To construct a highly accurate and interpretable feeding intolerance (FI) risk prediction model for preterm newborns based on machine learning (ML) to assist medical staff in clinical diagnosis.

Methods: In this study, a sample of 350 hospitalized preterm newborns were retrospectively analysed. First, dual feature selection was conducted to identify important feature variables for model construction. Second, ML models were constructed based on the logistic regression (LR), decision tree (DT), support vector machine (SVM) and eXtreme Gradient Boosting (XGBoost) algorithms, after which random sampling and tenfold cross-validation were separately used to evaluate and compare these models and identify the optimal model. Finally, we apply the SHapley Additive exPlanation (SHAP) interpretable framework to analyse the decision-making principles of the optimal model and expound upon the important factors affecting FI in preterm newborns and their modes of action.

Results: The accuracy of XGBoost was 87.62%, and the area under the curve (AUC) was 92.2%. After the application of tenfold cross-validation, the accuracy was 83.43%, and the AUC was 89.45%, which was significantly better than those of the other models. Analysis of the XGBoost model with the SHAP interpretable framework showed that a history of resuscitation, use of probiotics, milk opening time, interval between two stools and gestational age were the main factors affecting the occurrence of FI in preterm newborns, yielding importance scores of 0.632, 0.407, 0.313, 0.313, and 0.258, respectively. A history of resuscitation, first milk opening time ≥ 24 h and interval between stools ≥ 3 days were risk factors for FI, while the use of probiotics and gestational age ≥ 34 weeks were protective factors against FI in preterm newborns.

Conclusions: In practice, we should improve perinatal care and obstetrics with the aim of reducing the occurrence of hypoxia and preterm delivery. When feeding, early milk opening, the use of probiotics, the stimulation of defecation and other measures should be implemented with the aim of reducing the occurrence of FI.

目的基于机器学习(ML)构建一个高度准确且可解释的早产新生儿喂养不耐受(FI)风险预测模型,以协助医务人员进行临床诊断:本研究对 350 例住院早产新生儿进行了回顾性分析。首先,进行了双重特征选择,以确定用于构建模型的重要特征变量。其次,根据逻辑回归(LR)、决策树(DT)、支持向量机(SVM)和极梯度提升(XGBoost)算法构建 ML 模型,然后分别使用随机抽样和十倍交叉验证对这些模型进行评估和比较,并确定最佳模型。最后,我们应用SHAPLE Additive exPlanation(SHAP)可解释框架分析了最优模型的决策原理,并阐述了影响早产新生儿FI的重要因素及其作用模式:XGBoost的准确率为87.62%,曲线下面积(AUC)为92.2%。应用十倍交叉验证后,准确率为 83.43%,AUC 为 89.45%,明显优于其他模型。利用 SHAP 可解释框架对 XGBoost 模型进行的分析表明,复苏史、益生菌的使用、开奶时间、两次大便间隔时间和胎龄是影响早产新生儿 FI 发生的主要因素,其重要性得分分别为 0.632、0.407、0.313、0.313 和 0.258。复苏史、首次开奶时间≥24 h和大便间隔≥3天是早产新生儿FI的风险因素,而使用益生菌和胎龄≥34周是早产新生儿FI的保护因素:在实践中,我们应改善围产期护理和产科,以减少缺氧和早产的发生。在喂养时,应采取早期开奶、使用益生菌、刺激排便等措施,以减少 FI 的发生。
{"title":"Construction and SHAP interpretability analysis of a risk prediction model for feeding intolerance in preterm newborns based on machine learning.","authors":"Hui Xu, Xingwang Peng, Ziyu Peng, Rui Wang, Rui Zhou, Lianguo Fu","doi":"10.1186/s12911-024-02751-5","DOIUrl":"10.1186/s12911-024-02751-5","url":null,"abstract":"<p><strong>Objective: </strong>To construct a highly accurate and interpretable feeding intolerance (FI) risk prediction model for preterm newborns based on machine learning (ML) to assist medical staff in clinical diagnosis.</p><p><strong>Methods: </strong>In this study, a sample of 350 hospitalized preterm newborns were retrospectively analysed. First, dual feature selection was conducted to identify important feature variables for model construction. Second, ML models were constructed based on the logistic regression (LR), decision tree (DT), support vector machine (SVM) and eXtreme Gradient Boosting (XGBoost) algorithms, after which random sampling and tenfold cross-validation were separately used to evaluate and compare these models and identify the optimal model. Finally, we apply the SHapley Additive exPlanation (SHAP) interpretable framework to analyse the decision-making principles of the optimal model and expound upon the important factors affecting FI in preterm newborns and their modes of action.</p><p><strong>Results: </strong>The accuracy of XGBoost was 87.62%, and the area under the curve (AUC) was 92.2%. After the application of tenfold cross-validation, the accuracy was 83.43%, and the AUC was 89.45%, which was significantly better than those of the other models. Analysis of the XGBoost model with the SHAP interpretable framework showed that a history of resuscitation, use of probiotics, milk opening time, interval between two stools and gestational age were the main factors affecting the occurrence of FI in preterm newborns, yielding importance scores of 0.632, 0.407, 0.313, 0.313, and 0.258, respectively. A history of resuscitation, first milk opening time ≥ 24 h and interval between stools ≥ 3 days were risk factors for FI, while the use of probiotics and gestational age ≥ 34 weeks were protective factors against FI in preterm newborns.</p><p><strong>Conclusions: </strong>In practice, we should improve perinatal care and obstetrics with the aim of reducing the occurrence of hypoxia and preterm delivery. When feeding, early milk opening, the use of probiotics, the stimulation of defecation and other measures should be implemented with the aim of reducing the occurrence of FI.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"342"},"PeriodicalIF":3.3,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142667368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the usability of Iran's national comprehensive health information system: a think-aloud study to uncover usability problems in the recording of childcare data. 评估伊朗国家综合卫生信息系统的可用性:通过思考-朗读研究发现儿童保育数据记录中的可用性问题。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-11-16 DOI: 10.1186/s12911-024-02746-2
Razieh Farrahi, Ehsan Nabovati, Reyhane Bigham, Fateme Rangraz Jeddi

Introduction: Health information systems play a crucial role in the delivery of efficient and effective healthcare. Poor usability is one of the reasons for their lack of acceptance and low usage by users. The aim of this study was to identify the usability problems of a national comprehensive health information system using the concurrent think-aloud method in the recording of childcare data.

Methods: A descriptive cross-sectional study was conducted in the health centers of Kashan University of Medical Sciences, Iran, in 2020. Ten healthcare providers as system's users were purposively selected to evaluate the system. To identify problems, a concurrent think-aloud evaluation was conducted. Two administrators of the system designed scenarios for ten childcare data recording tasks. By analysing the recorded files, usability problems were identified. The severity of the problems was then determined with the help of the users and problems were assigned to usability attributes based on their impact on the user.

Results: A total of 68 unique problems were identified in the system, of which 47.1% were rated as catastrophic problems. The participants assigned 47 problems (69%) to the user satisfaction attribute and 45 problems (66%) to the efficiency attribute; they also did not assign any problems to the effectiveness attribute.

Conclusion: The problems identified in the national comprehensive health information system using the think-aloud method were rated as major and catastrophic, which indicates poor usability of this system. Therefore, resolving the system problems will help increase user satisfaction and system efficiency, allowing more time to be spent on patient care and parent's education as well as improving overall quality of care.

导言:医疗信息系统在提供高效和有效的医疗保健服务方面发挥着至关重要的作用。可用性差是其不被用户接受和使用率低的原因之一。本研究的目的是在记录儿童保健数据的过程中,使用 "同时思考-大声朗读 "的方法,找出国家综合保健信息系统的可用性问题:方法:2020 年在伊朗卡尚医科大学的医疗中心进行了一项描述性横断面研究。有目的性地选择了 10 名医疗保健提供者作为系统用户,对系统进行评估。为发现问题,同时进行了思考-朗读评估。系统的两名管理员为十项儿童护理数据记录任务设计了情景。通过分析记录的文件,发现了可用性问题。然后,在用户的帮助下确定了问题的严重程度,并根据问题对用户的影响将其分配到可用性属性中:结果:在系统中总共发现了 68 个独特的问题,其中 47.1% 被评为灾难性问题。参与者将 47 个问题(69%)分配给了用户满意度属性,将 45 个问题(66%)分配给了效率属性;他们也没有将任何问题分配给有效性属性:结论:使用 "大声想一想 "的方法在国家综合卫生信息系统中发现的问题被评为重大和灾难性问题,这表明该系统的可用性很差。因此,解决系统问题将有助于提高用户满意度和系统效率,从而将更多的时间用于病人护理和家长教育,并提高整体护理质量。
{"title":"Evaluating the usability of Iran's national comprehensive health information system: a think-aloud study to uncover usability problems in the recording of childcare data.","authors":"Razieh Farrahi, Ehsan Nabovati, Reyhane Bigham, Fateme Rangraz Jeddi","doi":"10.1186/s12911-024-02746-2","DOIUrl":"10.1186/s12911-024-02746-2","url":null,"abstract":"<p><strong>Introduction: </strong>Health information systems play a crucial role in the delivery of efficient and effective healthcare. Poor usability is one of the reasons for their lack of acceptance and low usage by users. The aim of this study was to identify the usability problems of a national comprehensive health information system using the concurrent think-aloud method in the recording of childcare data.</p><p><strong>Methods: </strong>A descriptive cross-sectional study was conducted in the health centers of Kashan University of Medical Sciences, Iran, in 2020. Ten healthcare providers as system's users were purposively selected to evaluate the system. To identify problems, a concurrent think-aloud evaluation was conducted. Two administrators of the system designed scenarios for ten childcare data recording tasks. By analysing the recorded files, usability problems were identified. The severity of the problems was then determined with the help of the users and problems were assigned to usability attributes based on their impact on the user.</p><p><strong>Results: </strong>A total of 68 unique problems were identified in the system, of which 47.1% were rated as catastrophic problems. The participants assigned 47 problems (69%) to the user satisfaction attribute and 45 problems (66%) to the efficiency attribute; they also did not assign any problems to the effectiveness attribute.</p><p><strong>Conclusion: </strong>The problems identified in the national comprehensive health information system using the think-aloud method were rated as major and catastrophic, which indicates poor usability of this system. Therefore, resolving the system problems will help increase user satisfaction and system efficiency, allowing more time to be spent on patient care and parent's education as well as improving overall quality of care.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"24 1","pages":"341"},"PeriodicalIF":3.3,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568614/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142643435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
BMC Medical Informatics and Decision Making
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1