首页 > 最新文献

BMC Medical Informatics and Decision Making最新文献

英文 中文
Med-MGF: multi-level graph-based framework for handling medical data imbalance and representation. Med-MGF:基于多层次图的医疗数据不平衡和代表性处理框架。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-09-02 DOI: 10.1186/s12911-024-02649-2
Tuong Minh Nguyen, Kim Leng Poh, Shu-Ling Chong, Jan Hau Lee

Background: Modeling patient data, particularly electronic health records (EHR), is one of the major focuses of machine learning studies in healthcare, as these records provide clinicians with valuable information that can potentially assist them in disease diagnosis and decision-making.

Methods: In this study, we present a multi-level graph-based framework called MedMGF, which models both patient medical profiles extracted from EHR data and their relationship network of health profiles in a single architecture. The medical profiles consist of several layers of data embedding derived from interval records obtained during hospitalization, and the patient-patient network is created by measuring the similarities between these profiles. We also propose a modification to the Focal Loss (FL) function to improve classification performance in imbalanced datasets without the need to imputate the data. MedMGF's performance was evaluated against several Graphical Convolutional Network (GCN) baseline models implemented with Binary Cross Entropy (BCE), FL, class balancing parameter α , and Synthetic Minority Oversampling Technique (SMOTE).

Results: Our proposed framework achieved high classification performance (AUC: 0.8098, ACC: 0.7503, SEN: 0.8750, SPE: 0.7445, NPV: 0.9923, PPV: 0.1367) on an extreme imbalanced pediatric sepsis dataset (n=3,014, imbalance ratio of 0.047). It yielded a classification improvement of 3.81% for AUC, 15% for SEN compared to the baseline GCN+ α FL (AUC: 0.7717, ACC: 0.8144, SEN: 0.7250, SPE: 0.8185, PPV: 0.1559, NPV: 0.9847), and an improvement of 5.88% in AUC and 22.5% compared to GCN+FL+SMOTE (AUC: 0.7510, ACC: 0.8431, SEN: 0.6500, SPE: 0.8520, PPV: 0.1688, NPV: 0.9814). It also showed a classification improvement of 3.86% for AUC, 15% for SEN compared to the baseline GCN+ α BCE (AUC: 0.7712, ACC: 0.8133, SEN: 0.7250, SPE: 0.8173, PPV: 0.1551, NPV: 0.9847), and an improvement of 14.33% in AUC and 27.5% in comparison to GCN+BCE+SMOTE (AUC: 0.6665, ACC: 0.7271, SEN: 0.6000, SPE: 0.7329, PPV: 0.0941, NPV: 0.9754).

Conclusion: When compared to all baseline models, MedMGF achieved the highest SEN and AUC results, demonstrating the potential for several healthcare applications.

背景:患者数据建模,尤其是电子健康记录(EHR),是医疗领域机器学习研究的重点之一,因为这些记录为临床医生提供了宝贵的信息,有可能帮助他们进行疾病诊断和决策:在本研究中,我们提出了一个基于多层次图的框架,称为 MedMGF,该框架在单一架构中对从电子病历数据中提取的患者医疗档案及其健康档案关系网络进行建模。医疗档案由多层数据嵌入组成,这些数据嵌入来自住院期间获得的间隔记录,而患者-患者网络则是通过测量这些档案之间的相似性创建的。我们还提出了对焦点损失(FL)函数的修改,以提高不平衡数据集的分类性能,而无需对数据进行估算。我们利用二元交叉熵(BCE)、FL、类平衡参数α和合成少数群体过度采样技术(SMOTE)对多个图形卷积网络(GCN)基线模型进行了评估:我们提出的框架取得了很高的分类性能(AUC:0.8098, ACC:0.7503,SEN:0.8750,SPE:0.7445,NPV:0.9923,PPV:0.1367)。与基线 GCN+ α FL(AUC:0.7717, ACC:0.8144,SEN:0.7250,SPE:0.8185,PPV:0.1559,NPV:0.9847)相比,AUC提高了5.88%,SEN提高了22.5%(AUC:0.7510,ACC:0.8144,SEN:0.7250,SPE:0.8185,PPV:0.1559,NPV:0.9847):0.7510, ACC:AUC:0.7510,ACC:0.8431,SEN:0.6500,SPE:0.8520,PPV:0.1688,NPV:0.9814)。与基线 GCN+ α BCE 相比,它的 AUC 和 SEN 分别提高了 3.86% 和 15%(AUC:0.7712, ACC:0.8133,SEN:0.7250,SPE:0.8173,PPV:0.1551,NPV:0.9847),与 GCN+BCE+SMOTE 相比,AUC 提高了 14.33%,SEN 提高了 27.5%(AUC:0.6665,ACC:0.8133,SEN:0.7250,SPE:0.8173,PPV:0.1551,NPV:0.9847):0.7271,SEN结论是:与所有基线模型相比,Medicrometo™模型的AUC和NPV分别提高了14.33%和27.5%:与所有基线模型相比,MedMGF 的 SEN 值和 AUC 值最高,显示了其在多种医疗应用中的潜力。
{"title":"Med-MGF: multi-level graph-based framework for handling medical data imbalance and representation.","authors":"Tuong Minh Nguyen, Kim Leng Poh, Shu-Ling Chong, Jan Hau Lee","doi":"10.1186/s12911-024-02649-2","DOIUrl":"10.1186/s12911-024-02649-2","url":null,"abstract":"<p><strong>Background: </strong>Modeling patient data, particularly electronic health records (EHR), is one of the major focuses of machine learning studies in healthcare, as these records provide clinicians with valuable information that can potentially assist them in disease diagnosis and decision-making.</p><p><strong>Methods: </strong>In this study, we present a multi-level graph-based framework called MedMGF, which models both patient medical profiles extracted from EHR data and their relationship network of health profiles in a single architecture. The medical profiles consist of several layers of data embedding derived from interval records obtained during hospitalization, and the patient-patient network is created by measuring the similarities between these profiles. We also propose a modification to the Focal Loss (FL) function to improve classification performance in imbalanced datasets without the need to imputate the data. MedMGF's performance was evaluated against several Graphical Convolutional Network (GCN) baseline models implemented with Binary Cross Entropy (BCE), FL, class balancing parameter <math><mi>α</mi></math> , and Synthetic Minority Oversampling Technique (SMOTE).</p><p><strong>Results: </strong>Our proposed framework achieved high classification performance (AUC: 0.8098, ACC: 0.7503, SEN: 0.8750, SPE: 0.7445, NPV: 0.9923, PPV: 0.1367) on an extreme imbalanced pediatric sepsis dataset (n=3,014, imbalance ratio of 0.047). It yielded a classification improvement of 3.81% for AUC, 15% for SEN compared to the baseline GCN+ <math><mi>α</mi></math> FL (AUC: 0.7717, ACC: 0.8144, SEN: 0.7250, SPE: 0.8185, PPV: 0.1559, NPV: 0.9847), and an improvement of 5.88% in AUC and 22.5% compared to GCN+FL+SMOTE (AUC: 0.7510, ACC: 0.8431, SEN: 0.6500, SPE: 0.8520, PPV: 0.1688, NPV: 0.9814). It also showed a classification improvement of 3.86% for AUC, 15% for SEN compared to the baseline GCN+ <math><mi>α</mi></math> BCE (AUC: 0.7712, ACC: 0.8133, SEN: 0.7250, SPE: 0.8173, PPV: 0.1551, NPV: 0.9847), and an improvement of 14.33% in AUC and 27.5% in comparison to GCN+BCE+SMOTE (AUC: 0.6665, ACC: 0.7271, SEN: 0.6000, SPE: 0.7329, PPV: 0.0941, NPV: 0.9754).</p><p><strong>Conclusion: </strong>When compared to all baseline models, MedMGF achieved the highest SEN and AUC results, demonstrating the potential for several healthcare applications.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11367907/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142119056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clinician perspectives and recommendations regarding design of clinical prediction models for deteriorating patients in acute care. 临床医生对急症护理中病情恶化患者临床预测模型设计的看法和建议。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-09-02 DOI: 10.1186/s12911-024-02647-4
Robin Blythe, Sundresan Naicker, Nicole White, Raelene Donovan, Ian A Scott, Andrew McKelliget, Steven M McPhail

Background: Successful deployment of clinical prediction models for clinical deterioration relates not only to predictive performance but to integration into the decision making process. Models may demonstrate good discrimination and calibration, but fail to match the needs of practising acute care clinicians who receive, interpret, and act upon model outputs or alerts. We sought to understand how prediction models for clinical deterioration, also known as early warning scores (EWS), influence the decision-making of clinicians who regularly use them and elicit their perspectives on model design to guide future deterioration model development and implementation.

Methods: Nurses and doctors who regularly receive or respond to EWS alerts in two digital metropolitan hospitals were interviewed for up to one hour between February 2022 and March 2023 using semi-structured formats. We grouped interview data into sub-themes and then into general themes using reflexive thematic analysis. Themes were then mapped to a model of clinical decision making using deductive framework mapping to develop a set of practical recommendations for future deterioration model development and deployment.

Results: Fifteen nurses (n = 8) and doctors (n = 7) were interviewed for a mean duration of 42 min. Participants emphasised the importance of using predictive tools for supporting rather than supplanting critical thinking, avoiding over-protocolising care, incorporating important contextual information and focusing on how clinicians generate, test, and select diagnostic hypotheses when managing deteriorating patients. These themes were incorporated into a conceptual model which informed recommendations that clinical deterioration prediction models demonstrate transparency and interactivity, generate outputs tailored to the tasks and responsibilities of end-users, avoid priming clinicians with potential diagnoses before patients were physically assessed, and support the process of deciding upon subsequent management.

Conclusions: Prediction models for deteriorating inpatients may be more impactful if they are designed in accordance with the decision-making processes of acute care clinicians. Models should produce actionable outputs that assist with, rather than supplant, critical thinking.

背景:临床恶化预测模型的成功应用不仅与预测性能有关,还与决策过程的整合有关。模型可能表现出良好的辨别力和校准能力,但却无法满足急症护理临床医生的需求,他们需要接收、解释模型输出或警报,并根据模型输出或警报采取行动。我们试图了解临床病情恶化预测模型(也称为早期预警评分(EWS))如何影响经常使用这些模型的临床医生的决策,并征求他们对模型设计的看法,以指导未来病情恶化模型的开发和实施:在 2022 年 2 月至 2023 年 3 月期间,我们采用半结构化形式对两家数字化都市医院中定期接收或响应 EWS 警报的护士和医生进行了长达一小时的访谈。我们采用反思性主题分析法将访谈数据归类为子主题,然后再归类为一般主题。然后使用演绎框架映射法将主题映射到临床决策模型中,从而为未来恶化模型的开发和部署提出一系列实用建议:15 名护士(n = 8)和医生(n = 7)接受了访谈,平均访谈时间为 42 分钟。参与者强调了使用预测工具支持而非取代批判性思维、避免过度协议化护理、纳入重要的背景信息以及关注临床医生在管理病情恶化患者时如何生成、测试和选择诊断假设的重要性。这些主题被纳入到一个概念模型中,该模型提出的建议包括:临床病情恶化预测模型应具有透明度和互动性,根据最终用户的任务和职责生成输出结果,避免在对患者进行身体评估之前向临床医生提供潜在诊断,以及支持决定后续管理的过程:针对病情恶化的住院患者的预测模型如果能根据急诊临床医生的决策过程进行设计,可能会产生更大的影响。模型应产生可操作的输出结果,协助而非取代批判性思维。
{"title":"Clinician perspectives and recommendations regarding design of clinical prediction models for deteriorating patients in acute care.","authors":"Robin Blythe, Sundresan Naicker, Nicole White, Raelene Donovan, Ian A Scott, Andrew McKelliget, Steven M McPhail","doi":"10.1186/s12911-024-02647-4","DOIUrl":"10.1186/s12911-024-02647-4","url":null,"abstract":"<p><strong>Background: </strong>Successful deployment of clinical prediction models for clinical deterioration relates not only to predictive performance but to integration into the decision making process. Models may demonstrate good discrimination and calibration, but fail to match the needs of practising acute care clinicians who receive, interpret, and act upon model outputs or alerts. We sought to understand how prediction models for clinical deterioration, also known as early warning scores (EWS), influence the decision-making of clinicians who regularly use them and elicit their perspectives on model design to guide future deterioration model development and implementation.</p><p><strong>Methods: </strong>Nurses and doctors who regularly receive or respond to EWS alerts in two digital metropolitan hospitals were interviewed for up to one hour between February 2022 and March 2023 using semi-structured formats. We grouped interview data into sub-themes and then into general themes using reflexive thematic analysis. Themes were then mapped to a model of clinical decision making using deductive framework mapping to develop a set of practical recommendations for future deterioration model development and deployment.</p><p><strong>Results: </strong>Fifteen nurses (n = 8) and doctors (n = 7) were interviewed for a mean duration of 42 min. Participants emphasised the importance of using predictive tools for supporting rather than supplanting critical thinking, avoiding over-protocolising care, incorporating important contextual information and focusing on how clinicians generate, test, and select diagnostic hypotheses when managing deteriorating patients. These themes were incorporated into a conceptual model which informed recommendations that clinical deterioration prediction models demonstrate transparency and interactivity, generate outputs tailored to the tasks and responsibilities of end-users, avoid priming clinicians with potential diagnoses before patients were physically assessed, and support the process of deciding upon subsequent management.</p><p><strong>Conclusions: </strong>Prediction models for deteriorating inpatients may be more impactful if they are designed in accordance with the decision-making processes of acute care clinicians. Models should produce actionable outputs that assist with, rather than supplant, critical thinking.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11367817/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142119053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Common data quality elements for health information systems: a systematic review. 卫生信息系统的通用数据质量要素:系统回顾。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-09-02 DOI: 10.1186/s12911-024-02644-7
Hossein Ghalavand, Saied Shirshahi, Alireza Rahimi, Zarrin Zarrinabadi, Fatemeh Amani

Background: Data quality in health information systems has a complex structure and consists of several dimensions. This research conducted for identify Common data quality elements for health information systems.

Methods: A literature review was conducted and search strategies run in Web of Knowledge, Science Direct, Emerald, PubMed, Scopus and Google Scholar search engine as an additional source for tracing references. We found 760 papers, excluded 314 duplicates, 339 on abstract review and 167 on full-text review; leaving 58 papers for critical appraisal.

Results: Current review shown that 14 criteria are categorized as the main dimensions for data quality for health information system include: Accuracy, Consistency, Security, Timeliness, Completeness, Reliability, Accessibility, Objectivity, Relevancy, Understandability, Navigation, Reputation, Efficiency and Value- added. Accuracy, Completeness, and Timeliness, were the three most-used dimensions in literature.

Conclusions: At present, there is a lack of uniformity and potential applicability in the dimensions employed to evaluate the data quality of health information system. Typically, different approaches (qualitative, quantitative and mixed methods) were utilized to evaluate data quality for health information system in the publications that were reviewed. Consequently, due to the inconsistency in defining dimensions and assessing methods, it became imperative to categorize the dimensions of data quality into a limited set of primary dimensions.

背景:医疗信息系统中的数据质量结构复杂,由多个维度组成。本研究旨在确定卫生信息系统的常见数据质量要素:我们进行了文献综述,并在 Web of Knowledge、Science Direct、Emerald、PubMed、Scopus 和 Google Scholar 搜索引擎上使用搜索策略,作为追踪参考文献的补充来源。我们找到了 760 篇论文,排除了 314 篇重复论文、339 篇摘要审查论文和 167 篇全文审查论文;剩下 58 篇论文进行了批判性评估:目前的研究表明,14 项标准被归类为卫生信息系统数据质量的主要维度,包括准确性、一致性、安全性、及时性、完整性、可靠性、可访问性、客观性、相关性、可理解性、导航性、声誉、效率和附加值。准确性、完整性和及时性是文献中使用最多的三个维度:目前,用于评估卫生信息系统数据质量的维度缺乏统一性和潜在适用性。在所查阅的文献中,通常采用不同的方法(定性、定量和混合方法)来评价卫生信息系统的数据质量。因此,由于定义维度和评估方法的不一致,必须将数据质量的维度归类为一套有限的主要维度。
{"title":"Common data quality elements for health information systems: a systematic review.","authors":"Hossein Ghalavand, Saied Shirshahi, Alireza Rahimi, Zarrin Zarrinabadi, Fatemeh Amani","doi":"10.1186/s12911-024-02644-7","DOIUrl":"10.1186/s12911-024-02644-7","url":null,"abstract":"<p><strong>Background: </strong>Data quality in health information systems has a complex structure and consists of several dimensions. This research conducted for identify Common data quality elements for health information systems.</p><p><strong>Methods: </strong>A literature review was conducted and search strategies run in Web of Knowledge, Science Direct, Emerald, PubMed, Scopus and Google Scholar search engine as an additional source for tracing references. We found 760 papers, excluded 314 duplicates, 339 on abstract review and 167 on full-text review; leaving 58 papers for critical appraisal.</p><p><strong>Results: </strong>Current review shown that 14 criteria are categorized as the main dimensions for data quality for health information system include: Accuracy, Consistency, Security, Timeliness, Completeness, Reliability, Accessibility, Objectivity, Relevancy, Understandability, Navigation, Reputation, Efficiency and Value- added. Accuracy, Completeness, and Timeliness, were the three most-used dimensions in literature.</p><p><strong>Conclusions: </strong>At present, there is a lack of uniformity and potential applicability in the dimensions employed to evaluate the data quality of health information system. Typically, different approaches (qualitative, quantitative and mixed methods) were utilized to evaluate data quality for health information system in the publications that were reviewed. Consequently, due to the inconsistency in defining dimensions and assessing methods, it became imperative to categorize the dimensions of data quality into a limited set of primary dimensions.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11367888/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142119054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Selection of data analytic techniques by using fuzzy AHP TOPSIS from a healthcare perspective. 从医疗保健角度利用模糊 AHP TOPSIS 选择数据分析技术。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-09-02 DOI: 10.1186/s12911-024-02651-8
Abdullah Alharbi, Wael Alosaimi, Hashem Alyami, Bader Alouffi, Ahmed Almulihi, Mohd Nadeem, Mohd Asim Sayeed, Raees Ahmad Khan

The healthcare industry has been put to test the need to manage enormous amounts of data provided by various sources, which are renowned for providing enormous quantities of heterogeneous information. The data are collected and analyzed with different Data Analytic (DA) and machine learning algorithm approaches. Researchers, scientists, and industrialists must manage or select the best approach associated with DA in healthcare. This scientific study is based on decision analysis between the DA factors and alternatives. The information affects the whole system in a rational manner. This information is very important in healthcare sector for appropriate prediction and analysis. The evaluation discusses its benefits and presents an analytic framework. The Fuzzy Analytic Hierarchy Process (Fuzzy AHP) approach is used to address the weight of the factors. The Fuzzy Techniques for Order Preference by Similarity to Ideal Solution (Fuzzy TOPSIS) address the rank of the data analytic alternatives used in healthcare sector. The models used in the article briefly discuss the challenges of DA and approaches to address those challenges. The assorted factors of DA are capture, cleaning, storage, security, stewardship, reporting, visualization, updating, sharing, and querying. The DA alternatives include descriptive, diagnostic, predictive, prescriptive, discovery, regression, cohort and inferential analyses. The most influential factors of the DA and the most suitable approach for the DA are evaluated. The 'cleaning' factor has the highest weight, and 'updating' is achieved at least by the Fuzzy-AHP approach. The regression approach of data analysis had the highest rank, and the diagnostic analysis had the lowest rank. Decision analyses are necessary for data scientists and medical providers to predict diseases appropriately in the healthcare domain. This analysis also revealed the cost benefits to hospitals.

医疗保健行业需要管理各种来源提供的海量数据,这些数据以提供大量异构信息而著称。这些数据是通过不同的数据分析(DA)和机器学习算法方法收集和分析的。研究人员、科学家和工业家必须管理或选择与医疗保健中的数据分析相关的最佳方法。这项科学研究以数据分析因素和替代方案之间的决策分析为基础。这些信息以合理的方式影响着整个系统。这些信息对医疗保健领域的适当预测和分析非常重要。评估讨论了其益处,并提出了一个分析框架。模糊分析层次过程(Fuzzy Analytic Hierarchy Process,FHP)方法用于解决各因素的权重问题。通过与理想解决方案的相似性进行排序的模糊技术(Fuzzy TOPSIS)解决了医疗保健领域使用的数据分析替代方案的排序问题。文章中使用的模型简要讨论了数据分析所面临的挑战以及应对这些挑战的方法。数据分析的各种因素包括捕获、清理、存储、安全、管理、报告、可视化、更新、共享和查询。数据分析的替代方法包括描述性分析、诊断性分析、预测性分析、规范性分析、发现分析、回归分析、队列分析和推理分析。对数据分析的最大影响因素和最适合数据分析的方法进行了评估。清理 "因素的权重最高,"更新 "因素在模糊-AHP 方法中的权重最低。数据分析的回归方法排名最高,诊断分析排名最低。决策分析是数据科学家和医疗服务提供者在医疗保健领域适当预测疾病所必需的。这项分析还揭示了医院的成本效益。
{"title":"Selection of data analytic techniques by using fuzzy AHP TOPSIS from a healthcare perspective.","authors":"Abdullah Alharbi, Wael Alosaimi, Hashem Alyami, Bader Alouffi, Ahmed Almulihi, Mohd Nadeem, Mohd Asim Sayeed, Raees Ahmad Khan","doi":"10.1186/s12911-024-02651-8","DOIUrl":"10.1186/s12911-024-02651-8","url":null,"abstract":"<p><p>The healthcare industry has been put to test the need to manage enormous amounts of data provided by various sources, which are renowned for providing enormous quantities of heterogeneous information. The data are collected and analyzed with different Data Analytic (DA) and machine learning algorithm approaches. Researchers, scientists, and industrialists must manage or select the best approach associated with DA in healthcare. This scientific study is based on decision analysis between the DA factors and alternatives. The information affects the whole system in a rational manner. This information is very important in healthcare sector for appropriate prediction and analysis. The evaluation discusses its benefits and presents an analytic framework. The Fuzzy Analytic Hierarchy Process (Fuzzy AHP) approach is used to address the weight of the factors. The Fuzzy Techniques for Order Preference by Similarity to Ideal Solution (Fuzzy TOPSIS) address the rank of the data analytic alternatives used in healthcare sector. The models used in the article briefly discuss the challenges of DA and approaches to address those challenges. The assorted factors of DA are capture, cleaning, storage, security, stewardship, reporting, visualization, updating, sharing, and querying. The DA alternatives include descriptive, diagnostic, predictive, prescriptive, discovery, regression, cohort and inferential analyses. The most influential factors of the DA and the most suitable approach for the DA are evaluated. The 'cleaning' factor has the highest weight, and 'updating' is achieved at least by the Fuzzy-AHP approach. The regression approach of data analysis had the highest rank, and the diagnostic analysis had the lowest rank. Decision analyses are necessary for data scientists and medical providers to predict diseases appropriately in the healthcare domain. This analysis also revealed the cost benefits to hospitals.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11368004/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142119057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Special supplement issue on quality assurance and enrichment of biological and biomedical ontologies and terminologies. 关于生物和生物医学本体和术语的质量保证和丰富的特别增刊。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-08-30 DOI: 10.1186/s12911-024-02654-5
Licong Cui, Ankur Agrawal

Ontologies and terminologies serve as the backbone of knowledge representation in biomedical domains, facilitating data integration, interoperability, and semantic understanding across diverse applications. However, the quality assurance and enrichment of these resources remain an ongoing challenge due to the dynamic nature of biomedical knowledge. In this editorial, we provide an introductory summary of seven articles included in this special supplement issue for quality assurance and enrichment of biological and biomedical ontologies and terminologies. These articles span a spectrum of topics, such as development of automated quality assessment frameworks for Resource Description Framework (RDF) resources, identification of missing concepts in SNOMED CT through logical definitions, and developing a COVID interface terminology to enable automatic annotations of COVID-19 related Electronic Health Records (EHRs). Collectively, these contributions underscore the ongoing efforts to improve the accuracy, consistency, and interoperability of biomedical ontologies and terminologies, thus advancing their pivotal role in healthcare and biomedical research.

本体和术语是生物医学领域知识表征的支柱,可促进数据集成、互操作性和不同应用之间的语义理解。然而,由于生物医学知识的动态性质,这些资源的质量保证和丰富仍然是一个持续的挑战。在这篇社论中,我们对本特刊增刊中收录的七篇文章进行了介绍性总结,这些文章涉及生物和生物医学本体和术语的质量保证和丰富。这些文章涉及多个主题,如为资源描述框架(RDF)资源开发自动质量评估框架、通过逻辑定义识别 SNOMED CT 中缺失的概念,以及开发 COVID 接口术语以实现 COVID-19 相关电子健康记录(EHR)的自动注释。总之,这些贡献强调了为提高生物医学本体和术语的准确性、一致性和互操作性所做的不懈努力,从而推动了它们在医疗保健和生物医学研究中的关键作用。
{"title":"Special supplement issue on quality assurance and enrichment of biological and biomedical ontologies and terminologies.","authors":"Licong Cui, Ankur Agrawal","doi":"10.1186/s12911-024-02654-5","DOIUrl":"10.1186/s12911-024-02654-5","url":null,"abstract":"<p><p>Ontologies and terminologies serve as the backbone of knowledge representation in biomedical domains, facilitating data integration, interoperability, and semantic understanding across diverse applications. However, the quality assurance and enrichment of these resources remain an ongoing challenge due to the dynamic nature of biomedical knowledge. In this editorial, we provide an introductory summary of seven articles included in this special supplement issue for quality assurance and enrichment of biological and biomedical ontologies and terminologies. These articles span a spectrum of topics, such as development of automated quality assessment frameworks for Resource Description Framework (RDF) resources, identification of missing concepts in SNOMED CT through logical definitions, and developing a COVID interface terminology to enable automatic annotations of COVID-19 related Electronic Health Records (EHRs). Collectively, these contributions underscore the ongoing efforts to improve the accuracy, consistency, and interoperability of biomedical ontologies and terminologies, thus advancing their pivotal role in healthcare and biomedical research.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11363377/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142116990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A deep convolutional neural network approach using medical image classification. 利用医学图像分类的深度卷积神经网络方法。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-08-29 DOI: 10.1186/s12911-024-02646-5
Mohammad Mousavi, Soodeh Hosseini

The epidemic diseases such as COVID-19 are rapidly spreading all around the world. The diagnosis of epidemic at initial stage is of high importance to provide medical care to and recovery of infected people as well as protecting the uninfected population. In this paper, an automatic COVID-19 detection model using respiratory sound and medical image based on internet of health things (IoHT) is proposed. In this model, primarily to screen those people having suspected Coronavirus disease, the sound of coughing used to detect healthy people and those suffering from COVID-19, which finally obtained an accuracy of 94.999%. This approach not only expedites diagnosis and enhances accuracy but also facilitates swift screening in public places using simple equipment. Then, in the second step, in order to help radiologists to interpret medical images as best as possible, we use three pre-trained convolutional neural network models InceptionResNetV2, InceptionV3 and EfficientNetB4 and two data sets of chest radiology medical images, and CT Scan in a three-class classification. Utilizing transfer learning and pre-existing knowledge in these models leads to notable improvements in disease diagnosis and identification compared to traditional techniques. Finally, the best result obtained for CT-Scan images belonging to InceptionResNetV2 architecture with 99.414% accuracy and for radiology images related to InceptionV3 and EfficientNetB4 architectures with the accuracy is 96.943%. Therefore, the proposed model can help radiology specialists to confirm the initial assessments of the COVID-19 disease.

COVID-19 等流行病正在全球迅速蔓延。疫情初期的诊断对于为感染者提供医疗服务和康复以及保护未感染人群具有重要意义。本文提出了一种基于健康物联网(IoHT)、利用呼吸声音和医学图像的 COVID-19 自动检测模型。该模型主要用于筛查疑似感染冠状病毒的人群,利用咳嗽声检测健康人群和 COVID-19 患者,最终获得了 94.999% 的准确率。这种方法不仅加快了诊断速度,提高了准确性,而且便于在公共场所使用简单的设备进行快速筛查。第二步,为了帮助放射科医生更好地解读医学影像,我们使用了三个预先训练好的卷积神经网络模型 InceptionResNetV2、InceptionV3 和 EfficientNetB4,以及胸部放射医学影像和 CT 扫描三类分类的两个数据集。与传统技术相比,在这些模型中利用迁移学习和已有知识可显著提高疾病诊断和识别能力。最后,属于 InceptionResNetV2 架构的 CT 扫描图像获得了最佳结果,准确率为 99.414%;与 InceptionV3 和 EfficientNetB4 架构相关的放射学图像的准确率为 96.943%。因此,所提出的模型可以帮助放射科专家确认 COVID-19 疾病的初步评估结果。
{"title":"A deep convolutional neural network approach using medical image classification.","authors":"Mohammad Mousavi, Soodeh Hosseini","doi":"10.1186/s12911-024-02646-5","DOIUrl":"https://doi.org/10.1186/s12911-024-02646-5","url":null,"abstract":"<p><p>The epidemic diseases such as COVID-19 are rapidly spreading all around the world. The diagnosis of epidemic at initial stage is of high importance to provide medical care to and recovery of infected people as well as protecting the uninfected population. In this paper, an automatic COVID-19 detection model using respiratory sound and medical image based on internet of health things (IoHT) is proposed. In this model, primarily to screen those people having suspected Coronavirus disease, the sound of coughing used to detect healthy people and those suffering from COVID-19, which finally obtained an accuracy of 94.999%. This approach not only expedites diagnosis and enhances accuracy but also facilitates swift screening in public places using simple equipment. Then, in the second step, in order to help radiologists to interpret medical images as best as possible, we use three pre-trained convolutional neural network models InceptionResNetV2, InceptionV3 and EfficientNetB4 and two data sets of chest radiology medical images, and CT Scan in a three-class classification. Utilizing transfer learning and pre-existing knowledge in these models leads to notable improvements in disease diagnosis and identification compared to traditional techniques. Finally, the best result obtained for CT-Scan images belonging to InceptionResNetV2 architecture with 99.414% accuracy and for radiology images related to InceptionV3 and EfficientNetB4 architectures with the accuracy is 96.943%. Therefore, the proposed model can help radiology specialists to confirm the initial assessments of the COVID-19 disease.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11360845/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142104626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From admission to discharge: a systematic review of clinical natural language processing along the patient journey. 从入院到出院:患者就医过程中临床自然语言处理的系统回顾。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-08-29 DOI: 10.1186/s12911-024-02641-w
Katrin Klug, Katharina Beckh, Dario Antweiler, Nilesh Chakraborty, Giulia Baldini, Katharina Laue, René Hosch, Felix Nensa, Martin Schuler, Sven Giesselbach

Background: Medical text, as part of an electronic health record, is an essential information source in healthcare. Although natural language processing (NLP) techniques for medical text are developing fast, successful transfer into clinical practice has been rare. Especially the hospital domain offers great potential while facing several challenges including many documents per patient, multiple departments and complex interrelated processes.

Methods: In this work, we survey relevant literature to identify and classify approaches which exploit NLP in the clinical context. Our contribution involves a systematic mapping of related research onto a prototypical patient journey in the hospital, along which medical documents are created, processed and consumed by hospital staff and patients themselves. Specifically, we reviewed which dataset types, dataset languages, model architectures and tasks are researched in current clinical NLP research. Additionally, we extract and analyze major obstacles during development and implementation. We discuss options to address them and argue for a focus on bias mitigation and model explainability.

Results: While a patient's hospital journey produces a significant amount of structured and unstructured documents, certain steps and documents receive more research attention than others. Diagnosis, Admission and Discharge are clinical patient steps that are researched often across the surveyed paper. In contrast, our findings reveal significant under-researched areas such as Treatment, Billing, After Care, and Smart Home. Leveraging NLP in these stages can greatly enhance clinical decision-making and patient outcomes. Additionally, clinical NLP models are mostly based on radiology reports, discharge letters and admission notes, even though we have shown that many other documents are produced throughout the patient journey. There is a significant opportunity in analyzing a wider range of medical documents produced throughout the patient journey to improve the applicability and impact of NLP in healthcare.

Conclusions: Our findings suggest that there is a significant opportunity to leverage NLP approaches to advance clinical decision-making systems, as there remains a considerable understudied potential for the analysis of patient journey data.

背景:医学文本作为电子健康记录的一部分,是医疗保健领域的重要信息来源。尽管针对医疗文本的自然语言处理(NLP)技术发展迅速,但成功应用于临床实践的却很少。特别是在医院领域,该技术具有巨大的潜力,但同时也面临着一些挑战,包括每个病人有许多文件、多个部门和复杂的相互关联的流程:在这项工作中,我们对相关文献进行了调查,对在临床环境中利用 NLP 的方法进行了识别和分类。我们的贡献包括将相关研究系统地映射到医院中的病人旅程原型上,在这个旅程中,医院员工和病人自己创建、处理和消费医疗文件。具体来说,我们回顾了当前临床 NLP 研究中的数据集类型、数据集语言、模型架构和任务。此外,我们还提取并分析了开发和实施过程中的主要障碍。我们讨论了解决这些问题的方案,并主张将重点放在减少偏差和模型的可解释性上:虽然患者的住院过程会产生大量的结构化和非结构化文档,但某些步骤和文档比其他步骤和文档受到更多的研究关注。诊断、入院和出院是调查论文中经常研究的临床患者步骤。相比之下,我们的研究结果显示,治疗、账单、后期护理和智能家居等领域的研究明显不足。在这些阶段利用 NLP 可以大大提高临床决策和患者疗效。此外,临床 NLP 模型大多基于放射学报告、出院信和入院记录,尽管我们已经证明,在整个患者治疗过程中还会产生许多其他文档。对患者整个就医过程中产生的更广泛的医疗文件进行分析,是提高 NLP 在医疗保健领域的适用性和影响力的重要机会:我们的研究结果表明,利用 NLP 方法推动临床决策系统的发展大有可为,因为对患者旅程数据进行分析的潜力仍未得到充分挖掘。
{"title":"From admission to discharge: a systematic review of clinical natural language processing along the patient journey.","authors":"Katrin Klug, Katharina Beckh, Dario Antweiler, Nilesh Chakraborty, Giulia Baldini, Katharina Laue, René Hosch, Felix Nensa, Martin Schuler, Sven Giesselbach","doi":"10.1186/s12911-024-02641-w","DOIUrl":"https://doi.org/10.1186/s12911-024-02641-w","url":null,"abstract":"<p><strong>Background: </strong>Medical text, as part of an electronic health record, is an essential information source in healthcare. Although natural language processing (NLP) techniques for medical text are developing fast, successful transfer into clinical practice has been rare. Especially the hospital domain offers great potential while facing several challenges including many documents per patient, multiple departments and complex interrelated processes.</p><p><strong>Methods: </strong>In this work, we survey relevant literature to identify and classify approaches which exploit NLP in the clinical context. Our contribution involves a systematic mapping of related research onto a prototypical patient journey in the hospital, along which medical documents are created, processed and consumed by hospital staff and patients themselves. Specifically, we reviewed which dataset types, dataset languages, model architectures and tasks are researched in current clinical NLP research. Additionally, we extract and analyze major obstacles during development and implementation. We discuss options to address them and argue for a focus on bias mitigation and model explainability.</p><p><strong>Results: </strong>While a patient's hospital journey produces a significant amount of structured and unstructured documents, certain steps and documents receive more research attention than others. Diagnosis, Admission and Discharge are clinical patient steps that are researched often across the surveyed paper. In contrast, our findings reveal significant under-researched areas such as Treatment, Billing, After Care, and Smart Home. Leveraging NLP in these stages can greatly enhance clinical decision-making and patient outcomes. Additionally, clinical NLP models are mostly based on radiology reports, discharge letters and admission notes, even though we have shown that many other documents are produced throughout the patient journey. There is a significant opportunity in analyzing a wider range of medical documents produced throughout the patient journey to improve the applicability and impact of NLP in healthcare.</p><p><strong>Conclusions: </strong>Our findings suggest that there is a significant opportunity to leverage NLP approaches to advance clinical decision-making systems, as there remains a considerable understudied potential for the analysis of patient journey data.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11360876/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142104627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How successful is the CatBoost classifier in diagnosing different dental anomalies in patients via sella turcica and vertebral morphologic alteration? CatBoost 分类器在通过蝶鞍和脊椎形态改变诊断患者的不同牙科异常方面有多成功?
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-08-29 DOI: 10.1186/s12911-024-02643-8
Merve Gonca, Busra Beser Gul, Mehmet Fatih Sert

Background: To investigate how successfully the classification of patients with and without dental anomalies was achieved through four experiments involving different dental anomalies.

Methods: Lateral cephalometric radiographs (LCRs) from 526 individuals aged between 14 and 22 years were included. Four experiments involving different dental anomalies were created. Experiment 1 included the total dental anomaly group and control group (CG). Experiment 2 only had dental agenesis and a CG. Experiment 3 consisted of only palatally impacted canines and the CG. Experiment 4 comprised patients with various dental defects (transposition, hypodontia, agenesis-palatally affected canine, peg-shaped laterally, hyperdontia) and the CG. Twelve sella measurements and assessments of the ponticulus posticus and posterior arch deficiency were given as input. The target was to distinguish between anomalies and controls. The CatBoost algorithm was applied to classify patients with and without dental anomalies.

Results: In order from lowest to highest, the predictive accuracies of the experiments were as follows: experiment 4 < experiment 2 < experiment 3 < experiment 1. The sella area (SA) (mm2) was the most important variable in experiment 1. The most significant variable in prediction model of experiment 2 was sella height posterior (SHP) (mm). Sella area (SA) (mm2) was again the most relevant variable in experiment 3. The most important variable in experiment 4 was sella height median (SHM) (mm).

Conclusions: Every prediction model from the four experiments prioritized different variables. These findings may suggest that related research should focus on specific traits from a diagnostic perspective.

背景:通过四次涉及不同牙齿畸形的实验,研究有牙齿畸形和没有牙齿畸形的患者如何成功分类:通过四次涉及不同牙齿畸形的实验,研究如何成功地对有牙齿畸形和没有牙齿畸形的患者进行分类:方法:研究对象包括 526 名年龄在 14 至 22 岁之间的患者的头颅侧位X光片(LCR)。创建了四个涉及不同牙齿畸形的实验。实验 1 包括牙齿全部畸形组和对照组(CG)。实验 2 只包括牙齿缺失组和对照组。实验 3 只包括腭撞击性犬齿和对照组。实验 4 包括有各种牙齿缺陷的患者(转位、牙齿发育不全、腭部受影响的犬齿发育不全、侧面呈桩状、牙齿发育过度)和对照组。作为输入,对 12 个蝶鞍进行了测量,并对后腭和后牙弓缺损进行了评估。目标是区分异常和对照组。应用 CatBoost 算法对牙科异常和非牙科异常患者进行分类:实验结果:实验的预测准确率从低到高依次为:实验 4 结论:四个实验中的每个预测模型都优先考虑了牙列畸形:四个实验中的每个预测模型都优先考虑了不同的变量。这些发现可能表明,相关研究应从诊断的角度关注特定的特征。
{"title":"How successful is the CatBoost classifier in diagnosing different dental anomalies in patients via sella turcica and vertebral morphologic alteration?","authors":"Merve Gonca, Busra Beser Gul, Mehmet Fatih Sert","doi":"10.1186/s12911-024-02643-8","DOIUrl":"https://doi.org/10.1186/s12911-024-02643-8","url":null,"abstract":"<p><strong>Background: </strong>To investigate how successfully the classification of patients with and without dental anomalies was achieved through four experiments involving different dental anomalies.</p><p><strong>Methods: </strong>Lateral cephalometric radiographs (LCRs) from 526 individuals aged between 14 and 22 years were included. Four experiments involving different dental anomalies were created. Experiment 1 included the total dental anomaly group and control group (CG). Experiment 2 only had dental agenesis and a CG. Experiment 3 consisted of only palatally impacted canines and the CG. Experiment 4 comprised patients with various dental defects (transposition, hypodontia, agenesis-palatally affected canine, peg-shaped laterally, hyperdontia) and the CG. Twelve sella measurements and assessments of the ponticulus posticus and posterior arch deficiency were given as input. The target was to distinguish between anomalies and controls. The CatBoost algorithm was applied to classify patients with and without dental anomalies.</p><p><strong>Results: </strong>In order from lowest to highest, the predictive accuracies of the experiments were as follows: experiment 4 < experiment 2 < experiment 3 < experiment 1. The sella area (SA) (mm2) was the most important variable in experiment 1. The most significant variable in prediction model of experiment 2 was sella height posterior (SHP) (mm). Sella area (SA) (mm2) was again the most relevant variable in experiment 3. The most important variable in experiment 4 was sella height median (SHM) (mm).</p><p><strong>Conclusions: </strong>Every prediction model from the four experiments prioritized different variables. These findings may suggest that related research should focus on specific traits from a diagnostic perspective.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11360316/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142104555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development and validation of a machine learning-based model to assess probability of systemic inflammatory response syndrome in patients with severe multiple traumas. 开发并验证基于机器学习的模型,以评估严重多发性创伤患者患全身炎症反应综合征的概率。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-08-27 DOI: 10.1186/s12911-024-02640-x
Alexander Prokazyuk, Aidos Tlemissov, Marat Zhanaspayev, Sabina Aubakirova, Arman Mussabekov

Background: Systemic inflammatory response syndrome (SIRS) is a predictor of serious infectious complications, organ failure, and death in patients with severe polytrauma and is one of the reasons for delaying early total surgical treatment. To determine the risk of SIRS within 24 h after hospitalization, we developed six machine learning models.

Materials and methods: Using retrospective data about the patient, the nature of the injury, the results of general and standard biochemical blood tests, and coagulation tests, six models were developed: decision tree, random forest, logistic regression, support vector and gradient boosting classifiers, logistic regressor, and neural network. The effectiveness of the models was assessed through internal and external validation.

Results: Among the 439 selected patients with severe polytrauma in 230 (52.4%), SIRS was diagnosed within the first 24 h of hospitalization. The SIRS group was more strongly associated with class II bleeding (39.5% vs. 60.5%; OR 1.81 [95% CI: 1.23-2.65]; P = 0.0023), long-term vasopressor use (68.4% vs. 31.6%; OR 5.51 [95% CI: 2.37-5.23]; P < 0.0001), risk of acute coagulopathy (67.8% vs. 32.2%; OR 2.4 [95% CI: 1.55-3.77]; P < 0.0001), and greater risk of pneumonia (59.5% vs. 40.5%; OR 1.74 [95% CI: 1.19-2.54]; P = 0.0042), longer ICU length of stay (5 ± 6.3 vs. 2.7 ± 4.3 days; P < 0.0001) and mortality rate (64.5% vs. 35.5%; OR 10.87 [95% CI: 6.3-19.89]; P = 0.0391). Of all the models, the random forest classifier showed the best predictive ability in the internal (AUROC 0.89; 95% CI: 0.83-0.96) and external validation (AUROC 0.83; 95% CI: 0.75-0.91) datasets.

Conclusions: The developed model made it possible to accurately predict the risk of developing SIRS in the early period after injury, allowing clinical specialists to predict patient management tactics and calculate medication and staffing needs for the patient.

Level of evidence: Level 3.

Trial registration: The study was retrospectively registered in the ClinicalTrials.gov database of the National Library of Medicine (NCT06323096).

背景:全身炎症反应综合征(SIRS)是严重多发性创伤患者出现严重感染并发症、器官衰竭和死亡的预测因素,也是延误早期手术治疗的原因之一。为了确定住院后 24 小时内发生 SIRS 的风险,我们开发了六个机器学习模型:利用患者的回顾性数据、损伤性质、一般和标准生化血液检测结果以及凝血检测结果,我们开发了六种模型:决策树、随机森林、逻辑回归、支持向量和梯度提升分类器、逻辑回归器和神经网络。通过内部和外部验证评估了模型的有效性:在 439 名入选的严重多发性创伤患者中,有 230 人(52.4%)在住院后 24 小时内被诊断出 SIRS。SIRS组与II级出血(39.5% vs. 60.5%;OR 1.81 [95% CI:1.23-2.65];P = 0.0023)、长期使用血管加压剂(68.4% vs. 31.6%;OR 5.51 [95% CI:2.37-5.23];P 结论:所开发的模型能够准确预测多发性创伤患者的SIRS:所开发的模型可以准确预测伤后早期发生 SIRS 的风险,使临床专家可以预测患者管理策略,并计算患者的用药和人员需求:证据等级:3 级:该研究以回顾性方式在美国国家医学图书馆的 ClinicalTrials.gov 数据库中注册(NCT06323096)。
{"title":"Development and validation of a machine learning-based model to assess probability of systemic inflammatory response syndrome in patients with severe multiple traumas.","authors":"Alexander Prokazyuk, Aidos Tlemissov, Marat Zhanaspayev, Sabina Aubakirova, Arman Mussabekov","doi":"10.1186/s12911-024-02640-x","DOIUrl":"10.1186/s12911-024-02640-x","url":null,"abstract":"<p><strong>Background: </strong>Systemic inflammatory response syndrome (SIRS) is a predictor of serious infectious complications, organ failure, and death in patients with severe polytrauma and is one of the reasons for delaying early total surgical treatment. To determine the risk of SIRS within 24 h after hospitalization, we developed six machine learning models.</p><p><strong>Materials and methods: </strong>Using retrospective data about the patient, the nature of the injury, the results of general and standard biochemical blood tests, and coagulation tests, six models were developed: decision tree, random forest, logistic regression, support vector and gradient boosting classifiers, logistic regressor, and neural network. The effectiveness of the models was assessed through internal and external validation.</p><p><strong>Results: </strong>Among the 439 selected patients with severe polytrauma in 230 (52.4%), SIRS was diagnosed within the first 24 h of hospitalization. The SIRS group was more strongly associated with class II bleeding (39.5% vs. 60.5%; OR 1.81 [95% CI: 1.23-2.65]; P = 0.0023), long-term vasopressor use (68.4% vs. 31.6%; OR 5.51 [95% CI: 2.37-5.23]; P < 0.0001), risk of acute coagulopathy (67.8% vs. 32.2%; OR 2.4 [95% CI: 1.55-3.77]; P < 0.0001), and greater risk of pneumonia (59.5% vs. 40.5%; OR 1.74 [95% CI: 1.19-2.54]; P = 0.0042), longer ICU length of stay (5 ± 6.3 vs. 2.7 ± 4.3 days; P < 0.0001) and mortality rate (64.5% vs. 35.5%; OR 10.87 [95% CI: 6.3-19.89]; P = 0.0391). Of all the models, the random forest classifier showed the best predictive ability in the internal (AUROC 0.89; 95% CI: 0.83-0.96) and external validation (AUROC 0.83; 95% CI: 0.75-0.91) datasets.</p><p><strong>Conclusions: </strong>The developed model made it possible to accurately predict the risk of developing SIRS in the early period after injury, allowing clinical specialists to predict patient management tactics and calculate medication and staffing needs for the patient.</p><p><strong>Level of evidence: </strong>Level 3.</p><p><strong>Trial registration: </strong>The study was retrospectively registered in the ClinicalTrials.gov database of the National Library of Medicine (NCT06323096).</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11351256/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142079158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing protein sequence classification: integrating deep learning models with Bayesian optimization for enhanced biological analysis. 优化蛋白质序列分类:将深度学习模型与贝叶斯优化相结合,增强生物分析能力。
IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS Pub Date : 2024-08-27 DOI: 10.1186/s12911-024-02631-y
Umesh Kumar Lilhore, Sarita Simiaya, Musaed Alhussein, Neetu Faujdar, Surjeet Dalal, Khursheed Aurangzeb

Efforts to enhance the accuracy of protein sequence classification are of utmost importance in driving forward biological analyses and facilitating significant medical advancements. This study presents a cutting-edge model called ProtICNN-BiLSTM, which combines attention-based Improved Convolutional Neural Networks (ICNN) and Bidirectional Long Short-Term Memory (BiLSTM) units seamlessly. Our main goal is to improve the accuracy of protein sequence classification by carefully optimizing performance through Bayesian Optimisation. ProtICNN-BiLSTM combines the power of CNN and BiLSTM architectures to effectively capture local and global protein sequence dependencies. In the proposed model, the ICNN component uses convolutional operations to identify local patterns. Captures long-range associations by analyzing sequence data forward and backwards. In advanced biological studies, Bayesian Optimisation optimizes model hyperparameters for efficiency and robustness. The model was extensively confirmed with PDB-14,189 and other protein data. We found that ProtICNN-BiLSTM outperforms traditional categorization models. Bayesian Optimization's fine-tuning and seamless integration of local and global sequence information make it effective. The precision of ProtICNN-BiLSTM improves comparative protein sequence categorization. The study improves computational bioinformatics for complex biological analysis. Good results from the ProtICNN-BiLSTM model improve protein sequence categorization. This powerful tool could improve medical and biological research. The breakthrough protein sequence classification model is ProtICNN-BiLSTM. Bayesian optimization, ICNN, and BiLSTM analyze biological data accurately.

努力提高蛋白质序列分类的准确性对于推动生物分析和促进重大医学进步至关重要。本研究提出了一种名为 ProtICNN-BiLSTM 的前沿模型,它将基于注意力的改进卷积神经网络(ICNN)和双向长短期记忆(BiLSTM)单元完美地结合在一起。我们的主要目标是通过贝叶斯优化技术精心优化性能,提高蛋白质序列分类的准确性。ProtICNN-BiLSTM 结合了 CNN 和 BiLSTM 架构的强大功能,能有效捕捉局部和全局的蛋白质序列依赖关系。在提议的模型中,ICNN 组件使用卷积运算来识别局部模式。通过正向和反向分析序列数据,捕捉长程关联。在高级生物学研究中,贝叶斯优化法(Bayesian Optimisation)可优化模型超参数,以提高效率和鲁棒性。我们利用 PDB-14,189 和其他蛋白质数据对该模型进行了广泛验证。我们发现,ProtICNN-BiLSTM 优于传统的分类模型。贝叶斯优化的微调和局部与全局序列信息的无缝整合使其非常有效。ProtICNN-BiLSTM 的精确度提高了蛋白质序列的比较分类。这项研究改进了复杂生物分析中的计算生物信息学。ProtICNN-BiLSTM 模型的良好结果改进了蛋白质序列分类。这一强大的工具可以改善医学和生物学研究。具有突破性的蛋白质序列分类模型是 ProtICNN-BiLSTM。贝叶斯优化、ICNN 和 BiLSTM 可精确分析生物数据。
{"title":"Optimizing protein sequence classification: integrating deep learning models with Bayesian optimization for enhanced biological analysis.","authors":"Umesh Kumar Lilhore, Sarita Simiaya, Musaed Alhussein, Neetu Faujdar, Surjeet Dalal, Khursheed Aurangzeb","doi":"10.1186/s12911-024-02631-y","DOIUrl":"10.1186/s12911-024-02631-y","url":null,"abstract":"<p><p>Efforts to enhance the accuracy of protein sequence classification are of utmost importance in driving forward biological analyses and facilitating significant medical advancements. This study presents a cutting-edge model called ProtICNN-BiLSTM, which combines attention-based Improved Convolutional Neural Networks (ICNN) and Bidirectional Long Short-Term Memory (BiLSTM) units seamlessly. Our main goal is to improve the accuracy of protein sequence classification by carefully optimizing performance through Bayesian Optimisation. ProtICNN-BiLSTM combines the power of CNN and BiLSTM architectures to effectively capture local and global protein sequence dependencies. In the proposed model, the ICNN component uses convolutional operations to identify local patterns. Captures long-range associations by analyzing sequence data forward and backwards. In advanced biological studies, Bayesian Optimisation optimizes model hyperparameters for efficiency and robustness. The model was extensively confirmed with PDB-14,189 and other protein data. We found that ProtICNN-BiLSTM outperforms traditional categorization models. Bayesian Optimization's fine-tuning and seamless integration of local and global sequence information make it effective. The precision of ProtICNN-BiLSTM improves comparative protein sequence categorization. The study improves computational bioinformatics for complex biological analysis. Good results from the ProtICNN-BiLSTM model improve protein sequence categorization. This powerful tool could improve medical and biological research. The breakthrough protein sequence classification model is ProtICNN-BiLSTM. Bayesian optimization, ICNN, and BiLSTM analyze biological data accurately.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":null,"pages":null},"PeriodicalIF":3.3,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11351277/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142079159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
BMC Medical Informatics and Decision Making
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1