Intelligence-based medicine最新文献_第5页

A Convolutional Neural Network- Based Deep Learning To Detect Reticulocytes From Human Peripheral Blood 基于卷积神经网络的深度学习检测人体外周血中的网状细胞

Intelligence-based medicine

Pub Date : 2024-01-01 DOI: 10.1016/j.ibmed.2024.100175

Keerthy Reghunandanan , V.S. Lakshmi , Rose Raj , Kasi Viswanath , Christeen Davis , Rajesh Chandramohanadas

Machine learning approaches are rapidly augmenting, and in some cases, replacing the conventional methods in biomedical data analysis; to reduce time, cost, biases, and the need for sophisticated analytical platforms. Hence, significant interest has been compounded in the integration of automated image analysis for various clinical applications, such as the detection of infected or inflamed wounds, bone fractures or for the purpose of disease diagnosis – such as Plasmodium parasites or circulating tumour cells in blood. Here, we report the development of a Convolutional Neural Network (CNN)-based method on CPU to distinguish and count immature human red blood cells known as reticulocytes from blood smears. Reticulocytes represent a heterogeneous and relatively small percentage of cells in peripheral blood, and contain residual RNA in complex with proteins which generates thread-like patterns when stained with New Methylene Blue (NMB) dye. We used more than 200 NMB-stained images from leukocyte-depleted blood to train and optimize the model for immature reticulocytes (stained positive with NMB, intensity and pattern of which depends on the developmental stage of the reticulocyte) and mature RBCs (no staining with NMB). The training performance evaluation metrics demonstrated a mean average precision (mAP50) of 0.88, a precision of 0.83, a recall of 0.88, and an F1 score of 0.87. Our model was able to successfully count reticulocytes with accuracy more than 90% from unknown samples which were subsequently cross-verified through microscopy and counting. Given the importance of reticulocyte maturation and its clinical relevance, the newly developed model will find important, easy to adopt biomedical applications that can be achieved on a simple PC.

机器学习方法正在迅速增强，并在某些情况下取代生物医学数据分析的传统方法，以减少时间、成本、偏差和对复杂分析平台的需求。因此，人们更加关注将自动图像分析整合到各种临床应用中，如检测感染或发炎的伤口、骨折，或用于疾病诊断--如疟原虫寄生虫或血液中的循环肿瘤细胞。在此，我们报告了在中央处理器上开发的基于卷积神经网络（CNN）的方法，用于从血液涂片中区分和计数未成熟的人类红细胞（称为网状红细胞）。网织红细胞在外周血细胞中占相对较小的比例，且具有异质性，含有与蛋白质复合的残留 RNA，在新亚甲蓝（NMB）染料染色时会产生线状图案。我们使用了 200 多张来自去白细胞血液的 NMB 染色图像，对未成熟网织红细胞（NMB 染色呈阳性，染色强度和模式取决于网织红细胞的发育阶段）和成熟网织红细胞（NMB 染色不呈阳性）进行了模型训练和优化。训练性能评估指标显示，平均精度（mAP50）为 0.88，精确度为 0.83，召回率为 0.88，F1 得分为 0.87。我们的模型能够成功地对未知样本中的网状细胞进行计数，准确率超过 90%，随后通过显微镜和计数进行交叉验证。鉴于网织红细胞成熟的重要性及其临床相关性，新开发的模型将在生物医学领域找到重要而易于采用的应用，而且只需一台简单的个人电脑即可实现。

{"title":"A Convolutional Neural Network- Based Deep Learning To Detect Reticulocytes From Human Peripheral Blood","authors":"Keerthy Reghunandanan , V.S. Lakshmi , Rose Raj , Kasi Viswanath , Christeen Davis , Rajesh Chandramohanadas","doi":"10.1016/j.ibmed.2024.100175","DOIUrl":"10.1016/j.ibmed.2024.100175","url":null,"abstract":"<div><div>Machine learning approaches are rapidly augmenting, and in some cases, replacing the conventional methods in biomedical data analysis; to reduce time, cost, biases, and the need for sophisticated analytical platforms. Hence, significant interest has been compounded in the integration of automated image analysis for various clinical applications, such as the detection of infected or inflamed wounds, bone fractures or for the purpose of disease diagnosis – such as <em>Plasmodium</em> parasites or circulating tumour cells in blood. Here, we report the development of a Convolutional Neural Network (CNN)-based method on CPU to distinguish and count immature human red blood cells known as reticulocytes from blood smears. Reticulocytes represent a heterogeneous and relatively small percentage of cells in peripheral blood, and contain residual RNA in complex with proteins which generates thread-like patterns when stained with New Methylene Blue (NMB) dye. We used more than 200 NMB-stained images from leukocyte-depleted blood to train and optimize the model for immature reticulocytes (stained positive with NMB, intensity and pattern of which depends on the developmental stage of the reticulocyte) and mature RBCs (no staining with NMB). The training performance evaluation metrics demonstrated a mean average precision (mAP50) of 0.88, a precision of 0.83, a recall of 0.88, and an F1 score of 0.87. Our model was able to successfully count reticulocytes with accuracy more than 90% from unknown samples which were subsequently cross-verified through microscopy and counting. Given the importance of reticulocyte maturation and its clinical relevance, the newly developed model will find important, easy to adopt biomedical applications that can be achieved on a simple PC.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"10 ","pages":"Article 100175"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142578622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Prognostics for respiratory epidemic dynamics by multivariate gaidai risk assessment methodology 通过多变量外代风险评估方法进行呼吸道流行病动态预报

Intelligence-based medicine

Pub Date : 2024-01-01 DOI: 10.1016/j.ibmed.2024.100173

Oleg Gaidai , Hongchen Li , Yu Cao , Alia Ashraf , Yan Zhu

Introduction

current study introduces an accurate prediction spatiotemporal model for epidemic outbreaks risk assessment.

Methods

utilize state-of-the-art statistical methodology on raw/unfiltered clinical datasets. In order to provide trustworthy long-term forecasts of viral outbreak risks, this research suggests a novel biosystem bio-reliability approach that works particularly well for multi-regional biological, environmental, and public health systems that are monitored over a representative time-lapse.

Results

study made use of daily clinically reported patient counts from COVID-19 (SARS-CoV-2) throughout all impacted Dutch administrative areas. The objective of this research was to establish new benchmark for novel bio-reliability methodology that enables efficient risk analysis, based on recorded raw clinical patient numbers, with accounting for pertinent area mapping.

Remarks and significance

by effectively employing various clinical survey datasets that are now accessible, the proposed technique may be used for contemporary biomedical applications, as well as the general welfare.

导言：本研究介绍了一种用于流行病爆发风险评估的精确预测时空模型。方法是在原始/未过滤的临床数据集上采用最先进的统计方法。为了对病毒爆发风险进行可靠的长期预测，这项研究提出了一种新的生物系统生物可靠性方法，这种方法特别适用于在具有代表性的时间间隔内进行监测的多区域生物、环境和公共卫生系统。这项研究的目的是为新型生物可靠性方法建立新的基准，以便根据记录的原始临床患者人数进行有效的风险分析，同时考虑到相关的区域映射。 Remarks and significance by effectively using various clinical survey datasets that are now accessible, the proposed technique may be used for contemporary biomedical applications, as well as the general welfare.

{"title":"Prognostics for respiratory epidemic dynamics by multivariate gaidai risk assessment methodology","authors":"Oleg Gaidai , Hongchen Li , Yu Cao , Alia Ashraf , Yan Zhu","doi":"10.1016/j.ibmed.2024.100173","DOIUrl":"10.1016/j.ibmed.2024.100173","url":null,"abstract":"<div><h3>Introduction</h3><div>current study introduces an accurate prediction spatiotemporal model for epidemic outbreaks risk assessment.</div></div><div><h3>Methods</h3><div>utilize state-of-the-art statistical methodology on raw/unfiltered clinical datasets. In order to provide trustworthy long-term forecasts of viral outbreak risks, this research suggests a novel biosystem bio-reliability approach that works particularly well for multi-regional biological, environmental, and public health systems that are monitored over a representative time-lapse.</div></div><div><h3>Results</h3><div>study made use of daily clinically reported patient counts from COVID-19 (SARS-CoV-2) throughout all impacted Dutch administrative areas. The objective of this research was to establish new benchmark for novel bio-reliability methodology that enables efficient risk analysis, based on recorded raw clinical patient numbers, with accounting for pertinent area mapping.</div></div><div><h3>Remarks and significance</h3><div>by effectively employing various clinical survey datasets that are now accessible, the proposed technique may be used for contemporary biomedical applications, as well as the general welfare.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"10 ","pages":"Article 100173"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142442514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring the business aspects of digital pathology, deep learning in cancers 探索数字病理学的业务方面，癌症中的深度学习

Intelligence-based medicine

Pub Date : 2024-01-01 DOI: 10.1016/j.ibmed.2024.100172

Arjun Reddy , Darnell K. Adrian Williams , Gillian Graifman , Nowair Hussain , Maytal Amiel , Tran Priscilla , Ali Haider , Bali Kumar Kavitesh , Austin Li , Leael Alishahian , Nichelle Perera , Corey Efros , Myoungmee Babu , Mathew Tharakan , Mill Etienne , Benson A. Babu

Introduction

Cancer remains one of the leading causes of morbidity and mortality worldwide. Deep learning in digital pathology has the potential to improve operational efficiency, costs, and care.

Methods

We searched Web of Science, Arxiv, MedRxiv, Embase, PubMed, DBLP, Google Scholar, IEEE Xplore, and Cochrane databases for whole slide imaging and deep learning articles published between 2019 and 2023. The final six articles were selected from 776 articles identified through an inclusion criterion.

Conclusion

Digital pathology services that utilize deep learning have the potential to enhance clinical workflow efficiencies and can have a positive impact on business activities. We anticipate cost reductions as deep learning technology advances and more companies enter the digital pathology ecosystem. However, the limited availability of business use cases, primarily due to publication bias, poses a challenge in medicine without clear examples to learn from.

导言癌症仍然是全球发病率和死亡率的主要原因之一。方法我们在Web of Science、Arxiv、MedRxiv、Embase、PubMed、DBLP、Google Scholar、IEEE Xplore和Cochrane数据库中搜索了2019年至2023年间发表的全切片成像和深度学习文章。结论利用深度学习的数字化病理服务有可能提高临床工作流程的效率，并对业务活动产生积极影响。随着深度学习技术的发展和更多公司进入数字病理生态系统，我们预计成本将会降低。然而，主要由于出版物的偏见，商业用例的可用性有限，这给没有明确实例可借鉴的医学界带来了挑战。

{"title":"Exploring the business aspects of digital pathology, deep learning in cancers","authors":"Arjun Reddy , Darnell K. Adrian Williams , Gillian Graifman , Nowair Hussain , Maytal Amiel , Tran Priscilla , Ali Haider , Bali Kumar Kavitesh , Austin Li , Leael Alishahian , Nichelle Perera , Corey Efros , Myoungmee Babu , Mathew Tharakan , Mill Etienne , Benson A. Babu","doi":"10.1016/j.ibmed.2024.100172","DOIUrl":"10.1016/j.ibmed.2024.100172","url":null,"abstract":"<div><h3>Introduction</h3><div>Cancer remains one of the leading causes of morbidity and mortality worldwide. Deep learning in digital pathology has the potential to improve operational efficiency, costs, and care.</div></div><div><h3>Methods</h3><div>We searched Web of Science, Arxiv, MedRxiv, Embase, PubMed, DBLP, Google Scholar, IEEE Xplore, and Cochrane databases for whole slide imaging and deep learning articles published between 2019 and 2023. The final six articles were selected from 776 articles identified through an inclusion criterion.</div></div><div><h3>Conclusion</h3><div>Digital pathology services that utilize deep learning have the potential to enhance clinical workflow efficiencies and can have a positive impact on business activities. We anticipate cost reductions as deep learning technology advances and more companies enter the digital pathology ecosystem. However, the limited availability of business use cases, primarily due to publication bias, poses a challenge in medicine without clear examples to learn from.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"10 ","pages":"Article 100172"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142586836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Developing a decision model to early predict ICU admission for COVID-19 patients: A machine learning approach 开发决策模型，及早预测 COVID-19 患者入住重症监护室的情况：机器学习方法

Intelligence-based medicine

Pub Date : 2024-01-01 DOI: 10.1016/j.ibmed.2024.100136

Abdulaziz Ahmed , Ferhat D. Zengul , Sheena Khan , Kristine R. Hearld , Sue S. Feldman , Allyson G. Hall , Gregory N. Orewa , James Willig , Kierstin Kennedy

Emergency department (ED) overcrowding is a significant problem in the US. This paper develops a decision model to mitigate ED overcrowding by helping hospitals proactively plan patient boarding processes. The information obtained after the initial assessment of COVID-19 patients in the ED, including patient demographics and medical history, is utilized to predict ICU admission earlier. The predicted information can be communicated with the inpatient unit to prepare an ICU bed for the patients who need ICU care. As a result, the boarding time when patients wait for an ICU bed to be ready can be reduced. The data used in this study included 100 features and 19,155 COVID-19 patients from an academic medical center located in the Southeast United States. Multiple feature selection methods along with Extreme Gradient Boosting (XGBoost) were utilized to develop the models. The parameters of the XGBoost models are optimized using simulated annealing (SA). Among the proposed models, the best model included ten features and resulted in an area under the curve (AUC) of 89.2%, which is the highest among the models proposed in the literature. The proposed prediction model allows hospital administrators to allocate ICU beds more efficiently, enhance patient flow, and mitigate ED overcrowding.

急诊科（ED）过度拥挤是美国的一个严重问题。本文开发了一个决策模型，通过帮助医院主动规划病人登机流程来缓解急诊室过度拥挤的问题。利用在急诊室对 COVID-19 患者进行初步评估后获得的信息（包括患者的人口统计学特征和病史），可以提前预测 ICU 入院情况。预测信息可与住院部沟通，为需要重症监护室护理的病人准备重症监护室床位。因此，病人等待 ICU 病床准备就绪的登机时间可以缩短。本研究使用的数据包括来自美国东南部一家学术医疗中心的 100 个特征和 19,155 名 COVID-19 患者。模型的开发采用了多种特征选择方法和极梯度提升（XGBoost）技术。XGBoost 模型的参数通过模拟退火（SA）进行优化。在提出的模型中，最佳模型包括十个特征，其曲线下面积（AUC）为 89.2%，是文献中提出的模型中最高的。所提出的预测模型能让医院管理者更有效地分配重症监护室床位，提高病人流量，缓解急诊室过度拥挤的问题。

{"title":"Developing a decision model to early predict ICU admission for COVID-19 patients: A machine learning approach","authors":"Abdulaziz Ahmed , Ferhat D. Zengul , Sheena Khan , Kristine R. Hearld , Sue S. Feldman , Allyson G. Hall , Gregory N. Orewa , James Willig , Kierstin Kennedy","doi":"10.1016/j.ibmed.2024.100136","DOIUrl":"https://doi.org/10.1016/j.ibmed.2024.100136","url":null,"abstract":"<div><p>Emergency department (ED) overcrowding is a significant problem in the US. This paper develops a decision model to mitigate ED overcrowding by helping hospitals proactively plan patient boarding processes. The information obtained after the initial assessment of COVID-19 patients in the ED, including patient demographics and medical history, is utilized to predict ICU admission earlier. The predicted information can be communicated with the inpatient unit to prepare an ICU bed for the patients who need ICU care. As a result, the boarding time when patients wait for an ICU bed to be ready can be reduced. The data used in this study included 100 features and 19,155 COVID-19 patients from an academic medical center located in the Southeast United States. Multiple feature selection methods along with Extreme Gradient Boosting (XGBoost) were utilized to develop the models. The parameters of the XGBoost models are optimized using simulated annealing (SA). Among the proposed models, the best model included ten features and resulted in an area under the curve (AUC) of 89.2%, which is the highest among the models proposed in the literature. The proposed prediction model allows hospital administrators to allocate ICU beds more efficiently, enhance patient flow, and mitigate ED overcrowding<strong>.</strong></p></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"9 ","pages":"Article 100136"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666521224000036/pdfft?md5=fb2565bfa79e72f25f4e18b5603f990e&pid=1-s2.0-S2666521224000036-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140327936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Clustering polycystic ovary syndrome laboratory results extracted from a large internet forum with machine learning 利用机器学习对从大型互联网论坛中提取的多囊卵巢综合征实验室结果进行聚类

Intelligence-based medicine

Pub Date : 2024-01-01 DOI: 10.1016/j.ibmed.2024.100135

Rebecca H.K. Emanuel , Paul D. Docherty , Helen Lunt , Rua Murray , Rebecca E. Campbell

Background

Polycystic Ovary Syndrome (PCOS) is reported to affect between 4% and 21% of reproductive aged people with ovaries. It is a heterogeneous condition with a lack of established phenotypes that address the range of reproductive and metabolic features present in PCOS. These reproductive and metabolic features may result in patients undergoing a variety of relevant laboratory tests. Previous work has led to the gathering of laboratory test results from a PCOS specific forum, hosted on a website called reddit.

Objectives

In this paper, laboratory results and body mass index (BMI) posted on the PCOS reddit forum were clustered to show the usefulness of the PCOS forum for PCOS research and validate existing PCOS phenotypes or discover other appropriate phenotypes.

Methods and results

Over 1500 sets of PCOS-related reddit laboratory test results and BMIs were clustered using nearest neighbour imputation and K-means clustering. However, only non-imputed data was included in the final clusters. Kernel Density Estimation plots were used to display the distinct clusters. The clustered test results suggested the existence of distinct metabolic and reproductive phenotypes, as well as a group displaying mild features of both types of dysregulations and a group skewed towards normal results. It was also possible to separate the groups further into distinct hypothyroid groups within the mixed dysregulation group and to separate insulin resistant and diabetes-like groups within the metabolic group.

Conclusions

This research further validates the usefulness of exploring alternate data sources in the age of the internet and machine learning. The reddit clusters reinforced the existing notion that people with PCOS can be separated into a primarily metabolic pathology group, a primarily reproductive pathology group and an in between group with pathology in both domains.

背景据报道，多囊卵巢综合症（PCOS）影响到 4% 到 21% 的育龄卵巢患者。多囊卵巢综合征是一种异质性疾病，缺乏针对多囊卵巢综合征一系列生殖和代谢特征的既定表型。这些生殖和代谢特征可能导致患者接受各种相关的实验室检查。本文对 PCOS reddit 论坛上发布的实验室结果和体重指数 (BMI) 进行了聚类，以显示 PCOS 论坛对 PCOS 研究的有用性，并验证现有的 PCOS 表型或发现其他合适的表型。方法和结果使用近邻估算和 K-means 聚类对 1500 多组与 PCOS 相关的 reddit 实验室测试结果和 BMI 进行了聚类。不过，最终的聚类只包括非估算数据。核密度估计图用于显示不同的聚类。聚类测试结果表明，存在不同的代谢和生殖表型，一组显示出两种类型失调的轻微特征，另一组则偏向于正常结果。这项研究进一步验证了在互联网和机器学习时代探索其他数据源的实用性。reddit 聚类加强了现有的概念，即多囊卵巢综合症患者可分为以代谢病理为主的组别、以生殖病理为主的组别以及在两个领域都有病理的介于两者之间的组别。

{"title":"Clustering polycystic ovary syndrome laboratory results extracted from a large internet forum with machine learning","authors":"Rebecca H.K. Emanuel , Paul D. Docherty , Helen Lunt , Rua Murray , Rebecca E. Campbell","doi":"10.1016/j.ibmed.2024.100135","DOIUrl":"https://doi.org/10.1016/j.ibmed.2024.100135","url":null,"abstract":"<div><h3>Background</h3><p>Polycystic Ovary Syndrome (PCOS) is reported to affect between 4% and 21% of reproductive aged people with ovaries. It is a heterogeneous condition with a lack of established phenotypes that address the range of reproductive and metabolic features present in PCOS. These reproductive and metabolic features may result in patients undergoing a variety of relevant laboratory tests. Previous work has led to the gathering of laboratory test results from a PCOS specific forum, hosted on a website called reddit.</p></div><div><h3>Objectives</h3><p>In this paper, laboratory results and body mass index (BMI) posted on the PCOS reddit forum were clustered to show the usefulness of the PCOS forum for PCOS research and validate existing PCOS phenotypes or discover other appropriate phenotypes.</p></div><div><h3>Methods and results</h3><p>Over 1500 sets of PCOS-related reddit laboratory test results and BMIs were clustered using nearest neighbour imputation and K-means clustering. However, only non-imputed data was included in the final clusters. Kernel Density Estimation plots were used to display the distinct clusters. The clustered test results suggested the existence of distinct metabolic and reproductive phenotypes, as well as a group displaying mild features of both types of dysregulations and a group skewed towards normal results. It was also possible to separate the groups further into distinct hypothyroid groups within the mixed dysregulation group and to separate insulin resistant and diabetes-like groups within the metabolic group.</p></div><div><h3>Conclusions</h3><p>This research further validates the usefulness of exploring alternate data sources in the age of the internet and machine learning. The reddit clusters reinforced the existing notion that people with PCOS can be separated into a primarily metabolic pathology group, a primarily reproductive pathology group and an in between group with pathology in both domains.</p></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"9 ","pages":"Article 100135"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666521224000024/pdfft?md5=87b2d688b9b327bd7f8d3d181ee40e71&pid=1-s2.0-S2666521224000024-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140134211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Machine learning prediction of Dice similarity coefficient for validation of deformable image registration 用于验证可变形图像配准的 Dice 相似性系数的机器学习预测

Intelligence-based medicine

Pub Date : 2024-01-01 DOI: 10.1016/j.ibmed.2024.100163

Yun Ming Wong , Ping Lin Yeap , Ashley Li Kuan Ong , Jeffrey Kit Loong Tuan , Wen Siang Lew , James Cheow Lei Lee , Hong Qi Tan

Introduction

Deformable image registration (DIR) plays a vital role in adaptive radiotherapy (ART). For the clinical implementation of DIR, evaluation of deformation accuracy is a critical step. While contour-based metrics, for example Dice similarity coefficient (DSC), are widely implemented for DIR validation, they require delineation of contours which is time-consuming and would cause hold-ups in an ART workflow. Therefore, this work aims to accomplish the prediction of DSC using various metrics based on deformation vector field (DVF) by applying machine learning (ML), in order to provide an efficient means of DIR validation with minimised human intervention.

Methods

Planning CT image was deformed to the cone-beam CT images for 20 prostate cancer patients. Various DVF-based metrics and DSC were calculated, and the former was used as input features to predict the latter using three ML models, namely linear regression (LR), Nu Support Vector Regression (NuSVR) and Random Forest Regressor (RFR). Four datasets were used for analysis: 1) prostate, 2) bladder, 3) rectum and 4) all the organs combined. Average mean absolute error (MAE) was computed to evaluate the model performance. The classification performance of the best-performing model was further evaluated, and the prediction interval and feature importance were calculated.

Results

Overall, RFR achieved the lowest average MAE, ranging between 0.045 and 0.069 for the four datasets, while LR and NuSVR had slightly poorer performances. Analysis on the results of best-performing model showed that sensitivity and specificity of 0.86 and 0.51, respectively, were obtained when a prediction threshold of 0.85 was used to classify the fourth dataset. Jacobian determinant was found to be a significant contributor to the predictions of all four datasets using this model.

Conclusion

This study demonstrated the potential of several ML models, especially RFR, to be applied for prediction of DSC to speed up the DIR validation process.

导言可变形图像配准（DIR）在自适应放射治疗（ART）中发挥着重要作用。对于 DIR 的临床应用来说，评估变形精度是一个关键步骤。虽然基于轮廓的指标（如 Dice 相似度系数 (DSC)）被广泛用于 DIR 验证，但它们需要划定轮廓，而划定轮廓非常耗时，会导致 ART 工作流程停滞。因此，这项工作旨在通过应用机器学习（ML），使用基于形变矢量场（DVF）的各种指标来完成 DSC 的预测，从而提供一种有效的 DIR 验证方法，最大限度地减少人为干预。计算各种基于 DVF 的指标和 DSC，并使用三种 ML 模型（即线性回归 (LR)、Nu 支持向量回归 (NuSVR) 和随机森林回归器 (RFR)）将前者作为输入特征来预测后者。分析使用了四个数据集：1）前列腺；2）膀胱；3）直肠；4）所有器官组合。计算平均绝对误差（MAE）来评估模型性能。结果总体而言，RFR 的平均绝对误差最小，四个数据集的平均绝对误差在 0.045 到 0.069 之间，而 LR 和 NuSVR 的表现稍差。对表现最佳模型结果的分析表明，当使用 0.85 的预测阈值对第四个数据集进行分类时，灵敏度和特异度分别为 0.86 和 0.51。结论这项研究证明了几种 ML 模型（尤其是 RFR）在预测 DSC 方面的应用潜力，从而加快了 DIR 验证过程。

{"title":"Machine learning prediction of Dice similarity coefficient for validation of deformable image registration","authors":"Yun Ming Wong , Ping Lin Yeap , Ashley Li Kuan Ong , Jeffrey Kit Loong Tuan , Wen Siang Lew , James Cheow Lei Lee , Hong Qi Tan","doi":"10.1016/j.ibmed.2024.100163","DOIUrl":"10.1016/j.ibmed.2024.100163","url":null,"abstract":"<div><h3>Introduction</h3><p>Deformable image registration (DIR) plays a vital role in adaptive radiotherapy (ART). For the clinical implementation of DIR, evaluation of deformation accuracy is a critical step. While contour-based metrics, for example Dice similarity coefficient (DSC), are widely implemented for DIR validation, they require delineation of contours which is time-consuming and would cause hold-ups in an ART workflow. Therefore, this work aims to accomplish the prediction of DSC using various metrics based on deformation vector field (DVF) by applying machine learning (ML), in order to provide an efficient means of DIR validation with minimised human intervention.</p></div><div><h3>Methods</h3><p>Planning CT image was deformed to the cone-beam CT images for 20 prostate cancer patients. Various DVF-based metrics and DSC were calculated, and the former was used as input features to predict the latter using three ML models, namely linear regression (LR), Nu Support Vector Regression (NuSVR) and Random Forest Regressor (RFR). Four datasets were used for analysis: 1) prostate, 2) bladder, 3) rectum and 4) all the organs combined. Average mean absolute error (MAE) was computed to evaluate the model performance. The classification performance of the best-performing model was further evaluated, and the prediction interval and feature importance were calculated.</p></div><div><h3>Results</h3><p>Overall, RFR achieved the lowest average MAE, ranging between 0.045 and 0.069 for the four datasets, while LR and NuSVR had slightly poorer performances. Analysis on the results of best-performing model showed that sensitivity and specificity of 0.86 and 0.51, respectively, were obtained when a prediction threshold of 0.85 was used to classify the fourth dataset. Jacobian determinant was found to be a significant contributor to the predictions of all four datasets using this model.</p></div><div><h3>Conclusion</h3><p>This study demonstrated the potential of several ML models, especially RFR, to be applied for prediction of DSC to speed up the DIR validation process.</p></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"10 ","pages":"Article 100163"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666521224000309/pdfft?md5=92ebfdf38ebfa5ad2817955b2f352129&pid=1-s2.0-S2666521224000309-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141985703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The impact of artificial intelligence on large vessel occlusion stroke detection and management: A systematic review meta-analysis 人工智能对大血管闭塞性卒中检测和管理的影响：系统综述荟萃分析

Intelligence-based medicine

Pub Date : 2024-01-01 DOI: 10.1016/j.ibmed.2024.100161

Elan Zebrowitz , Sonali Dadoo , Paige Brabant , Anaz Uddin , Esewi Aifuwa , Danielle Maraia , Mill Etienne , Neriy Yakubov , Myoungmee Babu , Benson Babu

Introduction

Stroke remains the second leading cause of death worldwide, with many survivors facing significant disabilities. In acute stroke care, the timeless adage 'Time is brain' underscores the vital need for quick action. Innovative Artificial Intelligence (AI) technology potentially enables swift detection and management of acute ischemic strokes, revolutionizing acute stroke care towards enhanced automation.

Methods

The study is registered with Prospero under CRD42024496716 and adheres to the Problem, Intervention, Comparison, and Outcomes framework (PICO). The analysis used Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. We searched Embase, PubMed, DBLP, Google Scholar, IEEE Xplore, Cochrane database, IEEE, Web of Science, ArXiv, MedRxiv, and Semantic Scholar. The articles included were published between 2019 and 2023. Out of 1528 articles identified, thirty-seven met the inclusion criteria.

Results

We compared AI-augmented Large Vessel Occlusion (LVO) detection and non-AI LVO detection in various patient processing times related to emergent endovascular therapy in acute ischemic strokes. Triage Time, Door-to-Intervention Notification Time (INR), and Door-to -Arterial Puncture Time revealed an odds ratio (OR) of 0.39 (95 % CI: 0.29–0.54, p < 0.001), 0.30 (95 % CI: 0.21–0.42, p < 0.001), and 0.50 (95 % CI: 0detection 0.30–0.82, p = 0.007), respectively -- all of which had negligible heterogeneity (I^2 = 0). CT-to-Puncture-Time and Door-to-CTA-Time yielded an OR of 0.57 (95 % CI: 0.31–1.04, p = 0.065) and 0.77 (95 % CI: 0.37–1.60, p = 0.489), respectively -- both of which had negligible heterogeneity (I^2 = 0). The Last Known Well (LWK) to Time of Arrival resulted in an OR of 1.15 (95 % CI: 0.83–1.59, p = 0.409, I^2 = 0). AI stroke detection sensitivity OR of 0.91 (95 % CI: 0.88–0.95, p < 0.001) should be interpreted with potential heterogeneity in mind (I^2 = 69.3). National Institute of Health score (NIHSS) mean of 16.20 (95 % CI: 14.96–17.45, p = 0.001, I^2 = 0). Patient Transfer-Times between primary and comprehensive stroke centers generated an OR of 0.98 (95 % CI: 0.73–1.32, p = 893, I^2 = 0). Similarly, Door-in-Door-Out Time (DIDO) had an OR of 1.19 (95 % CI: 0.21–6.88, p = 0.848) and low heterogeneity (I^2 = 5.1). The results indicated significant differences across several parameters between the AI augmentation and non-AI groups.

Conclusion

Our findings highlight how AI augments healthcare providers' ability to detect and manage strokes swiftly and accurately within acute care settings. As these technologies progress, healthcare organizations mature, and AI becomes more integrated into healthcare systems, longitudinal studies are critical in evaluating its impact on workflow efficiency, cost-effectiveness, and clinical outcomes.

引言脑卒中仍然是全球第二大死因，许多幸存者面临严重残疾。在急性脑卒中护理中，"时间就是大脑 "这句永恒的格言强调了快速行动的重要性。创新的人工智能（AI）技术有可能实现对急性缺血性脑卒中的快速检测和管理，彻底改变急性脑卒中护理，提高自动化水平。分析采用了系统综述和荟萃分析首选报告项目（PRISMA）指南。我们检索了Embase、PubMed、DBLP、Google Scholar、IEEE Xplore、Cochrane数据库、IEEE、Web of Science、ArXiv、MedRxiv和Semantic Scholar。收录的文章发表于 2019 年至 2023 年之间。结果我们比较了人工智能增强的大血管闭塞（LVO）检测和非人工智能增强的大血管闭塞检测在急性缺血性脑卒中紧急血管内治疗相关的各种患者处理时间。分诊时间、门到介入通知时间（INR）和门到动脉穿刺时间显示的几率比（OR）分别为 0.39（95 % CI：0.29-0.54，p < 0.001）、0.30（95 % CI：0.21-0.42，p <0.001）和 0.50（95 % CI：0detection 0.30-0.82，p = 0.007）--所有这些指标的异质性均可忽略不计（I^2 = 0）。CT-穿刺-时间和门-CTA-时间的OR值分别为0.57（95 % CI：0.31-1.04，p = 0.065）和0.77（95 % CI：0.37-1.60，p = 0.489），两者的异质性均可忽略不计（I^2 = 0）。最后已知井（LWK）到到达时间的 OR 值为 1.15（95 % CI：0.83-1.59，p = 0.409，I^2 = 0）。人工智能卒中检测灵敏度 OR 为 0.91（95 % CI：0.88-0.95，p < 0.001），在解释时应考虑潜在的异质性（I^2 = 69.3）。美国国立卫生研究院评分（NIHSS）平均值为 16.20（95 % CI：14.96-17.45，p = 0.001，I^2 = 0）。初级卒中中心与综合卒中中心之间的患者转运时间 OR 为 0.98（95 % CI：0.73-1.32，p = 893，I^2 = 0）。同样，门内-门外时间（DIDO）的 OR 值为 1.19（95 % CI：0.21-6.88，p = 0.848），异质性较低（I^2 = 5.1）。结果表明，人工智能增强组和非人工智能增强组在多个参数上存在明显差异。随着这些技术的进步、医疗机构的成熟以及人工智能与医疗系统的进一步整合，纵向研究对于评估其对工作流程效率、成本效益和临床结果的影响至关重要。

{"title":"The impact of artificial intelligence on large vessel occlusion stroke detection and management: A systematic review meta-analysis","authors":"Elan Zebrowitz , Sonali Dadoo , Paige Brabant , Anaz Uddin , Esewi Aifuwa , Danielle Maraia , Mill Etienne , Neriy Yakubov , Myoungmee Babu , Benson Babu","doi":"10.1016/j.ibmed.2024.100161","DOIUrl":"10.1016/j.ibmed.2024.100161","url":null,"abstract":"<div><h3>Introduction</h3><p>Stroke remains the second leading cause of death worldwide, with many survivors facing significant disabilities. In acute stroke care, the timeless adage 'Time is brain' underscores the vital need for quick action. Innovative Artificial Intelligence (AI) technology potentially enables swift detection and management of acute ischemic strokes, revolutionizing acute stroke care towards enhanced automation.</p></div><div><h3>Methods</h3><p>The study is registered with Prospero under CRD42024496716 and adheres to the Problem, Intervention, Comparison, and Outcomes framework (PICO). The analysis used Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. We searched Embase, PubMed, DBLP, Google Scholar, IEEE Xplore, Cochrane database, IEEE, Web of Science, ArXiv, MedRxiv, and Semantic Scholar. The articles included were published between 2019 and 2023. Out of 1528 articles identified, thirty-seven met the inclusion criteria.</p></div><div><h3>Results</h3><p>We compared AI-augmented Large Vessel Occlusion (LVO) detection and non-AI LVO detection in various patient processing times related to emergent endovascular therapy in acute ischemic strokes. Triage Time, Door-to-Intervention Notification Time (INR), and Door-to -Arterial Puncture Time revealed an odds ratio (OR) of 0.39 (95 % CI: 0.29–0.54, p < 0.001), 0.30 (95 % CI: 0.21–0.42, p < 0.001), and 0.50 (95 % CI: 0detection 0.30–0.82, p = 0.007), respectively -- all of which had negligible heterogeneity (I^2 = 0). CT-to-Puncture-Time and Door-to-CTA-Time yielded an OR of 0.57 (95 % CI: 0.31–1.04, p = 0.065) and 0.77 (95 % CI: 0.37–1.60, p = 0.489), respectively -- both of which had negligible heterogeneity (I^2 = 0). The Last Known Well (LWK) to Time of Arrival resulted in an OR of 1.15 (95 % CI: 0.83–1.59, p = 0.409, I^2 = 0). AI stroke detection sensitivity OR of 0.91 (95 % CI: 0.88–0.95, p < 0.001) should be interpreted with potential heterogeneity in mind (I^2 = 69.3). National Institute of Health score (NIHSS) mean of 16.20 (95 % CI: 14.96–17.45, p = 0.001, I^2 = 0). Patient Transfer-Times between primary and comprehensive stroke centers generated an OR of 0.98 (95 % CI: 0.73–1.32, p = 893, I^2 = 0). Similarly, Door-in-Door-Out Time (DIDO) had an OR of 1.19 (95 % CI: 0.21–6.88, p = 0.848) and low heterogeneity (I^2 = 5.1). The results indicated significant differences across several parameters between the AI augmentation and non-AI groups.</p></div><div><h3>Conclusion</h3><p>Our findings highlight how AI augments healthcare providers' ability to detect and manage strokes swiftly and accurately within acute care settings. As these technologies progress, healthcare organizations mature, and AI becomes more integrated into healthcare systems, longitudinal studies are critical in evaluating its impact on workflow efficiency, cost-effectiveness, and clinical outcomes.</p></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"10 ","pages":"Article 100161"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666521224000280/pdfft?md5=06eaca45e04d34842726f805c16400c5&pid=1-s2.0-S2666521224000280-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142075755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Continuous-discrete GeoSEIR(D) model for modelling and analysis of geo spread COVID-19 用于地质传播建模和分析的连续-离散 GeoSEIR(D) 模型 COVID-19

Intelligence-based medicine

Pub Date : 2024-01-01 DOI: 10.1016/j.ibmed.2024.100155

Yaroslav Vyklyuk , Denys Nevinskyi , Kateryna Hazdiuk

Humanity faces various types of viral infections, such as COVID-19, annually. In this paper, we propose a Geospatial SEIR(D) model based on a multi-agent approach with continuous-discrete states. This model accounts for key parameters of viral infections, daily human activities, and geodata. Our developed algorithms enable the simulation of statistical parameters such as the number of infected, recovered, deceased, and susceptible individuals, along with the spatial distribution of the pandemic on a geographical map. The model was validated by simulating the COVID-19 spread in Lviv, Ukraine. Several preventive strategies were analyzed: implementing a 50 % reduction in infection probability through mask mandates delayed the peak to 150 days with a 25 % reduction in the maximum number of patients, while a 75 % reduction delayed the peak to 240 days with a 60 % reduction in the maximum number of patients. Prohibiting public transport and public places resulted in the epidemic peaking on day 165 with 2854 patients, significantly reducing the spread rate compared to the base model. Simulating 50 %, 75 %, and 100 % vaccination rates showed a reduction in the peak number of infections by 34 %, 57 %, and 94 %, respectively, also extending the duration of the epidemic. Enforcing weekend quarantine delayed the epidemic onset by one month but had minimal impact on the overall number of infections and duration. Combining mask mandates, transport restrictions, and vaccination led to the most effective mitigation, with the average number of sick agents around 8 and never exceeding 15 over four years. This comprehensive approach highlights the effectiveness of combining various preventive measures to control the spread of viral infections. The proposed model provides a valuable tool for policymakers to evaluate and implement effective strategies against pandemics.

人类每年都会面临各种类型的病毒感染，如 COVID-19。在本文中，我们提出了一个地理空间 SEIR(D) 模型，该模型基于具有连续-离散状态的多代理方法。该模型考虑了病毒感染、人类日常活动和地理数据的关键参数。我们开发的算法可以模拟感染者、康复者、死亡者和易感人群的数量等统计参数，以及大流行病在地理图上的空间分布。该模型通过模拟 COVID-19 在乌克兰利沃夫的传播进行了验证。对几种预防策略进行了分析：通过口罩规定将感染概率降低 50%，可将高峰期推迟到 150 天，最大患者人数减少 25%；将感染概率降低 75%，可将高峰期推迟到 240 天，最大患者人数减少 60%。禁止公共交通和公共场所导致疫情在第 165 天达到峰值，患者人数为 2854 人，与基础模型相比，传播速度明显降低。模拟 50%、75% 和 100% 的疫苗接种率显示，感染高峰人数分别减少了 34%、57% 和 94%，同时也延长了疫情持续时间。实施周末检疫可将疫情爆发时间推迟一个月，但对总体感染人数和持续时间的影响微乎其微。将口罩规定、运输限制和疫苗接种结合起来，可以最有效地缓解疫情，在四年时间里，患病病原体的平均数量约为 8 个，从未超过 15 个。这种综合方法凸显了结合各种预防措施来控制病毒感染传播的有效性。所提出的模型为政策制定者评估和实施有效的大流行病防治战略提供了宝贵的工具。

{"title":"Continuous-discrete GeoSEIR(D) model for modelling and analysis of geo spread COVID-19","authors":"Yaroslav Vyklyuk , Denys Nevinskyi , Kateryna Hazdiuk","doi":"10.1016/j.ibmed.2024.100155","DOIUrl":"https://doi.org/10.1016/j.ibmed.2024.100155","url":null,"abstract":"<div><p>Humanity faces various types of viral infections, such as COVID-19, annually. In this paper, we propose a Geospatial SEIR(D) model based on a multi-agent approach with continuous-discrete states. This model accounts for key parameters of viral infections, daily human activities, and geodata. Our developed algorithms enable the simulation of statistical parameters such as the number of infected, recovered, deceased, and susceptible individuals, along with the spatial distribution of the pandemic on a geographical map. The model was validated by simulating the COVID-19 spread in Lviv, Ukraine. Several preventive strategies were analyzed: implementing a 50 % reduction in infection probability through mask mandates delayed the peak to 150 days with a 25 % reduction in the maximum number of patients, while a 75 % reduction delayed the peak to 240 days with a 60 % reduction in the maximum number of patients. Prohibiting public transport and public places resulted in the epidemic peaking on day 165 with 2854 patients, significantly reducing the spread rate compared to the base model. Simulating 50 %, 75 %, and 100 % vaccination rates showed a reduction in the peak number of infections by 34 %, 57 %, and 94 %, respectively, also extending the duration of the epidemic. Enforcing weekend quarantine delayed the epidemic onset by one month but had minimal impact on the overall number of infections and duration. Combining mask mandates, transport restrictions, and vaccination led to the most effective mitigation, with the average number of sick agents around 8 and never exceeding 15 over four years. This comprehensive approach highlights the effectiveness of combining various preventive measures to control the spread of viral infections. The proposed model provides a valuable tool for policymakers to evaluate and implement effective strategies against pandemics.</p></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"10 ","pages":"Article 100155"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266652122400022X/pdfft?md5=1a4623f90ada391f2f41e81336645d1e&pid=1-s2.0-S266652122400022X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141582940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Advancing delirium classification: A clinical notes-based natural language processing-supported machine learning model 推进谵妄分类：基于临床笔记的自然语言处理辅助机器学习模型

Intelligence-based medicine

Pub Date : 2024-01-01 DOI: 10.1016/j.ibmed.2024.100140

Sobia Amjad , Natasha E. Holmes , Kartik Kishore , Marcus Young , James Bailey , Rinaldo Bellomo , Karin Verspoor

Objective

The study of the epidemiology of delirium in hospitalized patients is challenging. We aimed to identify the presence or absence of delirium from clinical text notes using natural language processing (NLP) techniques and machine learning (ML) models.

Materials and methods

We developed a delirium predictive model using 942 clinical notes from hospitalized patients with an ICD-10 delirium hospital discharge code. Moreover, we implemented ML models using a) delirium-suggestive words from an expert-defined dictionary or b) free text in clinical notes. Both strategies considered positive and negative delirium-associated words.

Results

At the note level, for the dictionary method, the logistic regression model achieved an area under the receiver-operating curve (AUROC) of 0.917 for positive words and 0.914 for combined positive and negative words. The areas under the precision-recall curve (AUPR) were 0.893 and 0.897, respectively. For the free-text method, the model achieved an AUROC of 0.826 and 0.830 and AUPR of 0.852 and 0.856, respectively.

Discussion

NLP-based ML models accurately identified the presence of delirium in clinical notes. The dictionary-based method was superior to the free-text method. The use of negative features improved performance in both methods.

Conclusion

Our proposed NLP-based ML model identified delirium in clinical notes. This model could automatically screen millions of notes and facilitate the study of the epidemiology of in-hospital delirium.

目的研究住院患者谵妄的流行病学具有挑战性。我们的目的是利用自然语言处理（NLP）技术和机器学习（ML）模型从临床文本记录中识别是否存在谵妄。此外，我们还使用 a) 专家定义字典中的谵妄提示词或 b) 临床笔记中的自由文本，建立了 ML 模型。结果在笔记层面，对于字典方法，逻辑回归模型的接收者工作曲线下面积（AUROC）为 0.917（阳性词），而对于阳性词和阴性词的组合，接收者工作曲线下面积（AUROC）为 0.914。精确度-召回曲线下面积（AUPR）分别为 0.893 和 0.897。基于 NLP 的 ML 模型能准确识别临床笔记中是否存在谵妄。基于词典的方法优于自由文本方法。结论我们提出的基于 NLP 的 ML 模型可以识别临床笔记中的谵妄。该模型可以自动筛选数百万份病历，有助于研究院内谵妄的流行病学。

{"title":"Advancing delirium classification: A clinical notes-based natural language processing-supported machine learning model","authors":"Sobia Amjad , Natasha E. Holmes , Kartik Kishore , Marcus Young , James Bailey , Rinaldo Bellomo , Karin Verspoor","doi":"10.1016/j.ibmed.2024.100140","DOIUrl":"10.1016/j.ibmed.2024.100140","url":null,"abstract":"<div><h3>Objective</h3><p>The study of the epidemiology of delirium in hospitalized patients is challenging. We aimed to identify the presence or absence of delirium from clinical text notes using natural language processing (NLP) techniques and machine learning (ML) models.</p></div><div><h3>Materials and methods</h3><p>We developed a delirium predictive model using 942 clinical notes from hospitalized patients with an ICD-10 delirium hospital discharge code. Moreover, we implemented ML models using a) delirium-suggestive words from an expert-defined dictionary or b) free text in clinical notes. Both strategies considered positive and negative delirium-associated words.</p></div><div><h3>Results</h3><p>At the note level, for the dictionary method, the logistic regression model achieved an area under the receiver-operating curve (AUROC) of 0.917 for positive words and 0.914 for combined positive and negative words. The areas under the precision-recall curve (AUPR) were 0.893 and 0.897, respectively. For the free-text method, the model achieved an AUROC of 0.826 and 0.830 and AUPR of 0.852 and 0.856, respectively.</p></div><div><h3>Discussion</h3><p>NLP-based ML models accurately identified the presence of delirium in clinical notes. The dictionary-based method was superior to the free-text method. The use of negative features improved performance in both methods.</p></div><div><h3>Conclusion</h3><p>Our proposed NLP-based ML model identified delirium in clinical notes. This model could automatically screen millions of notes and facilitate the study of the epidemiology of in-hospital delirium.</p></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"9 ","pages":"Article 100140"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666521224000073/pdfft?md5=0f2ca58608d82ae6134a46fa069cda02&pid=1-s2.0-S2666521224000073-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141047806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Skin cancer detection using lightweight model souping and ensembling knowledge distillation for memory-constrained devices 为内存受限设备使用轻量级模型汤和集合知识提炼技术检测皮肤癌

Intelligence-based medicine

Pub Date : 2024-01-01 DOI: 10.1016/j.ibmed.2024.100176

Muhammad Rafsan Kabir, Rashidul Hassan Borshon, Mahiv Khan Wasi, Rafeed Mohammad Sultan, Ahmad Hossain, Riasat Khan

In contemporary times, the escalating prevalence of skin cancer is a significant concern, impacting numerous individuals. This work comprehensively explores advanced artificial intelligence-based deep learning techniques for skin cancer detection, utilizing the HAM10000 dataset. The experimental study fine-tunes two knowledge distillation teacher models, ResNet50 (25.6M) and DenseNet161 (28.7M), achieving remarkable accuracies of 98.32% and 98.80%, respectively. Despite their notable accuracy, the training and deployment of these large models pose significant challenges for implementation on memory-constrained medical devices. To address this issue, we introduce TinyStudent (0.35M), employing knowledge distillation from ResNet50 and DenseNet161, yielding accuracies of 85.45% and 85.00%, respectively. While TinyStudent may not achieve accuracies comparable to the teacher models, it is 82 and 73 times smaller than DenseNet161 and ResNet50, respectively, implying reduced training time and computational resource requirements. This significant reduction in the number of parameters makes it feasible to deploy the model on memory-constrained edge devices. Multi-teacher distillation, incorporating knowledge from both models, results in a competitive student accuracy of 84.10%. Ensembling methods, such as average ensembling and concatenation, further enhance predictive performances, achieving accuracies of 87.74% and 88.00%, respectively, each with approximately 1.05M parameters. Compared to DenseNet161 and ResNet50, these lightweight ensemble models offer shorter inference times, suitable for medical devices. Additionally, our implementation of the Greedy method in Model Soup establishes an accuracy of 85.70%.

在当代，皮肤癌发病率的不断攀升是一个重大问题，影响着无数人。这项研究利用 HAM10000 数据集，全面探索了用于皮肤癌检测的先进人工智能深度学习技术。实验研究对 ResNet50（25.6M）和 DenseNet161（28.7M）这两个知识提炼教师模型进行了微调，分别取得了 98.32% 和 98.80% 的显著准确率。尽管准确率很高，但这些大型模型的训练和部署对在内存受限的医疗设备上实施构成了巨大挑战。为了解决这个问题，我们引入了 TinyStudent (0.35M)，它采用了从 ResNet50 和 DenseNet161 中提炼的知识，准确率分别为 85.45% 和 85.00%。虽然 TinyStudent 的准确率可能无法与教师模型相提并论，但它的体积分别是 DenseNet161 和 ResNet50 的 82 倍和 73 倍，这意味着训练时间和计算资源需求都有所减少。参数数量的大幅减少使得在内存受限的边缘设备上部署该模型变得可行。多教师提炼法结合了两个模型的知识，使学生的准确率达到 84.10%，具有很强的竞争力。平均集合和串联等集合方法进一步提高了预测性能，在使用约 1.05M 个参数的情况下，准确率分别达到 87.74% 和 88.00%。与 DenseNet161 和 ResNet50 相比，这些轻量级集合模型的推理时间更短，适用于医疗设备。此外，我们在 Model Soup 中实施的 Greedy 方法的准确率达到了 85.70%。

{"title":"Skin cancer detection using lightweight model souping and ensembling knowledge distillation for memory-constrained devices","authors":"Muhammad Rafsan Kabir, Rashidul Hassan Borshon, Mahiv Khan Wasi, Rafeed Mohammad Sultan, Ahmad Hossain, Riasat Khan","doi":"10.1016/j.ibmed.2024.100176","DOIUrl":"10.1016/j.ibmed.2024.100176","url":null,"abstract":"<div><div>In contemporary times, the escalating prevalence of skin cancer is a significant concern, impacting numerous individuals. This work comprehensively explores advanced artificial intelligence-based deep learning techniques for skin cancer detection, utilizing the HAM10000 dataset. The experimental study fine-tunes two knowledge distillation teacher models, ResNet50 (25.6M) and DenseNet161 (28.7M), achieving remarkable accuracies of 98.32% and 98.80%, respectively. Despite their notable accuracy, the training and deployment of these large models pose significant challenges for implementation on memory-constrained medical devices. To address this issue, we introduce TinyStudent (0.35M), employing knowledge distillation from ResNet50 and DenseNet161, yielding accuracies of 85.45% and 85.00%, respectively. While TinyStudent may not achieve accuracies comparable to the teacher models, it is 82 and 73 times smaller than DenseNet161 and ResNet50, respectively, implying reduced training time and computational resource requirements. This significant reduction in the number of parameters makes it feasible to deploy the model on memory-constrained edge devices. Multi-teacher distillation, incorporating knowledge from both models, results in a competitive student accuracy of 84.10%. Ensembling methods, such as average ensembling and concatenation, further enhance predictive performances, achieving accuracies of 87.74% and 88.00%, respectively, each with approximately 1.05M parameters. Compared to DenseNet161 and ResNet50, these lightweight ensemble models offer shorter inference times, suitable for medical devices. Additionally, our implementation of the Greedy method in Model Soup establishes an accuracy of 85.70%.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"10 ","pages":"Article 100176"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142532612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0