Pub Date : 2024-02-04DOI: 10.1016/j.health.2024.100307
Snehal Rajput , Rupal Kapdi , Mohendra Roy , Mehul S. Raval
Automated segmentation methods can produce faster segmentation of tumors in medical images, aiding medical professionals in diagnosis and treatment plans. A 3D U-Net method excels in this task but has high computational costs due to large model parameters, which limits their application under resource constraints. This study targets an optimized triplanar (2.5D) model ensemble to generate accurate segmentation with fewer parameters. The proposed triplanar model uses spatial and channel attention mechanisms and information from multiple orthogonal planar views to predict segmentation labels. In particular, we studied the optimum filter size to improve the accuracy without increasing the network complexity. The model generated output is further post-processed to fine-tune the segmentation results. The Dice similarity coefficients (Dice-score) of the Brain Tumor Segmentation (BraTS) 2020 training set for enhancing tumor (ET), whole tumor (WT), and tumor core (TC) are 0.736, 0.896, and 0.841, whereas, for the validation set, they are 0.713, 0.873, and 0.778, respectively. The proposed base model has only parameters, three times less than BraTS 2020’s best-performing model (ET 0.798, WT 0.912, TC 0.857) on the validation set. The proposed ensemble model has parameters, 1.6 times less than the top-ranked model and two times less than the third-ranked model (ET 0.793, WT 0.911, TC 0.853 on validation set) of BraTS2020 challenge.
{"title":"A triplanar ensemble model for brain tumor segmentation with volumetric multiparametric magnetic resonance images","authors":"Snehal Rajput , Rupal Kapdi , Mohendra Roy , Mehul S. Raval","doi":"10.1016/j.health.2024.100307","DOIUrl":"https://doi.org/10.1016/j.health.2024.100307","url":null,"abstract":"<div><p>Automated segmentation methods can produce faster segmentation of tumors in medical images, aiding medical professionals in diagnosis and treatment plans. A 3D U-Net method excels in this task but has high computational costs due to large model parameters, which limits their application under resource constraints. This study targets an optimized triplanar (2.5D) model ensemble to generate accurate segmentation with fewer parameters. The proposed triplanar model uses spatial and channel attention mechanisms and information from multiple orthogonal planar views to predict segmentation labels. In particular, we studied the optimum filter size to improve the accuracy without increasing the network complexity. The model generated output is further post-processed to fine-tune the segmentation results. The Dice similarity coefficients (Dice-score) of the Brain Tumor Segmentation (BraTS) 2020 training set for enhancing tumor (ET), whole tumor (WT), and tumor core (TC) are 0.736, 0.896, and 0.841, whereas, for the validation set, they are 0.713, 0.873, and 0.778, respectively. The proposed base model has only <span><math><mrow><mn>10</mn><mo>.</mo><mn>25</mn><mspace></mspace><mi>M</mi></mrow></math></span> parameters, three times less than BraTS 2020’s best-performing model (ET 0.798, WT 0.912, TC 0.857) on the validation set. The proposed ensemble model has <span><math><mrow><mn>93</mn><mo>.</mo><mn>5</mn><mspace></mspace><mi>M</mi></mrow></math></span> parameters, 1.6 times less than the top-ranked model and two times less than the third-ranked model (ET 0.793, WT 0.911, TC 0.853 on validation set) of BraTS2020 challenge.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100307"},"PeriodicalIF":0.0,"publicationDate":"2024-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000091/pdfft?md5=d29fc0533e483abd517c7cab8004bdcb&pid=1-s2.0-S2772442524000091-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139710235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-02-01DOI: 10.1016/j.health.2024.100305
Fazla Rabbi , Debapriya Banik , Niamat Ullah Ibne Hossain , Alexandr Sokolov
Healthcare professionals must provide their patients with the best possible service and be well-informed and expert at carrying out complex surgical procedures to fulfill this responsibility. The aim of the medical treatments is fewer complications, shorter hospital stays, and a better patient experience. Through continuous learning and training, medical practitioners trained in up-to-date and state-of-the-art surgical techniques and technologies make productive and effective healthcare systems possible. Healthcare systems often report on problems with surgical processes, skipped procedures, unusual activities during operations, and lengthy transition times. This event log data allows implementing process mining methods to deliver medical professionals with simple and understandable findings using Petri nets for process analysis and enhancement. This study identifies the parallels and discrepancies between the pre-and post-stages and their respective frequency on each typical Central Venous Catheter (CVC) installation activity. The Process Mining for Python (PM4Py) frameworks used four major mining algorithms to view the event log (i.e., alpha miner, directly-follows graph (DFG), heuristic miner, and inductive miner). This study's findings indicate that medical residents are more susceptible to error during pre-operative procedures.
医护人员必须为患者提供尽可能最好的服务,并在实施复杂外科手术时掌握充分的信息和专业技能,以履行这一职责。医疗的目的是减少并发症、缩短住院时间和改善患者体验。通过不断的学习和培训,受过最新、最先进外科技术和工艺培训的医疗从业人员使医疗保健系统富有成效成为可能。医疗保健系统经常会报告手术过程中出现的问题、跳过的程序、手术过程中的异常活动以及冗长的过渡时间。利用这些事件日志数据,可以实施流程挖掘方法,使用 Petri 网为医疗专业人员提供简单易懂的结论,用于流程分析和改进。本研究确定了每个典型的中心静脉导管 (CVC) 安装活动的前后阶段之间的相似性和差异及其各自的频率。Python 进程挖掘(PM4Py)框架使用四种主要挖掘算法(即阿尔法挖掘器、直接跟踪图(DFG)、启发式挖掘器和归纳式挖掘器)来查看事件日志。这项研究的结果表明,住院医生在术前程序中更容易出错。
{"title":"Using process mining algorithms for process improvement in healthcare","authors":"Fazla Rabbi , Debapriya Banik , Niamat Ullah Ibne Hossain , Alexandr Sokolov","doi":"10.1016/j.health.2024.100305","DOIUrl":"https://doi.org/10.1016/j.health.2024.100305","url":null,"abstract":"<div><p>Healthcare professionals must provide their patients with the best possible service and be well-informed and expert at carrying out complex surgical procedures to fulfill this responsibility. The aim of the medical treatments is fewer complications, shorter hospital stays, and a better patient experience. Through continuous learning and training, medical practitioners trained in up-to-date and state-of-the-art surgical techniques and technologies make productive and effective healthcare systems possible. Healthcare systems often report on problems with surgical processes, skipped procedures, unusual activities during operations, and lengthy transition times. This event log data allows implementing process mining methods to deliver medical professionals with simple and understandable findings using Petri nets for process analysis and enhancement. This study identifies the parallels and discrepancies between the pre-and post-stages and their respective frequency on each typical Central Venous Catheter (CVC) installation activity. The Process Mining for Python (PM4Py) frameworks used four major mining algorithms to view the event log (i.e., alpha miner, directly-follows graph (DFG), heuristic miner, and inductive miner). This study's findings indicate that medical residents are more susceptible to error during pre-operative procedures.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100305"},"PeriodicalIF":0.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000078/pdfft?md5=d25e53aa28e307b96560fec95871fd89&pid=1-s2.0-S2772442524000078-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139674635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-31DOI: 10.1016/j.health.2024.100306
Kaushik Dehingia , Yamen Alharbi , Vikas Pandey
This study addresses a tumor–macrophage interaction model to examine the role of the saturated response of M2 macrophages. We find the equilibrium point of the model and analyze local stability at each equilibrium. We show that tumor-free equilibrium is always stable, whereas, under certain conditions, the tumor-dominant and interior equilibrium are asymptotically stable. Moreover, stable and unstable limit cycles and period-doubling bifurcation have been observed at the interior equilibrium point. A remarkable result has been observed: in the presence of a saturated response of M2 macrophages, with a relatively higher activation rate of M2 macrophages due to tumor cells, the disease spreads more quickly in the body. Hence, M1 macrophages cannot stabilize the system, and aperiodic oscillations are observed. Furthermore, we show that a better immune response can reverse that system’s unstable nature. Numerical simulations verify the analytical results.
{"title":"A mathematical tumor growth model for exploring saturated response of M2 macrophages","authors":"Kaushik Dehingia , Yamen Alharbi , Vikas Pandey","doi":"10.1016/j.health.2024.100306","DOIUrl":"https://doi.org/10.1016/j.health.2024.100306","url":null,"abstract":"<div><p>This study addresses a tumor–macrophage interaction model to examine the role of the saturated response of M2 macrophages. We find the equilibrium point of the model and analyze local stability at each equilibrium. We show that tumor-free equilibrium is always stable, whereas, under certain conditions, the tumor-dominant and interior equilibrium are asymptotically stable. Moreover, stable and unstable limit cycles and period-doubling bifurcation have been observed at the interior equilibrium point. A remarkable result has been observed: in the presence of a saturated response of M2 macrophages, with a relatively higher activation rate of M2 macrophages due to tumor cells, the disease spreads more quickly in the body. Hence, M1 macrophages cannot stabilize the system, and aperiodic oscillations are observed. Furthermore, we show that a better immune response can reverse that system’s unstable nature. Numerical simulations verify the analytical results.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100306"},"PeriodicalIF":0.0,"publicationDate":"2024-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S277244252400008X/pdfft?md5=56bbf5f1b26299586ec2ca78c05789d3&pid=1-s2.0-S277244252400008X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139653032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-22DOI: 10.1016/j.health.2024.100304
Kavyashree C. , H.S. Vimala , Shreyas J.
Oral cancer is a form of cancer that develops in the tissue of an oral cavity. Detection at an early stage is necessary to prevent the mortality rate in cancer patients. Artificial intelligence (AI) techniques play a significant role in assisting with diagnosing oral cancer. The AI techniques provide better detection accuracy and help automate oral cancer detection. The study shows that AI has a wide range of algorithms and provides outcomes in the most precise manner possible. We provide an overview of different input types and apply an appropriate algorithm to detect oral cancer. We aim to provide an overview of various AI techniques that can be used to automate oral cancer detection and to analyze these techniques to improve the efficiency and accuracy of oral cancer screening. We provide a summary of various methods available for oral cancer detection. We cover different input image formats, their processing, and the need for segmentation and feature extraction. We further include a list of other conventional strategies. We focus on various AI techniques for detecting oral cancer, including deep learning, machine learning, fuzzy computing, data mining, and genetic algorithms, and evaluates their benefits and drawbacks. The larger part of the articles focused on deep learning (37%) methods, followed by machine learning (32%), genetic algorithms (12%), data mining techniques (10%), and fuzzy computing (9%) for oral cancer detection.
{"title":"A systematic review of artificial intelligence techniques for oral cancer detection","authors":"Kavyashree C. , H.S. Vimala , Shreyas J.","doi":"10.1016/j.health.2024.100304","DOIUrl":"10.1016/j.health.2024.100304","url":null,"abstract":"<div><p>Oral cancer is a form of cancer that develops in the tissue of an oral cavity. Detection at an early stage is necessary to prevent the mortality rate in cancer patients. Artificial intelligence (AI) techniques play a significant role in assisting with diagnosing oral cancer. The AI techniques provide better detection accuracy and help automate oral cancer detection. The study shows that AI has a wide range of algorithms and provides outcomes in the most precise manner possible. We provide an overview of different input types and apply an appropriate algorithm to detect oral cancer. We aim to provide an overview of various AI techniques that can be used to automate oral cancer detection and to analyze these techniques to improve the efficiency and accuracy of oral cancer screening. We provide a summary of various methods available for oral cancer detection. We cover different input image formats, their processing, and the need for segmentation and feature extraction. We further include a list of other conventional strategies. We focus on various AI techniques for detecting oral cancer, including deep learning, machine learning, fuzzy computing, data mining, and genetic algorithms, and evaluates their benefits and drawbacks. The larger part of the articles focused on deep learning (37%) methods, followed by machine learning (32%), genetic algorithms (12%), data mining techniques (10%), and fuzzy computing (9%) for oral cancer detection.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100304"},"PeriodicalIF":0.0,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000066/pdfft?md5=4271ee0a4378ec8144ed336855cbfa61&pid=1-s2.0-S2772442524000066-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139636955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-20DOI: 10.1016/j.health.2024.100303
F M Javed Mehedi Shamrat , Rashiduzzaman Shakil , Sharmin , Nazmul Hoque ovy , Bonna Akter , Md Zunayed Ahmed , Kawsar Ahmed , Francis M. Bui , Mohammad Ali Moni
Diabetic retinopathy (DR) involves retina damage due to diabetes, often leading to blindness. It is diagnosed via color fundus injections, but the manual analysis is cumbersome and error-prone. While computer vision techniques can predict DR stages, they are computationally intensive and struggle with complex data extraction. In this research, our prime objective was to automate the process of DR classification into its various stages using convolutional neural network (CNN) models. We employed the performance of fifteen pre-trained models with our novel proposed diabetic retinopathy network (DRNet13) model. We aimed to discern the most efficient model for accurate diabetic retinopathy (DR) staging based on fundus images from five DR classes. We preprocessed the image using a median filter for noise reduction and Gamma correction for image enhancement. We expanded our dataset from 3662 to 7500 images to create a more generalized training model through various augmentation techniques. We also evaluated multiple evaluation metrics, including accuracy, precision, F1-score, Sensitivity, Specificity, Area under the curve (AUC), Mean Squared Error (MSE), False Positive Rate (FPR), False Negative Rate (FNR), in addition to confusion matrices for an in-depth comparison of the performance of these models. Feature maps were employed to illuminate decision making areas in the DRNet13 model, which achieved a 97 % accuracy rate for DR detection, surpassing other CNN architectures in speed and efficiency. Despite a few misclassifications, the model's capability to identify critical features demonstrates its potential as an impactful diagnostic tool for timely and accurate identification of diabetic retinopathy.
糖尿病视网膜病变(DR)是指糖尿病引起的视网膜损伤,通常会导致失明。糖尿病视网膜病变可通过彩色眼底注射进行诊断,但人工分析既繁琐又容易出错。虽然计算机视觉技术可以预测 DR 的分期,但其计算量大,且难以进行复杂的数据提取。在这项研究中,我们的首要目标是利用卷积神经网络(CNN)模型将 DR 分类过程自动化,并将其分为不同阶段。我们采用了 15 个预先训练好的模型和我们提出的新型糖尿病视网膜病变网络 (DRNet13) 模型。我们的目标是根据五个糖尿病视网膜病变等级的眼底图像,找出最有效的模型,对糖尿病视网膜病变(DR)进行准确分期。我们使用中值滤波器对图像进行预处理以降低噪音,并使用伽马校正对图像进行增强。我们将数据集从 3662 张图像扩展到 7500 张图像,通过各种增强技术创建了更具通用性的训练模型。我们还评估了多个评价指标,包括准确度、精确度、F1 分数、灵敏度、特异度、曲线下面积 (AUC)、平均平方误差 (MSE)、假阳性率 (FPR)、假阴性率 (FNR),以及混淆矩阵,以深入比较这些模型的性能。DRNet13 模型采用了特征图来阐明决策区域,其 DR 检测准确率达到 97%,在速度和效率方面超过了其他 CNN 架构。尽管存在一些错误分类,但该模型识别关键特征的能力证明了其作为一种有影响力的诊断工具的潜力,可及时准确地识别糖尿病视网膜病变。
{"title":"An advanced deep neural network for fundus image analysis and enhancing diabetic retinopathy detection","authors":"F M Javed Mehedi Shamrat , Rashiduzzaman Shakil , Sharmin , Nazmul Hoque ovy , Bonna Akter , Md Zunayed Ahmed , Kawsar Ahmed , Francis M. Bui , Mohammad Ali Moni","doi":"10.1016/j.health.2024.100303","DOIUrl":"https://doi.org/10.1016/j.health.2024.100303","url":null,"abstract":"<div><p>Diabetic retinopathy (DR) involves retina damage due to diabetes, often leading to blindness. It is diagnosed via color fundus injections, but the manual analysis is cumbersome and error-prone. While computer vision techniques can predict DR stages, they are computationally intensive and struggle with complex data extraction. In this research, our prime objective was to automate the process of DR classification into its various stages using convolutional neural network (CNN) models. We employed the performance of fifteen pre-trained models with our novel proposed diabetic retinopathy network (DRNet13) model. We aimed to discern the most efficient model for accurate diabetic retinopathy (DR) staging based on fundus images from five DR classes. We preprocessed the image using a median filter for noise reduction and Gamma correction for image enhancement. We expanded our dataset from 3662 to 7500 images to create a more generalized training model through various augmentation techniques. We also evaluated multiple evaluation metrics, including accuracy, precision, F1-score, Sensitivity, Specificity, Area under the curve (AUC), Mean Squared Error (MSE), False Positive Rate (FPR), False Negative Rate (FNR), in addition to confusion matrices for an in-depth comparison of the performance of these models. Feature maps were employed to illuminate decision making areas in the DRNet13 model, which achieved a 97 % accuracy rate for DR detection, surpassing other CNN architectures in speed and efficiency. Despite a few misclassifications, the model's capability to identify critical features demonstrates its potential as an impactful diagnostic tool for timely and accurate identification of diabetic retinopathy.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100303"},"PeriodicalIF":0.0,"publicationDate":"2024-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000054/pdfft?md5=d8486a0b7c2a66d37a79ca700f9d36fd&pid=1-s2.0-S2772442524000054-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139549042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This study introduces a novel fractional-order stochastic epidemic model to analyze the spread of conjunctivitis, a prevalent ocular infection, while accounting for the influence of media awareness on disease transmission. The model incorporates fractional derivatives to capture memory effects and non-local interactions inherent in epidemic processes, allowing for a more accurate representation of disease dynamics. The stability analysis of equilibrium points is carried out based on the basic reproduction number and fractional-order . Further, the Hopf bifurcation phenomenon is discussed in this paper. Stochasticity accounts for the randomness in transmission events. The findings of this study provide insights into the complex interrelationship between disease dynamics and media influence, shedding light on the role of public awareness in mitigating or exacerbating conjunctivitis outbreaks. The implications of this work extend to public health policy formulation, highlighting the importance of targeted communication strategies in controlling and preventing the spread of conjunctivitis and similar infectious diseases.
{"title":"A novel fractional-order stochastic epidemic model to analyze the role of media awareness in the spread of conjunctivitis","authors":"Shiv Mangal , Ebenezer Bonyah , Vijay Shankar Sharma , Y. Yuan","doi":"10.1016/j.health.2024.100302","DOIUrl":"https://doi.org/10.1016/j.health.2024.100302","url":null,"abstract":"<div><p>This study introduces a novel fractional-order stochastic epidemic model to analyze the spread of conjunctivitis, a prevalent ocular infection, while accounting for the influence of media awareness on disease transmission. The model incorporates fractional derivatives to capture memory effects and non-local interactions inherent in epidemic processes, allowing for a more accurate representation of disease dynamics. The stability analysis of equilibrium points is carried out based on the basic reproduction number <span><math><msub><mrow><mi>ℛ</mi></mrow><mrow><mn>0</mn></mrow></msub></math></span> and fractional-order <span><math><mi>α</mi></math></span>. Further, the Hopf bifurcation phenomenon is discussed in this paper. Stochasticity accounts for the randomness in transmission events. The findings of this study provide insights into the complex interrelationship between disease dynamics and media influence, shedding light on the role of public awareness in mitigating or exacerbating conjunctivitis outbreaks. The implications of this work extend to public health policy formulation, highlighting the importance of targeted communication strategies in controlling and preventing the spread of conjunctivitis and similar infectious diseases.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100302"},"PeriodicalIF":0.0,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000042/pdfft?md5=38829598f690a40a705f819fef29eef9&pid=1-s2.0-S2772442524000042-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139487305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This study introduces the first-ever self-explanatory interface for diagnosing diabetes patients using machine learning. We propose four classification models (Decision Tree (DT), K-nearest Neighbor (KNN), Support Vector Classification (SVC), and Extreme Gradient Boosting (XGB)) based on the publicly available diabetes dataset. To elucidate the inner workings of these models, we employed the machine learning interpretation method known as Shapley Additive Explanations (SHAP). All the models exhibited commendable accuracy in diagnosing patients with diabetes, with the XGB model showing a slight edge over the others. Utilising SHAP, we delved into the XGB model, providing in-depth insights into the reasoning behind its predictions at a granular level. Subsequently, we integrated the XGB model and SHAP’s local explanations into an interface to predict diabetes in patients. This interface serves a critical role as it diagnoses patients and offers transparent explanations for the decisions made, providing users with a heightened awareness of their current health conditions. Given the high-stakes nature of the medical field, this developed interface can be further enhanced by including more extensive clinical data, ultimately aiding medical professionals in their decision-making processes.
{"title":"A novel machine learning approach for diagnosing diabetes with a self-explainable interface","authors":"Gangani Dharmarathne , Thilini N. Jayasinghe , Madhusha Bogahawaththa , D.P.P. Meddage , Upaka Rathnayake","doi":"10.1016/j.health.2024.100301","DOIUrl":"https://doi.org/10.1016/j.health.2024.100301","url":null,"abstract":"<div><p>This study introduces the first-ever self-explanatory interface for diagnosing diabetes patients using machine learning. We propose four classification models (Decision Tree (DT), K-nearest Neighbor (KNN), Support Vector Classification (SVC), and Extreme Gradient Boosting (XGB)) based on the publicly available diabetes dataset. To elucidate the inner workings of these models, we employed the machine learning interpretation method known as Shapley Additive Explanations (SHAP). All the models exhibited commendable accuracy in diagnosing patients with diabetes, with the XGB model showing a slight edge over the others. Utilising SHAP, we delved into the XGB model, providing in-depth insights into the reasoning behind its predictions at a granular level. Subsequently, we integrated the XGB model and SHAP’s local explanations into an interface to predict diabetes in patients. This interface serves a critical role as it diagnoses patients and offers transparent explanations for the decisions made, providing users with a heightened awareness of their current health conditions. Given the high-stakes nature of the medical field, this developed interface can be further enhanced by including more extensive clinical data, ultimately aiding medical professionals in their decision-making processes.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100301"},"PeriodicalIF":0.0,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000030/pdfft?md5=494bc571d60d347c01d68d0c317c4288&pid=1-s2.0-S2772442524000030-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139487303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-15DOI: 10.1016/j.health.2024.100300
Naba Kumar Goswami , Samson Olaniyi , Sulaimon F. Abimbade , Furaha M. Chuma
The coronavirus pandemic is a global health crisis creating an unprecedented socio-economic catastrophe. This pandemic is the biggest challenge the world has faced since World War II and is the main turning point in the history of humanity. Media coverage can change citizens’ attention to emerging infectious diseases and consequently change individual behaviors and attitudes. This study proposes and analyzes a seven-compartmental mathematical model to investigate the impact of media coverage on the spread and control of COVID-19. The threshold condition Ro for the initial transmission of infection is achieved by the next-generation approach. Stability analysis of the proposed model on disease-free and endemic equilibria is investigated in terms of basic reproduction numbers locally and globally. The sensitivity analysis of the reproduction number is visualized to distinguish the most sensitive parameters that can be regulated to control the transmission dynamics of coronavirus disease. Moreover, the theoretical results of the deterministic model are compared using numerical simulations. The outcomes of the analysis suggest that the disease prevalence can be terminated by suitable management of quarantine/medical care. We further extend the model to the optimal control framework. It is analyzed using Pontryagin’s maximum principle to characterize preventive control, testing facility, and treatment measures for managing COVID-19 transmission.
{"title":"A mathematical model for investigating the effect of media awareness programs on the spread of COVID-19 with optimal control","authors":"Naba Kumar Goswami , Samson Olaniyi , Sulaimon F. Abimbade , Furaha M. Chuma","doi":"10.1016/j.health.2024.100300","DOIUrl":"https://doi.org/10.1016/j.health.2024.100300","url":null,"abstract":"<div><p>The coronavirus pandemic is a global health crisis creating an unprecedented socio-economic catastrophe. This pandemic is the biggest challenge the world has faced since World War II and is the main turning point in the history of humanity. Media coverage can change citizens’ attention to emerging infectious diseases and consequently change individual behaviors and attitudes. This study proposes and analyzes a seven-compartmental mathematical model to investigate the impact of media coverage on the spread and control of COVID-19. The threshold condition Ro for the initial transmission of infection is achieved by the next-generation approach. Stability analysis of the proposed model on disease-free and endemic equilibria is investigated in terms of basic reproduction numbers locally and globally. The sensitivity analysis of the reproduction number is visualized to distinguish the most sensitive parameters that can be regulated to control the transmission dynamics of coronavirus disease. Moreover, the theoretical results of the deterministic model are compared using numerical simulations. The outcomes of the analysis suggest that the disease prevalence can be terminated by suitable management of quarantine/medical care. We further extend the model to the optimal control framework. It is analyzed using Pontryagin’s maximum principle to characterize preventive control, testing facility, and treatment measures for managing COVID-19 transmission.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100300"},"PeriodicalIF":0.0,"publicationDate":"2024-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442524000029/pdfft?md5=181a72d948017369ae65a88b5750c988&pid=1-s2.0-S2772442524000029-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139487304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-01-04DOI: 10.1016/j.health.2023.100293
Rownak Ara Rasul , Promy Saha , Diponkor Bala , S.M. Rakib Ul Karim , Md. Ibrahim Abdullah , Bishwajit Saha
Autistic Spectrum Disorder (ASD) is a neurological disease characterized by difficulties with social interaction, communication, and repetitive activities. While its primary origin lies in genetics, early detection is crucial, and leveraging machine learning offers a promising avenue for a faster and more cost-effective diagnosis. This study employs diverse machine learning methods to identify crucial ASD traits, aiming to enhance and automate the diagnostic process. We study eight state-of-the-art classification models to determine their effectiveness in ASD detection. We evaluate the models using accuracy, precision, recall, specificity, F1-score, area under the curve (AUC), kappa, and log loss metrics to find the best classifier for these binary datasets. Among all the classification models, for the children dataset, the SVM and LR models achieve the highest accuracy of 100% and for the adult dataset, the LR model produces the highest accuracy of 97.14%. Our proposed ANN model provides the highest accuracy of 94.24% for the new combined dataset when hyperparameters are precisely tuned for each model. As almost all classification models achieve high accuracy which utilize true labels, we become interested in delving into five popular clustering algorithms to understand model behavior in scenarios without true labels. We calculate Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), and Silhouette Coefficient (SC) metrics to select the best clustering models. Our evaluation finds that spectral clustering outperforms all other benchmarking clustering models in terms of NMI and ARI metrics while demonstrating comparability to the optimal SC achieved by k-means. The implemented code is available at GitHub.
自闭症(ASD)是一种以社交、沟通和重复性活动困难为特征的神经系统疾病。虽然自闭症的主要病因在于遗传,但早期检测至关重要,而利用机器学习为更快、更具成本效益的诊断提供了一条大有可为的途径。本研究采用多种机器学习方法来识别 ASD 的关键特征,旨在提高诊断过程的效率和自动化程度。我们研究了八个最先进的分类模型,以确定它们在 ASD 检测中的有效性。我们使用准确度、精确度、召回率、特异性、F1-分数、曲线下面积(AUC)、卡帕和对数损失指标对模型进行评估,以找到这些二元数据集的最佳分类器。在所有分类模型中,对于儿童数据集,SVM 和 LR 模型的准确率最高,达到 100%;对于成人数据集,LR 模型的准确率最高,达到 97.14%。在对每个模型的超参数进行精确调整后,我们提出的 ANN 模型在新的组合数据集上的准确率最高,达到 94.24%。由于几乎所有使用真实标签的分类模型都能达到很高的准确率,因此我们有兴趣深入研究五种流行的聚类算法,以了解模型在无真实标签情况下的行为。我们计算归一化互信息(NMI)、调整后兰德指数(ARI)和轮廓系数(SC)指标来选择最佳聚类模型。我们的评估发现,就 NMI 和 ARI 指标而言,频谱聚类优于所有其他基准聚类模型,同时与 k-means 实现的最佳 SC 具有可比性。实现代码可在 GitHub 上获取。
{"title":"An evaluation of machine learning approaches for early diagnosis of autism spectrum disorder","authors":"Rownak Ara Rasul , Promy Saha , Diponkor Bala , S.M. Rakib Ul Karim , Md. Ibrahim Abdullah , Bishwajit Saha","doi":"10.1016/j.health.2023.100293","DOIUrl":"https://doi.org/10.1016/j.health.2023.100293","url":null,"abstract":"<div><p>Autistic Spectrum Disorder (ASD) is a neurological disease characterized by difficulties with social interaction, communication, and repetitive activities. While its primary origin lies in genetics, early detection is crucial, and leveraging machine learning offers a promising avenue for a faster and more cost-effective diagnosis. This study employs diverse machine learning methods to identify crucial ASD traits, aiming to enhance and automate the diagnostic process. We study eight state-of-the-art classification models to determine their effectiveness in ASD detection. We evaluate the models using accuracy, precision, recall, specificity, F1-score, area under the curve (AUC), kappa, and log loss metrics to find the best classifier for these binary datasets. Among all the classification models, for the children dataset, the SVM and LR models achieve the highest accuracy of 100% and for the adult dataset, the LR model produces the highest accuracy of 97.14%. Our proposed ANN model provides the highest accuracy of 94.24% for the new combined dataset when hyperparameters are precisely tuned for each model. As almost all classification models achieve high accuracy which utilize true labels, we become interested in delving into five popular clustering algorithms to understand model behavior in scenarios without true labels. We calculate Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), and Silhouette Coefficient (SC) metrics to select the best clustering models. Our evaluation finds that spectral clustering outperforms all other benchmarking clustering models in terms of NMI and ARI metrics while demonstrating comparability to the optimal SC achieved by k-means. The implemented code is available at <span>GitHub</span><svg><path></path></svg>.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100293"},"PeriodicalIF":0.0,"publicationDate":"2024-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442523001600/pdfft?md5=e0fd6cd67baa47c33181f21a1d4a70e4&pid=1-s2.0-S2772442523001600-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139434016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-12-30DOI: 10.1016/j.health.2023.100297
Mohammad Mihrab Chowdhury , Ragib Shahariar Ayon , Md Sakhawat Hossain
Diabetes is a prevalent chronic condition that poses significant challenges to early diagnosis and identifying at-risk individuals. Machine learning plays a crucial role in diabetes detection by leveraging its ability to process large volumes of data and identify complex patterns. However, imbalanced data, where the number of diabetic cases is substantially smaller than non-diabetic cases, complicates the identification of individuals with diabetes using machine learning algorithms. This study focuses on predicting whether a person is at risk of diabetes, considering the individual’s health and socio-economic conditions while mitigating the challenges posed by imbalanced data. We employ several data augmentation techniques, such as oversampling (Synthetic Minority Over Sampling for Nominal Data, i.e.SMOTE-N), undersampling (Edited Nearest Neighbor, i.e. ENN), and hybrid sampling techniques (SMOTE-Tomek and SMOTE-ENN) on training data before applying machine learning algorithms to minimize the impact of imbalanced data. Our study sheds light on the significance of carefully utilizing data augmentation techniques without any data leakage to enhance the effectiveness of machine learning algorithms. Moreover, it offers a complete machine learning structure for healthcare practitioners, from data obtaining to machine learning prediction, enabling them to make informed decisions.
{"title":"An investigation of machine learning algorithms and data augmentation techniques for diabetes diagnosis using class imbalanced BRFSS dataset","authors":"Mohammad Mihrab Chowdhury , Ragib Shahariar Ayon , Md Sakhawat Hossain","doi":"10.1016/j.health.2023.100297","DOIUrl":"https://doi.org/10.1016/j.health.2023.100297","url":null,"abstract":"<div><p>Diabetes is a prevalent chronic condition that poses significant challenges to early diagnosis and identifying at-risk individuals. Machine learning plays a crucial role in diabetes detection by leveraging its ability to process large volumes of data and identify complex patterns. However, imbalanced data, where the number of diabetic cases is substantially smaller than non-diabetic cases, complicates the identification of individuals with diabetes using machine learning algorithms. This study focuses on predicting whether a person is at risk of diabetes, considering the individual’s health and socio-economic conditions while mitigating the challenges posed by imbalanced data. We employ several data augmentation techniques, such as oversampling (Synthetic Minority Over Sampling for Nominal Data, i.e.SMOTE-N), undersampling (Edited Nearest Neighbor, i.e. ENN), and hybrid sampling techniques (SMOTE-Tomek and SMOTE-ENN) on training data before applying machine learning algorithms to minimize the impact of imbalanced data. Our study sheds light on the significance of carefully utilizing data augmentation techniques without any data leakage to enhance the effectiveness of machine learning algorithms. Moreover, it offers a complete machine learning structure for healthcare practitioners, from data obtaining to machine learning prediction, enabling them to make informed decisions.</p></div>","PeriodicalId":73222,"journal":{"name":"Healthcare analytics (New York, N.Y.)","volume":"5 ","pages":"Article 100297"},"PeriodicalIF":0.0,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772442523001648/pdfft?md5=cbb15d1b9b72127ef6f0b213ad40bae0&pid=1-s2.0-S2772442523001648-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139108378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}