首页 > 最新文献

Intelligent medicine最新文献

英文 中文
Impact of data balancing a multiclass dataset before the creation of association rules to study bacterial vaginosis 在创建研究细菌性阴道病的关联规则之前,数据平衡多类数据集的影响
IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-01 DOI: 10.1016/j.imed.2023.02.001

Background

Bacterial vaginosis is a polymicrobial syndrome in which the homeostasis exerted by the Latobacillus species that protect the vaginal mucosa has been lost. This study explored the data balancing process with the intention of improving the quality of association rules. The article aimed to balance the unbalanced multiclass dataset to improve association rule creation.

Methods

A dataset with 201 observations and 58 variables was analyzed. A preconstructed dataset was used. The authors collected the data between August 2016 and October 2018 in Tabasco, Mexico. The study population comprised sexually active women ages 18 to 50 who underwent gynecological inspection at the infectious and metabolic diseases research laboratory at the Universidad Juarez Autonoma de Tabasco. To determine the best k-value, the random-forest algorithm was used and the balancing was performed with the synthetic minority over-sampling technique (SMOTE), random over-sampling examples (ROSE), and adaptive syntetic sampling approach for imbalanced learning (ADASYN) algorithms. The Apriori algorithm created the rules and to select rules with statistical significance, the is.redundant(), is.significant(), and is.maximal() functions and quality metric Fisher’s exact tes were used. The biological validation was carried out by the expert (bacteriologist).

Results

The ADASYN algorithm at K=9 the out of the bag (OOB) error was zero, this was the best K-values. In the balancing process the ADASYN algorithm show best the performance. From the dataset balanced with ADASYN, the apriori algorithm created the association rules and the selection with the quality metric Fisher’s exact test, and the biological validation reported 13 rules. Gram - bacteria Atopobium vaginae, Gardnerella vaginalis, Megasphaera filotipo 1, Mycoplasma hominis and Ureaplasma parvum were detected by the apriori algorithm from the balanced dataset.

Conclusion

Balancing may improve the creation of association rules to efficiently model the bacteria that cause bacterial vaginosis.

背景细菌性阴道病是一种多微生物综合征,其中保护阴道粘膜的拉托杆菌失去了平衡。本研究探讨了数据平衡过程,旨在提高关联规则的质量。文章旨在平衡不平衡的多类数据集,以改进关联规则的创建。方法分析了一个包含 201 个观测值和 58 个变量的数据集。使用了预先构建的数据集。作者于 2016 年 8 月至 2018 年 10 月期间在墨西哥塔巴斯科收集了数据。研究人群包括在塔巴斯科华雷斯自治大学(Universidad Juarez Autonoma de Tabasco)传染病和代谢病研究实验室接受妇科检查的 18 至 50 岁的性活跃女性。为确定最佳 k 值,使用了随机森林算法,并通过合成少数过度采样技术(SMOTE)、随机过度采样示例(ROSE)和不平衡学习自适应合成采样方法(ADASYN)算法进行了平衡。Apriori 算法创建规则,并使用 is.redundant()、is.significant() 和 is.maximal() 函数和质量指标 Fisher's exact tes 来选择具有统计意义的规则。结果ADASYN 算法在 K=9 时的出包(OOB)误差为零,这是最佳的 K 值。在平衡过程中,ADASYN 算法表现最佳。从使用 ADASYN 算法平衡的数据集中,apriori 算法创建了关联规则,并通过质量指标费雪精确检验进行了选择,生物验证报告了 13 条规则。通过平衡数据集,apriori 算法检测出了革兰氏细菌 Atopobium vaginae、阴道加德纳菌 Gardnerella vaginalis、Megasphaera filotipo 1、人型支原体 Mycoplasma hominis 和副脲原体 Ureaplasma parvum。
{"title":"Impact of data balancing a multiclass dataset before the creation of association rules to study bacterial vaginosis","authors":"","doi":"10.1016/j.imed.2023.02.001","DOIUrl":"10.1016/j.imed.2023.02.001","url":null,"abstract":"<div><h3>Background</h3><p>Bacterial vaginosis is a polymicrobial syndrome in which the homeostasis exerted by the <em>Latobacillus</em> species that protect the vaginal mucosa has been lost. This study explored the data balancing process with the intention of improving the quality of association rules. The article aimed to balance the unbalanced multiclass dataset to improve association rule creation.</p></div><div><h3>Methods</h3><p>A dataset with 201 observations and 58 variables was analyzed. A preconstructed dataset was used. The authors collected the data between August 2016 and October 2018 in Tabasco, Mexico. The study population comprised sexually active women ages 18 to 50 who underwent gynecological inspection at the infectious and metabolic diseases research laboratory at the Universidad Juarez Autonoma de Tabasco. To determine the best <span><math><mi>k</mi></math></span>-value, the random-forest algorithm was used and the balancing was performed with the synthetic minority over-sampling technique (SMOTE), random over-sampling examples (ROSE), and adaptive syntetic sampling approach for imbalanced learning (ADASYN) algorithms. The Apriori algorithm created the rules and to select rules with statistical significance, the <em>is.redundant(), is.significant()</em>, and <em>is.maximal()</em> functions and quality metric Fisher’s exact tes were used. The biological validation was carried out by the expert (bacteriologist).</p></div><div><h3>Results</h3><p>The ADASYN algorithm at <span><math><mrow><mi>K</mi><mo>=</mo><mn>9</mn></mrow></math></span> the out of the bag (OOB) error was zero, this was the best <span><math><mi>K</mi></math></span>-values. In the balancing process the ADASYN algorithm show best the performance. From the dataset balanced with ADASYN, the apriori algorithm created the association rules and the selection with the quality metric Fisher’s exact test, and the biological validation reported 13 rules. Gram - bacteria <em>Atopobium vaginae, Gardnerella vaginalis, Megasphaera filotipo 1<strong>,</strong> Mycoplasma hominis</em> and <em>Ureaplasma parvum</em> were detected by the apriori algorithm from the balanced dataset.</p></div><div><h3>Conclusion</h3><p>Balancing may improve the creation of association rules to efficiently model the bacteria that cause bacterial vaginosis.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"4 3","pages":"Pages 188-199"},"PeriodicalIF":4.4,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667102623000190/pdfft?md5=8cf3d2c99555a9de09737d0e3a9fc329&pid=1-s2.0-S2667102623000190-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48136958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning predicts long-term mortality after acute myocardial infarction using systolic time intervals and routinely collected clinical data 机器学习利用收缩压时间间隔和常规收集的临床数据预测急性心肌梗死后的长期死亡率
IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-01 DOI: 10.1016/j.imed.2024.01.001
Bijan Roudini , Boshra Khajehpiri , Hamid Abrishami Moghaddam , Mohamad Forouzanfar

Background

Precise estimation of current and future comorbidities of patients with cardiovascular disease is an important factor in prioritizing continuous physiological monitoring and new therapies. Machine learning (ML) models have shown satisfactory performance in short-term mortality prediction in patients with heart disease, whereas their utility in long-term predictions is limited. This study aimed to investigate the performance of tree-based ML models on long-term mortality prediction and effect of two recently introduced biomarkers on long-term mortality.

Methods

This study used publicly available data from the Collaboration Center of Health Information Application at the Ministry of Health and Welfare, Taiwan, China. The collected data were from patients admitted to the cardiac care unit for acute myocardial infarction (AMI) between November 2003 and September 2004. We collected and analyzed mortality data up to December 2018. Medical records were used to gather demographic and clinical data, including age, gender, body mass index, percutaneous coronary intervention status, and comorbidities such as hypertension, dyslipidemia, ST-segment elevation myocardial infarction, and non-ST-segment elevation myocardial infarction. Using the data, collected from 139 patients with AMI, from medical and demographic records as well as two recently introduced biomarkers, brachial pre-ejection period (bPEP) and brachial ejection time (bET), we investigated the performance of advanced ensemble tree-based ML algorithms (random forest, AdaBoost, and XGBoost) to predict all-cause mortality within 14 years. A nested cross-validation was performed to evaluate and compare the performance of our developed models precisely with that of the conventional logistic regression (LR) as the baseline method.

Results

The developed ML models achieved significantly better performance compared to the baseline LR (C-Statistic, 0.80 for random forest, 0.79 for AdaBoost, and 0.78 for XGBoost, vs. 0.77 for LR) (PRF < 0.001, PAdaBoost < 0.001, and PXGBoost < 0.05). Adding bPEP and bET to our feature set significantly improved the performance of the algorithm, leading to an absolute increase in C-statistic of up to 0.03 (C-statistic, 0.83 for random forest, 0.82 for AdaBoost, and 0.80 for XGBoost, vs. 0.74 for LR) (PRF < 0.001, PAdaBoost < 0.001, PXGBoost < 0.05).

Conclusion

The study indicates that incorporating new biomarkers into advanced ML models may significantly improve long-term mortality prediction in patients with cardiovascular diseases. This advancement may enable better treatment prioritization for high-risk individuals.

背景精确估计心血管疾病患者当前和未来的合并症是优先考虑持续生理监测和新疗法的一个重要因素。机器学习(ML)模型在心脏病患者的短期死亡率预测中表现令人满意,但在长期预测中的作用有限。本研究旨在调查基于树的 ML 模型在长期死亡率预测中的表现,以及最近引入的两种生物标志物对长期死亡率的影响。所收集的数据来自 2003 年 11 月至 2004 年 9 月期间因急性心肌梗死(AMI)入住心脏监护室的患者。我们收集并分析了截至 2018 年 12 月的死亡率数据。病历用于收集人口统计学和临床数据,包括年龄、性别、体重指数、经皮冠状动脉介入治疗情况以及高血压、血脂异常、ST段抬高型心肌梗死和非ST段抬高型心肌梗死等合并症。我们利用从 139 名急性心肌梗死患者的医疗和人口学记录中收集的数据,以及最近推出的两个生物标志物--肱骨射血前时间(bPEP)和肱骨射血时间(bET),研究了基于高级集合树的 ML 算法(随机森林、AdaBoost 和 XGBoost)预测 14 年内全因死亡率的性能。通过嵌套交叉验证来评估和比较我们开发的模型与作为基线方法的传统逻辑回归(LR)的性能。结果与基线逻辑回归相比,所开发的 ML 模型取得了明显更好的性能(C-统计量,随机森林为 0.80,AdaBoost 为 0.79,XGBoost 为 0.78,LR 为 0.77)(PRF < 0.001,PAdaBoost < 0.001,PXGBoost < 0.05)。在特征集中添加 bPEP 和 bET 能显著提高算法的性能,使 C 统计量的绝对值提高了 0.03(随机森林的 C 统计量为 0.83,AdaBoost 为 0.82,XGBoost 为 0.80,而 LR 为 0.74)。74 for LR)(PRF <0.001,PAdaBoost <0.001,PXGBoost <0.05)。结论该研究表明,将新的生物标记物纳入高级 ML 模型可显著改善心血管疾病患者的长期死亡率预测。这种进步可以更好地确定高危人群的治疗优先次序。
{"title":"Machine learning predicts long-term mortality after acute myocardial infarction using systolic time intervals and routinely collected clinical data","authors":"Bijan Roudini ,&nbsp;Boshra Khajehpiri ,&nbsp;Hamid Abrishami Moghaddam ,&nbsp;Mohamad Forouzanfar","doi":"10.1016/j.imed.2024.01.001","DOIUrl":"10.1016/j.imed.2024.01.001","url":null,"abstract":"<div><h3>Background</h3><p>Precise estimation of current and future comorbidities of patients with cardiovascular disease is an important factor in prioritizing continuous physiological monitoring and new therapies. Machine learning (ML) models have shown satisfactory performance in short-term mortality prediction in patients with heart disease, whereas their utility in long-term predictions is limited. This study aimed to investigate the performance of tree-based ML models on long-term mortality prediction and effect of two recently introduced biomarkers on long-term mortality.</p></div><div><h3>Methods</h3><p>This study used publicly available data from the Collaboration Center of Health Information Application at the Ministry of Health and Welfare, Taiwan, China. The collected data were from patients admitted to the cardiac care unit for acute myocardial infarction (AMI) between November 2003 and September 2004. We collected and analyzed mortality data up to December 2018. Medical records were used to gather demographic and clinical data, including age, gender, body mass index, percutaneous coronary intervention status, and comorbidities such as hypertension, dyslipidemia, ST-segment elevation myocardial infarction, and non-ST-segment elevation myocardial infarction. Using the data, collected from 139 patients with AMI, from medical and demographic records as well as two recently introduced biomarkers, brachial pre-ejection period (bPEP) and brachial ejection time (bET), we investigated the performance of advanced ensemble tree-based ML algorithms (random forest, AdaBoost, and XGBoost) to predict all-cause mortality within 14 years. A nested cross-validation was performed to evaluate and compare the performance of our developed models precisely with that of the conventional logistic regression (LR) as the baseline method.</p></div><div><h3>Results</h3><p>The developed ML models achieved significantly better performance compared to the baseline LR (C-Statistic, 0.80 for random forest, 0.79 for AdaBoost, and 0.78 for XGBoost, <em>vs</em>. 0.77 for LR) (<em>P</em><sub>RF</sub> &lt; 0.001, <em>P</em><sub>AdaBoost</sub> &lt; 0.001, and <em>P</em><sub>XGBoost</sub> &lt; 0.05). Adding bPEP and bET to our feature set significantly improved the performance of the algorithm, leading to an absolute increase in C-statistic of up to 0.03 (C-statistic, 0.83 for random forest, 0.82 for AdaBoost, and 0.80 for XGBoost, <em>vs</em>. 0.74 for LR) (<em>P</em><sub>RF</sub> &lt; 0.001, <em>P</em><sub>AdaBoost</sub> &lt; 0.001, <em>P</em><sub>XGBoost</sub> &lt; 0.05).</p></div><div><h3>Conclusion</h3><p>The study indicates that incorporating new biomarkers into advanced ML models may significantly improve long-term mortality prediction in patients with cardiovascular diseases. This advancement may enable better treatment prioritization for high-risk individuals.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"4 3","pages":"Pages 170-176"},"PeriodicalIF":4.4,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667102624000329/pdfft?md5=039b96bf56f33e4f8342d2c062d97570&pid=1-s2.0-S2667102624000329-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142271265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neuropsychological detection and prediction using machine learning algorithms: a comprehensive review 使用机器学习算法进行神经心理学检测和预测:综合评述
IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-01 DOI: 10.1016/j.imed.2023.04.003

Neuropsychological disorders (e.g., dementia, epilepsy, brain cancer, autism, stroke, and multiple sclerosis) adversely affect the quality of life of patients and their families; moreover, in some instances, they may lead to loss of life. The primary aim was to evaluate and compare the use of machine learning in neuropsychological research in contrast to traditional approaches such as through case studies. This was achieved by referring to earlier studies on this subject. This article presented the use of support vector machines (SVMs) and convolutional neural networks (CNN) for detecting and predicting neuropsychological diseases, such as dementia and Alzheimer's disease. Challenges in using these models include data availability, quality, variability, model interpretability, and validation. Experimental findings have demonstrated the potential of these models in this field. It has been shown that SVM models are robust and efficient in processing and classifying data, particularly in neuroimaging applications, such as magnetic resonance imaging (MRI). CNNs have excelled in handling visual input; thus, they have been used in neuroimaging segregation, recognition, and classification, with applications in brain tumor segmentation, radiation therapy, robotic neurosurgery, and disease prediction. Future research will explore asymmetric differences among left- and right-handed patients, incorporate longitudinal studies, and utilize larger sample sizes. The use of machine learning models has the potential to revolutionize the diagnosis and treatment of neuropsychological diseases, allowing for early detection and intervention. This approach could offer significant advantages to healthcare, such as cost-effective diagnosis and treatment, to help save lives and preserve the quality of life of patients.

神经心理学疾病(如痴呆症、癫痫、脑癌、自闭症、中风和多发性硬化症)对患者及其家人的生活质量造成了不利影响;此外,在某些情况下,这些疾病还可能导致生命丧失。研究的主要目的是评估和比较机器学习在神经心理学研究中的应用,并与传统方法(如案例研究)进行对比。为了实现这一目标,我们参考了此前有关这一主题的研究。这篇文章介绍了支持向量机(SVM)和卷积神经网络(CNN)在检测和预测痴呆症和阿尔茨海默病等神经心理疾病中的应用。使用这些模型所面临的挑战包括数据的可用性、质量、可变性、模型的可解释性和验证。实验结果证明了这些模型在这一领域的潜力。研究表明,SVM 模型在处理和分类数据方面既稳健又高效,特别是在神经成像应用中,如磁共振成像(MRI)。CNN 在处理视觉输入方面表现出色;因此,它们已被用于神经影像的分割、识别和分类,并在脑肿瘤分割、放射治疗、机器人神经外科和疾病预测方面得到了应用。未来的研究将探索左撇子和右撇子患者之间的不对称差异,纳入纵向研究,并利用更大的样本量。机器学习模型的使用有可能彻底改变神经心理疾病的诊断和治疗,实现早期检测和干预。这种方法可以为医疗保健带来巨大优势,例如具有成本效益的诊断和治疗,从而帮助挽救生命并保持患者的生活质量。
{"title":"Neuropsychological detection and prediction using machine learning algorithms: a comprehensive review","authors":"","doi":"10.1016/j.imed.2023.04.003","DOIUrl":"10.1016/j.imed.2023.04.003","url":null,"abstract":"<div><p>Neuropsychological disorders (e.g., dementia, epilepsy, brain cancer, autism, stroke, and multiple sclerosis) adversely affect the quality of life of patients and their families; moreover, in some instances, they may lead to loss of life. The primary aim was to evaluate and compare the use of machine learning in neuropsychological research in contrast to traditional approaches such as through case studies. This was achieved by referring to earlier studies on this subject. This article presented the use of support vector machines (SVMs) and convolutional neural networks (CNN) for detecting and predicting neuropsychological diseases, such as dementia and Alzheimer's disease. Challenges in using these models include data availability, quality, variability, model interpretability, and validation. Experimental findings have demonstrated the potential of these models in this field. It has been shown that SVM models are robust and efficient in processing and classifying data, particularly in neuroimaging applications, such as magnetic resonance imaging (MRI). CNNs have excelled in handling visual input; thus, they have been used in neuroimaging segregation, recognition, and classification, with applications in brain tumor segmentation, radiation therapy, robotic neurosurgery, and disease prediction. Future research will explore asymmetric differences among left- and right-handed patients, incorporate longitudinal studies, and utilize larger sample sizes. The use of machine learning models has the potential to revolutionize the diagnosis and treatment of neuropsychological diseases, allowing for early detection and intervention. This approach could offer significant advantages to healthcare, such as cost-effective diagnosis and treatment, to help save lives and preserve the quality of life of patients.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"4 3","pages":"Pages 177-187"},"PeriodicalIF":4.4,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266710262300061X/pdfft?md5=d2a2cea8fe75ea1be6a8c4def2946bc6&pid=1-s2.0-S266710262300061X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135762813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A clinical decision support system using rough set theory and machine learning for disease prediction 利用粗糙集理论和机器学习进行疾病预测的临床决策支持系统
IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-01 DOI: 10.1016/j.imed.2023.08.002
Kamakhya Narain Singh, Jibendu Kumar Mantri
<div><h3>Objective</h3><p>Technological advances have led to drastic changes in daily life, and particularly healthcare, while traditional diagnosis methods are being replaced by technology-oriented models and paper-based patient healthcare records with digital files. Using the latest technology and data mining techniques, we aimed to develop an automated clinical decision support system (CDSS), to improve patient prognoses and healthcare delivery. Our proposed approach placed a strong emphasis on improvements that meet patient, parent, and physician expectations. We developed a flexible framework to identify hepatitis, dermatological conditions, hepatic disease, and autism in adults and provide results to patients as recommendations. The novelty of this CDSS lies in its integration of rough set theory (RST) and machine learning (ML) techniques to improve clinical decision-making accuracy and effectiveness.</p></div><div><h3>Methods</h3><p>Data were collected through various web-based resources. Standard preprocessing techniques were applied to encode categorical features, conduct min-max scaling, and remove null and duplicate entries. The most prevalent feature in the class and standard deviation were used to fill missing categorical and continuous feature values, respectively. A rough set approach was applied as feature selection, to remove highly redundant and irrelevant elements. Then, various ML techniques, including K nearest neighbors (KNN), linear support vector machine (LSVM), radial basis function support vector machine (RBF SVM), decision tree (DT), random forest (RF), and Naive Bayes (NB), were employed to analyze four publicly available benchmark medical datasets of different types from the UCI repository and Kaggle. The model was implemented in Python, and various validity metrics, including precision, recall, F1-score, and root mean square error (RMSE), applied to measure its performance.</p></div><div><h3>Results</h3><p>Features were selected using an RST approach and examined by RF analysis and important features of hepatitis, dermatology conditions, hepatic disease, and autism determined by RST and RF exhibited 92.85 %, 90.90 %, 100 %, and 80 % similarity, respectively. Selected features were stored as electronic health records and various ML classifiers, such as KNN, LSVM, RBF SVM, DT, RF, and NB, applied to classify patients with hepatitis, dermatology conditions, hepatic disease, and autism. In the last phase, the performance of proposed classifiers was compared with that of existing state-of-the-art methods, using various validity measures. RF was found to be the best approach for adult screening of: hepatitis with accuracy 88.66 %, precision 74.46 %, recall 75.17 %, F1-score 74.81 %, and RMSE value 0.244; dermatology conditions with accuracy 97.29 %, precision 96.96 %, recall 96.96 %, F1-score 96.96 %, and RMSE value, 0.173; hepatic disease, with accuracy 91.58 %, precision 81.76 %, recall 81.82 %, F1-Score 81.79 %, and RMS
目标技术进步导致日常生活,尤其是医疗保健发生了翻天覆地的变化,传统的诊断方法正在被以技术为导向的模型和以数字文件为基础的纸质病人医疗记录所取代。我们利用最新技术和数据挖掘技术,旨在开发一种自动临床决策支持系统(CDSS),以改善病人预后和医疗服务。我们提出的方法着重强调满足患者、家长和医生期望的改进。我们开发了一个灵活的框架,用于识别成人肝炎、皮肤病、肝病和自闭症,并将结果作为建议提供给患者。该 CDSS 的新颖之处在于整合了粗糙集理论(RST)和机器学习(ML)技术,以提高临床决策的准确性和有效性。数据通过各种网络资源收集,采用标准预处理技术对分类特征进行编码,进行最小-最大缩放,并删除空条目和重复条目。类中最普遍的特征和标准偏差分别用于填补缺失的分类和连续特征值。特征选择采用粗糙集方法,以去除高度冗余和不相关的元素。然后,采用了多种 ML 技术,包括 K 最近邻(KNN)、线性支持向量机(LSVM)、径向基函数支持向量机(RBF SVM)、决策树(DT)、随机森林(RF)和奈夫贝叶斯(NB),对来自 UCI 存储库和 Kaggle 的四个不同类型的公开基准医疗数据集进行分析。该模型用 Python 实现,并采用了包括精确度、召回率、F1-分数和均方根误差 (RMSE) 在内的各种有效性指标来衡量其性能。结果使用 RST 方法选择特征,并通过 RF 分析进行检查,RST 和 RF 确定的肝炎、皮肤病、肝病和自闭症的重要特征分别表现出 92.85 %、90.90 %、100 % 和 80 % 的相似性。将选定的特征存储为电子健康记录,并应用 KNN、LSVM、RBF SVM、DT、RF 和 NB 等多种 ML 分类器对肝炎、皮肤病、肝病和自闭症患者进行分类。在最后阶段,利用各种有效性测量方法,将所提出的分类器的性能与现有的最先进方法进行了比较。结果发现 RF 是成人筛查以下疾病的最佳方法:肝炎,准确率 88.66 %,精确率 74.46 %,召回率 75.17 %,F1-分数 74.81 %,RMSE 值 0.244;皮肤病,准确率 97.29 %,精确率 96.96 %,召回率 96.96 %,F1-分数 96.96 %,RMSE 值 0.173;肝病,准确率 91.58 %,精确率 81.76 %,召回率 81.结论我们提出的框架的总体性能表明,它可以帮助医学专家更准确地识别和诊断肝炎、皮肤病、肝病和自闭症患者。
{"title":"A clinical decision support system using rough set theory and machine learning for disease prediction","authors":"Kamakhya Narain Singh,&nbsp;Jibendu Kumar Mantri","doi":"10.1016/j.imed.2023.08.002","DOIUrl":"10.1016/j.imed.2023.08.002","url":null,"abstract":"&lt;div&gt;&lt;h3&gt;Objective&lt;/h3&gt;&lt;p&gt;Technological advances have led to drastic changes in daily life, and particularly healthcare, while traditional diagnosis methods are being replaced by technology-oriented models and paper-based patient healthcare records with digital files. Using the latest technology and data mining techniques, we aimed to develop an automated clinical decision support system (CDSS), to improve patient prognoses and healthcare delivery. Our proposed approach placed a strong emphasis on improvements that meet patient, parent, and physician expectations. We developed a flexible framework to identify hepatitis, dermatological conditions, hepatic disease, and autism in adults and provide results to patients as recommendations. The novelty of this CDSS lies in its integration of rough set theory (RST) and machine learning (ML) techniques to improve clinical decision-making accuracy and effectiveness.&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Methods&lt;/h3&gt;&lt;p&gt;Data were collected through various web-based resources. Standard preprocessing techniques were applied to encode categorical features, conduct min-max scaling, and remove null and duplicate entries. The most prevalent feature in the class and standard deviation were used to fill missing categorical and continuous feature values, respectively. A rough set approach was applied as feature selection, to remove highly redundant and irrelevant elements. Then, various ML techniques, including K nearest neighbors (KNN), linear support vector machine (LSVM), radial basis function support vector machine (RBF SVM), decision tree (DT), random forest (RF), and Naive Bayes (NB), were employed to analyze four publicly available benchmark medical datasets of different types from the UCI repository and Kaggle. The model was implemented in Python, and various validity metrics, including precision, recall, F1-score, and root mean square error (RMSE), applied to measure its performance.&lt;/p&gt;&lt;/div&gt;&lt;div&gt;&lt;h3&gt;Results&lt;/h3&gt;&lt;p&gt;Features were selected using an RST approach and examined by RF analysis and important features of hepatitis, dermatology conditions, hepatic disease, and autism determined by RST and RF exhibited 92.85 %, 90.90 %, 100 %, and 80 % similarity, respectively. Selected features were stored as electronic health records and various ML classifiers, such as KNN, LSVM, RBF SVM, DT, RF, and NB, applied to classify patients with hepatitis, dermatology conditions, hepatic disease, and autism. In the last phase, the performance of proposed classifiers was compared with that of existing state-of-the-art methods, using various validity measures. RF was found to be the best approach for adult screening of: hepatitis with accuracy 88.66 %, precision 74.46 %, recall 75.17 %, F1-score 74.81 %, and RMSE value 0.244; dermatology conditions with accuracy 97.29 %, precision 96.96 %, recall 96.96 %, F1-score 96.96 %, and RMSE value, 0.173; hepatic disease, with accuracy 91.58 %, precision 81.76 %, recall 81.82 %, F1-Score 81.79 %, and RMS","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"4 3","pages":"Pages 200-208"},"PeriodicalIF":4.4,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667102624000196/pdfft?md5=d65ef7a4c0f4fb5b3f70cdc367b1f5ae&pid=1-s2.0-S2667102624000196-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142271195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of graph-curvature features in computer-aided diagnosis for histopathological image identification of gastric cancer 图曲率特征在胃癌组织病理图像识别计算机辅助诊断中的应用
IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-01 DOI: 10.1016/j.imed.2024.02.001
Ruilin He , Chen Li , Xinyi Yang , Jinzhu Yang , Tao Jiang , Marcin Grzegorzek , Hongzan Sun

Background

Histopathology diagnosis is often regarded as the final diagnostic method for malignant tumors; however, it has some drawbacks. This study explored a computer-aided diagnostic method that can be used to identify benign and malignant gastric cancer using histopathological images.

Methods

The most suitable process was selected through multiple experiments by comparing multiple methods and features for classification. First, the U-net was applied to segment the image. Next, the nucleus was extracted from the segmented image, and the minimum spanning tree (MST) diagram structure that can capture the topological information was drawn. The third step was to extract the graph-curvature features of the histopathological image according to the MST image. Finally, by inputting the graph-curvature features into the classifier, the recognition results for benign or malignant cancer can be obtained.

Results

During the experiment, we used various methods for comparison. In the image segmentation stage, U-net, watershed algorithm, and Otsu threshold segmentation methods were used. We found that the U-net method, combined with multiple indicators, was the most suitable for segmentation of histopathological images. In the feature extraction stage, in addition to extracting graph-edge and graph-curvature features, several basic image features were extracted, including the red, green and blue feature, gray-level co-occurrence matrix feature, histogram of oriented gradient feature, and local binary pattern feature. In the classifier design stage, we experimented with various methods, such as support vector machine (SVM), random forest, artificial neural network, K nearest neighbors, VGG-16, and inception-V3. Through comparison and analysis, it was found that classification results with an accuracy of 98.57% can be obtained by inputting the graph-curvature feature into the SVM classifier.

Conclusion

This study created a unique feature, the graph-curvature feature, based on the MST to represent and analyze histopathological images. This graph-based feature could be used to identify benign and malignant cells in histopathological images and assist pathologists in diagnosis.

背景组织病理学诊断通常被认为是恶性肿瘤的最终诊断方法,但它也有一些缺点。本研究探讨了一种计算机辅助诊断方法,该方法可用于利用组织病理学图像识别良性和恶性胃癌。首先,应用 U-net 对图像进行分割。其次,从分割后的图像中提取细胞核,并绘制能捕捉拓扑信息的最小生成树(MST)图结构。第三步是根据 MST 图像提取组织病理学图像的图曲率特征。最后,将图曲率特征输入分类器,即可得到良性或恶性癌症的识别结果。在图像分割阶段,我们使用了 U-net、分水岭算法和大津阈值分割法。我们发现,结合多种指标的 U-net 方法最适合组织病理学图像的分割。在特征提取阶段,除了提取图边和图曲率特征外,还提取了几个基本的图像特征,包括红绿蓝特征、灰度级共现矩阵特征、定向梯度直方图特征和局部二进制模式特征。在分类器设计阶段,我们尝试了多种方法,如支持向量机(SVM)、随机森林、人工神经网络、K 近邻、VGG-16 和 inception-V3。通过比较和分析,我们发现将图曲率特征输入 SVM 分类器可获得准确率高达 98.57% 的分类结果。 结论 本研究基于 MST 创建了一种独特的特征--图曲率特征,用于表示和分析组织病理学图像。这种基于图的特征可用于识别组织病理学图像中的良性和恶性细胞,帮助病理学家进行诊断。
{"title":"Application of graph-curvature features in computer-aided diagnosis for histopathological image identification of gastric cancer","authors":"Ruilin He ,&nbsp;Chen Li ,&nbsp;Xinyi Yang ,&nbsp;Jinzhu Yang ,&nbsp;Tao Jiang ,&nbsp;Marcin Grzegorzek ,&nbsp;Hongzan Sun","doi":"10.1016/j.imed.2024.02.001","DOIUrl":"10.1016/j.imed.2024.02.001","url":null,"abstract":"<div><h3>Background</h3><p>Histopathology diagnosis is often regarded as the final diagnostic method for malignant tumors; however, it has some drawbacks. This study explored a computer-aided diagnostic method that can be used to identify benign and malignant gastric cancer using histopathological images.</p></div><div><h3>Methods</h3><p>The most suitable process was selected through multiple experiments by comparing multiple methods and features for classification. First, the U-net was applied to segment the image. Next, the nucleus was extracted from the segmented image, and the minimum spanning tree (MST) diagram structure that can capture the topological information was drawn. The third step was to extract the graph-curvature features of the histopathological image according to the MST image. Finally, by inputting the graph-curvature features into the classifier, the recognition results for benign or malignant cancer can be obtained.</p></div><div><h3>Results</h3><p>During the experiment, we used various methods for comparison. In the image segmentation stage, U-net, watershed algorithm, and Otsu threshold segmentation methods were used. We found that the U-net method, combined with multiple indicators, was the most suitable for segmentation of histopathological images. In the feature extraction stage, in addition to extracting graph-edge and graph-curvature features, several basic image features were extracted, including the red, green and blue feature, gray-level co-occurrence matrix feature, histogram of oriented gradient feature, and local binary pattern feature. In the classifier design stage, we experimented with various methods, such as support vector machine (SVM), random forest, artificial neural network, K nearest neighbors, VGG-16, and inception-V3. Through comparison and analysis, it was found that classification results with an accuracy of 98.57% can be obtained by inputting the graph-curvature feature into the SVM classifier.</p></div><div><h3>Conclusion</h3><p>This study created a unique feature, the graph-curvature feature, based on the MST to represent and analyze histopathological images. This graph-based feature could be used to identify benign and malignant cells in histopathological images and assist pathologists in diagnosis.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"4 3","pages":"Pages 141-152"},"PeriodicalIF":4.4,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667102624000330/pdfft?md5=a0b55da17b64f6b28358122b207863f3&pid=1-s2.0-S2667102624000330-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142271196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Guide for authors 作者指南
IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-01 DOI: 10.1016/S2667-1026(24)00049-4
{"title":"Guide for authors","authors":"","doi":"10.1016/S2667-1026(24)00049-4","DOIUrl":"10.1016/S2667-1026(24)00049-4","url":null,"abstract":"","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"4 3","pages":"Pages 209-214"},"PeriodicalIF":4.4,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667102624000494/pdfft?md5=b714353c6c82e1b1325bcf03469858dc&pid=1-s2.0-S2667102624000494-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142271266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development and validation of a nomogram prediction model for the risk of parastomal hernia 造口旁疝风险的nomogram预测模型的建立与验证
IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-01 DOI: 10.1016/j.imed.2023.05.004
Huasheng Liu , Weiqin Wang , Chen Qin , Hongxia Wang , Wei Qi , Yanhua Wei , Longbo Zheng , Jilin Hu

Background

Parastomal hernia is one of the potential complications after enterostomy. There is currently no early risk assessment tool for parastomal hernia.

Methods

The current investigation was conducted using retrospective studies. A total of 302 cases were used develop and internally to validate a nomogram prediction model, and 67 cases were used for external validation. Independent risk factors for parastomal hernia after permanent sigmoid colostomy were assessed via univariate analysis and binary logistic regression analysis. The nomogram prediction model was established based on independent risk factors.

Results

Body mass index, serum albumin, age, sex, and stoma diameter were independent risk factors for parastomal hernia. The areas under the receiver operating characteristic curves were 0.909 in the development group and 0.801 in the validation group. The Hosmer-Lemeshow test (P > 0.05) and calibration curves indicated good consistency between actual observations and predicted probabilities.

Conclusions

A nomogram prediction model was constructed and validated based on risk factors for parastomal hernia. The nomogram could be generalized to patients undergoing surgery for stoma by specialized surgeons to provide relevant references for stoma patients.

背景 胃旁疝是肠造口术后可能出现的并发症之一。目前还没有针对腹股沟旁疝的早期风险评估工具。共有 302 个病例用于开发和内部验证提名图预测模型,67 个病例用于外部验证。通过单变量分析和二元逻辑回归分析评估了永久性乙状结肠造口术后副乳疝的独立风险因素。结果 体重指数、血清白蛋白、年龄、性别和造口直径是造成造口旁疝的独立危险因素。开发组和验证组的接收器操作特征曲线下面积分别为 0.909 和 0.801。Hosmer-Lemeshow检验(P> 0.05)和校准曲线表明实际观察结果与预测概率之间具有良好的一致性。专业外科医生可将该预测模型推广至接受造口手术的患者,为造口患者提供相关参考。
{"title":"Development and validation of a nomogram prediction model for the risk of parastomal hernia","authors":"Huasheng Liu ,&nbsp;Weiqin Wang ,&nbsp;Chen Qin ,&nbsp;Hongxia Wang ,&nbsp;Wei Qi ,&nbsp;Yanhua Wei ,&nbsp;Longbo Zheng ,&nbsp;Jilin Hu","doi":"10.1016/j.imed.2023.05.004","DOIUrl":"10.1016/j.imed.2023.05.004","url":null,"abstract":"<div><h3>Background</h3><p>Parastomal hernia is one of the potential complications after enterostomy. There is currently no early risk assessment tool for parastomal hernia.</p></div><div><h3>Methods</h3><p>The current investigation was conducted using retrospective studies. A total of 302 cases were used develop and internally to validate a nomogram prediction model, and 67 cases were used for external validation. Independent risk factors for parastomal hernia after permanent sigmoid colostomy were assessed via univariate analysis and binary logistic regression analysis. The nomogram prediction model was established based on independent risk factors.</p></div><div><h3>Results</h3><p>Body mass index, serum albumin, age, sex, and stoma diameter were independent risk factors for parastomal hernia. The areas under the receiver operating characteristic curves were 0.909 in the development group and 0.801 in the validation group. The Hosmer-Lemeshow test (<em>P</em> &gt; 0.05) and calibration curves indicated good consistency between actual observations and predicted probabilities.</p></div><div><h3>Conclusions</h3><p>A nomogram prediction model was constructed and validated based on risk factors for parastomal hernia. The nomogram could be generalized to patients undergoing surgery for stoma by specialized surgeons to provide relevant references for stoma patients.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"4 2","pages":"Pages 128-133"},"PeriodicalIF":4.4,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667102623000426/pdfft?md5=6ed7eadc71f7e66a0977a46e25561cb2&pid=1-s2.0-S2667102623000426-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47230846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting somatisation disorder via speech: introducing the Shenzhen Somatisation Speech Corpus 通过语音检测躯体化障碍:介绍深圳躯体化语音语料库
IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-01 DOI: 10.1016/j.imed.2023.03.001
Kun Qian , Ruolan Huang , Zhihao Bao , Yang Tan , Zhonghao Zhao , Mengkai Sun , Bin Hu , Björn W. Schuller , Yoshiharu Yamamoto

Objective

Speech recognition technology is widely used as a mature technical approach in many fields. In the study of depression recognition, speech signals are commonly used due to their convenience and ease of acquisition. Though speech recognition is popular in the research field of depression recognition, it has been little studied in somatisation disorder recognition. The reason for this is the lack of a publicly accessible database of relevant speech and benchmark studies. To this end, we introduced our somatisation disorder speech database and gave benchmark results.

Methods

By collecting speech samples of somatisation disorder patients, in cooperation with the Shenzhen University General Hospital, we introduced our somatisation disorder speech database, the Shenzhen Somatisation Speech Corpus (SSSC). Moreover, a benchmark for SSSC using classic acoustic features and a machine learning model was proposed in our work.

Results

To obtain a more scientific benchmark, we compared and analysed the performance of different acoustic features, i. e., the full ComPare feature set, or only Mel frequency cepstral coefficients (MFCCs), fundamental frequency (F0), and frequency and bandwidth of the formants (F1-F3). By comparison, the best result of our benchmark was the 76.0% unweighted average recall achieved by a support vector machine with formants F1–F3.

Conclusion

The proposal of SSSC may bridge a research gap in somatisation disorder, providing researchers with a publicly accessible speech database. In addition, the results of the benchmark could show the scientific validity and feasibility of computer audition for speech recognition in somatization disorders.

目标语音识别技术作为一种成熟的技术方法,在许多领域得到广泛应用。在抑郁症识别研究中,语音信号因其方便易得而被广泛使用。虽然语音识别在抑郁症识别研究领域很受欢迎,但在躯体化障碍识别方面却鲜有研究。究其原因,是缺乏可公开访问的相关语音数据库和基准研究。通过与深圳大学总医院合作收集躯体化障碍患者的语音样本,我们建立了躯体化障碍语音数据库--深圳躯体化语音语料库(SSSC)。结果为了获得更科学的基准,我们比较和分析了不同声学特征的性能,即完整的 ComPare 特征集,或仅有梅尔频率倒频谱系数(MFCC)、基频(F0)和声母的频率和带宽(F1-F3)。相比之下,我们基准测试的最佳结果是支持向量机使用声调 F1-F3 所取得的 76.0% 的非加权平均召回率。此外,基准测试的结果还能证明计算机听力在躯体化障碍语音识别方面的科学性和可行性。
{"title":"Detecting somatisation disorder via speech: introducing the Shenzhen Somatisation Speech Corpus","authors":"Kun Qian ,&nbsp;Ruolan Huang ,&nbsp;Zhihao Bao ,&nbsp;Yang Tan ,&nbsp;Zhonghao Zhao ,&nbsp;Mengkai Sun ,&nbsp;Bin Hu ,&nbsp;Björn W. Schuller ,&nbsp;Yoshiharu Yamamoto","doi":"10.1016/j.imed.2023.03.001","DOIUrl":"10.1016/j.imed.2023.03.001","url":null,"abstract":"<div><h3>Objective</h3><p>Speech recognition technology is widely used as a mature technical approach in many fields. In the study of depression recognition, speech signals are commonly used due to their convenience and ease of acquisition. Though speech recognition is popular in the research field of depression recognition, it has been little studied in somatisation disorder recognition. The reason for this is the lack of a publicly accessible database of relevant speech and benchmark studies. To this end, we introduced our somatisation disorder speech database and gave benchmark results.</p></div><div><h3>Methods</h3><p>By collecting speech samples of somatisation disorder patients, in cooperation with the Shenzhen University General Hospital, we introduced our somatisation disorder speech database, the Shenzhen Somatisation Speech Corpus (SSSC). Moreover, a benchmark for SSSC using classic acoustic features and a machine learning model was proposed in our work.</p></div><div><h3>Results</h3><p>To obtain a more scientific benchmark, we compared and analysed the performance of different acoustic features, i. e., the full ComPare feature set, or only Mel frequency cepstral coefficients (MFCCs), fundamental frequency (F0), and frequency and bandwidth of the formants (F1-F3). By comparison, the best result of our benchmark was the 76.0% unweighted average recall achieved by a support vector machine with formants F1–F3.</p></div><div><h3>Conclusion</h3><p>The proposal of SSSC may bridge a research gap in somatisation disorder, providing researchers with a publicly accessible speech database. In addition, the results of the benchmark could show the scientific validity and feasibility of computer audition for speech recognition in somatization disorders.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"4 2","pages":"Pages 96-103"},"PeriodicalIF":4.4,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667102623000219/pdfft?md5=9ae4884ac76562266b28f28068f3f5a0&pid=1-s2.0-S2667102623000219-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46064781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Atherosclerotic plaque classification in carotid ultrasound images using machine learning and explainable deep learning 使用机器学习和可解释的深度学习对颈动脉超声图像中的动脉粥样硬化斑块进行分类
IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-01 DOI: 10.1016/j.imed.2023.05.003
Soni Singh , Pankaj K. Jain , Neeraj Sharma , Mausumi Pohit , Sudipta Roy

Objective

The incidence of cardiovascular diseases (CVD) is rising rapidly worldwide. Some forms of CVD, such as stroke and heart attack, are more common among patients with certain conditions. Atherosclerosis development is a major factor underlying cardiovascular events, such as heart attack and stroke, and its early detection may prevent such events. Ultrasound imaging of carotid arteries is a useful method for diagnosis of atherosclerotic plaques; however, an automated method to classify atherosclerotic plaques for evaluation of early-stage CVD is needed. Here, we propose an automated method for classification of high-risk atherosclerotic plaque ultrasound images.

Methods

Five deep learning (DL) models (VGG16, ResNet-50, GoogLeNet, XceptionNet, and SqueezeNet) were used for automated classification and the results compared with those of a machine learning (ML)-based technique, involving extraction of 23 texture features from ultrasound images and classification using a Support Vector Machine classifier. To enhance model interpretability, output gradient-weighted convolutional activation maps (GradCAMs) were generated and overlayed on original images.

Results

A series of indices, including accuracy, sensitivity, specificity, F1-score, Cohen-kappa index, and area under the curve values, were calculated to evaluate model performance. GradCAM output images allowed visualization of the most significant ultrasound image regions. The GoogLeNet model yielded the highest accuracy (98.20%).

Conclusion

ML models may be also suitable for applications requiring low computational resource. Further, DL models could be more completely automated than ML models.

目标心血管疾病(CVD)的发病率在全球范围内迅速上升。某些形式的心血管疾病,如中风和心脏病发作,在患有某些疾病的患者中更为常见。动脉粥样硬化的发展是心脏病发作和中风等心血管事件的主要诱因,及早发现动脉粥样硬化可预防此类事件的发生。颈动脉超声波成像是诊断动脉粥样硬化斑块的有效方法,但需要一种自动方法对动脉粥样硬化斑块进行分类,以评估早期心血管疾病。方法使用五个深度学习(DL)模型(VGG16、ResNet-50、GoogLeNet、XceptionNet 和 SqueezeNet)进行自动分类,并将结果与基于机器学习(ML)技术的结果进行比较,后者涉及从超声图像中提取 23 个纹理特征,并使用支持向量机分类器进行分类。为了提高模型的可解释性,生成了输出梯度加权卷积激活图(GradCAM)并叠加在原始图像上。结果 计算了一系列指标,包括准确率、灵敏度、特异性、F1-分数、Cohen-kappa 指数和曲线下面积值,以评估模型的性能。GradCAM 输出图像可以显示最重要的超声图像区域。GoogLeNet 模型的准确率最高(98.20%)。此外,与 ML 模型相比,DL 模型的自动化程度更高。
{"title":"Atherosclerotic plaque classification in carotid ultrasound images using machine learning and explainable deep learning","authors":"Soni Singh ,&nbsp;Pankaj K. Jain ,&nbsp;Neeraj Sharma ,&nbsp;Mausumi Pohit ,&nbsp;Sudipta Roy","doi":"10.1016/j.imed.2023.05.003","DOIUrl":"10.1016/j.imed.2023.05.003","url":null,"abstract":"<div><h3>Objective</h3><p>The incidence of cardiovascular diseases (CVD) is rising rapidly worldwide. Some forms of CVD, such as stroke and heart attack, are more common among patients with certain conditions. Atherosclerosis development is a major factor underlying cardiovascular events, such as heart attack and stroke, and its early detection may prevent such events. Ultrasound imaging of carotid arteries is a useful method for diagnosis of atherosclerotic plaques; however, an automated method to classify atherosclerotic plaques for evaluation of early-stage CVD is needed. Here, we propose an automated method for classification of high-risk atherosclerotic plaque ultrasound images.</p></div><div><h3>Methods</h3><p>Five deep learning (DL) models (VGG16, ResNet-50, GoogLeNet, XceptionNet, and SqueezeNet) were used for automated classification and the results compared with those of a machine learning (ML)-based technique, involving extraction of 23 texture features from ultrasound images and classification using a Support Vector Machine classifier. To enhance model interpretability, output gradient-weighted convolutional activation maps (GradCAMs) were generated and overlayed on original images.</p></div><div><h3>Results</h3><p>A series of indices, including accuracy, sensitivity, specificity, F1-score, Cohen-kappa index, and area under the curve values, were calculated to evaluate model performance. GradCAM output images allowed visualization of the most significant ultrasound image regions. The GoogLeNet model yielded the highest accuracy (98.20%).</p></div><div><h3>Conclusion</h3><p>ML models may be also suitable for applications requiring low computational resource. Further, DL models could be more completely automated than ML models.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"4 2","pages":"Pages 83-95"},"PeriodicalIF":4.4,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667102623000414/pdfft?md5=76fb748a98c6820248b23d97b4e68905&pid=1-s2.0-S2667102623000414-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42888619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hybrid system to predict brain stroke using a combined feature selection and classifier 一种使用特征选择和分类器组合预测脑卒中的混合系统
IF 4.4 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-01 DOI: 10.1016/j.imed.2023.06.002
Priyanka Bathla, Rajneesh Kumar

Background

Brain stroke is a serious health issue that requires timely and accurate prediction for effective treatment and prevention. This study described a hybrid system that used the best feature selection method and classifier to predict brain stroke.

Methods

The Stroke Prediction Dataset from Kaggle was used for this study. Synthetic minority over-sampling technique (SMOTE) analysis was used to accomplish class balancing. Accuracy, sensitivity, specificity, precision, and the F-Measure were the main performance parameters considered for investigation. To determine the best combination for predicting brain stroke, the performance of five classifiers, Naïve Bayes (NB), support vector machine (SVM), random forest (RF), adaptive boosting (Adaboost), and extreme gradient boosting (XGBoost), was compared along with three feature selection techniques, mutual information (MI), Pearson correlation (PC), and feature importance (FI). The performance parameters were assessed using k-fold cross-validation.

Results

The hybrid system proposed in this study identified a reduced set of features that were able to effectively predict brain stroke. FI provided a feature reduction ratio of 36.3%. The most successful hybrid system for predicting brain stroke used FI as the feature selection technique and RF as the classifier, achieving an accuracy of 97.17%.

Conclusion

The proposed system predicted brain stroke with high accuracy. These findings could be used to inform the early detection and prevention of brain stroke, allowing healthcare professionals to provide timely and targeted care to at-risk patients.

背景脑中风是一个严重的健康问题,需要及时准确的预测才能有效治疗和预防。本研究介绍了一种混合系统,该系统使用最佳特征选择方法和分类器来预测脑中风。本研究使用了 Kaggle 中的脑卒中预测数据集,并使用合成少数群体过度采样技术(SMOTE)分析来实现类平衡。准确度、灵敏度、特异性、精确度和 F-Measure 是考察的主要性能参数。为了确定预测脑中风的最佳组合,比较了奈夫贝叶斯(NB)、支持向量机(SVM)、随机森林(RF)、自适应增强(Adaboost)和极梯度增强(XGBoost)这五种分类器的性能,以及互信息(MI)、皮尔逊相关(PC)和特征重要性(FI)这三种特征选择技术。结果本研究提出的混合系统识别出了一组能够有效预测脑中风的精简特征。FI 提供了 36.3% 的特征缩减率。预测脑中风最成功的混合系统使用 FI 作为特征选择技术,RF 作为分类器,准确率达到 97.17%。这些发现可用于脑中风的早期检测和预防,使医护人员能够为高危患者提供及时和有针对性的护理。
{"title":"A hybrid system to predict brain stroke using a combined feature selection and classifier","authors":"Priyanka Bathla,&nbsp;Rajneesh Kumar","doi":"10.1016/j.imed.2023.06.002","DOIUrl":"10.1016/j.imed.2023.06.002","url":null,"abstract":"<div><h3>Background</h3><p>Brain stroke is a serious health issue that requires timely and accurate prediction for effective treatment and prevention. This study described a hybrid system that used the best feature selection method and classifier to predict brain stroke.</p></div><div><h3>Methods</h3><p>The Stroke Prediction Dataset from Kaggle was used for this study. Synthetic minority over-sampling technique (SMOTE) analysis was used to accomplish class balancing. Accuracy, sensitivity, specificity, precision, and the F-Measure were the main performance parameters considered for investigation. To determine the best combination for predicting brain stroke, the performance of five classifiers, Naïve Bayes (NB), support vector machine (SVM), random forest (RF), adaptive boosting (Adaboost), and extreme gradient boosting (XGBoost), was compared along with three feature selection techniques, mutual information (MI), Pearson correlation (PC), and feature importance (FI). The performance parameters were assessed using k-fold cross-validation.</p></div><div><h3>Results</h3><p>The hybrid system proposed in this study identified a reduced set of features that were able to effectively predict brain stroke. FI provided a feature reduction ratio of 36.3%. The most successful hybrid system for predicting brain stroke used FI as the feature selection technique and RF as the classifier, achieving an accuracy of 97.17%.</p></div><div><h3>Conclusion</h3><p>The proposed system predicted brain stroke with high accuracy. These findings could be used to inform the early detection and prevention of brain stroke, allowing healthcare professionals to provide timely and targeted care to at-risk patients.</p></div>","PeriodicalId":73400,"journal":{"name":"Intelligent medicine","volume":"4 2","pages":"Pages 75-82"},"PeriodicalIF":4.4,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266710262300058X/pdfft?md5=0f0ee3d2b045cfcda2c84f99bb898a5e&pid=1-s2.0-S266710262300058X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49456623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Intelligent medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1