Empowering Medical Diagnosis: A Machine Learning Approach for Symptom-Based Health Checker

Mobile Networks and Applications Pub Date : 2024-08-01 DOI:10.1007/s11036-024-02369-x

Leila Aissaoui Ferhi, Manel Ben Amar, Fethi Choubani, Ridha Bouallegue

{"title":"Empowering Medical Diagnosis: A Machine Learning Approach for Symptom-Based Health Checker","authors":"Leila Aissaoui Ferhi, Manel Ben Amar, Fethi Choubani, Ridha Bouallegue","doi":"10.1007/s11036-024-02369-x","DOIUrl":null,"url":null,"abstract":"<p>AI-powered health checkers and apps for automated medical diagnosis have a lot of promise for a variety of applications. During pandemics, they can lessen the need for in-person patient-doctor interactions and offer vital medical advice in neglected rural areas. In this work, we demonstrate the creation of an expert system driven by machine learning on the web. This technology helps medical practitioners make better diagnostic decisions by supporting them and by offering accurate health forecasts and suggestions to the general population. Due to the lack of authentic medical datasets focusing on symptoms, we collected information from reputable medical sources. This enabled us to prioritize the medical diagnostic process, resulting in the compilation of a comprehensive list of illnesses and associated symptoms. This dataset played a key role in developing our health checker, which consisted of four primary parts: FrontEnd, Authentication module, BackEnd housing the machine learning module, and the Database. We constructed a dataset encompassing up to 415712 synthetic patients, 75 symptoms and risk factors, and 22 cough-related diagnoses. This dataset enabled the training and testing of supervised machine learning models to identify the most effective algorithm for implementation. The accuracy, performance and generalization ability of the utilized machine learning models were assessed using metrics including accuracy, F1-score and cross validation. Our work not only advances machine learning models but also addresses the pressing need for reliable medical datasets. The outcome of our efforts is a robust health checker, set to bring positive changes to diagnostic processes and healthcare accessibility as well as generalization and real-world applicability of our models. This highlights the critical role of dataset quality, especially with our ‘third dataset’ showcasing unparalleled performance across diverse medical scenarios with an accuracy superior to 99% and F1 score superior to 99% also for all the models. Stratified fivefold cross-validation also demonstrates positive results with an average accuracy and an average F1 score exceeding 99% for all models, thereby enhancing the reliability of our model evaluations and boosting confidence in the obtained metrics. In conclusion, our work propels the advancement of machine learning models, specifically addressing the imperative for reliable medical datasets. The result is a symptom-based health checker that demonstrates resilience, positioned to potentially contribute to advancements in diagnostics and improve accessibility to healthcare services.</p>","PeriodicalId":501103,"journal":{"name":"Mobile Networks and Applications","volume":"8 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mobile Networks and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s11036-024-02369-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

AI-powered health checkers and apps for automated medical diagnosis have a lot of promise for a variety of applications. During pandemics, they can lessen the need for in-person patient-doctor interactions and offer vital medical advice in neglected rural areas. In this work, we demonstrate the creation of an expert system driven by machine learning on the web. This technology helps medical practitioners make better diagnostic decisions by supporting them and by offering accurate health forecasts and suggestions to the general population. Due to the lack of authentic medical datasets focusing on symptoms, we collected information from reputable medical sources. This enabled us to prioritize the medical diagnostic process, resulting in the compilation of a comprehensive list of illnesses and associated symptoms. This dataset played a key role in developing our health checker, which consisted of four primary parts: FrontEnd, Authentication module, BackEnd housing the machine learning module, and the Database. We constructed a dataset encompassing up to 415712 synthetic patients, 75 symptoms and risk factors, and 22 cough-related diagnoses. This dataset enabled the training and testing of supervised machine learning models to identify the most effective algorithm for implementation. The accuracy, performance and generalization ability of the utilized machine learning models were assessed using metrics including accuracy, F1-score and cross validation. Our work not only advances machine learning models but also addresses the pressing need for reliable medical datasets. The outcome of our efforts is a robust health checker, set to bring positive changes to diagnostic processes and healthcare accessibility as well as generalization and real-world applicability of our models. This highlights the critical role of dataset quality, especially with our ‘third dataset’ showcasing unparalleled performance across diverse medical scenarios with an accuracy superior to 99% and F1 score superior to 99% also for all the models. Stratified fivefold cross-validation also demonstrates positive results with an average accuracy and an average F1 score exceeding 99% for all models, thereby enhancing the reliability of our model evaluations and boosting confidence in the obtained metrics. In conclusion, our work propels the advancement of machine learning models, specifically addressing the imperative for reliable medical datasets. The result is a symptom-based health checker that demonstrates resilience, positioned to potentially contribute to advancements in diagnostics and improve accessibility to healthcare services.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

增强医疗诊断能力：基于症状的健康检查器的机器学习方法

人工智能驱动的健康检查器和自动医疗诊断应用程序在各种应用中大有可为。在大流行病期间，它们可以减少患者与医生面对面交流的需求，并为被忽视的农村地区提供重要的医疗建议。在这项工作中，我们展示了如何在网络上创建一个由机器学习驱动的专家系统。这项技术通过为医疗从业人员提供支持，帮助他们做出更好的诊断决定，并为普通大众提供准确的健康预测和建议。由于缺乏以症状为重点的真实医疗数据集，我们从著名的医疗来源收集信息。这使我们能够对医疗诊断过程进行优先排序，从而编制出一份全面的疾病和相关症状列表。该数据集在开发我们的健康检查器过程中发挥了关键作用，健康检查器由四个主要部分组成：前端（FrontEnd）、认证模块（Authentication module）、包含机器学习模块的后端（BackEnd）和数据库（Database）。我们构建了一个数据集，其中包括多达 415712 名合成患者、75 种症状和风险因素以及 22 种与咳嗽相关的诊断。通过该数据集，我们对有监督的机器学习模型进行了训练和测试，以确定最有效的实施算法。我们使用准确率、F1 分数和交叉验证等指标评估了所使用的机器学习模型的准确性、性能和泛化能力。我们的工作不仅推动了机器学习模型的发展，还解决了对可靠医疗数据集的迫切需求。我们的努力成果是一个强大的健康检查器，它将为诊断流程和医疗保健的可及性以及模型的通用性和现实世界的适用性带来积极的变化。这凸显了数据集质量的关键作用，尤其是我们的 "第三个数据集 "在各种医疗场景中都表现出了无与伦比的性能，所有模型的准确率都超过了 99%，F1 分数也超过了 99%。分层五重交叉验证也取得了积极成果，所有模型的平均准确率和平均 F1 分数都超过了 99%，从而提高了模型评估的可靠性，增强了对所获指标的信心。总之，我们的工作推动了机器学习模型的进步，特别是解决了可靠医疗数据集的当务之急。我们的成果是一个基于症状的健康检查器，它表现出了强大的适应能力，有望推动诊断技术的进步，改善医疗服务的可及性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Mobile Networks and Applications

自引率

0.00%

发文量