首页 > 最新文献

Computer methods and programs in biomedicine update最新文献

英文 中文
Numerical study on normal lung sounds in bronchial airways under different breathing intensities 不同呼吸强度下支气管正常肺音的数值研究
Pub Date : 2024-01-01 DOI: 10.1016/j.cmpbup.2024.100154
Huiqiang Li , Xiaozhao Li , Juntao Feng

Background

Due to the complexity of airways and the limitation of experiments, the production mechanism of the lung sounds in airways has not been fully understood, which often confuses diagnosis.

Method

A 3D geometrical model of human airways (G5-G8) has been developed based on Weibel's model. Simulation on transient airflow and the noise production during exhalation under different breathing intensities (Q = 15, 30, 45, 60, 75, 90 L/min) has been carried out with Direct Noise Computation (DNC) and Ffowcs Williams-Hawkings (FW-H) method.

Results

(1) The junctions between airways are most likely to produce lung sounds, and the peak value is located in the junction between G7 and G6 at the middle of exhalation (about 0.75 s). (2) With the increase in breathing intensity, the average sound pressure level first increases, reaches the peak value at 70–75 L/min, and then drops. (3) Higher breathing intensity is helpful to produce the feature of wheezing, namely a comparatively higher sound pressure level in the range of 200–500 Hz. Moreover, this feature is prominent with the increase in breathing intensity.

背景由于气道的复杂性和实验的局限性,气道中肺音的产生机制尚未被完全理解,这往往会给诊断带来困惑。方法在 Weibel 模型的基础上建立了人体气道(G5-G8)的三维几何模型。采用直接噪声计算(DNC)和 Ffowcs Williams-Hawkings (FW-H) 方法对不同呼吸强度(Q = 15、30、45、60、75、90 L/min)下的瞬时气流和呼气时产生的噪声进行了模拟。(2)随着呼吸强度的增加,平均声压级先上升,在 70-75 L/min 时达到峰值,然后下降。(3) 较高的呼吸强度有助于产生喘鸣特征,即在 200-500 Hz 范围内声压级相对较高。此外,随着呼吸强度的增加,这一特征也会更加突出。
{"title":"Numerical study on normal lung sounds in bronchial airways under different breathing intensities","authors":"Huiqiang Li ,&nbsp;Xiaozhao Li ,&nbsp;Juntao Feng","doi":"10.1016/j.cmpbup.2024.100154","DOIUrl":"https://doi.org/10.1016/j.cmpbup.2024.100154","url":null,"abstract":"<div><h3>Background</h3><p>Due to the complexity of airways and the limitation of experiments, the production mechanism of the lung sounds in airways has not been fully understood, which often confuses diagnosis.</p></div><div><h3>Method</h3><p>A 3D geometrical model of human airways (G5-G8) has been developed based on Weibel's model. Simulation on transient airflow and the noise production during exhalation under different breathing intensities (<em>Q</em> = 15, 30, 45, 60, 75, 90 L/min) has been carried out with Direct Noise Computation (DNC) and Ffowcs Williams-Hawkings (FW-H) method.</p></div><div><h3>Results</h3><p>(1) The junctions between airways are most likely to produce lung sounds, and the peak value is located in the junction between G7 and G6 at the middle of exhalation (about 0.75 s). (2) With the increase in breathing intensity, the average sound pressure level first increases, reaches the peak value at 70–75 L/min, and then drops. (3) Higher breathing intensity is helpful to produce the feature of wheezing, namely a comparatively higher sound pressure level in the range of 200–500 Hz. Moreover, this feature is prominent with the increase in breathing intensity.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666990024000211/pdfft?md5=6b1cdf9b1b9d99f91f6def14fe7bffab&pid=1-s2.0-S2666990024000211-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140604780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI in diagnostic imaging: Revolutionising accuracy and efficiency 诊断成像中的人工智能:彻底改变准确性和效率
Pub Date : 2024-01-01 DOI: 10.1016/j.cmpbup.2024.100146
Mohamed Khalifa , Mona Albadawy

Introduction

This review evaluates the role of Artificial Intelligence (AI) in transforming diagnostic imaging in healthcare. AI has the potential to enhance accuracy and efficiency of interpreting medical images like X-rays, MRIs, and CT scans.

Methods

A comprehensive literature search across databases like PubMed, Embase, and Google Scholar was conducted, focusing on articles published in peer-reviewed journals in English language since 2019. Inclusion criteria targeted studies on AI's application in diagnostic imaging, while exclusion criteria filtered out irrelevant or empirically unsupported studies.

Results and discussion

Through 30 included studies, the review identifies four AI domains and eight functions in diagnostic imaging: 1) In the area of Image Analysis and Interpretation, AI capabilities enhanced image analysis, spotting minor discrepancies and anomalies, and by reducing human error, maintaining accuracy and mitigating the impact of fatigue or oversight, 2) The Operational Efficiency is enhanced by AI through efficiency and speed, which accelerates the diagnostic process, and cost-effectiveness, reducing healthcare costs by improving efficiency and accuracy, 3) Predictive and Personalised Healthcare benefit from AI through predictive analytics, leveraging historical data for early diagnosis, and personalised medicine, which employs patient-specific data for tailored diagnostic approaches, 4) Lastly, in Clinical Decision Support, AI assists in complex procedures by providing precise imaging support and integrates with other technologies like electronic health records for enriched health insights, showcasing ai's transformative potential in diagnostic imaging. The review also discusses challenges in AI integration, such as ethical concerns, data privacy, and the need for technology investments and training.

Conclusion

AI is revolutionising diagnostic imaging by improving accuracy, efficiency, and personalised healthcare delivery. Recommendations include continued investment in AI, establishment of ethical guidelines, training for healthcare professionals, and ensuring patient-centred AI development. The review calls for collaborative efforts to integrate AI in clinical practice effectively and address healthcare disparities.

导言本综述评估了人工智能(AI)在改变医疗诊断成像方面的作用。人工智能有可能提高X射线、核磁共振成像和CT扫描等医学影像解读的准确性和效率。方法在PubMed、Embase和谷歌学术等数据库中进行了全面的文献检索,重点关注2019年以来发表在同行评审期刊上的英文文章。纳入标准针对有关人工智能在影像诊断中应用的研究,而排除标准则过滤掉了不相关或无经验支持的研究。结果与讨论通过纳入的 30 篇研究,综述确定了人工智能在影像诊断中的四个领域和八种功能:1)在图像分析和解读领域,人工智能功能增强了图像分析,发现细微差异和异常,并通过减少人为错误,保持准确性,减轻疲劳或疏忽的影响;2)人工智能通过效率和速度提高了操作效率,加快了诊断过程,并提高了成本效益,通过提高效率和准确性降低了医疗成本、3)预测性和个性化医疗受益于人工智能的预测性分析和个性化医疗,前者利用历史数据进行早期诊断,后者则利用患者的特定数据进行量身定制的诊断方法。本综述还讨论了人工智能整合所面临的挑战,如伦理问题、数据隐私以及技术投资和培训需求。建议包括继续投资人工智能、制定伦理准则、培训医疗保健专业人员以及确保以患者为中心的人工智能发展。本综述呼吁各方共同努力,将人工智能有效融入临床实践,并解决医疗差距问题。
{"title":"AI in diagnostic imaging: Revolutionising accuracy and efficiency","authors":"Mohamed Khalifa ,&nbsp;Mona Albadawy","doi":"10.1016/j.cmpbup.2024.100146","DOIUrl":"10.1016/j.cmpbup.2024.100146","url":null,"abstract":"<div><h3>Introduction</h3><p>This review evaluates the role of Artificial Intelligence (AI) in transforming diagnostic imaging in healthcare. AI has the potential to enhance accuracy and efficiency of interpreting medical images like X-rays, MRIs, and CT scans.</p></div><div><h3>Methods</h3><p>A comprehensive literature search across databases like PubMed, Embase, and Google Scholar was conducted, focusing on articles published in peer-reviewed journals in English language since 2019. Inclusion criteria targeted studies on AI's application in diagnostic imaging, while exclusion criteria filtered out irrelevant or empirically unsupported studies.</p></div><div><h3>Results and discussion</h3><p>Through 30 included studies, the review identifies four AI domains and eight functions in diagnostic imaging: 1) In the area of Image Analysis and Interpretation, AI capabilities enhanced image analysis, spotting minor discrepancies and anomalies, and by reducing human error, maintaining accuracy and mitigating the impact of fatigue or oversight, 2) The Operational Efficiency is enhanced by AI through efficiency and speed, which accelerates the diagnostic process, and cost-effectiveness, reducing healthcare costs by improving efficiency and accuracy, 3) Predictive and Personalised Healthcare benefit from AI through predictive analytics, leveraging historical data for early diagnosis, and personalised medicine, which employs patient-specific data for tailored diagnostic approaches, 4) Lastly, in Clinical Decision Support, AI assists in complex procedures by providing precise imaging support and integrates with other technologies like electronic health records for enriched health insights, showcasing ai's transformative potential in diagnostic imaging. The review also discusses challenges in AI integration, such as ethical concerns, data privacy, and the need for technology investments and training.</p></div><div><h3>Conclusion</h3><p>AI is revolutionising diagnostic imaging by improving accuracy, efficiency, and personalised healthcare delivery. Recommendations include continued investment in AI, establishment of ethical guidelines, training for healthcare professionals, and ensuring patient-centred AI development. The review calls for collaborative efforts to integrate AI in clinical practice effectively and address healthcare disparities.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666990024000132/pdfft?md5=dc2a7d25e2ce178c93e675f9e58901e5&pid=1-s2.0-S2666990024000132-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140084641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Digital health literacy and information-seeking on the internet in relation to COVID-19 among university students in Greece 希腊大学生中与 COVID-19 相关的数字健康知识和互联网信息搜索情况
Pub Date : 2024-01-01 DOI: 10.1016/j.cmpbup.2024.100139
Evanthia Sakellari , Orkan Okan , Kevin Dadaczynski , Kostantinos Koutentakis , Areti Lagiou

Background

COVID-19 is the first pandemic in history in which technology and social media are being used for people to be informed and be safe. Thus, digital health literacy skills affect the way people will protect and promote their health.

Methods

A cross-sectional web-based study was conducted with a convenience sample among university students (N=604) from one of the Universities located in Attica (Greece) during May - June 2020. The COVID-HL university students survey questionnaire was used for collecting the data.

Results

In regards to information search, 28 % of the university students indicated that they found it very difficult/difficult to find the exact information they were looking for and 20.4 % to make a choice from all the information they found. Additionally, 45.1 % of the participants found it very difficult/difficult to decide whether the information retrieved via online search is reliable or not.

Conclusion

The results indicate a need for the promotion of digital health literacy among university students and therefore, health education interventions need to optimize students’ seeking skills and critical thinking. Health educators should consider the results of this study and involve the university students in any intervention they plan in order to address the students’ specific needs. It is also suggested that these health education interventions should be integrated throughout all academic activities.

背景COVID-19是历史上第一次利用技术和社交媒体让人们了解信息并确保安全的流行病。因此,数字健康素养技能会影响人们保护和促进自身健康的方式。方法 在 2020 年 5 月至 6 月期间,对希腊阿提卡一所大学的大学生(N=604)进行了一项基于网络的横断面研究。结果 在信息搜索方面,28%的大学生表示很难/很难找到他们想要的准确信息,20.4%的大学生表示很难/很难从他们找到的所有信息中做出选择。此外,45.1%的参与者认为很难/很难判断通过在线搜索获得的信息是否可靠。健康教育工作者应考虑本研究的结果,让大学生参与他们计划的任何干预措施,以满足学生的特殊需求。研究还建议,这些健康教育干预措施应贯穿于所有学术活动中。
{"title":"Digital health literacy and information-seeking on the internet in relation to COVID-19 among university students in Greece","authors":"Evanthia Sakellari ,&nbsp;Orkan Okan ,&nbsp;Kevin Dadaczynski ,&nbsp;Kostantinos Koutentakis ,&nbsp;Areti Lagiou","doi":"10.1016/j.cmpbup.2024.100139","DOIUrl":"10.1016/j.cmpbup.2024.100139","url":null,"abstract":"<div><h3>Background</h3><p>COVID-19 is the first pandemic in history in which technology and social media are being used for people to be informed and be safe. Thus, digital health literacy skills affect the way people will protect and promote their health.</p></div><div><h3>Methods</h3><p>A cross-sectional web-based study was conducted with a convenience sample among university students (<em>N</em>=604) from one of the Universities located in Attica (Greece) during May - June 2020. The COVID-HL university students survey questionnaire was used for collecting the data.</p></div><div><h3>Results</h3><p>In regards to information search, 28 % of the university students indicated that they found it very difficult/difficult to find the exact information they were looking for and 20.4 % to make a choice from all the information they found. Additionally, 45.1 % of the participants found it very difficult/difficult to decide whether the information retrieved via online search is reliable or not.</p></div><div><h3>Conclusion</h3><p>The results indicate a need for the promotion of digital health literacy among university students and therefore, health education interventions need to optimize students’ seeking skills and critical thinking. Health educators should consider the results of this study and involve the university students in any intervention they plan in order to address the students’ specific needs. It is also suggested that these health education interventions should be integrated throughout all academic activities.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666990024000065/pdfft?md5=7424d30d38ac13d3fb171812c2d3fc89&pid=1-s2.0-S2666990024000065-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139638316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deciphering the complex links between inflammatory bowel diseases and NAFLD through advanced statistical and machine learning analysis 通过先进的统计和机器学习分析,破译炎症性肠病和非酒精性脂肪肝之间的复杂联系
Pub Date : 2024-01-01 DOI: 10.1016/j.cmpbup.2024.100165

Background and Objective:

Accurate classification of liver disease stages provides crucial insights into patient prognosis, aiding in the prediction of disease outcomes and influencing clinical decision-making. There is an urgent need for non-invasive methods to diagnose various stages of liver dysfunction and uncover hidden pattern based on individual disease characteristics.

Method:

One popular and effective approach is collecting serum biomarker samples. The study was conducted on collected serum biomaker samples of 81 patients with Inflammatory Bowel Disease (IBD) of Changhua Christian Hospital in China, including 36 with Crohn’s disease (CD) and 45 with Ulcerative Colitis (UC) using Latent Semantic Analysis(LSA) and machine learning (ML) techniques.Machine Learning algorithms Random Forest (RF), Logistic Regression (LR), XGBoost (XGB), and Support Vector Classifier (SVC), were utilized to predict liver risk associated with conditions including Hepatitis, Autoimmune Hepatitis (AIH), Alcoholic Liver Disease (ALD), and Non-Alcoholic Fatty Liver Disease (NAFLD). Models’ accuracy was assessed using K-Fold Cross-Validation (CV).Distinct pattern were identified using Latent Semantic Analysis(LSA). Furthermore, SHAP plots were utilized for enhanced interpretability, highlighting essential features for liver dysfunction levels.

Results:

The inflammatory profile, mixed disease profile, and healthy profile were the three distinct clusters were identified with LSA. The RF model achieved high accuracy of 0.94±0.06. Serum Glutamate Pyruvate Transaminase (GPT), Age at Diagnosis (AAD), Erythrocyte Sedimentation Rate (ESR), C-reactive protein (CRP) were found the most key important features in liver disease staging increment.

Conclusion:

The research significantly contributes to the fields of biomedical informatics and clinical decision-making. The developed model offers valuable decision-making tools for clinicians, enabling early and targeted interventions.
背景和目的:对肝脏疾病分期进行准确的分类可为患者预后提供重要的洞察力,有助于预测疾病结果并影响临床决策。目前急需一种非侵入性方法来诊断肝功能异常的各个阶段,并根据个体疾病特征揭示隐藏的模式。本研究采用潜语义分析(LSA)和机器学习(ML)技术,对中国彰化基督教医院收集的81名炎症性肠病(IBD)患者的血清生物标记物样本进行了分析,其中包括36名克罗恩病(CD)患者和45名溃疡性结肠炎(UC)患者。利用机器学习算法随机森林(RF)、逻辑回归(LR)、XGBoost(XGB)和支持向量分类器(SVC)来预测与肝炎、自身免疫性肝炎(AIH)、酒精性肝病(ALD)和非酒精性脂肪肝(NAFLD)等疾病相关的肝脏风险。使用潜语义分析(LSA)确定了不同的模式。此外,还利用 SHAP 图增强了可解释性,突出了肝功能异常水平的基本特征。RF 模型的准确率高达 0.94±0.06。血清谷氨酸丙酮酸转氨酶(GPT)、诊断年龄(AAD)、红细胞沉降率(ESR)、C反应蛋白(CRP)是肝病分期增量中最重要的特征。所开发的模型为临床医生提供了有价值的决策工具,可实现早期和有针对性的干预。
{"title":"Deciphering the complex links between inflammatory bowel diseases and NAFLD through advanced statistical and machine learning analysis","authors":"","doi":"10.1016/j.cmpbup.2024.100165","DOIUrl":"10.1016/j.cmpbup.2024.100165","url":null,"abstract":"<div><h3>Background and Objective:</h3><div>Accurate classification of liver disease stages provides crucial insights into patient prognosis, aiding in the prediction of disease outcomes and influencing clinical decision-making. There is an urgent need for non-invasive methods to diagnose various stages of liver dysfunction and uncover hidden pattern based on individual disease characteristics.</div></div><div><h3>Method:</h3><div>One popular and effective approach is collecting serum biomarker samples. The study was conducted on collected serum biomaker samples of 81 patients with Inflammatory Bowel Disease (IBD) of Changhua Christian Hospital in China, including 36 with Crohn’s disease (CD) and 45 with Ulcerative Colitis (UC) using Latent Semantic Analysis(LSA) and machine learning (ML) techniques.Machine Learning algorithms Random Forest (RF), Logistic Regression (LR), XGBoost (XGB), and Support Vector Classifier (SVC), were utilized to predict liver risk associated with conditions including Hepatitis, Autoimmune Hepatitis (AIH), Alcoholic Liver Disease (ALD), and Non-Alcoholic Fatty Liver Disease (NAFLD). Models’ accuracy was assessed using K-Fold Cross-Validation (CV).Distinct pattern were identified using Latent Semantic Analysis(LSA). Furthermore, SHAP plots were utilized for enhanced interpretability, highlighting essential features for liver dysfunction levels.</div></div><div><h3>Results:</h3><div>The inflammatory profile, mixed disease profile, and healthy profile were the three distinct clusters were identified with LSA. The RF model achieved high accuracy of <span><math><mrow><mn>0</mn><mo>.</mo><mn>94</mn><mo>±</mo><mn>0</mn><mo>.</mo><mn>06</mn></mrow></math></span>. Serum Glutamate Pyruvate Transaminase (GPT), Age at Diagnosis (AAD), Erythrocyte Sedimentation Rate (ESR), C-reactive protein (CRP) were found the most key important features in liver disease staging increment.</div></div><div><h3>Conclusion:</h3><div>The research significantly contributes to the fields of biomedical informatics and clinical decision-making. The developed model offers valuable decision-making tools for clinicians, enabling early and targeted interventions.</div></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142319497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using artificial intelligence in academic writing and research: An essential productivity tool 在学术写作和研究中使用人工智能:必不可少的生产力工具
Pub Date : 2024-01-01 DOI: 10.1016/j.cmpbup.2024.100145
Mohamed Khalifa , Mona Albadawy

Background

Academic writing is an essential component of research, characterized by structured expression of ideas, data-driven arguments, and logical reasoning. However, it poses challenges such as handling vast amounts of information and complex ideas. The integration of Artificial Intelligence (AI) into academic writing has become increasingly important, offering solutions to these challenges. This review aims to explore specific domains where AI significantly supports academic writing.

Methods

A systematic review of literature from databases like PubMed, Embase, and Google Scholar, published since 2019, was conducted. Studies were included based on relevance to AI's application in academic writing and research, focusing on writing assistance, grammar improvement, structure optimization, and other related aspects.

Results

The search identified 24 studies through which six core domains were identified where AI helps academic writing and research: 1) facilitating idea generation and research design, 2) improving content and structuring, 3) supporting literature review and synthesis, 4) enhancing data management and analysis, 5) supporting editing, review, and publishing, and 6) assisting in communication, outreach, and ethical compliance. ChatGPT has shown substantial potential in these areas, though challenges like maintaining academic integrity and balancing AI use with human insight remain.

Conclusion and recommendations

AI significantly revolutionises academic writing and research across various domains. Recommendations include broader integration of AI tools in research workflows, emphasizing ethical and transparent use, providing adequate training for researchers, and maintaining a balance between AI utility and human insight. Ongoing research and development are essential to address emerging challenges and ethical considerations in AI's application in academia.

背景学术写作是研究工作的重要组成部分,其特点是有条理地表达观点、数据驱动论证和逻辑推理。然而,它也带来了一些挑战,如处理海量信息和复杂观点。将人工智能(AI)融入学术写作变得越来越重要,为这些挑战提供了解决方案。本综述旨在探讨人工智能在哪些具体领域为学术写作提供了重要支持。方法对PubMed、Embase和Google Scholar等数据库中2019年以来发表的文献进行了系统综述。根据人工智能在学术写作和研究中应用的相关性纳入研究,重点关注写作辅助、语法改进、结构优化和其他相关方面。结果检索发现了24项研究,通过这些研究确定了人工智能有助于学术写作和研究的六个核心领域:1) 促进想法的产生和研究设计;2) 改进内容和结构;3) 支持文献综述和合成;4) 加强数据管理和分析;5) 支持编辑、审查和出版;6) 协助交流、推广和伦理合规。ChatGPT 已在这些领域显示出巨大的潜力,尽管仍面临着保持学术诚信以及平衡人工智能的使用与人类洞察力等挑战。建议包括在研究工作流程中更广泛地整合人工智能工具,强调使用的道德性和透明度,为研究人员提供充分的培训,以及在人工智能的实用性和人类洞察力之间保持平衡。要解决人工智能在学术界应用过程中新出现的挑战和伦理问题,持续的研究和开发至关重要。
{"title":"Using artificial intelligence in academic writing and research: An essential productivity tool","authors":"Mohamed Khalifa ,&nbsp;Mona Albadawy","doi":"10.1016/j.cmpbup.2024.100145","DOIUrl":"https://doi.org/10.1016/j.cmpbup.2024.100145","url":null,"abstract":"<div><h3>Background</h3><p>Academic writing is an essential component of research, characterized by structured expression of ideas, data-driven arguments, and logical reasoning. However, it poses challenges such as handling vast amounts of information and complex ideas. The integration of Artificial Intelligence (AI) into academic writing has become increasingly important, offering solutions to these challenges. This review aims to explore specific domains where AI significantly supports academic writing.</p></div><div><h3>Methods</h3><p>A systematic review of literature from databases like PubMed, Embase, and Google Scholar, published since 2019, was conducted. Studies were included based on relevance to AI's application in academic writing and research, focusing on writing assistance, grammar improvement, structure optimization, and other related aspects.</p></div><div><h3>Results</h3><p>The search identified 24 studies through which six core domains were identified where AI helps academic writing and research: 1) facilitating idea generation and research design, 2) improving content and structuring, 3) supporting literature review and synthesis, 4) enhancing data management and analysis, 5) supporting editing, review, and publishing, and 6) assisting in communication, outreach, and ethical compliance. ChatGPT has shown substantial potential in these areas, though challenges like maintaining academic integrity and balancing AI use with human insight remain.</p></div><div><h3>Conclusion and recommendations</h3><p>AI significantly revolutionises academic writing and research across various domains. Recommendations include broader integration of AI tools in research workflows, emphasizing ethical and transparent use, providing adequate training for researchers, and maintaining a balance between AI utility and human insight. Ongoing research and development are essential to address emerging challenges and ethical considerations in AI's application in academia.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666990024000120/pdfft?md5=69cd44e1ee12e7efa2147c0319eb0030&pid=1-s2.0-S2666990024000120-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140062690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Concepts, objectives and analysis of public health surveillance systems 公共卫生监测系统的概念、目标和分析
Pub Date : 2024-01-01 DOI: 10.1016/j.cmpbup.2024.100136
Hurmat Ali Shah, Mowafa Househ

Public health surveillance (PHS) systems are an important tool to map the distribution and burden of disease as well as enable efficient distribution of resources to fight a disease. The surveillance systems are used to detect, report, track a disease as well as assess the response to the disease and people's attitudes. PHS systems are changing with the rapid change in technology and are becoming more real-time responsive with availability of new type of data such as online content and social media data. This review presents the basics of surveillance systems and develop from it to show the evolution of surveillance systems. The different forms of data available, surveillance methods and surveillance types are also reviewed such as social media based, web-based and clinical data based PHS maps. This review provide comprehensive details of the surveillance systems in terms of data types used, source of data and purpose of the surveillance system.

公共卫生监测系统(PHS)是绘制疾病分布和负担图以及有效分配抗病资源的重要工具。监测系统用于检测、报告和跟踪疾病,以及评估对疾病的反应和人们的态度。随着技术的快速发展,公共卫生监测系统也在发生变化,并且随着新型数据(如在线内容和社交媒体数据)的出现,其实时响应性也在不断提高。本综述介绍了监控系统的基本原理,并以此为基础说明监控系统的演变。此外,还回顾了不同形式的可用数据、监控方法和监控类型,如基于社交媒体、基于网络和基于临床数据的 PHS 地图。这篇综述从使用的数据类型、数据来源和监测系统的目的等方面全面详细地介绍了监测系统。
{"title":"Concepts, objectives and analysis of public health surveillance systems","authors":"Hurmat Ali Shah,&nbsp;Mowafa Househ","doi":"10.1016/j.cmpbup.2024.100136","DOIUrl":"10.1016/j.cmpbup.2024.100136","url":null,"abstract":"<div><p>Public health surveillance (PHS) systems are an important tool to map the distribution and burden of disease as well as enable efficient distribution of resources to fight a disease. The surveillance systems are used to detect, report, track a disease as well as assess the response to the disease and people's attitudes. PHS systems are changing with the rapid change in technology and are becoming more real-time responsive with availability of new type of data such as online content and social media data. This review presents the basics of surveillance systems and develop from it to show the evolution of surveillance systems. The different forms of data available, surveillance methods and surveillance types are also reviewed such as social media based, web-based and clinical data based PHS maps. This review provide comprehensive details of the surveillance systems in terms of data types used, source of data and purpose of the surveillance system.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266699002400003X/pdfft?md5=c7174d19610aa51e76061c94d0b56e24&pid=1-s2.0-S266699002400003X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139456573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ResfEANet: ResNet-fused External Attention Network for Tuberculosis Diagnosis using Chest X-ray Images ResfEANet:利用胸部 X 光图像诊断结核病的 ResNet 融合外部注意力网络
Pub Date : 2024-01-01 DOI: 10.1016/j.cmpbup.2023.100133
Chukwuebuka Joseph Ejiyi , Zhen Qin , Ann O Nnani , Fuhu Deng , Thomas Ugochukwu Ejiyi , Makuachukwu Bennedith Ejiyi , Victor Kwaku Agbesi , Olusola Bamisile

Pulmonary tuberculosis (TB), the most prevalent form of TB, remains a major global public health concern, contributing to more than a million deaths each year. The accurate and timely diagnosis of this disease is of paramount importance for effective control and treatment. Chest X-ray (CXR) images have emerged as a valuable tool for screening lung diseases, including TB, owing to their cost-effectiveness and non-invasiveness. Despite advancements in technology, the challenges associated with interpreting CXR images persist, primarily due to the scarcity of trained radiologists. This underscores the pressing need for an automated and cost-effective computer-aided system capable of diagnosing TB, assisting medical practitioners in distinguishing between TB-positive and negative CXR scans. In response to this need, we introduce an innovative approach called ResNet-fused External Attention Network (ResfEANet). This network excels in accurately classifying TB from CXR images, achieving remarkable levels of accuracy and sensitivity. ResfEANet is built upon ResNet and incorporates an External Attention mechanism, albeit with fewer residual network blocks than ResNet-50 resulting in a relatively shallow network with fewer layers. This approach proves highly effective in feature extraction and yields competitive results in the classification of TB. Our method was employed to train a model that demonstrated an impressive accuracy rate of 97.59% and a remarkable sensitivity of 100% in binary classification tasks with optimal computational cost. These outcomes suggest that our proposed approach has the potential to serve as a valuable secondary tool in clinical decision-making, providing crucial assistance to radiologists and healthcare professionals.

肺结核(TB)是结核病中最常见的一种,它仍然是全球公共卫生领域的一个重大问题,每年导致一百多万人死亡。准确及时地诊断这种疾病对有效控制和治疗至关重要。胸部 X 光(CXR)图像因其成本效益高且无创,已成为筛查肺部疾病(包括结核病)的重要工具。尽管技术不断进步,但与解读 CXR 图像相关的挑战依然存在,这主要是由于缺乏训练有素的放射科医生。因此,我们迫切需要一种能够诊断肺结核、协助医疗从业人员区分肺结核阳性和阴性 CXR 扫描图像的自动化、经济高效的计算机辅助系统。为了满足这一需求,我们引入了一种创新方法,称为 "ResNet-fused External Attention Network"(ResfEANet)。该网络能从 CXR 图像中准确地对结核病进行分类,准确性和灵敏度都达到了很高的水平。ResfEANet 建立在 ResNet 的基础上,并结合了外部注意机制,但与 ResNet-50 相比,ResfEANet 的残余网络块更少,因此网络层次相对较浅。事实证明,这种方法在特征提取方面非常有效,并在肺结核分类方面取得了有竞争力的结果。我们采用这种方法训练的模型在二元分类任务中的准确率达到了令人印象深刻的 97.59%,灵敏度达到了显著的 100%,而且计算成本最优。这些结果表明,我们提出的方法有可能成为临床决策的重要辅助工具,为放射科医生和医疗保健专业人员提供重要帮助。
{"title":"ResfEANet: ResNet-fused External Attention Network for Tuberculosis Diagnosis using Chest X-ray Images","authors":"Chukwuebuka Joseph Ejiyi ,&nbsp;Zhen Qin ,&nbsp;Ann O Nnani ,&nbsp;Fuhu Deng ,&nbsp;Thomas Ugochukwu Ejiyi ,&nbsp;Makuachukwu Bennedith Ejiyi ,&nbsp;Victor Kwaku Agbesi ,&nbsp;Olusola Bamisile","doi":"10.1016/j.cmpbup.2023.100133","DOIUrl":"10.1016/j.cmpbup.2023.100133","url":null,"abstract":"<div><p>Pulmonary tuberculosis (TB), the most prevalent form of TB, remains a major global public health concern, contributing to more than a million deaths each year. The accurate and timely diagnosis of this disease is of paramount importance for effective control and treatment. Chest X-ray (CXR) images have emerged as a valuable tool for screening lung diseases, including TB, owing to their cost-effectiveness and non-invasiveness. Despite advancements in technology, the challenges associated with interpreting CXR images persist, primarily due to the scarcity of trained radiologists. This underscores the pressing need for an automated and cost-effective computer-aided system capable of diagnosing TB, assisting medical practitioners in distinguishing between TB-positive and negative CXR scans. In response to this need, we introduce an innovative approach called ResNet-fused External Attention Network (ResfEANet). This network excels in accurately classifying TB from CXR images, achieving remarkable levels of accuracy and sensitivity. ResfEANet is built upon ResNet and incorporates an External Attention mechanism, albeit with fewer residual network blocks than ResNet-50 resulting in a relatively shallow network with fewer layers. This approach proves highly effective in feature extraction and yields competitive results in the classification of TB. Our method was employed to train a model that demonstrated an impressive accuracy rate of 97.59% and a remarkable sensitivity of 100% in binary classification tasks with optimal computational cost. These outcomes suggest that our proposed approach has the potential to serve as a valuable secondary tool in clinical decision-making, providing crucial assistance to radiologists and healthcare professionals.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666990023000411/pdfft?md5=81aecaa858595c69800e5427f5591e96&pid=1-s2.0-S2666990023000411-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139193821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mathematical modeling of the effects of vector control, treatment and mass awareness on the transmission dynamics of dengue fever 病媒控制、治疗和大众宣传对登革热传播动态影响的数学建模
Pub Date : 2024-01-01 DOI: 10.1016/j.cmpbup.2024.100159
Boniface Zacharia Naaly , Theresia Marijani , Augustino Isdory , Jufren Zakayo Ndendya

Dengue fever is a vital public health concern that affects about 40% of the world’s population. To address the dynamics of dengue disease, a mathematical model was formulated by incorporating three control strategies: vector control, treatment, and mass awareness. A stability analysis of the disease-free equilibrium (DFE) was conducted using the Jacobian matrix. The DFE was found to be locally and globally asymptotically stable when the effective reproductive number was less than one; otherwise, it was unstable. Additionally, an endemic equilibrium point (EEP) was identified. The global stability analysis of the EEP, performed using the Lyapunov method, showed that it is globally asymptotically stable whenever Re>1; otherwise, it is unstable. Bifurcation analysis revealed that the model system exhibits a forward bifurcation. Furthermore, sensitivity analysis of the effective reproduction number revealed that the most sensitive parameters are the biting rate (b) and insecticide efficacy (δ). Therefore, the results suggest that, in order to reduce new dengue cases, intervention strategies that decrease the biting rate, such as mosquito repellents and the use of insecticides to kill mosquitoes, should be implemented. Moreover, simulations were conducted for the extended model with vector control, treatment, and mass awareness. The results showed that the combination of vector control, treatment, and mass awareness has a more positive impact on the control of dengue fever than any single or paired intervention. Thus, for effective control of dengue fever, the three control measures should be implemented simultaneously, especially in endemic areas.

登革热是一个重要的公共卫生问题,影响着全球约 40% 的人口。针对登革热病的动态变化,我们建立了一个数学模型,其中包含三种控制策略:病媒控制、治疗和大众宣传。利用雅各布矩阵对无疾病平衡(DFE)进行了稳定性分析。结果发现,当有效繁殖数小于 1 时,无病平衡点在局部和全局上都趋于稳定;反之,则不稳定。此外,还确定了一个地方性平衡点(EEP)。利用 Lyapunov 方法对 EEP 进行的全局稳定性分析表明,当 Re>1 时,它是全局渐近稳定的;否则,它是不稳定的。分岔分析表明,模型系统呈现正向分岔。此外,对有效繁殖数量的敏感性分析表明,最敏感的参数是叮咬率(b)和杀虫剂效力(δ)。因此,结果表明,为了减少新的登革热病例,应实施降低叮咬率的干预策略,如驱蚊剂和使用杀虫剂杀灭蚊子。此外,还对包含病媒控制、治疗和大众宣传的扩展模型进行了模拟。结果表明,病媒控制、治疗和大众宣传相结合,比任何单一或成对的干预措施对登革热的控制都有更积极的影响。因此,为有效控制登革热,应同时实施这三种控制措施,特别是在登革热流行地区。
{"title":"Mathematical modeling of the effects of vector control, treatment and mass awareness on the transmission dynamics of dengue fever","authors":"Boniface Zacharia Naaly ,&nbsp;Theresia Marijani ,&nbsp;Augustino Isdory ,&nbsp;Jufren Zakayo Ndendya","doi":"10.1016/j.cmpbup.2024.100159","DOIUrl":"https://doi.org/10.1016/j.cmpbup.2024.100159","url":null,"abstract":"<div><p>Dengue fever is a vital public health concern that affects about 40% of the world’s population. To address the dynamics of dengue disease, a mathematical model was formulated by incorporating three control strategies: vector control, treatment, and mass awareness. A stability analysis of the disease-free equilibrium (DFE) was conducted using the Jacobian matrix. The DFE was found to be locally and globally asymptotically stable when the effective reproductive number was less than one; otherwise, it was unstable. Additionally, an endemic equilibrium point (EEP) was identified. The global stability analysis of the EEP, performed using the Lyapunov method, showed that it is globally asymptotically stable whenever <span><math><mrow><msub><mrow><mi>R</mi></mrow><mrow><mi>e</mi></mrow></msub><mo>&gt;</mo><mn>1</mn></mrow></math></span>; otherwise, it is unstable. Bifurcation analysis revealed that the model system exhibits a forward bifurcation. Furthermore, sensitivity analysis of the effective reproduction number revealed that the most sensitive parameters are the biting rate (<span><math><mi>b</mi></math></span>) and insecticide efficacy (<span><math><mi>δ</mi></math></span>). Therefore, the results suggest that, in order to reduce new dengue cases, intervention strategies that decrease the biting rate, such as mosquito repellents and the use of insecticides to kill mosquitoes, should be implemented. Moreover, simulations were conducted for the extended model with vector control, treatment, and mass awareness. The results showed that the combination of vector control, treatment, and mass awareness has a more positive impact on the control of dengue fever than any single or paired intervention. Thus, for effective control of dengue fever, the three control measures should be implemented simultaneously, especially in endemic areas.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666990024000260/pdfft?md5=58dbf14090021c1ebf275bc2a8944acb&pid=1-s2.0-S2666990024000260-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141480951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning approaches for predicting frailty base on multimorbidities in US adults using NHANES data (1999–2018) 利用 NHANES 数据(1999-2018 年)的机器学习方法预测美国成人多病虚弱情况
Pub Date : 2024-01-01 DOI: 10.1016/j.cmpbup.2024.100164

Background

The global increase in an aging population has led to more common age-related health challenges, particularly multimorbidity and frailty, but there is a significant gap.

Methods

This cross-sectional study utilized data from the National Health and Nutrition Examination Survey (1999–2018). The association between age and frailty was assessed using a restricted cubic spline (RCS) model, while weighted adjusted multivariable logistic regression evaluated the effect of diseases to frailty. And in machine learning process, feature selection for the frailty prediction model involved three algorithms. The model's performance was optimized using nested cross-validation and tested with various algorithms including decision tree, Logistic Regression, k-Nearest Neighbor, Random Forest, Recursive Partitioning and Regression Trees, and eXtreme Gradient Boosting (XGBoost). We used areas under the receiver operating characteristic curve (AUC) and area under the precision-recall curve (AU-PRC) to evaluate six algorithms, select the optimal model, and test the discrimination and consistency of the optimal model.

Results

The study included 46,187 participants, with 6,009 cases of frailty. RCS analysis showed a non-linear association between age and frailty, with a turning point at 49 years. Key impacting variables identified are Anemia, Arthritis, Diabetes Mellitus, Coronary Heart Disease, and Hypertension. In the machine learning process, we selected the optimal data set by feature selection, including 13 variables. Through nested cross-validation, a total of 31,900 models were built using 6 algorithms. And the XGBoost model showed the highest performance (AUC = 0.8828 and AU-PRC = 0.624), and clear proficiency in both discrimination and calibration.

Conclusions

We found 49 years maintain the balance of physiological reserve and external aggression. In addition, chronic diseases are trigger factor of frailty, while acute diseases are contributing factor that exacerbates the body's rapid decline. Last, the XGBoost frailty prediction model, with its simplicity, high performance and high clinical value holds potential for clinical application.

背景全球老龄化人口的增加导致了更常见的与年龄相关的健康挑战,尤其是多病症和虚弱,但目前还存在很大差距。使用受限立方样条(RCS)模型评估了年龄与虚弱之间的关联,而加权调整多变量逻辑回归评估了疾病对虚弱的影响。在机器学习过程中,虚弱预测模型的特征选择涉及三种算法。我们使用嵌套交叉验证对模型的性能进行了优化,并使用多种算法进行了测试,包括决策树、逻辑回归、k-近邻、随机森林、递归分区和回归树以及极梯度提升(XGBoost)。我们使用接收者操作特征曲线下面积(AUC)和精确度-召回曲线下面积(AU-PRC)对六种算法进行了评估,选出了最优模型,并测试了最优模型的区分度和一致性。RCS 分析表明,年龄与虚弱之间存在非线性关系,49 岁时出现转折点。主要影响变量包括贫血、关节炎、糖尿病、冠心病和高血压。在机器学习过程中,我们通过特征选择选出了最佳数据集,其中包括 13 个变量。通过嵌套交叉验证,共使用 6 种算法建立了 31900 个模型。而 XGBoost 模型表现出了最高的性能(AUC = 0.8828 和 AU-PRC = 0.624),并且在判别和校准方面都有明显的优势。此外,慢性疾病是体弱的诱发因素,而急性疾病则是加剧身体快速衰退的诱因。最后,XGBoost 虚弱预测模型具有简单、高性能和高临床价值的特点,具有临床应用潜力。
{"title":"Machine learning approaches for predicting frailty base on multimorbidities in US adults using NHANES data (1999–2018)","authors":"","doi":"10.1016/j.cmpbup.2024.100164","DOIUrl":"10.1016/j.cmpbup.2024.100164","url":null,"abstract":"<div><h3>Background</h3><p>The global increase in an aging population has led to more common age-related health challenges, particularly multimorbidity and frailty, but there is a significant gap.</p></div><div><h3>Methods</h3><p>This cross-sectional study utilized data from the National Health and Nutrition Examination Survey (1999–2018). The association between age and frailty was assessed using a restricted cubic spline (RCS) model, while weighted adjusted multivariable logistic regression evaluated the effect of diseases to frailty. And in machine learning process, feature selection for the frailty prediction model involved three algorithms. The model's performance was optimized using nested cross-validation and tested with various algorithms including decision tree, Logistic Regression, k-Nearest Neighbor, Random Forest, Recursive Partitioning and Regression Trees, and eXtreme Gradient Boosting (XGBoost). We used areas under the receiver operating characteristic curve (AUC) and area under the precision-recall curve (AU-PRC) to evaluate six algorithms, select the optimal model, and test the discrimination and consistency of the optimal model.</p></div><div><h3>Results</h3><p>The study included 46,187 participants, with 6,009 cases of frailty. RCS analysis showed a non-linear association between age and frailty, with a turning point at 49 years. Key impacting variables identified are Anemia, Arthritis, Diabetes Mellitus, Coronary Heart Disease, and Hypertension. In the machine learning process, we selected the optimal data set by feature selection, including 13 variables. Through nested cross-validation, a total of 31,900 models were built using 6 algorithms. And the XGBoost model showed the highest performance (AUC = 0.8828 and AU-PRC = 0.624), and clear proficiency in both discrimination and calibration.</p></div><div><h3>Conclusions</h3><p>We found 49 years maintain the balance of physiological reserve and external aggression. In addition, chronic diseases are trigger factor of frailty, while acute diseases are contributing factor that exacerbates the body's rapid decline. Last, the XGBoost frailty prediction model, with its simplicity, high performance and high clinical value holds potential for clinical application.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666990024000314/pdfft?md5=b2ac2f1faea71ce864789e43929be852&pid=1-s2.0-S2666990024000314-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142239341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning from real data: A mental health registry case study 从真实数据中进行机器学习:心理健康登记案例研究
Pub Date : 2024-01-01 DOI: 10.1016/j.cmpbup.2023.100132
Elisabetta Gentili , Giorgia Franchini , Riccardo Zese , Marco Alberti , Maria Ferrara , Ilaria Domenicano , Luigi Grassi

Imbalanced datasets can impair the learning performance of many Machine Learning techniques. Nevertheless, many real-world datasets, especially in the healthcare field, are inherently imbalanced. For instance, in the medical domain, the classes representing a specific disease are typically the minority of the total cases. This challenge justifies the substantial research effort spent in the past decades to tackle data imbalance at the data and algorithm levels. In this paper, we describe the strategies we used to deal with an imbalanced classification task on data extracted from a database generated from the Electronic Health Records of the Mental Health Service of the Ferrara Province, Italy. In particular, we applied balancing techniques to the original data, such as random undersampling and oversampling, and Synthetic Minority Oversampling Technique for Nominal and Continuous (SMOTE-NC). In order to assess the effectiveness of the balancing techniques on the classification task at hand, we applied different Machine Learning algorithms. We employed cost-sensitive learning as well and compared its results with those of the balancing methods. Furthermore, a feature selection analysis was conducted to investigate the relevance of each feature. Results show that balancing can help find the best setting to accomplish classification tasks. Since real-world imbalanced datasets are increasingly becoming the core of scientific research, further studies are needed to improve already existing techniques.

不平衡数据集会损害许多机器学习技术的学习性能。然而,现实世界中的许多数据集,尤其是医疗保健领域的数据集,本身就是不平衡的。例如,在医疗领域,代表特定疾病的类别通常只占病例总数的少数。因此,在过去几十年里,人们花费了大量精力在数据和算法层面解决数据不平衡问题。在本文中,我们介绍了在处理从意大利费拉拉省心理健康服务电子健康记录数据库中提取的数据的不平衡分类任务时所采用的策略。特别是,我们对原始数据采用了平衡技术,如随机欠采样和超采样,以及用于名义和连续的合成少数群体超采样技术(SMOTE-NC)。为了评估平衡技术对当前分类任务的有效性,我们采用了不同的机器学习算法。我们还采用了成本敏感学习,并将其结果与平衡方法的结果进行了比较。此外,我们还进行了特征选择分析,以研究每个特征的相关性。结果表明,平衡有助于找到完成分类任务的最佳设置。由于真实世界的不平衡数据集正日益成为科学研究的核心,因此需要进一步的研究来改进现有的技术。
{"title":"Machine learning from real data: A mental health registry case study","authors":"Elisabetta Gentili ,&nbsp;Giorgia Franchini ,&nbsp;Riccardo Zese ,&nbsp;Marco Alberti ,&nbsp;Maria Ferrara ,&nbsp;Ilaria Domenicano ,&nbsp;Luigi Grassi","doi":"10.1016/j.cmpbup.2023.100132","DOIUrl":"10.1016/j.cmpbup.2023.100132","url":null,"abstract":"<div><p>Imbalanced datasets can impair the learning performance of many Machine Learning techniques. Nevertheless, many real-world datasets, especially in the healthcare field, are inherently imbalanced. For instance, in the medical domain, the classes representing a specific disease are typically the minority of the total cases. This challenge justifies the substantial research effort spent in the past decades to tackle data imbalance at the data and algorithm levels. In this paper, we describe the strategies we used to deal with an imbalanced classification task on data extracted from a database generated from the Electronic Health Records of the Mental Health Service of the Ferrara Province, Italy. In particular, we applied balancing techniques to the original data, such as random undersampling and oversampling, and Synthetic Minority Oversampling Technique for Nominal and Continuous (SMOTE-NC). In order to assess the effectiveness of the balancing techniques on the classification task at hand, we applied different Machine Learning algorithms. We employed cost-sensitive learning as well and compared its results with those of the balancing methods. Furthermore, a feature selection analysis was conducted to investigate the relevance of each feature. Results show that balancing can help find the best setting to accomplish classification tasks. Since real-world imbalanced datasets are increasingly becoming the core of scientific research, further studies are needed to improve already existing techniques.</p></div>","PeriodicalId":72670,"journal":{"name":"Computer methods and programs in biomedicine update","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266699002300040X/pdfft?md5=fca4123f44f99c83994cc13701771f05&pid=1-s2.0-S266699002300040X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139190322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computer methods and programs in biomedicine update
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1