首页 > 最新文献

Artificial Intelligence in Medicine最新文献

英文 中文
Machine learning applications in preventive healthcare: A systematic literature review on predictive analytics of disease comorbidity from multiple perspectives 预防保健中的机器学习应用:从多个角度对疾病合并症预测分析进行系统性文献综述
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-14 DOI: 10.1016/j.artmed.2024.102950
Duo Xu , Zeshui Xu

Artificial intelligence is constantly revolutionizing biomedical research and healthcare management. Disease comorbidity is a major threat to the quality of life for susceptible groups, especially middle-aged and elderly patients. The presence of multiple chronic diseases makes precision diagnosis challenging to realize and imposes a heavy burden on the healthcare system and economy. Given an enormous amount of accumulated health data, machine learning techniques show their capability in handling this puzzle. The present study conducts a review to uncover current research efforts in applying these methods to understanding comorbidity mechanisms and making clinical predictions considering these complex patterns. A descriptive metadata analysis of 791 unique publications aims to capture the overall research progression between January 2012 and June 2023. To delve into comorbidity-focused research, 61 of these scientific papers are systematically assessed. Four predictive analytics of tasks are detected: disease comorbidity data extraction, clustering, network, and risk prediction. It is observed that some machine learning-driven applications address inherent data deficiencies in healthcare datasets and provide a model interpretation that identifies significant risk factors of comorbidity development. Based on insights, both technical and practical, gained from relevant literature, this study intends to guide future interests in comorbidity research and draw conclusions about chronic disease prevention and diagnosis with managerial implications.

人工智能不断给生物医学研究和医疗保健管理带来变革。对于易感人群,尤其是中老年患者来说,疾病合并症是生活质量的一大威胁。多种慢性疾病的存在使精准诊断的实现面临挑战,并给医疗系统和经济带来沉重负担。鉴于积累了大量的健康数据,机器学习技术在处理这一难题方面显示出了自己的能力。本研究通过综述,揭示了当前应用这些方法来理解合并症机制并根据这些复杂模式进行临床预测的研究工作。本研究对 791 篇独特出版物进行了描述性元数据分析,旨在捕捉 2012 年 1 月至 2023 年 6 月期间的整体研究进展。为了深入研究以合并症为重点的研究,我们对其中的 61 篇科学论文进行了系统评估。发现了四种预测分析任务:疾病合并症数据提取、聚类、网络和风险预测。据观察,一些机器学习驱动的应用解决了医疗保健数据集中固有的数据缺陷,并提供了一种模型解释,可识别合并症发展的重要风险因素。基于从相关文献中获得的技术和实践见解,本研究旨在指导未来的合并症研究兴趣,并得出具有管理意义的慢性病预防和诊断结论。
{"title":"Machine learning applications in preventive healthcare: A systematic literature review on predictive analytics of disease comorbidity from multiple perspectives","authors":"Duo Xu ,&nbsp;Zeshui Xu","doi":"10.1016/j.artmed.2024.102950","DOIUrl":"10.1016/j.artmed.2024.102950","url":null,"abstract":"<div><p>Artificial intelligence is constantly revolutionizing biomedical research and healthcare management. Disease comorbidity is a major threat to the quality of life for susceptible groups, especially middle-aged and elderly patients. The presence of multiple chronic diseases makes precision diagnosis challenging to realize and imposes a heavy burden on the healthcare system and economy. Given an enormous amount of accumulated health data, machine learning techniques show their capability in handling this puzzle. The present study conducts a review to uncover current research efforts in applying these methods to understanding comorbidity mechanisms and making clinical predictions considering these complex patterns. A descriptive metadata analysis of 791 unique publications aims to capture the overall research progression between January 2012 and June 2023. To delve into comorbidity-focused research, 61 of these scientific papers are systematically assessed. Four predictive analytics of tasks are detected: disease comorbidity data extraction, clustering, network, and risk prediction. It is observed that some machine learning-driven applications address inherent data deficiencies in healthcare datasets and provide a model interpretation that identifies significant risk factors of comorbidity development. Based on insights, both technical and practical, gained from relevant literature, this study intends to guide future interests in comorbidity research and draw conclusions about chronic disease prevention and diagnosis with managerial implications.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"156 ","pages":"Article 102950"},"PeriodicalIF":6.1,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142006363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing metagenomic classification with compression-based features 利用基于压缩的特征增强元基因组分类。
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-14 DOI: 10.1016/j.artmed.2024.102948
Jorge Miguel Silva, João Rafael Almeida

Metagenomics is a rapidly expanding field that uses next-generation sequencing technology to analyze the genetic makeup of environmental samples. However, accurately identifying the organisms in a metagenomic sample can be complex, and traditional reference-based methods may need to be more effective in some instances.

In this study, we present a novel approach for metagenomic identification, using data compressors as a feature for taxonomic classification. By evaluating a comprehensive set of compressors, including both general-purpose and genomic-specific, we demonstrate the effectiveness of this method in accurately identifying organisms in metagenomic samples. The results indicate that using features from multiple compressors can help identify taxonomy. An overall accuracy of 95% was achieved using this method using an imbalanced dataset with classes with limited samples. The study also showed that the correlation between compression and classification is insignificant, highlighting the need for a multi-faceted approach to metagenomic identification.

This approach offers a significant advancement in the field of metagenomics, providing a reference-less method for taxonomic identification that is both effective and efficient while revealing insights into the statistical and algorithmic nature of genomic data. The code to validate this study is publicly available at https://github.com/ieeta-pt/xgTaxonomy.

元基因组学是一个迅速发展的领域,它利用下一代测序技术分析环境样本的基因构成。然而,准确识别元基因组样本中的生物可能很复杂,在某些情况下,传统的基于参考的方法可能需要更加有效。在本研究中,我们提出了一种新的元基因组鉴定方法,利用数据压缩器作为分类学分类的特征。通过评估一整套压缩器(包括通用压缩器和基因组专用压缩器),我们证明了这种方法在准确鉴定元基因组样本中的生物体方面的有效性。结果表明,使用来自多个压缩器的特征有助于识别生物分类。使用这种方法,在样本有限的不平衡类数据集上,总体准确率达到 95%。研究还表明,压缩与分类之间的相关性并不显著,这突出表明需要一种多方面的元基因组识别方法。这种方法是元基因组学领域的一大进步,它提供了一种无需参考的分类鉴定方法,既有效又高效,同时还揭示了基因组数据的统计和算法本质。验证这项研究的代码可在 https://github.com/ieeta-pt/xgTaxonomy 上公开获取。
{"title":"Enhancing metagenomic classification with compression-based features","authors":"Jorge Miguel Silva,&nbsp;João Rafael Almeida","doi":"10.1016/j.artmed.2024.102948","DOIUrl":"10.1016/j.artmed.2024.102948","url":null,"abstract":"<div><p>Metagenomics is a rapidly expanding field that uses next-generation sequencing technology to analyze the genetic makeup of environmental samples. However, accurately identifying the organisms in a metagenomic sample can be complex, and traditional reference-based methods may need to be more effective in some instances.</p><p>In this study, we present a novel approach for metagenomic identification, using data compressors as a feature for taxonomic classification. By evaluating a comprehensive set of compressors, including both general-purpose and genomic-specific, we demonstrate the effectiveness of this method in accurately identifying organisms in metagenomic samples. The results indicate that using features from multiple compressors can help identify taxonomy. An overall accuracy of 95% was achieved using this method using an imbalanced dataset with classes with limited samples. The study also showed that the correlation between compression and classification is insignificant, highlighting the need for a multi-faceted approach to metagenomic identification.</p><p>This approach offers a significant advancement in the field of metagenomics, providing a reference-less method for taxonomic identification that is both effective and efficient while revealing insights into the statistical and algorithmic nature of genomic data. The code to validate this study is publicly available at <span><span>https://github.com/ieeta-pt/xgTaxonomy</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"156 ","pages":"Article 102948"},"PeriodicalIF":6.1,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0933365724001908/pdfft?md5=303ea4cf8a281f1bfe4eda4a7757ef46&pid=1-s2.0-S0933365724001908-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142037788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Walking representation and simulation based on multi-source image fusion and multi-agent reinforcement learning for gait rehabilitation 基于多源图像融合和多代理强化学习的步态康复行走表示与模拟
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-13 DOI: 10.1016/j.artmed.2024.102945
Yean Zhu , Meirong Xiao , Dan Robbins , Xiaoying Wu , Wei Lu , Wensheng Hou

In the formulation of strategies for walking rehabilitation, achieving precise identification of the current state and making rational predictions about the future state are crucial but often unrealized. To tackle this challenge, our study introduces a unified framework that integrates a novel 3D walking motion capture method using multi-source image fusion and a walking rehabilitation simulation approach based on multi-agent reinforcement learning. We found that, (i) the proposal achieved an accurate 3D walking motion capture and outperforms other advanced methods. Experimental evidence indicates that, compared to similar visual skeleton tracking methods, the proposed approach yields results with higher Pearson correlation (r=0.93), intra-class correlation coefficient (ICC(2,1)=0.91), and narrower confidence intervals ([0.90,0.95] for r, [0.88,0.94] for ICC(2,1)) when compared to standard results. The outcomes of the proposed approach also exhibit commendable correlation and concurrence with those obtained through the IMU-based skeleton tracking method in the assessment of gait parameters ([0.85,0.89] for r, [0.75,0.81] for ICC(2,1)); (ii) multi-agent reinforcement learning has the potential to be used to solve the simulation task of gait rehabilitation. In mimicry experiment, our proposed simulation method for gait rehabilitation not only enables the intelligent agent to converge from the initial state to the target state, but also observes evolutionary patterns similar to those observed in clinical practice through motor state resolution. This study offers valuable contributions to walking rehabilitation, enabling precise assessment and simulation-based interventions, with potential implications for clinical practice and patient outcomes.

在制定行走康复策略时,实现对当前状态的精确识别和对未来状态的合理预测至关重要,但往往无法实现。为了应对这一挑战,我们的研究引入了一个统一的框架,将使用多源图像融合的新型三维行走运动捕捉方法和基于多代理强化学习的行走康复模拟方法整合在一起。我们发现:(i) 该提案实现了精确的三维行走动作捕捉,并优于其他先进方法。实验证据表明,与类似的视觉骨骼跟踪方法相比,建议的方法产生的结果具有更高的皮尔逊相关性(r=0.93)、类内相关系数(ICC(2,1)=0.91),以及更窄的置信区间(r 为 [0.90,0.95],ICC(2,1) 为 [0.88,0.94])。在步态参数评估方面,拟议方法的结果与基于 IMU 的骨架跟踪方法所获得的结果(r 为 [0.85,0.89],ICC(2,1) 为 [0.75,0.81])也表现出值得称道的相关性和一致性;(ii) 多代理强化学习有潜力用于解决步态康复的模拟任务。在模仿实验中,我们提出的步态康复模拟方法不仅能使智能代理从初始状态收敛到目标状态,还能通过运动状态解析观察到与临床实践中类似的进化模式。这项研究为步行康复做出了宝贵贡献,实现了精确评估和基于模拟的干预,对临床实践和患者预后具有潜在影响。
{"title":"Walking representation and simulation based on multi-source image fusion and multi-agent reinforcement learning for gait rehabilitation","authors":"Yean Zhu ,&nbsp;Meirong Xiao ,&nbsp;Dan Robbins ,&nbsp;Xiaoying Wu ,&nbsp;Wei Lu ,&nbsp;Wensheng Hou","doi":"10.1016/j.artmed.2024.102945","DOIUrl":"10.1016/j.artmed.2024.102945","url":null,"abstract":"<div><p>In the formulation of strategies for walking rehabilitation, achieving precise identification of the current state and making rational predictions about the future state are crucial but often unrealized. To tackle this challenge, our study introduces a unified framework that integrates a novel 3D walking motion capture method using multi-source image fusion and a walking rehabilitation simulation approach based on multi-agent reinforcement learning. We found that, (i) the proposal achieved an accurate 3D walking motion capture and outperforms other advanced methods. Experimental evidence indicates that, compared to similar visual skeleton tracking methods, the proposed approach yields results with higher Pearson correlation (<span><math><mrow><mi>r</mi><mo>=</mo><mn>0</mn><mo>.</mo><mn>93</mn></mrow></math></span>), intra-class correlation coefficient (<span><math><mrow><mi>I</mi><mi>C</mi><mi>C</mi><mrow><mo>(</mo><mn>2</mn><mo>,</mo><mn>1</mn><mo>)</mo></mrow><mo>=</mo><mn>0</mn><mo>.</mo><mn>91</mn></mrow></math></span>), and narrower confidence intervals (<span><math><mrow><mo>[</mo><mn>0</mn><mo>.</mo><mn>90</mn><mo>,</mo><mn>0</mn><mo>.</mo><mn>95</mn><mo>]</mo></mrow></math></span> for <span><math><mi>r</mi></math></span>, <span><math><mrow><mo>[</mo><mn>0</mn><mo>.</mo><mn>88</mn><mo>,</mo><mn>0</mn><mo>.</mo><mn>94</mn><mo>]</mo></mrow></math></span> for <span><math><mrow><mi>I</mi><mi>C</mi><mi>C</mi><mrow><mo>(</mo><mn>2</mn><mo>,</mo><mn>1</mn><mo>)</mo></mrow></mrow></math></span>) when compared to standard results. The outcomes of the proposed approach also exhibit commendable correlation and concurrence with those obtained through the IMU-based skeleton tracking method in the assessment of gait parameters (<span><math><mrow><mo>[</mo><mn>0</mn><mo>.</mo><mn>85</mn><mo>,</mo><mn>0</mn><mo>.</mo><mn>89</mn><mo>]</mo></mrow></math></span> for <span><math><mi>r</mi></math></span>, <span><math><mrow><mo>[</mo><mn>0</mn><mo>.</mo><mn>75</mn><mo>,</mo><mn>0</mn><mo>.</mo><mn>81</mn><mo>]</mo></mrow></math></span> for <span><math><mrow><mi>I</mi><mi>C</mi><mi>C</mi><mrow><mo>(</mo><mn>2</mn><mo>,</mo><mn>1</mn><mo>)</mo></mrow></mrow></math></span>); (ii) multi-agent reinforcement learning has the potential to be used to solve the simulation task of gait rehabilitation. In mimicry experiment, our proposed simulation method for gait rehabilitation not only enables the intelligent agent to converge from the initial state to the target state, but also observes evolutionary patterns similar to those observed in clinical practice through motor state resolution. This study offers valuable contributions to walking rehabilitation, enabling precise assessment and simulation-based interventions, with potential implications for clinical practice and patient outcomes.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"156 ","pages":"Article 102945"},"PeriodicalIF":6.1,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142040607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian network analysis of risk classification strategies in the regulation of cellular products 细胞产品监管风险分类策略的贝叶斯网络分析
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-09 DOI: 10.1016/j.artmed.2024.102937
Guoshu Jia , Lixia Fu , Likun Wang , Dongning Yao , Yimin Cui

Cell therapy, a burgeoning therapeutic strategy, necessitates a scientific regulatory framework but faces challenges in risk-based regulation due to the lack of a global consensus on risk classification. This study applies Bayesian network analysis to compare and evaluate the risk classification strategies for cellular products proposed by the Food and Drug Administration (FDA), Ministry of Health, Labour and Welfare (MHLW), and World Health Organization (WHO), using real-world data to validate the models. The appropriateness of key risk factors is assessed within the three regulatory frameworks, along with their implications for clinical safety. The results indicate several directions for refining risk classification approaches. Additionally, a substudy focuses on a specific type of cell and gene therapy (CGT), chimeric antigen receptor (CAR) T cell therapy. It underscores the importance of considering CAR targets, tumor types, and costimulatory domains when assessing the safety risks of CAR T cell products. Overall, there is currently a lack of a regulatory framework based on real-world data for cellular products and a lack of risk-based classification review methods. This study aims to improve the regulatory system for cellular products, emphasizing risk-based classification. Furthermore, the study advocates for leveraging machine learning in regulatory science to enhance the assessment of cellular product safety, illustrating the role of Bayesian networks in aiding regulatory decision-making for the risk classification of cellular products.

细胞疗法作为一种新兴的治疗策略,需要一个科学的监管框架,但由于全球对风险分类缺乏共识,基于风险的监管面临挑战。本研究运用贝叶斯网络分析法比较和评估了美国食品药品管理局(FDA)、日本厚生劳动省(MHLW)和世界卫生组织(WHO)提出的细胞产品风险分类策略,并使用真实世界的数据对模型进行了验证。在三个监管框架内评估了关键风险因素的适当性及其对临床安全性的影响。研究结果为完善风险分类方法指明了几个方向。此外,一项子研究重点关注一种特殊类型的细胞和基因疗法(CGT),即嵌合抗原受体(CAR)T 细胞疗法。它强调了在评估 CAR T 细胞产品的安全风险时考虑 CAR 靶点、肿瘤类型和成本调控域的重要性。总体而言,目前缺乏基于真实世界数据的细胞产品监管框架,也缺乏基于风险的分类审查方法。本研究旨在改进细胞产品的监管体系,强调基于风险的分类。此外,本研究还提倡在监管科学中利用机器学习来加强对细胞产品安全性的评估,并说明了贝叶斯网络在细胞产品风险分类监管决策中的辅助作用。
{"title":"Bayesian network analysis of risk classification strategies in the regulation of cellular products","authors":"Guoshu Jia ,&nbsp;Lixia Fu ,&nbsp;Likun Wang ,&nbsp;Dongning Yao ,&nbsp;Yimin Cui","doi":"10.1016/j.artmed.2024.102937","DOIUrl":"10.1016/j.artmed.2024.102937","url":null,"abstract":"<div><p>Cell therapy, a burgeoning therapeutic strategy, necessitates a scientific regulatory framework but faces challenges in risk-based regulation due to the lack of a global consensus on risk classification. This study applies Bayesian network analysis to compare and evaluate the risk classification strategies for cellular products proposed by the Food and Drug Administration (FDA), Ministry of Health, Labour and Welfare (MHLW), and World Health Organization (WHO), using real-world data to validate the models. The appropriateness of key risk factors is assessed within the three regulatory frameworks, along with their implications for clinical safety. The results indicate several directions for refining risk classification approaches. Additionally, a substudy focuses on a specific type of cell and gene therapy (CGT), chimeric antigen receptor (CAR) T cell therapy. It underscores the importance of considering CAR targets, tumor types, and costimulatory domains when assessing the safety risks of CAR T cell products. Overall, there is currently a lack of a regulatory framework based on real-world data for cellular products and a lack of risk-based classification review methods. This study aims to improve the regulatory system for cellular products, emphasizing risk-based classification. Furthermore, the study advocates for leveraging machine learning in regulatory science to enhance the assessment of cellular product safety, illustrating the role of Bayesian networks in aiding regulatory decision-making for the risk classification of cellular products.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"155 ","pages":"Article 102937"},"PeriodicalIF":6.1,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0933365724001799/pdfft?md5=a9ff234dd953a123436e59d1b68d25ee&pid=1-s2.0-S0933365724001799-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141964579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Probing perfection: The relentless art of meddling for pulmonary airway segmentation from HRCT via a human-AI collaboration based active learning method 探索完美:通过基于人机协作的主动学习方法,从 HRCT 中获得肺气道分割的无情干预艺术
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-01 DOI: 10.1016/j.artmed.2024.102930
Shiyi Wang , Yang Nan , Sheng Zhang , Federico Felder , Xiaodan Xing , Yingying Fang , Javier Del Ser , Simon L.F. Walsh , Guang Yang

In the realm of pulmonary tracheal segmentation, the scarcity of annotated data stands as a prevalent pain point in most medical segmentation endeavors. Concurrently, most Deep Learning (DL) methodologies employed in this domain invariably grapple with other dual challenges: the inherent opacity of ‘black box’ models and the ongoing pursuit of performance enhancement. In response to these intertwined challenges, the core concept of our Human-Computer Interaction (HCI) based learning models (RS_UNet, LC_UNet, UUNet and WD_UNet) hinge on the versatile combination of diverse query strategies and an array of deep learning models. We train four HCI models based on the initial training dataset and sequentially repeat the following steps 1–4: (1) Query Strategy: Our proposed HCI models selects those samples which contribute the most additional representative information when labeled in each iteration of the query strategy (showing the names and sequence numbers of the samples to be annotated). Additionally, in this phase, the model selects the unlabeled samples with the greatest predictive disparity by calculating the Wasserstein Distance, Least Confidence, Entropy Sampling, and Random Sampling. (2) Central line correction: The selected samples in previous stage are then used for domain expert correction of the system-generated tracheal central lines in each training round. (3) Update training dataset: When domain experts are involved in each epoch of the DL model's training iterations, they update the training dataset with greater precision after each epoch, thereby enhancing the trustworthiness of the ‘black box’ DL model and improving the performance of models. (4) Model training: Proposed HCI model is trained using the updated training dataset and an enhanced version of existing UNet.

Experimental results validate the effectiveness of this Human-Computer Interaction-based approaches, demonstrating that our proposed WD-UNet, LC-UNet, UUNet, RS-UNet achieve comparable or even superior performance than the state-of-the-art DL models, such as WD-UNet with only 15 %–35 % of the training data, leading to substantial reductions (65 %–85 % reduction of annotation effort) in physician annotation time.

在肺气管分割领域,标注数据的稀缺是大多数医疗分割工作的普遍痛点。与此同时,该领域采用的大多数深度学习(DL)方法也总是在努力应对其他双重挑战:"黑盒 "模型固有的不透明性和对性能提升的不断追求。为了应对这些交织在一起的挑战,我们基于人机交互(HCI)的学习模型(RS_UNet、LC_UNet、UUNet 和 WD_UNet)的核心理念取决于多样化查询策略和一系列深度学习模型的多功能组合。我们基于初始训练数据集训练四个人机交互模型,并依次重复以下步骤 1-4:(1)查询策略:我们提出的人机交互模型在每次迭代查询策略(显示待注释样本的名称和序列号)时,都会选择那些在标注时贡献了最多额外代表性信息的样本。此外,在这一阶段,模型还通过计算瓦瑟斯坦距离(Wasserstein Distance)、最小置信度(Least Confidence)、熵取样(Entropy Sampling)和随机取样(Random Sampling)来选择预测差异最大的未标注样本。(2) 中心线校正:然后,在每一轮训练中使用前一阶段选定的样本对系统生成的气管中心线进行领域专家校正。(3) 更新训练数据集:当领域专家参与到 DL 模型的每一轮迭代训练中时,他们会在每一轮迭代训练后更精确地更新训练数据集,从而增强 "黑盒 "DL 模型的可信度,提高模型的性能。(4) 模型训练:实验结果验证了这种基于人机交互的方法的有效性,表明我们提出的 WD-UNet、LC-UNet、UUNet、RS-UNet 与最先进的 DL 模型(如 WD-UNet)相比,只需 15 %-35 % 的训练数据就能实现相当甚至更优的性能,从而大幅减少(注释工作量减少 65 %-85%)医生注释时间。
{"title":"Probing perfection: The relentless art of meddling for pulmonary airway segmentation from HRCT via a human-AI collaboration based active learning method","authors":"Shiyi Wang ,&nbsp;Yang Nan ,&nbsp;Sheng Zhang ,&nbsp;Federico Felder ,&nbsp;Xiaodan Xing ,&nbsp;Yingying Fang ,&nbsp;Javier Del Ser ,&nbsp;Simon L.F. Walsh ,&nbsp;Guang Yang","doi":"10.1016/j.artmed.2024.102930","DOIUrl":"10.1016/j.artmed.2024.102930","url":null,"abstract":"<div><p>In the realm of pulmonary tracheal segmentation, the scarcity of annotated data stands as a prevalent pain point in most medical segmentation endeavors. Concurrently, most Deep Learning (DL) methodologies employed in this domain invariably grapple with other dual challenges: the inherent opacity of ‘black box’ models and the ongoing pursuit of performance enhancement. In response to these intertwined challenges, the core concept of our Human-Computer Interaction (HCI) based learning models (RS_UNet, LC_UNet, UUNet and WD_UNet) hinge on the versatile combination of diverse query strategies and an array of deep learning models. We train four HCI models based on the initial training dataset and sequentially repeat the following steps 1–4: (1) Query Strategy: Our proposed HCI models selects those samples which contribute the most additional representative information when labeled in each iteration of the query strategy (showing the names and sequence numbers of the samples to be annotated). Additionally, in this phase, the model selects the unlabeled samples with the greatest predictive disparity by calculating the Wasserstein Distance, Least Confidence, Entropy Sampling, and Random Sampling. (2) Central line correction: The selected samples in previous stage are then used for domain expert correction of the system-generated tracheal central lines in each training round. (3) Update training dataset: When domain experts are involved in each epoch of the DL model's training iterations, they update the training dataset with greater precision after each epoch, thereby enhancing the trustworthiness of the ‘black box’ DL model and improving the performance of models. (4) Model training: Proposed HCI model is trained using the updated training dataset and an enhanced version of existing UNet.</p><p>Experimental results validate the effectiveness of this Human-Computer Interaction-based approaches, demonstrating that our proposed WD-UNet, LC-UNet, UUNet, RS-UNet achieve comparable or even superior performance than the state-of-the-art DL models, such as WD-UNet with only 15 %–35 % of the training data, leading to substantial reductions (65 %–85 % reduction of annotation effort) in physician annotation time.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"154 ","pages":"Article 102930"},"PeriodicalIF":6.1,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0933365724001726/pdfft?md5=d0d63374bfedea7ba634b9ea22685c76&pid=1-s2.0-S0933365724001726-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141694038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MedExpQA: Multilingual benchmarking of Large Language Models for Medical Question Answering MedExpQA:用于医学问题解答的大型语言模型的多语言基准测试。
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-31 DOI: 10.1016/j.artmed.2024.102938
Iñigo Alonso, Maite Oronoz, Rodrigo Agerri

Large Language Models (LLMs) have the potential of facilitating the development of Artificial Intelligence technology to assist medical experts for interactive decision support. This potential has been illustrated by the state-of-the-art performance obtained by LLMs in Medical Question Answering, with striking results such as passing marks in licensing medical exams. However, while impressive, the required quality bar for medical applications remains far from being achieved. Currently, LLMs remain challenged by outdated knowledge and by their tendency to generate hallucinated content. Furthermore, most benchmarks to assess medical knowledge lack reference gold explanations which means that it is not possible to evaluate the reasoning of LLMs predictions. Finally, the situation is particularly grim if we consider benchmarking LLMs for languages other than English which remains, as far as we know, a totally neglected topic. In order to address these shortcomings, in this paper we present MedExpQA, the first multilingual benchmark based on medical exams to evaluate LLMs in Medical Question Answering. To the best of our knowledge, MedExpQA includes for the first time reference gold explanations, written by medical doctors, of the correct and incorrect options in the exams. Comprehensive multilingual experimentation using both the gold reference explanations and Retrieval Augmented Generation (RAG) approaches show that performance of LLMs, with best results around 75 accuracy for English, still has large room for improvement, especially for languages other than English, for which accuracy drops 10 points. Therefore, despite using state-of-the-art RAG methods, our results also demonstrate the difficulty of obtaining and integrating readily available medical knowledge that may positively impact results on downstream evaluations for Medical Question Answering. Data, code, and fine-tuned models will be made publicly available.1

大型语言模型(LLMs)具有促进人工智能技术发展的潜力,可协助医学专家进行交互式决策支持。大型语言模型在医学问题解答中取得的一流性能已经证明了这一潜力,并取得了令人瞩目的成绩,如在执业医师资格考试中取得及格分数。然而,尽管令人印象深刻,但医学应用所需的质量标准仍远未达到。目前,法学硕士仍然面临着知识过时和容易产生幻觉内容的挑战。此外,大多数评估医学知识的基准都缺乏参考金解释,这意味着无法评估法学硕士预测的推理能力。最后,如果我们考虑对英语以外的语言进行 LLMs 基准测试,情况将尤为严峻,据我们所知,英语仍然是一个完全被忽视的话题。为了解决这些不足,我们在本文中介绍了 MedExpQA,这是第一个基于医学考试的多语言基准,用于评估医学问题解答中的 LLM。据我们所知,MedExpQA 首次包含了由医生撰写的关于考试中正确和错误选项的金牌参考解释。使用黄金参考解释和检索增强生成(RAG)方法进行的多语言综合实验表明,LLMs 的性能在英语方面的最佳结果为 75% 左右的准确率,但仍有很大的改进空间,尤其是在英语以外的语言方面,准确率下降了 10 个百分点。因此,尽管使用了最先进的 RAG 方法,我们的结果也证明了获取和整合现成医学知识的难度,而这些知识可能会对医学问题解答的下游评估结果产生积极影响。数据、代码和微调模型将公开发布。
{"title":"MedExpQA: Multilingual benchmarking of Large Language Models for Medical Question Answering","authors":"Iñigo Alonso,&nbsp;Maite Oronoz,&nbsp;Rodrigo Agerri","doi":"10.1016/j.artmed.2024.102938","DOIUrl":"10.1016/j.artmed.2024.102938","url":null,"abstract":"<div><p>Large Language Models (LLMs) have the potential of facilitating the development of Artificial Intelligence technology to assist medical experts for interactive decision support. This potential has been illustrated by the state-of-the-art performance obtained by LLMs in Medical Question Answering, with striking results such as passing marks in licensing medical exams. However, while impressive, the required quality bar for medical applications remains far from being achieved. Currently, LLMs remain challenged by outdated knowledge and by their tendency to generate hallucinated content. Furthermore, most benchmarks to assess medical knowledge lack reference gold explanations which means that it is not possible to evaluate the reasoning of LLMs predictions. Finally, the situation is particularly grim if we consider benchmarking LLMs for languages other than English which remains, as far as we know, a totally neglected topic. In order to address these shortcomings, in this paper we present MedExpQA, the first multilingual benchmark based on medical exams to evaluate LLMs in Medical Question Answering. To the best of our knowledge, MedExpQA includes for the first time reference gold explanations, written by medical doctors, of the correct and incorrect options in the exams. Comprehensive multilingual experimentation using both the gold reference explanations and Retrieval Augmented Generation (RAG) approaches show that performance of LLMs, with best results around 75 accuracy for English, still has large room for improvement, especially for languages other than English, for which accuracy drops 10 points. Therefore, despite using state-of-the-art RAG methods, our results also demonstrate the difficulty of obtaining and integrating readily available medical knowledge that may positively impact results on downstream evaluations for Medical Question Answering. Data, code, and fine-tuned models will be made publicly available.<span><span><sup>1</sup></span></span></p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"155 ","pages":"Article 102938"},"PeriodicalIF":6.1,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0933365724001805/pdfft?md5=4257af50106cf356598f8ea351cad6b8&pid=1-s2.0-S0933365724001805-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141914671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning algorithms for melanoma detection using dermoscopic images: A systematic review and meta-analysis 利用皮肤镜成像检测黑色素瘤的深度学习算法:系统回顾和荟萃分析
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-25 DOI: 10.1016/j.artmed.2024.102934
Zichen Ye , Daqian Zhang , Yuankai Zhao , Mingyang Chen , Huike Wang , Samuel Seery , Yimin Qu , Peng Xue , Yu Jiang

Background

Melanoma is a serious risk to human health and early identification is vital for treatment success. Deep learning (DL) has the potential to detect cancer using imaging technologies and many studies provide evidence that DL algorithms can achieve high accuracy in melanoma diagnostics.

Objectives

To critically assess different DL performances in diagnosing melanoma using dermatoscopic images and discuss the relationship between dermatologists and DL.

Methods

Ovid-Medline, Embase, IEEE Xplore, and the Cochrane Library were systematically searched from inception until 7th December 2021. Studies that reported diagnostic DL model performances in detecting melanoma using dermatoscopic images were included if they had specific outcomes and histopathologic confirmation. Binary diagnostic accuracy data and contingency tables were extracted to analyze outcomes of interest, which included sensitivity (SEN), specificity (SPE), and area under the curve (AUC). Subgroup analyses were performed according to human-machine comparison and cooperation. The study was registered in PROSPERO, CRD42022367824.

Results

2309 records were initially retrieved, of which 37 studies met our inclusion criteria, and 27 provided sufficient data for meta-analytical synthesis. The pooled SEN was 82 % (range 77–86), SPE was 87 % (range 84–90), with an AUC of 0.92 (range 0.89–0.94). Human-machine comparison had pooled AUCs of 0.87 (0.84–0.90) and 0.83 (0.79–0.86) for DL and dermatologists, respectively. Pooled AUCs were 0.90 (0.87–0.93), 0.80 (0.76–0.83), and 0.88 (0.85–0.91) for DL, and junior and senior dermatologists, respectively. Analyses of human-machine cooperation were 0.88 (0.85–0.91) for DL, 0.76 (0.72–0.79) for unassisted, and 0.87 (0.84–0.90) for DL-assisted dermatologists.

Conclusions

Evidence suggests that DL algorithms are as accurate as senior dermatologists in melanoma diagnostics. Therefore, DL could be used to support dermatologists in diagnostic decision-making. Although, further high-quality, large-scale multicenter studies are required to address the specific challenges associated with medical AI-based diagnostics.

背景黑色素瘤严重危害人类健康,早期识别对治疗成功至关重要。深度学习(DL)具有利用成像技术检测癌症的潜力,许多研究提供证据表明,DL 算法可以在黑色素瘤诊断中达到很高的准确性.Objectives To critically assess different DL performances in diagnosing melanoma using dermatoscopic images and discuss the relationship between dermatologists and DL.MethodsOvid-Medline, Embase, IEEE Xplore, and the Cochrane Library were systematically searched from inception until 7th December 2021.方法系统地检索了从开始到 2021 年 12 月 7 日的研究。如果研究具有特定结果和组织病理学证实,则纳入报告了使用皮肤镜图像检测黑色素瘤的诊断 DL 模型性能的研究。提取二元诊断准确性数据和或然率表来分析相关结果,包括灵敏度(SEN)、特异性(SPE)和曲线下面积(AUC)。根据人机比较和合作情况进行了分组分析。该研究已在 PROSPERO 注册,CRD42022367824.Results最初检索到 2309 条记录,其中 37 项研究符合我们的纳入标准,27 项提供了足够的数据用于荟萃分析综合。汇总的 SEN 为 82%(范围为 77-86),SPE 为 87%(范围为 84-90),AUC 为 0.92(范围为 0.89-0.94)。人机比较中,DL 和皮肤科医生的集合 AUC 分别为 0.87(0.84-0.90)和 0.83(0.79-0.86)。DL以及初级和高级皮肤科医生的集合AUC分别为0.90(0.87-0.93)、0.80(0.76-0.83)和0.88(0.85-0.91)。结论有证据表明,DL 算法在黑色素瘤诊断方面与资深皮肤科医生一样准确。因此,DL 可用于辅助皮肤科医生做出诊断决策。不过,还需要进一步开展高质量、大规模的多中心研究,以应对与基于医疗人工智能的诊断相关的具体挑战。
{"title":"Deep learning algorithms for melanoma detection using dermoscopic images: A systematic review and meta-analysis","authors":"Zichen Ye ,&nbsp;Daqian Zhang ,&nbsp;Yuankai Zhao ,&nbsp;Mingyang Chen ,&nbsp;Huike Wang ,&nbsp;Samuel Seery ,&nbsp;Yimin Qu ,&nbsp;Peng Xue ,&nbsp;Yu Jiang","doi":"10.1016/j.artmed.2024.102934","DOIUrl":"10.1016/j.artmed.2024.102934","url":null,"abstract":"<div><h3>Background</h3><p>Melanoma is a serious risk to human health and early identification is vital for treatment success. Deep learning (DL) has the potential to detect cancer using imaging technologies and many studies provide evidence that DL algorithms can achieve high accuracy in melanoma diagnostics.</p></div><div><h3>Objectives</h3><p>To critically assess different DL performances in diagnosing melanoma using dermatoscopic images and discuss the relationship between dermatologists and DL.</p></div><div><h3>Methods</h3><p>Ovid-Medline, Embase, IEEE Xplore, and the Cochrane Library were systematically searched from inception until 7th December 2021. Studies that reported diagnostic DL model performances in detecting melanoma using dermatoscopic images were included if they had specific outcomes and histopathologic confirmation. Binary diagnostic accuracy data and contingency tables were extracted to analyze outcomes of interest, which included sensitivity (SEN), specificity (SPE), and area under the curve (AUC). Subgroup analyses were performed according to human-machine comparison and cooperation. The study was registered in PROSPERO, CRD42022367824.</p></div><div><h3>Results</h3><p>2309 records were initially retrieved, of which 37 studies met our inclusion criteria, and 27 provided sufficient data for meta-analytical synthesis. The pooled SEN was 82 % (range 77–86), SPE was 87 % (range 84–90), with an AUC of 0.92 (range 0.89–0.94). Human-machine comparison had pooled AUCs of 0.87 (0.84–0.90) and 0.83 (0.79–0.86) for DL and dermatologists, respectively. Pooled AUCs were 0.90 (0.87–0.93), 0.80 (0.76–0.83), and 0.88 (0.85–0.91) for DL, and junior and senior dermatologists, respectively. Analyses of human-machine cooperation were 0.88 (0.85–0.91) for DL, 0.76 (0.72–0.79) for unassisted, and 0.87 (0.84–0.90) for DL-assisted dermatologists.</p></div><div><h3>Conclusions</h3><p>Evidence suggests that DL algorithms are as accurate as senior dermatologists in melanoma diagnostics. Therefore, DL could be used to support dermatologists in diagnostic decision-making. Although, further high-quality, large-scale multicenter studies are required to address the specific challenges associated with medical AI-based diagnostics.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"155 ","pages":"Article 102934"},"PeriodicalIF":6.1,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141842381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comprehensive review of deep learning in orthopaedics: Applications, challenges, trustworthiness, and fusion 深度学习在矫形外科中的综合评述:应用、挑战、可信度和融合
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-25 DOI: 10.1016/j.artmed.2024.102935
Laith Alzubaidi , Khamael AL-Dulaimi , Asma Salhi , Zaenab Alammar , Mohammed A. Fadhel , A.S. Albahri , A.H. Alamoodi , O.S. Albahri , Amjad F. Hasan , Jinshuai Bai , Luke Gilliland , Jing Peng , Marco Branni , Tristan Shuker , Kenneth Cutbush , Jose Santamaría , Catarina Moreira , Chun Ouyang , Ye Duan , Mohamed Manoufali , Yuantong Gu

Deep learning (DL) in orthopaedics has gained significant attention in recent years. Previous studies have shown that DL can be applied to a wide variety of orthopaedic tasks, including fracture detection, bone tumour diagnosis, implant recognition, and evaluation of osteoarthritis severity. The utilisation of DL is expected to increase, owing to its ability to present accurate diagnoses more efficiently than traditional methods in many scenarios. This reduces the time and cost of diagnosis for patients and orthopaedic surgeons. To our knowledge, no exclusive study has comprehensively reviewed all aspects of DL currently used in orthopaedic practice. This review addresses this knowledge gap using articles from Science Direct, Scopus, IEEE Xplore, and Web of Science between 2017 and 2023. The authors begin with the motivation for using DL in orthopaedics, including its ability to enhance diagnosis and treatment planning. The review then covers various applications of DL in orthopaedics, including fracture detection, detection of supraspinatus tears using MRI, osteoarthritis, prediction of types of arthroplasty implants, bone age assessment, and detection of joint-specific soft tissue disease. We also examine the challenges for implementing DL in orthopaedics, including the scarcity of data to train DL and the lack of interpretability, as well as possible solutions to these common pitfalls. Our work highlights the requirements to achieve trustworthiness in the outcomes generated by DL, including the need for accuracy, explainability, and fairness in the DL models. We pay particular attention to fusion techniques as one of the ways to increase trustworthiness, which have also been used to address the common multimodality in orthopaedics. Finally, we have reviewed the approval requirements set forth by the US Food and Drug Administration to enable the use of DL applications. As such, we aim to have this review function as a guide for researchers to develop a reliable DL application for orthopaedic tasks from scratch for use in the market.

近年来,骨科领域的深度学习(DL)获得了极大关注。以往的研究表明,深度学习可应用于各种骨科任务,包括骨折检测、骨肿瘤诊断、植入物识别和骨关节炎严重程度评估。在许多情况下,DL 能够比传统方法更有效地提供准确诊断,因此其使用率预计会越来越高。这为患者和矫形外科医生减少了诊断时间和成本。据我们所知,目前还没有一项独家研究全面回顾了骨科实践中使用的 DL 的各个方面。本综述利用 2017 年至 2023 年间来自 Science Direct、Scopus、IEEE Xplore 和 Web of Science 的文章,填补了这一知识空白。作者首先介绍了在骨科中使用 DL 的动机,包括其增强诊断和治疗规划的能力。然后,综述涵盖了 DL 在骨科中的各种应用,包括骨折检测、使用核磁共振成像检测冈上撕裂、骨关节炎、关节成形术植入物类型预测、骨龄评估以及关节特异性软组织疾病检测。我们还研究了在骨科领域实施 DL 所面临的挑战,包括用于训练 DL 的数据稀缺和缺乏可解释性,以及解决这些常见缺陷的可能方案。我们的工作强调了实现 DL 生成的结果可信性的要求,包括 DL 模型的准确性、可解释性和公平性。我们特别关注融合技术,将其作为提高可信度的方法之一,该技术也被用于解决骨科常见的多模态问题。最后,我们审查了美国食品和药物管理局为启用 DL 应用程序而规定的审批要求。因此,我们希望本综述能为研究人员提供指导,帮助他们从零开始开发出可靠的骨科任务 DL 应用程序,供市场使用。
{"title":"Comprehensive review of deep learning in orthopaedics: Applications, challenges, trustworthiness, and fusion","authors":"Laith Alzubaidi ,&nbsp;Khamael AL-Dulaimi ,&nbsp;Asma Salhi ,&nbsp;Zaenab Alammar ,&nbsp;Mohammed A. Fadhel ,&nbsp;A.S. Albahri ,&nbsp;A.H. Alamoodi ,&nbsp;O.S. Albahri ,&nbsp;Amjad F. Hasan ,&nbsp;Jinshuai Bai ,&nbsp;Luke Gilliland ,&nbsp;Jing Peng ,&nbsp;Marco Branni ,&nbsp;Tristan Shuker ,&nbsp;Kenneth Cutbush ,&nbsp;Jose Santamaría ,&nbsp;Catarina Moreira ,&nbsp;Chun Ouyang ,&nbsp;Ye Duan ,&nbsp;Mohamed Manoufali ,&nbsp;Yuantong Gu","doi":"10.1016/j.artmed.2024.102935","DOIUrl":"10.1016/j.artmed.2024.102935","url":null,"abstract":"<div><p>Deep learning (DL) in orthopaedics has gained significant attention in recent years. Previous studies have shown that DL can be applied to a wide variety of orthopaedic tasks, including fracture detection, bone tumour diagnosis, implant recognition, and evaluation of osteoarthritis severity. The utilisation of DL is expected to increase, owing to its ability to present accurate diagnoses more efficiently than traditional methods in many scenarios. This reduces the time and cost of diagnosis for patients and orthopaedic surgeons. To our knowledge, no exclusive study has comprehensively reviewed all aspects of DL currently used in orthopaedic practice. This review addresses this knowledge gap using articles from Science Direct, Scopus, IEEE Xplore, and Web of Science between 2017 and 2023. The authors begin with the motivation for using DL in orthopaedics, including its ability to enhance diagnosis and treatment planning. The review then covers various applications of DL in orthopaedics, including fracture detection, detection of supraspinatus tears using MRI, osteoarthritis, prediction of types of arthroplasty implants, bone age assessment, and detection of joint-specific soft tissue disease. We also examine the challenges for implementing DL in orthopaedics, including the scarcity of data to train DL and the lack of interpretability, as well as possible solutions to these common pitfalls. Our work highlights the requirements to achieve trustworthiness in the outcomes generated by DL, including the need for accuracy, explainability, and fairness in the DL models. We pay particular attention to fusion techniques as one of the ways to increase trustworthiness, which have also been used to address the common multimodality in orthopaedics. Finally, we have reviewed the approval requirements set forth by the US Food and Drug Administration to enable the use of DL applications. As such, we aim to have this review function as a guide for researchers to develop a reliable DL application for orthopaedic tasks from scratch for use in the market.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"155 ","pages":"Article 102935"},"PeriodicalIF":6.1,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0933365724001775/pdfft?md5=fe495472b9c9c72428f1196589e2e990&pid=1-s2.0-S0933365724001775-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141839273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tackling heterogeneity in medical federated learning via aligning vision transformers 通过调整视觉转换器解决医学联合学习中的异质性问题
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-25 DOI: 10.1016/j.artmed.2024.102936
Erfan Darzi , Yiqing Shen , Yangming Ou , Nanna M. Sijtsema , P.M.A van Ooijen

Federated learning enables training models on distributed, privacy-sensitive medical imaging data. However, data heterogeneity across participating institutions leads to reduced model performance and fairness issues, especially for underrepresented datasets. To address these challenges, we propose leveraging the multi-head attention mechanism in Vision Transformers to align the representations of heterogeneous data across clients. By focusing on the attention mechanism as the alignment objective, our approach aims to improve both the accuracy and fairness of federated learning models in medical imaging applications. We evaluate our method on the IQ-OTH/NCCD Lung Cancer dataset, simulating various levels of data heterogeneity using Latent Dirichlet Allocation (LDA). Our results demonstrate that our approach achieves competitive performance compared to state-of-the-art federated learning methods across different heterogeneity levels and improves the performance of models for underrepresented clients, promoting fairness in the federated learning setting. These findings highlight the potential of leveraging the multi-head attention mechanism to address the challenges of data heterogeneity in medical federated learning.

联盟学习可以在分布式、隐私敏感的医学影像数据上训练模型。然而,参与机构间的数据异构会导致模型性能下降和公平性问题,尤其是对于代表性不足的数据集。为了应对这些挑战,我们建议利用 Vision Transformers 中的多头注意力机制来调整不同客户端的异构数据表示。通过将注意力机制作为对齐目标,我们的方法旨在提高医学成像应用中联合学习模型的准确性和公平性。我们在 IQ-OTH/NCCD 肺癌数据集上评估了我们的方法,使用 Latent Dirichlet Allocation (LDA) 模拟了不同程度的数据异质性。我们的结果表明,与最先进的联合学习方法相比,我们的方法在不同的异质性水平上都取得了具有竞争力的性能,并提高了代表性不足的客户模型的性能,促进了联合学习环境中的公平性。这些发现凸显了利用多头关注机制解决医疗联合学习中数据异质性挑战的潜力。
{"title":"Tackling heterogeneity in medical federated learning via aligning vision transformers","authors":"Erfan Darzi ,&nbsp;Yiqing Shen ,&nbsp;Yangming Ou ,&nbsp;Nanna M. Sijtsema ,&nbsp;P.M.A van Ooijen","doi":"10.1016/j.artmed.2024.102936","DOIUrl":"10.1016/j.artmed.2024.102936","url":null,"abstract":"<div><p>Federated learning enables training models on distributed, privacy-sensitive medical imaging data. However, data heterogeneity across participating institutions leads to reduced model performance and fairness issues, especially for underrepresented datasets. To address these challenges, we propose leveraging the multi-head attention mechanism in Vision Transformers to align the representations of heterogeneous data across clients. By focusing on the attention mechanism as the alignment objective, our approach aims to improve both the accuracy and fairness of federated learning models in medical imaging applications. We evaluate our method on the IQ-OTH/NCCD Lung Cancer dataset, simulating various levels of data heterogeneity using Latent Dirichlet Allocation (LDA). Our results demonstrate that our approach achieves competitive performance compared to state-of-the-art federated learning methods across different heterogeneity levels and improves the performance of models for underrepresented clients, promoting fairness in the federated learning setting. These findings highlight the potential of leveraging the multi-head attention mechanism to address the challenges of data heterogeneity in medical federated learning.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"155 ","pages":"Article 102936"},"PeriodicalIF":6.1,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141846044","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Human–AI interaction paradigm and its application to rhinocytology 人机交互范例及其在鼻细胞学中的应用
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-07-22 DOI: 10.1016/j.artmed.2024.102933
Giuseppe Desolda , Giovanni Dimauro , Andrea Esposito , Rosa Lanzilotti , Maristella Matera , Massimo Zancanaro

This article explores Human-Centered Artificial Intelligence (HCAI) in medical cytology, with a focus on enhancing the interaction with AI. It presents a Human–AI interaction paradigm that emphasizes explainability and user control of AI systems. It is an iterative negotiation process based on three interaction strategies aimed to (i) elaborate the system outcomes through iterative steps (Iterative Exploration), (ii) explain the AI system’s behavior or decisions (Clarification), and (iii) allow non-expert users to trigger simple retraining of the AI model (Reconfiguration). This interaction paradigm is exploited in the redesign of an existing AI-based tool for microscopic analysis of the nasal mucosa. The resulting tool is tested with rhinocytologists. The article discusses the analysis of the results of the conducted evaluation and outlines lessons learned that are relevant for AI in medicine.

本文探讨了医学细胞学中以人为中心的人工智能(HCAI),重点是加强与人工智能的交互。它提出了一种人机交互范式,强调人工智能系统的可解释性和用户控制。这是一个基于三种交互策略的迭代协商过程,旨在:(i) 通过迭代步骤阐述系统结果(迭代探索);(ii) 解释人工智能系统的行为或决策(澄清);(iii) 允许非专业用户触发对人工智能模型的简单再训练(重新配置)。在重新设计现有的基于人工智能的鼻粘膜显微分析工具时,就利用了这种交互范式。鼻腔细胞学专家对该工具进行了测试。文章讨论了对评估结果的分析,并概述了与医学人工智能相关的经验教训。
{"title":"A Human–AI interaction paradigm and its application to rhinocytology","authors":"Giuseppe Desolda ,&nbsp;Giovanni Dimauro ,&nbsp;Andrea Esposito ,&nbsp;Rosa Lanzilotti ,&nbsp;Maristella Matera ,&nbsp;Massimo Zancanaro","doi":"10.1016/j.artmed.2024.102933","DOIUrl":"10.1016/j.artmed.2024.102933","url":null,"abstract":"<div><p>This article explores Human-Centered Artificial Intelligence (HCAI) in medical cytology, with a focus on enhancing the interaction with AI. It presents a Human–AI interaction paradigm that emphasizes explainability and user control of AI systems. It is an iterative negotiation process based on three interaction strategies aimed to (i) elaborate the system outcomes through iterative steps (<em>Iterative Exploration</em>), (ii) explain the AI system’s behavior or decisions (<em>Clarification</em>), and (iii) allow non-expert users to trigger simple retraining of the AI model (<em>Reconfiguration</em>). This interaction paradigm is exploited in the redesign of an existing AI-based tool for microscopic analysis of the nasal mucosa. The resulting tool is tested with rhinocytologists. The article discusses the analysis of the results of the conducted evaluation and outlines lessons learned that are relevant for AI in medicine.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"155 ","pages":"Article 102933"},"PeriodicalIF":6.1,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0933365724001751/pdfft?md5=b223655b36875d5261335f6e3519aed6&pid=1-s2.0-S0933365724001751-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141840791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence in Medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1