Artificial Intelligence in Medicine最新文献_第2页

Design and use of a Denoising Convolutional Autoencoder for reconstructing electrocardiogram signals at super resolution 超分辨率重构心电图信号降噪卷积自编码器的设计与使用。

IF 6.1 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103058

Ugo Lomoio , Pierangelo Veltri , Pietro Hiram Guzzi , Pietro Liò

Electrocardiogram signals play a pivotal role in cardiovascular diagnostics, providing essential information on electrical hearth activity. However, inherent noise and limited resolution can hinder an accurate interpretation of the recordings. In this paper an advanced Denoising Convolutional Autoencoder designed to process electrocardiogram signals, generating super-resolution reconstructions is proposed; this is followed by in-depth analysis of the enhanced signals. The autoencoder receives a signal window (of 5 s) sampled at 50 Hz (low resolution) as input and reconstructs a denoised super-resolution signal at 500 Hz. The proposed autoencoder is applied to publicly available datasets, demonstrating optimal performance in reconstructing high-resolution signals from very low-resolution inputs sampled at 50 Hz. The results were then compared with current state-of-the-art for electrocardiogram super-resolution, demonstrating the effectiveness of the proposed method. The method achieves a signal-to-noise ratio of 12.20 dB, a mean squared error of 0.0044, and a root mean squared error of 4.86%, which significantly outperforms current state-of-the-art alternatives. This framework can effectively enhance hidden information within signals, aiding in the detection of heart-related diseases.

心电图信号在心血管诊断中起着关键作用，提供了关于电床活动的基本信息。然而，固有的噪声和有限的分辨率会阻碍对记录的准确解释。本文提出了一种先进的去噪卷积自编码器，用于处理心电图信号，产生超分辨率重构；接下来是对增强信号的深入分析。自编码器接收50 Hz（低分辨率）采样的信号窗口（5秒）作为输入，并在500 Hz重建去噪的超分辨率信号。所提出的自编码器应用于公开可用的数据集，展示了从50 Hz采样的极低分辨率输入重建高分辨率信号的最佳性能。然后将结果与当前最先进的心电图超分辨率进行比较，证明了所提出方法的有效性。该方法的信噪比为12.20 dB，均方根误差为0.0044，均方根误差为4.86%，明显优于目前最先进的替代方法。这个框架可以有效地增强信号中的隐藏信息，帮助检测心脏相关疾病。

{"title":"Design and use of a Denoising Convolutional Autoencoder for reconstructing electrocardiogram signals at super resolution","authors":"Ugo Lomoio , Pierangelo Veltri , Pietro Hiram Guzzi , Pietro Liò","doi":"10.1016/j.artmed.2024.103058","DOIUrl":"10.1016/j.artmed.2024.103058","url":null,"abstract":"<div><div>Electrocardiogram signals play a pivotal role in cardiovascular diagnostics, providing essential information on electrical hearth activity. However, inherent noise and limited resolution can hinder an accurate interpretation of the recordings. In this paper an advanced Denoising Convolutional Autoencoder designed to process electrocardiogram signals, generating super-resolution reconstructions is proposed; this is followed by in-depth analysis of the enhanced signals. The autoencoder receives a signal window (of 5 s) sampled at 50 Hz (low resolution) as input and reconstructs a denoised super-resolution signal at 500 Hz. The proposed autoencoder is applied to publicly available datasets, demonstrating optimal performance in reconstructing high-resolution signals from very low-resolution inputs sampled at 50 Hz. The results were then compared with current state-of-the-art for electrocardiogram super-resolution, demonstrating the effectiveness of the proposed method. The method achieves a signal-to-noise ratio of 12.20 dB, a mean squared error of 0.0044, and a root mean squared error of 4.86%, which significantly outperforms current state-of-the-art alternatives. This framework can effectively enhance hidden information within signals, aiding in the detection of heart-related diseases.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103058"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142915779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A systematic review on the roles of remote diagnosis in telemedicine system: Coherent taxonomy, insights, recommendations, and open research directions for intelligent healthcare solutions 远程诊断在远程医疗系统中的作用综述：智能医疗解决方案的一致分类、见解、建议和开放研究方向。

IF 6.1 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103057

Sura Saad Mohsin , Omar H. Salman , Abdulrahman Ahmed Jasim , Mohammed A. Al-Nouman , Ammar Riadh Kairaldeen

<div><h3>Background</h3><div>The term ‘remote diagnosis’ in telemedicine describes the procedure wherein medical practitioners diagnose patients remotely by using telecommunications technology. With this method, patients can obtain medical care without having to physically visit a hospital, which can be helpful for people who live in distant places or have restricted mobility. When people in the past had health issues, they were usually sent to the hospital, where they received clinical examinations, diagnoses, and treatment at the facility. Thus, hospitals were overcrowded because of the increase in the number of patients or in the death of some very ill patients given that the completion of medical operations required a significant amount of time.</div></div><div><h3>Objective</h3><div>This research aims to provide a literature review study and an in-depth analysis to (1) investigate the procedure and roles of remote diagnosis in telemedicine; (2) review the technical tools and technologies used in remote diagnosis; (3) review the diseases diagnosed remotely in telemedicine; (4) compose a crossover taxonomy among diseases, technologies, and telemedicine; (5) present lists of input variables, vital signs, data and output decisions already applied in remote diagnosis; (6) Summarize the performance assessment measures utilized to assess and validate remote diagnosis models; and (7) identify and categorize open research issues while providing recommendations for future advancements in intelligent remote diagnosis within telemedicine systems.</div></div><div><h3>Methods</h3><div>A systematic search was conducted using online libraries for articles published from 1 January 2016 to 13 September 2023 in IEEE, PubMed, Science Direct, Springer, and Web of Science. Notably, searches were limited to articles in the English language. The papers examine remote diagnosis in telemedicine, the technologies employed for this function, and the ramifications of diagnosing patients outside hospital settings. Each selected study was synthesized to furnish proof about the implementation of remote diagnostics in telemedicine.</div></div><div><h3>Results</h3><div>A new crossover taxonomy between the most important diagnosed diseases and technologies used for this purpose and their relationship with telemedicine tiers is proposed. The functions executed at each tier are elucidated. Additionally, a compilation of diagnostic technologies is provided. Additionally, open research difficulties, advantages of remote diagnosis in telemedicine, and suggestions for future research prospects that require attention are systematically organized and presented.</div></div><div><h3>Conclusions</h3><div>This study reviews the role of remote diagnosis in telemedicine, with a focus on key technologies and current approaches. This study highlights research challenges, provides recommendations for future directions, and addresses research gaps and limitations to provide a clear vision of r

背景：远程医疗中的“远程诊断”一词描述了医生利用电信技术远程诊断患者的过程。通过这种方法，患者无需亲自前往医院即可获得医疗服务，这对居住在偏远地区或行动不便的人很有帮助。过去，当人们出现健康问题时，他们通常被送到医院，在那里接受临床检查、诊断和治疗。因此，由于病人人数增加或一些病重病人死亡，医院人满为患，因为完成医疗手术需要很长时间。目的：本研究旨在通过文献综述和深入分析：(1)探讨远程诊断在远程医疗中的程序和作用；(2)综述远程诊断的技术工具和技术；(3)对远程医疗远程诊断疾病进行综述；(4)构建疾病、技术和远程医疗的交叉分类；(5)列出已应用于远程诊断的输入变量、生命体征、数据和输出决策；(6)总结了用于评估和验证远程诊断模型的性能评估指标；(7)识别和分类开放的研究问题，同时为远程医疗系统中智能远程诊断的未来发展提供建议。方法：系统检索2016年1月1日至2023年9月13日在IEEE、PubMed、Science Direct、b施普林格和Web of Science上发表的文章。值得注意的是，搜索仅限于英语中的文章。论文检查远程医疗中的远程诊断，用于此功能的技术，以及诊断医院外患者的后果。每个选定的研究被综合起来，为远程医疗中远程诊断的实施提供证据。结果：提出了最重要的诊断疾病和用于此目的的技术之间的新交叉分类及其与远程医疗级别的关系。阐明了在每一层执行的函数。此外，还提供了诊断技术的汇编。此外，系统地整理和提出了开放式研究难点、远程医疗中远程诊断的优势以及需要注意的未来研究前景建议。结论：本研究综述了远程诊断在远程医疗中的作用，重点介绍了远程诊断的关键技术和目前的方法。本研究强调了研究的挑战，提出了未来方向的建议，并解决了研究的差距和局限性，为远程医疗中的远程诊断提供了一个清晰的愿景。这项研究强调了现有研究的优势，并为新的方向和智能医疗解决方案开辟了可能性。

{"title":"A systematic review on the roles of remote diagnosis in telemedicine system: Coherent taxonomy, insights, recommendations, and open research directions for intelligent healthcare solutions","authors":"Sura Saad Mohsin , Omar H. Salman , Abdulrahman Ahmed Jasim , Mohammed A. Al-Nouman , Ammar Riadh Kairaldeen","doi":"10.1016/j.artmed.2024.103057","DOIUrl":"10.1016/j.artmed.2024.103057","url":null,"abstract":"<div><h3>Background</h3><div>The term ‘remote diagnosis’ in telemedicine describes the procedure wherein medical practitioners diagnose patients remotely by using telecommunications technology. With this method, patients can obtain medical care without having to physically visit a hospital, which can be helpful for people who live in distant places or have restricted mobility. When people in the past had health issues, they were usually sent to the hospital, where they received clinical examinations, diagnoses, and treatment at the facility. Thus, hospitals were overcrowded because of the increase in the number of patients or in the death of some very ill patients given that the completion of medical operations required a significant amount of time.</div></div><div><h3>Objective</h3><div>This research aims to provide a literature review study and an in-depth analysis to (1) investigate the procedure and roles of remote diagnosis in telemedicine; (2) review the technical tools and technologies used in remote diagnosis; (3) review the diseases diagnosed remotely in telemedicine; (4) compose a crossover taxonomy among diseases, technologies, and telemedicine; (5) present lists of input variables, vital signs, data and output decisions already applied in remote diagnosis; (6) Summarize the performance assessment measures utilized to assess and validate remote diagnosis models; and (7) identify and categorize open research issues while providing recommendations for future advancements in intelligent remote diagnosis within telemedicine systems.</div></div><div><h3>Methods</h3><div>A systematic search was conducted using online libraries for articles published from 1 January 2016 to 13 September 2023 in IEEE, PubMed, Science Direct, Springer, and Web of Science. Notably, searches were limited to articles in the English language. The papers examine remote diagnosis in telemedicine, the technologies employed for this function, and the ramifications of diagnosing patients outside hospital settings. Each selected study was synthesized to furnish proof about the implementation of remote diagnostics in telemedicine.</div></div><div><h3>Results</h3><div>A new crossover taxonomy between the most important diagnosed diseases and technologies used for this purpose and their relationship with telemedicine tiers is proposed. The functions executed at each tier are elucidated. Additionally, a compilation of diagnostic technologies is provided. Additionally, open research difficulties, advantages of remote diagnosis in telemedicine, and suggestions for future research prospects that require attention are systematically organized and presented.</div></div><div><h3>Conclusions</h3><div>This study reviews the role of remote diagnosis in telemedicine, with a focus on key technologies and current approaches. This study highlights research challenges, provides recommendations for future directions, and addresses research gaps and limitations to provide a clear vision of r","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103057"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142873446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Concordance-based Predictive Uncertainty (CPU)-Index: Proof-of-concept with application towards improved specificity of lung cancers on low dose screening CT 基于一致性的预测不确定性(CPU)-指数：用于提高肺癌低剂量筛查CT特异性的概念验证。

IF 6.1 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103055

Yuqi Wang , Aarzu Gupta , Fakrul Islam Tushar , Breylon Riley , Avivah Wang , Tina D. Tailor , Stacy Tantum , Jian-Guo Liu , Mustafa R. Bashir , Joseph Y. Lo , Kyle J. Lafata

In this paper, we introduce a novel concordance-based predictive uncertainty (CPU)-Index, which integrates insights from subgroup analysis and personalized AI time-to-event models. Through its application in refining lung cancer screening (LCS) predictions generated by an individualized AI time-to-event model trained with fused data of low dose CT (LDCT) radiomics with patient demographics, we demonstrate its effectiveness, resulting in improved risk assessment compared to the Lung CT Screening Reporting & Data System (Lung-RADS). Subgroup-based Lung-RADS faces challenges in representing individual variations and relies on a limited set of predefined characteristics, resulting in variable predictions. Conversely, personalized AI time-to-event models are hindered by transparency issues and biases from censored data. By measuring the prediction consistency between subgroup analysis and AI time-to-event models, the CPU-Index framework offers a nuanced evaluation of the bias–variance trade-off and improves the transparency and reliability of predictions. Consistency was estimated by the concordance index of subgroup analysis-based similarity rank and model prediction similarity rank. Subgroup analysis-based similarity loss was defined as the sum-of-the-difference between Lung-RADS and feature-level 0-1 loss. Model prediction similarity loss was defined as squared loss. To test our approach, we identified 3,326 patients who underwent LDCT for LCS from 1/1/2015 to 6/30/2020 with confirmation of lung cancer on pathology within one year. For each LDCT image, the lesion associated with a Lung-RADS score was detected using a pretrained deep learning model from Medical Open Network for AI (MONAI), from which radiomic features were extracted. Radiomics were optimally fused with patient demographics via a positional encoding scheme and used to train a neural multi-task logistic regression time-to-event model that predicts malignancy. Performance was maximized when radiomics features were fused with positionally encoded demographic features. In this configuration, our algorithm raised the AUC from 0.81 ± 0.04 to 0.89 ± 0.02. Compared to standard Lung-RADS, our approach reduced the False-Positive-Rate from 0.41 ± 0.02 to 0.30 ± 0.12 while maintaining the same False-Negative-Rate. Our methodology enhances lung cancer risk assessment by estimating prediction uncertainty and adjusting accordingly. Furthermore, the optimal integration of radiomics and patient demographics improved overall diagnostic performance, indicating their complementary nature.

在本文中，我们引入了一种新的基于一致性的预测不确定性（CPU）指数，它集成了来自子组分析和个性化AI时间到事件模型的见解。通过将其应用于细化肺癌筛查（LCS）预测，该预测由低剂量CT （LDCT）放射组学与患者人口统计学融合数据训练的个性化AI时间到事件模型生成，我们证明了其有效性，与肺CT筛查报告和数据系统（lung - rads）相比，其风险评估得到了改善。基于亚组的Lung-RADS在表示个体变化方面面临挑战，并且依赖于一组有限的预定义特征，从而导致变量预测。相反，个性化的人工智能时间到事件模型受到透明度问题和审查数据偏见的阻碍。通过测量子组分析和AI时间到事件模型之间的预测一致性，CPU-Index框架提供了对偏差-方差权衡的细致评估，并提高了预测的透明度和可靠性。通过基于子群分析的相似度排序和模型预测相似度排序的一致性指数来估计一致性。基于亚组分析的相似性损失被定义为Lung-RADS与特征级0-1损失的差值之和。模型预测相似度损失定义为平方损失。为了验证我们的方法，我们确定了3326例在2015年1月1日至2020年6月30日期间接受LDCT检查的LCS患者，这些患者在一年内病理证实为肺癌。对于每张LDCT图像，使用来自AI医学开放网络（MONAI）的预训练深度学习模型检测与肺部rads评分相关的病变，并从中提取放射学特征。放射组学通过位置编码方案与患者人口统计学最佳融合，并用于训练预测恶性肿瘤的神经多任务逻辑回归时间-事件模型。当放射组学特征与位置编码的人口特征融合时，性能最大化。在这个配置中，我们的算法将AUC从0.81±0.04提高到0.89±0.02。与标准Lung-RADS相比，我们的方法将假阳性率从0.41±0.02降低到0.30±0.12，同时保持相同的假阴性率。我们的方法通过估计预测不确定性并进行相应调整来提高肺癌风险评估。此外，放射组学和患者人口统计学的最佳整合提高了整体诊断性能，表明它们的互补性。

{"title":"Concordance-based Predictive Uncertainty (CPU)-Index: Proof-of-concept with application towards improved specificity of lung cancers on low dose screening CT","authors":"Yuqi Wang , Aarzu Gupta , Fakrul Islam Tushar , Breylon Riley , Avivah Wang , Tina D. Tailor , Stacy Tantum , Jian-Guo Liu , Mustafa R. Bashir , Joseph Y. Lo , Kyle J. Lafata","doi":"10.1016/j.artmed.2024.103055","DOIUrl":"10.1016/j.artmed.2024.103055","url":null,"abstract":"<div><div>In this paper, we introduce a novel concordance-based predictive uncertainty (CPU)-Index, which integrates insights from subgroup analysis and personalized AI time-to-event models. Through its application in refining lung cancer screening (LCS) predictions generated by an individualized AI time-to-event model trained with fused data of low dose CT (LDCT) radiomics with patient demographics, we demonstrate its effectiveness, resulting in improved risk assessment compared to the Lung CT Screening Reporting & Data System (Lung-RADS). Subgroup-based Lung-RADS faces challenges in representing individual variations and relies on a limited set of predefined characteristics, resulting in variable predictions. Conversely, personalized AI time-to-event models are hindered by transparency issues and biases from censored data. By measuring the prediction consistency between subgroup analysis and AI time-to-event models, the CPU-Index framework offers a nuanced evaluation of the bias–variance trade-off and improves the transparency and reliability of predictions. Consistency was estimated by the concordance index of subgroup analysis-based similarity rank and model prediction similarity rank. Subgroup analysis-based similarity loss was defined as the sum-of-the-difference between Lung-RADS and feature-level 0-1 loss. Model prediction similarity loss was defined as squared loss. To test our approach, we identified 3,326 patients who underwent LDCT for LCS from 1/1/2015 to 6/30/2020 with confirmation of lung cancer on pathology within one year. For each LDCT image, the lesion associated with a Lung-RADS score was detected using a pretrained deep learning model from Medical Open Network for AI (MONAI), from which radiomic features were extracted. Radiomics were optimally fused with patient demographics via a positional encoding scheme and used to train a neural multi-task logistic regression time-to-event model that predicts malignancy. Performance was maximized when radiomics features were fused with positionally encoded demographic features. In this configuration, our algorithm raised the AUC from 0.81 ± 0.04 to 0.89 ± 0.02. Compared to standard Lung-RADS, our approach reduced the False-Positive-Rate from 0.41 ± 0.02 to 0.30 ± 0.12 while maintaining the same False-Negative-Rate. Our methodology enhances lung cancer risk assessment by estimating prediction uncertainty and adjusting accordingly. Furthermore, the optimal integration of radiomics and patient demographics improved overall diagnostic performance, indicating their complementary nature.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103055"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142900660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Advances in diagnosis and prognosis of bacteraemia, bloodstream infection, and sepsis using machine learning: A comprehensive living literature review 利用机器学习在菌血症、血流感染和败血症的诊断和预后方面的进展：一篇全面的文献综述。

IF 6.1 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103008

Hernandez B. , Ming D.K. , Rawson T.M. , Bolton W. , Wilson R. , Vasikasin V. , Daniels J. , Rodriguez-Manzano J. , Davies F.J. , Georgiou P. , Holmes A.H.

<div><h3>Background:</h3><div>Blood-related infections are a significant concern in healthcare. They can lead to serious medical complications and even death if not promptly diagnosed and treated. Throughout time, medical research has sought to identify clinical factors and strategies to improve the management of these conditions. The increasing adoption of electronic health records has led to a wealth of electronically available medical information and predictive models have emerged as invaluable tools. This manuscript offers a detailed survey of machine-learning techniques used for the diagnosis and prognosis of bacteraemia, bloodstream infections, and sepsis shedding light on their efficacy, potential limitations, and the intricacies of their integration into clinical practice.</div></div><div><h3>Methods:</h3><div>This study presents a comprehensive analysis derived from a thorough search across prominent databases, namely EMBASE, Google Scholar, PubMed, Scopus, and Web of Science, spanning from their inception dates to October 25, 2023. Eligibility assessment was conducted independently by investigators, with inclusion criteria encompassing peer-reviewed articles and pertinent non-peer-reviewed literature. Clinical and technical data were meticulously extracted and integrated into a registry, facilitating a holistic examination of the subject matter. To maintain currency and comprehensiveness, readers are encouraged to contribute manuscript suggestions and/or reports for integration into this living registry.</div></div><div><h3>Results:</h3><div>While machine learning (ML) models exhibit promise in advanced disease stages such as sepsis, early stages remain underexplored due to data limitations. Biochemical markers emerge as pivotal predictors during early stages such as bacteraemia, or bloodstream infections, while vital signs assume significance in sepsis prognosis. Integrating temporal trend information into conventional machine learning models appears to enhance performance. Unfortunately, sequential deep learning models face challenges, showing minimal performance improvements and significant drops in external datasets, potentially due to learning missing patterns within the scarce data available rather than understanding disease dynamics. Real-life implementation receives limited attention, as meeting design requirements proves challenging within existing healthcare infrastructure. The data collected in an event-based fashion during clinical practice is insufficient to fully harness the potential of these data-hungry models. Despite limitations, opportunities abound in leveraging flexible models and exploiting real-time non-invasive data collection technologies such as wearable devices or microneedles. Addressing research gaps in early disease stages, harnessing patient history data often underused, and embracing continual diagnostics beyond treatment initiation are crucial for improving healthcare decision-making support and adoption

背景：血液相关感染是医疗保健中的一个重要问题。如果不及时诊断和治疗，它们可能导致严重的医疗并发症，甚至死亡。长期以来，医学研究一直在寻求确定临床因素和策略，以改善这些疾病的管理。越来越多地采用电子健康记录导致了大量的电子医疗信息和预测模型已成为宝贵的工具。这篇手稿提供了用于诊断和预测菌血症、血液感染和败血症的机器学习技术的详细调查，揭示了它们的功效、潜在的局限性，以及它们融入临床实践的复杂性。方法：本研究对EMBASE、谷歌Scholar、PubMed、Scopus和Web of Science等知名数据库进行了全面的检索，从其建立日期到2023年10月25日。资格评估由研究者独立进行，纳入标准包括同行评议的文章和相关的非同行评议文献。临床和技术数据被精心提取并整合到登记处，促进对主题的全面检查。为了保持时效性和全面性，我们鼓励读者提供手稿建议和/或报告，以便整合到这个动态登记册中。结果：虽然机器学习（ML）模型在败血症等晚期疾病阶段表现出希望，但由于数据限制，早期阶段仍未得到充分探索。在脓毒症的早期阶段，如菌血症或血液感染，生化指标是关键的预测指标，而生命体征在脓毒症的预后中具有重要意义。将时间趋势信息集成到传统的机器学习模型中似乎可以提高性能。不幸的是，序列深度学习模型面临着挑战，表现出最小的性能改进和外部数据集的显著下降，这可能是由于在可用的稀缺数据中学习缺失的模式，而不是理解疾病动态。现实生活中的实现受到的关注有限，因为在现有的医疗保健基础设施中满足设计要求具有挑战性。在临床实践中以基于事件的方式收集的数据不足以充分利用这些数据饥渴模型的潜力。尽管存在局限性，但利用灵活的模型和利用实时非侵入性数据收集技术（如可穿戴设备或微针）的机会仍然很多。解决疾病早期阶段的研究差距，利用经常未被充分利用的患者病史数据，并在治疗开始后进行持续诊断，对于改善整个管理途径的医疗保健决策支持和采用至关重要。结论：这项全面的调查阐明了ML在血液相关感染管理中的应用前景，为未来的研究和临床实践提供了见解。实施临床基于ml的临床决策支持系统需要平衡研究与实际考虑。目前的方法往往导致复杂的模型缺乏透明度和实际验证。集成到医疗保健系统中面临着监管、隐私和信任方面的挑战。清晰的演示和对标准的遵守对于增强对真实医疗保健应用程序的机器学习模型的信心至关重要。

{"title":"Advances in diagnosis and prognosis of bacteraemia, bloodstream infection, and sepsis using machine learning: A comprehensive living literature review","authors":"Hernandez B. , Ming D.K. , Rawson T.M. , Bolton W. , Wilson R. , Vasikasin V. , Daniels J. , Rodriguez-Manzano J. , Davies F.J. , Georgiou P. , Holmes A.H.","doi":"10.1016/j.artmed.2024.103008","DOIUrl":"10.1016/j.artmed.2024.103008","url":null,"abstract":"<div><h3>Background:</h3><div>Blood-related infections are a significant concern in healthcare. They can lead to serious medical complications and even death if not promptly diagnosed and treated. Throughout time, medical research has sought to identify clinical factors and strategies to improve the management of these conditions. The increasing adoption of electronic health records has led to a wealth of electronically available medical information and predictive models have emerged as invaluable tools. This manuscript offers a detailed survey of machine-learning techniques used for the diagnosis and prognosis of bacteraemia, bloodstream infections, and sepsis shedding light on their efficacy, potential limitations, and the intricacies of their integration into clinical practice.</div></div><div><h3>Methods:</h3><div>This study presents a comprehensive analysis derived from a thorough search across prominent databases, namely EMBASE, Google Scholar, PubMed, Scopus, and Web of Science, spanning from their inception dates to October 25, 2023. Eligibility assessment was conducted independently by investigators, with inclusion criteria encompassing peer-reviewed articles and pertinent non-peer-reviewed literature. Clinical and technical data were meticulously extracted and integrated into a registry, facilitating a holistic examination of the subject matter. To maintain currency and comprehensiveness, readers are encouraged to contribute manuscript suggestions and/or reports for integration into this living registry.</div></div><div><h3>Results:</h3><div>While machine learning (ML) models exhibit promise in advanced disease stages such as sepsis, early stages remain underexplored due to data limitations. Biochemical markers emerge as pivotal predictors during early stages such as bacteraemia, or bloodstream infections, while vital signs assume significance in sepsis prognosis. Integrating temporal trend information into conventional machine learning models appears to enhance performance. Unfortunately, sequential deep learning models face challenges, showing minimal performance improvements and significant drops in external datasets, potentially due to learning missing patterns within the scarce data available rather than understanding disease dynamics. Real-life implementation receives limited attention, as meeting design requirements proves challenging within existing healthcare infrastructure. The data collected in an event-based fashion during clinical practice is insufficient to fully harness the potential of these data-hungry models. Despite limitations, opportunities abound in leveraging flexible models and exploiting real-time non-invasive data collection technologies such as wearable devices or microneedles. Addressing research gaps in early disease stages, harnessing patient history data often underused, and embracing continual diagnostics beyond treatment initiation are crucial for improving healthcare decision-making support and adoption","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103008"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142873515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Practical X-ray gastric cancer diagnostic support using refined stochastic data augmentation and hard boundary box training

IF 6.1 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2025.103075

Hideaki Okamoto , Quan Huu Cap , Takakiyo Nomura , Kazuhito Nabeshima , Jun Hashimoto , Hitoshi Iyatomi

Endoscopy is widely used to diagnose gastric cancer and has a high diagnostic performance, but it must be performed by a physician, which limits the number of people who can be diagnosed. In contrast, gastric X-rays can be taken by radiographers, thus allowing a much larger number of patients to undergo imaging. However, the diagnosis of X-ray images relies heavily on the expertise and experience of physicians, and few machine learning methods have been developed to assist in this process. We propose a novel and practical gastric cancer diagnostic support system for gastric X-ray images that will enable more people to be screened. The system is based on a general deep learning-based object detection model and incorporates two novel techniques: refined probabilistic stomach image augmentation (R-sGAIA) and hard boundary box training (HBBT). R-sGAIA enhances the probabilistic gastric fold region and provides more learning patterns for cancer detection models. HBBT is an efficient training method that improves model performance by allowing the use of unannotated negative (i.e., healthy control) samples, which are typically unusable in conventional detection models. The proposed system achieved a sensitivity (SE) for gastric cancer of 90.2%, higher than that of an expert (85.5%). Under these conditions, two out of five candidate boxes identified by the system were cancerous (precision = 42.5%), with an image processing speed of 0.51 s per image. The system also outperformed methods using the same object detection model and state-of-the-art data augmentation by showing a 5.9-point improvement in the F1 score. In summary, this system efficiently identifies areas for radiologists to examine within a practical time frame, thus significantly reducing their workload.

{"title":"Practical X-ray gastric cancer diagnostic support using refined stochastic data augmentation and hard boundary box training","authors":"Hideaki Okamoto , Quan Huu Cap , Takakiyo Nomura , Kazuhito Nabeshima , Jun Hashimoto , Hitoshi Iyatomi","doi":"10.1016/j.artmed.2025.103075","DOIUrl":"10.1016/j.artmed.2025.103075","url":null,"abstract":"<div><div>Endoscopy is widely used to diagnose gastric cancer and has a high diagnostic performance, but it must be performed by a physician, which limits the number of people who can be diagnosed. In contrast, gastric X-rays can be taken by radiographers, thus allowing a much larger number of patients to undergo imaging. However, the diagnosis of X-ray images relies heavily on the expertise and experience of physicians, and few machine learning methods have been developed to assist in this process. We propose a novel and practical gastric cancer diagnostic support system for gastric X-ray images that will enable more people to be screened. The system is based on a general deep learning-based object detection model and incorporates two novel techniques: refined probabilistic stomach image augmentation (R-sGAIA) and hard boundary box training (HBBT). R-sGAIA enhances the probabilistic gastric fold region and provides more learning patterns for cancer detection models. HBBT is an efficient training method that improves model performance by allowing the use of unannotated negative (i.e., healthy control) samples, which are typically unusable in conventional detection models. The proposed system achieved a sensitivity (SE) for gastric cancer of 90.2%, higher than that of an expert (85.5%). Under these conditions, two out of five candidate boxes identified by the system were cancerous (precision = 42.5%), with an image processing speed of 0.51 s per image. The system also outperformed methods using the same object detection model and state-of-the-art data augmentation by showing a 5.9-point improvement in the F1 score. In summary, this system efficiently identifies areas for radiologists to examine within a practical time frame, thus significantly reducing their workload.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"161 ","pages":"Article 103075"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143326565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Intrinsic-dimension analysis for guiding dimensionality reduction and data fusion in multi-omics data processing 本征维度分析用于指导多组学数据处理中的降维和数据融合。

IF 6.1 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103049

Jessica Gliozzo , Mauricio Soto-Gomez , Valentina Guarino , Arturo Bonometti , Alberto Cabri , Emanuele Cavalleri , Justin Reese , Peter N. Robinson , Marco Mesiti , Giorgio Valentini , Elena Casiraghi

Multi-omics data have revolutionized biomedical research by providing a comprehensive understanding of biological systems and the molecular mechanisms of disease development. However, analyzing multi-omics data is challenging due to high dimensionality and limited sample sizes, necessitating proper data-reduction pipelines to ensure reliable analyses. Additionally, its multimodal nature requires effective data-integration pipelines.

While several dimensionality reduction and data fusion algorithms have been proposed, crucial aspects are often overlooked. Specifically, the choice of projection space dimension is typically heuristic and uniformly applied across all omics, neglecting the unique high dimension small sample size challenges faced by individual omics.

This paper introduces a novel multi-modal dimensionality reduction pipeline tailored to individual views. By leveraging intrinsic dimensionality estimators, we assess the curse-of-dimensionality impact on each view and propose a two-step reduction strategy for significantly affected views, combining feature selection with feature extraction.

Compared to traditional uniform reduction pipelines in a crucial and supervised multi-omics analysis setting, our approach shows significant improvement. Additionally, we explore three effective unsupervised multi-omics data fusion methods rooted in the main data fusion strategies to gain insights into their performance under crucial, yet overlooked, settings.

通过提供对生物系统和疾病发展的分子机制的全面理解，多组学数据已经彻底改变了生物医学研究。然而，由于高维度和有限的样本量，分析多组学数据具有挑战性，需要适当的数据缩减管道来确保可靠的分析。此外，它的多模式特性需要有效的数据集成管道。虽然提出了几种降维和数据融合算法，但往往忽略了关键的方面。具体而言，投影空间维度的选择通常是启发式的，并统一应用于所有组学，而忽略了个体组学面临的独特的高维小样本量挑战。本文介绍了一种针对单个视图定制的新型多模态降维管道。通过利用内在维数估计器，我们评估了对每个视图的维数影响，并提出了一种两步约简策略，将特征选择与特征提取相结合。与传统的统一还原管道相比，在关键和监督的多组学分析环境中，我们的方法显示出显着的改进。此外，我们探索了三种有效的无监督多组学数据融合方法，这些方法植根于主要的数据融合策略，以深入了解它们在关键但被忽视的设置下的性能。

{"title":"Intrinsic-dimension analysis for guiding dimensionality reduction and data fusion in multi-omics data processing","authors":"Jessica Gliozzo , Mauricio Soto-Gomez , Valentina Guarino , Arturo Bonometti , Alberto Cabri , Emanuele Cavalleri , Justin Reese , Peter N. Robinson , Marco Mesiti , Giorgio Valentini , Elena Casiraghi","doi":"10.1016/j.artmed.2024.103049","DOIUrl":"10.1016/j.artmed.2024.103049","url":null,"abstract":"<div><div>Multi-omics data have revolutionized biomedical research by providing a comprehensive understanding of biological systems and the molecular mechanisms of disease development. However, analyzing multi-omics data is challenging due to high dimensionality and limited sample sizes, necessitating proper data-reduction pipelines to ensure reliable analyses. Additionally, its multimodal nature requires effective data-integration pipelines.</div><div>While several dimensionality reduction and data fusion algorithms have been proposed, crucial aspects are often overlooked. Specifically, the choice of projection space dimension is typically heuristic and uniformly applied across all omics, neglecting the unique high dimension small sample size challenges faced by individual omics.</div><div>This paper introduces a novel multi-modal dimensionality reduction pipeline tailored to individual views. By leveraging intrinsic dimensionality estimators, we assess the curse-of-dimensionality impact on each view and propose a two-step reduction strategy for significantly affected views, combining feature selection with feature extraction.</div><div>Compared to traditional uniform reduction pipelines in a crucial and supervised multi-omics analysis setting, our approach shows significant improvement. Additionally, we explore three effective unsupervised multi-omics data fusion methods rooted in the main data fusion strategies to gain insights into their performance under crucial, yet overlooked, settings.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103049"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142824829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Computer model for gait assessments in Parkinson's patients using a fuzzy inference model and inertial sensors 基于模糊推理模型和惯性传感器的帕金森患者步态评估计算机模型。

IF 6.1 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103059

Luis Pastor Sánchez-Fernández , Luis Alejandro Sánchez-Pérez , Juan Manuel Martínez-Hernández

Patients with Parkinson's disease (PD) in the moderate and severe stages can present several walk alterations. They can show slow movements and difficulty initiating, varying, or interrupting their gait; freezing; short steps; speed changes; shuffling; little arm swing; and festinating gait. The Movement Disorder Society-Unified Parkinson's Disease Rating Scale (MDS-UPDRS) has a good reputation for uniformly evaluating motor and non-motor aspects of PD. However, the motor clinical assessment depends on visual observations, the results are qualitative, and subtle differences are not identified. This study presents a fuzzy inference model for gait assessments in PD patients with detailed descriptions of signal processing and eight biomechanical indicators computations; as such, other authors can replicate the presented methods. The computer model uses 334 bilateral measurements of 58 Parkinson's patients and 15 healthy control subjects performed over one year. The computer model validations are based on physician evaluations in real-time and post-analysis using an extensive database of videos and signals. The assessment results are explainable, quantitative, and qualitative, increasing their acceptance and use in clinical environments. The computer system design considers three expert motor evaluations, including the PD patients' evolutions; this facilitates correlation with medication doses and appropriate intervals for follow-up medical consultations. The assessments include three qualitative gait conditions of MDS-UPDRS—normal, slight, and mild—as well as a numerical evaluation of up to two decimal places.

帕金森病患者（PD）在中度和重度阶段可以表现出几种行走改变。他们动作缓慢，难以启动、改变或中断他们的步态；冻结;短的步骤;速度变化;洗牌;小胳膊摆动；还有欢庆的步态。运动障碍学会统一帕金森病评定量表（MDS-UPDRS）在统一评估帕金森病的运动和非运动方面享有良好声誉。然而，运动临床评估依赖于视觉观察，结果是定性的，微妙的差异不确定。本研究提出了一种用于PD患者步态评估的模糊推理模型，详细描述了信号处理和8个生物力学指标的计算；因此，其他作者可以复制所提出的方法。该计算机模型使用了58名帕金森患者和15名健康对照者在一年内进行的334次双侧测量。计算机模型的验证是基于医生的实时评估，并使用广泛的视频和信号数据库进行事后分析。评估结果是可解释的、定量的和定性的，增加了它们在临床环境中的接受和使用。计算机系统设计考虑了三种专家运动评价，包括PD患者的进化；这有助于与药物剂量和后续医疗咨询的适当间隔进行关联。评估包括mds - updrs的三种定性步态条件-正常，轻微和轻微-以及高达小数点后两位的数值评估。

{"title":"Computer model for gait assessments in Parkinson's patients using a fuzzy inference model and inertial sensors","authors":"Luis Pastor Sánchez-Fernández , Luis Alejandro Sánchez-Pérez , Juan Manuel Martínez-Hernández","doi":"10.1016/j.artmed.2024.103059","DOIUrl":"10.1016/j.artmed.2024.103059","url":null,"abstract":"<div><div>Patients with Parkinson's disease (PD) in the moderate and severe stages can present several walk alterations. They can show slow movements and difficulty initiating, varying, or interrupting their gait; freezing; short steps; speed changes; shuffling; little arm swing; and festinating gait. The Movement Disorder Society-Unified Parkinson's Disease Rating Scale (MDS-UPDRS) has a good reputation for uniformly evaluating motor and non-motor aspects of PD. However, the motor clinical assessment depends on visual observations, the results are qualitative, and subtle differences are not identified. This study presents a fuzzy inference model for gait assessments in PD patients with detailed descriptions of signal processing and eight biomechanical indicators computations; as such, other authors can replicate the presented methods. The computer model uses 334 bilateral measurements of 58 Parkinson's patients and 15 healthy control subjects performed over one year. The computer model validations are based on physician evaluations in real-time and post-analysis using an extensive database of videos and signals. The assessment results are explainable, quantitative, and qualitative, increasing their acceptance and use in clinical environments. The computer system design considers three expert motor evaluations, including the PD patients' evolutions; this facilitates correlation with medication doses and appropriate intervals for follow-up medical consultations. The assessments include three qualitative gait conditions of MDS-UPDRS—normal, slight, and mild—as well as a numerical evaluation of up to two decimal places.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103059"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142900658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fraud detection in healthcare claims using machine learning: A systematic review 使用机器学习的医疗保健索赔欺诈检测：系统回顾。

IF 6.1 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103061

Anli du Preez , Sanmitra Bhattacharya , Peter Beling , Edward Bowen

Objective:

Identifying fraud in healthcare programs is crucial, as an estimated 3%–10% of the total healthcare expenditures are lost to fraudulent activities. This study presents a systematic literature review of machine learning techniques applied to fraud detection in health insurance claims. We aim to analyze the data and methodologies documented in the literature over the past two decades, providing insights into research challenges and opportunities.

Methods:

We identified research studies on health insurance fraud detection using machine learning approaches from databases such as Google Scholar, Springer-Link journals, Elsevier, PubMed, Excerpta Medica Database (EMBASE), Scopus, the Association for Computing Machinery (ACM) Digital Library, and the Institute of Electrical and Electronics Engineers (IEEE) Xplore Digital Library. We included only articles that presented experimental results of machine learning-based approaches applied to healthcare claims. From the reviewed articles, 137 were selected for the final qualitative and quantitative analyses.

Results:

In recent years, there has been a surge in publications centered on the use of machine learning to detect health insurance fraud. Among these studies, those focused on the detection of fraud committed by healthcare providers was the most prevalent, followed by fraud committed by patients. A wide variety of machine learning algorithms are highlighted in these studies, ranging from unsupervised (41 studies) and supervised methods (94 studies), to hybrid approaches (12 studies). While traditional machine learning approaches remain dominant in this research area, the adoption of advanced deep learning techniques is on the rise. Considering the type of healthcare claims data used, 30 studies utilized private data sources, while the rest used publicly available datasets. Data from 16 countries were utilized, with a majority coming from the United States (96 studies), followed by China (11 studies) and Australia (5 studies).

Discussion and Conclusion:

Detecting fraud in healthcare claims using machine learning presents several challenges. These include inconsistent data, absence of data standardization and integration, privacy concerns, and a limited number of labeled fraudulent cases to train models on. Future work should focus on enhancing transparency in data preparation, promoting the sharing of fraud investigation outcomes by authorities, and developing benchmark datasets to enhance accessibility and comparability. Furthermore, innovative techniques in data sampling, feature encoding methods for training machine learning models, and exploring the latest advancements in deep learning can significantly advance research in health insurance fraud detection.

目的：识别医疗保健计划中的欺诈行为至关重要，因为估计有3%-10%的医疗保健总支出损失在欺诈活动中。本研究对机器学习技术在健康保险索赔欺诈检测中的应用进行了系统的文献综述。我们的目标是分析过去二十年来文献中记录的数据和方法，为研究挑战和机遇提供见解。方法：我们从b谷歌Scholar、Springer-Link期刊、Elsevier、PubMed、摘录医学数据库（EMBASE）、Scopus、计算机械协会（ACM）数字图书馆和电气与电子工程师协会（IEEE） Xplore数字图书馆等数据库中确定了使用机器学习方法进行医疗保险欺诈检测的研究。我们只收录了将基于机器学习的方法应用于医疗保健索赔的实验结果的文章。从审查的文章中，选择137篇进行最后的定性和定量分析。结果：近年来，以使用机器学习检测健康保险欺诈为中心的出版物激增。在这些研究中，关注医疗保健提供者欺诈行为的研究最为普遍，其次是患者欺诈行为。这些研究强调了各种各样的机器学习算法，从无监督（41项研究）和监督方法（94项研究）到混合方法（12项研究）。虽然传统的机器学习方法在这一研究领域仍然占主导地位，但先进的深度学习技术的采用正在上升。考虑到所使用的医疗保健索赔数据的类型，30项研究使用了私人数据源，而其余研究使用了公开可用的数据集。本研究使用了来自16个国家的数据，其中大部分来自美国（96项研究），其次是中国（11项研究）和澳大利亚（5项研究）。讨论和结论：使用机器学习检测医疗保健索赔中的欺诈存在几个挑战。这些问题包括不一致的数据、缺乏数据标准化和集成、隐私问题以及用于训练模型的标记欺诈案例数量有限。未来的工作应侧重于提高数据准备的透明度，促进当局共享欺诈调查结果，并开发基准数据集以提高可及性和可比性。此外，数据采样的创新技术、用于训练机器学习模型的特征编码方法以及探索深度学习的最新进展可以显著推进医疗保险欺诈检测的研究。

{"title":"Fraud detection in healthcare claims using machine learning: A systematic review","authors":"Anli du Preez , Sanmitra Bhattacharya , Peter Beling , Edward Bowen","doi":"10.1016/j.artmed.2024.103061","DOIUrl":"10.1016/j.artmed.2024.103061","url":null,"abstract":"<div><h3>Objective:</h3><div>Identifying fraud in healthcare programs is crucial, as an estimated 3%–10% of the total healthcare expenditures are lost to fraudulent activities. This study presents a systematic literature review of machine learning techniques applied to fraud detection in health insurance claims. We aim to analyze the data and methodologies documented in the literature over the past two decades, providing insights into research challenges and opportunities.</div></div><div><h3>Methods:</h3><div>We identified research studies on health insurance fraud detection using machine learning approaches from databases such as Google Scholar, Springer-Link journals, Elsevier, PubMed, Excerpta Medica Database (EMBASE), Scopus, the Association for Computing Machinery (ACM) Digital Library, and the Institute of Electrical and Electronics Engineers (IEEE) Xplore Digital Library. We included only articles that presented experimental results of machine learning-based approaches applied to healthcare claims. From the reviewed articles, 137 were selected for the final qualitative and quantitative analyses.</div></div><div><h3>Results:</h3><div>In recent years, there has been a surge in publications centered on the use of machine learning to detect health insurance fraud. Among these studies, those focused on the detection of fraud committed by healthcare providers was the most prevalent, followed by fraud committed by patients. A wide variety of machine learning algorithms are highlighted in these studies, ranging from unsupervised (41 studies) and supervised methods (94 studies), to hybrid approaches (12 studies). While traditional machine learning approaches remain dominant in this research area, the adoption of advanced deep learning techniques is on the rise. Considering the type of healthcare claims data used, 30 studies utilized private data sources, while the rest used publicly available datasets. Data from 16 countries were utilized, with a majority coming from the United States (96 studies), followed by China (11 studies) and Australia (5 studies).</div></div><div><h3>Discussion and Conclusion:</h3><div>Detecting fraud in healthcare claims using machine learning presents several challenges. These include inconsistent data, absence of data standardization and integration, privacy concerns, and a limited number of labeled fraudulent cases to train models on. Future work should focus on enhancing transparency in data preparation, promoting the sharing of fraud investigation outcomes by authorities, and developing benchmark datasets to enhance accessibility and comparability. Furthermore, innovative techniques in data sampling, feature encoding methods for training machine learning models, and exploring the latest advancements in deep learning can significantly advance research in health insurance fraud detection.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103061"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142933715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Prediction of radiological decision errors from longitudinal analysis of gaze and image features 从凝视和图像特征的纵向分析预测放射学决策误差。

IF 6.1 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103051

Anna Anikina , Diliara Ibragimova , Tamerlan Mustafaev , Claudia Mello-Thoms , Bulat Ibragimov

Medical imaging, particularly radiography, is an indispensable part of diagnosing many chest diseases. Final diagnoses are made by radiologists based on images, but the decision-making process is always associated with a risk of incorrect interpretation. Incorrectly interpreted data can lead to delays in treatment, a prescription of inappropriate therapy, or even a completely missed diagnosis. In this context, our study aims to determine whether it is possible to predict diagnostic errors made by radiologists using eye-tracking technology. For this purpose, we asked 4 radiologists with different levels of experience to analyze 1000 images covering a wide range of chest diseases. Using eye-tracking data, we calculated the radiologists’ gaze fixation points and generated feature vectors based on this data to describe the radiologists’ gaze behavior during image analysis. Additionally, we emulated the process of revealing the read images following radiologists’ gaze data to create a more comprehensive picture of their analysis. Then we applied a recurrent neural network to predict diagnostic errors. Our results showed a 0.7755 ROC AUC score, demonstrating a significant potential for this approach in enhancing the accuracy of diagnostic error recognition.

医学影像，特别是x线摄影，是诊断许多胸部疾病不可或缺的一部分。最终诊断是由放射科医生根据图像做出的，但决策过程总是伴随着错误解释的风险。错误解读的数据可能导致治疗延误，开出不适当的治疗处方，甚至完全错过诊断。在此背景下，我们的研究旨在确定是否有可能预测放射科医生使用眼动追踪技术所犯的诊断错误。为此，我们请了4位不同经验水平的放射科医生分析了1000张涵盖各种胸部疾病的图像。利用眼动追踪数据，计算放射科医生注视点，并基于这些数据生成特征向量来描述放射科医生在图像分析过程中的注视行为。此外，我们模拟了根据放射科医生的注视数据显示读取图像的过程，以创建他们分析的更全面的图像。然后应用递归神经网络预测诊断错误。我们的结果显示，ROC AUC得分为0.7755，表明该方法在提高诊断错误识别的准确性方面具有显著的潜力。

{"title":"Prediction of radiological decision errors from longitudinal analysis of gaze and image features","authors":"Anna Anikina , Diliara Ibragimova , Tamerlan Mustafaev , Claudia Mello-Thoms , Bulat Ibragimov","doi":"10.1016/j.artmed.2024.103051","DOIUrl":"10.1016/j.artmed.2024.103051","url":null,"abstract":"<div><div>Medical imaging, particularly radiography, is an indispensable part of diagnosing many chest diseases. Final diagnoses are made by radiologists based on images, but the decision-making process is always associated with a risk of incorrect interpretation. Incorrectly interpreted data can lead to delays in treatment, a prescription of inappropriate therapy, or even a completely missed diagnosis. In this context, our study aims to determine whether it is possible to predict diagnostic errors made by radiologists using eye-tracking technology. For this purpose, we asked 4 radiologists with different levels of experience to analyze 1000 images covering a wide range of chest diseases. Using eye-tracking data, we calculated the radiologists’ gaze fixation points and generated feature vectors based on this data to describe the radiologists’ gaze behavior during image analysis. Additionally, we emulated the process of revealing the read images following radiologists’ gaze data to create a more comprehensive picture of their analysis. Then we applied a recurrent neural network to predict diagnostic errors. Our results showed a 0.7755 ROC AUC score, demonstrating a significant potential for this approach in enhancing the accuracy of diagnostic error recognition.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103051"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142873483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Rough hypervolume-driven feature selection with groupwise intelligent sampling for detecting clinical characterization of lupus nephritis 利用分组智能采样进行粗糙超卷积特征选择，检测狼疮性肾炎的临床特征。

IF 6.1 2区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Artificial Intelligence in Medicine

Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103042

Xinsen Zhou , Yi Chen , Ali Asghar Heidari , Huiling Chen , Xiaowei Chen

Systemic lupus erythematosus (SLE) is an autoimmune inflammatory disease. Lupus nephritis (LN) is a major risk factor for morbidity and mortality in SLE. Proliferative and pure membranous LN have different prognoses and may require different treatments. This study proposes a binary rough hypervolume-driven spherical evolution algorithm with groupwise intelligent sampling (bRGSE). The efficient dimensionality reduction capability of the bRGSE is verified across twelve datasets. These datasets are from the public datasets, with feature dimensions ranging from seven hundred to fifty thousand. The experimental results indicate that bRGSE performs better than seven high-performing alternatives. Then, the bRGSE was combined with adaptive boosting (AdaBoost) to form a new model (bRGSE_AdaBoost), which analyzed clinical records collected from 110 patients with LN. Experimental results show that the proposed bRGSE_AdaBoost can identify the most critical indicators, including urine latent blood, white blood cells, endogenous creatinine clearing rate, and age. These indicators may help differentiate between proliferative LN and membranous LN. The proposed bRGSE algorithm is an efficient dimensionality reduction method. The developed bRGSE_AdaBoost model, a computer-aided model, achieved an accuracy of 96.687 % and is expected to provide early warning for the treatment and diagnosis of LN.

系统性红斑狼疮（SLE）是一种自身免疫性炎症疾病。狼疮性肾炎（LN）是系统性红斑狼疮发病和死亡的主要危险因素。增生性 LN 和纯膜性 LN 的预后不同，可能需要不同的治疗方法。本研究提出了一种具有分组智能采样功能的二元粗糙超卷积驱动球形进化算法（bRGSE）。bRGSE 的高效降维能力在 12 个数据集上得到了验证。这些数据集来自公共数据集，特征维度从七百到五万不等。实验结果表明，bRGSE 的表现优于七个表现优异的替代方案。然后，bRGSE 与自适应提升（AdaBoost）相结合，形成了一个新模型（bRGSE_AdaBoost），该模型分析了从 110 名 LN 患者收集的临床记录。实验结果表明，所提出的 bRGSE_AdaBoost 可以识别最关键的指标，包括尿潜血、白细胞、内源性肌酐清除率和年龄。这些指标有助于区分增生性 LN 和膜性 LN。所提出的 bRGSE 算法是一种高效的降维方法。所开发的 bRGSE_AdaBoost 计算机辅助模型的准确率达到 96.687%，有望为 LN 的治疗和诊断提供早期预警。

{"title":"Rough hypervolume-driven feature selection with groupwise intelligent sampling for detecting clinical characterization of lupus nephritis","authors":"Xinsen Zhou , Yi Chen , Ali Asghar Heidari , Huiling Chen , Xiaowei Chen","doi":"10.1016/j.artmed.2024.103042","DOIUrl":"10.1016/j.artmed.2024.103042","url":null,"abstract":"<div><div>Systemic lupus erythematosus (SLE) is an autoimmune inflammatory disease. Lupus nephritis (LN) is a major risk factor for morbidity and mortality in SLE. Proliferative and pure membranous LN have different prognoses and may require different treatments. This study proposes a binary rough hypervolume-driven spherical evolution algorithm with groupwise intelligent sampling (bRGSE). The efficient dimensionality reduction capability of the bRGSE is verified across twelve datasets. These datasets are from the public datasets, with feature dimensions ranging from seven hundred to fifty thousand. The experimental results indicate that bRGSE performs better than seven high-performing alternatives. Then, the bRGSE was combined with adaptive boosting (AdaBoost) to form a new model (bRGSE_AdaBoost), which analyzed clinical records collected from 110 patients with LN. Experimental results show that the proposed bRGSE_AdaBoost can identify the most critical indicators, including urine latent blood, white blood cells, endogenous creatinine clearing rate, and age. These indicators may help differentiate between proliferative LN and membranous LN. The proposed bRGSE algorithm is an efficient dimensionality reduction method. The developed bRGSE_AdaBoost model, a computer-aided model, achieved an accuracy of 96.687 % and is expected to provide early warning for the treatment and diagnosis of LN.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103042"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142824848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0