首页 > 最新文献

Artificial Intelligence in Medicine最新文献

英文 中文
Finger-aware Artificial Neural Network for predicting arthritis in Patients with hand pain
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-14 DOI: 10.1016/j.artmed.2025.103077
Hwa-Ah-Ni Lee , Geun-Hyeong Kim , Seung Park , In Ah Choi , Hyun Woo Kwon , Hansol Moon , Jae Hyun Jung , Chulhan Kim
Arthritis is an inflammatory condition associated with joint damage, the incidence of which is increasing worldwide. In severe cases, arthritis can result in the restriction of joint movement, thereby affecting daily activities; as such, early and accurate diagnosis crucial to ensure effective treatment and management. Advances in imaging technologies used for arthritis diagnosis, particularly Single Photon Emission Computed Tomography/Computed Tomography (SPECT/CT), have enabled the quantitative measurement of joint inflammation using SUVmax. To the best of our knowledge, this is the first study to apply deep learning to SUVmax to predict the development of hand arthritis. We developed a transformer-based Finger-aware Artificial Neural Network (FANN) to predict arthritis in patients experiencing hand pain, including finger embedding, and to share unique finger-specific information between hands. Compared to conventional machine learning models, the FANN model demonstrated superior performance, achieving an area under the receiver operating characteristic curve of 0.85, accuracy of 0.79, precision of 0.87, recall of 0.79, and F1-score of 0.83. Furthermore, analysis using the SHapley Additive exPlanations (SHAP) algorithm revealed that the FANN predictions were most significantly influenced by the proximal interphalangeal joints of the right hand, in which arthritis is the most clinically prevalent. These findings indicate that the FANN significantly enhances arthritis prediction, representing a promising tool for clinical decision-making in arthritis diagnosis.
{"title":"Finger-aware Artificial Neural Network for predicting arthritis in Patients with hand pain","authors":"Hwa-Ah-Ni Lee ,&nbsp;Geun-Hyeong Kim ,&nbsp;Seung Park ,&nbsp;In Ah Choi ,&nbsp;Hyun Woo Kwon ,&nbsp;Hansol Moon ,&nbsp;Jae Hyun Jung ,&nbsp;Chulhan Kim","doi":"10.1016/j.artmed.2025.103077","DOIUrl":"10.1016/j.artmed.2025.103077","url":null,"abstract":"<div><div>Arthritis is an inflammatory condition associated with joint damage, the incidence of which is increasing worldwide. In severe cases, arthritis can result in the restriction of joint movement, thereby affecting daily activities; as such, early and accurate diagnosis crucial to ensure effective treatment and management. Advances in imaging technologies used for arthritis diagnosis, particularly Single Photon Emission Computed Tomography/Computed Tomography (SPECT/CT), have enabled the quantitative measurement of joint inflammation using <span><math><msub><mrow><mtext>SUV</mtext></mrow><mrow><mtext>max</mtext></mrow></msub></math></span>. To the best of our knowledge, this is the first study to apply deep learning to <span><math><msub><mrow><mtext>SUV</mtext></mrow><mrow><mtext>max</mtext></mrow></msub></math></span> to predict the development of hand arthritis. We developed a transformer-based Finger-aware Artificial Neural Network (FANN) to predict arthritis in patients experiencing hand pain, including finger embedding, and to share unique finger-specific information between hands. Compared to conventional machine learning models, the FANN model demonstrated superior performance, achieving an area under the receiver operating characteristic curve of 0.85, accuracy of 0.79, precision of 0.87, recall of 0.79, and F1-score of 0.83. Furthermore, analysis using the SHapley Additive exPlanations (SHAP) algorithm revealed that the FANN predictions were most significantly influenced by the proximal interphalangeal joints of the right hand, in which arthritis is the most clinically prevalent. These findings indicate that the FANN significantly enhances arthritis prediction, representing a promising tool for clinical decision-making in arthritis diagnosis.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"162 ","pages":"Article 103077"},"PeriodicalIF":6.1,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143437715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Empowering large language models for automated clinical assessment with generation-augmented retrieval and hierarchical chain-of-thought
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-12 DOI: 10.1016/j.artmed.2025.103078
Zhanzhong Gu , Wenjing Jia , Massimo Piccardi , Ping Yu

Background:

Understanding and extracting valuable information from electronic health records (EHRs) is important for improving healthcare delivery and health outcomes. Large language models (LLMs) have demonstrated significant proficiency in natural language understanding and processing, offering promises for automating the typically labor-intensive and time-consuming analytical tasks with EHRs. Despite the active application of LLMs in the healthcare setting, many foundation models lack real-world healthcare relevance. Applying LLMs to EHRs is still in its early stage. To advance this field, in this study, we pioneer a generation-augmented prompting paradigm “GAPrompt” to empower generic LLMs for automated clinical assessment, in particular, quantitative stroke severity assessment, using data extracted from EHRs.

Methods:

The GAPrompt paradigm comprises five components: (i) prompt-driven selection of LLMs, (ii) generation-augmented construction of a knowledge base, (iii) summary-based generation-augmented retrieval (SGAR); (iv) inferencing with a hierarchical chain-of-thought (HCoT), and (v) ensembling of multiple generations.

Results:

GAPrompt addresses the limitations of generic LLMs in clinical applications in a progressive manner. It efficiently evaluates the applicability of LLMs in specific tasks through LLM selection prompting, enhances their understanding of task-specific knowledge from the constructed knowledge base, improves the accuracy of knowledge and demonstration retrieval via SGAR, elevates LLM inference precision through HCoT, enhances generation robustness, and reduces hallucinations of LLM via ensembling. Experiment results demonstrate the capability of our method to empower LLMs to automatically assess EHRs and generate quantitative clinical assessment results.

Conclusion:

Our study highlights the applicability of enhancing the capabilities of foundation LLMs in medical domain-specific tasks, i.e., automated quantitative analysis of EHRs, addressing the challenges of labor-intensive and often manually conducted quantitative assessment of stroke in clinical practice and research. This approach offers a practical and accessible GAPrompt paradigm for researchers and industry practitioners seeking to leverage the power of LLMs in domain-specific applications. Its utility extends beyond the medical domain, applicable to a wide range of fields.
{"title":"Empowering large language models for automated clinical assessment with generation-augmented retrieval and hierarchical chain-of-thought","authors":"Zhanzhong Gu ,&nbsp;Wenjing Jia ,&nbsp;Massimo Piccardi ,&nbsp;Ping Yu","doi":"10.1016/j.artmed.2025.103078","DOIUrl":"10.1016/j.artmed.2025.103078","url":null,"abstract":"<div><h3>Background:</h3><div>Understanding and extracting valuable information from electronic health records (EHRs) is important for improving healthcare delivery and health outcomes. Large language models (LLMs) have demonstrated significant proficiency in natural language understanding and processing, offering promises for automating the typically labor-intensive and time-consuming analytical tasks with EHRs. Despite the active application of LLMs in the healthcare setting, many foundation models lack real-world healthcare relevance. Applying LLMs to EHRs is still in its early stage. To advance this field, in this study, we pioneer a generation-augmented prompting paradigm “GAPrompt” to empower generic LLMs for automated clinical assessment, in particular, quantitative stroke severity assessment, using data extracted from EHRs.</div></div><div><h3>Methods:</h3><div>The GAPrompt paradigm comprises five components: (i) prompt-driven selection of LLMs, (ii) generation-augmented construction of a knowledge base, (iii) summary-based generation-augmented retrieval (SGAR); (iv) inferencing with a hierarchical chain-of-thought (HCoT), and (v) ensembling of multiple generations.</div></div><div><h3>Results:</h3><div>GAPrompt addresses the limitations of generic LLMs in clinical applications in a progressive manner. It efficiently evaluates the applicability of LLMs in specific tasks through LLM selection prompting, enhances their understanding of task-specific knowledge from the constructed knowledge base, improves the accuracy of knowledge and demonstration retrieval via SGAR, elevates LLM inference precision through HCoT, enhances generation robustness, and reduces hallucinations of LLM via ensembling. Experiment results demonstrate the capability of our method to empower LLMs to automatically assess EHRs and generate quantitative clinical assessment results.</div></div><div><h3>Conclusion:</h3><div>Our study highlights the applicability of enhancing the capabilities of foundation LLMs in medical domain-specific tasks, <em>i.e.</em>, automated quantitative analysis of EHRs, addressing the challenges of labor-intensive and often manually conducted quantitative assessment of stroke in clinical practice and research. This approach offers a practical and accessible GAPrompt paradigm for researchers and industry practitioners seeking to leverage the power of LLMs in domain-specific applications. Its utility extends beyond the medical domain, applicable to a wide range of fields.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"162 ","pages":"Article 103078"},"PeriodicalIF":6.1,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143437716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial intelligence-driven approaches in antibiotic stewardship programs and optimizing prescription practices: A systematic review
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-12 DOI: 10.1016/j.artmed.2025.103089
Hamid Harandi , Maryam Shafaati , Mohammadreza Salehi , Mohammad Mahdi Roozbahani , Keyhan Mohammadi , Samaneh Akbarpour , Ramin Rahimnia , Gholamreza Hassanpour , Yasin Rahmani , Arash Seifi
Antimicrobial stewardship programs (ASPs) are essential in optimizing the use of antibiotics to address the global concern of antimicrobial resistance (AMR). Artificial intelligence (AI) and machine learning (ML) have emerged as promising tools for enhancing ASPs efficiency by improving antibiotic prescription accuracy, resistance prediction, and dosage optimization. This systematic review evaluated the application of AI-driven ASPs, focusing on their methodologies, outcomes, and challenges. We searched all of the databases in PubMed, Scopus, Web of Science, and Embase using keywords related to “AI” and “antibiotic.” We only included studies that used AI and ML algorithms in ASPs, with the main criteria being empirical antibiotic selection, dose adjustment, and ASP adherence. There were no limits on time, setting, or language. Two authors independently screened studies for inclusion and assessed their risk of bias using the Newcastle Ottawa Scale (NOS) Assessment tool for observational studies. Implementation studies underscored AI's potential for improving antimicrobial stewardship programs. Two studies showed that logistic regression, boosted-tree models, and gradient-boosting machines could effectively describe the difference between patients who needed to change their antibiotic regimen and those who did not. Twenty-four studies have confirmed the role of machine learning in optimizing empirical antibiotic selection, predicting resistance, and enhancing therapy appropriateness, all of which have the potential to reduce mortality rates. Additionally, machine learning algorithms showed promise in optimizing antibiotic dosing, particularly for vancomycin. This systematic review aimed to highlight various AI models, their applications in ASPs, and the resulting impact on healthcare outcomes. Machine learning and AI models effectively enhance antibiotic stewardship by optimizing patient interventions, empirical antibiotic selection, resistance prediction, and dosing. However, it subtly draws attention to the differences between high-income countries (HICs) and low- and middle-income countries (LMICs), highlighting the structural difficulties that LMICs confront while simultaneously highlighting the progress made in HICs.
{"title":"Artificial intelligence-driven approaches in antibiotic stewardship programs and optimizing prescription practices: A systematic review","authors":"Hamid Harandi ,&nbsp;Maryam Shafaati ,&nbsp;Mohammadreza Salehi ,&nbsp;Mohammad Mahdi Roozbahani ,&nbsp;Keyhan Mohammadi ,&nbsp;Samaneh Akbarpour ,&nbsp;Ramin Rahimnia ,&nbsp;Gholamreza Hassanpour ,&nbsp;Yasin Rahmani ,&nbsp;Arash Seifi","doi":"10.1016/j.artmed.2025.103089","DOIUrl":"10.1016/j.artmed.2025.103089","url":null,"abstract":"<div><div>Antimicrobial stewardship programs (ASPs) are essential in optimizing the use of antibiotics to address the global concern of antimicrobial resistance (AMR). Artificial intelligence (AI) and machine learning (ML) have emerged as promising tools for enhancing ASPs efficiency by improving antibiotic prescription accuracy, resistance prediction, and dosage optimization. This systematic review evaluated the application of AI-driven ASPs, focusing on their methodologies, outcomes, and challenges. We searched all of the databases in PubMed, Scopus, Web of Science, and Embase using keywords related to “AI” and “antibiotic.” We only included studies that used AI and ML algorithms in ASPs, with the main criteria being empirical antibiotic selection, dose adjustment, and ASP adherence. There were no limits on time, setting, or language. Two authors independently screened studies for inclusion and assessed their risk of bias using the Newcastle Ottawa Scale (NOS) Assessment tool for observational studies. Implementation studies underscored AI's potential for improving antimicrobial stewardship programs. Two studies showed that logistic regression, boosted-tree models, and gradient-boosting machines could effectively describe the difference between patients who needed to change their antibiotic regimen and those who did not. Twenty-four studies have confirmed the role of machine learning in optimizing empirical antibiotic selection, predicting resistance, and enhancing therapy appropriateness, all of which have the potential to reduce mortality rates. Additionally, machine learning algorithms showed promise in optimizing antibiotic dosing, particularly for vancomycin. This systematic review aimed to highlight various AI models, their applications in ASPs, and the resulting impact on healthcare outcomes. Machine learning and AI models effectively enhance antibiotic stewardship by optimizing patient interventions, empirical antibiotic selection, resistance prediction, and dosing. However, it subtly draws attention to the differences between high-income countries (HICs) and low- and middle-income countries (LMICs), highlighting the structural difficulties that LMICs confront while simultaneously highlighting the progress made in HICs.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"162 ","pages":"Article 103089"},"PeriodicalIF":6.1,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143421785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of decision support tools by model order reduction for active endovascular navigation
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-07 DOI: 10.1016/j.artmed.2025.103080
Arif Badrou , Arnaud Duval , Jérôme Szewczyk , Raphaël Blanc , Nicolas Tardif , Nahiène Hamila , Anthony Gravouil , Aline Bel-Brunon
Endovascular therapies enable minimally invasive treatment of vascular pathologies by guiding long tools towards the target area. However, certain pathways, such as the Supra-Aortic Trunks (SATs), present complex trajectories that make navigation challenging. To improve catheterization access to these challenging targets, an active guidewire composed of Shape Memory Alloy has been developed. Our study focuses on navigating this device and associated catheters to reach neurovascular targets via the left carotid artery. In previous work, a finite element model was used to simulate the navigation of the active guidewire and catheters from the aortic arch to the branching of the left carotid artery in patient-specific aortas. However, these numerical simulations are computationally intensive, limiting their feasibility for real-time navigation assistance. To address this, we present the development of numerical charts that enable real-time computation based on high-fidelity FE simulations. These charts predict: (1) the behavior of the active guidewire, and (2) the navigation of the guidewire and catheters within specific anatomical configurations, based on guidewire and navigation parameters. Using the High Order Proper Generalized Decomposition (HOPGD) method, these charts achieve accurate real-time predictions with errors below 5 % and a response time of 103 seconds, based on a limited number of preliminary high-fidelity computations. These findings could significantly contribute to the development of clinically applicable methods to enhance endovascular procedures and the advance the broader field of neurovascular interventions.
{"title":"Development of decision support tools by model order reduction for active endovascular navigation","authors":"Arif Badrou ,&nbsp;Arnaud Duval ,&nbsp;Jérôme Szewczyk ,&nbsp;Raphaël Blanc ,&nbsp;Nicolas Tardif ,&nbsp;Nahiène Hamila ,&nbsp;Anthony Gravouil ,&nbsp;Aline Bel-Brunon","doi":"10.1016/j.artmed.2025.103080","DOIUrl":"10.1016/j.artmed.2025.103080","url":null,"abstract":"<div><div>Endovascular therapies enable minimally invasive treatment of vascular pathologies by guiding long tools towards the target area. However, certain pathways, such as the Supra-Aortic Trunks (SATs), present complex trajectories that make navigation challenging. To improve catheterization access to these challenging targets, an active guidewire composed of Shape Memory Alloy has been developed. Our study focuses on navigating this device and associated catheters to reach neurovascular targets via the left carotid artery. In previous work, a finite element model was used to simulate the navigation of the active guidewire and catheters from the aortic arch to the branching of the left carotid artery in patient-specific aortas. However, these numerical simulations are computationally intensive, limiting their feasibility for real-time navigation assistance. To address this, we present the development of numerical charts that enable real-time computation based on high-fidelity FE simulations. These charts predict: (1) the behavior of the active guidewire, and (2) the navigation of the guidewire and catheters within specific anatomical configurations, based on guidewire and navigation parameters. Using the High Order Proper Generalized Decomposition (HOPGD) method, these charts achieve accurate real-time predictions with errors below 5 % and a response time of <span><math><msup><mn>10</mn><mrow><mo>−</mo><mn>3</mn></mrow></msup></math></span> seconds, based on a limited number of preliminary high-fidelity computations. These findings could significantly contribute to the development of clinically applicable methods to enhance endovascular procedures and the advance the broader field of neurovascular interventions.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"161 ","pages":"Article 103080"},"PeriodicalIF":6.1,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143378610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CircWaveDL: Modeling of optical coherence tomography images based on a new supervised tensor-based dictionary learning for classification of macular abnormalities CircWaveDL:基于新的监督张量字典学习的光学相干断层扫描图像建模,用于黄斑异常分类。
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103060
Roya Arian , Alireza Vard , Rahele Kafieh , Gerlind Plonka , Hossein Rabbani
Modeling Optical Coherence Tomography (OCT) images is crucial for numerous image processing applications and aids ophthalmologists in the early detection of macular abnormalities. Sparse representation-based models, particularly dictionary learning (DL), play a pivotal role in image modeling. Traditional DL methods often transform higher-order tensors into vectors and then aggregate them into a matrix, which overlooks the inherent multi-dimensional structure of the data. To address this limitation, tensor-based DL approaches have been introduced. In this study, we present a novel tensor-based DL algorithm, CircWaveDL, for OCT classification, where both the training data and the dictionary are modeled as higher-order tensors. We named our approach CircWaveDL to reflect the use of CircWave atoms for dictionary initialization, rather than random initialization. CircWave has previously shown effectiveness in OCT classification, making it a fitting basis function for our DL method. The algorithm employs CANDECOMP/PARAFAC (CP) decomposition to factorize each tensor into lower dimensions. We then learn a sub-dictionary for each class using its respective training tensor. For testing, a test tensor is reconstructed with each sub-dictionary, and each test B-scan is assigned to the class that yields the minimal residual error. To evaluate the model's generalizability, we tested it across three distinct databases. Additionally, we introduce a new heatmap generation technique based on averaging the most significant atoms of the learned sub-dictionaries. This approach highlights that selecting an appropriate sub-dictionary for reconstructing test B-scans improves reconstructions, emphasizing the distinctive features of different classes. CircWaveDL demonstrated strong generalizability across external validation datasets, outperforming previous classification methods. It achieved accuracies of 92.5 %, 86.1 %, and 89.3 % on datasets 1, 2, and 3, respectively, showcasing its efficacy in OCT image classification.
光学相干断层扫描(OCT)图像建模对于许多图像处理应用至关重要,并有助于眼科医生早期发现黄斑异常。基于稀疏表示的模型,特别是字典学习(DL),在图像建模中起着关键作用。传统的深度学习方法通常是将高阶张量转换为向量,然后将它们聚合成矩阵,这忽略了数据固有的多维结构。为了解决这一限制,引入了基于张量的深度学习方法。在这项研究中,我们提出了一种新的基于张量的DL算法CircWaveDL,用于OCT分类,其中训练数据和字典都被建模为高阶张量。我们将我们的方法命名为CircWaveDL,以反映使用CircWave原子进行字典初始化,而不是随机初始化。CircWave之前在OCT分类中显示了有效性,使其成为我们DL方法的拟合基函数。该算法采用CANDECOMP/PARAFAC (CP)分解将每个张量分解成较低的维数。然后,我们使用每个类各自的训练张量为其学习子字典。对于测试,使用每个子字典重构一个测试张量,并将每个测试b扫描分配给产生最小残差的类。为了评估模型的通用性,我们在三个不同的数据库中对其进行了测试。此外,我们还引入了一种新的热图生成技术,该技术基于对学习到的子字典中最重要的原子进行平均。这种方法强调选择合适的子字典来重建测试b扫描可以改善重建,强调不同类的独特特征。CircWaveDL在外部验证数据集上表现出很强的泛化能力,优于以前的分类方法。它在数据集1、2和3上分别达到92.5%、86.1%和89.3%的准确率,显示了它在OCT图像分类中的有效性。
{"title":"CircWaveDL: Modeling of optical coherence tomography images based on a new supervised tensor-based dictionary learning for classification of macular abnormalities","authors":"Roya Arian ,&nbsp;Alireza Vard ,&nbsp;Rahele Kafieh ,&nbsp;Gerlind Plonka ,&nbsp;Hossein Rabbani","doi":"10.1016/j.artmed.2024.103060","DOIUrl":"10.1016/j.artmed.2024.103060","url":null,"abstract":"<div><div>Modeling Optical Coherence Tomography (OCT) images is crucial for numerous image processing applications and aids ophthalmologists in the early detection of macular abnormalities. Sparse representation-based models, particularly dictionary learning (DL), play a pivotal role in image modeling. Traditional DL methods often transform higher-order tensors into vectors and then aggregate them into a matrix, which overlooks the inherent multi-dimensional structure of the data. To address this limitation, tensor-based DL approaches have been introduced. In this study, we present a novel tensor-based DL algorithm, CircWaveDL, for OCT classification, where both the training data and the dictionary are modeled as higher-order tensors. We named our approach CircWaveDL to reflect the use of CircWave atoms for dictionary initialization, rather than random initialization. CircWave has previously shown effectiveness in OCT classification, making it a fitting basis function for our DL method. The algorithm employs CANDECOMP/PARAFAC (CP) decomposition to factorize each tensor into lower dimensions. We then learn a sub-dictionary for each class using its respective training tensor. For testing, a test tensor is reconstructed with each sub-dictionary, and each test B-scan is assigned to the class that yields the minimal residual error. To evaluate the model's generalizability, we tested it across three distinct databases. Additionally, we introduce a new heatmap generation technique based on averaging the most significant atoms of the learned sub-dictionaries. This approach highlights that selecting an appropriate sub-dictionary for reconstructing test B-scans improves reconstructions, emphasizing the distinctive features of different classes. CircWaveDL demonstrated strong generalizability across external validation datasets, outperforming previous classification methods. It achieved accuracies of 92.5 %, 86.1 %, and 89.3 % on datasets 1, 2, and 3, respectively, showcasing its efficacy in OCT image classification.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103060"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142973365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Disentangled global and local features of multi-source data variational autoencoder: An interpretable model for diagnosing IgAN via multi-source Raman spectral fusion techniques 多源数据变分自编码器的解纠缠全局和局部特征:基于多源拉曼光谱融合技术诊断IgAN的可解释模型。
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103053
Wei Shuai , Xuecong Tian , Enguang Zuo , Xueqin Zhang , Chen Lu , Jin Gu , Chen Chen , Xiaoyi Lv , Cheng Chen
A single Raman spectrum reflects limited molecular information. Effective fusion of the Raman spectra of serum and urine source domains helps to obtain richer feature information. However, most of the current studies on immunoglobulin A nephropathy (IgAN) based on Raman spectroscopy are based on small sample data and low signal-to-noise ratio. If a multi-source data fusion strategy is directly adopted, it may even reduce the accuracy of disease diagnosis. To this end, this paper proposes a data enhancement and spectral optimization method based on variational autoencoders to obtain reconstructed Raman spectra with doubled sample size and improved signal-to-noise ratio. In the diagnosis of IgAN in multi-source domain Raman spectra, this paper builds a global and local feature decoupled variational autoencoder (DMSGL-VAE) model based on multi-source data. First, the statistical features after spectral segmentation are extracted, and the latent variables obtained by the variational encoder are decoupled through the decoupling module. The global representation and local representation obtained represent the global shared information and local unique information of the serum and urine source domains, respectively. Then, the cross-source reconstruction loss and decoupling loss are used to constrain the decoupling, and the effectiveness of the decoupling is proved quantitatively and qualitatively. Finally, the features of different source domains were integrated to diagnose IgAN, and the results were analyzed for important features using the SHapley Additive exPlanations algorithm. The experimental results showed that the AUC value of the DMSGL-VAE model for diagnosing IgAN on the test set was as high as 0.9958. The SHAP algorithm was used to further prove that proteins, hydroxybutyrate, and guanine are likely to be common biological fingerprint substances for the diagnosis of IgAN by serum and urine Raman spectroscopy. In summary, the DMSGL-VAE model designed based on Raman spectroscopy in this paper can achieve rapid, non-invasive, and accurate screening of IgAN in terms of classification performance. And interpretable analysis may help doctors further understand IgAN and make more efficient diagnostic measures in the future.
单个拉曼光谱反映有限的分子信息。有效融合血清和尿液源域拉曼光谱有助于获得更丰富的特征信息。然而,目前基于拉曼光谱的免疫球蛋白A肾病(IgAN)研究大多基于小样本数据和低信噪比。如果直接采用多源数据融合策略,甚至可能降低疾病诊断的准确性。为此,本文提出了一种基于变分自编码器的数据增强和光谱优化方法,以获得双倍样本量和提高信噪比的重构拉曼光谱。在多源域拉曼光谱IgAN诊断中,建立了基于多源数据的全局和局部特征解耦变分自编码器(DMSGL-VAE)模型。首先提取光谱分割后的统计特征,通过解耦模块对变分编码器得到的潜变量进行解耦;得到的全局表示和局部表示分别表示血清源域和尿源域的全局共享信息和局部唯一信息。然后,利用交叉源重构损耗和去耦损耗对去耦进行约束,定量和定性地证明了去耦的有效性。最后,综合不同源域的特征进行IgAN诊断,并利用SHapley加性解释算法对诊断结果进行重要特征分析。实验结果表明,DMSGL-VAE模型在测试集上诊断IgAN的AUC值高达0.9958。利用SHAP算法进一步证明蛋白质、羟丁酸盐和鸟嘌呤可能是血清和尿液拉曼光谱诊断IgAN的常见生物指纹物质。综上所述,本文基于拉曼光谱设计的DMSGL-VAE模型在分类性能上可以实现对IgAN的快速、无创、准确筛选。可解释的分析可以帮助医生进一步了解IgAN,并在未来制定更有效的诊断措施。
{"title":"Disentangled global and local features of multi-source data variational autoencoder: An interpretable model for diagnosing IgAN via multi-source Raman spectral fusion techniques","authors":"Wei Shuai ,&nbsp;Xuecong Tian ,&nbsp;Enguang Zuo ,&nbsp;Xueqin Zhang ,&nbsp;Chen Lu ,&nbsp;Jin Gu ,&nbsp;Chen Chen ,&nbsp;Xiaoyi Lv ,&nbsp;Cheng Chen","doi":"10.1016/j.artmed.2024.103053","DOIUrl":"10.1016/j.artmed.2024.103053","url":null,"abstract":"<div><div>A single Raman spectrum reflects limited molecular information. Effective fusion of the Raman spectra of serum and urine source domains helps to obtain richer feature information. However, most of the current studies on immunoglobulin A nephropathy (IgAN) based on Raman spectroscopy are based on small sample data and low signal-to-noise ratio. If a multi-source data fusion strategy is directly adopted, it may even reduce the accuracy of disease diagnosis. To this end, this paper proposes a data enhancement and spectral optimization method based on variational autoencoders to obtain reconstructed Raman spectra with doubled sample size and improved signal-to-noise ratio. In the diagnosis of IgAN in multi-source domain Raman spectra, this paper builds a global and local feature decoupled variational autoencoder (DMSGL-VAE) model based on multi-source data. First, the statistical features after spectral segmentation are extracted, and the latent variables obtained by the variational encoder are decoupled through the decoupling module. The global representation and local representation obtained represent the global shared information and local unique information of the serum and urine source domains, respectively. Then, the cross-source reconstruction loss and decoupling loss are used to constrain the decoupling, and the effectiveness of the decoupling is proved quantitatively and qualitatively. Finally, the features of different source domains were integrated to diagnose IgAN, and the results were analyzed for important features using the SHapley Additive exPlanations algorithm. The experimental results showed that the AUC value of the DMSGL-VAE model for diagnosing IgAN on the test set was as high as 0.9958. The SHAP algorithm was used to further prove that proteins, hydroxybutyrate, and guanine are likely to be common biological fingerprint substances for the diagnosis of IgAN by serum and urine Raman spectroscopy. In summary, the DMSGL-VAE model designed based on Raman spectroscopy in this paper can achieve rapid, non-invasive, and accurate screening of IgAN in terms of classification performance. And interpretable analysis may help doctors further understand IgAN and make more efficient diagnostic measures in the future.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103053"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142866489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging deep-learning and unconventional data for real-time surveillance, forecasting, and early warning of respiratory pathogens outbreak
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2025.103076
Z. Movahedi Nia , L. Seyyed-Kalantari , M. Goitom , B. Mellado , A. Ahmadi , A. Asgary , J. Orbinski , J. Wu , J.D. Kong

Background

Controlling re-emerging outbreaks such as COVID-19 is a critical concern to global health. Disease forecasting solutions are extremely beneficial to public health emergency management. This work aims to design and deploy a framework for real-time surveillance, prediction, forecasting, and early warning of respiratory disease. To this end, we selected southern African countries and Canadian provinces, along with COVID-19 and influenza as our case studies.

Methodology

Six different datasets were collected for different provinces of Canada: number of influenza cases, number of COVID-19 cases, Google Trends, Reddit posts, satellite air quality data, and weather data. Moreover, five different data sources were collected for southern African countries whose COVID-19 number of cases were significantly correlated with each other: number of COVID-19 infections, Google Trends, Wiki Trends, Google News, and satellite air quality data. For each infectious disease, i.e. COVID-19 and Influenza for Canada and COVID-19 for southern African countries, data was processed, scaled, and fed into the deep learning model which included four layers, namely, a Convolutional Neural Network (CNN), a Graph Neural Network (GNN), a Gated Recurrent Unit (GRU), and a linear Neural Network (NN). Hyperparameters were optimized to provide an accurate 56-day-ahead prediction of the number of cases.

Result

The accuracy of our models in real-time surveillance, prediction, forecasting, and early warning of respiratory diseases are evaluated against state-of-the-art models, through Root Mean Square Error (RMSE), coefficient of determination (R2-score), and correlation coefficient. Our model improves R2-score, RMSE, and correlation by up to 55.98 %, 39.71 %, and 44.47 % for 56 days-ahead COVID-19 prediction in Ontario, 34.87 %, 25.52 %, 50.91 % for 8 weeks-ahead influenza prediction in Quebec, and 51.04 %, 32.04 %, and 28.74 % for 56 days-ahead COVID-19 prediction in South Africa, respectively.

Conclusion

This work presents a framework that automatically collects data from unconventional sources, and builds an early warning system for COVID-19 and influenza outbreaks. The result is extremely helpful to policy-makers and health officials for preparedness and rapid response against future outbreaks.
{"title":"Leveraging deep-learning and unconventional data for real-time surveillance, forecasting, and early warning of respiratory pathogens outbreak","authors":"Z. Movahedi Nia ,&nbsp;L. Seyyed-Kalantari ,&nbsp;M. Goitom ,&nbsp;B. Mellado ,&nbsp;A. Ahmadi ,&nbsp;A. Asgary ,&nbsp;J. Orbinski ,&nbsp;J. Wu ,&nbsp;J.D. Kong","doi":"10.1016/j.artmed.2025.103076","DOIUrl":"10.1016/j.artmed.2025.103076","url":null,"abstract":"<div><h3>Background</h3><div>Controlling re-emerging outbreaks such as COVID-19 is a critical concern to global health. Disease forecasting solutions are extremely beneficial to public health emergency management. This work aims to design and deploy a framework for real-time surveillance, prediction, forecasting, and early warning of respiratory disease. To this end, we selected southern African countries and Canadian provinces, along with COVID-19 and influenza as our case studies.</div></div><div><h3>Methodology</h3><div>Six different datasets were collected for different provinces of Canada: number of influenza cases, number of COVID-19 cases, Google Trends, Reddit posts, satellite air quality data, and weather data. Moreover, five different data sources were collected for southern African countries whose COVID-19 number of cases were significantly correlated with each other: number of COVID-19 infections, Google Trends, Wiki Trends, Google News, and satellite air quality data. For each infectious disease, i.e. COVID-19 and Influenza for Canada and COVID-19 for southern African countries, data was processed, scaled, and fed into the deep learning model which included four layers, namely, a Convolutional Neural Network (CNN), a Graph Neural Network (GNN), a Gated Recurrent Unit (GRU), and a linear Neural Network (NN). Hyperparameters were optimized to provide an accurate 56-day-ahead prediction of the number of cases.</div></div><div><h3>Result</h3><div>The accuracy of our models in real-time surveillance, prediction, forecasting, and early warning of respiratory diseases are evaluated against state-of-the-art models, through Root Mean Square Error (RMSE), coefficient of determination (R2-score), and correlation coefficient. Our model improves R2-score, RMSE, and correlation by up to 55.98 %, 39.71 %, and 44.47 % for 56 days-ahead COVID-19 prediction in Ontario, 34.87 %, 25.52 %, 50.91 % for 8 weeks-ahead influenza prediction in Quebec, and 51.04 %, 32.04 %, and 28.74 % for 56 days-ahead COVID-19 prediction in South Africa, respectively.</div></div><div><h3>Conclusion</h3><div>This work presents a framework that automatically collects data from unconventional sources, and builds an early warning system for COVID-19 and influenza outbreaks. The result is extremely helpful to policy-makers and health officials for preparedness and rapid response against future outbreaks.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"161 ","pages":"Article 103076"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143167903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ItpCtrl-AI: End-to-end interpretable and controllable artificial intelligence by modeling radiologists’ intentions ItpCtrl-AI:通过模拟放射科医生的意图,实现端到端可解释和可控的人工智能。
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103054
Trong-Thang Pham , Jacob Brecheisen , Carol C. Wu , Hien Nguyen , Zhigang Deng , Donald Adjeroh , Gianfranco Doretto , Arabinda Choudhary , Ngan Le
Using Deep Learning in computer-aided diagnosis systems has been of great interest due to its impressive performance in the general domain and medical domain. However, a notable challenge is the lack of explainability of many advanced models, which poses risks in critical applications such as diagnosing findings in CXR. To address this problem, we propose ItpCtrl-AI, a novel end-to-end interpretable and controllable framework that mirrors the decision-making process of the radiologist. By emulating the eye gaze patterns of radiologists, our framework initially determines the focal areas and assesses the significance of each pixel within those regions. As a result, the model generates an attention heatmap representing radiologists’ attention, which is then used to extract attended visual information to diagnose the findings. By allowing the directional input, our framework is controllable by the user. Furthermore, by displaying the eye gaze heatmap which guides the diagnostic conclusion, the underlying rationale behind the model’s decision is revealed, thereby making it interpretable.
In addition to developing an interpretable and controllable framework, our work includes the creation of a dataset, named Diagnosed-Gaze++, which aligns medical findings with eye gaze data. Our extensive experimentation validates the effectiveness of our approach in generating accurate attention heatmaps and diagnoses. The experimental results show that our model not only accurately identifies medical findings but also precisely produces the eye gaze attention of radiologists. The dataset, models, and source code will be made publicly available upon acceptance.
由于深度学习在通用领域和医学领域的出色表现,在计算机辅助诊断系统中使用深度学习已经引起了人们的极大兴趣。然而,一个值得注意的挑战是许多高级模型缺乏可解释性,这对关键应用(如CXR中的诊断结果)构成了风险。为了解决这个问题,我们提出了ItpCtrl-AI,这是一个新的端到端可解释和可控的框架,反映了放射科医生的决策过程。通过模拟放射科医生的眼睛注视模式,我们的框架最初确定焦点区域,并评估这些区域内每个像素的重要性。结果,该模型生成了一个代表放射科医生注意力的注意力热图,然后用于提取被关注的视觉信息来诊断发现。通过允许方向输入,我们的框架是由用户控制的。此外,通过显示引导诊断结论的眼睛注视热图,揭示了模型决策背后的基本原理,从而使其具有可解释性。除了开发一个可解释和可控的框架外,我们的工作还包括创建一个名为“诊断-凝视++”的数据集,该数据集将医学发现与眼睛凝视数据结合起来。我们广泛的实验验证了我们的方法在生成准确的注意力热图和诊断方面的有效性。实验结果表明,该模型不仅能准确地识别医学特征,而且能准确地引起放射科医生的目光注意。数据集、模型和源代码将在接受后公开提供。
{"title":"ItpCtrl-AI: End-to-end interpretable and controllable artificial intelligence by modeling radiologists’ intentions","authors":"Trong-Thang Pham ,&nbsp;Jacob Brecheisen ,&nbsp;Carol C. Wu ,&nbsp;Hien Nguyen ,&nbsp;Zhigang Deng ,&nbsp;Donald Adjeroh ,&nbsp;Gianfranco Doretto ,&nbsp;Arabinda Choudhary ,&nbsp;Ngan Le","doi":"10.1016/j.artmed.2024.103054","DOIUrl":"10.1016/j.artmed.2024.103054","url":null,"abstract":"<div><div>Using Deep Learning in computer-aided diagnosis systems has been of great interest due to its impressive performance in the general domain and medical domain. However, a notable challenge is the lack of explainability of many advanced models, which poses risks in critical applications such as diagnosing findings in CXR. To address this problem, we propose ItpCtrl-AI, a novel <em>end-to-end interpretable and controllable framework that mirrors the decision-making process of the radiologist</em>. By emulating the eye gaze patterns of radiologists, our framework initially determines the focal areas and assesses the significance of each pixel within those regions. As a result, the model generates an attention heatmap representing radiologists’ attention, which is then used to extract attended visual information to diagnose the findings. By allowing the directional input, our framework is controllable by the user. Furthermore, by displaying the eye gaze heatmap which guides the diagnostic conclusion, the underlying rationale behind the model’s decision is revealed, thereby making it interpretable.</div><div>In addition to developing an interpretable and controllable framework, our work includes the creation of a dataset, named Diagnosed-Gaze++, which aligns medical findings with eye gaze data. Our extensive experimentation validates the effectiveness of our approach in generating accurate attention heatmaps and diagnoses. The experimental results show that our model not only accurately identifies medical findings but also precisely produces the eye gaze attention of radiologists. The dataset, models, and source code will be made publicly available upon acceptance.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103054"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142848409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI-enabled clinical decision support tools for mental healthcare: A product review 用于精神卫生保健的人工智能临床决策支持工具:产品综述。
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103052
Anne-Kathrin Kleine , Eesha Kokje , Pia Hummelsberger , Eva Lermer , Insa Schaffernak , Susanne Gaube
The review seeks to promote transparency in the availability of regulated AI-enabled Clinical Decision Support Systems (AI-CDSS) for mental healthcare. From 84 potential products, seven fulfilled the inclusion criteria. The products can be categorized into three major areas: diagnosis of autism spectrum disorder (ASD) based on clinical history, behavioral, and eye-tracking data; diagnosis of multiple disorders based on conversational data; and medication selection based on clinical history and genetic data. We found five scientific articles evaluating the devices' performance and external validity. The average completeness of reporting, indicated by 52 % adherence to the Consolidated Standards of Reporting Trials Artificial Intelligence (CONSORT-AI) checklist, was modest, signaling room for improvement in reporting quality. Our findings stress the importance of obtaining regulatory approval, adhering to scientific standards, and staying up-to-date with the latest changes in the regulatory landscape. Refining regulatory guidelines and implementing effective tracking systems for AI-CDSS could enhance transparency and oversight in the field.
该审查旨在提高用于精神卫生保健的受监管的人工智能临床决策支持系统(AI-CDSS)可用性的透明度。从84个潜在产品中,有7个符合纳入标准。这些产品可以分为三个主要领域:基于临床病史、行为和眼动追踪数据的自闭症谱系障碍(ASD)诊断;基于会话数据的多种疾病诊断以及基于临床病史和基因数据的药物选择。我们找到了五篇科学文章来评估设备的性能和外部有效性。报告的平均完整性,通过52%的遵守报告试验综合标准人工智能(consortium - ai)检查表来表示,是适度的,表明报告质量有改进的空间。我们的研究结果强调了获得监管机构批准、坚持科学标准以及紧跟监管环境最新变化的重要性。完善人工智能cdss的监管准则和实施有效的跟踪系统可以提高该领域的透明度和监督。
{"title":"AI-enabled clinical decision support tools for mental healthcare: A product review","authors":"Anne-Kathrin Kleine ,&nbsp;Eesha Kokje ,&nbsp;Pia Hummelsberger ,&nbsp;Eva Lermer ,&nbsp;Insa Schaffernak ,&nbsp;Susanne Gaube","doi":"10.1016/j.artmed.2024.103052","DOIUrl":"10.1016/j.artmed.2024.103052","url":null,"abstract":"<div><div>The review seeks to promote transparency in the availability of regulated AI-enabled Clinical Decision Support Systems (AI-CDSS) for mental healthcare. From 84 potential products, seven fulfilled the inclusion criteria. The products can be categorized into three major areas: diagnosis of autism spectrum disorder (ASD) based on clinical history, behavioral, and eye-tracking data; diagnosis of multiple disorders based on conversational data; and medication selection based on clinical history and genetic data. We found five scientific articles evaluating the devices' performance and external validity. The average completeness of reporting, indicated by 52 % adherence to the Consolidated Standards of Reporting Trials Artificial Intelligence (CONSORT-AI) checklist, was modest, signaling room for improvement in reporting quality. Our findings stress the importance of obtaining regulatory approval, adhering to scientific standards, and staying up-to-date with the latest changes in the regulatory landscape. Refining regulatory guidelines and implementing effective tracking systems for AI-CDSS could enhance transparency and oversight in the field.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103052"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142815086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and use of a Denoising Convolutional Autoencoder for reconstructing electrocardiogram signals at super resolution 超分辨率重构心电图信号降噪卷积自编码器的设计与使用。
IF 6.1 2区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-02-01 DOI: 10.1016/j.artmed.2024.103058
Ugo Lomoio , Pierangelo Veltri , Pietro Hiram Guzzi , Pietro Liò
Electrocardiogram signals play a pivotal role in cardiovascular diagnostics, providing essential information on electrical hearth activity. However, inherent noise and limited resolution can hinder an accurate interpretation of the recordings. In this paper an advanced Denoising Convolutional Autoencoder designed to process electrocardiogram signals, generating super-resolution reconstructions is proposed; this is followed by in-depth analysis of the enhanced signals. The autoencoder receives a signal window (of 5 s) sampled at 50 Hz (low resolution) as input and reconstructs a denoised super-resolution signal at 500 Hz. The proposed autoencoder is applied to publicly available datasets, demonstrating optimal performance in reconstructing high-resolution signals from very low-resolution inputs sampled at 50 Hz. The results were then compared with current state-of-the-art for electrocardiogram super-resolution, demonstrating the effectiveness of the proposed method. The method achieves a signal-to-noise ratio of 12.20 dB, a mean squared error of 0.0044, and a root mean squared error of 4.86%, which significantly outperforms current state-of-the-art alternatives. This framework can effectively enhance hidden information within signals, aiding in the detection of heart-related diseases.
心电图信号在心血管诊断中起着关键作用,提供了关于电床活动的基本信息。然而,固有的噪声和有限的分辨率会阻碍对记录的准确解释。本文提出了一种先进的去噪卷积自编码器,用于处理心电图信号,产生超分辨率重构;接下来是对增强信号的深入分析。自编码器接收50 Hz(低分辨率)采样的信号窗口(5秒)作为输入,并在500 Hz重建去噪的超分辨率信号。所提出的自编码器应用于公开可用的数据集,展示了从50 Hz采样的极低分辨率输入重建高分辨率信号的最佳性能。然后将结果与当前最先进的心电图超分辨率进行比较,证明了所提出方法的有效性。该方法的信噪比为12.20 dB,均方根误差为0.0044,均方根误差为4.86%,明显优于目前最先进的替代方法。这个框架可以有效地增强信号中的隐藏信息,帮助检测心脏相关疾病。
{"title":"Design and use of a Denoising Convolutional Autoencoder for reconstructing electrocardiogram signals at super resolution","authors":"Ugo Lomoio ,&nbsp;Pierangelo Veltri ,&nbsp;Pietro Hiram Guzzi ,&nbsp;Pietro Liò","doi":"10.1016/j.artmed.2024.103058","DOIUrl":"10.1016/j.artmed.2024.103058","url":null,"abstract":"<div><div>Electrocardiogram signals play a pivotal role in cardiovascular diagnostics, providing essential information on electrical hearth activity. However, inherent noise and limited resolution can hinder an accurate interpretation of the recordings. In this paper an advanced Denoising Convolutional Autoencoder designed to process electrocardiogram signals, generating super-resolution reconstructions is proposed; this is followed by in-depth analysis of the enhanced signals. The autoencoder receives a signal window (of 5 s) sampled at 50 Hz (low resolution) as input and reconstructs a denoised super-resolution signal at 500 Hz. The proposed autoencoder is applied to publicly available datasets, demonstrating optimal performance in reconstructing high-resolution signals from very low-resolution inputs sampled at 50 Hz. The results were then compared with current state-of-the-art for electrocardiogram super-resolution, demonstrating the effectiveness of the proposed method. The method achieves a signal-to-noise ratio of 12.20 dB, a mean squared error of 0.0044, and a root mean squared error of 4.86%, which significantly outperforms current state-of-the-art alternatives. This framework can effectively enhance hidden information within signals, aiding in the detection of heart-related diseases.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"160 ","pages":"Article 103058"},"PeriodicalIF":6.1,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142915779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence in Medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1