Pub Date : 2024-08-22DOI: 10.1016/j.artmed.2024.102968
Leo Huang , Wai Hoh Tang , Rahman Attar , Claudia Gore , Hywel C. Williams , Adnan Custovic , Reiko J. Tanaka
Various studies have been published on the remote assessment of eczema severity from digital camera images. Successful deployment of an accurate and robust AI-powered tool for such purposes can aid the formulation of eczema treatment plans and assist in patient monitoring. This review aims to provide an overview of the quality of published studies on this topic and to identify challenges and suggestions to improve the robustness and reliability of existing tools. We identified 25 articles from the Scopus database that aimed to assess eczema severity automatically from digital camera images by eczema area detection (n=13), which is important for prior delineation of the most relevant clinical features, and/or severity prediction (n=12). Deep learning methods (n=14) were more commonly used in recent years over conventional machine learning (n=11). A set of 20 pre-defined criteria were used for critical appraisal in this study. Study quality was hindered in many cases due to dataset challenges, with only 28% of studies reporting patient age range and 16% reporting skin phototype range. Furthermore, 52% of studies utilised solely non-public datasets and only 17% provided open-source access to code repositories, making validation of experimental results a significant challenge. In terms of algorithm design, attempts to improve model accuracy and process automation are widely reported. However, there remains limited implementation of methods for explicitly improving model trustworthiness and robustness. There is a need for a high-quality dataset with a sufficient number of bias-free images and consistent labels, as well as improved image analytics methods, to enhance the state of remote eczema severity assessment algorithms. Improving the interpretability and explainability of developed tools will further improve long-term reliability and trustworthiness.
{"title":"Remote Assessment of Eczema Severity via AI-powered Skin Image Analytics: A Systematic Review","authors":"Leo Huang , Wai Hoh Tang , Rahman Attar , Claudia Gore , Hywel C. Williams , Adnan Custovic , Reiko J. Tanaka","doi":"10.1016/j.artmed.2024.102968","DOIUrl":"10.1016/j.artmed.2024.102968","url":null,"abstract":"<div><p>Various studies have been published on the remote assessment of eczema severity from digital camera images. Successful deployment of an accurate and robust AI-powered tool for such purposes can aid the formulation of eczema treatment plans and assist in patient monitoring. This review aims to provide an overview of the quality of published studies on this topic and to identify challenges and suggestions to improve the robustness and reliability of existing tools. We identified 25 articles from the Scopus database that aimed to assess eczema severity automatically from digital camera images by eczema area detection (<em>n</em>=13), which is important for prior delineation of the most relevant clinical features, and/or severity prediction (<em>n</em>=12). Deep learning methods (<em>n</em>=14) were more commonly used in recent years over conventional machine learning (<em>n</em>=11). A set of 20 pre-defined criteria were used for critical appraisal in this study. Study quality was hindered in many cases due to dataset challenges, with only 28% of studies reporting patient age range and 16% reporting skin phototype range. Furthermore, 52% of studies utilised solely non-public datasets and only 17% provided open-source access to code repositories, making validation of experimental results a significant challenge. In terms of algorithm design, attempts to improve model accuracy and process automation are widely reported. However, there remains limited implementation of methods for explicitly improving model trustworthiness and robustness. There is a need for a high-quality dataset with a sufficient number of bias-free images and consistent labels, as well as improved image analytics methods, to enhance the state of remote eczema severity assessment algorithms. Improving the interpretability and explainability of developed tools will further improve long-term reliability and trustworthiness.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"156 ","pages":"Article 102968"},"PeriodicalIF":6.1,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0933365724002100/pdfft?md5=6f68224a44ecdb3cc5e6bc9ec5d98ae9&pid=1-s2.0-S0933365724002100-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142097633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-20DOI: 10.1016/j.artmed.2024.102951
Zeynep Hilal Kilimci, Mustafa Yalcin
Anticancer peptides (ACPs) are a class of molecules that have gained significant attention in the field of cancer research and therapy. ACPs are short chains of amino acids, the building blocks of proteins, and they possess the ability to selectively target and kill cancer cells. One of the key advantages of ACPs is their ability to selectively target cancer cells while sparing healthy cells to a greater extent. This selectivity is often attributed to differences in the surface properties of cancer cells compared to normal cells. That is why ACPs are being investigated as potential candidates for cancer therapy. ACPs may be used alone or in combination with other treatment modalities like chemotherapy and radiation therapy. While ACPs hold promise as a novel approach to cancer treatment, there are challenges to overcome, including optimizing their stability, improving selectivity, and enhancing their delivery to cancer cells, continuous increasing in number of peptide sequences, developing a reliable and precise prediction model. In this work, we propose an efficient transformer-based framework to identify ACPs for by performing accurate a reliable and precise prediction model. For this purpose, four different transformer models, namely ESM, ProtBERT, BioBERT, and SciBERT are employed to detect ACPs from amino acid sequences. To demonstrate the contribution of the proposed framework, extensive experiments are carried on widely-used datasets in the literature, two versions of AntiCp2, cACP-DeepGram, ACP-740. Experiment results show the usage of proposed model enhances classification accuracy when compared to the literature studies. The proposed framework, ESM, exhibits 96.45% of accuracy for AntiCp2 dataset, 97.66% of accuracy for cACP-DeepGram dataset, and 88.51% of accuracy for ACP-740 dataset, thence determining new state-of-the-art. The code of proposed framework is publicly available at github (https://github.com/mstf-yalcin/acp-esm).
{"title":"ACP-ESM: A novel framework for classification of anticancer peptides using protein-oriented transformer approach","authors":"Zeynep Hilal Kilimci, Mustafa Yalcin","doi":"10.1016/j.artmed.2024.102951","DOIUrl":"10.1016/j.artmed.2024.102951","url":null,"abstract":"<div><p>Anticancer peptides (ACPs) are a class of molecules that have gained significant attention in the field of cancer research and therapy. ACPs are short chains of amino acids, the building blocks of proteins, and they possess the ability to selectively target and kill cancer cells. One of the key advantages of ACPs is their ability to selectively target cancer cells while sparing healthy cells to a greater extent. This selectivity is often attributed to differences in the surface properties of cancer cells compared to normal cells. That is why ACPs are being investigated as potential candidates for cancer therapy. ACPs may be used alone or in combination with other treatment modalities like chemotherapy and radiation therapy. While ACPs hold promise as a novel approach to cancer treatment, there are challenges to overcome, including optimizing their stability, improving selectivity, and enhancing their delivery to cancer cells, continuous increasing in number of peptide sequences, developing a reliable and precise prediction model. In this work, we propose an efficient transformer-based framework to identify ACPs for by performing accurate a reliable and precise prediction model. For this purpose, four different transformer models, namely ESM, ProtBERT, BioBERT, and SciBERT are employed to detect ACPs from amino acid sequences. To demonstrate the contribution of the proposed framework, extensive experiments are carried on widely-used datasets in the literature, two versions of AntiCp2, cACP-DeepGram, ACP-740. Experiment results show the usage of proposed model enhances classification accuracy when compared to the literature studies. The proposed framework, ESM, exhibits 96.45% of accuracy for AntiCp2 dataset, 97.66% of accuracy for cACP-DeepGram dataset, and 88.51% of accuracy for ACP-740 dataset, thence determining new state-of-the-art. The code of proposed framework is publicly available at github (<span><span>https://github.com/mstf-yalcin/acp-esm</span><svg><path></path></svg></span>).</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"156 ","pages":"Article 102951"},"PeriodicalIF":6.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142021583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-20DOI: 10.1016/j.artmed.2024.102962
Álvaro Torres-Martos , Augusto Anguita-Ruiz , Mireia Bustos-Aibar , Alberto Ramírez-Mena , María Arteaga , Gloria Bueno , Rosaura Leis , Concepción M. Aguilera , Rafael Alcalá , Jesús Alcalá-Fdez
Pediatric obesity can drastically heighten the risk of cardiometabolic alterations later in life, with insulin resistance standing as the cornerstone linking adiposity to the increased cardiovascular risk. Puberty has been pointed out as a critical stage after which obesity-associated insulin resistance is more difficult to revert. Timely prediction of insulin resistance in pediatric obesity is therefore vital for mitigating the risk of its associated comorbidities. The construction of effective and robust predictive systems for a complex health outcome like insulin resistance during the early stages of life demands the adoption of longitudinal designs for more causal inferences, and the integration of factors of varying nature involved in its onset. In this work, we propose an eXplainable Artificial Intelligence-based decision support pipeline for early diagnosis of insulin resistance in a longitudinal cohort of 90 children. For that, we leverage multi-omics (genomics and epigenomics) and clinical data from the pre-pubertal stage. Different data layers combinations, pre-processing techniques (missing values, feature selection, class imbalance, etc.), algorithms, training procedures were considered following good practices for Machine Learning. SHapley Additive exPlanations were provided for specialists to understand both the decision-making mechanisms of the system and the impact of the features on each automatic decision, an essential issue in high-risk areas such as this one where system decisions may affect people’s lives. The system showed a relevant predictive ability (AUC and G-mean of 0.92). A deep exploration, both at the global and the local level, revealed promising biomarkers of insulin resistance in our population, highlighting classical markers, such as Body Mass Index z-score or leptin/adiponectin ratio, and novel ones such as methylation patterns of relevant genes, such as HDAC4, PTPRN2, MATN2, RASGRF1 and EBF1. Our findings highlight the importance of integrating multi-omics data and following eXplainable Artificial Intelligence trends when building decision support systems.
小儿肥胖会大大增加日后发生心血管代谢变化的风险,而胰岛素抵抗是将肥胖与心血管风险增加联系起来的基石。有人指出,青春期是一个关键阶段,过了青春期,肥胖引起的胰岛素抵抗就更难恢复。因此,及时预测小儿肥胖症的胰岛素抵抗对于降低其相关并发症的风险至关重要。要为生命早期阶段胰岛素抵抗这种复杂的健康结果构建有效、稳健的预测系统,就必须采用纵向设计来进行更多的因果推断,并整合导致胰岛素抵抗发生的各种因素。在这项工作中,我们提出了一种基于人工智能的可扩展决策支持管道,用于在 90 名儿童的纵向队列中对胰岛素抵抗进行早期诊断。为此,我们利用了青春期前阶段的多组学(基因组学和表观基因组学)和临床数据。根据机器学习的良好实践,我们考虑了不同的数据层组合、预处理技术(缺失值、特征选择、类不平衡等)、算法和训练程序。还为专家提供了 "SHapley Additive exPlanations",以便他们了解系统的决策机制以及特征对每个自动决策的影响,这在像本项目这样的高风险领域是一个至关重要的问题,因为系统的决策可能会影响到人们的生命。该系统显示了相关的预测能力(AUC 和 G 均值均为 0.92)。在全局和局部层面的深入探索揭示了我们人群中胰岛素抵抗的潜在生物标志物,其中既有经典标志物,如体重指数 z 值或瘦素/脂联素比率,也有新标志物,如相关基因的甲基化模式,如 HDAC4、PTPRN2、MATN2、RASGRF1 和 EBF1。我们的研究结果凸显了在构建决策支持系统时整合多组学数据和遵循易用人工智能趋势的重要性。
{"title":"Multiomics and eXplainable artificial intelligence for decision support in insulin resistance early diagnosis: A pediatric population-based longitudinal study","authors":"Álvaro Torres-Martos , Augusto Anguita-Ruiz , Mireia Bustos-Aibar , Alberto Ramírez-Mena , María Arteaga , Gloria Bueno , Rosaura Leis , Concepción M. Aguilera , Rafael Alcalá , Jesús Alcalá-Fdez","doi":"10.1016/j.artmed.2024.102962","DOIUrl":"10.1016/j.artmed.2024.102962","url":null,"abstract":"<div><p>Pediatric obesity can drastically heighten the risk of cardiometabolic alterations later in life, with insulin resistance standing as the cornerstone linking adiposity to the increased cardiovascular risk. Puberty has been pointed out as a critical stage after which obesity-associated insulin resistance is more difficult to revert. Timely prediction of insulin resistance in pediatric obesity is therefore vital for mitigating the risk of its associated comorbidities. The construction of effective and robust predictive systems for a complex health outcome like insulin resistance during the early stages of life demands the adoption of longitudinal designs for more causal inferences, and the integration of factors of varying nature involved in its onset. In this work, we propose an eXplainable Artificial Intelligence-based decision support pipeline for early diagnosis of insulin resistance in a longitudinal cohort of 90 children. For that, we leverage multi-omics (genomics and epigenomics) and clinical data from the pre-pubertal stage. Different data layers combinations, pre-processing techniques (missing values, feature selection, class imbalance, etc.), algorithms, training procedures were considered following good practices for Machine Learning. SHapley Additive exPlanations were provided for specialists to understand both the decision-making mechanisms of the system and the impact of the features on each automatic decision, an essential issue in high-risk areas such as this one where system decisions may affect people’s lives. The system showed a relevant predictive ability (AUC and G-mean of 0.92). A deep exploration, both at the global and the local level, revealed promising biomarkers of insulin resistance in our population, highlighting classical markers, such as Body Mass Index z-score or leptin/adiponectin ratio, and novel ones such as methylation patterns of relevant genes, such as <em>HDAC4</em>, <em>PTPRN2</em>, <em>MATN2</em>, <em>RASGRF1</em> and <em>EBF1</em>. Our findings highlight the importance of integrating multi-omics data and following eXplainable Artificial Intelligence trends when building decision support systems.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"156 ","pages":"Article 102962"},"PeriodicalIF":6.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0933365724002045/pdfft?md5=8524adfb2927cd0dbcacbe5577c03b65&pid=1-s2.0-S0933365724002045-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142050034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-20DOI: 10.1016/j.artmed.2024.102967
Yaoqian Sun , Lei Sang , Dan Wu , Shilin He , Yani Chen , Huilong Duan , Han Chen , Xudong Lu
Background
Assigning International Classification of Diseases (ICD) codes to clinical texts is a common and crucial practice in patient classification, hospital management, and further statistics analysis. Current auto-coding methods mainly transfer this task to a multi-label classification problem. Such solutions are suffering from high-dimensional mapping space and excessive redundant information in long clinical texts. To alleviate such a situation, we introduce text summarization methods to the ICD coding regime and apply text matching to select ICD codes.
Method
We focus on the tenth revision of the ICD (ICD-10) coding and design a novel summarization-based approach (SuM) with an end-to-end strategy to efficiently assign ICD-10 code to clinical texts. In this approach, a knowledge-guided pointer network is purposed to distill and summarize key information in clinical texts precisely. Then a matching model with matching-aggregation architecture follows to align the summary result with code, tuning the one-vs-all scenario to one-vs-one matching so that the large-label-space obstacle laid in classification approaches would be avoided.
Result
The 12,788 ICD-10 coded discharge summaries from a Chinese hospital were collected to evaluate the proposed approach. Compared with existing methods, the purposed model achieves the greatest coding results with Micro AUC of 0.9548, MRR@10 of 0.7977, Precision@10 of 0.0944, and Recall@10 of 0.9439 for the TOP-50 Dataset. Results on the FULL-Dataset remain consistent. Also, the proposed knowledge encoder and applied end-to-end strategy are proven to facilitate the whole model to gain efficacy in selecting the most suitable code.
Conclusion
The proposed automatic ICD-10 code assignment approach via text summarization can effectively capture critical messages in long clinical texts and improve the performance of ICD-10 coding of clinical texts.
{"title":"Enhanced ICD-10 code assignment of clinical texts: A summarization-based approach","authors":"Yaoqian Sun , Lei Sang , Dan Wu , Shilin He , Yani Chen , Huilong Duan , Han Chen , Xudong Lu","doi":"10.1016/j.artmed.2024.102967","DOIUrl":"10.1016/j.artmed.2024.102967","url":null,"abstract":"<div><h3>Background</h3><p>Assigning International Classification of Diseases (ICD) codes to clinical texts is a common and crucial practice in patient classification, hospital management, and further statistics analysis. Current auto-coding methods mainly transfer this task to a multi-label classification problem. Such solutions are suffering from high-dimensional mapping space and excessive redundant information in long clinical texts. To alleviate such a situation, we introduce text summarization methods to the ICD coding regime and apply text matching to select ICD codes.</p></div><div><h3>Method</h3><p>We focus on the tenth revision of the ICD (ICD-10) coding and design a novel summarization-based approach (SuM) with an end-to-end strategy to efficiently assign ICD-10 code to clinical texts. In this approach, a knowledge-guided pointer network is purposed to distill and summarize key information in clinical texts precisely. Then a matching model with matching-aggregation architecture follows to align the summary result with code, tuning the one-vs-all scenario to one-vs-one matching so that the large-label-space obstacle laid in classification approaches would be avoided.</p></div><div><h3>Result</h3><p>The 12,788 ICD-10 coded discharge summaries from a Chinese hospital were collected to evaluate the proposed approach. Compared with existing methods, the purposed model achieves the greatest coding results with Micro AUC of 0.9548, MRR@10 of 0.7977, Precision@10 of 0.0944, and Recall@10 of 0.9439 for the TOP-50 Dataset. Results on the FULL-Dataset remain consistent. Also, the proposed knowledge encoder and applied end-to-end strategy are proven to facilitate the whole model to gain efficacy in selecting the most suitable code.</p></div><div><h3>Conclusion</h3><p>The proposed automatic ICD-10 code assignment approach via text summarization can effectively capture critical messages in long clinical texts and improve the performance of ICD-10 coding of clinical texts.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"156 ","pages":"Article 102967"},"PeriodicalIF":6.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142088621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-18DOI: 10.1016/j.artmed.2024.102961
Miao Liao , Shuanhu Di , Yuqian Zhao , Wei Liang , Zhen Yang
Dose prediction is a crucial step in automated radiotherapy planning for liver cancer. Several deep learning-based approaches for dose prediction have been proposed to enhance the design efficiency and quality of radiotherapy plan. However, these approaches usually take CT images and contours of organs at risk (OARs) and planning target volume (PTV) as a multi-channel input and is thus difficult to extract sufficient feature information from each input, which results in unsatisfactory dose distribution. In this paper, we propose a novel dose prediction network for liver cancer based on hierarchical feature fusion and interactive attention. A feature extraction module is first constructed to extract multi-scale features from different inputs, and a hierarchical feature fusion module is then built to fuse these multi-scale features hierarchically. A decoder based on attention mechanism is designed to gradually reconstruct the fused features into dose distribution. Additionally, we design an autoencoder network to generate a perceptual loss during training stage, which is used to improve the accuracy of dose prediction. The proposed method is tested on private clinical dataset and obtains HI and CI of 0.31 and 0.87, respectively. The experimental results are better than those by several existing methods, indicating that the dose distribution generated by the proposed method is close to that approved in clinics. The codes are available at https://github.com/hired-ld/FA-Net.
剂量预测是肝癌自动化放疗计划的关键步骤。目前已提出了几种基于深度学习的剂量预测方法,以提高放疗计划的设计效率和质量。然而,这些方法通常将CT图像以及危险器官(OAR)和计划靶体积(PTV)的轮廓作为多通道输入,因此很难从每个输入中提取足够的特征信息,从而导致剂量分布不理想。本文提出了一种基于分层特征融合和交互关注的新型肝癌剂量预测网络。首先构建一个特征提取模块,从不同输入中提取多尺度特征,然后构建一个分层特征融合模块,对这些多尺度特征进行分层融合。我们设计了一个基于注意力机制的解码器,将融合后的特征逐步重构为剂量分布。此外,我们还设计了一个自动编码器网络,在训练阶段产生感知损失,用于提高剂量预测的准确性。所提出的方法在私人临床数据集上进行了测试,得到的 HI 和 CI 分别为 0.31 和 0.87。实验结果优于现有的几种方法,表明所提方法生成的剂量分布接近临床认可的剂量分布。代码见 https://github.com/hired-ld/FA-Net。
{"title":"FA-Net: A hierarchical feature fusion and interactive attention-based network for dose prediction in liver cancer patients","authors":"Miao Liao , Shuanhu Di , Yuqian Zhao , Wei Liang , Zhen Yang","doi":"10.1016/j.artmed.2024.102961","DOIUrl":"10.1016/j.artmed.2024.102961","url":null,"abstract":"<div><p>Dose prediction is a crucial step in automated radiotherapy planning for liver cancer. Several deep learning-based approaches for dose prediction have been proposed to enhance the design efficiency and quality of radiotherapy plan. However, these approaches usually take CT images and contours of organs at risk (OARs) and planning target volume (PTV) as a multi-channel input and is thus difficult to extract sufficient feature information from each input, which results in unsatisfactory dose distribution. In this paper, we propose a novel dose prediction network for liver cancer based on hierarchical feature fusion and interactive attention. A feature extraction module is first constructed to extract multi-scale features from different inputs, and a hierarchical feature fusion module is then built to fuse these multi-scale features hierarchically. A decoder based on attention mechanism is designed to gradually reconstruct the fused features into dose distribution. Additionally, we design an autoencoder network to generate a perceptual loss during training stage, which is used to improve the accuracy of dose prediction. The proposed method is tested on private clinical dataset and obtains HI and CI of 0.31 and 0.87, respectively. The experimental results are better than those by several existing methods, indicating that the dose distribution generated by the proposed method is close to that approved in clinics. The codes are available at <span><span>https://github.com/hired-ld/FA-Net</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"156 ","pages":"Article 102961"},"PeriodicalIF":6.1,"publicationDate":"2024-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142050032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-16DOI: 10.1016/j.artmed.2024.102949
Eva Pachetti , Sara Colantonio
The lack of annotated medical images limits the performance of deep learning models, which usually need large-scale labelled datasets. Few-shot learning techniques can reduce data scarcity issues and enhance medical image analysis speed and robustness. This systematic review gives a comprehensive overview of few-shot learning methods for medical image analysis, aiming to establish a standard methodological pipeline for future research reference. With a particular emphasis on the role of meta-learning, we analysed 80 relevant articles published from 2018 to 2023, conducting a risk of bias assessment and extracting relevant information, especially regarding the employed learning techniques. From this, we delineated a comprehensive methodological pipeline shared among all studies. In addition, we performed a statistical analysis of the studies’ results concerning the clinical task and the meta-learning method employed while also presenting supplemental information such as imaging modalities and model robustness evaluation techniques. We discussed the findings of our analysis, providing a deep insight into the limitations of the state-of-the-art methods and the most promising approaches. Drawing on our investigation, we yielded recommendations on potential future research directions aiming to bridge the gap between research and clinical practice.
{"title":"A systematic review of few-shot learning in medical imaging","authors":"Eva Pachetti , Sara Colantonio","doi":"10.1016/j.artmed.2024.102949","DOIUrl":"10.1016/j.artmed.2024.102949","url":null,"abstract":"<div><p>The lack of annotated medical images limits the performance of deep learning models, which usually need large-scale labelled datasets. Few-shot learning techniques can reduce data scarcity issues and enhance medical image analysis speed and robustness. This systematic review gives a comprehensive overview of few-shot learning methods for medical image analysis, aiming to establish a standard methodological pipeline for future research reference. With a particular emphasis on the role of meta-learning, we analysed 80 relevant articles published from 2018 to 2023, conducting a risk of bias assessment and extracting relevant information, especially regarding the employed learning techniques. From this, we delineated a comprehensive methodological pipeline shared among all studies. In addition, we performed a statistical analysis of the studies’ results concerning the clinical task and the meta-learning method employed while also presenting supplemental information such as imaging modalities and model robustness evaluation techniques. We discussed the findings of our analysis, providing a deep insight into the limitations of the state-of-the-art methods and the most promising approaches. Drawing on our investigation, we yielded recommendations on potential future research directions aiming to bridge the gap between research and clinical practice.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"156 ","pages":"Article 102949"},"PeriodicalIF":6.1,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S093336572400191X/pdfft?md5=7f631ad6f02006409c385680e691e86a&pid=1-s2.0-S093336572400191X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142040237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-15DOI: 10.1016/j.artmed.2024.102953
Alper Idrisoglu , Ana Luiza Dallora , Abbas Cheddad , Peter Anderberg , Andreas Jakobsson , Johan Sanmartin Berglund
<div><h3>Background</h3><p>Chronic obstructive pulmonary disease (COPD) is a severe condition affecting millions worldwide, leading to numerous annual deaths. The absence of significant symptoms in its early stages promotes high underdiagnosis rates for the affected people. Besides pulmonary function failure, another harmful problem of COPD is the systemic effects, e.g., heart failure or voice distortion. However, the systemic effects of COPD might provide valuable information for early detection. In other words, symptoms caused by systemic effects could be helpful to detect the condition in its early stages.</p></div><div><h3>Objective</h3><p>The proposed study aims to explore whether the voice features extracted from the vowel “a” utterance carry any information that can be predictive of COPD by employing Machine Learning (ML) on a newly collected voice dataset.</p></div><div><h3>Methods</h3><p>Forty-eight participants were recruited from the pool of research clinic visitors at Blekinge Institute of Technology (BTH) in Sweden between January 2022 and May 2023. A dataset consisting of 1246 recordings from 48 participants was gathered. The collection of voice recordings containing the vowel “a” utterance commenced following an information and consent meeting with each participant using the <em>VoiceDiagnostic</em> application. The collected voice data was subjected to silence segment removal, feature extraction of baseline acoustic features, and Mel Frequency Cepstrum Coefficients (MFCC). Sociodemographic data was also collected from the participants. Three ML models were investigated for the binary classification of COPD and healthy controls: Random Forest (RF), Support Vector Machine (SVM), and CatBoost (CB). A nested k-fold cross-validation approach was employed. Additionally, the hyperparameters were optimized using grid-search on each ML model. For best performance assessment, accuracy, F1-score, precision, and recall metrics were computed. Afterward, we further examined the best classifier by utilizing the Area Under the Curve (AUC), Average Precision (AP), and SHapley Additive exPlanations (SHAP) feature-importance measures.</p></div><div><h3>Results</h3><p>The classifiers RF, SVM, and CB achieved a maximum accuracy of 77 %, 69 %, and 78 % on the test set and 93 %, 78 % and 97 % on the validation set, respectively. The CB classifier outperformed RF and SVM. After further investigation of the best-performing classifier, CB demonstrated the highest performance, producing an AUC of 82 % and AP of 76 %. In addition to age and gender, the mean values of baseline acoustic and MFCC features demonstrate high importance and deterministic characteristics for classification performance in both test and validation sets, though in varied order.</p></div><div><h3>Conclusion</h3><p>This study concludes that the utterance of vowel “a” recordings contain information that can be captured by the CatBoost classifier with high accuracy for the classification o
背景 慢性阻塞性肺疾病(COPD)是一种严重的疾病,影响着全球数百万人,每年导致大量死亡。由于慢性阻塞性肺病早期没有明显症状,因此患者的诊断率很低。除肺功能衰竭外,慢性阻塞性肺病的另一个有害问题是全身影响,如心力衰竭或声音失真。然而,慢性阻塞性肺病的全身影响可能为早期检测提供有价值的信息。本研究旨在通过在新收集的语音数据集上使用机器学习(ML)技术,探讨从元音 "a "的语音中提取的语音特征是否包含任何可预测慢性阻塞性肺病的信息。方法在 2022 年 1 月至 2023 年 5 月期间,从瑞典布莱金厄理工学院(Blekinge Institute of Technology,BTH)的研究诊所访客中招募了 48 名参与者。数据集由 48 名参与者的 1246 份录音组成。在使用 VoiceDiagnostic 应用程序与每位参与者进行信息交流并征得同意后,开始收集包含元音 "a "的语音记录。收集到的语音数据经过了静音段去除、基线声学特征提取和梅尔频率倒频谱系数(MFCC)处理。此外,还收集了参与者的社会人口学数据。针对慢性阻塞性肺病和健康对照组的二元分类,研究了三种 ML 模型:随机森林 (RF)、支持向量机 (SVM) 和 CatBoost (CB)。采用了嵌套 k 倍交叉验证方法。此外,还在每个多模型上使用网格搜索对超参数进行了优化。为了评估最佳性能,我们计算了准确率、F1 分数、精确度和召回率指标。结果RF、SVM 和 CB 分类器在测试集上的最高准确率分别为 77%、69% 和 78%,在验证集上的最高准确率分别为 93%、78% 和 97%。CB 分类器的表现优于 RF 和 SVM。在对表现最好的分类器进行进一步研究后,CB 表现最好,其 AUC 为 82%,AP 为 76%。除了年龄和性别外,基线声学特征和 MFCC 特征的平均值在测试集和验证集中都显示出了对分类性能的高度重要性和确定性特征,尽管顺序有所不同。此外,基线声学和 MFCC 特征与年龄和性别信息相结合,可用于分类目的,并有利于医疗保健对慢性阻塞性肺病诊断的决策支持。
{"title":"COPDVD: Automated classification of chronic obstructive pulmonary disease on a new collected and evaluated voice dataset","authors":"Alper Idrisoglu , Ana Luiza Dallora , Abbas Cheddad , Peter Anderberg , Andreas Jakobsson , Johan Sanmartin Berglund","doi":"10.1016/j.artmed.2024.102953","DOIUrl":"10.1016/j.artmed.2024.102953","url":null,"abstract":"<div><h3>Background</h3><p>Chronic obstructive pulmonary disease (COPD) is a severe condition affecting millions worldwide, leading to numerous annual deaths. The absence of significant symptoms in its early stages promotes high underdiagnosis rates for the affected people. Besides pulmonary function failure, another harmful problem of COPD is the systemic effects, e.g., heart failure or voice distortion. However, the systemic effects of COPD might provide valuable information for early detection. In other words, symptoms caused by systemic effects could be helpful to detect the condition in its early stages.</p></div><div><h3>Objective</h3><p>The proposed study aims to explore whether the voice features extracted from the vowel “a” utterance carry any information that can be predictive of COPD by employing Machine Learning (ML) on a newly collected voice dataset.</p></div><div><h3>Methods</h3><p>Forty-eight participants were recruited from the pool of research clinic visitors at Blekinge Institute of Technology (BTH) in Sweden between January 2022 and May 2023. A dataset consisting of 1246 recordings from 48 participants was gathered. The collection of voice recordings containing the vowel “a” utterance commenced following an information and consent meeting with each participant using the <em>VoiceDiagnostic</em> application. The collected voice data was subjected to silence segment removal, feature extraction of baseline acoustic features, and Mel Frequency Cepstrum Coefficients (MFCC). Sociodemographic data was also collected from the participants. Three ML models were investigated for the binary classification of COPD and healthy controls: Random Forest (RF), Support Vector Machine (SVM), and CatBoost (CB). A nested k-fold cross-validation approach was employed. Additionally, the hyperparameters were optimized using grid-search on each ML model. For best performance assessment, accuracy, F1-score, precision, and recall metrics were computed. Afterward, we further examined the best classifier by utilizing the Area Under the Curve (AUC), Average Precision (AP), and SHapley Additive exPlanations (SHAP) feature-importance measures.</p></div><div><h3>Results</h3><p>The classifiers RF, SVM, and CB achieved a maximum accuracy of 77 %, 69 %, and 78 % on the test set and 93 %, 78 % and 97 % on the validation set, respectively. The CB classifier outperformed RF and SVM. After further investigation of the best-performing classifier, CB demonstrated the highest performance, producing an AUC of 82 % and AP of 76 %. In addition to age and gender, the mean values of baseline acoustic and MFCC features demonstrate high importance and deterministic characteristics for classification performance in both test and validation sets, though in varied order.</p></div><div><h3>Conclusion</h3><p>This study concludes that the utterance of vowel “a” recordings contain information that can be captured by the CatBoost classifier with high accuracy for the classification o","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"156 ","pages":"Article 102953"},"PeriodicalIF":6.1,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0933365724001957/pdfft?md5=91d584b7c0bf3f6fbf86fae6ca0a54d6&pid=1-s2.0-S0933365724001957-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142117556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The advanced learning paradigm, learning using privileged information (LUPI), leverages information in training that is not present at the time of prediction. In this study, we developed privileged logistic regression (PLR) models under the LUPI paradigm to detect acute respiratory distress syndrome (ARDS), with mechanical ventilation variables or chest x-ray image features employed in the privileged domain and electronic health records in the base domain. In model training, the objective of privileged logistic regression was designed to incorporate data from the privileged domain and encourage knowledge transfer across the privileged and base domains. An asymptotic analysis was also performed, yielding sufficient conditions under which the addition of privileged information increases the rate of convergence in the proposed model. Results for ARDS detection show that PLR models achieve better classification performances than logistic regression models trained solely on the base domain, even when privileged information is partially available. Furthermore, PLR models demonstrate performance on par with or superior to state-of-the-art models under the LUPI paradigm. As the proposed models are effective, easy to interpret, and highly explainable, they are ideal for other clinical applications where privileged information is at least partially available.
高级学习范式,即使用特权信息学习(LUPI),在训练中利用预测时不存在的信息。在本研究中,我们在 LUPI 范式下开发了特权逻辑回归(PLR)模型来检测急性呼吸窘迫综合征(ARDS),特权域采用机械通气变量或胸部 X 光图像特征,基础域采用电子健康记录。在模型训练中,特权逻辑回归的目标是纳入特权域的数据,并鼓励在特权域和基础域之间进行知识转移。此外,还进行了渐近分析,得出了增加特权信息可提高拟议模型收敛速度的充分条件。ARDS 检测结果表明,即使在特权信息部分可用的情况下,PLR 模型的分类性能也优于仅在基域上训练的逻辑回归模型。此外,在 LUPI 范式下,PLR 模型的性能与最先进的模型相当或更胜一筹。由于所提出的模型有效、易于解释、可解释性强,因此非常适合于至少有部分特权信息的其他临床应用。
{"title":"Learning using privileged information with logistic regression on acute respiratory distress syndrome detection","authors":"Zijun Gao , Shuyang Cheng , Emily Wittrup , Jonathan Gryak , Kayvan Najarian","doi":"10.1016/j.artmed.2024.102947","DOIUrl":"10.1016/j.artmed.2024.102947","url":null,"abstract":"<div><p>The advanced learning paradigm, learning using privileged information (LUPI), leverages information in training that is not present at the time of prediction. In this study, we developed privileged logistic regression (PLR) models under the LUPI paradigm to detect acute respiratory distress syndrome (ARDS), with mechanical ventilation variables or chest x-ray image features employed in the privileged domain and electronic health records in the base domain. In model training, the objective of privileged logistic regression was designed to incorporate data from the privileged domain and encourage knowledge transfer across the privileged and base domains. An asymptotic analysis was also performed, yielding sufficient conditions under which the addition of privileged information increases the rate of convergence in the proposed model. Results for ARDS detection show that PLR models achieve better classification performances than logistic regression models trained solely on the base domain, even when privileged information is partially available. Furthermore, PLR models demonstrate performance on par with or superior to state-of-the-art models under the LUPI paradigm. As the proposed models are effective, easy to interpret, and highly explainable, they are ideal for other clinical applications where privileged information is at least partially available.</p></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"156 ","pages":"Article 102947"},"PeriodicalIF":6.1,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0933365724001891/pdfft?md5=a2efcd8bad011ba494040f8f74dd7135&pid=1-s2.0-S0933365724001891-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142088620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-14DOI: 10.1016/j.artmed.2024.102954
Rosa Sicilia , Linlin Shen , Alejandro Rodríguez-González , KC Santosh , Peter J.F. Lucas
{"title":"Introduction to the special issue on IEEE CBMS 2022 mining healthcare: AI and machine learning for biomedicine","authors":"Rosa Sicilia , Linlin Shen , Alejandro Rodríguez-González , KC Santosh , Peter J.F. Lucas","doi":"10.1016/j.artmed.2024.102954","DOIUrl":"10.1016/j.artmed.2024.102954","url":null,"abstract":"","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"156 ","pages":"Article 102954"},"PeriodicalIF":6.1,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142127458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}