Journal of Imaging最新文献_第6页

AI Use in Mammography for Diagnosing Metachronous Contralateral Breast Cancer. 在乳房 X 射线照相术中使用人工智能诊断近端对侧乳腺癌。

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging

Pub Date : 2024-08-28 DOI: 10.3390/jimaging10090211

Mio Adachi, Tomoyuki Fujioka, Toshiyuki Ishiba, Miyako Nara, Sakiko Maruya, Kumiko Hayashi, Yuichi Kumaki, Emi Yamaga, Leona Katsuta, Du Hao, Mikael Hartman, Feng Mengling, Goshi Oda, Kazunori Kubota, Ukihide Tateishi

Although several studies have been conducted on artificial intelligence (AI) use in mammography (MG), there is still a paucity of research on the diagnosis of metachronous bilateral breast cancer (BC), which is typically more challenging to diagnose. This study aimed to determine whether AI could enhance BC detection, achieving earlier or more accurate diagnoses than radiologists in cases of metachronous contralateral BC. We included patients who underwent unilateral BC surgery and subsequently developed contralateral BC. This retrospective study evaluated the AI-supported MG diagnostic system called FxMammo™. We evaluated the capability of FxMammo™ (FathomX Pte Ltd., Singapore) to diagnose BC more accurately or earlier than radiologists' assessments. This evaluation was supplemented by reviewing MG readings made by radiologists. Out of 1101 patients who underwent surgery, 10 who had initially undergone a partial mastectomy and later developed contralateral BC were analyzed. The AI system identified malignancies in six cases (60%), while radiologists identified five cases (50%). Notably, two cases (20%) were diagnosed solely by the AI system. Additionally, for these cases, the AI system had identified malignancies a year before the conventional diagnosis. This study highlights the AI system's effectiveness in diagnosing metachronous contralateral BC via MG. In some cases, the AI system consistently diagnosed cancer earlier than radiological assessments.

虽然已有多项关于人工智能（AI）在乳腺 X 射线摄影（MG）中应用的研究，但关于近端双侧乳腺癌（BC）诊断的研究仍然很少，而这种癌症的诊断通常更具挑战性。本研究旨在确定人工智能是否能提高双侧乳腺癌的检测率，在对侧近端乳腺癌病例中实现比放射科医生更早或更准确的诊断。我们纳入了接受单侧 BC 手术并随后发展为对侧 BC 的患者。这项回顾性研究评估了人工智能支持的 MG 诊断系统 FxMammo™。我们评估了 FxMammo™（新加坡 FathomX 私人有限公司）比放射科医生的评估更准确或更早诊断出 BC 的能力。这项评估通过审查放射科医生的 MG 读数进行补充。在接受手术的 1101 名患者中，有 10 名患者最初接受了乳房部分切除术，后来又出现了对侧乳腺癌。人工智能系统识别出六例（60%）恶性肿瘤，而放射科医生识别出五例（50%）。值得注意的是，有两个病例（20%）仅由人工智能系统确诊。此外，在这些病例中，人工智能系统比常规诊断提前一年发现了恶性肿瘤。这项研究强调了人工智能系统通过 MG 诊断对侧晚期 BC 的有效性。在某些病例中，人工智能系统对癌症的诊断始终早于放射评估。

{"title":"AI Use in Mammography for Diagnosing Metachronous Contralateral Breast Cancer.","authors":"Mio Adachi, Tomoyuki Fujioka, Toshiyuki Ishiba, Miyako Nara, Sakiko Maruya, Kumiko Hayashi, Yuichi Kumaki, Emi Yamaga, Leona Katsuta, Du Hao, Mikael Hartman, Feng Mengling, Goshi Oda, Kazunori Kubota, Ukihide Tateishi","doi":"10.3390/jimaging10090211","DOIUrl":"https://doi.org/10.3390/jimaging10090211","url":null,"abstract":"Although several studies have been conducted on artificial intelligence (AI) use in mammography (MG), there is still a paucity of research on the diagnosis of metachronous bilateral breast cancer (BC), which is typically more challenging to diagnose. This study aimed to determine whether AI could enhance BC detection, achieving earlier or more accurate diagnoses than radiologists in cases of metachronous contralateral BC. We included patients who underwent unilateral BC surgery and subsequently developed contralateral BC. This retrospective study evaluated the AI-supported MG diagnostic system called FxMammo™. We evaluated the capability of FxMammo™ (FathomX Pte Ltd., Singapore) to diagnose BC more accurately or earlier than radiologists' assessments. This evaluation was supplemented by reviewing MG readings made by radiologists. Out of 1101 patients who underwent surgery, 10 who had initially undergone a partial mastectomy and later developed contralateral BC were analyzed. The AI system identified malignancies in six cases (60%), while radiologists identified five cases (50%). Notably, two cases (20%) were diagnosed solely by the AI system. Additionally, for these cases, the AI system had identified malignancies a year before the conventional diagnosis. This study highlights the AI system's effectiveness in diagnosing metachronous contralateral BC via MG. In some cases, the AI system consistently diagnosed cancer earlier than radiological assessments.","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"10 9","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11432939/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142355808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A New Approach for Effective Retrieval of Medical Images: A Step towards Computer-Assisted Diagnosis. 有效检索医学图像的新方法：迈向计算机辅助诊断的一步。

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging

Pub Date : 2024-08-26 DOI: 10.3390/jimaging10090210

Suchita Sharma, Ashutosh Aggarwal

The biomedical imaging field has grown enormously in the past decade. In the era of digitization, the demand for computer-assisted diagnosis is increasing day by day. The COVID-19 pandemic further emphasized how retrieving meaningful information from medical repositories can aid in improving the quality of patient's diagnosis. Therefore, content-based retrieval of medical images has a very prominent role in fulfilling our ultimate goal of developing automated computer-assisted diagnosis systems. Therefore, this paper presents a content-based medical image retrieval system that extracts multi-resolution, noise-resistant, rotation-invariant texture features in the form of a novel pattern descriptor, i.e., MsNrRiTxP, from medical images. In the proposed approach, the input medical image is initially decomposed into three neutrosophic images on its transformation into the neutrosophic domain. Afterwards, three distinct pattern descriptors, i.e., MsTrP, NrTxP, and RiTxP, are derived at multiple scales from the three neutrosophic images. The proposed MsNrRiTxP pattern descriptor is obtained by scale-wise concatenation of the joint histograms of MsTrP×RiTxP and NrTxP×RiTxP. To demonstrate the efficacy of the proposed system, medical images of different modalities, i.e., CT and MRI, from four test datasets are considered in our experimental setup. The retrieval performance of the proposed approach is exhaustively compared with several existing, recent, and state-of-the-art local binary pattern-based variants. The retrieval rates obtained by the proposed approach for the noise-free and noisy variants of the test datasets are observed to be substantially higher than the compared ones.

在过去十年中，生物医学成像领域取得了巨大发展。在数字化时代，对计算机辅助诊断的需求与日俱增。COVID-19 大流行进一步强调了从医学资料库中检索有意义的信息如何有助于提高病人诊断的质量。因此，基于内容的医学图像检索在实现我们开发计算机辅助自动诊断系统的最终目标方面具有非常突出的作用。因此，本文提出了一种基于内容的医学图像检索系统，该系统以一种新型模式描述符（即 MsNrRiTxP）的形式从医学图像中提取多分辨率、抗噪、旋转不变的纹理特征。在所提出的方法中，输入的医学图像在转换到中性域时首先被分解成三个中性图像。然后，从这三幅中性图像衍生出多种尺度的三种不同模式描述符，即 MsTrP、NrTxP 和 RiTxP。提出的 MsNrRiTxP 模式描述符是通过按比例连接 MsTrP×RiTxP 和 NrTxP×RiTxP 的联合直方图得到的。为了证明所提系统的有效性，我们在实验设置中考虑了来自四个测试数据集的不同模式的医学图像，即 CT 和 MRI。建议方法的检索性能与几种现有的、最新的和最先进的基于局部二进制模式的变体进行了详尽的比较。通过观察发现，在测试数据集的无噪声和有噪声变体中，所提方法获得的检索率大大高于同类方法。

{"title":"A New Approach for Effective Retrieval of Medical Images: A Step towards Computer-Assisted Diagnosis.","authors":"Suchita Sharma, Ashutosh Aggarwal","doi":"10.3390/jimaging10090210","DOIUrl":"https://doi.org/10.3390/jimaging10090210","url":null,"abstract":"The biomedical imaging field has grown enormously in the past decade. In the era of digitization, the demand for computer-assisted diagnosis is increasing day by day. The COVID-19 pandemic further emphasized how retrieving meaningful information from medical repositories can aid in improving the quality of patient's diagnosis. Therefore, content-based retrieval of medical images has a very prominent role in fulfilling our ultimate goal of developing automated computer-assisted diagnosis systems. Therefore, this paper presents a content-based medical image retrieval system that extracts multi-resolution, noise-resistant, rotation-invariant texture features in the form of a novel pattern descriptor, i.e., MsNrRiTxP, from medical images. In the proposed approach, the input medical image is initially decomposed into three neutrosophic images on its transformation into the neutrosophic domain. Afterwards, three distinct pattern descriptors, i.e., MsTrP, NrTxP, and RiTxP, are derived at multiple scales from the three neutrosophic images. The proposed MsNrRiTxP pattern descriptor is obtained by scale-wise concatenation of the joint histograms of MsTrP×RiTxP and NrTxP×RiTxP. To demonstrate the efficacy of the proposed system, medical images of different modalities, i.e., CT and MRI, from four test datasets are considered in our experimental setup. The retrieval performance of the proposed approach is exhaustively compared with several existing, recent, and state-of-the-art local binary pattern-based variants. The retrieval rates obtained by the proposed approach for the noise-free and noisy variants of the test datasets are observed to be substantially higher than the compared ones.","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"10 9","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11433568/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142355791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Ex Vivo Simultaneous H₂¹⁵O Positron Emission Tomography and Magnetic Resonance Imaging of Porcine Kidneys-A Feasibility Study. 猪肾脏的体内同步 H215O 正电子发射断层成像和磁共振成像--一项可行性研究。

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging

Pub Date : 2024-08-25 DOI: 10.3390/jimaging10090209

Maibritt Meldgaard Arildsen, Christian Østergaard Mariager, Christoffer Vase Overgaard, Thomas Vorre, Martin Bøjesen, Niels Moeslund, Aage Kristian Olsen Alstrup, Lars Poulsen Tolbod, Mikkel Holm Vendelbo, Steffen Ringgaard, Michael Pedersen, Niels Henrik Buus

The aim was to establish combined H₂¹⁵O PET/MRI during ex vivo normothermic machine perfusion (NMP) of isolated porcine kidneys. We examined whether changes in renal arterial blood flow (RABF) are accompanied by changes of a similar magnitude in renal blood perfusion (RBP) as well as the relation between RBP and renal parenchymal oxygenation (RPO).

Methods: Pig kidneys (n = 7) were connected to a NMP circuit. PET/MRI was performed at two different pump flow levels: a blood-oxygenation-level-dependent (BOLD) MRI sequence performed simultaneously with a H₂¹⁵O PET sequence for determination of RBP.

Results: RBP was measured using H₂¹⁵O PET in all kidneys (flow 1: 0.42-0.76 mL/min/g, flow 2: 0.7-1.6 mL/min/g). We found a linear correlation between changes in delivered blood flow from the perfusion pump and changes in the measured RBP using PET imaging (r² = 0.87).

Conclusion: Our study demonstrated the feasibility of combined H₂¹⁵O PET/MRI during NMP of isolated porcine kidneys with tissue oxygenation being stable over time. The introduction of H215O PET/MRI in nephrological research could be highly relevant for future pre-transplant kidney evaluation and as a tool for studying renal physiology in healthy and diseased kidneys.

我们的目的是在离体猪肾的体外常温机器灌注（NMP）过程中建立 H215O PET/MRI 组合。我们研究了肾动脉血流（RABF）的变化是否伴随着肾脏血液灌注（RBP）类似程度的变化，以及 RBP 和肾实质氧合（RPO）之间的关系：方法：将猪肾脏（n = 7）连接到 NMP 电路。PET/MRI 在两种不同的泵流量水平下进行：血液氧合水平依赖性 (BOLD) MRI 序列与用于测定 RBP 的 H215O PET 序列同时进行：使用 H215O PET 测量了所有肾脏的 RBP（流量 1：0.42-0.76 毫升/分钟/克，流量 2：0.7-1.6 毫升/分钟/克）。我们发现灌注泵输送的血流量变化与 PET 成像测量的 RBP 变化之间存在线性相关（r2 = 0.87）：我们的研究证明了在离体猪肾NMP期间结合H215O PET/MRI的可行性，组织氧合随时间保持稳定。在肾脏病研究中引入 H215O PET/MRI 对未来移植前肾脏评估以及健康和病变肾脏的肾脏生理研究具有重要意义。

{"title":"Ex Vivo Simultaneous H215O Positron Emission Tomography and Magnetic Resonance Imaging of Porcine Kidneys-A Feasibility Study.","authors":"Maibritt Meldgaard Arildsen, Christian Østergaard Mariager, Christoffer Vase Overgaard, Thomas Vorre, Martin Bøjesen, Niels Moeslund, Aage Kristian Olsen Alstrup, Lars Poulsen Tolbod, Mikkel Holm Vendelbo, Steffen Ringgaard, Michael Pedersen, Niels Henrik Buus","doi":"10.3390/jimaging10090209","DOIUrl":"https://doi.org/10.3390/jimaging10090209","url":null,"abstract":"The aim was to establish combined H215O PET/MRI during ex vivo normothermic machine perfusion (NMP) of isolated porcine kidneys. We examined whether changes in renal arterial blood flow (RABF) are accompanied by changes of a similar magnitude in renal blood perfusion (RBP) as well as the relation between RBP and renal parenchymal oxygenation (RPO).Methods: Pig kidneys (n = 7) were connected to a NMP circuit. PET/MRI was performed at two different pump flow levels: a blood-oxygenation-level-dependent (BOLD) MRI sequence performed simultaneously with a H215O PET sequence for determination of RBP.Results: RBP was measured using H215O PET in all kidneys (flow 1: 0.42-0.76 mL/min/g, flow 2: 0.7-1.6 mL/min/g). We found a linear correlation between changes in delivered blood flow from the perfusion pump and changes in the measured RBP using PET imaging (r2 = 0.87).Conclusion: Our study demonstrated the feasibility of combined H215O PET/MRI during NMP of isolated porcine kidneys with tissue oxygenation being stable over time. The introduction of H215O PET/MRI in nephrological research could be highly relevant for future pre-transplant kidney evaluation and as a tool for studying renal physiology in healthy and diseased kidneys.","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"10 9","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11433579/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142355817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Task-Adaptive Angle Selection for Computed Tomography-Based Defect Detection. 基于计算机断层扫描的缺陷检测的任务自适应角度选择

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging

Pub Date : 2024-08-23 DOI: 10.3390/jimaging10090208

Tianyuan Wang, Virginia Florian, Richard Schielein, Christian Kretzer, Stefan Kasperl, Felix Lucka, Tristan van Leeuwen

Sparse-angle X-ray Computed Tomography (CT) plays a vital role in industrial quality control but leads to an inherent trade-off between scan time and reconstruction quality. Adaptive angle selection strategies try to improve upon this based on the idea that the geometry of the object under investigation leads to an uneven distribution of the information content over the projection angles. Deep Reinforcement Learning (DRL) has emerged as an effective approach for adaptive angle selection in X-ray CT. While previous studies focused on optimizing generic image quality measures using a fixed number of angles, our work extends them by considering a specific downstream task, namely image-based defect detection, and introducing flexibility in the number of angles used. By leveraging prior knowledge about typical defect characteristics, our task-adaptive angle selection method, adaptable in terms of angle count, enables easy detection of defects in the reconstructed images.

稀疏角度 X 射线计算机断层扫描（CT）在工业质量控制中发挥着重要作用，但在扫描时间和重建质量之间存在固有的权衡问题。自适应角度选择策略试图改善这一问题，其依据是被测物体的几何形状会导致信息内容在投影角度上的不均匀分布。深度强化学习（DRL）已成为 X 射线 CT 自适应角度选择的有效方法。以往的研究侧重于使用固定数量的角度来优化通用图像质量度量，而我们的工作则通过考虑特定的下游任务（即基于图像的缺陷检测），并在使用的角度数量上引入灵活性来扩展这些研究。通过利用有关典型缺陷特征的先验知识，我们的任务自适应角度选择方法可根据角度数进行调整，从而轻松检测重建图像中的缺陷。

引用次数: 0

Beyond Nyquist: A Comparative Analysis of 3D Deep Learning Models Enhancing MRI Resolution. 超越奈奎斯特：增强核磁共振成像分辨率的三维深度学习模型比较分析》（Beyond Nyquist: A Comparative Analysis of 3D Deep Learning Models Enhancing MRI Resolution.

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging

Pub Date : 2024-08-23 DOI: 10.3390/jimaging10090207

Soumick Chatterjee, Alessandro Sciarra, Max Dünnwald, Anitha Bhat Talagini Ashoka, Mayura Gurjar Cheepinahalli Vasudeva, Shudarsan Saravanan, Venkatesh Thirugnana Sambandham, Pavan Tummala, Steffen Oeltze-Jafra, Oliver Speck, Andreas Nürnberger

High-spatial resolution MRI produces abundant structural information, enabling highly accurate clinical diagnosis and image-guided therapeutics. However, the acquisition of high-spatial resolution MRI data typically can come at the expense of less spatial coverage, lower signal-to-noise ratio (SNR), and longer scan time due to physical, physiological and hardware limitations. In order to overcome these limitations, super-resolution MRI deep-learning-based techniques can be utilised. In this work, different state-of-the-art 3D convolution neural network models for super resolution (RRDB, SPSR, UNet, UNet-MSS and ShuffleUNet) were compared for the super-resolution task with the goal of finding the best model in terms of performance and robustness. The public IXI dataset (only structural images) was used. Data were artificially downsampled to obtain lower-resolution spatial MRIs (downsampling factor varying from 8 to 64). When assessing performance using the SSIM metric in the test set, all models performed well. In particular, regardless of the downsampling factor, the UNet consistently obtained the top results. On the other hand, the SPSR model consistently performed worse. In conclusion, UNet and UNet-MSS achieved overall top performances while RRDB performed relatively poorly compared to the other models.

高空间分辨率磁共振成像可产生丰富的结构信息，从而实现高度准确的临床诊断和图像引导治疗。然而，由于物理、生理和硬件方面的限制，获取高空间分辨率核磁共振成像数据的代价通常是空间覆盖范围较小、信噪比（SNR）较低和扫描时间较长。为了克服这些限制，可以利用基于深度学习的超分辨率磁共振成像技术。在这项工作中，针对超分辨率任务比较了不同的先进三维卷积神经网络模型（RRDB、SPSR、UNet、UNet-MSS 和 ShuffleUNet），目的是找到性能和鲁棒性最佳的模型。我们使用了公开的 IXI 数据集（仅结构图像）。数据被人为降采样，以获得较低分辨率的空间磁共振成像（降采样因子从 8 到 64 不等）。在测试集中使用 SSIM 指标评估性能时，所有模型都表现良好。特别是，无论采用何种下采样因子，UNet 的结果都一直名列前茅。另一方面，SPSR 模型的表现一直较差。总之，UNet 和 UNet-MSS 总体表现优异，而 RRDB 与其他模型相比表现相对较差。

{"title":"Beyond Nyquist: A Comparative Analysis of 3D Deep Learning Models Enhancing MRI Resolution.","authors":"Soumick Chatterjee, Alessandro Sciarra, Max Dünnwald, Anitha Bhat Talagini Ashoka, Mayura Gurjar Cheepinahalli Vasudeva, Shudarsan Saravanan, Venkatesh Thirugnana Sambandham, Pavan Tummala, Steffen Oeltze-Jafra, Oliver Speck, Andreas Nürnberger","doi":"10.3390/jimaging10090207","DOIUrl":"https://doi.org/10.3390/jimaging10090207","url":null,"abstract":"High-spatial resolution MRI produces abundant structural information, enabling highly accurate clinical diagnosis and image-guided therapeutics. However, the acquisition of high-spatial resolution MRI data typically can come at the expense of less spatial coverage, lower signal-to-noise ratio (SNR), and longer scan time due to physical, physiological and hardware limitations. In order to overcome these limitations, super-resolution MRI deep-learning-based techniques can be utilised. In this work, different state-of-the-art 3D convolution neural network models for super resolution (RRDB, SPSR, UNet, UNet-MSS and ShuffleUNet) were compared for the super-resolution task with the goal of finding the best model in terms of performance and robustness. The public IXI dataset (only structural images) was used. Data were artificially downsampled to obtain lower-resolution spatial MRIs (downsampling factor varying from 8 to 64). When assessing performance using the SSIM metric in the test set, all models performed well. In particular, regardless of the downsampling factor, the UNet consistently obtained the top results. On the other hand, the SPSR model consistently performed worse. In conclusion, UNet and UNet-MSS achieved overall top performances while RRDB performed relatively poorly compared to the other models.","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"10 9","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11433164/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142355810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Novel Multi-Dimensional Joint Search Method for the Compression of Medical Image Segmentation Models. 压缩医学图像分割模型的新型多维联合搜索法

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging

Pub Date : 2024-08-23 DOI: 10.3390/jimaging10090206

Yunhui Zheng, Zhiyong Wu, Fengna Ji, Lei Du, Zhenyu Yang

Due to the excellent results achieved by transformers in computer vision, more and more scholars have introduced transformers into the field of medical image segmentation. However, the use of transformers will make the model's parameters very large, which occupies a large amount of the computer's resources, making them very time-consuming during training. In order to alleviate this disadvantage, this paper explores a flexible and efficient search strategy that can find the best subnet from a continuous transformer network. The method is based on a learnable and uniform L1 sparsity constraint, which contains factors that reflect the global importance of the continuous search space in different dimensions, while the search process is simple and efficient, containing a single round of training. At the same time, in order to compensate for the loss of accuracy caused by the search, a pixel classification module is introduced into the model to compensate for the loss of accuracy in the model search process. Our experiments show that the model in this paper compresses 30% of the parameters and FLOPs used, while also showing a slight increase in the accuracy of the model on the Automatic Cardiac Diagnosis Challenge (ACDC) dataset.

由于变换器在计算机视觉领域取得的卓越效果，越来越多的学者将变换器引入医学图像分割领域。然而，变换器的使用会使模型参数变得非常大，占用大量计算机资源，在训练过程中非常耗时。为了缓解这一缺点，本文探索了一种灵活高效的搜索策略，可以从连续变压器网络中找到最佳子网。该方法基于可学习和统一的 L1 稀疏性约束，其中包含的因子反映了连续搜索空间在不同维度上的全局重要性，同时搜索过程简单高效，只需一轮训练。同时，为了弥补搜索带来的精度损失，模型中引入了像素分类模块，以弥补模型搜索过程中的精度损失。我们的实验表明，本文中的模型压缩了 30% 的参数和 FLOPs，同时在自动心脏诊断挑战赛（ACDC）数据集上的准确率也略有提高。

{"title":"A Novel Multi-Dimensional Joint Search Method for the Compression of Medical Image Segmentation Models.","authors":"Yunhui Zheng, Zhiyong Wu, Fengna Ji, Lei Du, Zhenyu Yang","doi":"10.3390/jimaging10090206","DOIUrl":"https://doi.org/10.3390/jimaging10090206","url":null,"abstract":"Due to the excellent results achieved by transformers in computer vision, more and more scholars have introduced transformers into the field of medical image segmentation. However, the use of transformers will make the model's parameters very large, which occupies a large amount of the computer's resources, making them very time-consuming during training. In order to alleviate this disadvantage, this paper explores a flexible and efficient search strategy that can find the best subnet from a continuous transformer network. The method is based on a learnable and uniform L1 sparsity constraint, which contains factors that reflect the global importance of the continuous search space in different dimensions, while the search process is simple and efficient, containing a single round of training. At the same time, in order to compensate for the loss of accuracy caused by the search, a pixel classification module is introduced into the model to compensate for the loss of accuracy in the model search process. Our experiments show that the model in this paper compresses 30% of the parameters and FLOPs used, while also showing a slight increase in the accuracy of the model on the Automatic Cardiac Diagnosis Challenge (ACDC) dataset.","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"10 9","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11432891/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142355792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Help-Seeking Situations Related to Visual Interactions on Mobile Platforms and Recommended Designs for Blind and Visually Impaired Users. 与移动平台上的视觉交互有关的求助情况以及针对盲人和视障用户的建议设计。

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging

Pub Date : 2024-08-22 DOI: 10.3390/jimaging10080205

Iris Xie, Wonchan Choi, Shengang Wang, Hyun Seung Lee, Bo Hyun Hong, Ning-Chiao Wang, Emmanuel Kwame Cudjoe

While it is common for blind and visually impaired (BVI) users to use mobile devices to search for information, little research has explored the accessibility issues they encounter in their interactions with information retrieval systems, in particular digital libraries (DLs). This study represents one of the most comprehensive research projects, investigating accessibility issues, especially help-seeking situations BVI users face in their DL search processes. One hundred and twenty BVI users were recruited to search for information in six DLs on four types of mobile devices (iPhone, iPad, Android phone, and Android tablet), and multiple data collection methods were employed: questionnaires, think-aloud protocols, transaction logs, and interviews. This paper reports part of a large-scale study, including the categories of help-seeking situations BVI users face in their interactions with DLs, focusing on seven types of help-seeking situations related to visual interactions on mobile platforms: difficulty finding a toggle-based search feature, difficulty understanding a video feature, difficulty navigating items on paginated sections, difficulty distinguishing collection labels from thumbnails, difficulty recognizing the content of images, difficulty recognizing the content of graphs, and difficulty interacting with multilayered windows. Moreover, corresponding design recommendations are also proposed: placing meaningful labels for icon-based features in an easy-to-access location, offering intuitive and informative video descriptions for video players, providing structure information about a paginated section, separating collection/item titles from thumbnail descriptions, incorporating artificial intelligence image/graph recognition mechanisms, and limiting screen reader interactions to active windows. Additionally, the limitations of the study and future research are discussed.

盲人和视障（BVI）用户使用移动设备搜索信息的情况很普遍，但很少有研究探讨他们在与信息检索系统，特别是数字图书馆（DL）的交互过程中遇到的无障碍问题。本研究是最全面的研究项目之一，调查了无障碍问题，特别是 BVI 用户在数字图书馆搜索过程中面临的寻求帮助的情况。研究人员招募了 120 名 BVI 用户，让他们使用四种类型的移动设备（iPhone、iPad、安卓手机和安卓平板电脑）在六个 DL 中搜索信息，并采用了多种数据收集方法：问卷调查、思考-朗读协议、交易日志和访谈。本文报告了大规模研究的部分内容，包括BVI用户在与DL交互过程中面临的求助情况类别，重点关注与移动平台视觉交互相关的七类求助情况：难以找到基于切换键的搜索功能、难以理解视频功能、难以浏览分页部分的项目、难以区分集合标签和缩略图、难以识别图片内容、难以识别图表内容以及难以与多层窗口交互。此外，还提出了相应的设计建议：在易于访问的位置放置有意义的图标式功能标签，为视频播放器提供直观且信息丰富的视频描述，提供分页部分的结构信息，将集合/项目标题与缩略图描述分开，纳入人工智能图像/图表识别机制，以及将屏幕阅读器的交互限制在活动窗口。此外，还讨论了研究的局限性和未来的研究方向。

{"title":"Help-Seeking Situations Related to Visual Interactions on Mobile Platforms and Recommended Designs for Blind and Visually Impaired Users.","authors":"Iris Xie, Wonchan Choi, Shengang Wang, Hyun Seung Lee, Bo Hyun Hong, Ning-Chiao Wang, Emmanuel Kwame Cudjoe","doi":"10.3390/jimaging10080205","DOIUrl":"10.3390/jimaging10080205","url":null,"abstract":"While it is common for blind and visually impaired (BVI) users to use mobile devices to search for information, little research has explored the accessibility issues they encounter in their interactions with information retrieval systems, in particular digital libraries (DLs). This study represents one of the most comprehensive research projects, investigating accessibility issues, especially help-seeking situations BVI users face in their DL search processes. One hundred and twenty BVI users were recruited to search for information in six DLs on four types of mobile devices (iPhone, iPad, Android phone, and Android tablet), and multiple data collection methods were employed: questionnaires, think-aloud protocols, transaction logs, and interviews. This paper reports part of a large-scale study, including the categories of help-seeking situations BVI users face in their interactions with DLs, focusing on seven types of help-seeking situations related to visual interactions on mobile platforms: difficulty finding a toggle-based search feature, difficulty understanding a video feature, difficulty navigating items on paginated sections, difficulty distinguishing collection labels from thumbnails, difficulty recognizing the content of images, difficulty recognizing the content of graphs, and difficulty interacting with multilayered windows. Moreover, corresponding design recommendations are also proposed: placing meaningful labels for icon-based features in an easy-to-access location, offering intuitive and informative video descriptions for video players, providing structure information about a paginated section, separating collection/item titles from thumbnail descriptions, incorporating artificial intelligence image/graph recognition mechanisms, and limiting screen reader interactions to active windows. Additionally, the limitations of the study and future research are discussed.","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"10 8","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11355365/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142082076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimisation of Convolution-Based Image Lightness Processing. 基于卷积的图像亮度处理优化

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging

Pub Date : 2024-08-22 DOI: 10.3390/jimaging10080204

D Andrew Rowlands, Graham D Finlayson

In the convolutional retinex approach to image lightness processing, an image is filtered by a centre/surround operator that is designed to mitigate the effects of shading (illumination gradients), which in turn compresses the dynamic range. Typically, the parameters that define the shape and extent of the filter are tuned to provide visually pleasing results, and a mapping function such as a logarithm is included for further image enhancement. In contrast, a statistical approach to convolutional retinex has recently been introduced, which is based upon known or estimated autocorrelation statistics of the image albedo and shading components. By introducing models for the autocorrelation matrices and solving a linear regression, the optimal filter is obtained in closed form. Unlike existing methods, the aim is simply to objectively mitigate shading, and so image enhancement components such as a logarithmic mapping function are not included. Here, the full mathematical details of the method are provided, along with implementation details. Significantly, it is shown that the shapes of the autocorrelation matrices directly impact the shape of the optimal filter. To investigate the performance of the method, we address the problem of shading removal from text documents. Further experiments on a challenging image dataset validate the method.

在图像亮度处理的卷积视网膜方法中，图像通过中心/环绕算子进行过滤，以减轻阴影（光照梯度）的影响，进而压缩动态范围。通常情况下，对定义滤波器形状和范围的参数进行调整，以提供视觉上令人愉悦的结果，并加入对数等映射函数以进一步增强图像。与此相反，最近推出了一种卷积视网膜统计方法，该方法基于已知或估计的图像反照率和阴影成分的自相关统计。通过引入自相关矩阵模型和线性回归求解，以封闭形式获得最佳滤波器。与现有方法不同的是，该方法的目的只是客观地减轻阴影，因此不包括对数映射函数等图像增强组件。这里提供了该方法的全部数学细节以及实施细节。值得注意的是，自相关矩阵的形状会直接影响最佳滤波器的形状。为了研究该方法的性能，我们解决了从文本文档中去除阴影的问题。在一个具有挑战性的图像数据集上的进一步实验验证了该方法。

{"title":"Optimisation of Convolution-Based Image Lightness Processing.","authors":"D Andrew Rowlands, Graham D Finlayson","doi":"10.3390/jimaging10080204","DOIUrl":"10.3390/jimaging10080204","url":null,"abstract":"In the convolutional retinex approach to image lightness processing, an image is filtered by a centre/surround operator that is designed to mitigate the effects of shading (illumination gradients), which in turn compresses the dynamic range. Typically, the parameters that define the shape and extent of the filter are tuned to provide visually pleasing results, and a mapping function such as a logarithm is included for further image enhancement. In contrast, a statistical approach to convolutional retinex has recently been introduced, which is based upon known or estimated autocorrelation statistics of the image albedo and shading components. By introducing models for the autocorrelation matrices and solving a linear regression, the optimal filter is obtained in closed form. Unlike existing methods, the aim is simply to objectively mitigate shading, and so image enhancement components such as a logarithmic mapping function are not included. Here, the full mathematical details of the method are provided, along with implementation details. Significantly, it is shown that the shapes of the autocorrelation matrices directly impact the shape of the optimal filter. To investigate the performance of the method, we address the problem of shading removal from text documents. Further experiments on a challenging image dataset validate the method.","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"10 8","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11355274/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142082079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Automatic Classification of Nodules from 2D Ultrasound Images Using Deep Learning Networks. 利用深度学习网络对二维超声图像中的结节进行自动分类

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging

Pub Date : 2024-08-22 DOI: 10.3390/jimaging10080203

Tewele W Tareke, Sarah Leclerc, Catherine Vuillemin, Perrine Buffier, Elodie Crevisy, Amandine Nguyen, Marie-Paule Monnier Meteau, Pauline Legris, Serge Angiolini, Alain Lalande

Objective: In clinical practice, thyroid nodules are typically visually evaluated by expert physicians using 2D ultrasound images. Based on their assessment, a fine needle aspiration (FNA) may be recommended. However, visually classifying thyroid nodules from ultrasound images may lead to unnecessary fine needle aspirations for patients. The aim of this study is to develop an automatic thyroid ultrasound image classification system to prevent unnecessary FNAs.

Methods: An automatic computer-aided artificial intelligence system is proposed for classifying thyroid nodules using a fine-tuned deep learning model based on the DenseNet architecture, which incorporates an attention module. The dataset comprises 591 thyroid nodule images categorized based on the Bethesda score. Thyroid nodules are classified as either requiring FNA or not. The challenges encountered in this task include managing variability in image quality, addressing the presence of artifacts in ultrasound image datasets, tackling class imbalance, and ensuring model interpretability. We employed techniques such as data augmentation, class weighting, and gradient-weighted class activation maps (Grad-CAM) to enhance model performance and provide insights into decision making.

Results: Our approach achieved excellent results with an average accuracy of 0.94, F1-score of 0.93, and sensitivity of 0.96. The use of Grad-CAM gives insights on the decision making and then reinforce the reliability of the binary classification for the end-user perspective.

Conclusions: We propose a deep learning architecture that effectively classifies thyroid nodules as requiring FNA or not from ultrasound images. Despite challenges related to image variability, class imbalance, and interpretability, our method demonstrated a high classification accuracy with minimal false negatives, showing its potential to reduce unnecessary FNAs in clinical settings.

目的：在临床实践中，甲状腺结节通常由专业医生使用二维超声图像进行目测评估。根据他们的评估，可能会建议进行细针穿刺（FNA）。然而，根据超声图像对甲状腺结节进行目测分类可能会导致患者进行不必要的细针穿刺。本研究旨在开发一种自动甲状腺超声图像分类系统，以避免不必要的 FNA：方法：本研究提出了一种自动计算机辅助人工智能系统，该系统使用基于 DenseNet 架构的微调深度学习模型对甲状腺结节进行分类，其中包含一个注意力模块。数据集包括 591 张根据 Bethesda 评分分类的甲状腺结节图像。甲状腺结节被分为需要 FNA 或不需要 FNA。这项任务面临的挑战包括管理图像质量的可变性、处理超声图像数据集中存在的伪影、解决类别不平衡问题以及确保模型的可解释性。我们采用了数据增强、类别加权和梯度加权类别激活图（Grad-CAM）等技术来提高模型性能，并为决策提供见解：我们的方法取得了优异的成绩，平均准确率为 0.94，F1 分数为 0.93，灵敏度为 0.96。Grad-CAM 的使用为决策制定提供了洞察力，进而从最终用户的角度加强了二元分类的可靠性：我们提出了一种深度学习架构，能有效地从超声图像中将甲状腺结节分类为是否需要 FNA。尽管存在图像可变性、类别不平衡和可解释性等方面的挑战，但我们的方法表现出了较高的分类准确性，假阴性极低，显示了其在临床环境中减少不必要的 FNA 的潜力。

{"title":"Automatic Classification of Nodules from 2D Ultrasound Images Using Deep Learning Networks.","authors":"Tewele W Tareke, Sarah Leclerc, Catherine Vuillemin, Perrine Buffier, Elodie Crevisy, Amandine Nguyen, Marie-Paule Monnier Meteau, Pauline Legris, Serge Angiolini, Alain Lalande","doi":"10.3390/jimaging10080203","DOIUrl":"10.3390/jimaging10080203","url":null,"abstract":"Objective: In clinical practice, thyroid nodules are typically visually evaluated by expert physicians using 2D ultrasound images. Based on their assessment, a fine needle aspiration (FNA) may be recommended. However, visually classifying thyroid nodules from ultrasound images may lead to unnecessary fine needle aspirations for patients. The aim of this study is to develop an automatic thyroid ultrasound image classification system to prevent unnecessary FNAs.Methods: An automatic computer-aided artificial intelligence system is proposed for classifying thyroid nodules using a fine-tuned deep learning model based on the DenseNet architecture, which incorporates an attention module. The dataset comprises 591 thyroid nodule images categorized based on the Bethesda score. Thyroid nodules are classified as either requiring FNA or not. The challenges encountered in this task include managing variability in image quality, addressing the presence of artifacts in ultrasound image datasets, tackling class imbalance, and ensuring model interpretability. We employed techniques such as data augmentation, class weighting, and gradient-weighted class activation maps (Grad-CAM) to enhance model performance and provide insights into decision making.Results: Our approach achieved excellent results with an average accuracy of 0.94, F1-score of 0.93, and sensitivity of 0.96. The use of Grad-CAM gives insights on the decision making and then reinforce the reliability of the binary classification for the end-user perspective.Conclusions: We propose a deep learning architecture that effectively classifies thyroid nodules as requiring FNA or not from ultrasound images. Despite challenges related to image variability, class imbalance, and interpretability, our method demonstrated a high classification accuracy with minimal false negatives, showing its potential to reduce unnecessary FNAs in clinical settings.","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"10 8","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11355778/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142082034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Review of Advancements and Challenges in Liver Segmentation. 肝脏分割的进展与挑战综述。

IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY

Journal of Imaging

Pub Date : 2024-08-21 DOI: 10.3390/jimaging10080202

Di Wei, Yundan Jiang, Xuhui Zhou, Di Wu, Xiaorong Feng

Liver segmentation technologies play vital roles in clinical diagnosis, disease monitoring, and surgical planning due to the complex anatomical structure and physiological functions of the liver. This paper provides a comprehensive review of the developments, challenges, and future directions in liver segmentation technology. We systematically analyzed high-quality research published between 2014 and 2024, focusing on liver segmentation methods, public datasets, and evaluation metrics. This review highlights the transition from manual to semi-automatic and fully automatic segmentation methods, describes the capabilities and limitations of available technologies, and provides future outlooks.

由于肝脏具有复杂的解剖结构和生理功能，因此肝脏分割技术在临床诊断、疾病监测和手术规划中发挥着至关重要的作用。本文全面回顾了肝脏分割技术的发展、挑战和未来方向。我们系统分析了 2014 年至 2024 年间发表的高质量研究成果，重点关注肝脏分割方法、公共数据集和评估指标。本综述强调了从手动到半自动和全自动分割方法的过渡，描述了现有技术的能力和局限性，并提供了未来展望。

引用次数: 0