International Journal of Computer Assisted Radiology and Surgery最新文献_第9页

Background removal for debiasing computer-aided cytological diagnosis. 为计算机辅助细胞学诊断去除背景杂质。

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery

Pub Date : 2024-11-01 Epub Date: 2024-06-25 DOI: 10.1007/s11548-024-03169-0

Keita Takeda, Tomoya Sakai, Eiji Mitate

To address the background-bias problem in computer-aided cytology caused by microscopic slide deterioration, this article proposes a deep learning approach for cell segmentation and background removal without requiring cell annotation. A U-Net-based model was trained to separate cells from the background in an unsupervised manner by leveraging the redundancy of the background and the sparsity of cells in liquid-based cytology (LBC) images. The experimental results demonstrate that the U-Net-based model trained on a small set of cytology images can exclude background features and accurately segment cells. This capability is beneficial for debiasing in the detection and classification of the cells of interest in oral LBC. Slide deterioration can significantly affect deep learning-based cell classification. Our proposed method effectively removes background features at no cost of cell annotation, thereby enabling accurate cytological diagnosis through the deep learning of microscopic slide images.

为了解决显微载玻片变质导致的计算机辅助细胞学中的背景偏差问题，本文提出了一种无需细胞注释即可进行细胞分割和背景去除的深度学习方法。利用液基细胞学（LBC）图像中背景的冗余性和细胞的稀疏性，训练了一个基于 U-Net 的模型，以无监督的方式将细胞从背景中分离出来。实验结果表明，基于 U-Net 的模型在一小部分细胞学图像上经过训练后，可以排除背景特征，准确分割细胞。这种能力有利于在口腔 LBC 中对感兴趣的细胞进行检测和分类。切片劣化会严重影响基于深度学习的细胞分类。我们提出的方法能在不影响细胞标注的情况下有效去除背景特征，从而通过对显微载玻片图像的深度学习实现准确的细胞学诊断。

引用次数: 0

Hybrid representation-enhanced sampling for Bayesian active learning in musculoskeletal segmentation of lower extremities. 贝叶斯主动学习下肢肌肉骨骼分割中的混合表示增强采样。

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery

Pub Date : 2024-11-01 Epub Date: 2024-01-29 DOI: 10.1007/s11548-024-03065-7

Ganping Li, Yoshito Otake, Mazen Soufi, Masashi Taniguchi, Masahide Yagi, Noriaki Ichihashi, Keisuke Uemura, Masaki Takao, Nobuhiko Sugano, Yoshinobu Sato

Purpose: Manual annotations for training deep learning models in auto-segmentation are time-intensive. This study introduces a hybrid representation-enhanced sampling strategy that integrates both density and diversity criteria within an uncertainty-based Bayesian active learning (BAL) framework to reduce annotation efforts by selecting the most informative training samples.

Methods: The experiments are performed on two lower extremity datasets of MRI and CT images, focusing on the segmentation of the femur, pelvis, sacrum, quadriceps femoris, hamstrings, adductors, sartorius, and iliopsoas, utilizing a U-net-based BAL framework. Our method selects uncertain samples with high density and diversity for manual revision, optimizing for maximal similarity to unlabeled instances and minimal similarity to existing training data. We assess the accuracy and efficiency using dice and a proposed metric called reduced annotation cost (RAC), respectively. We further evaluate the impact of various acquisition rules on BAL performance and design an ablation study for effectiveness estimation.

Results: In MRI and CT datasets, our method was superior or comparable to existing ones, achieving a 0.8% dice and 1.0% RAC increase in CT (statistically significant), and a 0.8% dice and 1.1% RAC increase in MRI (not statistically significant) in volume-wise acquisition. Our ablation study indicates that combining density and diversity criteria enhances the efficiency of BAL in musculoskeletal segmentation compared to using either criterion alone.

Conclusion: Our sampling method is proven efficient in reducing annotation costs in image segmentation tasks. The combination of the proposed method and our BAL framework provides a semi-automatic way for efficient annotation of medical image datasets.

目的：为训练自动分割中的深度学习模型而进行人工标注耗费大量时间。本研究介绍了一种混合表示增强采样策略，该策略在基于不确定性的贝叶斯主动学习（BAL）框架内整合了密度和多样性标准，通过选择信息量最大的训练样本来减少标注工作：实验在两个下肢数据集的核磁共振成像和 CT 图像上进行，重点是股骨、骨盆、骶骨、股四头肌、腘绳肌、内收肌、腓肠肌和髂腰肌的分割，并利用基于 U 网的 BAL 框架。我们的方法选择具有高密度和多样性的不确定样本进行人工修正，优化与未标记实例的最大相似性和与现有训练数据的最小相似性。我们分别使用骰子和一种称为降低标注成本（RAC）的拟议指标来评估准确性和效率。我们还进一步评估了各种采集规则对 BAL 性能的影响，并设计了一项消融研究来估算有效性：在 MRI 和 CT 数据集中，我们的方法优于或媲美现有的方法，在 CT 中实现了 0.8% 的骰子增加和 1.0% 的 RAC 增加（有统计学意义），在 MRI 中实现了 0.8% 的骰子增加和 1.1% 的 RAC 增加（无统计学意义）。我们的消融研究表明，与单独使用其中一种标准相比，结合密度和多样性标准可提高 BAL 在肌肉骨骼分割中的效率：结论：事实证明，我们的采样方法能有效降低图像分割任务中的注释成本。建议的方法与我们的 BAL 框架相结合，为高效注释医学图像数据集提供了一种半自动方法。

{"title":"Hybrid representation-enhanced sampling for Bayesian active learning in musculoskeletal segmentation of lower extremities.","authors":"Ganping Li, Yoshito Otake, Mazen Soufi, Masashi Taniguchi, Masahide Yagi, Noriaki Ichihashi, Keisuke Uemura, Masaki Takao, Nobuhiko Sugano, Yoshinobu Sato","doi":"10.1007/s11548-024-03065-7","DOIUrl":"10.1007/s11548-024-03065-7","url":null,"abstract":"Purpose: Manual annotations for training deep learning models in auto-segmentation are time-intensive. This study introduces a hybrid representation-enhanced sampling strategy that integrates both density and diversity criteria within an uncertainty-based Bayesian active learning (BAL) framework to reduce annotation efforts by selecting the most informative training samples.Methods: The experiments are performed on two lower extremity datasets of MRI and CT images, focusing on the segmentation of the femur, pelvis, sacrum, quadriceps femoris, hamstrings, adductors, sartorius, and iliopsoas, utilizing a U-net-based BAL framework. Our method selects uncertain samples with high density and diversity for manual revision, optimizing for maximal similarity to unlabeled instances and minimal similarity to existing training data. We assess the accuracy and efficiency using dice and a proposed metric called reduced annotation cost (RAC), respectively. We further evaluate the impact of various acquisition rules on BAL performance and design an ablation study for effectiveness estimation.Results: In MRI and CT datasets, our method was superior or comparable to existing ones, achieving a 0.8% dice and 1.0% RAC increase in CT (statistically significant), and a 0.8% dice and 1.1% RAC increase in MRI (not statistically significant) in volume-wise acquisition. Our ablation study indicates that combining density and diversity criteria enhances the efficiency of BAL in musculoskeletal segmentation compared to using either criterion alone.Conclusion: Our sampling method is proven efficient in reducing annotation costs in image segmentation tasks. The combination of the proposed method and our BAL framework provides a semi-automatic way for efficient annotation of medical image datasets.","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":"2177-2186"},"PeriodicalIF":2.3,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139571189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Domain transformation using semi-supervised CycleGAN for improving performance of classifying thyroid tissue images. 利用半监督 CycleGAN 进行领域转换，提高甲状腺组织图像的分类性能。

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery

Pub Date : 2024-11-01 Epub Date: 2024-01-18 DOI: 10.1007/s11548-024-03061-x

Yoshihito Ichiuji, Shingo Mabu, Satomi Hatta, Kunihiro Inai, Shohei Higuchi, Shoji Kido

Purpose: A large number of research has been conducted on the classification of medical images using deep learning. The thyroid tissue images can be also classified by cancer types. Deep learning requires a large amount of data, but every medical institution cannot collect sufficient number of data for deep learning. In that case, we can consider a case where a classifier trained at a certain medical institution that has a sufficient number of data is reused at other institutions. However, when using data from multiple institutions, it is necessary to unify the feature distribution because the feature of the data differs due to differences in data acquisition conditions.

Methods: To unify the feature distribution, the data from Institution T are transformed to have the closer distribution to that from Institution S by applying a domain transformation using semi-supervised CycleGAN. The proposed method enhances CycleGAN considering the feature distribution of classes for making appropriate domain transformation for classification. In addition, to address the problem of imbalanced data with different numbers of data for each cancer type, several methods dealing with imbalanced data are applied to semi-supervised CycleGAN.

Results: The experimental results showed that the classification performance was enhanced when the dataset from Institution S was used as training data and the testing dataset from Institution T was classified after applying domain transformation. In addition, focal loss contributed to improving the mean F1 score the best as a method that addresses the class imbalance.

Conclusion: The proposed method achieved the domain transformation of thyroid tissue images between two domains, where it retained the important features related to the classes across domains and showed the best F1 score with significant differences compared with other methods. In addition, the proposed method was further enhanced by addressing the class imbalance of the dataset.

目的：人们利用深度学习对医学图像进行了大量分类研究。甲状腺组织图像也可按癌症类型进行分类。深度学习需要大量数据，但每个医疗机构都无法收集到足够数量的数据用于深度学习。在这种情况下，我们可以考虑将某个医疗机构训练的分类器在其他医疗机构重复使用，因为该医疗机构拥有足够数量的数据。但是，在使用多个机构的数据时，由于数据获取条件的不同，数据的特征也不尽相同，因此有必要统一特征分布：为了统一特征分布，使用半监督 CycleGAN 进行域转换，将来自 T 机构的数据转换为与来自 S 机构的数据分布更接近的数据。所提出的方法增强了 CycleGAN 的功能，考虑到了类的特征分布，从而为分类进行适当的域转换。此外，为了解决每种癌症类型的数据数量不同的不平衡数据问题，在半监督 CycleGAN 中应用了几种处理不平衡数据的方法：实验结果表明，当使用 S 机构的数据集作为训练数据，并对 T 机构的测试数据集进行域转换后进行分类时，分类性能得到了提高。此外，作为一种解决类不平衡的方法，焦点丢失对提高平均 F1 分数的贡献最大：结论：所提出的方法实现了甲状腺组织图像在两个域之间的域转换，保留了与跨域类别相关的重要特征，与其他方法相比，F1得分最高，差异显著。此外，通过解决数据集的类不平衡问题，所提出的方法得到了进一步增强。

{"title":"Domain transformation using semi-supervised CycleGAN for improving performance of classifying thyroid tissue images.","authors":"Yoshihito Ichiuji, Shingo Mabu, Satomi Hatta, Kunihiro Inai, Shohei Higuchi, Shoji Kido","doi":"10.1007/s11548-024-03061-x","DOIUrl":"10.1007/s11548-024-03061-x","url":null,"abstract":"Purpose: A large number of research has been conducted on the classification of medical images using deep learning. The thyroid tissue images can be also classified by cancer types. Deep learning requires a large amount of data, but every medical institution cannot collect sufficient number of data for deep learning. In that case, we can consider a case where a classifier trained at a certain medical institution that has a sufficient number of data is reused at other institutions. However, when using data from multiple institutions, it is necessary to unify the feature distribution because the feature of the data differs due to differences in data acquisition conditions.Methods: To unify the feature distribution, the data from Institution T are transformed to have the closer distribution to that from Institution S by applying a domain transformation using semi-supervised CycleGAN. The proposed method enhances CycleGAN considering the feature distribution of classes for making appropriate domain transformation for classification. In addition, to address the problem of imbalanced data with different numbers of data for each cancer type, several methods dealing with imbalanced data are applied to semi-supervised CycleGAN.Results: The experimental results showed that the classification performance was enhanced when the dataset from Institution S was used as training data and the testing dataset from Institution T was classified after applying domain transformation. In addition, focal loss contributed to improving the mean F1 score the best as a method that addresses the class imbalance.Conclusion: The proposed method achieved the domain transformation of thyroid tissue images between two domains, where it retained the important features related to the classes across domains and showed the best F1 score with significant differences compared with other methods. In addition, the proposed method was further enhanced by addressing the class imbalance of the dataset.","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":"2153-2163"},"PeriodicalIF":2.3,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139492884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep learning-based automatic pipeline for 3D needle localization on intra-procedural 3D MRI. 基于深度学习的自动管道，用于术中三维磁共振成像的三维针定位。

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery

Pub Date : 2024-11-01 Epub Date: 2024-03-23 DOI: 10.1007/s11548-024-03077-3

Wenqi Zhou, Xinzhou Li, Fatemeh Zabihollahy, David S Lu, Holden H Wu

Purpose: Accurate and rapid needle localization on 3D magnetic resonance imaging (MRI) is critical for MRI-guided percutaneous interventions. The current workflow requires manual needle localization on 3D MRI, which is time-consuming and cumbersome. Automatic methods using 2D deep learning networks for needle segmentation require manual image plane localization, while 3D networks are challenged by the need for sufficient training datasets. This work aimed to develop an automatic deep learning-based pipeline for accurate and rapid 3D needle localization on in vivo intra-procedural 3D MRI using a limited training dataset.

Methods: The proposed automatic pipeline adopted Shifted Window (Swin) Transformers and employed a coarse-to-fine segmentation strategy: (1) initial 3D needle feature segmentation with 3D Swin UNEt TRansfomer (UNETR); (2) generation of a 2D reformatted image containing the needle feature; (3) fine 2D needle feature segmentation with 2D Swin Transformer and calculation of 3D needle tip position and axis orientation. Pre-training and data augmentation were performed to improve network training. The pipeline was evaluated via cross-validation with 49 in vivo intra-procedural 3D MR images from preclinical pig experiments. The needle tip and axis localization errors were compared with human intra-reader variation using the Wilcoxon signed rank test, with p < 0.05 considered significant.

Results: The average end-to-end computational time for the pipeline was 6 s per 3D volume. The median Dice scores of the 3D Swin UNETR and 2D Swin Transformer in the pipeline were 0.80 and 0.93, respectively. The median 3D needle tip and axis localization errors were 1.48 mm (1.09 pixels) and 0.98°, respectively. Needle tip localization errors were significantly smaller than human intra-reader variation (median 1.70 mm; p < 0.01).

Conclusion: The proposed automatic pipeline achieved rapid pixel-level 3D needle localization on intra-procedural 3D MRI without requiring a large 3D training dataset and has the potential to assist MRI-guided percutaneous interventions.

目的：在三维磁共振成像（MRI）上准确、快速地定位穿刺针对于 MRI 引导下的经皮介入治疗至关重要。目前的工作流程需要在三维核磁共振成像上进行手动针定位，既费时又繁琐。使用二维深度学习网络进行针头分割的自动方法需要人工图像平面定位，而三维网络则因需要足够的训练数据集而面临挑战。这项工作旨在开发一种基于深度学习的自动流水线，利用有限的训练数据集，在活体术中三维核磁共振成像上实现准确、快速的三维针定位：拟议的自动流水线采用移位窗（Swin）变换器，并采用从粗到细的分割策略：(1) 使用 3D Swin UNEt TRansfomer (UNETR)进行初始三维针特征分割；(2) 生成包含针特征的二维重新格式化图像；(3) 使用 2D Swin 变换器进行精细二维针特征分割，并计算三维针尖位置和轴方向。为改进网络训练，还进行了预训练和数据增强。通过对临床前猪实验中的 49 幅体内术中三维 MR 图像进行交叉验证，对该管道进行了评估。使用 Wilcoxon 符号秩检验比较了针尖和针轴定位误差与人类读取器内部差异，结果为 p：每个三维卷的端到端计算时间平均为 6 秒。管道中三维 Swin UNETR 和二维 Swin Transformer 的中位 Dice 分数分别为 0.80 和 0.93。三维针尖和轴定位误差的中位数分别为 1.48 毫米（1.09 像素）和 0.98°。针尖定位误差明显小于人为读取器内部的误差（中位数为 1.70 毫米；p 结论：针尖定位误差和轴定位误差的中位数分别为 1.48 毫米（1.09 像素）和 0.98°：所提出的自动管道无需大量三维训练数据集即可在术中三维 MRI 上实现快速像素级三维针定位，有望为 MRI 引导下的经皮介入治疗提供帮助。

{"title":"Deep learning-based automatic pipeline for 3D needle localization on intra-procedural 3D MRI.","authors":"Wenqi Zhou, Xinzhou Li, Fatemeh Zabihollahy, David S Lu, Holden H Wu","doi":"10.1007/s11548-024-03077-3","DOIUrl":"10.1007/s11548-024-03077-3","url":null,"abstract":"Purpose: Accurate and rapid needle localization on 3D magnetic resonance imaging (MRI) is critical for MRI-guided percutaneous interventions. The current workflow requires manual needle localization on 3D MRI, which is time-consuming and cumbersome. Automatic methods using 2D deep learning networks for needle segmentation require manual image plane localization, while 3D networks are challenged by the need for sufficient training datasets. This work aimed to develop an automatic deep learning-based pipeline for accurate and rapid 3D needle localization on in vivo intra-procedural 3D MRI using a limited training dataset.Methods: The proposed automatic pipeline adopted Shifted Window (Swin) Transformers and employed a coarse-to-fine segmentation strategy: (1) initial 3D needle feature segmentation with 3D Swin UNEt TRansfomer (UNETR); (2) generation of a 2D reformatted image containing the needle feature; (3) fine 2D needle feature segmentation with 2D Swin Transformer and calculation of 3D needle tip position and axis orientation. Pre-training and data augmentation were performed to improve network training. The pipeline was evaluated via cross-validation with 49 in vivo intra-procedural 3D MR images from preclinical pig experiments. The needle tip and axis localization errors were compared with human intra-reader variation using the Wilcoxon signed rank test, with p < 0.05 considered significant.Results: The average end-to-end computational time for the pipeline was 6 s per 3D volume. The median Dice scores of the 3D Swin UNETR and 2D Swin Transformer in the pipeline were 0.80 and 0.93, respectively. The median 3D needle tip and axis localization errors were 1.48 mm (1.09 pixels) and 0.98°, respectively. Needle tip localization errors were significantly smaller than human intra-reader variation (median 1.70 mm; p < 0.01).Conclusion: The proposed automatic pipeline achieved rapid pixel-level 3D needle localization on intra-procedural 3D MRI without requiring a large 3D training dataset and has the potential to assist MRI-guided percutaneous interventions.","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":"2227-2237"},"PeriodicalIF":2.3,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11541278/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140195078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

High-quality semi-supervised anomaly detection with generative adversarial networks. 使用生成对抗性网络进行高质量的半监督异常检测。

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery

Pub Date : 2024-11-01 Epub Date: 2023-11-09 DOI: 10.1007/s11548-023-03031-9

Yuki Sato, Junya Sato, Noriyuki Tomiyama, Shoji Kido

Purpose: The visualization of an anomaly area is easier in anomaly detection methods that use generative models rather than classification models. However, achieving both anomaly detection accuracy and a clear visualization of anomalous areas is challenging. This study aimed to establish a method that combines both detection accuracy and clear visualization of anomalous areas using a generative adversarial network (GAN).

Methods: In this study, StyleGAN2 with adaptive discriminator augmentation (StyleGAN2-ADA), which can generate high-resolution and high-quality images with limited number of datasets, was used as the image generation model, and pixel-to-style-to-pixel (pSp) encoder was used to convert images into intermediate latent variables. We combined existing methods for training and proposed a method for calculating anomaly scores using intermediate latent variables. The proposed method, which combines these two methods, is called high-quality anomaly GAN (HQ-AnoGAN).

Results: The experimental results obtained using three datasets demonstrated that HQ-AnoGAN has equal or better detection accuracy than the existing methods. The results of the visualization of abnormal areas using the generated images showed that HQ-AnoGAN could generate more natural images than the existing methods and was qualitatively more accurate in the visualization of abnormal areas.

Conclusion: In this study, HQ-AnoGAN comprising StyleGAN2-ADA and pSp encoder was proposed with an optimal anomaly score calculation method. The experimental results show that HQ-AnoGAN can achieve both high abnormality detection accuracy and clear visualization of abnormal areas; thus, HQ-AnoGAN demonstrates significant potential for application in medical imaging diagnosis cases where an explanation of diagnosis is required.

目的：在使用生成模型而不是分类模型的异常检测方法中，异常区域的可视化更容易。然而，实现异常检测的准确性和异常区域的清晰可视化是具有挑战性的。本研究旨在建立一种使用生成对抗性网络（GAN）将异常区域的检测精度和清晰可视化相结合的方法。方法：本研究使用具有自适应鉴别器增强的StyleGAN2（StyleGAN2-ADA）作为图像生成模型，该模型可以在有限的数据集数量下生成高分辨率和高质量的图像，并且使用像素到风格到像素（pSp）编码器将图像转换为中间潜在变量。我们结合了现有的训练方法，提出了一种使用中间潜在变量计算异常分数的方法。将这两种方法相结合的方法被称为高质量异常GAN（HQ AnoGAN）。结果：使用三个数据集获得的实验结果表明，HQ AnoGAN具有与现有方法相同或更好的检测精度。使用生成的图像对异常区域进行可视化的结果表明，HQ AnoGAN可以生成比现有方法更自然的图像，并且在异常区域的可视化中定性地更准确。结论：本研究提出了由StyleGAN2-ADA和pSp编码器组成的HQ AnoGAN，并提出了一种最佳异常评分计算方法。实验结果表明，HQ AnoGAN可以实现高的异常检测精度和异常区域的清晰可视化；因此，HQ AnoGAN在需要解释诊断的医学影像诊断病例中显示出巨大的应用潜力。

{"title":"High-quality semi-supervised anomaly detection with generative adversarial networks.","authors":"Yuki Sato, Junya Sato, Noriyuki Tomiyama, Shoji Kido","doi":"10.1007/s11548-023-03031-9","DOIUrl":"10.1007/s11548-023-03031-9","url":null,"abstract":"Purpose: The visualization of an anomaly area is easier in anomaly detection methods that use generative models rather than classification models. However, achieving both anomaly detection accuracy and a clear visualization of anomalous areas is challenging. This study aimed to establish a method that combines both detection accuracy and clear visualization of anomalous areas using a generative adversarial network (GAN).Methods: In this study, StyleGAN2 with adaptive discriminator augmentation (StyleGAN2-ADA), which can generate high-resolution and high-quality images with limited number of datasets, was used as the image generation model, and pixel-to-style-to-pixel (pSp) encoder was used to convert images into intermediate latent variables. We combined existing methods for training and proposed a method for calculating anomaly scores using intermediate latent variables. The proposed method, which combines these two methods, is called high-quality anomaly GAN (HQ-AnoGAN).Results: The experimental results obtained using three datasets demonstrated that HQ-AnoGAN has equal or better detection accuracy than the existing methods. The results of the visualization of abnormal areas using the generated images showed that HQ-AnoGAN could generate more natural images than the existing methods and was qualitatively more accurate in the visualization of abnormal areas.Conclusion: In this study, HQ-AnoGAN comprising StyleGAN2-ADA and pSp encoder was proposed with an optimal anomaly score calculation method. The experimental results show that HQ-AnoGAN can achieve both high abnormality detection accuracy and clear visualization of abnormal areas; thus, HQ-AnoGAN demonstrates significant potential for application in medical imaging diagnosis cases where an explanation of diagnosis is required.","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":"2121-2131"},"PeriodicalIF":2.3,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71523347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Challenges in multi-centric generalization: phase and step recognition in Roux-en-Y gastric bypass surgery. 多中心泛化的挑战：Roux-en-Y 胃旁路手术中的阶段和步骤识别。

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery

Pub Date : 2024-11-01 Epub Date: 2024-05-18 DOI: 10.1007/s11548-024-03166-3

Joël L Lavanchy, Sanat Ramesh, Diego Dall'Alba, Cristians Gonzalez, Paolo Fiorini, Beat P Müller-Stich, Philipp C Nett, Jacques Marescaux, Didier Mutter, Nicolas Padoy

Purpose: Most studies on surgical activity recognition utilizing artificial intelligence (AI) have focused mainly on recognizing one type of activity from small and mono-centric surgical video datasets. It remains speculative whether those models would generalize to other centers.

Methods: In this work, we introduce a large multi-centric multi-activity dataset consisting of 140 surgical videos (MultiBypass140) of laparoscopic Roux-en-Y gastric bypass (LRYGB) surgeries performed at two medical centers, i.e., the University Hospital of Strasbourg, France (StrasBypass70) and Inselspital, Bern University Hospital, Switzerland (BernBypass70). The dataset has been fully annotated with phases and steps by two board-certified surgeons. Furthermore, we assess the generalizability and benchmark different deep learning models for the task of phase and step recognition in 7 experimental studies: (1) Training and evaluation on BernBypass70; (2) Training and evaluation on StrasBypass70; (3) Training and evaluation on the joint MultiBypass140 dataset; (4) Training on BernBypass70, evaluation on StrasBypass70; (5) Training on StrasBypass70, evaluation on BernBypass70; Training on MultiBypass140, (6) evaluation on BernBypass70 and (7) evaluation on StrasBypass70.

Results: The model's performance is markedly influenced by the training data. The worst results were obtained in experiments (4) and (5) confirming the limited generalization capabilities of models trained on mono-centric data. The use of multi-centric training data, experiments (6) and (7), improves the generalization capabilities of the models, bringing them beyond the level of independent mono-centric training and validation (experiments (1) and (2)).

Conclusion: MultiBypass140 shows considerable variation in surgical technique and workflow of LRYGB procedures between centers. Therefore, generalization experiments demonstrate a remarkable difference in model performance. These results highlight the importance of multi-centric datasets for AI model generalization to account for variance in surgical technique and workflows. The dataset and code are publicly available at https://github.com/CAMMA-public/MultiBypass140.

目的：大多数利用人工智能（AI）进行手术活动识别的研究主要集中在从小型和单一中心的手术视频数据集中识别一种类型的活动。这些模型是否能推广到其他中心仍是个未知数：在这项工作中，我们引入了一个大型多中心多活动数据集，该数据集由两个医疗中心，即法国斯特拉斯堡大学医院（StrasBypass70）和瑞士伯尔尼大学医院（BernBypass70）的 140 个腹腔镜鲁-恩-Y 胃旁路（LRYGB）手术视频（MultiBypass140）组成。该数据集已由两名获得认证的外科医生对阶段和步骤进行了全面注释。此外，我们还在 7 项实验研究中评估了不同深度学习模型在相位和步骤识别任务中的通用性和基准：(1) 在 BernBypass70 上进行训练和评估；(2) 在 StrasBypass70 上进行训练和评估；(3) 在 MultiBypass140 联合数据集上进行训练和评估；(4) 在 BernBypass70 上进行训练，在 StrasBypass70 上进行评估；(5) 在 StrasBypass70 上进行训练，在 BernBypass70 上进行评估；在 MultiBypass140 上进行训练，(6) 在 BernBypass70 上进行评估，以及 (7) 在 StrasBypass70 上进行评估。结果：模型的性能明显受到训练数据的影响。实验(4)和(5)的结果最差，这证实了在单中心数据上训练的模型的泛化能力有限。在实验（6）和（7）中使用多中心训练数据提高了模型的泛化能力，使其超过了独立的单中心训练和验证（实验（1）和（2））的水平：结论：MultiBypass140 显示，不同中心在 LRYGB 手术的手术技术和工作流程方面存在很大差异。因此，归纳实验证明了模型性能的显著差异。这些结果凸显了多中心数据集对人工智能模型泛化的重要性，以考虑手术技术和工作流程的差异。数据集和代码可在 https://github.com/CAMMA-public/MultiBypass140 公开获取。

{"title":"Challenges in multi-centric generalization: phase and step recognition in Roux-en-Y gastric bypass surgery.","authors":"Joël L Lavanchy, Sanat Ramesh, Diego Dall'Alba, Cristians Gonzalez, Paolo Fiorini, Beat P Müller-Stich, Philipp C Nett, Jacques Marescaux, Didier Mutter, Nicolas Padoy","doi":"10.1007/s11548-024-03166-3","DOIUrl":"10.1007/s11548-024-03166-3","url":null,"abstract":"Purpose: Most studies on surgical activity recognition utilizing artificial intelligence (AI) have focused mainly on recognizing one type of activity from small and mono-centric surgical video datasets. It remains speculative whether those models would generalize to other centers.Methods: In this work, we introduce a large multi-centric multi-activity dataset consisting of 140 surgical videos (MultiBypass140) of laparoscopic Roux-en-Y gastric bypass (LRYGB) surgeries performed at two medical centers, i.e., the University Hospital of Strasbourg, France (StrasBypass70) and Inselspital, Bern University Hospital, Switzerland (BernBypass70). The dataset has been fully annotated with phases and steps by two board-certified surgeons. Furthermore, we assess the generalizability and benchmark different deep learning models for the task of phase and step recognition in 7 experimental studies: (1) Training and evaluation on BernBypass70; (2) Training and evaluation on StrasBypass70; (3) Training and evaluation on the joint MultiBypass140 dataset; (4) Training on BernBypass70, evaluation on StrasBypass70; (5) Training on StrasBypass70, evaluation on BernBypass70; Training on MultiBypass140, (6) evaluation on BernBypass70 and (7) evaluation on StrasBypass70.Results: The model's performance is markedly influenced by the training data. The worst results were obtained in experiments (4) and (5) confirming the limited generalization capabilities of models trained on mono-centric data. The use of multi-centric training data, experiments (6) and (7), improves the generalization capabilities of the models, bringing them beyond the level of independent mono-centric training and validation (experiments (1) and (2)).Conclusion: MultiBypass140 shows considerable variation in surgical technique and workflow of LRYGB procedures between centers. Therefore, generalization experiments demonstrate a remarkable difference in model performance. These results highlight the importance of multi-centric datasets for AI model generalization to account for variance in surgical technique and workflows. The dataset and code are publicly available at https://github.com/CAMMA-public/MultiBypass140.","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":"2249-2257"},"PeriodicalIF":2.3,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11541311/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140959178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep learning-based osteochondritis dissecans detection in ultrasound images with humeral capitellum localization. 基于深度学习的肱骨髁定位超声图像骨软骨炎检测

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery

Pub Date : 2024-11-01 Epub Date: 2024-01-17 DOI: 10.1007/s11548-023-03040-8

Kenta Sasaki, Daisuke Fujita, Kenta Takatsuji, Yoshihiro Kotoura, Masataka Minami, Yusuke Kobayashi, Tsuyoshi Sukenari, Yoshikazu Kida, Kenji Takahashi, Syoji Kobashi

Purpose: Osteochondritis dissecans (OCD) of the humeral capitellum is a common cause of elbow disorders, particularly among young throwing athletes. Conservative treatment is the preferred treatment for managing OCD, and early intervention significantly influences the possibility of complete disease resolution. The purpose of this study is to develop a deep learning-based classification model in ultrasound images for computer-aided diagnosis.

Methods: This paper proposes a deep learning-based OCD classification method in ultrasound images. The proposed method first detects the humeral capitellum detection using YOLO and then estimates the OCD probability of the detected region probability using VGG16. We hypothesis that the performance will be improved by eliminating unnecessary regions. To validate the performance of the proposed method, it was applied to 158 subjects (OCD: 67, Normal: 91) using five-fold-cross-validation.

Results: The study demonstrated that the humeral capitellum detection achieved a mean average precision (mAP) of over 0.95, while OCD probability estimation achieved an average accuracy of 0.890, precision of 0.888, recall of 0.927, F1 score of 0.894, and an area under the curve (AUC) of 0.962. On the other hand, when the classification model was constructed for the entire image, accuracy, precision, recall, F1 score, and AUC were 0.806, 0.806, 0.932, 0.843, and 0.928, respectively. The findings suggest the high-performance potential of the proposed model for OCD classification in ultrasonic images.

Conclusion: This paper introduces a deep learning-based OCD classification method. The experimental results emphasize the effectiveness of focusing on the humeral capitellum for OCD classification in ultrasound images. Future work should involve evaluating the effectiveness of employing the proposed method by physicians during medical check-ups for OCD.

目的：肱骨髌骨骨软骨炎（OCD）是肘关节疾病的常见病因，尤其是在年轻的投掷运动员中。保守治疗是控制 OCD 的首选治疗方法，早期干预可显著提高疾病完全治愈的可能性。本研究旨在开发一种基于深度学习的超声图像分类模型，用于计算机辅助诊断：本文提出了一种基于深度学习的超声图像强迫症分类方法。方法：本文提出了一种基于深度学习的超声图像 OCD 分类方法。该方法首先使用 YOLO 检测肱骨岬，然后使用 VGG16 估计检测区域的 OCD 概率。我们假设，通过消除不必要的区域，该方法的性能将得到改善。为了验证所提方法的性能，我们使用五倍交叉验证法对 158 名受试者（OCD：67 人，Normal：91 人）进行了测试：研究表明，肱骨岬检测的平均精确度（mAP）超过了 0.95，而 OCD 概率估计的平均准确度为 0.890，精确度为 0.888，召回率为 0.927，F1 分数为 0.894，曲线下面积（AUC）为 0.962。另一方面，当为整个图像构建分类模型时，准确率、精确度、召回率、F1 分数和 AUC 分别为 0.806、0.806、0.932、0.843 和 0.928。研究结果表明，所提出的模型在超声波图像中的 OCD 分类方面具有很高的性能潜力：本文介绍了一种基于深度学习的 OCD 分类方法。实验结果表明，聚焦肱骨岬对超声图像中的 OCD 分类非常有效。未来的工作应包括评估医生在对 OCD 进行体检时采用所提方法的有效性。

{"title":"Deep learning-based osteochondritis dissecans detection in ultrasound images with humeral capitellum localization.","authors":"Kenta Sasaki, Daisuke Fujita, Kenta Takatsuji, Yoshihiro Kotoura, Masataka Minami, Yusuke Kobayashi, Tsuyoshi Sukenari, Yoshikazu Kida, Kenji Takahashi, Syoji Kobashi","doi":"10.1007/s11548-023-03040-8","DOIUrl":"10.1007/s11548-023-03040-8","url":null,"abstract":"Purpose: Osteochondritis dissecans (OCD) of the humeral capitellum is a common cause of elbow disorders, particularly among young throwing athletes. Conservative treatment is the preferred treatment for managing OCD, and early intervention significantly influences the possibility of complete disease resolution. The purpose of this study is to develop a deep learning-based classification model in ultrasound images for computer-aided diagnosis.Methods: This paper proposes a deep learning-based OCD classification method in ultrasound images. The proposed method first detects the humeral capitellum detection using YOLO and then estimates the OCD probability of the detected region probability using VGG16. We hypothesis that the performance will be improved by eliminating unnecessary regions. To validate the performance of the proposed method, it was applied to 158 subjects (OCD: 67, Normal: 91) using five-fold-cross-validation.Results: The study demonstrated that the humeral capitellum detection achieved a mean average precision (mAP) of over 0.95, while OCD probability estimation achieved an average accuracy of 0.890, precision of 0.888, recall of 0.927, F1 score of 0.894, and an area under the curve (AUC) of 0.962. On the other hand, when the classification model was constructed for the entire image, accuracy, precision, recall, F1 score, and AUC were 0.806, 0.806, 0.932, 0.843, and 0.928, respectively. The findings suggest the high-performance potential of the proposed model for OCD classification in ultrasonic images.Conclusion: This paper introduces a deep learning-based OCD classification method. The experimental results emphasize the effectiveness of focusing on the humeral capitellum for OCD classification in ultrasound images. Future work should involve evaluating the effectiveness of employing the proposed method by physicians during medical check-ups for OCD.","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":"2143-2152"},"PeriodicalIF":2.3,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11541362/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139486838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Correction to: Background removal for debiasing computer-aided cytological diagnosis. 更正：为计算机辅助细胞学诊断去除背景色。

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery

Pub Date : 2024-11-01 DOI: 10.1007/s11548-024-03236-6

Keita Takeda, Tomoya Sakai, Eiji Mitate

引用次数: 0

Objective performance indicators versus GEARS: an opportunity for more accurate assessment of surgical skill. 客观绩效指标与 GEARS 对比：更准确评估手术技能的机会。

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery

Pub Date : 2024-11-01 Epub Date: 2024-09-25 DOI: 10.1007/s11548-024-03248-2

Marzieh Ershad Langroodi, Xi Liu, Mark R Tousignant, Anthony M Jarc

Purpose: Surgical skill evaluation that relies on subjective scoring of surgical videos can be time-consuming and inconsistent across raters. We demonstrate differentiated opportunities for objective evaluation to improve surgeon training and performance.

Methods: Subjective evaluation was performed using the Global evaluative assessment of robotic skills (GEARS) from both expert and crowd raters; whereas, objective evaluation used objective performance indicators (OPIs) derived from da Vinci surgical systems. Classifiers were trained for each evaluation method to distinguish between surgical expertise levels. This study includes one clinical task from a case series of robotic-assisted sleeve gastrectomy procedures performed by a single surgeon, and two training tasks performed by novice and expert surgeons, i.e., surgeons with no experience in robotic-assisted surgery (RAS) and those with more than 500 RAS procedures.

Results: When comparing expert and novice skill levels, OPI-based classifier showed significantly higher accuracy than GEARS-based classifier on the more complex dissection task (OPI 0.93 ± 0.08 vs. GEARS 0.67 ± 0.18; 95% CI, 0.16-0.37; p = 0.02), but no significant difference was shown on the simpler suturing task. For the single-surgeon case series, both classifiers performed well when differentiating between early and late group cases with smaller group sizes and larger intervals between groups (OPI 0.9 ± 0.08; GEARS 0.87 ± 0.12; 95% CI, 0.02-0.04; p = 0.67). When increasing the group size to include more cases, thereby having smaller intervals between groups, OPIs demonstrated significantly higher accuracy (OPI 0.97 ± 0.06; GEARS 0.76 ± 0.07; 95% CI, 0.12-0.28; p = 0.004) in differentiating between the early/late cases.

Conclusions: Objective methods for skill evaluation in RAS outperform subjective methods when (1) differentiating expertise in a technically challenging training task, and (2) identifying more granular differences along early versus late phases of a surgeon learning curve within a clinical task. Objective methods offer an opportunity for more accessible and scalable skill evaluation in RAS.

目的：依靠对手术视频进行主观评分的手术技能评估可能会耗费大量时间，而且不同评分者的评分结果也不一致。我们展示了客观评价的差异化机会，以改善外科医生的培训和表现：方法：主观评价采用全球机器人技能评估（GEARS），由专家和群众评分者进行；客观评价则采用达芬奇手术系统的客观性能指标（OPI）。每种评估方法都训练了分类器，以区分不同的手术专业水平。本研究包括由一名外科医生完成的机器人辅助袖带胃切除术系列病例中的一项临床任务，以及由外科医生新手和专家（即没有机器人辅助手术（RAS）经验的外科医生和有超过500例RAS手术经验的外科医生）完成的两项培训任务：在比较专家和新手的技术水平时，在较复杂的解剖任务中，基于 OPI 的分类器的准确率明显高于基于 GEARS 的分类器（OPI 0.93 ± 0.08 vs. GEARS 0.67 ± 0.18; 95% CI, 0.16-0.37; p = 0.02），但在较简单的缝合任务中没有明显差异。在单个外科医生的病例系列中，在分组规模较小、组间间隔较大的情况下，两种分类器在区分早期和晚期分组病例时均表现良好（OPI 0.9 ± 0.08；GEARS 0.87 ± 0.12；95% CI，0.02-0.04；p = 0.67）。当增加组数以包括更多病例，从而缩小组间间隔时，OPIs 在区分早期/晚期病例方面的准确性明显更高（OPI 0.97 ± 0.06；GEARS 0.76 ± 0.07；95% CI，0.12-0.28；p = 0.004）：RAS技能评估的客观方法在以下方面优于主观方法：(1) 在一项具有技术挑战性的培训任务中区分专业技能；(2) 在一项临床任务中识别外科医生学习曲线早期阶段与晚期阶段的更细微差别。客观方法为在 RAS 中进行更方便、更可扩展的技能评估提供了机会。

{"title":"Objective performance indicators versus GEARS: an opportunity for more accurate assessment of surgical skill.","authors":"Marzieh Ershad Langroodi, Xi Liu, Mark R Tousignant, Anthony M Jarc","doi":"10.1007/s11548-024-03248-2","DOIUrl":"10.1007/s11548-024-03248-2","url":null,"abstract":"Purpose: Surgical skill evaluation that relies on subjective scoring of surgical videos can be time-consuming and inconsistent across raters. We demonstrate differentiated opportunities for objective evaluation to improve surgeon training and performance.Methods: Subjective evaluation was performed using the Global evaluative assessment of robotic skills (GEARS) from both expert and crowd raters; whereas, objective evaluation used objective performance indicators (OPIs) derived from da Vinci surgical systems. Classifiers were trained for each evaluation method to distinguish between surgical expertise levels. This study includes one clinical task from a case series of robotic-assisted sleeve gastrectomy procedures performed by a single surgeon, and two training tasks performed by novice and expert surgeons, i.e., surgeons with no experience in robotic-assisted surgery (RAS) and those with more than 500 RAS procedures.Results: When comparing expert and novice skill levels, OPI-based classifier showed significantly higher accuracy than GEARS-based classifier on the more complex dissection task (OPI 0.93 ± 0.08 vs. GEARS 0.67 ± 0.18; 95% CI, 0.16-0.37; p = 0.02), but no significant difference was shown on the simpler suturing task. For the single-surgeon case series, both classifiers performed well when differentiating between early and late group cases with smaller group sizes and larger intervals between groups (OPI 0.9 ± 0.08; GEARS 0.87 ± 0.12; 95% CI, 0.02-0.04; p = 0.67). When increasing the group size to include more cases, thereby having smaller intervals between groups, OPIs demonstrated significantly higher accuracy (OPI 0.97 ± 0.06; GEARS 0.76 ± 0.07; 95% CI, 0.12-0.28; p = 0.004) in differentiating between the early/late cases.Conclusions: Objective methods for skill evaluation in RAS outperform subjective methods when (1) differentiating expertise in a technically challenging training task, and (2) identifying more granular differences along early versus late phases of a surgeon learning curve within a clinical task. Objective methods offer an opportunity for more accessible and scalable skill evaluation in RAS.","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":"2259-2267"},"PeriodicalIF":2.3,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142332054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Requirement analysis for an AI-based AR assistance system for surgical tools in the operating room: stakeholder requirements and technical perspectives. 手术室手术工具的人工智能 AR 辅助系统需求分析：利益相关者的需求和技术视角。

IF 2.3 3区医学 Q3 ENGINEERING, BIOMEDICAL

International Journal of Computer Assisted Radiology and Surgery

Pub Date : 2024-11-01 Epub Date: 2024-06-07 DOI: 10.1007/s11548-024-03193-0

E Cramer, A B Kucharski, J Kreimeier, S Andreß, S Li, C Walk, F Merkl, J Högl, P Wucherer, P Stefan, R von Eisenhart-Rothe, P Enste, D Roth

Purpose: We aim to investigate the integration of augmented reality (AR) within the context of increasingly complex surgical procedures and instrument handling toward the transition to smart operating rooms (OR). In contrast to cumbersome paper-based surgical instrument manuals still used in the OR, we wish to provide surgical staff with an AR head-mounted display that provides in-situ visualization and guidance throughout the assembly process of surgical instruments. Our requirement analysis supports the development and provides guidelines for its transfer into surgical practice.

Methods: A three-phase user-centered design approach was applied with online interviews, an observational study, and a workshop with two focus groups with scrub nurses, circulating nurses, surgeons, manufacturers, clinic IT staff, and members of the sterilization department. The requirement analysis was based on key criteria for usability. The data were analyzed via structured content analysis.

Results: We identified twelve main problems with the current use of paper manuals. Major issues included sterile users' inability to directly handle non-sterile manuals, missing details, and excessive text information, potentially delaying procedure performance. Major requirements for AR-driven guidance fall into the categories of design, practicability, control, and integration into the current workflow. Additionally, further recommendations for technical development could be obtained.

Conclusion: In conclusion, our insights have outlined a comprehensive spectrum of requirements that are essential for the successful implementation of an AI- and AR-driven guidance for assembling surgical instruments. The consistently appreciative evaluation by stakeholders underscores the profound potential of AR and AI technology as valuable assistance and guidance.

目的：我们的目标是研究如何将增强现实技术（AR）与日益复杂的手术过程和器械处理相结合，以实现向智能手术室（OR）的过渡。与手术室仍在使用的繁琐的纸质手术器械手册相比，我们希望为手术人员提供一个 AR 头戴式显示器，在整个手术器械组装过程中提供现场可视化和指导。我们的需求分析为该产品的开发提供了支持，并为其在外科手术中的应用提供了指导：方法：我们采用了以用户为中心的三阶段设计方法，包括在线访谈、观察研究和研讨会，其中研讨会包含两个焦点小组，参加者包括擦洗护士、循环护士、外科医生、制造商、诊所 IT 人员和消毒部门成员。需求分析基于可用性的关键标准。通过结构化内容分析对数据进行了分析：我们发现了目前使用纸质手册存在的十二个主要问题。主要问题包括：无菌用户无法直接处理非无菌手册、细节缺失、文字信息过多，这些都有可能延误程序的执行。AR 驱动指南的主要要求包括设计、实用性、控制和与当前工作流程的整合。此外，还可以获得更多的技术发展建议：总之，我们的见解概述了成功实施人工智能和增强现实技术驱动的手术器械组装指导所必需的全面要求。利益相关者的一致好评强调了 AR 和 AI 技术作为有价值的辅助和指导的巨大潜力。

{"title":"Requirement analysis for an AI-based AR assistance system for surgical tools in the operating room: stakeholder requirements and technical perspectives.","authors":"E Cramer, A B Kucharski, J Kreimeier, S Andreß, S Li, C Walk, F Merkl, J Högl, P Wucherer, P Stefan, R von Eisenhart-Rothe, P Enste, D Roth","doi":"10.1007/s11548-024-03193-0","DOIUrl":"10.1007/s11548-024-03193-0","url":null,"abstract":"Purpose: We aim to investigate the integration of augmented reality (AR) within the context of increasingly complex surgical procedures and instrument handling toward the transition to smart operating rooms (OR). In contrast to cumbersome paper-based surgical instrument manuals still used in the OR, we wish to provide surgical staff with an AR head-mounted display that provides in-situ visualization and guidance throughout the assembly process of surgical instruments. Our requirement analysis supports the development and provides guidelines for its transfer into surgical practice.Methods: A three-phase user-centered design approach was applied with online interviews, an observational study, and a workshop with two focus groups with scrub nurses, circulating nurses, surgeons, manufacturers, clinic IT staff, and members of the sterilization department. The requirement analysis was based on key criteria for usability. The data were analyzed via structured content analysis.Results: We identified twelve main problems with the current use of paper manuals. Major issues included sterile users' inability to directly handle non-sterile manuals, missing details, and excessive text information, potentially delaying procedure performance. Major requirements for AR-driven guidance fall into the categories of design, practicability, control, and integration into the current workflow. Additionally, further recommendations for technical development could be obtained.Conclusion: In conclusion, our insights have outlined a comprehensive spectrum of requirements that are essential for the successful implementation of an AI- and AR-driven guidance for assembling surgical instruments. The consistently appreciative evaluation by stakeholders underscores the profound potential of AR and AI technology as valuable assistance and guidance.","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":"2287-2296"},"PeriodicalIF":2.3,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11541324/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141285346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0