Patch-wise vector quantization for unsupervised medical anomaly detection

IF 3.9 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pattern Recognition Letters Pub Date : 2024-06-28 DOI:10.1016/j.patrec.2024.06.028

Taejune Kim , Yun-Gyoo Lee , Inho Jeong , Soo-Youn Ham , Simon S. Woo

{"title":"Patch-wise vector quantization for unsupervised medical anomaly detection","authors":"Taejune Kim , Yun-Gyoo Lee , Inho Jeong , Soo-Youn Ham , Simon S. Woo","doi":"10.1016/j.patrec.2024.06.028","DOIUrl":null,"url":null,"abstract":"<div><p>Radiography images inherently possess globally consistent structures while exhibiting significant diversity in local anatomical regions, making it challenging to model their normal features through unsupervised anomaly detection. Since unsupervised anomaly detection methods localize anomalies by utilizing discrepancies between learned normal features and input abnormal features, previous studies introduce a memory structure to capture the normal features of radiography images. However, these approaches store extremely localized image segments in their memory, causing the model to represent both normal and pathological features with the stored components. This poses a significant challenge in unsupervised anomaly detection by reducing the disparity between learned features and abnormal features. Furthermore, with the diverse settings in radiography imaging, the above issue is exacerbated: more diversity in the normal images results in stronger representation of pathological features. To resolve the issues above, we propose a novel pathology detection method called Patch-wise Vector Quantization (P-VQ). Unlike the previous methods, P-VQ learns vector-quantized representations of normal “patches” while preserving its spatial information by incorporating vector similarity metric. Furthermore, we introduce a novel method for selecting features in the memory to further enhance the robustness against diverse imaging settings. P-VQ even mitigates the “index collapse” problem of vector quantization by proposing top-<span><math><mrow><mi>k</mi><mtext>%</mtext></mrow></math></span> dropout. Our extensive experiments on the BMAD benchmark demonstrate the superior performance of P-VQ against existing state-of-the-art methods.</p></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"184 ","pages":"Pages 205-211"},"PeriodicalIF":3.9000,"publicationDate":"2024-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016786552400196X","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Radiography images inherently possess globally consistent structures while exhibiting significant diversity in local anatomical regions, making it challenging to model their normal features through unsupervised anomaly detection. Since unsupervised anomaly detection methods localize anomalies by utilizing discrepancies between learned normal features and input abnormal features, previous studies introduce a memory structure to capture the normal features of radiography images. However, these approaches store extremely localized image segments in their memory, causing the model to represent both normal and pathological features with the stored components. This poses a significant challenge in unsupervised anomaly detection by reducing the disparity between learned features and abnormal features. Furthermore, with the diverse settings in radiography imaging, the above issue is exacerbated: more diversity in the normal images results in stronger representation of pathological features. To resolve the issues above, we propose a novel pathology detection method called Patch-wise Vector Quantization (P-VQ). Unlike the previous methods, P-VQ learns vector-quantized representations of normal “patches” while preserving its spatial information by incorporating vector similarity metric. Furthermore, we introduce a novel method for selecting features in the memory to further enhance the robustness against diverse imaging settings. P-VQ even mitigates the “index collapse” problem of vector quantization by proposing top- $k %$ dropout. Our extensive experiments on the BMAD benchmark demonstrate the superior performance of P-VQ against existing state-of-the-art methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于无监督医学异常检测的补丁矢量量化技术

放射成像图像本身具有全局一致的结构，但在局部解剖区域却表现出显著的多样性，这使得通过无监督异常检测对其正常特征建模具有挑战性。由于无监督异常检测方法是利用学习到的正常特征与输入的异常特征之间的差异来定位异常的，因此之前的研究引入了一种记忆结构来捕捉放射影像的正常特征。然而，这些方法会在内存中存储极其局部化的图像片段，导致模型既要用存储的组件来表示正常特征，又要用存储的组件来表示病理特征。这给无监督异常检测带来了巨大挑战，因为要减少学习到的特征与异常特征之间的差异。此外，由于放射成像的设置多种多样，上述问题更加严重：正常图像的多样性越多，病理特征的代表性就越强。为了解决上述问题，我们提出了一种新的病理检测方法--"补丁式矢量量化（Patch-wise Vector Quantization，P-VQ）"。与之前的方法不同，P-VQ 在学习正常 "斑块 "的矢量量化表示的同时，还通过矢量相似度量来保留其空间信息。此外，我们还引入了一种在内存中选择特征的新方法，以进一步增强对不同成像环境的鲁棒性。P-VQ 甚至通过提出 top-k% dropout 来缓解矢量量化的 "索引崩溃 "问题。我们在 BMAD 基准上进行的大量实验证明，与现有的先进方法相比，P-VQ 的性能更加优越。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Pattern Recognition Letters 工程技术-计算机：人工智能

CiteScore

12.40

自引率

5.90%

发文量

287

审稿时长

9.1 months

期刊介绍： Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition. Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.