Investigating Vision Transformer Models for Low-Resolution Medical Image Recognition

2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP) Pub Date : 2021-12-17 DOI:10.1109/ICCWAMTIP53232.2021.9674065

Isaac Adjei-Mensah, Xiaoling Zhang, Adu Asare Baffour, Isaac Osei Agyemang, S. B. Yussif, B. L. Y. Agbley, Collins Sey

引用次数: 3

Abstract

Vision Transformers use self-attention techniques to learn long-range spatial relations to focus on the relevant parts of an image. They have achieved state-of-the-art results in many computer vision tasks. Recently, some methods have to leverage Vision Transformer-based models to tackle tasks in medical imaging. However, Vision Transformer emphasizes the low-resolution features due to the repetitive downsamplings, which result in a loss or lack of detailed localization information, making it highly unfit for low-level image recognition. In this paper, we investigate the performance of Vision Transformer on low-level medical images and contrast it with convolutional neural networks. The experimental results show that Convolutional Neural Network outperforms the Vision Transformer-based models on all four datasets.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

研究用于低分辨率医学图像识别的视觉变形模型

视觉变形器使用自关注技术来学习远程空间关系，以聚焦于图像的相关部分。他们在许多计算机视觉任务中取得了最先进的成果。最近，一些方法必须利用基于视觉变换的模型来解决医学成像中的任务。然而，Vision Transformer强调由于重复下采样而导致的低分辨率特征，这导致丢失或缺乏详细的定位信息，使其非常不适合低级图像识别。在本文中，我们研究了Vision Transformer在低水平医学图像上的性能，并将其与卷积神经网络进行了对比。实验结果表明，卷积神经网络在所有四种数据集上都优于基于视觉变换的模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)

自引率

0.00%

发文量

期刊最新文献

Joint Modulation and Coding Recognition Using Deep Learning Chinese Short Text Classification Based On Deep Learning Solving TPS by SA Based on Probabilistic Double Crossover Operator Personalized Federated Learning with Gradient Similarity Implicit Certificate Based Signcryption for a Secure Data Sharing in Clouds