增强肝癌诊断的鲁棒性：具有轻量级融合和有效数据增强功能的多模态对比学习器

ACM transactions on computing for healthcare Pub Date : 2023-12-30 DOI:10.1145/3639414

Pei-Xuan Li, Hsun-Ping Hsieh, Chiang Fan Yang, Ding-You Wu, Ching-Chung Ko

{"title":"增强肝癌诊断的鲁棒性：具有轻量级融合和有效数据增强功能的多模态对比学习器","authors":"Pei-Xuan Li, Hsun-Ping Hsieh, Chiang Fan Yang, Ding-You Wu, Ching-Chung Ko","doi":"10.1145/3639414","DOIUrl":null,"url":null,"abstract":"This paper explores the application of self-supervised contrastive learning in the medical domain, focusing on classification of multi-modality Magnetic Resonance (MR) images. To address the challenges of limited and hard-to-annotate medical data, we introduce multi-modality data augmentation (MDA) and cross-modality group convolution (CGC). In the pre-training phase, we leverage Simple Siamese networks to maximize the similarity between two augmented MR images from a patient, without a handcrafted pretext task. Our approach also combines 3D and 2D group convolution with a channel shuffle operation to efficiently incorporate different modalities of image features. Evaluation on liver MR images from a well-known hospital in Taiwan demonstrates a significant improvement over previous methods. This work contributes to advancing multi-modality contrastive learning, particularly in the context of medical imaging, offering enhanced tools for analyzing complex image data.","PeriodicalId":72043,"journal":{"name":"ACM transactions on computing for healthcare","volume":" 19","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing Robust Liver Cancer Diagnosis: A Contrastive Multi-Modality Learner with Lightweight Fusion and Effective Data Augmentation\",\"authors\":\"Pei-Xuan Li, Hsun-Ping Hsieh, Chiang Fan Yang, Ding-You Wu, Ching-Chung Ko\",\"doi\":\"10.1145/3639414\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper explores the application of self-supervised contrastive learning in the medical domain, focusing on classification of multi-modality Magnetic Resonance (MR) images. To address the challenges of limited and hard-to-annotate medical data, we introduce multi-modality data augmentation (MDA) and cross-modality group convolution (CGC). In the pre-training phase, we leverage Simple Siamese networks to maximize the similarity between two augmented MR images from a patient, without a handcrafted pretext task. Our approach also combines 3D and 2D group convolution with a channel shuffle operation to efficiently incorporate different modalities of image features. Evaluation on liver MR images from a well-known hospital in Taiwan demonstrates a significant improvement over previous methods. This work contributes to advancing multi-modality contrastive learning, particularly in the context of medical imaging, offering enhanced tools for analyzing complex image data.\",\"PeriodicalId\":72043,\"journal\":{\"name\":\"ACM transactions on computing for healthcare\",\"volume\":\" 19\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM transactions on computing for healthcare\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3639414\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM transactions on computing for healthcare","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3639414","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文探讨了自监督对比学习在医疗领域的应用，重点是多模态磁共振（MR）图像的分类。为了应对医疗数据有限且难以标注的挑战，我们引入了多模态数据增强（MDA）和跨模态群卷积（CGC）。在预训练阶段，我们利用简单连体网络（Simple Siamese networks）来最大化患者两幅增强磁共振图像之间的相似性，而无需手工制作借口任务。我们的方法还将三维和二维群卷积与通道洗牌操作相结合，有效地整合了不同模式的图像特征。在台湾一家知名医院的肝脏磁共振图像上进行的评估表明，我们的方法比以前的方法有了显著的改进。这项工作有助于推进多模态对比学习，尤其是在医学成像方面，为分析复杂图像数据提供了更强大的工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Enhancing Robust Liver Cancer Diagnosis: A Contrastive Multi-Modality Learner with Lightweight Fusion and Effective Data Augmentation

This paper explores the application of self-supervised contrastive learning in the medical domain, focusing on classification of multi-modality Magnetic Resonance (MR) images. To address the challenges of limited and hard-to-annotate medical data, we introduce multi-modality data augmentation (MDA) and cross-modality group convolution (CGC). In the pre-training phase, we leverage Simple Siamese networks to maximize the similarity between two augmented MR images from a patient, without a handcrafted pretext task. Our approach also combines 3D and 2D group convolution with a channel shuffle operation to efficiently incorporate different modalities of image features. Evaluation on liver MR images from a well-known hospital in Taiwan demonstrates a significant improvement over previous methods. This work contributes to advancing multi-modality contrastive learning, particularly in the context of medical imaging, offering enhanced tools for analyzing complex image data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ACM transactions on computing for healthcare

CiteScore

10.30

自引率

0.00%

发文量