Data augmentation, labelling, and imperfections : second MICCAI workshop, DALI 2022, held in conjunction with MICCAI 2022, Singapore, September 22, 2022, proceedings. DALI (Workshop) (2nd : 2022 : Singapore)最新文献

Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New Benchmark Study. 胸部 X 光片胸腔疾病的长尾分类：新基准研究

Data augmentation, labelling, and imperfections : second MICCAI workshop, DALI 2022, held in conjunction with MICCAI 2022, Singapore, September 22, 2022, proceedings. DALI (Workshop) (2nd : 2022 : Singapore)

Pub Date : 2022-09-01 Epub Date: 2022-09-16 DOI: 10.1007/978-3-031-17027-0_3

Gregory Holste, Song Wang, Ziyu Jiang, Thomas C Shen, George Shih, Ronald M Summers, Yifan Peng, Zhangyang Wang

Imaging exams, such as chest radiography, will yield a small set of common findings and a much larger set of uncommon findings. While a trained radiologist can learn the visual presentation of rare conditions by studying a few representative examples, teaching a machine to learn from such a "long-tailed" distribution is much more difficult, as standard methods would be easily biased toward the most frequent classes. In this paper, we present a comprehensive benchmark study of the long-tailed learning problem in the specific domain of thorax diseases on chest X-rays. We focus on learning from naturally distributed chest X-ray data, optimizing classification accuracy over not only the common "head" classes, but also the rare yet critical "tail" classes. To accomplish this, we introduce a challenging new long-tailed chest X-ray benchmark to facilitate research on developing long-tailed learning methods for medical image classification. The benchmark consists of two chest X-ray datasets for 19- and 20-way thorax disease classification, containing classes with as many as 53,000 and as few as 7 labeled training images. We evaluate both standard and state-of-the-art long-tailed learning methods on this new benchmark, analyzing which aspects of these methods are most beneficial for long-tailed medical image classification and summarizing insights for future algorithm design. The datasets, trained models, and code are available at https://github.com/VITA-Group/LongTailCXR.

成像检查（如胸片）会产生一小部分常见的检查结果和一大部分不常见的检查结果。虽然训练有素的放射科医生可以通过研究一些有代表性的例子来学习罕见病症的视觉表现，但让机器从这种 "长尾 "分布中学习却要困难得多，因为标准方法很容易偏向最常见的类别。在本文中，我们针对胸部 X 光片上的胸部疾病这一特定领域的长尾学习问题进行了全面的基准研究。我们的研究重点是从自然分布的胸部 X 光数据中学习，不仅要优化常见 "头部 "类别的分类准确性，还要优化罕见但关键的 "尾部 "类别的分类准确性。为此，我们引入了一个具有挑战性的新长尾胸部 X 光基准，以促进医学图像分类长尾学习方法的开发研究。该基准由两个胸部 X 光数据集组成，分别用于 19 路和 20 路胸部疾病分类，包含多达 53,000 个类别和少至 7 个标记的训练图像。我们在这个新基准上评估了标准的和最先进的长尾学习方法，分析了这些方法的哪些方面最有利于长尾医学图像分类，并总结了对未来算法设计的启示。数据集、训练模型和代码可在 https://github.com/VITA-Group/LongTailCXR 上获取。

{"title":"Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New Benchmark Study.","authors":"Gregory Holste, Song Wang, Ziyu Jiang, Thomas C Shen, George Shih, Ronald M Summers, Yifan Peng, Zhangyang Wang","doi":"10.1007/978-3-031-17027-0_3","DOIUrl":"10.1007/978-3-031-17027-0_3","url":null,"abstract":"Imaging exams, such as chest radiography, will yield a small set of common findings and a much larger set of uncommon findings. While a trained radiologist can learn the visual presentation of rare conditions by studying a few representative examples, teaching a machine to learn from such a \"long-tailed\" distribution is much more difficult, as standard methods would be easily biased toward the most frequent classes. In this paper, we present a comprehensive benchmark study of the long-tailed learning problem in the specific domain of thorax diseases on chest X-rays. We focus on learning from naturally distributed chest X-ray data, optimizing classification accuracy over not only the common \"head\" classes, but also the rare yet critical \"tail\" classes. To accomplish this, we introduce a challenging new long-tailed chest X-ray benchmark to facilitate research on developing long-tailed learning methods for medical image classification. The benchmark consists of two chest X-ray datasets for 19- and 20-way thorax disease classification, containing classes with as many as 53,000 and as few as 7 labeled training images. We evaluate both standard and state-of-the-art long-tailed learning methods on this new benchmark, analyzing which aspects of these methods are most beneficial for long-tailed medical image classification and summarizing insights for future algorithm design. The datasets, trained models, and code are available at https://github.com/VITA-Group/LongTailCXR.","PeriodicalId":93741,"journal":{"name":"Data augmentation, labelling, and imperfections : second MICCAI workshop, DALI 2022, held in conjunction with MICCAI 2022, Singapore, September 22, 2022, proceedings. DALI (Workshop) (2nd : 2022 : Singapore)","volume":"13567 ","pages":"22-32"},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9618235/pdf/nihms-1844023.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40660013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Few-Shot Learning Geometric Ensemble for Multi-label Classification of Chest X-Rays. 胸片多标记分类的少射学习几何集成。

Data augmentation, labelling, and imperfections : second MICCAI workshop, DALI 2022, held in conjunction with MICCAI 2022, Singapore, September 22, 2022, proceedings. DALI (Workshop) (2nd : 2022 : Singapore)

Pub Date : 2022-09-01 Epub Date: 2022-09-16 DOI: 10.1007/978-3-031-17027-0_12

Dana Moukheiber, Saurabh Mahindre, Lama Moukheiber, Mira Moukheiber, Song Wang, Chunwei Ma, George Shih, Yifan Peng, Mingchen Gao

This paper aims to identify uncommon cardiothoracic diseases and patterns on chest X-ray images. Training a machine learning model to classify rare diseases with multi-label indications is challenging without sufficient labeled training samples. Our model leverages the information from common diseases and adapts to perform on less common mentions. We propose to use multi-label few-shot learning (FSL) schemes including neighborhood component analysis loss, generating additional samples using distribution calibration and fine-tuning based on multi-label classification loss. We utilize the fact that the widely adopted nearest neighbor-based FSL schemes like ProtoNet are Voronoi diagrams in feature space. In our method, the Voronoi diagrams in the features space generated from multi-label schemes are combined into our geometric DeepVoro Multi-label ensemble. The improved performance in multi-label few-shot classification using the multi-label ensemble is demonstrated in our experiments (The code is publicly available at https://github.com/Saurabh7/Few-shot-learning-multilabel-cxray).

本文的目的是识别不常见的心胸疾病和胸片上的模式。在没有足够的标记训练样本的情况下，训练机器学习模型对具有多标签适应症的罕见病进行分类是具有挑战性的。我们的模型利用来自常见疾病的信息，并适应不太常见的提及。我们建议使用包含邻域成分分析损失的多标签少射学习(FSL)方案，使用分布校准和基于多标签分类损失的微调来生成额外的样本。我们利用了广泛采用的基于最近邻的FSL方案(如ProtoNet)是特征空间中的Voronoi图这一事实。在我们的方法中，由多标签方案生成的特征空间中的Voronoi图被组合到我们的几何DeepVoro多标签集成中。我们的实验证明了使用多标签集成在多标签少镜头分类中的改进性能(代码可在https://github.com/Saurabh7/Few-shot-learning-multilabel-cxray上公开获得)。

{"title":"Few-Shot Learning Geometric Ensemble for Multi-label Classification of Chest X-Rays.","authors":"Dana Moukheiber, Saurabh Mahindre, Lama Moukheiber, Mira Moukheiber, Song Wang, Chunwei Ma, George Shih, Yifan Peng, Mingchen Gao","doi":"10.1007/978-3-031-17027-0_12","DOIUrl":"https://doi.org/10.1007/978-3-031-17027-0_12","url":null,"abstract":"This paper aims to identify uncommon cardiothoracic diseases and patterns on chest X-ray images. Training a machine learning model to classify rare diseases with multi-label indications is challenging without sufficient labeled training samples. Our model leverages the information from common diseases and adapts to perform on less common mentions. We propose to use multi-label few-shot learning (FSL) schemes including neighborhood component analysis loss, generating additional samples using distribution calibration and fine-tuning based on multi-label classification loss. We utilize the fact that the widely adopted nearest neighbor-based FSL schemes like ProtoNet are Voronoi diagrams in feature space. In our method, the Voronoi diagrams in the features space generated from multi-label schemes are combined into our geometric DeepVoro Multi-label ensemble. The improved performance in multi-label few-shot classification using the multi-label ensemble is demonstrated in our experiments (The code is publicly available at https://github.com/Saurabh7/Few-shot-learning-multilabel-cxray).","PeriodicalId":93741,"journal":{"name":"Data augmentation, labelling, and imperfections : second MICCAI workshop, DALI 2022, held in conjunction with MICCAI 2022, Singapore, September 22, 2022, proceedings. DALI (Workshop) (2nd : 2022 : Singapore)","volume":"13567 ","pages":"112-122"},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9652771/pdf/nihms-1846293.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40490560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

The Aesthetics of Imperfection in Everyday Life 日常生活中的不完美美学

Data augmentation, labelling, and imperfections : second MICCAI workshop, DALI 2022, held in conjunction with MICCAI 2022, Singapore, September 22, 2022, proceedings. DALI (Workshop) (2nd : 2022 : Singapore)

Pub Date : 2022-01-01 DOI: 10.5040/9781501380303.ch-001

Yuriko Saito