Ethan Tu, Jonathan Burkow, Andy Tsai, Joseph Junewick, Francisco A Perez, Jeffrey Otjen, Adam M Alessio
{"title":"Near-pair patch generative adversarial network for data augmentation of focal pathology object detection models.","authors":"Ethan Tu, Jonathan Burkow, Andy Tsai, Joseph Junewick, Francisco A Perez, Jeffrey Otjen, Adam M Alessio","doi":"10.1117/1.JMI.11.3.034505","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>The limited volume of medical training data remains one of the leading challenges for machine learning for diagnostic applications. Object detectors that identify and localize pathologies require training with a large volume of labeled images, which are often expensive and time-consuming to curate. To reduce this challenge, we present a method to support distant supervision of object detectors through generation of synthetic pathology-present labeled images.</p><p><strong>Approach: </strong>Our method employs the previously proposed cyclic generative adversarial network (cycleGAN) with two key innovations: (1) use of \"near-pair\" pathology-present regions and pathology-absent regions from similar locations in the same subject for training and (2) the addition of a realism metric (Fréchet inception distance) to the generator loss term. We trained and tested this method with 2800 fracture-present and 2800 fracture-absent image patches from 704 unique pediatric chest radiographs. The trained model was then used to generate synthetic pathology-present images with exact knowledge of location (labels) of the pathology. These synthetic images provided an augmented training set for an object detector.</p><p><strong>Results: </strong>In an observer study, four pediatric radiologists used a five-point Likert scale indicating the likelihood of a real fracture (1 = definitely not a fracture and 5 = definitely a fracture) to grade a set of real fracture-absent, real fracture-present, and synthetic fracture-present images. The real fracture-absent images scored <math><mrow><mn>1.7</mn><mo>±</mo><mn>1.0</mn></mrow></math>, real fracture-present images <math><mrow><mn>4.1</mn><mo>±</mo><mn>1.2</mn></mrow></math>, and synthetic fracture-present images <math><mrow><mn>2.5</mn><mo>±</mo><mn>1.2</mn></mrow></math>. An object detector model (YOLOv5) trained on a mix of 500 real and 500 synthetic radiographs performed with a recall of <math><mrow><mn>0.57</mn><mo>±</mo><mn>0.05</mn></mrow></math> and an <math><mrow><mi>F</mi><mn>2</mn></mrow></math> score of <math><mrow><mn>0.59</mn><mo>±</mo><mn>0.05</mn></mrow></math>. In comparison, when trained on only 500 real radiographs, the recall and <math><mrow><mi>F</mi><mn>2</mn></mrow></math> score were <math><mrow><mn>0.49</mn><mo>±</mo><mn>0.06</mn></mrow></math> and <math><mrow><mn>0.53</mn><mo>±</mo><mn>0.06</mn></mrow></math>, respectively.</p><p><strong>Conclusions: </strong>Our proposed method generates visually realistic pathology and that provided improved object detector performance for the task of rib fracture detection.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"11 3","pages":"034505"},"PeriodicalIF":1.9000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11149891/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1117/1.JMI.11.3.034505","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/4 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: The limited volume of medical training data remains one of the leading challenges for machine learning for diagnostic applications. Object detectors that identify and localize pathologies require training with a large volume of labeled images, which are often expensive and time-consuming to curate. To reduce this challenge, we present a method to support distant supervision of object detectors through generation of synthetic pathology-present labeled images.
Approach: Our method employs the previously proposed cyclic generative adversarial network (cycleGAN) with two key innovations: (1) use of "near-pair" pathology-present regions and pathology-absent regions from similar locations in the same subject for training and (2) the addition of a realism metric (Fréchet inception distance) to the generator loss term. We trained and tested this method with 2800 fracture-present and 2800 fracture-absent image patches from 704 unique pediatric chest radiographs. The trained model was then used to generate synthetic pathology-present images with exact knowledge of location (labels) of the pathology. These synthetic images provided an augmented training set for an object detector.
Results: In an observer study, four pediatric radiologists used a five-point Likert scale indicating the likelihood of a real fracture (1 = definitely not a fracture and 5 = definitely a fracture) to grade a set of real fracture-absent, real fracture-present, and synthetic fracture-present images. The real fracture-absent images scored , real fracture-present images , and synthetic fracture-present images . An object detector model (YOLOv5) trained on a mix of 500 real and 500 synthetic radiographs performed with a recall of and an score of . In comparison, when trained on only 500 real radiographs, the recall and score were and , respectively.
Conclusions: Our proposed method generates visually realistic pathology and that provided improved object detector performance for the task of rib fracture detection.
期刊介绍:
JMI covers fundamental and translational research, as well as applications, focused on medical imaging, which continue to yield physical and biomedical advancements in the early detection, diagnostics, and therapy of disease as well as in the understanding of normal. The scope of JMI includes: Imaging physics, Tomographic reconstruction algorithms (such as those in CT and MRI), Image processing and deep learning, Computer-aided diagnosis and quantitative image analysis, Visualization and modeling, Picture archiving and communications systems (PACS), Image perception and observer performance, Technology assessment, Ultrasonic imaging, Image-guided procedures, Digital pathology, Biomedical applications of biomedical imaging. JMI allows for the peer-reviewed communication and archiving of scientific developments, translational and clinical applications, reviews, and recommendations for the field.