Martin Thißen, Thi Ngoc Diep Tran, Ben Joel Schönbein, Ute Trapp, Barbara Esteve Ratsch, Beate Egner, Romana Piat, Elke Hergenröther
{"title":"Enhancing Canine Musculoskeletal Diagnoses: Leveraging Synthetic Image Data for Pre-Training AI-Models on Visual Documentations","authors":"Martin Thißen, Thi Ngoc Diep Tran, Ben Joel Schönbein, Ute Trapp, Barbara Esteve Ratsch, Beate Egner, Romana Piat, Elke Hergenröther","doi":"arxiv-2409.08181","DOIUrl":null,"url":null,"abstract":"The examination of the musculoskeletal system in dogs is a challenging task\nin veterinary practice. In this work, a novel method has been developed that\nenables efficient documentation of a dog's condition through a visual\nrepresentation. However, since the visual documentation is new, there is no\nexisting training data. The objective of this work is therefore to mitigate the\nimpact of data scarcity in order to develop an AI-based diagnostic support\nsystem. To this end, the potential of synthetic data that mimics realistic\nvisual documentations of diseases for pre-training AI models is investigated.\nWe propose a method for generating synthetic image data that mimics realistic\nvisual documentations. Initially, a basic dataset containing three distinct\nclasses is generated, followed by the creation of a more sophisticated dataset\ncontaining 36 different classes. Both datasets are used for the pre-training of\nan AI model. Subsequently, an evaluation dataset is created, consisting of 250\nmanually created visual documentations for five different diseases. This\ndataset, along with a subset containing 25 examples. The obtained results on\nthe evaluation dataset containing 25 examples demonstrate a significant\nenhancement of approximately 10% in diagnosis accuracy when utilizing generated\nsynthetic images that mimic real-world visual documentations. However, these\nresults do not hold true for the larger evaluation dataset containing 250\nexamples, indicating that the advantages of using synthetic data for\npre-training an AI model emerge primarily when dealing with few examples of\nvisual documentations for a given disease. Overall, this work provides valuable\ninsights into mitigating the limitations imposed by limited training data\nthrough the strategic use of generated synthetic data, presenting an approach\napplicable beyond the canine musculoskeletal assessment domain.","PeriodicalId":501130,"journal":{"name":"arXiv - CS - Computer Vision and Pattern Recognition","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08181","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The examination of the musculoskeletal system in dogs is a challenging task
in veterinary practice. In this work, a novel method has been developed that
enables efficient documentation of a dog's condition through a visual
representation. However, since the visual documentation is new, there is no
existing training data. The objective of this work is therefore to mitigate the
impact of data scarcity in order to develop an AI-based diagnostic support
system. To this end, the potential of synthetic data that mimics realistic
visual documentations of diseases for pre-training AI models is investigated.
We propose a method for generating synthetic image data that mimics realistic
visual documentations. Initially, a basic dataset containing three distinct
classes is generated, followed by the creation of a more sophisticated dataset
containing 36 different classes. Both datasets are used for the pre-training of
an AI model. Subsequently, an evaluation dataset is created, consisting of 250
manually created visual documentations for five different diseases. This
dataset, along with a subset containing 25 examples. The obtained results on
the evaluation dataset containing 25 examples demonstrate a significant
enhancement of approximately 10% in diagnosis accuracy when utilizing generated
synthetic images that mimic real-world visual documentations. However, these
results do not hold true for the larger evaluation dataset containing 250
examples, indicating that the advantages of using synthetic data for
pre-training an AI model emerge primarily when dealing with few examples of
visual documentations for a given disease. Overall, this work provides valuable
insights into mitigating the limitations imposed by limited training data
through the strategic use of generated synthetic data, presenting an approach
applicable beyond the canine musculoskeletal assessment domain.