加强犬类肌肉骨骼诊断:利用合成图像数据预训练视觉文档上的人工智能模型

Martin Thißen, Thi Ngoc Diep Tran, Ben Joel Schönbein, Ute Trapp, Barbara Esteve Ratsch, Beate Egner, Romana Piat, Elke Hergenröther
{"title":"加强犬类肌肉骨骼诊断:利用合成图像数据预训练视觉文档上的人工智能模型","authors":"Martin Thißen, Thi Ngoc Diep Tran, Ben Joel Schönbein, Ute Trapp, Barbara Esteve Ratsch, Beate Egner, Romana Piat, Elke Hergenröther","doi":"arxiv-2409.08181","DOIUrl":null,"url":null,"abstract":"The examination of the musculoskeletal system in dogs is a challenging task\nin veterinary practice. In this work, a novel method has been developed that\nenables efficient documentation of a dog's condition through a visual\nrepresentation. However, since the visual documentation is new, there is no\nexisting training data. The objective of this work is therefore to mitigate the\nimpact of data scarcity in order to develop an AI-based diagnostic support\nsystem. To this end, the potential of synthetic data that mimics realistic\nvisual documentations of diseases for pre-training AI models is investigated.\nWe propose a method for generating synthetic image data that mimics realistic\nvisual documentations. Initially, a basic dataset containing three distinct\nclasses is generated, followed by the creation of a more sophisticated dataset\ncontaining 36 different classes. Both datasets are used for the pre-training of\nan AI model. Subsequently, an evaluation dataset is created, consisting of 250\nmanually created visual documentations for five different diseases. This\ndataset, along with a subset containing 25 examples. The obtained results on\nthe evaluation dataset containing 25 examples demonstrate a significant\nenhancement of approximately 10% in diagnosis accuracy when utilizing generated\nsynthetic images that mimic real-world visual documentations. However, these\nresults do not hold true for the larger evaluation dataset containing 250\nexamples, indicating that the advantages of using synthetic data for\npre-training an AI model emerge primarily when dealing with few examples of\nvisual documentations for a given disease. Overall, this work provides valuable\ninsights into mitigating the limitations imposed by limited training data\nthrough the strategic use of generated synthetic data, presenting an approach\napplicable beyond the canine musculoskeletal assessment domain.","PeriodicalId":501130,"journal":{"name":"arXiv - CS - Computer Vision and Pattern Recognition","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing Canine Musculoskeletal Diagnoses: Leveraging Synthetic Image Data for Pre-Training AI-Models on Visual Documentations\",\"authors\":\"Martin Thißen, Thi Ngoc Diep Tran, Ben Joel Schönbein, Ute Trapp, Barbara Esteve Ratsch, Beate Egner, Romana Piat, Elke Hergenröther\",\"doi\":\"arxiv-2409.08181\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The examination of the musculoskeletal system in dogs is a challenging task\\nin veterinary practice. In this work, a novel method has been developed that\\nenables efficient documentation of a dog's condition through a visual\\nrepresentation. However, since the visual documentation is new, there is no\\nexisting training data. The objective of this work is therefore to mitigate the\\nimpact of data scarcity in order to develop an AI-based diagnostic support\\nsystem. To this end, the potential of synthetic data that mimics realistic\\nvisual documentations of diseases for pre-training AI models is investigated.\\nWe propose a method for generating synthetic image data that mimics realistic\\nvisual documentations. Initially, a basic dataset containing three distinct\\nclasses is generated, followed by the creation of a more sophisticated dataset\\ncontaining 36 different classes. Both datasets are used for the pre-training of\\nan AI model. Subsequently, an evaluation dataset is created, consisting of 250\\nmanually created visual documentations for five different diseases. This\\ndataset, along with a subset containing 25 examples. The obtained results on\\nthe evaluation dataset containing 25 examples demonstrate a significant\\nenhancement of approximately 10% in diagnosis accuracy when utilizing generated\\nsynthetic images that mimic real-world visual documentations. However, these\\nresults do not hold true for the larger evaluation dataset containing 250\\nexamples, indicating that the advantages of using synthetic data for\\npre-training an AI model emerge primarily when dealing with few examples of\\nvisual documentations for a given disease. Overall, this work provides valuable\\ninsights into mitigating the limitations imposed by limited training data\\nthrough the strategic use of generated synthetic data, presenting an approach\\napplicable beyond the canine musculoskeletal assessment domain.\",\"PeriodicalId\":501130,\"journal\":{\"name\":\"arXiv - CS - Computer Vision and Pattern Recognition\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Computer Vision and Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.08181\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08181","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在兽医实践中,对狗的肌肉骨骼系统进行检查是一项具有挑战性的任务。在这项工作中,我们开发了一种新方法,可通过视觉呈现有效记录狗的状况。然而,由于视觉记录是一项新技术,因此没有现成的训练数据。因此,这项工作的目标是减轻数据匮乏的影响,以开发基于人工智能的诊断支持系统。为此,我们研究了模拟现实疾病视觉文献的合成数据在预训练人工智能模型方面的潜力。首先,我们生成了一个包含三个不同类别的基本数据集,然后又创建了一个包含 36 个不同类别的更复杂数据集。这两个数据集都用于人工智能模型的预训练。随后,我们创建了一个评估数据集,由 250 个手动创建的五种不同疾病的视觉文档组成。该数据集以及包含 25 个示例的子集。在包含 25 个示例的评估数据集上获得的结果表明,利用模拟真实世界视觉文档生成的合成图像,诊断准确率显著提高了约 10%。然而,这些结果在包含 250 个示例的更大评估数据集上并不成立,这表明使用合成数据预训练人工智能模型的优势主要体现在处理特定疾病的少量视觉文档示例时。总之,这项工作为通过战略性地使用生成的合成数据来减轻有限的训练数据所带来的限制提供了宝贵的见解,提出了一种适用于犬肌肉骨骼评估领域以外的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Enhancing Canine Musculoskeletal Diagnoses: Leveraging Synthetic Image Data for Pre-Training AI-Models on Visual Documentations
The examination of the musculoskeletal system in dogs is a challenging task in veterinary practice. In this work, a novel method has been developed that enables efficient documentation of a dog's condition through a visual representation. However, since the visual documentation is new, there is no existing training data. The objective of this work is therefore to mitigate the impact of data scarcity in order to develop an AI-based diagnostic support system. To this end, the potential of synthetic data that mimics realistic visual documentations of diseases for pre-training AI models is investigated. We propose a method for generating synthetic image data that mimics realistic visual documentations. Initially, a basic dataset containing three distinct classes is generated, followed by the creation of a more sophisticated dataset containing 36 different classes. Both datasets are used for the pre-training of an AI model. Subsequently, an evaluation dataset is created, consisting of 250 manually created visual documentations for five different diseases. This dataset, along with a subset containing 25 examples. The obtained results on the evaluation dataset containing 25 examples demonstrate a significant enhancement of approximately 10% in diagnosis accuracy when utilizing generated synthetic images that mimic real-world visual documentations. However, these results do not hold true for the larger evaluation dataset containing 250 examples, indicating that the advantages of using synthetic data for pre-training an AI model emerge primarily when dealing with few examples of visual documentations for a given disease. Overall, this work provides valuable insights into mitigating the limitations imposed by limited training data through the strategic use of generated synthetic data, presenting an approach applicable beyond the canine musculoskeletal assessment domain.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Massively Multi-Person 3D Human Motion Forecasting with Scene Context Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution Precise Forecasting of Sky Images Using Spatial Warping JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation Applications of Knowledge Distillation in Remote Sensing: A Survey
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1