Semantic segmentation dataset authoring with simplified labels.

IF 2.3 3区 医学 Q3 ENGINEERING, BIOMEDICAL International Journal of Computer Assisted Radiology and Surgery Pub Date : 2025-05-01 Epub Date: 2025-04-05 DOI:10.1007/s11548-024-03314-9
Leo Uramoto, Yuichiro Hayashi, Masahiro Oda, Takayuki Kitasaka, Kensaku Mori
{"title":"Semantic segmentation dataset authoring with simplified labels.","authors":"Leo Uramoto, Yuichiro Hayashi, Masahiro Oda, Takayuki Kitasaka, Kensaku Mori","doi":"10.1007/s11548-024-03314-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Semantic segmentation of laparoscopic images is a key problem in surgical scene understanding. Creating ground truth labels for semantic segmentation tasks is time consuming, and in the medical field a need for medical training of annotators adds further complications, leading to reliance on a small pool of experts. Previous research has focused on reducing the time to author datasets, by using spatially weak labels, pseudolabels, and synthetic data. In this paper, we address the difficulties caused by the need for medically trained annotators, hoping to enable non-medical annotators to participate in medical annotation tasks, to ease the creation of large datasets.</p><p><strong>Methods: </strong>We propose simplified labels, labels that are semantically weak. Our labels allow non-medical annotators to participate in medical dataset authoring, by lowering the need for medical expertise. We simulate authoring processes with mixtures of medical and non-medical annotators and measure the impact adding non-medical annotators has on accuracy. We also show that simplified labels offer a simple formulation for multi-dataset training.</p><p><strong>Results: </strong>We show that simplified labels are a viable approach to dataset authoring. Including non-medical annotators in the authoring process is beneficial, but medically trained annotators are worth multiple non-medical annotators, with maximal Dice score increases of 9.3% for 1 medically trained annotator and 6.9% for 3 non-medical annotators. We also show that the labels offer a simple formulation for multi-dataset training, even with no overlapping classes. We find that converting the labels of a secondary incompatible dataset into simplified labels and jointly training on both datasets improves performance.</p><p><strong>Conclusion: </strong>Simplified labels offer a framework that can be applied both to dataset authoring and to multi-dataset training. Using the proposed method, non-medical annotators can participate in semantic segmentation dataset authoring. Labels of incompatible datasets can be converted into simplified datasets, enabling multi-dataset training.</p>","PeriodicalId":51251,"journal":{"name":"International Journal of Computer Assisted Radiology and Surgery","volume":" ","pages":"1003-1013"},"PeriodicalIF":2.3000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12055892/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Assisted Radiology and Surgery","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11548-024-03314-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/5 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: Semantic segmentation of laparoscopic images is a key problem in surgical scene understanding. Creating ground truth labels for semantic segmentation tasks is time consuming, and in the medical field a need for medical training of annotators adds further complications, leading to reliance on a small pool of experts. Previous research has focused on reducing the time to author datasets, by using spatially weak labels, pseudolabels, and synthetic data. In this paper, we address the difficulties caused by the need for medically trained annotators, hoping to enable non-medical annotators to participate in medical annotation tasks, to ease the creation of large datasets.

Methods: We propose simplified labels, labels that are semantically weak. Our labels allow non-medical annotators to participate in medical dataset authoring, by lowering the need for medical expertise. We simulate authoring processes with mixtures of medical and non-medical annotators and measure the impact adding non-medical annotators has on accuracy. We also show that simplified labels offer a simple formulation for multi-dataset training.

Results: We show that simplified labels are a viable approach to dataset authoring. Including non-medical annotators in the authoring process is beneficial, but medically trained annotators are worth multiple non-medical annotators, with maximal Dice score increases of 9.3% for 1 medically trained annotator and 6.9% for 3 non-medical annotators. We also show that the labels offer a simple formulation for multi-dataset training, even with no overlapping classes. We find that converting the labels of a secondary incompatible dataset into simplified labels and jointly training on both datasets improves performance.

Conclusion: Simplified labels offer a framework that can be applied both to dataset authoring and to multi-dataset training. Using the proposed method, non-medical annotators can participate in semantic segmentation dataset authoring. Labels of incompatible datasets can be converted into simplified datasets, enabling multi-dataset training.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用简化的标签创建语义分割数据集。
目的:腹腔镜图像的语义分割是手术场景理解的关键问题。为语义分割任务创建基础真值标签非常耗时,而且在医学领域,需要对注释者进行医学培训,这进一步增加了复杂性,导致对一小部分专家的依赖。以前的研究主要集中在通过使用空间弱标签、伪标签和合成数据来减少编写数据集的时间。在本文中,我们解决了由于需要受过医学训练的注释者而带来的困难,希望使非医学注释者也能参与到医学注释任务中来,从而简化大型数据集的创建。方法:我们提出简化标签,语义弱的标签。通过降低对医学专业知识的需求,我们的标签允许非医学注释者参与医学数据集的创作。我们模拟了混合使用医学和非医学注释器的创作过程,并测量了添加非医学注释器对准确性的影响。我们还表明,简化标签为多数据集训练提供了一个简单的公式。结果:我们表明简化标签是一种可行的数据集创作方法。在创作过程中加入非医学注释者是有益的,但医学训练的注释者值得多个非医学注释者,1个医学训练的注释者最大Dice分数提高9.3%,3个非医学注释者最大Dice分数提高6.9%。我们还表明,标签为多数据集训练提供了一个简单的公式,即使没有重叠的类。我们发现将次要不兼容数据集的标签转换为简化的标签并在两个数据集上联合训练可以提高性能。结论:简化标签提供了一个框架,既可以应用于数据集创作,也可以应用于多数据集训练。利用该方法,非医学标注者也可以参与到语义分割数据集的创作中。不兼容数据集的标签可以转换为简化的数据集,实现多数据集训练。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of Computer Assisted Radiology and Surgery
International Journal of Computer Assisted Radiology and Surgery ENGINEERING, BIOMEDICAL-RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
CiteScore
5.90
自引率
6.70%
发文量
243
审稿时长
6-12 weeks
期刊介绍: The International Journal for Computer Assisted Radiology and Surgery (IJCARS) is a peer-reviewed journal that provides a platform for closing the gap between medical and technical disciplines, and encourages interdisciplinary research and development activities in an international environment.
期刊最新文献
DyEndoVO: scene dynamics-aware pose estimation of endoscope in minimally invasive surgery. Using deep vision-language models improves multi-task performance in assistance applications for endoscopic ENT surgery. Quantifying the anatomical variability of the proximal femur. Colormap augmentation: a novel method for cross-modality domain generalization. Multimodal framework for swallow detection in video-fluoroscopic swallow studies using manometric pressure distributions from dysphagic patients.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1