Ao Chang, Xing Tao, Yuhao Huang, Xin Yang, Jiajun Zeng, Xinrui Zhou, Ruobing Huang, Dong Ni
{"title":"P<sup>2</sup>ED: A four-quadrant framework for progressive prompt enhancement in 3D interactive medical imaging segmentation.","authors":"Ao Chang, Xing Tao, Yuhao Huang, Xin Yang, Jiajun Zeng, Xinrui Zhou, Ruobing Huang, Dong Ni","doi":"10.1016/j.neunet.2024.106973","DOIUrl":null,"url":null,"abstract":"<p><p>Interactive segmentation allows active user participation to enhance output quality and resolve ambiguities. This may be especially indispensable to medical image segmentation to address complex anatomy and customization to varying user requirements. Existing approaches often encounter issues such as information dilution, limited adaptability to diverse user interactions, and insufficient response. To address these challenges, we present a novel 3D interactive framework P<sup>2</sup>ED that divides the task into four quadrants. It is equipped with a multi-granular prompt encrypted to extract prompt features from various hierarchical levels, along with a progressive hierarchical prompt decrypter to adaptively heighten the attention to the scarce prompt features along three spatial axes. Finally, it is appended by a calibration module to further align the prediction with user intentions. Extensive experiments demonstrate that the proposed P<sup>2</sup>ED achieves accurate results with fewer user interactions compared to state-of-the-art methods and is effective in promoting the upper limit of segmentation performance. The code will be released in https://github.com/chuyhu/P2ED.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"183 ","pages":"106973"},"PeriodicalIF":6.0000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1016/j.neunet.2024.106973","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/3 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Interactive segmentation allows active user participation to enhance output quality and resolve ambiguities. This may be especially indispensable to medical image segmentation to address complex anatomy and customization to varying user requirements. Existing approaches often encounter issues such as information dilution, limited adaptability to diverse user interactions, and insufficient response. To address these challenges, we present a novel 3D interactive framework P2ED that divides the task into four quadrants. It is equipped with a multi-granular prompt encrypted to extract prompt features from various hierarchical levels, along with a progressive hierarchical prompt decrypter to adaptively heighten the attention to the scarce prompt features along three spatial axes. Finally, it is appended by a calibration module to further align the prediction with user intentions. Extensive experiments demonstrate that the proposed P2ED achieves accurate results with fewer user interactions compared to state-of-the-art methods and is effective in promoting the upper limit of segmentation performance. The code will be released in https://github.com/chuyhu/P2ED.
期刊介绍:
Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.