Pengfang Li , Fang Liu , Licheng Jiao , Shuo Li , Xu Liu , Puhua Chen , Lingling Li , Zehua Hao
{"title":"LLM Knowledge-Driven Target Prototype Learning for Few-Shot Segmentation","authors":"Pengfang Li , Fang Liu , Licheng Jiao , Shuo Li , Xu Liu , Puhua Chen , Lingling Li , Zehua Hao","doi":"10.1016/j.knosys.2025.113149","DOIUrl":null,"url":null,"abstract":"<div><div>Few-Shot Segmentation (FSS) aims to segment new class objects in a query image with few support images. The prototype-based FSS methods first model a target prototype and then match it with the query feature for segmentation. Recent research has focused on mining visual features to model the prototype. However, modeling the target prototype using visual features alone is not sufficient to represent target objects due to appearance differences between targets in support and query images. To address this limitation, based on the generalizable knowledge implied in the Large Language Model (LLM), we propose an LLM Knowledge-Driven Target Prototype Learning method (KD-TPL) to learn a robust prototype for the target object in the query image. Specifically, a knowledge-driven semantic prior generator is constructed to mine semantic priors in the query image applied to LLM knowledge. Based on the modeled semantic priors, a knowledge-driven hybrid prototype learner is designed to learn a representative target prototype. A knowledge-driven query feature enhancer is developed to enhance the semantics of the query feature. Finally, competitive comparison and ablation experimental results on COCO-<span><math><mrow><mn>2</mn><msup><mrow><mn>0</mn></mrow><mrow><mi>i</mi></mrow></msup></mrow></math></span> and PASCAL-<span><math><msup><mrow><mn>5</mn></mrow><mrow><mi>i</mi></mrow></msup></math></span> demonstrate the effectiveness of our method.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"312 ","pages":"Article 113149"},"PeriodicalIF":7.2000,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125001960","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Few-Shot Segmentation (FSS) aims to segment new class objects in a query image with few support images. The prototype-based FSS methods first model a target prototype and then match it with the query feature for segmentation. Recent research has focused on mining visual features to model the prototype. However, modeling the target prototype using visual features alone is not sufficient to represent target objects due to appearance differences between targets in support and query images. To address this limitation, based on the generalizable knowledge implied in the Large Language Model (LLM), we propose an LLM Knowledge-Driven Target Prototype Learning method (KD-TPL) to learn a robust prototype for the target object in the query image. Specifically, a knowledge-driven semantic prior generator is constructed to mine semantic priors in the query image applied to LLM knowledge. Based on the modeled semantic priors, a knowledge-driven hybrid prototype learner is designed to learn a representative target prototype. A knowledge-driven query feature enhancer is developed to enhance the semantics of the query feature. Finally, competitive comparison and ablation experimental results on COCO- and PASCAL- demonstrate the effectiveness of our method.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.