Tao Cheng, Hua Cheng, Yiquan Fang, Yufei Liu, Caiting Gao
{"title":"基于概念原型对比学习的少射文本分类","authors":"Tao Cheng, Hua Cheng, Yiquan Fang, Yufei Liu, Caiting Gao","doi":"10.3233/jifs-231570","DOIUrl":null,"url":null,"abstract":"As prototype-based Few-Shot Learning methods, Prototypical Network generates prototypes for each class in a low-resource state and classify by a metric module. Therefore, the quality of prototypes matters but they are inaccurate from the few support instances, and the domain-specific information of training data are harmful to the generalizability of prototypes. We propose a Conceptual Prototype (CP), which contains both rich instance and concept features. The numerous query data can inspire the few support instances. An interactive network is designed to leverage the interrelation between support set and query-detached set to acquire a rich Instance Prototype which is typical on the whole data. Besides, class labels are introduced to prototype by prompt engineering, which makes it more conceptual. The label-only concept makes prototype immune to domain-specific information in training phase to improve its generalizability. Based on CP, Conceptual Prototypical Contrastive Learning (CPCL) is proposed where PCL brings instances closer to its corresponding prototype and pushes away from other prototypes. “2-way 5-shot” experiments show that CPCL achieves 92.41% accuracy on ARSC dataset, 2.30% higher than other prototype-based models. Meanwhile, the 0-shot performance of CPCL is comparable to Induction Network in the 5-shot way, indicating that our model is adequate for 0-shot tasks.","PeriodicalId":54795,"journal":{"name":"Journal of Intelligent & Fuzzy Systems","volume":"2 1","pages":"0"},"PeriodicalIF":1.7000,"publicationDate":"2023-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CPCL: Conceptual prototypical contrastive learning for Few-Shot text classification\",\"authors\":\"Tao Cheng, Hua Cheng, Yiquan Fang, Yufei Liu, Caiting Gao\",\"doi\":\"10.3233/jifs-231570\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As prototype-based Few-Shot Learning methods, Prototypical Network generates prototypes for each class in a low-resource state and classify by a metric module. Therefore, the quality of prototypes matters but they are inaccurate from the few support instances, and the domain-specific information of training data are harmful to the generalizability of prototypes. We propose a Conceptual Prototype (CP), which contains both rich instance and concept features. The numerous query data can inspire the few support instances. An interactive network is designed to leverage the interrelation between support set and query-detached set to acquire a rich Instance Prototype which is typical on the whole data. Besides, class labels are introduced to prototype by prompt engineering, which makes it more conceptual. The label-only concept makes prototype immune to domain-specific information in training phase to improve its generalizability. Based on CP, Conceptual Prototypical Contrastive Learning (CPCL) is proposed where PCL brings instances closer to its corresponding prototype and pushes away from other prototypes. “2-way 5-shot” experiments show that CPCL achieves 92.41% accuracy on ARSC dataset, 2.30% higher than other prototype-based models. Meanwhile, the 0-shot performance of CPCL is comparable to Induction Network in the 5-shot way, indicating that our model is adequate for 0-shot tasks.\",\"PeriodicalId\":54795,\"journal\":{\"name\":\"Journal of Intelligent & Fuzzy Systems\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2023-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Intelligent & Fuzzy Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3233/jifs-231570\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent & Fuzzy Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/jifs-231570","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
CPCL: Conceptual prototypical contrastive learning for Few-Shot text classification
As prototype-based Few-Shot Learning methods, Prototypical Network generates prototypes for each class in a low-resource state and classify by a metric module. Therefore, the quality of prototypes matters but they are inaccurate from the few support instances, and the domain-specific information of training data are harmful to the generalizability of prototypes. We propose a Conceptual Prototype (CP), which contains both rich instance and concept features. The numerous query data can inspire the few support instances. An interactive network is designed to leverage the interrelation between support set and query-detached set to acquire a rich Instance Prototype which is typical on the whole data. Besides, class labels are introduced to prototype by prompt engineering, which makes it more conceptual. The label-only concept makes prototype immune to domain-specific information in training phase to improve its generalizability. Based on CP, Conceptual Prototypical Contrastive Learning (CPCL) is proposed where PCL brings instances closer to its corresponding prototype and pushes away from other prototypes. “2-way 5-shot” experiments show that CPCL achieves 92.41% accuracy on ARSC dataset, 2.30% higher than other prototype-based models. Meanwhile, the 0-shot performance of CPCL is comparable to Induction Network in the 5-shot way, indicating that our model is adequate for 0-shot tasks.
期刊介绍:
The purpose of the Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology is to foster advancements of knowledge and help disseminate results concerning recent applications and case studies in the areas of fuzzy logic, intelligent systems, and web-based applications among working professionals and professionals in education and research, covering a broad cross-section of technical disciplines.