{"title":"Imbalanced Question Classification Using Generative Prototypes","authors":"Ning Wu, Shaochen Sun, Yunfeng Zou, Yang Yu","doi":"10.1109/ICRAE50850.2020.9310877","DOIUrl":null,"url":null,"abstract":"A practical question-answering (QA) system typically categorizes a new question into the frequently asked questions (FAQs) and returns the corresponding answer. Having imbalanced FAQs data is actually prevalent in general. This paper proposes a meta-learning method for imbalanced question classification. The basic idea is to generate virtual training data for zero-shot questions and then construct question prototypes for training a question classifier, thereby relieving the problem of data imbalance and improving performance of question classifier. Experiments show that the proposed method improves the overall classification performance both for English and Chinese QA tasks. Especially, the classification performance of zero annotated questions increased significantly, and the generative prototypes has minute impact on the performance of large annotated question test set.","PeriodicalId":296832,"journal":{"name":"2020 5th International Conference on Robotics and Automation Engineering (ICRAE)","volume":"153 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Robotics and Automation Engineering (ICRAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRAE50850.2020.9310877","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A practical question-answering (QA) system typically categorizes a new question into the frequently asked questions (FAQs) and returns the corresponding answer. Having imbalanced FAQs data is actually prevalent in general. This paper proposes a meta-learning method for imbalanced question classification. The basic idea is to generate virtual training data for zero-shot questions and then construct question prototypes for training a question classifier, thereby relieving the problem of data imbalance and improving performance of question classifier. Experiments show that the proposed method improves the overall classification performance both for English and Chinese QA tasks. Especially, the classification performance of zero annotated questions increased significantly, and the generative prototypes has minute impact on the performance of large annotated question test set.