{"title":"A simple but practical method: How to improve the usage of entities in the Chinese question generation","authors":"Haoze Yang, Kunyao Lan, Jiawei You, Liping Shen","doi":"10.1109/IJCNN55064.2022.9891960","DOIUrl":null,"url":null,"abstract":"Answer-aware question generation aims to generate answerable questions from a given paragraph and answer. Most of the current models concatenated entity information into word embeddings to improve the model's learning ability for special entities, but this method is inefficient for utilizing these information and has accumulated errors. In addition, the majority of research focuses on English, with less exploration in languages such as Chinese. Combining the differences between languages, we propose three methods for incorporating entity information in paragraphs and answers into the training corpus. The corpus processed by these methods can enable the model to have the ability to learn entities autonomously. The experimental results show that our methods can improve most mainstream models and enhance the learning ability of the model for special entities.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Joint Conference on Neural Networks (IJCNN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJCNN55064.2022.9891960","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Answer-aware question generation aims to generate answerable questions from a given paragraph and answer. Most of the current models concatenated entity information into word embeddings to improve the model's learning ability for special entities, but this method is inefficient for utilizing these information and has accumulated errors. In addition, the majority of research focuses on English, with less exploration in languages such as Chinese. Combining the differences between languages, we propose three methods for incorporating entity information in paragraphs and answers into the training corpus. The corpus processed by these methods can enable the model to have the ability to learn entities autonomously. The experimental results show that our methods can improve most mainstream models and enhance the learning ability of the model for special entities.