Yulan Su, Yu Hong, Hongyu Zhu, Minhan Xu, Yifan Fan, Min Zhang
{"title":"Binary-perspective Asymmetrical Twin Gain: a Novel Evaluation Method for Question Generation","authors":"Yulan Su, Yu Hong, Hongyu Zhu, Minhan Xu, Yifan Fan, Min Zhang","doi":"10.1109/IJCNN55064.2022.9892106","DOIUrl":null,"url":null,"abstract":"We propose a novel evaluation method for Question Generation (QG) task. It is designed to verify the quality of the generated questions in terms of different references, including not only the manually-written questions (i.e., ground truth) but also their variants. Back translation is utilized to obtain the variants, and accordingly, they generally appear as paraphrases of the ground-truth examples. In particular, an Asymmetrical Twin Gain (ATG) is proposed for binary-perspective evaluation using the existing metrics, such as BLEU and ROUGE-L, respectively. It enables both the metrics to be observed from two perspectives, including the consistency between QG results and ground-truth examples, as well as that of variants. The experiments on the publicly-available benchmark SQuAD demonstrate the reliability of ATG. More importantly, ATG is proven effective for indicating the stable QG performance. It is noteworthy that the proposed binary-perspective evaluation is explored for assisting the conventional evaluation methods, instead of replacing them. The contribute can be identified as the additional insight into the robustness of QG when some slightly-different references (e.g., paraphrases) are offered for evaluation. All the models and source codes in the experiments will be made publicly available to support reproducible research.","PeriodicalId":106974,"journal":{"name":"2022 International Joint Conference on Neural Networks (IJCNN)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Joint Conference on Neural Networks (IJCNN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJCNN55064.2022.9892106","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We propose a novel evaluation method for Question Generation (QG) task. It is designed to verify the quality of the generated questions in terms of different references, including not only the manually-written questions (i.e., ground truth) but also their variants. Back translation is utilized to obtain the variants, and accordingly, they generally appear as paraphrases of the ground-truth examples. In particular, an Asymmetrical Twin Gain (ATG) is proposed for binary-perspective evaluation using the existing metrics, such as BLEU and ROUGE-L, respectively. It enables both the metrics to be observed from two perspectives, including the consistency between QG results and ground-truth examples, as well as that of variants. The experiments on the publicly-available benchmark SQuAD demonstrate the reliability of ATG. More importantly, ATG is proven effective for indicating the stable QG performance. It is noteworthy that the proposed binary-perspective evaluation is explored for assisting the conventional evaluation methods, instead of replacing them. The contribute can be identified as the additional insight into the robustness of QG when some slightly-different references (e.g., paraphrases) are offered for evaluation. All the models and source codes in the experiments will be made publicly available to support reproducible research.