{"title":"VLSP 2021 - VieCap4H挑战:越南医疗保健领域的自动图像标题生成","authors":"P. Phan","doi":"10.25073/2588-1086/vnucsce.364","DOIUrl":null,"url":null,"abstract":"Machine reading comprehension (MRC) is a challenging Natural Language Processing (NLP) research fieldand wide real-world applications. The great progress of this field in recents is mainly due to the emergence offew datasets for machine reading comprehension tasks with large sizes and deep learning. For the Vietnameselanguage, some datasets, such as UIT-ViQuAD [1] and UIT-ViNewsQA [2], most recently, UIT-ViQuAD 2.0 [3] - adataset of the competitive VLSP 2021-MRC Shared Task 1 . MRC systems must not only answer questions whennecessary but also tactfully abstain from answering when no answer is available according to the given passage.In this paper, we proposed two types of joint models for answerability prediction and pure-MRC prediction with/without a dependency mechanism to learn the correlation between a start position and end position in pure-MRCoutput prediction. Besides, we use ensemble models and a verification strategy by voting the best answer from thetop K answers of different models. Our proposed approach is evaluated on the benchmark VLSP 2021-MRC SharedTask challenge dataset UIT-ViQuAD 2.0 [3] shows that our approach is significantly better than the baseline.","PeriodicalId":416488,"journal":{"name":"VNU Journal of Science: Computer Science and Communication Engineering","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"VLSP 2021 - VieCap4H Challenge: Automatic Image Caption Generation for Healthcare Domain in Vietnamese\",\"authors\":\"P. Phan\",\"doi\":\"10.25073/2588-1086/vnucsce.364\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine reading comprehension (MRC) is a challenging Natural Language Processing (NLP) research fieldand wide real-world applications. The great progress of this field in recents is mainly due to the emergence offew datasets for machine reading comprehension tasks with large sizes and deep learning. For the Vietnameselanguage, some datasets, such as UIT-ViQuAD [1] and UIT-ViNewsQA [2], most recently, UIT-ViQuAD 2.0 [3] - adataset of the competitive VLSP 2021-MRC Shared Task 1 . MRC systems must not only answer questions whennecessary but also tactfully abstain from answering when no answer is available according to the given passage.In this paper, we proposed two types of joint models for answerability prediction and pure-MRC prediction with/without a dependency mechanism to learn the correlation between a start position and end position in pure-MRCoutput prediction. Besides, we use ensemble models and a verification strategy by voting the best answer from thetop K answers of different models. Our proposed approach is evaluated on the benchmark VLSP 2021-MRC SharedTask challenge dataset UIT-ViQuAD 2.0 [3] shows that our approach is significantly better than the baseline.\",\"PeriodicalId\":416488,\"journal\":{\"name\":\"VNU Journal of Science: Computer Science and Communication Engineering\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"VNU Journal of Science: Computer Science and Communication Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.25073/2588-1086/vnucsce.364\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"VNU Journal of Science: Computer Science and Communication Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25073/2588-1086/vnucsce.364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
VLSP 2021 - VieCap4H Challenge: Automatic Image Caption Generation for Healthcare Domain in Vietnamese
Machine reading comprehension (MRC) is a challenging Natural Language Processing (NLP) research fieldand wide real-world applications. The great progress of this field in recents is mainly due to the emergence offew datasets for machine reading comprehension tasks with large sizes and deep learning. For the Vietnameselanguage, some datasets, such as UIT-ViQuAD [1] and UIT-ViNewsQA [2], most recently, UIT-ViQuAD 2.0 [3] - adataset of the competitive VLSP 2021-MRC Shared Task 1 . MRC systems must not only answer questions whennecessary but also tactfully abstain from answering when no answer is available according to the given passage.In this paper, we proposed two types of joint models for answerability prediction and pure-MRC prediction with/without a dependency mechanism to learn the correlation between a start position and end position in pure-MRCoutput prediction. Besides, we use ensemble models and a verification strategy by voting the best answer from thetop K answers of different models. Our proposed approach is evaluated on the benchmark VLSP 2021-MRC SharedTask challenge dataset UIT-ViQuAD 2.0 [3] shows that our approach is significantly better than the baseline.