Effectiveness of Pre-Trained Language Models for the Japanese Winograd Schema Challenge

IF 0.7 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Journal of Advanced Computational Intelligence and Intelligent Informatics Pub Date : 2023-05-20 DOI:10.20965/jaciii.2023.p0511

Keigo Takahashi, Teruaki Oka, Mamoru Komachi

{"title":"Effectiveness of Pre-Trained Language Models for the Japanese Winograd Schema Challenge","authors":"Keigo Takahashi, Teruaki Oka, Mamoru Komachi","doi":"10.20965/jaciii.2023.p0511","DOIUrl":null,"url":null,"abstract":"This paper compares Japanese and multilingual language models (LMs) in a Japanese pronoun reference resolution task to determine the factors of LMs that contribute to Japanese pronoun resolution. Specifically, we tackle the Japanese Winograd schema challenge task (WSC task), which is a well-known pronoun reference resolution task. The Japanese WSC task requires inter-sentential analysis, which is more challenging to solve than intra-sentential analysis. A previous study evaluated pre-trained multilingual LMs in terms of training language on the target WSC task, including Japanese. However, the study did not perform pre-trained LM-wise evaluations, focusing on the training language-wise evaluations with a multilingual WSC task. Furthermore, it did not investigate the effectiveness of factors (e.g., model size, learning settings in the pre-training phase, or multilingualism) to improve the performance. In our study, we compare the performance of inter-sentential analysis on the Japanese WSC task for several pre-trained LMs, including multilingual ones. Our results confirm that XLM, a pre-trained LM on multiple languages, performs the best among all considered LMs, which we attribute to the amount of data in the pre-training phase.","PeriodicalId":45921,"journal":{"name":"Journal of Advanced Computational Intelligence and Intelligent Informatics","volume":"49 1","pages":"511-521"},"PeriodicalIF":0.7000,"publicationDate":"2023-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advanced Computational Intelligence and Intelligent Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20965/jaciii.2023.p0511","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

This paper compares Japanese and multilingual language models (LMs) in a Japanese pronoun reference resolution task to determine the factors of LMs that contribute to Japanese pronoun resolution. Specifically, we tackle the Japanese Winograd schema challenge task (WSC task), which is a well-known pronoun reference resolution task. The Japanese WSC task requires inter-sentential analysis, which is more challenging to solve than intra-sentential analysis. A previous study evaluated pre-trained multilingual LMs in terms of training language on the target WSC task, including Japanese. However, the study did not perform pre-trained LM-wise evaluations, focusing on the training language-wise evaluations with a multilingual WSC task. Furthermore, it did not investigate the effectiveness of factors (e.g., model size, learning settings in the pre-training phase, or multilingualism) to improve the performance. In our study, we compare the performance of inter-sentential analysis on the Japanese WSC task for several pre-trained LMs, including multilingual ones. Our results confirm that XLM, a pre-trained LM on multiple languages, performs the best among all considered LMs, which we attribute to the amount of data in the pre-training phase.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

日语Winograd图式挑战中预训练语言模型的有效性

本文比较了日语和多语言模型在日语代词指代解析任务中的作用，以确定多语言模型对日语代词指代解析的影响因素。具体来说，我们解决了日本Winograd模式挑战任务(WSC任务)，这是一个众所周知的代词引用解析任务。日语WSC任务需要句间分析，这比句内分析更具挑战性。先前的一项研究评估了预训练的多语言LMs在目标WSC任务上的训练语言，包括日语。然而，该研究没有进行预训练的lm智能评估，而是关注多语言WSC任务的训练语言智能评估。此外，它没有调查因素(例如，模型大小，预训练阶段的学习设置或多语言)对提高性能的有效性。在我们的研究中，我们比较了几个预训练的LMs(包括多语言LMs)在日语WSC任务上的句子间分析性能。我们的结果证实，在所有考虑的LM中，基于多种语言的预训练LM XLM表现最好，我们将其归因于预训练阶段的数据量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Advanced Computational Intelligence and Intelligent Informatics COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

1.50

自引率

14.30%

发文量

期刊介绍： JACIII focuses on advanced computational intelligence and intelligent informatics. The topics include, but are not limited to; Fuzzy logic, Fuzzy control, Neural Networks, GA and Evolutionary Computation, Hybrid Systems, Adaptation and Learning Systems, Distributed Intelligent Systems, Network systems, Multi-media, Human interface, Biologically inspired evolutionary systems, Artificial life, Chaos, Complex systems, Fractals, Robotics, Medical applications, Pattern recognition, Virtual reality, Wavelet analysis, Scientific applications, Industrial applications, and Artistic applications.