A Novel Multi-Task Learning Approach for Context-Sensitive Compound Type Identification in Sanskrit

Proceedings of COLING. International Conference on Computational Linguistics Pub Date : 2022-08-22 DOI:10.48550/arXiv.2208.10310

Jivnesh Sandhan, Ashish Gupta, Hrishikesh Terdalkar, Tushar Sandhan, S. Samanta, L. Behera, Pawan Goyal

{"title":"A Novel Multi-Task Learning Approach for Context-Sensitive Compound Type Identification in Sanskrit","authors":"Jivnesh Sandhan, Ashish Gupta, Hrishikesh Terdalkar, Tushar Sandhan, S. Samanta, L. Behera, Pawan Goyal","doi":"10.48550/arXiv.2208.10310","DOIUrl":null,"url":null,"abstract":"The phenomenon of compounding is ubiquitous in Sanskrit. It serves for achieving brevity in expressing thoughts, while simultaneously enriching the lexical and structural formation of the language. In this work, we focus on the Sanskrit Compound Type Identification (SaCTI) task, where we consider the problem of identifying semantic relations between the components of a compound word. Earlier approaches solely rely on the lexical information obtained from the components and ignore the most crucial contextual and syntactic information useful for SaCTI. However, the SaCTI task is challenging primarily due to the implicitly encoded context-sensitive semantic relation between the compound components. Thus, we propose a novel multi-task learning architecture which incorporates the contextual information and enriches the complementary syntactic information using morphological tagging and dependency parsing as two auxiliary tasks. Experiments on the benchmark datasets for SaCTI show 6.1 points (Accuracy) and 7.7 points (F1-score) absolute gain compared to the state-of-the-art system. Further, our multi-lingual experiments demonstrate the efficacy of the proposed architecture in English and Marathi languages.","PeriodicalId":91381,"journal":{"name":"Proceedings of COLING. International Conference on Computational Linguistics","volume":"8 1","pages":"4071-4083"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of COLING. International Conference on Computational Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2208.10310","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

The phenomenon of compounding is ubiquitous in Sanskrit. It serves for achieving brevity in expressing thoughts, while simultaneously enriching the lexical and structural formation of the language. In this work, we focus on the Sanskrit Compound Type Identification (SaCTI) task, where we consider the problem of identifying semantic relations between the components of a compound word. Earlier approaches solely rely on the lexical information obtained from the components and ignore the most crucial contextual and syntactic information useful for SaCTI. However, the SaCTI task is challenging primarily due to the implicitly encoded context-sensitive semantic relation between the compound components. Thus, we propose a novel multi-task learning architecture which incorporates the contextual information and enriches the complementary syntactic information using morphological tagging and dependency parsing as two auxiliary tasks. Experiments on the benchmark datasets for SaCTI show 6.1 points (Accuracy) and 7.7 points (F1-score) absolute gain compared to the state-of-the-art system. Further, our multi-lingual experiments demonstrate the efficacy of the proposed architecture in English and Marathi languages.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种新的上下文敏感的梵文复合型识别多任务学习方法

复合现象在梵语中无处不在。它的作用是达到表达思想的简洁，同时丰富语言的词汇和结构形式。在这项工作中，我们专注于梵语复合类型识别(SaCTI)任务，在该任务中，我们考虑了识别复合词组成部分之间语义关系的问题。早期的方法仅依赖于从组件获得的词汇信息，而忽略了对SaCTI有用的最重要的上下文和语法信息。然而，由于复合组件之间隐式编码的上下文敏感语义关系，SaCTI任务具有挑战性。因此，我们提出了一种新的多任务学习架构，该架构以形态标注和依存句法分析为辅助任务，融合语境信息并丰富互补句法信息。在SaCTI基准数据集上的实验显示，与最先进的系统相比，该系统的绝对增益为6.1分(精度)和7.7分(F1-score)。此外，我们的多语言实验证明了该架构在英语和马拉地语中的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of COLING. International Conference on Computational Linguistics

自引率

0.00%

发文量