大型语言模型中的金属语言意识:意大利语和意大利语变体语法的 ChatGPT 和偏差效应

IF 0.1 4区 文学 Q3 HISTORY Verbum Pub Date : 2023-12-20 DOI:10.15388/verb.42
Angelapia Massaro, Giuseppe Samo
{"title":"大型语言模型中的金属语言意识:意大利语和意大利语变体语法的 ChatGPT 和偏差效应","authors":"Angelapia Massaro, Giuseppe Samo","doi":"10.15388/verb.42","DOIUrl":null,"url":null,"abstract":"We explore ChatGPT’s handling of left-peripheral phenomena in Italian and Italian varieties through prompt engineering to investigate 1) forms of syntactic bias in the model, 2) the model’s metalinguistic awareness in relation to reorderings of canonical clauses (e.g., Topics) and certain grammatical categories (object clitics). A further question concerns the content of the model’s sources of training data: how are minor languages included in the model’s training? The results of our investigation show that 1) the model seems to be biased against reorderings, labelling them as archaic even though it is not the case; 2) the model seems to have difficulties with coindexed elements such as clitics and their anaphoric status, labeling them as ‘not referring to any element in the phrase’, and 3) major languages still seem to be dominant, overshadowing the positive effects of including minor languages in the model’s training.","PeriodicalId":42449,"journal":{"name":"Verbum","volume":null,"pages":null},"PeriodicalIF":0.1000,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prompting Metalinguistic Awareness in Large Language Models: ChatGPT and Bias Effects on the Grammar of Italian and Italian Varieties\",\"authors\":\"Angelapia Massaro, Giuseppe Samo\",\"doi\":\"10.15388/verb.42\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We explore ChatGPT’s handling of left-peripheral phenomena in Italian and Italian varieties through prompt engineering to investigate 1) forms of syntactic bias in the model, 2) the model’s metalinguistic awareness in relation to reorderings of canonical clauses (e.g., Topics) and certain grammatical categories (object clitics). A further question concerns the content of the model’s sources of training data: how are minor languages included in the model’s training? The results of our investigation show that 1) the model seems to be biased against reorderings, labelling them as archaic even though it is not the case; 2) the model seems to have difficulties with coindexed elements such as clitics and their anaphoric status, labeling them as ‘not referring to any element in the phrase’, and 3) major languages still seem to be dominant, overshadowing the positive effects of including minor languages in the model’s training.\",\"PeriodicalId\":42449,\"journal\":{\"name\":\"Verbum\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.1000,\"publicationDate\":\"2023-12-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Verbum\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15388/verb.42\",\"RegionNum\":4,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"HISTORY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Verbum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15388/verb.42","RegionNum":4,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"HISTORY","Score":null,"Total":0}
引用次数: 0

摘要

我们通过提示工程来探索 ChatGPT 在意大利语和意大利语变体中对左边缘现象的处理,以研究:1)模型中的句法偏差形式;2)模型在对典型从句(如 Topics)和某些语法类别(宾语从句)重新排序时的金属语言意识。另一个问题涉及模型训练数据源的内容:小语种是如何纳入模型训练的?我们的研究结果表明:1)该模型似乎对重排序有偏见,即使情况并非如此,也会将其标记为古语;2)该模型似乎难以处理共表成分(如状语)及其拟喻地位,将其标记为 "不指代短语中的任何成分";3)大语种似乎仍占主导地位,掩盖了将小语种纳入模型训练的积极作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Prompting Metalinguistic Awareness in Large Language Models: ChatGPT and Bias Effects on the Grammar of Italian and Italian Varieties
We explore ChatGPT’s handling of left-peripheral phenomena in Italian and Italian varieties through prompt engineering to investigate 1) forms of syntactic bias in the model, 2) the model’s metalinguistic awareness in relation to reorderings of canonical clauses (e.g., Topics) and certain grammatical categories (object clitics). A further question concerns the content of the model’s sources of training data: how are minor languages included in the model’s training? The results of our investigation show that 1) the model seems to be biased against reorderings, labelling them as archaic even though it is not the case; 2) the model seems to have difficulties with coindexed elements such as clitics and their anaphoric status, labeling them as ‘not referring to any element in the phrase’, and 3) major languages still seem to be dominant, overshadowing the positive effects of including minor languages in the model’s training.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Verbum
Verbum Multiple-
自引率
0.00%
发文量
0
审稿时长
30 weeks
期刊最新文献
Les pragmatèmes de politesse dans le manuel du français de niveau élémentaire « Edito A1 » Des premières priorités, un hasard imprévu, applaudir des deux mains, découvrir pour la première fois … ou des phrasèmes pléonastiques en français et en polonais contemporains Prompting Metalinguistic Awareness in Large Language Models: ChatGPT and Bias Effects on the Grammar of Italian and Italian Varieties Lietuviškojo tapatumo apraiškos grįžusių iš užsienio vaikų sakytinės kalbos tekstyne A TRAJETÓRIA DA LITERATURA DE CORDEL NO BRASIL:
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1