A multimodal grammar of artificial intelligence: Measuring the gains and losses in generative artificial intelligence

B. Cope, M. Kalantzis
{"title":"A multimodal grammar of artificial intelligence: Measuring the gains and losses in generative artificial intelligence","authors":"B. Cope, M. Kalantzis","doi":"10.1177/26349795231221699","DOIUrl":null,"url":null,"abstract":"This paper analyzes the scope of Artificial Intelligence (AI) from the perspective of a multimodal grammar. Its focal point is Generative AI, a technology that puts so-called Large Language Models to work. The first part of the paper analyzes Generative AI, based as it is on the statistical probability of one token (a word or part of a word) following another. If the relation of tokens is meaningful, this is circumstantial and no more, because its mechanisms of statistical analysis eschew any theory of meaning. This is the case not only for the written text that Generative AI leverages, but by extension image and multimodal forms of meaning that it can generate. The AI can only work with non-textual forms of meaning after applying language labels, and to that extent is captive not only to the limits of probabilistic statistics but the limits of written language as well. While acknowledging gains arising from the brute statistical power of Generative AI, in its second part the paper goes on to map what is lost in its statistical and text-bound approaches to multimodal meaning-making. Our measure of these gains and losses is guided by the concept of grammar, defined here as a theory of the elemental patterns of meaning in the world—not just written text and speech, but also image, space, object, body, and sound. Ironically, a good deal of what is lost by Generative AI is computable. The third and final part of the paper briefly discusses educational applications of Generative AI. Given both its power and intrinsic limitations, we have been experimenting with the application of Generative AI in educational settings and the ways it might be put to pedagogical use. How does a grammatical analysis help us to identify the scope of worthwhile application? Finally, if more of human experience is computable than can be captured in text-bound AI, how might it be possible at the level of code to create a synthesis in which grammatical and multimodal approaches complement Generative AI?","PeriodicalId":153235,"journal":{"name":"Multimodality & Society","volume":"39 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multimodality & Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/26349795231221699","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper analyzes the scope of Artificial Intelligence (AI) from the perspective of a multimodal grammar. Its focal point is Generative AI, a technology that puts so-called Large Language Models to work. The first part of the paper analyzes Generative AI, based as it is on the statistical probability of one token (a word or part of a word) following another. If the relation of tokens is meaningful, this is circumstantial and no more, because its mechanisms of statistical analysis eschew any theory of meaning. This is the case not only for the written text that Generative AI leverages, but by extension image and multimodal forms of meaning that it can generate. The AI can only work with non-textual forms of meaning after applying language labels, and to that extent is captive not only to the limits of probabilistic statistics but the limits of written language as well. While acknowledging gains arising from the brute statistical power of Generative AI, in its second part the paper goes on to map what is lost in its statistical and text-bound approaches to multimodal meaning-making. Our measure of these gains and losses is guided by the concept of grammar, defined here as a theory of the elemental patterns of meaning in the world—not just written text and speech, but also image, space, object, body, and sound. Ironically, a good deal of what is lost by Generative AI is computable. The third and final part of the paper briefly discusses educational applications of Generative AI. Given both its power and intrinsic limitations, we have been experimenting with the application of Generative AI in educational settings and the ways it might be put to pedagogical use. How does a grammatical analysis help us to identify the scope of worthwhile application? Finally, if more of human experience is computable than can be captured in text-bound AI, how might it be possible at the level of code to create a synthesis in which grammatical and multimodal approaches complement Generative AI?
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
人工智能的多模式语法:衡量生成式人工智能的得失
本文从多模态语法的角度分析了人工智能(AI)的范围。其重点是生成式人工智能,这是一种将所谓的大型语言模型投入使用的技术。本文的第一部分分析了生成式人工智能,因为它是以一个标记(一个词或词的一部分)跟随另一个标记的统计概率为基础的。如果标记之间的关系是有意义的,那么这只是间接的,仅此而已,因为它的统计分析机制摒弃了任何意义理论。这不仅适用于生成式人工智能所利用的书面文本,也适用于它所能生成的图像和多模态意义形式。人工智能只能在应用语言标签后才能处理非文本形式的意义,因此不仅受到概率统计的限制,也受到书面语言的限制。在承认生成式人工智能的强大统计能力所带来的收益的同时,本文的第二部分将继续描绘其在多模态意义生成的统计和文本束缚方法中的损失。我们对这些得失的衡量是以语法概念为指导的,语法在这里被定义为世界意义要素模式的理论--不仅仅是书面文本和语音,还包括图像、空间、物体、身体和声音。具有讽刺意味的是,生成式人工智能所失去的很多东西都是可以计算的。本文的第三部分也是最后一部分简要讨论了生成式人工智能在教育领域的应用。鉴于生成式人工智能的强大功能和内在局限性,我们一直在尝试在教育环境中应用生成式人工智能,以及将其用于教学的方法。语法分析如何帮助我们确定有价值的应用范围?最后,如果可计算的人类经验比文本人工智能所能捕捉到的更多,那么如何在代码层面创造一种综合方法,使语法和多模态方法与生成式人工智能相辅相成?
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Exuberant and fantastical: How flythroughs are changing the way museums construct their identities and build relationships with their publics Enabling touch in an art museum: A curatorial reflection Methodological challenges and insights for researching embodied learning in museums Innovating visitor research within museums: Concepts, tools and practices Book review: Museums and technologies of presence
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1