Efficient and Equitable Natural Language Processing in the Age of Deep Learning (Dagstuhl Seminar 22232)

Jesse Dodge, Iryna Gurevych, Roy Schwartz, Emma Strubell, Betty van Aken
{"title":"Efficient and Equitable Natural Language Processing in the Age of Deep Learning (Dagstuhl Seminar 22232)","authors":"Jesse Dodge, Iryna Gurevych, Roy Schwartz, Emma Strubell, Betty van Aken","doi":"10.4230/DagRep.12.6.14","DOIUrl":null,"url":null,"abstract":"This report documents the program and the outcomes of Dagstuhl Seminar 22232 “Efficient and Equitable Natural Language Processing in the Age of Deep Learning”. Since 2012, the field of artificial intelligence (AI) has reported remarkable progress on a broad range of capabilities including object recognition, game playing, speech recognition, and machine translation. Much of this progress has been achieved by increasingly large and computationally intensive deep learning models: training costs for state-of-the-art deep learning models have increased 300,000 times between 2012 and 2018 [1]. Perhaps the epitome of this trend is the subfield of natural language processing (NLP) that over the past three years has experienced even sharper growth in model size and corresponding computational requirements in the word embedding approaches (e.g. ELMo, BERT, openGPT-2, Megatron-LM, T5, and GPT-3, one of the largest models ever trained with 175B dense parameters) that are now the basic building blocks of nearly all NLP models. Recent studies indicate that this trend is both environmentally unfriendly and prohibitively expensive, raising barriers to participation in NLP research [2, 3]. The goal of this seminar was to mitigate these concerns and promote equity of access in NLP.","PeriodicalId":91064,"journal":{"name":"Dagstuhl reports","volume":"12 1","pages":"14-27"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Dagstuhl reports","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/DagRep.12.6.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

This report documents the program and the outcomes of Dagstuhl Seminar 22232 “Efficient and Equitable Natural Language Processing in the Age of Deep Learning”. Since 2012, the field of artificial intelligence (AI) has reported remarkable progress on a broad range of capabilities including object recognition, game playing, speech recognition, and machine translation. Much of this progress has been achieved by increasingly large and computationally intensive deep learning models: training costs for state-of-the-art deep learning models have increased 300,000 times between 2012 and 2018 [1]. Perhaps the epitome of this trend is the subfield of natural language processing (NLP) that over the past three years has experienced even sharper growth in model size and corresponding computational requirements in the word embedding approaches (e.g. ELMo, BERT, openGPT-2, Megatron-LM, T5, and GPT-3, one of the largest models ever trained with 175B dense parameters) that are now the basic building blocks of nearly all NLP models. Recent studies indicate that this trend is both environmentally unfriendly and prohibitively expensive, raising barriers to participation in NLP research [2, 3]. The goal of this seminar was to mitigate these concerns and promote equity of access in NLP.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
深度学习时代高效公平的自然语言处理(Dagstuhl Seminar 22232)
本报告记录了Dagstuhl研讨会22232“深度学习时代的高效和公平的自然语言处理”的计划和成果。自2012年以来,人工智能(AI)领域在包括物体识别、游戏、语音识别和机器翻译在内的广泛能力方面取得了显著进展。这种进步很大程度上是通过日益庞大和计算密集型的深度学习模型实现的:最先进的深度学习模型的训练成本在2012年至2018年间增加了30万倍。也许这一趋势的缩影是自然语言处理(NLP)的子领域,在过去三年中,在词嵌入方法(例如ELMo, BERT, openGPT-2, Megatron-LM, T5和GPT-3,有史以来使用175B个密集参数训练的最大模型之一)中,模型大小和相应的计算需求经历了更急剧的增长,这些方法现在几乎是所有NLP模型的基本构建块。最近的研究表明,这种趋势既不环保,又昂贵,增加了参与NLP研究的障碍[2,3]。这次研讨会的目的是减轻这些担忧,促进自然语言学习的公平。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Computer Science Methods for Effective and Sustainable Simulation Studies (Dagstuhl Seminar 22401) Intelligent Security: Is "AI for Cybersecurity" a Blessing or a Curse (Dagstuhl Seminar 22412) Logic and Random Discrete Structures (Dagstuhl Seminar 22061) Security of Decentralized Financial Technologies (Dagstuhl Seminar 22421) Developmental Machine Learning: From Human Learning to Machines and Back (Dagstuhl Seminar 22422)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1