Efficient and Equitable Natural Language Processing in the Age of Deep Learning (Dagstuhl Seminar 22232)

Dagstuhl reports Pub Date : 2022-01-01 DOI:10.4230/DagRep.12.6.14

Jesse Dodge, Iryna Gurevych, Roy Schwartz, Emma Strubell, Betty van Aken

{"title":"Efficient and Equitable Natural Language Processing in the Age of Deep Learning (Dagstuhl Seminar 22232)","authors":"Jesse Dodge, Iryna Gurevych, Roy Schwartz, Emma Strubell, Betty van Aken","doi":"10.4230/DagRep.12.6.14","DOIUrl":null,"url":null,"abstract":"This report documents the program and the outcomes of Dagstuhl Seminar 22232 “Efficient and Equitable Natural Language Processing in the Age of Deep Learning”. Since 2012, the field of artificial intelligence (AI) has reported remarkable progress on a broad range of capabilities including object recognition, game playing, speech recognition, and machine translation. Much of this progress has been achieved by increasingly large and computationally intensive deep learning models: training costs for state-of-the-art deep learning models have increased 300,000 times between 2012 and 2018 [1]. Perhaps the epitome of this trend is the subfield of natural language processing (NLP) that over the past three years has experienced even sharper growth in model size and corresponding computational requirements in the word embedding approaches (e.g. ELMo, BERT, openGPT-2, Megatron-LM, T5, and GPT-3, one of the largest models ever trained with 175B dense parameters) that are now the basic building blocks of nearly all NLP models. Recent studies indicate that this trend is both environmentally unfriendly and prohibitively expensive, raising barriers to participation in NLP research [2, 3]. The goal of this seminar was to mitigate these concerns and promote equity of access in NLP.","PeriodicalId":91064,"journal":{"name":"Dagstuhl reports","volume":"12 1","pages":"14-27"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Dagstuhl reports","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/DagRep.12.6.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

This report documents the program and the outcomes of Dagstuhl Seminar 22232 “Efficient and Equitable Natural Language Processing in the Age of Deep Learning”. Since 2012, the field of artificial intelligence (AI) has reported remarkable progress on a broad range of capabilities including object recognition, game playing, speech recognition, and machine translation. Much of this progress has been achieved by increasingly large and computationally intensive deep learning models: training costs for state-of-the-art deep learning models have increased 300,000 times between 2012 and 2018 [1]. Perhaps the epitome of this trend is the subfield of natural language processing (NLP) that over the past three years has experienced even sharper growth in model size and corresponding computational requirements in the word embedding approaches (e.g. ELMo, BERT, openGPT-2, Megatron-LM, T5, and GPT-3, one of the largest models ever trained with 175B dense parameters) that are now the basic building blocks of nearly all NLP models. Recent studies indicate that this trend is both environmentally unfriendly and prohibitively expensive, raising barriers to participation in NLP research [2, 3]. The goal of this seminar was to mitigate these concerns and promote equity of access in NLP.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

深度学习时代高效公平的自然语言处理(Dagstuhl Seminar 22232)

本报告记录了Dagstuhl研讨会22232“深度学习时代的高效和公平的自然语言处理”的计划和成果。自2012年以来，人工智能(AI)领域在包括物体识别、游戏、语音识别和机器翻译在内的广泛能力方面取得了显著进展。这种进步很大程度上是通过日益庞大和计算密集型的深度学习模型实现的:最先进的深度学习模型的训练成本在2012年至2018年间增加了30万倍。也许这一趋势的缩影是自然语言处理(NLP)的子领域，在过去三年中，在词嵌入方法(例如ELMo, BERT, openGPT-2, Megatron-LM, T5和GPT-3，有史以来使用175B个密集参数训练的最大模型之一)中，模型大小和相应的计算需求经历了更急剧的增长，这些方法现在几乎是所有NLP模型的基本构建块。最近的研究表明，这种趋势既不环保，又昂贵，增加了参与NLP研究的障碍[2,3]。这次研讨会的目的是减轻这些担忧，促进自然语言学习的公平。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Dagstuhl reports

自引率

0.00%

发文量