首页 > 最新文献

Natural Language Engineering最新文献

英文 中文
Real-world sentence boundary detection using multitask learning: A case study on French 基于多任务学习的真实世界句子边界检测——以法语为例
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-04-06 DOI: 10.1017/s1351324922000134
Kyungtae Lim, Jungyeul Park
We propose a novel approach for sentence boundary detection in text datasets in which boundaries are not evident (e.g., sentence fragments). Although detecting sentence boundaries without punctuation marks has rarely been explored in written text, current real-world textual data suffer from widespread lack of proper start/stop signaling. Herein, we annotate a dataset with linguistic information, such as parts of speech and named entity labels, to boost the sentence boundary detection task. Via experiments, we obtained F1 scores up to 98.07% using the proposed multitask neural model, including a score of 89.41% for sentences completely lacking punctuation marks. We also present an ablation study and provide a detailed analysis to demonstrate the effectiveness of the proposed multitask learning method.
我们提出了一种在边界不明显的文本数据集中(例如,句子片段)进行句子边界检测的新方法。尽管在书面文本中很少探索检测没有标点符号的句子边界,但当前现实世界的文本数据普遍缺乏正确的开始/停止信号。在此,我们用语言信息(如词性和命名实体标签)对数据集进行注释,以增强句子边界检测任务。通过实验,我们使用所提出的多任务神经模型获得了高达98.07%的F1分数,其中完全没有标点符号的句子的分数为89.41%。我们还介绍了一项消融研究,并提供了详细的分析,以证明所提出的多任务学习方法的有效性。
{"title":"Real-world sentence boundary detection using multitask learning: A case study on French","authors":"Kyungtae Lim, Jungyeul Park","doi":"10.1017/s1351324922000134","DOIUrl":"https://doi.org/10.1017/s1351324922000134","url":null,"abstract":"\u0000 We propose a novel approach for sentence boundary detection in text datasets in which boundaries are not evident (e.g., sentence fragments). Although detecting sentence boundaries without punctuation marks has rarely been explored in written text, current real-world textual data suffer from widespread lack of proper start/stop signaling. Herein, we annotate a dataset with linguistic information, such as parts of speech and named entity labels, to boost the sentence boundary detection task. Via experiments, we obtained F1 scores up to 98.07% using the proposed multitask neural model, including a score of 89.41% for sentences completely lacking punctuation marks. We also present an ablation study and provide a detailed analysis to demonstrate the effectiveness of the proposed multitask learning method.","PeriodicalId":49143,"journal":{"name":"Natural Language Engineering","volume":" ","pages":""},"PeriodicalIF":2.5,"publicationDate":"2022-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43668233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Gender bias in legal corpora and debiasing it 法律语料库中的性别偏见及其消除
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-03-30 DOI: 10.1017/s1351324922000122
Nurullah Sevim, Furkan Şahinuç, Aykut Koç
Word embeddings have become important building blocks that are used profoundly in natural language processing (NLP). Despite their several advantages, word embeddings can unintentionally accommodate some gender- and ethnicity-based biases that are present within the corpora they are trained on. Therefore, ethical concerns have been raised since word embeddings are extensively used in several high-level algorithms. Studying such biases and debiasing them have recently become an important research endeavor. Various studies have been conducted to measure the extent of bias that word embeddings capture and to eradicate them. Concurrently, as another subfield that has started to gain traction recently, the applications of NLP in the field of law have started to increase and develop rapidly. As law has a direct and utmost effect on people’s lives, the issues of bias for NLP applications in legal domain are certainly important. However, to the best of our knowledge, bias issues have not yet been studied in the context of legal corpora. In this article, we approach the gender bias problem from the scope of legal text processing domain. Word embedding models that are trained on corpora composed by legal documents and legislation from different countries have been utilized to measure and eliminate gender bias in legal documents. Several methods have been employed to reveal the degree of gender bias and observe its variations over countries. Moreover, a debiasing method has been used to neutralize unwanted bias. The preservation of semantic coherence of the debiased vector space has also been demonstrated by using high-level tasks. Finally, overall results and their implications have been discussed in the scope of NLP in legal domain.
词嵌入已经成为自然语言处理(NLP)中广泛使用的重要组成部分。尽管词嵌入有一些优点,但它们可能会无意中适应一些基于性别和种族的偏见,这些偏见存在于它们所训练的语料库中。因此,由于词嵌入在一些高级算法中广泛使用,伦理问题已经提出。研究这些偏见并消除它们最近成为一项重要的研究工作。已经进行了各种各样的研究来测量词嵌入捕获的偏见程度并消除它们。与此同时,作为近年来兴起的另一个分支领域,自然语言处理在法律领域的应用也开始增多和迅速发展。由于法律对人们的生活有直接和最大的影响,自然语言处理在法律领域应用的偏见问题当然很重要。然而,据我们所知,偏见问题尚未在法律语料库的背景下进行研究。在本文中,我们从法律文本处理领域的角度来探讨性别偏见问题。利用对各国法律文件和立法组成的语料库进行训练的词嵌入模型来衡量和消除法律文件中的性别偏见。已经采用了几种方法来揭示性别偏见的程度,并观察其在各国之间的差异。此外,还采用了一种消除偏置的方法来消除不必要的偏置。通过使用高级任务也证明了去偏向量空间的语义一致性的保存。最后,在法律领域的自然语言处理范围内讨论了总体结果及其影响。
{"title":"Gender bias in legal corpora and debiasing it","authors":"Nurullah Sevim, Furkan Şahinuç, Aykut Koç","doi":"10.1017/s1351324922000122","DOIUrl":"https://doi.org/10.1017/s1351324922000122","url":null,"abstract":"\u0000 Word embeddings have become important building blocks that are used profoundly in natural language processing (NLP). Despite their several advantages, word embeddings can unintentionally accommodate some gender- and ethnicity-based biases that are present within the corpora they are trained on. Therefore, ethical concerns have been raised since word embeddings are extensively used in several high-level algorithms. Studying such biases and debiasing them have recently become an important research endeavor. Various studies have been conducted to measure the extent of bias that word embeddings capture and to eradicate them. Concurrently, as another subfield that has started to gain traction recently, the applications of NLP in the field of law have started to increase and develop rapidly. As law has a direct and utmost effect on people’s lives, the issues of bias for NLP applications in legal domain are certainly important. However, to the best of our knowledge, bias issues have not yet been studied in the context of legal corpora. In this article, we approach the gender bias problem from the scope of legal text processing domain. Word embedding models that are trained on corpora composed by legal documents and legislation from different countries have been utilized to measure and eliminate gender bias in legal documents. Several methods have been employed to reveal the degree of gender bias and observe its variations over countries. Moreover, a debiasing method has been used to neutralize unwanted bias. The preservation of semantic coherence of the debiased vector space has also been demonstrated by using high-level tasks. Finally, overall results and their implications have been discussed in the scope of NLP in legal domain.","PeriodicalId":49143,"journal":{"name":"Natural Language Engineering","volume":"1 1","pages":"449-482"},"PeriodicalIF":2.5,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78705462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Generating Arabic TAG for syntax-semantics analysis 生成用于语法语义分析的阿拉伯语TAG
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-03-24 DOI: 10.1017/s1351324922000109
Chérifa Ben Khelil, C. Zribi, D. Duchier, Y. Parmentier
Arabic presents many challenges for automatic processing. Although several research studies have addressed some issues, electronic resources for processing Arabic remain relatively rare or not widely available. In this paper, we propose a Tree-adjoining grammar with a syntax-semantic interface. It is applied to the modern standard Arabic, but it can be easily adapted to other languages. This grammar named “ArabTAG V2.0” (Arabic Tree Adjoining Grammar) is semi-automatically generated by means of an abstract representation called meta-grammar. To ensure its development, ArabTAG V2.0 benefits from a grammar testing environment that uses a corpus of phenomena. Further experiments were performed to check the coverage of this grammar as well as the syntax-semantic analysis. The results showed that ArabTAG V2.0 can cover the majority of syntactical structures and different linguistic phenomena with a precision rate of 88.76%. Moreover, we were able to semantically analyze sentences and build their semantic representations with a precision rate of about 95.63%.
阿拉伯语对自动加工提出了许多挑战。虽然有几项研究已经解决了一些问题,但是处理阿拉伯文的电子资源仍然相对稀少或没有广泛使用。本文提出了一种具有语法-语义接口的树相邻语法。它适用于现代标准阿拉伯语,但它可以很容易地适应其他语言。这个名为“ArabTAG V2.0”(阿拉伯树相邻语法)的语法是通过称为元语法的抽象表示半自动生成的。为了确保其开发,ArabTAG V2.0受益于使用现象语料库的语法测试环境。进一步的实验验证了该语法的覆盖范围以及语法语义分析。结果表明,ArabTAG V2.0可以覆盖大部分的句法结构和不同的语言现象,准确率达到88.76%。此外,我们能够对句子进行语义分析并构建其语义表示,准确率约为95.63%。
{"title":"Generating Arabic TAG for syntax-semantics analysis","authors":"Chérifa Ben Khelil, C. Zribi, D. Duchier, Y. Parmentier","doi":"10.1017/s1351324922000109","DOIUrl":"https://doi.org/10.1017/s1351324922000109","url":null,"abstract":"\u0000 Arabic presents many challenges for automatic processing. Although several research studies have addressed some issues, electronic resources for processing Arabic remain relatively rare or not widely available. In this paper, we propose a Tree-adjoining grammar with a syntax-semantic interface. It is applied to the modern standard Arabic, but it can be easily adapted to other languages. This grammar named “ArabTAG V2.0” (Arabic Tree Adjoining Grammar) is semi-automatically generated by means of an abstract representation called meta-grammar. To ensure its development, ArabTAG V2.0 benefits from a grammar testing environment that uses a corpus of phenomena. Further experiments were performed to check the coverage of this grammar as well as the syntax-semantic analysis. The results showed that ArabTAG V2.0 can cover the majority of syntactical structures and different linguistic phenomena with a precision rate of 88.76%. Moreover, we were able to semantically analyze sentences and build their semantic representations with a precision rate of about 95.63%.","PeriodicalId":49143,"journal":{"name":"Natural Language Engineering","volume":"255 1","pages":"386-424"},"PeriodicalIF":2.5,"publicationDate":"2022-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82292033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
In-depth analysis of the impact of OCR errors on named entity recognition and linking 深入分析了OCR错误对命名实体识别和链接的影响
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-03-18 DOI: 10.1017/s1351324922000110
Ahmed Hamdi, Elvys Linhares Pontes, Nicolas Sidère, Mickaël Coustaty, A. Doucet
Named entities (NEs) are among the most relevant type of information that can be used to properly index digital documents and thus easily retrieve them. It has long been observed that NEs are key to accessing the contents of digital library portals as they are contained in most user queries. However, most digitized documents are indexed through their optical character recognition (OCRed) version which include numerous errors. Although OCR engines have considerably improved over the last few years, OCR errors still considerably impact document access. Previous works were conducted to evaluate the impact of OCR errors on named entity recognition (NER) and named entity linking (NEL) techniques separately. In this article, we experimented with a variety of OCRed documents with different levels and types of OCR noise to assess in depth the impact of OCR on named entity processing. We provide a deep analysis of OCR errors that impact the performance of NER and NEL. We then present the resulting exhaustive study and subsequent recommendations on the adequate documents, the OCR quality levels, and the post-OCR correction strategies required to perform reliable NER and NEL.
命名实体(NEs)是最相关的信息类型之一,可用于正确地索引数字文档,从而轻松地检索它们。长期以来,人们一直观察到网元是访问数字图书馆门户内容的关键,因为它们包含在大多数用户查询中。然而,大多数数字化文档都是通过光学字符识别(OCRed)版本进行索引的,其中包含许多错误。尽管OCR引擎在过去几年中有了很大的改进,但是OCR错误仍然会严重影响文档访问。之前的研究分别评估了OCR错误对命名实体识别(NER)和命名实体链接(NEL)技术的影响。在本文中,我们对具有不同级别和类型OCR噪声的各种ocredd文档进行了实验,以深入评估OCR对命名实体处理的影响。我们对影响NER和NEL性能的OCR误差进行了深入分析。然后,我们提出了详尽的研究结果,并就适当的文件、OCR质量水平以及执行可靠的NER和NEL所需的OCR后校正策略提出了后续建议。
{"title":"In-depth analysis of the impact of OCR errors on named entity recognition and linking","authors":"Ahmed Hamdi, Elvys Linhares Pontes, Nicolas Sidère, Mickaël Coustaty, A. Doucet","doi":"10.1017/s1351324922000110","DOIUrl":"https://doi.org/10.1017/s1351324922000110","url":null,"abstract":"\u0000 Named entities (NEs) are among the most relevant type of information that can be used to properly index digital documents and thus easily retrieve them. It has long been observed that NEs are key to accessing the contents of digital library portals as they are contained in most user queries. However, most digitized documents are indexed through their optical character recognition (OCRed) version which include numerous errors. Although OCR engines have considerably improved over the last few years, OCR errors still considerably impact document access. Previous works were conducted to evaluate the impact of OCR errors on named entity recognition (NER) and named entity linking (NEL) techniques separately. In this article, we experimented with a variety of OCRed documents with different levels and types of OCR noise to assess in depth the impact of OCR on named entity processing. We provide a deep analysis of OCR errors that impact the performance of NER and NEL. We then present the resulting exhaustive study and subsequent recommendations on the adequate documents, the OCR quality levels, and the post-OCR correction strategies required to perform reliable NER and NEL.","PeriodicalId":49143,"journal":{"name":"Natural Language Engineering","volume":"84 1","pages":"425-448"},"PeriodicalIF":2.5,"publicationDate":"2022-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83850798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
MHeTRep: A multilingual semantically tagged health terms repository MHeTRep:多语言语义标记的运行状况术语存储库
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-02-25 DOI: 10.1017/s1351324922000055
J. Vivaldi, H. Rodríguez
Abstract This paper presents MHeTRep, a multilingual medical terminology and the methodology followed for its compilation. The multilingual terminology is organised into one vocabulary for each language. All the terms in the collection are semantically tagged with a tagset corresponding to the top categories of Snomed-CT ontology. When possible, the individual terms are linked to their equivalent in the other languages. Even though many NLP resources and tools claim to be domain independent, their application to specific tasks can be restricted to specific domains, otherwise their performance degrades notably. As the accuracy of NLP resources drops heavily when applied in environments different from which they were built, a tuning to the new environment is needed. Usually, having a domain terminology facilitates and accelerates the adaptation of general domain NLP applications to a new domain. This is particularly important in medicine, a domain living moments of great expansion. The proposed method takes Snomed-CT as starting point. From this point and using 13 multilingual resources, covering the most relevant medical concepts such as drugs, anatomy, clinical findings and procedures, we built a large resource covering seven languages totalling more than two million semantically tagged terms. The resulting collection has been intensively evaluated in several ways for the involved languages and domain categories. Our hypothesis is that MHeTRep can be used advantageously over the original resources for a number of NLP use cases and likely extended to other languages.
摘要本文介绍了MHeTRep,一个多语言医学术语和方法遵循其编译。多语言术语为每种语言组织成一个词汇表。集合中的所有术语都使用与Snomed-CT本体的顶级类别相对应的标记集进行语义标记。在可能的情况下,将单个术语与其他语言中的对应术语链接起来。尽管许多NLP资源和工具声称是领域独立的,但它们对特定任务的应用可以限制在特定的领域,否则它们的性能会显著下降。由于NLP资源在不同于其构建环境的环境中应用时准确性会严重下降,因此需要对新环境进行调优。通常,拥有一个领域术语可以促进和加速一般领域NLP应用程序对新领域的适应。这在医学领域尤其重要,因为医学领域正处于巨大的扩张时期。该方法以Snomed-CT为起始点。从这一点出发,使用13种多语言资源,涵盖最相关的医学概念,如药物,解剖学,临床发现和程序,我们建立了一个涵盖七种语言的大型资源,总计超过200万个语义标记术语。对于所涉及的语言和领域类别,结果集已经以几种方式进行了深入评估。我们的假设是,在许多NLP用例中,MHeTRep可以比原始资源更有利地使用,并可能扩展到其他语言。
{"title":"MHeTRep: A multilingual semantically tagged health terms repository","authors":"J. Vivaldi, H. Rodríguez","doi":"10.1017/s1351324922000055","DOIUrl":"https://doi.org/10.1017/s1351324922000055","url":null,"abstract":"Abstract This paper presents MHeTRep, a multilingual medical terminology and the methodology followed for its compilation. The multilingual terminology is organised into one vocabulary for each language. All the terms in the collection are semantically tagged with a tagset corresponding to the top categories of Snomed-CT ontology. When possible, the individual terms are linked to their equivalent in the other languages. Even though many NLP resources and tools claim to be domain independent, their application to specific tasks can be restricted to specific domains, otherwise their performance degrades notably. As the accuracy of NLP resources drops heavily when applied in environments different from which they were built, a tuning to the new environment is needed. Usually, having a domain terminology facilitates and accelerates the adaptation of general domain NLP applications to a new domain. This is particularly important in medicine, a domain living moments of great expansion. The proposed method takes Snomed-CT as starting point. From this point and using 13 multilingual resources, covering the most relevant medical concepts such as drugs, anatomy, clinical findings and procedures, we built a large resource covering seven languages totalling more than two million semantically tagged terms. The resulting collection has been intensively evaluated in several ways for the involved languages and domain categories. Our hypothesis is that MHeTRep can be used advantageously over the original resources for a number of NLP use cases and likely extended to other languages.","PeriodicalId":49143,"journal":{"name":"Natural Language Engineering","volume":"29 1","pages":"1364 - 1401"},"PeriodicalIF":2.5,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44880191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An empirical study of cyclical learning rate on neural machine translation 神经机器翻译周期学习率的实证研究
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-02-09 DOI: 10.1017/s135132492200002x
Weixuan Wang, Choon Meng Lee, Jianfeng Liu, Talha Çolakoğlu, Wei Peng
In training deep learning networks, the optimizer and related learning rate are often used without much thought or with minimal tuning, even though it is crucial in ensuring a fast convergence to a good quality minimum of the loss function that can also generalize well on the test dataset. Drawing inspiration from the successful application of cyclical learning rate policy to computer vision tasks, we explore how cyclical learning rate can be applied to train transformer-based neural networks for neural machine translation. From our carefully designed experiments, we show that the choice of optimizers and the associated cyclical learning rate policy can have a significant impact on the performance. In addition, we establish guidelines when applying cyclical learning rates to neural machine translation tasks.
在训练深度学习网络时,优化器和相关的学习率通常没有经过太多的思考或进行最小的调整,即使它对于确保快速收敛到一个高质量的最小损失函数(也可以在测试数据集上很好地泛化)至关重要。从周期学习率策略在计算机视觉任务中的成功应用中获得灵感,我们探索了如何将周期学习率应用于训练基于变压器的神经网络,用于神经机器翻译。从我们精心设计的实验中,我们表明优化器的选择和相关的周期性学习率策略可以对性能产生重大影响。此外,我们在将周期性学习率应用于神经机器翻译任务时建立了指导方针。
{"title":"An empirical study of cyclical learning rate on neural machine translation","authors":"Weixuan Wang, Choon Meng Lee, Jianfeng Liu, Talha Çolakoğlu, Wei Peng","doi":"10.1017/s135132492200002x","DOIUrl":"https://doi.org/10.1017/s135132492200002x","url":null,"abstract":"\u0000 In training deep learning networks, the optimizer and related learning rate are often used without much thought or with minimal tuning, even though it is crucial in ensuring a fast convergence to a good quality minimum of the loss function that can also generalize well on the test dataset. Drawing inspiration from the successful application of cyclical learning rate policy to computer vision tasks, we explore how cyclical learning rate can be applied to train transformer-based neural networks for neural machine translation. From our carefully designed experiments, we show that the choice of optimizers and the associated cyclical learning rate policy can have a significant impact on the performance. In addition, we establish guidelines when applying cyclical learning rates to neural machine translation tasks.","PeriodicalId":49143,"journal":{"name":"Natural Language Engineering","volume":"88 1","pages":"316-336"},"PeriodicalIF":2.5,"publicationDate":"2022-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80588155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Emerging Trends: SOTA-Chasing 新兴趋势:SOTA追逐
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-02-08 DOI: 10.1017/S1351324922000043
Kenneth Ward Church, Valia Kordoni
Abstract Many papers are chasing state-of-the-art (SOTA) numbers, and more will do so in the future. SOTA-chasing comes with many costs. SOTA-chasing squeezes out more promising opportunities such as coopetition and interdisciplinary collaboration. In addition, there is a risk that too much SOTA-chasing could lead to claims of superhuman performance, unrealistic expectations, and the next AI winter. Two root causes for SOTA-chasing will be discussed: (1) lack of leadership and (2) iffy reviewing processes. SOTA-chasing may be similar to the replication crisis in the scientific literature. The replication crisis is yet another example, like evaluation, of over-confidence in accepted practices and the scientific method, even when such practices lead to absurd consequences.
摘要许多论文都在追求最先进的(SOTA)数字,未来还会有更多的论文这样做。追逐SOTA需要付出很多代价。SOTA的追逐挤出了合作竞争和跨学科合作等更有前景的机会。此外,过多的SOTA追逐可能会导致超人的表现、不切实际的期望和下一个人工智能冬天的到来。将讨论SOTA追逐的两个根本原因:(1)缺乏领导力和(2)审查过程不确定。SOTA追逐可能类似于科学文献中的复制危机。复制危机是另一个例子,就像评估一样,对公认的做法和科学方法过于自信,即使这种做法会导致荒谬的后果。
{"title":"Emerging Trends: SOTA-Chasing","authors":"Kenneth Ward Church, Valia Kordoni","doi":"10.1017/S1351324922000043","DOIUrl":"https://doi.org/10.1017/S1351324922000043","url":null,"abstract":"Abstract Many papers are chasing state-of-the-art (SOTA) numbers, and more will do so in the future. SOTA-chasing comes with many costs. SOTA-chasing squeezes out more promising opportunities such as coopetition and interdisciplinary collaboration. In addition, there is a risk that too much SOTA-chasing could lead to claims of superhuman performance, unrealistic expectations, and the next AI winter. Two root causes for SOTA-chasing will be discussed: (1) lack of leadership and (2) iffy reviewing processes. SOTA-chasing may be similar to the replication crisis in the scientific literature. The replication crisis is yet another example, like evaluation, of over-confidence in accepted practices and the scientific method, even when such practices lead to absurd consequences.","PeriodicalId":49143,"journal":{"name":"Natural Language Engineering","volume":"28 1","pages":"249 - 269"},"PeriodicalIF":2.5,"publicationDate":"2022-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49477443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
NLE volume 28 issue 2 Cover and Front matter NLE第28卷第2期封面和封面问题
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-02-08 DOI: 10.1017/s1351324922000067
R. Mitkov, B. Boguraev
{"title":"NLE volume 28 issue 2 Cover and Front matter","authors":"R. Mitkov, B. Boguraev","doi":"10.1017/s1351324922000067","DOIUrl":"https://doi.org/10.1017/s1351324922000067","url":null,"abstract":"","PeriodicalId":49143,"journal":{"name":"Natural Language Engineering","volume":" ","pages":"f1 - f2"},"PeriodicalIF":2.5,"publicationDate":"2022-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47897633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NLE volume 28 issue 2 Cover and Back matter NLE第28卷第2期封面和封底
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-02-08 DOI: 10.1017/s1351324922000079
{"title":"NLE volume 28 issue 2 Cover and Back matter","authors":"","doi":"10.1017/s1351324922000079","DOIUrl":"https://doi.org/10.1017/s1351324922000079","url":null,"abstract":"","PeriodicalId":49143,"journal":{"name":"Natural Language Engineering","volume":" ","pages":"b1 - b2"},"PeriodicalIF":2.5,"publicationDate":"2022-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44865303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards improving the robustness of sequential labeling models against typographical adversarial examples using triplet loss 利用三重损失提高序列标记模型对排版对抗示例的鲁棒性
IF 2.5 3区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2022-02-04 DOI: 10.1017/s1351324921000486
Can Udomcharoenchaikit, P. Boonkwan, P. Vateekul
Many fundamentaltasks in natural language processing (NLP) such as part-of-speech tagging, text chunking, and named-entity recognition can be formulated as sequence labeling problems. Although neural sequence labeling models have shown excellent results on standard test sets, they are very brittle when presented with misspelled texts. In this paper, we introduce an adversarial training framework that enhances the robustness against typographical adversarial examples. We evaluate the robustness of sequence labeling models with an adversarial evaluation scheme that includes typographical adversarial examples. We generate two types of adversarial examples without access (black-box) or with full access (white-box) to the target model’s parameters. We conducted a series of extensive experiments on three languages (English, Thai, and German) across three sequence labeling tasks. Experiments show that the proposed adversarial training framework provides better resistance against adversarial examples on all tasks. We found that we can further improve the model’s robustness on the chunking task by including a triplet loss constraint.
自然语言处理(NLP)中的许多基本任务,如词性标注、文本分块和命名实体识别,都可以表述为序列标注问题。尽管神经序列标记模型在标准测试集上显示了出色的结果,但当出现拼写错误的文本时,它们非常脆弱。在本文中,我们引入了一个对抗训练框架,以增强对排版对抗示例的鲁棒性。我们评估序列标记模型的鲁棒性与一个对抗的评估方案,其中包括排版对抗的例子。我们生成了两种类型的对抗性示例,没有访问(黑盒)或完全访问(白盒)目标模型的参数。我们对三种语言(英语、泰语和德语)进行了一系列广泛的实验,涉及三个序列标记任务。实验表明,提出的对抗性训练框架在所有任务上都具有更好的抗对抗性。我们发现通过加入三元组损失约束可以进一步提高模型在分块任务上的鲁棒性。
{"title":"Towards improving the robustness of sequential labeling models against typographical adversarial examples using triplet loss","authors":"Can Udomcharoenchaikit, P. Boonkwan, P. Vateekul","doi":"10.1017/s1351324921000486","DOIUrl":"https://doi.org/10.1017/s1351324921000486","url":null,"abstract":"\u0000 Many fundamentaltasks in natural language processing (NLP) such as part-of-speech tagging, text chunking, and named-entity recognition can be formulated as sequence labeling problems. Although neural sequence labeling models have shown excellent results on standard test sets, they are very brittle when presented with misspelled texts. In this paper, we introduce an adversarial training framework that enhances the robustness against typographical adversarial examples. We evaluate the robustness of sequence labeling models with an adversarial evaluation scheme that includes typographical adversarial examples. We generate two types of adversarial examples without access (black-box) or with full access (white-box) to the target model’s parameters. We conducted a series of extensive experiments on three languages (English, Thai, and German) across three sequence labeling tasks. Experiments show that the proposed adversarial training framework provides better resistance against adversarial examples on all tasks. We found that we can further improve the model’s robustness on the chunking task by including a triplet loss constraint.","PeriodicalId":49143,"journal":{"name":"Natural Language Engineering","volume":"37 1","pages":"287-315"},"PeriodicalIF":2.5,"publicationDate":"2022-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91318432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Natural Language Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1