Natural Language Processing Research最新文献

英文中文

Neural Dialogue Generation Methods in Open Domain: A Survey 开放域神经对话生成方法综述

Natural Language Processing Research

Pub Date : 2021-03-01 DOI: 10.2991/NLPR.D.210223.001

Bin Sun, Kan Li

引用次数: 4

Bangla Text Sentiment Analysis Using Supervised Machine Learning with Extended Lexicon Dictionary 基于扩展词典的监督式机器学习孟加拉语文本情感分析

Natural Language Processing Research

Pub Date : 2021-03-01 DOI: 10.2991/NLPR.D.210316.001

Nitish Ranjan Bhowmik, M. Arifuzzaman, M. Mondal, Md. Saiful Islam

WiththeproliferationoftheInternet’ssocialdigitalcontent,sentimentanalysis(SA)hasgainedawideresearchinterestinnatural language processing (NLP). A few significant research has been done in Bangla language domain because of having intricate grammatical structure on text. This paper focuses on SA in the context of Bangla language. Firstly, a specific domain-based categorical weighted lexicon data dictionary (LDD) is developed for analyzing sentiments in Bangla. This LDD is developed by applying the concepts of normalization, tokenization, and stemming to two Bangla datasets available in GitHub repository. Secondly, a novel rule–based algorithm termed as Bangla Text Sentiment Score (BTSC) is developed for detecting sentence polarity. This algorithm considers parts of speech tagger words and special characters to generate a score of a word and thus that ofasentenceandablog.TheBTSCalgorithmalongwiththeLDDisappliedtoextractsentimentsbygeneratingscoresofthetwoBangladatasets.Thirdly,twofeaturematricesaredevelopedbyapplyingtermfrequency-inversedocumentfrequency(tf-idf)to thetwodatasets,andbyusingthecorrespondingBTSCscores.Next,supervisedmachinelearningclassifiersareappliedtothefeaturematrices

随着互联网社交数字内容的激增，情感分析(SA)在自然语言处理(NLP)领域得到了广泛的研究。由于孟加拉语语篇语法结构复杂，因此在孟加拉语领域的研究很少。本文主要研究孟加拉语语境下的SA。首先，开发了一个基于特定领域的分类加权词汇数据字典(LDD)，用于分析孟加拉语的情感。这个LDD是通过将规范化、标记化和词干提取的概念应用于GitHub存储库中的两个孟加拉语数据集来开发的。其次，提出了一种基于规则的孟加拉语文本情感评分(BTSC)算法来检测句子极性。该算法考虑词性、标注词和特殊字符来生成词的分数，从而生成句子和日志的分数。btscc算法与ddisc算法一起通过生成两个数据集的分数来提取情感。第三，通过对两个数据集应用术语频率-逆文档频率(tf-idf)，并使用相应的btscc分数来开发两个特征矩阵。接下来,supervisedmachinelearningclassifiersareappliedtothefeaturematrices

{"title":"Bangla Text Sentiment Analysis Using Supervised Machine Learning with Extended Lexicon Dictionary","authors":"Nitish Ranjan Bhowmik, M. Arifuzzaman, M. Mondal, Md. Saiful Islam","doi":"10.2991/NLPR.D.210316.001","DOIUrl":"https://doi.org/10.2991/NLPR.D.210316.001","url":null,"abstract":"WiththeproliferationoftheInternet’ssocialdigitalcontent,sentimentanalysis(SA)hasgainedawideresearchinterestinnatural language processing (NLP). A few significant research has been done in Bangla language domain because of having intricate grammatical structure on text. This paper focuses on SA in the context of Bangla language. Firstly, a specific domain-based categorical weighted lexicon data dictionary (LDD) is developed for analyzing sentiments in Bangla. This LDD is developed by applying the concepts of normalization, tokenization, and stemming to two Bangla datasets available in GitHub repository. Secondly, a novel rule–based algorithm termed as Bangla Text Sentiment Score (BTSC) is developed for detecting sentence polarity. This algorithm considers parts of speech tagger words and special characters to generate a score of a word and thus that ofasentenceandablog.TheBTSCalgorithmalongwiththeLDDisappliedtoextractsentimentsbygeneratingscoresofthetwoBangladatasets.Thirdly,twofeaturematricesaredevelopedbyapplyingtermfrequency-inversedocumentfrequency(tf-idf)to thetwodatasets,andbyusingthecorrespondingBTSCscores.Next,supervisedmachinelearningclassifiersareappliedtothefeaturematrices","PeriodicalId":332352,"journal":{"name":"Natural Language Processing Research","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125667774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 24

Motivations, Methods and Metrics of Misinformation Detection: An NLP Perspective 错误信息检测的动机、方法和度量:NLP视角

Natural Language Processing Research

Pub Date : 2020-06-01 DOI: 10.2991/nlpr.d.200522.001

Qi Su, Mingyu Wan, Xiaoqian Liu, Chu-Ren Huang

ive summarization is also a relevant task that can be useful for facilitating misinformation detection. Specifically, the summarization model can be applied to identify the central claims of the input texts and serves as a feature extractor prior to misinformation detection. For example, Esmaeilzadeh et al. [24] use a text summarization model to first summarize an article and then input the summarized sequences into a RNN-based neural network to do misinformation detection. The experimental results are compared against the task using only the original texts, and finally demonstrate higher performance. Fact checking is the task of assessing the truthfulness of claims especially made by public figures such as politicians [25]. Usually, there is no clear distinction between misinformation detection and fact checking since both of them aim to assess the truthfulness of claims, thoughmisinformation detection usually focuses on certain pieces of information while fact checking is broader [26]. However, fact checking can also be a relevant task of misinformation detection when a piece of information contains claims that need to be verified as true or false. Rumor detection is often confused with fake news detection, since rumor refers to a statement consisting of unverified information at the posting time. Rumor detection task is then defined as separating personal statements into rumor or nonrumor [27]. Thus, rumor detection can also serve as another relevant task of misinformation detection to first detect worth-checking statements prior to classifying the statement as true or false. This can help mitigate the impact that subjective opinions or feelings have on the selection of statements that need to be further verified. Sentiment analysis is the task of extracting emotions from texts or user stances. The sentiment in the true and misrepresented information can be different, since publishers of misinformation focus more on the degree to impress the audience and the spreading speed of the information. Thus, misinformation typically either contains intense emotion which could easily resonate with the public, or Q. Su et al. / Natural Language Processing Research 1(1-2) 1–13 3 controversial statements aiming to evoke intense emotion among receivers. Thus, misinformation detection can also utilize emotion analysis through both the content and user comments. Guo et al. [28] propose a Emotion-based misinformation Detection framework to learn contentand comment-emotion representations for publishers and users respectively so as to exploit content and social emotions simultaneously for misinformation detection. 1.3. An Overview of the Survey This survey aims to present a comprehensive review on studying misinformation in terms of its characteristics and detection methods. It first introduces the related concepts and highlights the significance of misinformation detection. It then uses a two-dimensional model to decompose this task: the internal dimension of

摘要也是一项相关的任务，有助于促进错误信息的检测。具体来说，摘要模型可以用于识别输入文本的中心声明，并在错误信息检测之前作为特征提取器。例如，esmailzadeh等人[24]使用文本摘要模型首先对文章进行摘要，然后将摘要序列输入到基于rnn的神经网络中进行误信息检测。实验结果与仅使用原始文本的任务进行了比较，最终证明了更高的性能。事实核查的任务是评估言论的真实性，尤其是政治家等公众人物的言论[25]。通常，错误信息检测和事实检查之间没有明确的区别，因为它们都旨在评估声明的真实性，尽管错误信息检测通常侧重于某些信息，而事实检查则更广泛[26]。然而，当一条信息包含需要验证为真或假的声明时，事实检查也可能是错误信息检测的相关任务。谣言检测常常与假新闻检测相混淆，因为谣言是指在发布时由未经证实的信息组成的声明。然后将谣言检测任务定义为将个人陈述分为谣言或非谣言[27]。因此，谣言检测也可以作为错误信息检测的另一个相关任务，首先检测值得检查的语句，然后将其分类为真或假。这可以帮助减轻主观意见或感觉对需要进一步验证的陈述选择的影响。情感分析是从文本或用户立场中提取情感的任务。真实和虚假信息中的情绪可能是不同的，因为虚假信息的发布者更关注给受众留下深刻印象的程度和信息的传播速度。因此，错误信息通常要么包含强烈的情绪，很容易引起公众的共鸣，要么Q. Su等人/自然语言处理研究1(1-2)1- 13 3有争议的陈述，旨在唤起接受者的强烈情绪。因此，错误信息检测也可以通过内容和用户评论来利用情感分析。Guo等[28]提出了一种基于情感的错误信息检测框架，分别学习发布者和用户的内容和评论情感表征，从而同时利用内容和社会情感进行错误信息检测。1.3. 本调查旨在对错误信息的特征和检测方法进行全面的综述。首先介绍了相关概念，强调了误报检测的重要性。然后，它使用二维模型来分解该任务:描述性分析的内部维度(即低可信度信息的表征)和预测建模的外部维度(即错误信息的自动检测)。特别地，从检测方法、特征表示和模型构建方面回顾了公开可用的数据集和最先进的技术。最后，总结了错误信息检测面临的挑战，并对未来的错误信息检测工作提出了新的展望。

{"title":"Motivations, Methods and Metrics of Misinformation Detection: An NLP Perspective","authors":"Qi Su, Mingyu Wan, Xiaoqian Liu, Chu-Ren Huang","doi":"10.2991/nlpr.d.200522.001","DOIUrl":"https://doi.org/10.2991/nlpr.d.200522.001","url":null,"abstract":"ive summarization is also a relevant task that can be useful for facilitating misinformation detection. Specifically, the summarization model can be applied to identify the central claims of the input texts and serves as a feature extractor prior to misinformation detection. For example, Esmaeilzadeh et al. [24] use a text summarization model to first summarize an article and then input the summarized sequences into a RNN-based neural network to do misinformation detection. The experimental results are compared against the task using only the original texts, and finally demonstrate higher performance. Fact checking is the task of assessing the truthfulness of claims especially made by public figures such as politicians [25]. Usually, there is no clear distinction between misinformation detection and fact checking since both of them aim to assess the truthfulness of claims, thoughmisinformation detection usually focuses on certain pieces of information while fact checking is broader [26]. However, fact checking can also be a relevant task of misinformation detection when a piece of information contains claims that need to be verified as true or false. Rumor detection is often confused with fake news detection, since rumor refers to a statement consisting of unverified information at the posting time. Rumor detection task is then defined as separating personal statements into rumor or nonrumor [27]. Thus, rumor detection can also serve as another relevant task of misinformation detection to first detect worth-checking statements prior to classifying the statement as true or false. This can help mitigate the impact that subjective opinions or feelings have on the selection of statements that need to be further verified. Sentiment analysis is the task of extracting emotions from texts or user stances. The sentiment in the true and misrepresented information can be different, since publishers of misinformation focus more on the degree to impress the audience and the spreading speed of the information. Thus, misinformation typically either contains intense emotion which could easily resonate with the public, or Q. Su et al. / Natural Language Processing Research 1(1-2) 1–13 3 controversial statements aiming to evoke intense emotion among receivers. Thus, misinformation detection can also utilize emotion analysis through both the content and user comments. Guo et al. [28] propose a Emotion-based misinformation Detection framework to learn contentand comment-emotion representations for publishers and users respectively so as to exploit content and social emotions simultaneously for misinformation detection. 1.3. An Overview of the Survey This survey aims to present a comprehensive review on studying misinformation in terms of its characteristics and detection methods. It first introduces the related concepts and highlights the significance of misinformation detection. It then uses a two-dimensional model to decompose this task: the internal dimension of ","PeriodicalId":332352,"journal":{"name":"Natural Language Processing Research","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121587116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 45

NLPR Journal Re-Launched NLPR期刊重新推出

Natural Language Processing Research

Pub Date : 1900-01-01 DOI: 10.55060/j.nlpre.221227.001

Kan Li

NLP applications promote and improve people’s lives, including smart customer service, smart home, and more. With the help of these apps, people can handle things more easily. As a result, there is a growing demand for some newer, better NLP applications. However, existing technologies are already struggling to meet a range of new demands of society. Despite the increasing number and wealth of research on NLP, there are still many intractable technical obstacles such as grammar production, lexical semantics, logical semantics, and so on.

NLP应用促进和改善了人们的生活，包括智能客户服务、智能家居等。在这些应用程序的帮助下，人们可以更轻松地处理事情。因此，对更新、更好的NLP应用程序的需求不断增长。然而，现有的技术已经很难满足社会的一系列新需求。尽管对自然语言处理的研究越来越多，越来越丰富，但仍然存在许多棘手的技术障碍，如语法生成、词汇语义、逻辑语义等。

引用次数: 0

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Natural Language Processing Research

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀