Pub Date : 2021-03-01DOI: 10.2991/NLPR.D.210223.001
Bin Sun, Kan Li
{"title":"Neural Dialogue Generation Methods in Open Domain: A Survey","authors":"Bin Sun, Kan Li","doi":"10.2991/NLPR.D.210223.001","DOIUrl":"https://doi.org/10.2991/NLPR.D.210223.001","url":null,"abstract":"","PeriodicalId":332352,"journal":{"name":"Natural Language Processing Research","volume":"752 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116110767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-03-01DOI: 10.2991/NLPR.D.210316.001
Nitish Ranjan Bhowmik, M. Arifuzzaman, M. Mondal, Md. Saiful Islam
WiththeproliferationoftheInternet’ssocialdigitalcontent,sentimentanalysis(SA)hasgainedawideresearchinterestinnatural language processing (NLP). A few significant research has been done in Bangla language domain because of having intricate grammatical structure on text. This paper focuses on SA in the context of Bangla language. Firstly, a specific domain-based categorical weighted lexicon data dictionary (LDD) is developed for analyzing sentiments in Bangla. This LDD is developed by applying the concepts of normalization, tokenization, and stemming to two Bangla datasets available in GitHub repository. Secondly, a novel rule–based algorithm termed as Bangla Text Sentiment Score (BTSC) is developed for detecting sentence polarity. This algorithm considers parts of speech tagger words and special characters to generate a score of a word and thus that ofasentenceandablog.TheBTSCalgorithmalongwiththeLDDisappliedtoextractsentimentsbygeneratingscoresofthetwoBangladatasets.Thirdly,twofeaturematricesaredevelopedbyapplyingtermfrequency-inversedocumentfrequency(tf-idf)to thetwodatasets,andbyusingthecorrespondingBTSCscores.Next,supervisedmachinelearningclassifiersareappliedtothefeaturematrices
{"title":"Bangla Text Sentiment Analysis Using Supervised Machine Learning with Extended Lexicon Dictionary","authors":"Nitish Ranjan Bhowmik, M. Arifuzzaman, M. Mondal, Md. Saiful Islam","doi":"10.2991/NLPR.D.210316.001","DOIUrl":"https://doi.org/10.2991/NLPR.D.210316.001","url":null,"abstract":"WiththeproliferationoftheInternet’ssocialdigitalcontent,sentimentanalysis(SA)hasgainedawideresearchinterestinnatural language processing (NLP). A few significant research has been done in Bangla language domain because of having intricate grammatical structure on text. This paper focuses on SA in the context of Bangla language. Firstly, a specific domain-based categorical weighted lexicon data dictionary (LDD) is developed for analyzing sentiments in Bangla. This LDD is developed by applying the concepts of normalization, tokenization, and stemming to two Bangla datasets available in GitHub repository. Secondly, a novel rule–based algorithm termed as Bangla Text Sentiment Score (BTSC) is developed for detecting sentence polarity. This algorithm considers parts of speech tagger words and special characters to generate a score of a word and thus that ofasentenceandablog.TheBTSCalgorithmalongwiththeLDDisappliedtoextractsentimentsbygeneratingscoresofthetwoBangladatasets.Thirdly,twofeaturematricesaredevelopedbyapplyingtermfrequency-inversedocumentfrequency(tf-idf)to thetwodatasets,andbyusingthecorrespondingBTSCscores.Next,supervisedmachinelearningclassifiersareappliedtothefeaturematrices","PeriodicalId":332352,"journal":{"name":"Natural Language Processing Research","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125667774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2020-06-01DOI: 10.2991/nlpr.d.200522.001
Qi Su, Mingyu Wan, Xiaoqian Liu, Chu-Ren Huang
ive summarization is also a relevant task that can be useful for facilitating misinformation detection. Specifically, the summarization model can be applied to identify the central claims of the input texts and serves as a feature extractor prior to misinformation detection. For example, Esmaeilzadeh et al. [24] use a text summarization model to first summarize an article and then input the summarized sequences into a RNN-based neural network to do misinformation detection. The experimental results are compared against the task using only the original texts, and finally demonstrate higher performance. Fact checking is the task of assessing the truthfulness of claims especially made by public figures such as politicians [25]. Usually, there is no clear distinction between misinformation detection and fact checking since both of them aim to assess the truthfulness of claims, thoughmisinformation detection usually focuses on certain pieces of information while fact checking is broader [26]. However, fact checking can also be a relevant task of misinformation detection when a piece of information contains claims that need to be verified as true or false. Rumor detection is often confused with fake news detection, since rumor refers to a statement consisting of unverified information at the posting time. Rumor detection task is then defined as separating personal statements into rumor or nonrumor [27]. Thus, rumor detection can also serve as another relevant task of misinformation detection to first detect worth-checking statements prior to classifying the statement as true or false. This can help mitigate the impact that subjective opinions or feelings have on the selection of statements that need to be further verified. Sentiment analysis is the task of extracting emotions from texts or user stances. The sentiment in the true and misrepresented information can be different, since publishers of misinformation focus more on the degree to impress the audience and the spreading speed of the information. Thus, misinformation typically either contains intense emotion which could easily resonate with the public, or Q. Su et al. / Natural Language Processing Research 1(1-2) 1–13 3 controversial statements aiming to evoke intense emotion among receivers. Thus, misinformation detection can also utilize emotion analysis through both the content and user comments. Guo et al. [28] propose a Emotion-based misinformation Detection framework to learn contentand comment-emotion representations for publishers and users respectively so as to exploit content and social emotions simultaneously for misinformation detection. 1.3. An Overview of the Survey This survey aims to present a comprehensive review on studying misinformation in terms of its characteristics and detection methods. It first introduces the related concepts and highlights the significance of misinformation detection. It then uses a two-dimensional model to decompose this task: the internal dimension of
{"title":"Motivations, Methods and Metrics of Misinformation Detection: An NLP Perspective","authors":"Qi Su, Mingyu Wan, Xiaoqian Liu, Chu-Ren Huang","doi":"10.2991/nlpr.d.200522.001","DOIUrl":"https://doi.org/10.2991/nlpr.d.200522.001","url":null,"abstract":"ive summarization is also a relevant task that can be useful for facilitating misinformation detection. Specifically, the summarization model can be applied to identify the central claims of the input texts and serves as a feature extractor prior to misinformation detection. For example, Esmaeilzadeh et al. [24] use a text summarization model to first summarize an article and then input the summarized sequences into a RNN-based neural network to do misinformation detection. The experimental results are compared against the task using only the original texts, and finally demonstrate higher performance. Fact checking is the task of assessing the truthfulness of claims especially made by public figures such as politicians [25]. Usually, there is no clear distinction between misinformation detection and fact checking since both of them aim to assess the truthfulness of claims, thoughmisinformation detection usually focuses on certain pieces of information while fact checking is broader [26]. However, fact checking can also be a relevant task of misinformation detection when a piece of information contains claims that need to be verified as true or false. Rumor detection is often confused with fake news detection, since rumor refers to a statement consisting of unverified information at the posting time. Rumor detection task is then defined as separating personal statements into rumor or nonrumor [27]. Thus, rumor detection can also serve as another relevant task of misinformation detection to first detect worth-checking statements prior to classifying the statement as true or false. This can help mitigate the impact that subjective opinions or feelings have on the selection of statements that need to be further verified. Sentiment analysis is the task of extracting emotions from texts or user stances. The sentiment in the true and misrepresented information can be different, since publishers of misinformation focus more on the degree to impress the audience and the spreading speed of the information. Thus, misinformation typically either contains intense emotion which could easily resonate with the public, or Q. Su et al. / Natural Language Processing Research 1(1-2) 1–13 3 controversial statements aiming to evoke intense emotion among receivers. Thus, misinformation detection can also utilize emotion analysis through both the content and user comments. Guo et al. [28] propose a Emotion-based misinformation Detection framework to learn contentand comment-emotion representations for publishers and users respectively so as to exploit content and social emotions simultaneously for misinformation detection. 1.3. An Overview of the Survey This survey aims to present a comprehensive review on studying misinformation in terms of its characteristics and detection methods. It first introduces the related concepts and highlights the significance of misinformation detection. It then uses a two-dimensional model to decompose this task: the internal dimension of ","PeriodicalId":332352,"journal":{"name":"Natural Language Processing Research","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121587116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1900-01-01DOI: 10.55060/j.nlpre.221227.001
Kan Li
NLP applications promote and improve people’s lives, including smart customer service, smart home, and more. With the help of these apps, people can handle things more easily. As a result, there is a growing demand for some newer, better NLP applications. However, existing technologies are already struggling to meet a range of new demands of society. Despite the increasing number and wealth of research on NLP, there are still many intractable technical obstacles such as grammar production, lexical semantics, logical semantics, and so on.
{"title":"NLPR Journal Re-Launched","authors":"Kan Li","doi":"10.55060/j.nlpre.221227.001","DOIUrl":"https://doi.org/10.55060/j.nlpre.221227.001","url":null,"abstract":"NLP applications promote and improve people’s lives, including smart customer service, smart home, and more. With the help of these apps, people can handle things more easily. As a result, there is a growing demand for some newer, better NLP applications. However, existing technologies are already struggling to meet a range of new demands of society. Despite the increasing number and wealth of research on NLP, there are still many intractable technical obstacles such as grammar production, lexical semantics, logical semantics, and so on.","PeriodicalId":332352,"journal":{"name":"Natural Language Processing Research","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121377295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}