Bengali Informative Chatbot

Md. Kowsher, M. A. Alam, M. J. Uddin, Md. Rafiqul Islam, Nuruzzaman Pias, Abu Rayhan Md Saifullah
{"title":"Bengali Informative Chatbot","authors":"Md. Kowsher, M. A. Alam, M. J. Uddin, Md. Rafiqul Islam, Nuruzzaman Pias, Abu Rayhan Md Saifullah","doi":"10.1109/IC4ME247184.2019.9036585","DOIUrl":null,"url":null,"abstract":"Bengali Informative Chatbot (BIC) is an effective technique that helps a user to trace relevant information by Natural Language Processing (NLP). In this research paper, we introduce an algorithmic Bengali Informative Chatbot (BIC) based on information that is significant mathematically and statistically. This paper is demonstrated by two algorithms for finding out the lemmatization of Bengali words such as Trie and Dictionary Based Search by Removing Affix (DBSRA) as well as compared with Edit Distance for the exact lemmatization. We present the Bengali Anaphora resolution system using the Hobbs’ algorithm to get the correct expression of information. As the actions of chatbot replying algorithms, the TF-IDF and Cosine Similarity are developed to find out the accurate answer from the documents. In this study, we introduce a Bengali Language Toolkit (BLTK) and Bengali Language Expression (BRE) that make the easiest implication of our task. We have also developed Bengali root word’s corpus, synonym word’s corpus, stop word’s corpus and gathered 672 articles as questions and answers form the popular Bengali newspapers ‘The Daily Prothom Alo’ is our inserted information. For testing this system, we have created 19334 questions from the introduced information and got 97.22% accurate answer by proposed BIC.","PeriodicalId":368690,"journal":{"name":"2019 International Conference on Computer, Communication, Chemical, Materials and Electronic Engineering (IC4ME2)","volume":"222 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Computer, Communication, Chemical, Materials and Electronic Engineering (IC4ME2)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC4ME247184.2019.9036585","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Bengali Informative Chatbot (BIC) is an effective technique that helps a user to trace relevant information by Natural Language Processing (NLP). In this research paper, we introduce an algorithmic Bengali Informative Chatbot (BIC) based on information that is significant mathematically and statistically. This paper is demonstrated by two algorithms for finding out the lemmatization of Bengali words such as Trie and Dictionary Based Search by Removing Affix (DBSRA) as well as compared with Edit Distance for the exact lemmatization. We present the Bengali Anaphora resolution system using the Hobbs’ algorithm to get the correct expression of information. As the actions of chatbot replying algorithms, the TF-IDF and Cosine Similarity are developed to find out the accurate answer from the documents. In this study, we introduce a Bengali Language Toolkit (BLTK) and Bengali Language Expression (BRE) that make the easiest implication of our task. We have also developed Bengali root word’s corpus, synonym word’s corpus, stop word’s corpus and gathered 672 articles as questions and answers form the popular Bengali newspapers ‘The Daily Prothom Alo’ is our inserted information. For testing this system, we have created 19334 questions from the introduced information and got 97.22% accurate answer by proposed BIC.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
孟加拉语信息聊天机器人
孟加拉语信息聊天机器人(BIC)是一种利用自然语言处理(NLP)帮助用户追踪相关信息的有效技术。在本文中,我们介绍了一种基于数学和统计意义的孟加拉语信息聊天机器人(BIC)算法。本文介绍了三列法(Trie)和基于字典的去词缀搜索法(DBSRA)这两种孟加拉语词的词序查找算法,并与编辑距离法(Edit Distance)进行了比较。本文提出了一种利用Hobbs算法求解孟加拉语回指的系统,以获得正确的信息表达。作为聊天机器人应答算法的动作,开发了TF-IDF和余弦相似度,从文档中找到准确的答案。在这项研究中,我们引入了一个孟加拉语工具包(BLTK)和孟加拉语表达(BRE),使我们的任务最简单的含义。我们还开发了孟加拉语词根词语料库、近义词语料库、停顿词语料库,并从孟加拉语流行报纸中收集了672篇文章作为问答,《每日问答》是我们的插入信息。在系统的测试中,我们从引入的信息中创建了19334个问题,并得到了97.22%的正确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Application of Si-NPs Extracted from the Padma River Sand of Rajshahi in Photovoltaic Cells Misadjustment Measurement with Normalized Weighted Noise Covariance based LMS Algorithm Design and Implementation of a Hospital Based Modern Healthcare Monitoring System on Android Platform Design and Simulation of PV Based Harmonic Compensator for Three Phase load Study of nonradiative recombination centers in GaAs:N δ-doped superlattices structures revealed by below-gap excitation light
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1