基于自然语言处理和信息检索技术的题库相似度搜索系统(QB3S

Md. Raihan Mia, A. S. M. Latiful Hoque
{"title":"基于自然语言处理和信息检索技术的题库相似度搜索系统(QB3S","authors":"Md. Raihan Mia, A. S. M. Latiful Hoque","doi":"10.1109/ICASERT.2019.8934449","DOIUrl":null,"url":null,"abstract":"Problem Based e-learning(PBeL) in bangla language is one of the most progressing areas of the use of ICT in education. Question Bank(QB) is the main component of any PBeL system. Searching similarity in the complex structure of QB is a challenging task in the development of PBeL system. We have been developed an efficient Question Bank Similarity Searching System(QB3S) to find similar questions, handle duplicate question and rank search result of a query input based on NLP and Information Retrieval techniques. QB3S has four modules: bangla documents processing, question structure analysis and clustered indexing by B+ tree , word-net construction and Information retrieval module. Lexical analysis, stemming by finite automata rules and stopwords removing have been used for bangla document processing. The most challenging procedures of QB3S were Analyzing the structure of data for clustered indexing in the sorted sequential file of the QB database with a B+ tree data structure and improved TF-IDF algorithm with weighted functionality. A Word-net has been used for handling synonyms. Vector Space Model(VSM) has been designed from the value of TF-IDF weighted matrix. By using cosine similarity product rule, we have been Calculated the similarity value between the query input and all mcq of DB from VSM. QB3S has been evaluated in some experimental dataset to find results by imposing different test cases. The accuracy of searching performance which has found to be satisfactory.","PeriodicalId":6613,"journal":{"name":"2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT)","volume":"25 1","pages":"1-7"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Question Bank Similarity Searching System (QB3S) Using NLP and Information Retrieval Technique\",\"authors\":\"Md. Raihan Mia, A. S. M. Latiful Hoque\",\"doi\":\"10.1109/ICASERT.2019.8934449\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Problem Based e-learning(PBeL) in bangla language is one of the most progressing areas of the use of ICT in education. Question Bank(QB) is the main component of any PBeL system. Searching similarity in the complex structure of QB is a challenging task in the development of PBeL system. We have been developed an efficient Question Bank Similarity Searching System(QB3S) to find similar questions, handle duplicate question and rank search result of a query input based on NLP and Information Retrieval techniques. QB3S has four modules: bangla documents processing, question structure analysis and clustered indexing by B+ tree , word-net construction and Information retrieval module. Lexical analysis, stemming by finite automata rules and stopwords removing have been used for bangla document processing. The most challenging procedures of QB3S were Analyzing the structure of data for clustered indexing in the sorted sequential file of the QB database with a B+ tree data structure and improved TF-IDF algorithm with weighted functionality. A Word-net has been used for handling synonyms. Vector Space Model(VSM) has been designed from the value of TF-IDF weighted matrix. By using cosine similarity product rule, we have been Calculated the similarity value between the query input and all mcq of DB from VSM. QB3S has been evaluated in some experimental dataset to find results by imposing different test cases. The accuracy of searching performance which has found to be satisfactory.\",\"PeriodicalId\":6613,\"journal\":{\"name\":\"2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT)\",\"volume\":\"25 1\",\"pages\":\"1-7\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASERT.2019.8934449\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASERT.2019.8934449","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

孟加拉语的基于问题的电子学习(PBeL)是在教育中使用信息通信技术的最进步的领域之一。题库(QB)是任何PBeL系统的主要组成部分。在QB的复杂结构中寻找相似度是PBeL系统开发中的一个具有挑战性的任务。基于自然语言处理和信息检索技术,我们开发了一个高效的题库相似度搜索系统(QB3S),用于查找相似问题、处理重复问题和对查询输入的搜索结果进行排序。QB3S有四个模块:孟加拉文文档处理、问题结构分析和B+树聚类索引、词网构建和信息检索模块。词法分析、有限自动机规则词干提取和停止词删除已被用于孟加拉语文档处理。QB3S最具挑战性的过程是利用B+树数据结构和带加权功能的改进TF-IDF算法对QB数据库的排序顺序文件进行数据结构分析并进行聚类索引。Word-net已被用于处理同义词。从TF-IDF加权矩阵的值出发,设计向量空间模型(VSM)。利用余弦相似积法则,计算出查询输入与VSM中DB的所有mcq之间的相似值。在一些实验数据集中对QB3S进行了评估,通过施加不同的测试用例来寻找结果。结果表明,该算法的搜索精度令人满意。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Question Bank Similarity Searching System (QB3S) Using NLP and Information Retrieval Technique
Problem Based e-learning(PBeL) in bangla language is one of the most progressing areas of the use of ICT in education. Question Bank(QB) is the main component of any PBeL system. Searching similarity in the complex structure of QB is a challenging task in the development of PBeL system. We have been developed an efficient Question Bank Similarity Searching System(QB3S) to find similar questions, handle duplicate question and rank search result of a query input based on NLP and Information Retrieval techniques. QB3S has four modules: bangla documents processing, question structure analysis and clustered indexing by B+ tree , word-net construction and Information retrieval module. Lexical analysis, stemming by finite automata rules and stopwords removing have been used for bangla document processing. The most challenging procedures of QB3S were Analyzing the structure of data for clustered indexing in the sorted sequential file of the QB database with a B+ tree data structure and improved TF-IDF algorithm with weighted functionality. A Word-net has been used for handling synonyms. Vector Space Model(VSM) has been designed from the value of TF-IDF weighted matrix. By using cosine similarity product rule, we have been Calculated the similarity value between the query input and all mcq of DB from VSM. QB3S has been evaluated in some experimental dataset to find results by imposing different test cases. The accuracy of searching performance which has found to be satisfactory.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Thickness Dependency of Zinc Selenide (ZnSe) Thin Film Deposited By Vacuum Evaporation Method Comparative Study of Enhancing Stability of Wind Farm attached to the Grid by PID Controller based STATCOM and Capacitor Bank Performance Analysis of a High Power Quality Single Phase AC-DC Buck Boost Converter RoboFI: Autonomous Path Follower Robot for Human Body Detection and Geolocalization for Search and Rescue Missions using Computer Vision and IoT Electrical Properties of CSS Deposited CdTe Thin Films for Solar Cell Applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1