SEMANTIC RETRIEVAL FOR INDONESIAN QURAN AUTOCOMPLETION

IF 0.9 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Jordanian Journal of Computers and Information Technology Pub Date : 1900-01-01 DOI:10.5455/jjcit.71-1668279800
R. Rajagede, Kholid Haryono, Rizan Qardafil
{"title":"SEMANTIC RETRIEVAL FOR INDONESIAN QURAN AUTOCOMPLETION","authors":"R. Rajagede, Kholid Haryono, Rizan Qardafil","doi":"10.5455/jjcit.71-1668279800","DOIUrl":null,"url":null,"abstract":"Attending lectures is a common way to learn Islamic knowledge. The speaker talks in front of the forum, and participants take notes on the lecture material. Many participants listen to the lecture while taking notes either in books or on other digital devices to avoid forgetting the discussed topics. However, note-taking during the lecture can be challenging, with no complementing module from the speaker. Lecturers have different paces and varying ways of delivering. In addition, sometimes, participants cannot always focus during the lecture. Those factors can cause problems in the note-taking process: some details can be lost or even shift the meaning. For note-taking on sensitive topics, such as verses from the Quran, the note-taking process must be done carefully and avoid mistakes. In this study, we proposed an autocomplete system for the Indonesian translation of the Quran that will help the user in note-taking Islamic lectures. The user writes down words, the parts of the Quran verse that he hears, and the system will retrieve the most similar verse. With semantic retrieval, the user does not need to write down the exact words of the verses he heard. The system can also handle typographical-error that usually occur in note-taking. We use Fasttext and calculate the cosine distance between the query and verses for the retrieval process. We also performed several optimization steps to create a robust system for the production stage. The system is evaluated by comparing how close the returned verse is with the ground truth. The proposed method's result accuracy reached 79.41% for the top 5 retrieved verse and 85.29% for the top 10 retrieved verse.","PeriodicalId":36757,"journal":{"name":"Jordanian Journal of Computers and Information Technology","volume":null,"pages":null},"PeriodicalIF":0.9000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jordanian Journal of Computers and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5455/jjcit.71-1668279800","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Attending lectures is a common way to learn Islamic knowledge. The speaker talks in front of the forum, and participants take notes on the lecture material. Many participants listen to the lecture while taking notes either in books or on other digital devices to avoid forgetting the discussed topics. However, note-taking during the lecture can be challenging, with no complementing module from the speaker. Lecturers have different paces and varying ways of delivering. In addition, sometimes, participants cannot always focus during the lecture. Those factors can cause problems in the note-taking process: some details can be lost or even shift the meaning. For note-taking on sensitive topics, such as verses from the Quran, the note-taking process must be done carefully and avoid mistakes. In this study, we proposed an autocomplete system for the Indonesian translation of the Quran that will help the user in note-taking Islamic lectures. The user writes down words, the parts of the Quran verse that he hears, and the system will retrieve the most similar verse. With semantic retrieval, the user does not need to write down the exact words of the verses he heard. The system can also handle typographical-error that usually occur in note-taking. We use Fasttext and calculate the cosine distance between the query and verses for the retrieval process. We also performed several optimization steps to create a robust system for the production stage. The system is evaluated by comparing how close the returned verse is with the ground truth. The proposed method's result accuracy reached 79.41% for the top 5 retrieved verse and 85.29% for the top 10 retrieved verse.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
印尼语古兰经自动补全的语义检索
参加讲座是学习伊斯兰知识的一种常见方式。演讲者在讲坛前演讲,参与者在演讲材料上做笔记。许多参与者一边听讲座,一边在书本上或其他电子设备上做笔记,以避免忘记讨论的主题。然而,在讲座期间做笔记可能是具有挑战性的,因为演讲者没有补充的模块。讲师有不同的节奏和不同的交付方式。此外,有时参与者在讲课过程中不能始终集中注意力。这些因素可能会在记笔记的过程中造成问题:一些细节可能会丢失,甚至会改变意思。对于敏感话题的笔记,比如《古兰经》的经文,笔记的过程必须仔细完成,避免错误。在这项研究中,我们提出了一个自动完成系统的古兰经印尼翻译,将有助于用户记笔记的伊斯兰讲座。用户写下单词,写下他听到的《古兰经》经文的部分,系统就会检索出最相似的经文。使用语义检索,用户不需要写下他听到的诗句的确切单词。该系统还可以处理笔记中经常出现的印刷错误。在检索过程中,我们使用Fasttext计算查询和段落之间的余弦距离。我们还执行了几个优化步骤,为生产阶段创建了一个健壮的系统。通过比较返回的诗句与基础真理的接近程度来评估该系统。该方法对检索前5位的结果准确率达到79.41%,对检索前10位的结果准确率达到85.29%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Jordanian Journal of Computers and Information Technology
Jordanian Journal of Computers and Information Technology Computer Science-Computer Science (all)
CiteScore
3.10
自引率
25.00%
发文量
19
期刊最新文献
OPTIMAL ENERGY CONSUMPTION AND COST PERFORMANCE SOLUTION WITH DELAY CONSTRAINTS ON FOG COMPUTING ORTHOGONAL REGRESSED STEEPEST DESCENT DEEP PERCEPTIVE NEURAL LEARNING FOR IoT- AWARE SECURED BIG DATA COMMUNICATION AUTOMATIC DETECTION OF PNEUMONIA USING CONCATENATED CONVOLUTIONAL NEURAL NETWORK DESIGN OF A COMPACT BROADBAND ANTENNA USING CHARACTERISTIC MODE ANALYSIS FOR MICROWAVE APPLICATIONS Effectiveness of zero-shot models in automatic Arabic Poem generation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1