Collaboration of Nazief & Adriani Stemming Algorithm with PostgreSQL Queries Parsing Method to Search for New Study Program Names

Indra Chaidir
{"title":"Collaboration of Nazief & Adriani Stemming Algorithm with PostgreSQL Queries Parsing Method to Search for New Study Program Names","authors":"Indra Chaidir","doi":"10.24114/cess.v8i2.48212","DOIUrl":null,"url":null,"abstract":"Penolakan usulan nama baru program studi vokasi pada Aplikasi Silemkerma di Direktorat Jenderal Pendidikan Tinggi Vokasi, Kementerian Pendidikan, Kebudayaan, Riset, dan Teknologi sering terjadi karena terdapat kemiripan nama program studi yang diusulkan dengan nama program studi yang sudah ada di dalam basis data. Banyak data tidak ditemukan karena filter data menggunakan metode konvensional dalam kasus ini menggunakan operator ILIKE dengan pola wildcard character % (percent), sedangkan data yang dicari tersedia di dalam basis data. Ini terjadi dikarenakan operator ILIKE tidak dapat membaca perubahan kata dari leksem/akar kata (root word) seperti \"pengelolaan\" dengan memiliki prefix dan suffix, dengan akar kata \"kelola\". Mengatasi permasalahan ini, penulis memanfaatkan Algoritma Nazief & Adriani untuk stemming agar mendapatkan leksem dari kalimat yang dimasukan. Hasil algoritma tersebut terus diolah menggunakan Metode Parsing Queries, salah satu metode Full Text Search yang ada pada basis data PostgresQL. Hasil penelitian ini dapat diimplementasikan pada Aplikasi tersebut.Rejection of new vocational study program name proposals in Silemkerma Application at the Directorate General of Vocational Higher Education, Ministry of Education, Culture, Research, and Technology often occurs because there is a similarity between the proposed study program name and the existing study program name in the database. Many data are not found because the data filter uses conventional methods in this case using the ILIKE operator with the wildcard character pattern % (percent), while the data sought is available in the database. This is because the ILIKE operator cannot read word changes from lexemes/root words such as \"pengelolaan\" which has a prefix and suffix, with the root word \"kelola\". Overcoming this problem, the author utilizes the Nazief & Adriani Algorithm for stemming in order to get lexemes from the sentences entered. The results of the algorithm are then processed using the Parsing Queries Method, one of the Full Text Search methods available in the PostgresQL database. The results of this research can be implemented in the application.","PeriodicalId":53361,"journal":{"name":"CESS Journal of Computer Engineering System and Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"CESS Journal of Computer Engineering System and Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24114/cess.v8i2.48212","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Penolakan usulan nama baru program studi vokasi pada Aplikasi Silemkerma di Direktorat Jenderal Pendidikan Tinggi Vokasi, Kementerian Pendidikan, Kebudayaan, Riset, dan Teknologi sering terjadi karena terdapat kemiripan nama program studi yang diusulkan dengan nama program studi yang sudah ada di dalam basis data. Banyak data tidak ditemukan karena filter data menggunakan metode konvensional dalam kasus ini menggunakan operator ILIKE dengan pola wildcard character % (percent), sedangkan data yang dicari tersedia di dalam basis data. Ini terjadi dikarenakan operator ILIKE tidak dapat membaca perubahan kata dari leksem/akar kata (root word) seperti "pengelolaan" dengan memiliki prefix dan suffix, dengan akar kata "kelola". Mengatasi permasalahan ini, penulis memanfaatkan Algoritma Nazief & Adriani untuk stemming agar mendapatkan leksem dari kalimat yang dimasukan. Hasil algoritma tersebut terus diolah menggunakan Metode Parsing Queries, salah satu metode Full Text Search yang ada pada basis data PostgresQL. Hasil penelitian ini dapat diimplementasikan pada Aplikasi tersebut.Rejection of new vocational study program name proposals in Silemkerma Application at the Directorate General of Vocational Higher Education, Ministry of Education, Culture, Research, and Technology often occurs because there is a similarity between the proposed study program name and the existing study program name in the database. Many data are not found because the data filter uses conventional methods in this case using the ILIKE operator with the wildcard character pattern % (percent), while the data sought is available in the database. This is because the ILIKE operator cannot read word changes from lexemes/root words such as "pengelolaan" which has a prefix and suffix, with the root word "kelola". Overcoming this problem, the author utilizes the Nazief & Adriani Algorithm for stemming in order to get lexemes from the sentences entered. The results of the algorithm are then processed using the Parsing Queries Method, one of the Full Text Search methods available in the PostgresQL database. The results of this research can be implemented in the application.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Nazief和Adriani Stemming算法与PostgreSQL查询解析方法协作搜索新的研究项目名称
在vokasi高等教育总理事会、教育部、文化、研究和技术中,反对针对该项目的新名称的建议,往往是因为提议的研究项目的名称与数据库中已经存在的研究项目的名称相似。许多数据没有被发现,因为数据过滤器在这种情况下使用传统的方法来过滤数据,使用的是带有野猫模式% (percent)的ILIKE操作员,而检索的数据可以在数据库中找到。这是因为经营者无法通过prefix和suffix、词根“管理”等词根来读取单词的变化。为了解决这一问题,作者使用了亚得里亚尼算法来将单词压在插入句中。该算法继续使用PostgresQL数据库中的全文本搜索方法Parsing Queries进行处理。这项研究的结果可以在应用程序上实现。关于西勒姆克马高级教育指导、文化、研究和技术的新词汇学习计划的申请,因为在数据库中设立了一个类似的例子。许多数据尚未被发现,因为在这种情况下,数据过滤了有用的uses习惯方法,我喜欢使用野花模式的操作员,而sought数据在数据库中是可行的。这是因为我喜欢的操作员能够从lexemes/root words中读取单词changes,就像管理哪些prefix和suffix一样,带有根词“管理”。克服这些问题,author实用主义和Adriani算法扎根,以便从内部获得词汇。该算法的结果是使用Parsing Queries Method,在PostgresQL数据库中完整的文本搜索方法之一是可行的。这项研究的结果可以在应用中实现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
40
审稿时长
4 weeks
期刊最新文献
Implementation of the Multimedia Development Life Cycle in Making Educational Games About Indonesia Data Mining Algorithm Decision Tree Itterative Dechotomiser 3 (ID3) for Classification of Stroke Implementation of Weight Aggregated Sum Product Assessment (WASPAS) on the Selection of Online English Course Platforms Usability of Brain Tumor Detection Using the DNN (Deep Neural Network) Method Based on Medical Image on DICOM Performance Comparison Analysis of Multi Prime RSA and Multi Power RSA
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1