Learning to rank for determining relevant document in Indonesian-English cross language information retrieval using BM25

2014 International Conference on Advanced Computer Science and Information System Pub Date : 2014-10-01 DOI:10.1109/ICACSIS.2014.7065896

Syandra Sari, M. Adriani

引用次数: 11

Abstract

One important task in cross-language information retrieval (CLIR) is to determine the relevance of a document from a number of documents based on user query. In this paper we applied pointwise learning to rank in SVM (Support Vector Machine) to determine the relevance of a document and used BM25 (Best Match 25) ranking function for selecting words as features. We did the experiment in Indonesian-English CLIR The results show an average ability of SVM to identify relevant documents is 88.51%, while the average accuracy of SVM to identify non relevant documents is 88%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

学习用BM25在印尼语-英语跨语言信息检索中排序确定相关文档

跨语言信息检索(CLIR)中的一项重要任务是根据用户查询从大量文档中确定文档的相关性。在本文中，我们应用点向学习对SVM(支持向量机)进行排序来确定文档的相关性，并使用BM25(最佳匹配25)排序函数来选择单词作为特征。我们在印尼语-英语CLIR中进行了实验，结果表明SVM识别相关文档的平均准确率为88.51%，而SVM识别非相关文档的平均准确率为88%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 International Conference on Advanced Computer Science and Information System

自引率

0.00%

发文量