{"title":"改进基于英语-印地语的跨语言信息检索性能","authors":"Saurabh Varshney, Jyoti Bajpai","doi":"10.1109/MITE.2013.6756354","DOIUrl":null,"url":null,"abstract":"The hurdle problem in Cross Language Information Retrieval (CLIR) is the poor performance when compared to monolingual performance in terms of average precision. The main reasons behind the poor performance of CLIR are query term mismatching, multiple representations of query terms and un-translated query terms. In this paper, we are putting our effort to solve the given problem which is discussed in detail. The limitations are needed to be addressed in order to increase the performance of the CLIR system. By analyzing those methods the architecture for English-Hindi CLIR system is proposed. Pre and post query expansion is used to improve the performance of English-Hindi CLIR system using English and Hindi WordNet, Local Expansion using initial query, definition based pre query expansion and keyword ranking. The pre and post query expansion helps to improving the performance of English-Hindi CLIR system and based upon past experiences the proposed approach retrieves more relevant information. All experiments are performed on FIRE 2010 (Forum of Information Retrieval Evaluation) datasets. The experimental results show that the proposed approach gives equal/better performance of English-Hindi CLIR system compared to monolingual performance and also helps in overcoming existing problems and outperforms the existing English-Hindi CLIR system in terms of average precision.","PeriodicalId":284844,"journal":{"name":"2013 IEEE International Conference in MOOC, Innovation and Technology in Education (MITE)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Improving Retrieval performance of English-Hindi based Cross-Language Information Retrieval\",\"authors\":\"Saurabh Varshney, Jyoti Bajpai\",\"doi\":\"10.1109/MITE.2013.6756354\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The hurdle problem in Cross Language Information Retrieval (CLIR) is the poor performance when compared to monolingual performance in terms of average precision. The main reasons behind the poor performance of CLIR are query term mismatching, multiple representations of query terms and un-translated query terms. In this paper, we are putting our effort to solve the given problem which is discussed in detail. The limitations are needed to be addressed in order to increase the performance of the CLIR system. By analyzing those methods the architecture for English-Hindi CLIR system is proposed. Pre and post query expansion is used to improve the performance of English-Hindi CLIR system using English and Hindi WordNet, Local Expansion using initial query, definition based pre query expansion and keyword ranking. The pre and post query expansion helps to improving the performance of English-Hindi CLIR system and based upon past experiences the proposed approach retrieves more relevant information. All experiments are performed on FIRE 2010 (Forum of Information Retrieval Evaluation) datasets. The experimental results show that the proposed approach gives equal/better performance of English-Hindi CLIR system compared to monolingual performance and also helps in overcoming existing problems and outperforms the existing English-Hindi CLIR system in terms of average precision.\",\"PeriodicalId\":284844,\"journal\":{\"name\":\"2013 IEEE International Conference in MOOC, Innovation and Technology in Education (MITE)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE International Conference in MOOC, Innovation and Technology in Education (MITE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MITE.2013.6756354\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE International Conference in MOOC, Innovation and Technology in Education (MITE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MITE.2013.6756354","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving Retrieval performance of English-Hindi based Cross-Language Information Retrieval
The hurdle problem in Cross Language Information Retrieval (CLIR) is the poor performance when compared to monolingual performance in terms of average precision. The main reasons behind the poor performance of CLIR are query term mismatching, multiple representations of query terms and un-translated query terms. In this paper, we are putting our effort to solve the given problem which is discussed in detail. The limitations are needed to be addressed in order to increase the performance of the CLIR system. By analyzing those methods the architecture for English-Hindi CLIR system is proposed. Pre and post query expansion is used to improve the performance of English-Hindi CLIR system using English and Hindi WordNet, Local Expansion using initial query, definition based pre query expansion and keyword ranking. The pre and post query expansion helps to improving the performance of English-Hindi CLIR system and based upon past experiences the proposed approach retrieves more relevant information. All experiments are performed on FIRE 2010 (Forum of Information Retrieval Evaluation) datasets. The experimental results show that the proposed approach gives equal/better performance of English-Hindi CLIR system compared to monolingual performance and also helps in overcoming existing problems and outperforms the existing English-Hindi CLIR system in terms of average precision.