{"title":"基于局部上下文和分布模型的文本信息检索中的查询扩展","authors":"Fabiano Tavares da Silva, J. Maia","doi":"10.6025/jdim/2019/17/6/313-320","DOIUrl":null,"url":null,"abstract":"The Semantic Distributional Model is based on the frequency of contexts of use of language terms in large open corpus such as the web, to establish similarity or the relationship between words. These relationships or similarities can be used to add terms when expanding queries. The idea explored in this paper is that, for queries in closed collections of text documents, a posterior filter based on the restricted vocabulary of the collection can improve the effectiveness of automatic query expansion. This idea is developed and evaluated in publicly available benchmarks presenting promising results. Subject Categories and Descriptors: [H.3.3 Information Search and Retrieval]; Query formulation: [I.2.7 Natural Language Processing] Text analysis [F.4.2 Grammars and Other Rewriting Systems]; Grammar types General Terms: Distributional Semantic Model, Information Retrieval, Local Context Analysis.","PeriodicalId":303976,"journal":{"name":"J. Digit. Inf. Manag.","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Query Expansion in Text Information Retrieval with Local Context and Distributional Model\",\"authors\":\"Fabiano Tavares da Silva, J. Maia\",\"doi\":\"10.6025/jdim/2019/17/6/313-320\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Semantic Distributional Model is based on the frequency of contexts of use of language terms in large open corpus such as the web, to establish similarity or the relationship between words. These relationships or similarities can be used to add terms when expanding queries. The idea explored in this paper is that, for queries in closed collections of text documents, a posterior filter based on the restricted vocabulary of the collection can improve the effectiveness of automatic query expansion. This idea is developed and evaluated in publicly available benchmarks presenting promising results. Subject Categories and Descriptors: [H.3.3 Information Search and Retrieval]; Query formulation: [I.2.7 Natural Language Processing] Text analysis [F.4.2 Grammars and Other Rewriting Systems]; Grammar types General Terms: Distributional Semantic Model, Information Retrieval, Local Context Analysis.\",\"PeriodicalId\":303976,\"journal\":{\"name\":\"J. Digit. Inf. Manag.\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"J. Digit. Inf. Manag.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.6025/jdim/2019/17/6/313-320\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Digit. Inf. Manag.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.6025/jdim/2019/17/6/313-320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Query Expansion in Text Information Retrieval with Local Context and Distributional Model
The Semantic Distributional Model is based on the frequency of contexts of use of language terms in large open corpus such as the web, to establish similarity or the relationship between words. These relationships or similarities can be used to add terms when expanding queries. The idea explored in this paper is that, for queries in closed collections of text documents, a posterior filter based on the restricted vocabulary of the collection can improve the effectiveness of automatic query expansion. This idea is developed and evaluated in publicly available benchmarks presenting promising results. Subject Categories and Descriptors: [H.3.3 Information Search and Retrieval]; Query formulation: [I.2.7 Natural Language Processing] Text analysis [F.4.2 Grammars and Other Rewriting Systems]; Grammar types General Terms: Distributional Semantic Model, Information Retrieval, Local Context Analysis.