{"title":"基于WORDNET的信息检索系统中的术语。第一集","authors":"J. Nhacuongue, M. Dutra","doi":"10.22478/ufpb.1809-4783.2020v30n2.50756","DOIUrl":null,"url":null,"abstract":"The article results from post-doctoral research conducted in Universidade Federal de Santa Catarina. The goal is to propose information retrieval strategies based on natural language processing, to extract semantic relations from WordNet.Pt, and use them to represent documents and users’ search expressions. The approach is qualitative, exploratory and applied to ambiguity problems in information retrieval. As for the procedures used, it is a bibliographic search. The discussion is motivated by the problem of low precision and high recall in user searches, influenced both by the absence of semantic correspondence between search expressions and terms used in indexing and by the lack of determination of the semantic similarity between document terms that, even being lexicographically different, have the same meaning. The research core is justified by the advantage of developing systems that combine natural language and controlled language, for an interactive search. Although in a partial way, the research points to important results in the solution of lexical ambiguity, through semantic relationships in the representation of documents and user search. On the one hand, this success guarantees the restriction of the search space and, consequently, precision. On the other hand, the expansion of consultations by suggesting equivalent terms from controlled vocabularies and the natural language and its variants.","PeriodicalId":44127,"journal":{"name":"Informacao & Sociedade-Estudos","volume":"86 1","pages":""},"PeriodicalIF":0.1000,"publicationDate":"2020-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A terminologia em Sistemas de Recuperação da Informação baseada na WORDNET.PT\",\"authors\":\"J. Nhacuongue, M. Dutra\",\"doi\":\"10.22478/ufpb.1809-4783.2020v30n2.50756\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The article results from post-doctoral research conducted in Universidade Federal de Santa Catarina. The goal is to propose information retrieval strategies based on natural language processing, to extract semantic relations from WordNet.Pt, and use them to represent documents and users’ search expressions. The approach is qualitative, exploratory and applied to ambiguity problems in information retrieval. As for the procedures used, it is a bibliographic search. The discussion is motivated by the problem of low precision and high recall in user searches, influenced both by the absence of semantic correspondence between search expressions and terms used in indexing and by the lack of determination of the semantic similarity between document terms that, even being lexicographically different, have the same meaning. The research core is justified by the advantage of developing systems that combine natural language and controlled language, for an interactive search. Although in a partial way, the research points to important results in the solution of lexical ambiguity, through semantic relationships in the representation of documents and user search. On the one hand, this success guarantees the restriction of the search space and, consequently, precision. On the other hand, the expansion of consultations by suggesting equivalent terms from controlled vocabularies and the natural language and its variants.\",\"PeriodicalId\":44127,\"journal\":{\"name\":\"Informacao & Sociedade-Estudos\",\"volume\":\"86 1\",\"pages\":\"\"},\"PeriodicalIF\":0.1000,\"publicationDate\":\"2020-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Informacao & Sociedade-Estudos\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://doi.org/10.22478/ufpb.1809-4783.2020v30n2.50756\",\"RegionNum\":4,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"INFORMATION SCIENCE & LIBRARY SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informacao & Sociedade-Estudos","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.22478/ufpb.1809-4783.2020v30n2.50756","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
A terminologia em Sistemas de Recuperação da Informação baseada na WORDNET.PT
The article results from post-doctoral research conducted in Universidade Federal de Santa Catarina. The goal is to propose information retrieval strategies based on natural language processing, to extract semantic relations from WordNet.Pt, and use them to represent documents and users’ search expressions. The approach is qualitative, exploratory and applied to ambiguity problems in information retrieval. As for the procedures used, it is a bibliographic search. The discussion is motivated by the problem of low precision and high recall in user searches, influenced both by the absence of semantic correspondence between search expressions and terms used in indexing and by the lack of determination of the semantic similarity between document terms that, even being lexicographically different, have the same meaning. The research core is justified by the advantage of developing systems that combine natural language and controlled language, for an interactive search. Although in a partial way, the research points to important results in the solution of lexical ambiguity, through semantic relationships in the representation of documents and user search. On the one hand, this success guarantees the restriction of the search space and, consequently, precision. On the other hand, the expansion of consultations by suggesting equivalent terms from controlled vocabularies and the natural language and its variants.