Chia-Yang Chang, Yan-Ting Lin, Shie-Jue Lee, Chih-Chin Lai
{"title":"Information Retrieval Based on Word Semantic Clustering","authors":"Chia-Yang Chang, Yan-Ting Lin, Shie-Jue Lee, Chih-Chin Lai","doi":"10.1109/CISP-BMEI.2018.8633017","DOIUrl":null,"url":null,"abstract":"Information retrieval is an important topic in the modern age. With the advance of Internet, it is more and more easy to retrieve other people's writings or publications. However, how to retrieve desirable information efficiently is a challenging work. Traditional methods like vector space model or bag-of-words are short of providing a good solution due to the incapability of handling the semantics of words satisfactorily. In this paper, we propose a new method for information retrieval. We use Word2vec to transform the words into word vectors which are able to represent the semantic relationship among different words. By considering the semantic of words and clustering the word vectors into concepts, information retrieval can be done effectively.","PeriodicalId":117227,"journal":{"name":"2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP-BMEI.2018.8633017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Information retrieval is an important topic in the modern age. With the advance of Internet, it is more and more easy to retrieve other people's writings or publications. However, how to retrieve desirable information efficiently is a challenging work. Traditional methods like vector space model or bag-of-words are short of providing a good solution due to the incapability of handling the semantics of words satisfactorily. In this paper, we propose a new method for information retrieval. We use Word2vec to transform the words into word vectors which are able to represent the semantic relationship among different words. By considering the semantic of words and clustering the word vectors into concepts, information retrieval can be done effectively.