{"title":"通用句子编码器在员工数据语义搜索中的应用","authors":"Divyam Sheth, A. R. Gupta, L. D'mello","doi":"10.1109/iccica52458.2021.9697114","DOIUrl":null,"url":null,"abstract":"This paper describes an application that performs a semantic search on an employee database. It helps Human Resources employees to target relevant people for their events and trainings. Syntactic or lexical searching involves keyword matching but does not match synonyms and other contextually related data. By using regular keyword search, a document either contains the given word or not, and there is no middle ground. Semantic Search allows the matching of data contextually linked with the search term. High dimensional vectors, also known as embeddings, are generated for a complete sentence and are then used for searching. Under the hood, Google’s Universal Sentence Encoder. The Universal Sentence Encoder encodes text into high dimensional vectors that can be used for text classification, semantic similarity, clustering, and other natural language tasks to provide better performance of the model as compared to a custom trained Convolution Neural Network which also requires more training data.","PeriodicalId":327193,"journal":{"name":"2021 International Conference on Computational Intelligence and Computing Applications (ICCICA)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using Universal Sentence Encoder for Semantic Search of Employee Data\",\"authors\":\"Divyam Sheth, A. R. Gupta, L. D'mello\",\"doi\":\"10.1109/iccica52458.2021.9697114\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes an application that performs a semantic search on an employee database. It helps Human Resources employees to target relevant people for their events and trainings. Syntactic or lexical searching involves keyword matching but does not match synonyms and other contextually related data. By using regular keyword search, a document either contains the given word or not, and there is no middle ground. Semantic Search allows the matching of data contextually linked with the search term. High dimensional vectors, also known as embeddings, are generated for a complete sentence and are then used for searching. Under the hood, Google’s Universal Sentence Encoder. The Universal Sentence Encoder encodes text into high dimensional vectors that can be used for text classification, semantic similarity, clustering, and other natural language tasks to provide better performance of the model as compared to a custom trained Convolution Neural Network which also requires more training data.\",\"PeriodicalId\":327193,\"journal\":{\"name\":\"2021 International Conference on Computational Intelligence and Computing Applications (ICCICA)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Computational Intelligence and Computing Applications (ICCICA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iccica52458.2021.9697114\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computational Intelligence and Computing Applications (ICCICA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iccica52458.2021.9697114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using Universal Sentence Encoder for Semantic Search of Employee Data
This paper describes an application that performs a semantic search on an employee database. It helps Human Resources employees to target relevant people for their events and trainings. Syntactic or lexical searching involves keyword matching but does not match synonyms and other contextually related data. By using regular keyword search, a document either contains the given word or not, and there is no middle ground. Semantic Search allows the matching of data contextually linked with the search term. High dimensional vectors, also known as embeddings, are generated for a complete sentence and are then used for searching. Under the hood, Google’s Universal Sentence Encoder. The Universal Sentence Encoder encodes text into high dimensional vectors that can be used for text classification, semantic similarity, clustering, and other natural language tasks to provide better performance of the model as compared to a custom trained Convolution Neural Network which also requires more training data.