{"title":"NLP关系查询及其应用","authors":"Andrei Stoica, K. Pu, Heidar Davoudi","doi":"10.1109/IRI49571.2020.00064","DOIUrl":null,"url":null,"abstract":"Recent advances in natural language processing have shown the effectiveness of statistical and neural networkbased algorithms in a deep understanding of textual data. We demonstrate that the result of NLP analysis on text documents can enrich relational data in a way so that structured queries can be used to derive further value from text data. In this paper, we present how we can perform analytics on a scientific research dataset based on both the relational data and NLP topic modeling. The integrated NLP features together with the classical relational query constructs allow one to explore the topic structure of the DBLP dataset with flexibility and precision.","PeriodicalId":93159,"journal":{"name":"2020 IEEE 21st International Conference on Information Reuse and Integration for Data Science : IRI 2020 : proceedings : virtual conference, 11-13 August 2020. IEEE International Conference on Information Reuse and Integration (21st : 2...","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"NLP Relational Queries and Its Application\",\"authors\":\"Andrei Stoica, K. Pu, Heidar Davoudi\",\"doi\":\"10.1109/IRI49571.2020.00064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent advances in natural language processing have shown the effectiveness of statistical and neural networkbased algorithms in a deep understanding of textual data. We demonstrate that the result of NLP analysis on text documents can enrich relational data in a way so that structured queries can be used to derive further value from text data. In this paper, we present how we can perform analytics on a scientific research dataset based on both the relational data and NLP topic modeling. The integrated NLP features together with the classical relational query constructs allow one to explore the topic structure of the DBLP dataset with flexibility and precision.\",\"PeriodicalId\":93159,\"journal\":{\"name\":\"2020 IEEE 21st International Conference on Information Reuse and Integration for Data Science : IRI 2020 : proceedings : virtual conference, 11-13 August 2020. IEEE International Conference on Information Reuse and Integration (21st : 2...\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 21st International Conference on Information Reuse and Integration for Data Science : IRI 2020 : proceedings : virtual conference, 11-13 August 2020. IEEE International Conference on Information Reuse and Integration (21st : 2...\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRI49571.2020.00064\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 21st International Conference on Information Reuse and Integration for Data Science : IRI 2020 : proceedings : virtual conference, 11-13 August 2020. IEEE International Conference on Information Reuse and Integration (21st : 2...","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI49571.2020.00064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Recent advances in natural language processing have shown the effectiveness of statistical and neural networkbased algorithms in a deep understanding of textual data. We demonstrate that the result of NLP analysis on text documents can enrich relational data in a way so that structured queries can be used to derive further value from text data. In this paper, we present how we can perform analytics on a scientific research dataset based on both the relational data and NLP topic modeling. The integrated NLP features together with the classical relational query constructs allow one to explore the topic structure of the DBLP dataset with flexibility and precision.