Supaporn Tantanasiriwong, S. Guha, P. Janecek, C. Haruechaiyasak, L. Azzopardi
{"title":"基于混合主题模型和共被引选择的跨领域引文推荐","authors":"Supaporn Tantanasiriwong, S. Guha, P. Janecek, C. Haruechaiyasak, L. Azzopardi","doi":"10.1504/IJDMMM.2017.086566","DOIUrl":null,"url":null,"abstract":"Cross-domain recommendations are of growing importance in the research community. An application of particular interest is to recommend a set of relevant research papers as citations for a given patent. This paper proposes an approach for cross-domain citation recommendation based on the hybrid topic model and co-citation selection. Using the topic model, relevant terms from documents could be clustered into the same topics. In addition, the co-citation selection technique will help select citations based on a set of highly similar patents. To evaluate the performance, we compared our proposed approach with the traditional baseline approaches using a corpus of patents collected for different technological fields of biotechnology, environmental technology, medical technology and nanotechnology. Experimental results show our cross domain citation recommendation yields a higher performance in predicting relevant publication citations than all baseline approaches.","PeriodicalId":43061,"journal":{"name":"International Journal of Data Mining Modelling and Management","volume":"37 1","pages":"220-236"},"PeriodicalIF":0.4000,"publicationDate":"2017-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cross-domain citation recommendation based on hybrid topic model and co-citation selection citation selection\",\"authors\":\"Supaporn Tantanasiriwong, S. Guha, P. Janecek, C. Haruechaiyasak, L. Azzopardi\",\"doi\":\"10.1504/IJDMMM.2017.086566\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cross-domain recommendations are of growing importance in the research community. An application of particular interest is to recommend a set of relevant research papers as citations for a given patent. This paper proposes an approach for cross-domain citation recommendation based on the hybrid topic model and co-citation selection. Using the topic model, relevant terms from documents could be clustered into the same topics. In addition, the co-citation selection technique will help select citations based on a set of highly similar patents. To evaluate the performance, we compared our proposed approach with the traditional baseline approaches using a corpus of patents collected for different technological fields of biotechnology, environmental technology, medical technology and nanotechnology. Experimental results show our cross domain citation recommendation yields a higher performance in predicting relevant publication citations than all baseline approaches.\",\"PeriodicalId\":43061,\"journal\":{\"name\":\"International Journal of Data Mining Modelling and Management\",\"volume\":\"37 1\",\"pages\":\"220-236\"},\"PeriodicalIF\":0.4000,\"publicationDate\":\"2017-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Data Mining Modelling and Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJDMMM.2017.086566\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Data Mining Modelling and Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJDMMM.2017.086566","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Cross-domain citation recommendation based on hybrid topic model and co-citation selection citation selection
Cross-domain recommendations are of growing importance in the research community. An application of particular interest is to recommend a set of relevant research papers as citations for a given patent. This paper proposes an approach for cross-domain citation recommendation based on the hybrid topic model and co-citation selection. Using the topic model, relevant terms from documents could be clustered into the same topics. In addition, the co-citation selection technique will help select citations based on a set of highly similar patents. To evaluate the performance, we compared our proposed approach with the traditional baseline approaches using a corpus of patents collected for different technological fields of biotechnology, environmental technology, medical technology and nanotechnology. Experimental results show our cross domain citation recommendation yields a higher performance in predicting relevant publication citations than all baseline approaches.
期刊介绍:
Facilitating transformation from data to information to knowledge is paramount for organisations. Companies are flooded with data and conflicting information, but with limited real usable knowledge. However, rarely should a process be looked at from limited angles or in parts. Isolated islands of data mining, modelling and management (DMMM) should be connected. IJDMMM highlightes integration of DMMM, statistics/machine learning/databases, each element of data chain management, types of information, algorithms in software; from data pre-processing to post-processing; between theory and applications. Topics covered include: -Artificial intelligence- Biomedical science- Business analytics/intelligence, process modelling- Computer science, database management systems- Data management, mining, modelling, warehousing- Engineering- Environmental science, environment (ecoinformatics)- Information systems/technology, telecommunications/networking- Management science, operations research, mathematics/statistics- Social sciences- Business/economics, (computational) finance- Healthcare, medicine, pharmaceuticals- (Computational) chemistry, biology (bioinformatics)- Sustainable mobility systems, intelligent transportation systems- National security