{"title":"Enhancing link prediction through node embedding and ensemble learning","authors":"Zhongyuan Chen, Yongji Wang","doi":"10.1007/s10115-024-02203-6","DOIUrl":null,"url":null,"abstract":"<p>Social networks, characterized by their dynamic and continually evolving nature, present challenges for effective link prediction (LP) due to the constant addition of nodes and connections. In response to this, we propose a novel approach to LP in social networks through Node Embedding and Ensemble Learning (LP-NEEL). Our method constructs a transition matrix from the network’s adjacency matrix and computes similarity measures between node pairs. Utilizing node2vec embedding, we extract features from nodes and generate edge embeddings by computing the inner product of node embeddings for each edge. This process yields a well-labeled dataset suitable for LP tasks. To mitigate overfitting, we balance the dataset by ensuring an equal number of negative and positive samples edge samples during both the testing and training phases. Leveraging this balanced dataset, we employ the XGBoost machine learning algorithm for final link prediction. Extensive experimentation across six social network datasets validates the efficacy of our approach, demonstrating improved predictive performance compared to existing methods.</p>","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"9 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge and Information Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10115-024-02203-6","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Social networks, characterized by their dynamic and continually evolving nature, present challenges for effective link prediction (LP) due to the constant addition of nodes and connections. In response to this, we propose a novel approach to LP in social networks through Node Embedding and Ensemble Learning (LP-NEEL). Our method constructs a transition matrix from the network’s adjacency matrix and computes similarity measures between node pairs. Utilizing node2vec embedding, we extract features from nodes and generate edge embeddings by computing the inner product of node embeddings for each edge. This process yields a well-labeled dataset suitable for LP tasks. To mitigate overfitting, we balance the dataset by ensuring an equal number of negative and positive samples edge samples during both the testing and training phases. Leveraging this balanced dataset, we employ the XGBoost machine learning algorithm for final link prediction. Extensive experimentation across six social network datasets validates the efficacy of our approach, demonstrating improved predictive performance compared to existing methods.
期刊介绍:
Knowledge and Information Systems (KAIS) provides an international forum for researchers and professionals to share their knowledge and report new advances on all topics related to knowledge systems and advanced information systems. This monthly peer-reviewed archival journal publishes state-of-the-art research reports on emerging topics in KAIS, reviews of important techniques in related areas, and application papers of interest to a general readership.