{"title":"GMaglev: Graph-friendly Consistent Hashing for Distributed Social Graph Partition","authors":"Miaomiao Cheng, Jiahui Zhang, Shuibing Long","doi":"10.1109/IDITR57726.2023.10145968","DOIUrl":null,"url":null,"abstract":"Consistent hashing plays an important role in distributed systems as well as in distributed graph systems. It assumes the role of load balancing and data relocation for system expansion and contraction. Graph partitioning is used by many distributed graph systems, which can improve the system’s performance. In distributed systems, expansion and contraction cause data relocation. Data relocation consumes CPU and leads to the unreliability of online services.The system’s performance using the graph partition algorithm to optimize data distribution will inevitably decline with data relocation. Data migration due to capacity expansion is accidental, and system efficiency is a long-term indicator. We present GMaglev to find a way to balance the data migration and the fanout optimization. Experiments on multiple datasets show the effectiveness of GMaglev algorithm. In the case of only 5% increase in data relocation radio, fanout is reduced by 20%.","PeriodicalId":272880,"journal":{"name":"2023 2nd International Conference on Innovations and Development of Information Technologies and Robotics (IDITR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd International Conference on Innovations and Development of Information Technologies and Robotics (IDITR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IDITR57726.2023.10145968","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Consistent hashing plays an important role in distributed systems as well as in distributed graph systems. It assumes the role of load balancing and data relocation for system expansion and contraction. Graph partitioning is used by many distributed graph systems, which can improve the system’s performance. In distributed systems, expansion and contraction cause data relocation. Data relocation consumes CPU and leads to the unreliability of online services.The system’s performance using the graph partition algorithm to optimize data distribution will inevitably decline with data relocation. Data migration due to capacity expansion is accidental, and system efficiency is a long-term indicator. We present GMaglev to find a way to balance the data migration and the fanout optimization. Experiments on multiple datasets show the effectiveness of GMaglev algorithm. In the case of only 5% increase in data relocation radio, fanout is reduced by 20%.