{"title":"DGERCL: A Dynamic Graph Embedding Approach for Root Cause Localization in Microservice Systems","authors":"Han Cheng;Qian Li;Bingchen Liu;Shijun Liu;Li Pan","doi":"10.1109/TSC.2024.3437742","DOIUrl":null,"url":null,"abstract":"Root cause localization in microservice systems refers to finding the root cause that causes system anomalies using system information. Many methods construct a graph structure and perform random walk on it to localize the root cause. This is not suitable for larger systems due to the high computational overhead. Besides, the constructed graph is usually static which mismatches with evolving metrics. Different metrics also contribute differently to determining root cause. To address these challenges, we have developed DGERCL, a novel method that employs dynamic graph embedding to localize root causes in microservice systems. We construct a dynamic graph where nodes, edges, and features correspond to microservices, invocations, and metrics. DGERCL first gets invocation information by aggregating node embedding and features via a trainable structure. An LSTM then processes invocation information to update node embedding. We also propose a neighbor information aggregation method to enrich structure information and a self-attention-inspired mechanism to leverage the importance of metrics for better mining metrics information. Finally, a classifier maps node embedding learned by LSTM to possibilities belonging to root cause. We conduct comprehensive experiments on two microservice benchmarks. Our model achieves good results which demonstrates the effectiveness of DGERCL.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"17 6","pages":"3417-3428"},"PeriodicalIF":5.8000,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10621623/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Root cause localization in microservice systems refers to finding the root cause that causes system anomalies using system information. Many methods construct a graph structure and perform random walk on it to localize the root cause. This is not suitable for larger systems due to the high computational overhead. Besides, the constructed graph is usually static which mismatches with evolving metrics. Different metrics also contribute differently to determining root cause. To address these challenges, we have developed DGERCL, a novel method that employs dynamic graph embedding to localize root causes in microservice systems. We construct a dynamic graph where nodes, edges, and features correspond to microservices, invocations, and metrics. DGERCL first gets invocation information by aggregating node embedding and features via a trainable structure. An LSTM then processes invocation information to update node embedding. We also propose a neighbor information aggregation method to enrich structure information and a self-attention-inspired mechanism to leverage the importance of metrics for better mining metrics information. Finally, a classifier maps node embedding learned by LSTM to possibilities belonging to root cause. We conduct comprehensive experiments on two microservice benchmarks. Our model achieves good results which demonstrates the effectiveness of DGERCL.
期刊介绍:
IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.