{"title":"A Communication-Contention-Aware Privacy-Preserving Workflow Scheduling Method for Geo-Distributed Datacenters","authors":"Xinyue Shu;Quanwang Wu;MengChu Zhou;Junhao Wen","doi":"10.1109/TSC.2024.3407595","DOIUrl":null,"url":null,"abstract":"Owing to real-world demands for global collaboration and increasing volumes of data to be analyzed, many data-intensive workflow applications are deployed in geographically distributed (geo-distributed) datacenters (DCs). In such an environment, inter-DC bandwidths are much slower than intra-DC ones, and how to effectively schedule inter-DC data communication without contention is crucial to a workflow's execution time. Meanwhile, the diversity of data privacy requirements in geo-distributed DCs causes an additional challenge. This article introduces a workflow scheduling model for geo-distributed DCs where inter-DC communications are explicitly considered and data privacy must be protected. A Communication-contention-Aware Privacy-preserving Scheduling (CAPS) method is proposed to solve it for the first time. CAPS distributes workflow tasks to DCs via a simulated annealing method such that privacy constraints are respected and the overall inter-DC data transmission time is minimized. It adopts a list scheduling heuristic to schedule tasks and data communications to computation and network resources. In experiments, CAPS is compared against leading-edge methods with realistic workflows and network settings. The results reveal that it can reduce workflow makespan by 7.08-87.53% in comparison with its peers, while guaranteeing data privacy and resolving all the communication contention issues, which has not been seen in the existing work.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"17 5","pages":"1887-1898"},"PeriodicalIF":5.8000,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10546273/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Owing to real-world demands for global collaboration and increasing volumes of data to be analyzed, many data-intensive workflow applications are deployed in geographically distributed (geo-distributed) datacenters (DCs). In such an environment, inter-DC bandwidths are much slower than intra-DC ones, and how to effectively schedule inter-DC data communication without contention is crucial to a workflow's execution time. Meanwhile, the diversity of data privacy requirements in geo-distributed DCs causes an additional challenge. This article introduces a workflow scheduling model for geo-distributed DCs where inter-DC communications are explicitly considered and data privacy must be protected. A Communication-contention-Aware Privacy-preserving Scheduling (CAPS) method is proposed to solve it for the first time. CAPS distributes workflow tasks to DCs via a simulated annealing method such that privacy constraints are respected and the overall inter-DC data transmission time is minimized. It adopts a list scheduling heuristic to schedule tasks and data communications to computation and network resources. In experiments, CAPS is compared against leading-edge methods with realistic workflows and network settings. The results reveal that it can reduce workflow makespan by 7.08-87.53% in comparison with its peers, while guaranteeing data privacy and resolving all the communication contention issues, which has not been seen in the existing work.
期刊介绍:
IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.