A Communication-Contention-Aware Privacy-Preserving Workflow Scheduling Method for Geo-Distributed Datacenters

IF 5.8 2区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Transactions on Services Computing Pub Date : 2024-06-03 DOI:10.1109/TSC.2024.3407595
Xinyue Shu;Quanwang Wu;MengChu Zhou;Junhao Wen
{"title":"A Communication-Contention-Aware Privacy-Preserving Workflow Scheduling Method for Geo-Distributed Datacenters","authors":"Xinyue Shu;Quanwang Wu;MengChu Zhou;Junhao Wen","doi":"10.1109/TSC.2024.3407595","DOIUrl":null,"url":null,"abstract":"Owing to real-world demands for global collaboration and increasing volumes of data to be analyzed, many data-intensive workflow applications are deployed in geographically distributed (geo-distributed) datacenters (DCs). In such an environment, inter-DC bandwidths are much slower than intra-DC ones, and how to effectively schedule inter-DC data communication without contention is crucial to a workflow's execution time. Meanwhile, the diversity of data privacy requirements in geo-distributed DCs causes an additional challenge. This article introduces a workflow scheduling model for geo-distributed DCs where inter-DC communications are explicitly considered and data privacy must be protected. A Communication-contention-Aware Privacy-preserving Scheduling (CAPS) method is proposed to solve it for the first time. CAPS distributes workflow tasks to DCs via a simulated annealing method such that privacy constraints are respected and the overall inter-DC data transmission time is minimized. It adopts a list scheduling heuristic to schedule tasks and data communications to computation and network resources. In experiments, CAPS is compared against leading-edge methods with realistic workflows and network settings. The results reveal that it can reduce workflow makespan by 7.08-87.53% in comparison with its peers, while guaranteeing data privacy and resolving all the communication contention issues, which has not been seen in the existing work.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"17 5","pages":"1887-1898"},"PeriodicalIF":5.8000,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10546273/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Owing to real-world demands for global collaboration and increasing volumes of data to be analyzed, many data-intensive workflow applications are deployed in geographically distributed (geo-distributed) datacenters (DCs). In such an environment, inter-DC bandwidths are much slower than intra-DC ones, and how to effectively schedule inter-DC data communication without contention is crucial to a workflow's execution time. Meanwhile, the diversity of data privacy requirements in geo-distributed DCs causes an additional challenge. This article introduces a workflow scheduling model for geo-distributed DCs where inter-DC communications are explicitly considered and data privacy must be protected. A Communication-contention-Aware Privacy-preserving Scheduling (CAPS) method is proposed to solve it for the first time. CAPS distributes workflow tasks to DCs via a simulated annealing method such that privacy constraints are respected and the overall inter-DC data transmission time is minimized. It adopts a list scheduling heuristic to schedule tasks and data communications to computation and network resources. In experiments, CAPS is compared against leading-edge methods with realistic workflows and network settings. The results reveal that it can reduce workflow makespan by 7.08-87.53% in comparison with its peers, while guaranteeing data privacy and resolving all the communication contention issues, which has not been seen in the existing work.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
面向地理分布式数据中心的通信-保留-隐私保护工作流调度方法
由于现实世界对全球协作的需求以及需要分析的数据量不断增加,许多数据密集型工作流应用程序都部署在地理上分布(geo-distributed)的数据中心(DC)中。在这种环境下,数据中心间的带宽比数据中心内的带宽要慢得多,如何有效地安排数据中心间的数据通信而不发生争用,对工作流的执行时间至关重要。同时,地理分布式数据中心对数据隐私要求的多样性也带来了额外的挑战。本文介绍了一种地理分布式数据中心的工作流调度模型,其中明确考虑了数据中心间的通信,并且必须保护数据隐私。本文首次提出了通信-保留隐私调度(CAPS)方法来解决这一问题。CAPS 通过模拟退火方法将工作流任务分配到 DC,从而使隐私约束得到尊重,并使整个 DC 间数据传输时间最小化。它采用列表调度启发式将任务和数据通信调度到计算和网络资源上。在实验中,CAPS 与采用现实工作流和网络设置的先进方法进行了比较。实验结果表明,与同类方法相比,CAPS 能将工作流的时间跨度缩短 7.08%-87.53%,同时还能保证数据隐私并解决所有通信争用问题,这在现有研究中是没有的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Services Computing
IEEE Transactions on Services Computing COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING
CiteScore
11.50
自引率
6.20%
发文量
278
审稿时长
>12 weeks
期刊介绍: IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.
期刊最新文献
NER-AD: Noise-Robust Reconstruction Enhanced by Representation-Learning for Metric Anomaly Detection in Online Service Systems Decentralized Load Balancing in Urban Edge computing with Spatial Modeling Age-Aware Big Data Query Evaluation for Analytic Services in Serverless Edge Clouds PMLSF: A Privacy-Preserving Machine Learning Service Framework for Efficient Task Allocation in Mobile Crowdsensing Online Microservice Deployment in Edge Networks via Multiobjective Deep Reinforcement Learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1