Balanced double queues for GC work-stealing on weak memory models

Michihiro Horie, H. Horii, Kazunori Ogata, Tamiya Onodera
{"title":"Balanced double queues for GC work-stealing on weak memory models","authors":"Michihiro Horie, H. Horii, Kazunori Ogata, Tamiya Onodera","doi":"10.1145/3210563.3210570","DOIUrl":null,"url":null,"abstract":"Work-stealing is promising for scheduling and balancing parallel workloads. It has a wide range of applicability on middleware, libraries, and runtime systems of programming languages. OpenJDK uses work-stealing for copying garbage collection (GC) to balance copying tasks among GC threads. Each thread has its own queue to store tasks. When a thread has no task in its queue, it acts as a thief and attempts to steal a task from another thread's queue. However, this work-stealing algorithm requires expensive memory fences for pushing, popping, and stealing tasks, especially on weak memory models such as POWER and ARM. To address this problem, we propose a work-stealing algorithm that uses double queues. Each GC thread has a public queue that is accessible from other GC threads and a private queue that is only accessible by itself. Pushing and popping tasks in the private queue are free from expensive memory fences. The most significant point in our algorithm is providing a mechanism to maintain the load balance on the basis of the use of double queues. We developed a prototype implementation for parallel GC in OpenJDK8 for ppc64le. We evaluated our algorithm by using SPECjbb2015, SPECjvm2008, TPC-DS, and Apache DayTrader.","PeriodicalId":420262,"journal":{"name":"Proceedings of the 2018 ACM SIGPLAN International Symposium on Memory Management","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2018 ACM SIGPLAN International Symposium on Memory Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3210563.3210570","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Work-stealing is promising for scheduling and balancing parallel workloads. It has a wide range of applicability on middleware, libraries, and runtime systems of programming languages. OpenJDK uses work-stealing for copying garbage collection (GC) to balance copying tasks among GC threads. Each thread has its own queue to store tasks. When a thread has no task in its queue, it acts as a thief and attempts to steal a task from another thread's queue. However, this work-stealing algorithm requires expensive memory fences for pushing, popping, and stealing tasks, especially on weak memory models such as POWER and ARM. To address this problem, we propose a work-stealing algorithm that uses double queues. Each GC thread has a public queue that is accessible from other GC threads and a private queue that is only accessible by itself. Pushing and popping tasks in the private queue are free from expensive memory fences. The most significant point in our algorithm is providing a mechanism to maintain the load balance on the basis of the use of double queues. We developed a prototype implementation for parallel GC in OpenJDK8 for ppc64le. We evaluated our algorithm by using SPECjbb2015, SPECjvm2008, TPC-DS, and Apache DayTrader.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在弱内存模型上用于GC工作窃取的平衡双队列
偷工作对于调度和平衡并行工作负载很有希望。它在中间件、库和编程语言的运行时系统上具有广泛的适用性。OpenJDK使用工作窃取来复制垃圾收集(GC),以平衡GC线程之间的复制任务。每个线程都有自己的队列来存储任务。当线程的队列中没有任务时,它就像小偷一样,试图从另一个线程的队列中窃取任务。然而,这种窃取工作的算法需要昂贵的内存围栏来推送、弹出和窃取任务,特别是在POWER和ARM等弱内存模型上。为了解决这个问题,我们提出了一个使用双队列的工作窃取算法。每个GC线程都有一个可供其他GC线程访问的公共队列和一个只能由自己访问的私有队列。在私有队列中推送和弹出任务不需要昂贵的内存屏障。我们的算法中最重要的一点是提供了一种机制,在使用双队列的基础上维持负载平衡。我们在OpenJDK8中为ppc64le开发了一个并行GC的原型实现。我们使用SPECjbb2015、SPECjvm2008、TPC-DS和Apache DayTrader来评估我们的算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
FRC: a high-performance concurrent parallel deferred reference counter for C++ mPart: miss-ratio curve guided partitioning in key-value stores Detailed heap profiling OMR: out-of-core MapReduce for large data sets Hardware-software co-optimization of memory management in dynamic languages
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1