首页 > 最新文献

Proceedings of the Eleventh European Conference on Computer Systems最新文献

英文 中文
EuroSys '22: Seventeenth European Conference on Computer Systems, Rennes, France, April 5 - 8, 2022 EuroSys '22:第十七届欧洲计算机系统会议,法国雷恩,2022年4月5日至8日
Pub Date : 2022-01-01 DOI: 10.1145/3492321
{"title":"EuroSys '22: Seventeenth European Conference on Computer Systems, Rennes, France, April 5 - 8, 2022","authors":"","doi":"10.1145/3492321","DOIUrl":"https://doi.org/10.1145/3492321","url":null,"abstract":"","PeriodicalId":20737,"journal":{"name":"Proceedings of the Eleventh European Conference on Computer Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91322469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EuroSys '21: Sixteenth European Conference on Computer Systems, Online Event, United Kingdom, April 26-28, 2021 EuroSys '21:第十六届欧洲计算机系统会议,在线活动,英国,2021年4月26-28日
Pub Date : 2021-01-01 DOI: 10.1145/3447786
{"title":"EuroSys '21: Sixteenth European Conference on Computer Systems, Online Event, United Kingdom, April 26-28, 2021","authors":"","doi":"10.1145/3447786","DOIUrl":"https://doi.org/10.1145/3447786","url":null,"abstract":"","PeriodicalId":20737,"journal":{"name":"Proceedings of the Eleventh European Conference on Computer Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84301512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
EuroSys '20: Fifteenth EuroSys Conference 2020, Heraklion, Greece, April 27-30, 2020 2020年第15届EuroSys会议,伊拉克利翁,希腊,2020年4月27-30日
Pub Date : 2020-01-01 DOI: 10.1145/3342195
{"title":"EuroSys '20: Fifteenth EuroSys Conference 2020, Heraklion, Greece, April 27-30, 2020","authors":"","doi":"10.1145/3342195","DOIUrl":"https://doi.org/10.1145/3342195","url":null,"abstract":"","PeriodicalId":20737,"journal":{"name":"Proceedings of the Eleventh European Conference on Computer Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78763375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Application-specific quantum for multi-core platform scheduler 用于多核平台调度器的特定应用程序量子
Pub Date : 2016-04-18 DOI: 10.1145/2901318.2901340
Boris Teabe, A. Tchana, D. Hagimont
Scheduling has a significant influence on application performance. Deciding on a quantum length can be very tricky, especially when concurrent applications have various characteristics. This is actually the case in virtualized cloud computing environments where virtual machines from different users are colocated on the same physical machine. We claim that in a multi-core virtualized platform, different quantum lengths should be associated with different application types. We apply this principle in a new scheduler called AQL_Sched. We identified 5 main application types and experimentally found the best quantum length for each of them. Dynamically, AQL_Sched associates an application type with each virtual CPU (vCPU) and schedules vCPUs according to their type on physical CPU (pCPU) pools with the best quantum length. Therefore, each vCPU is scheduled on a pCPU with the best quantum length. We implemented a prototype of AQL_Sched in Xen and we evaluated it with various reference benchmarks (SPECweb2009, SPECmail2009, SPEC CPU2006, and PARSEC). The evaluation results show that AQL_Sched outperforms Xen's credit scheduler. For instance, up to 20%, 10% and 15% of performance improvements have been obtained with SPECweb2009, SPEC CPU2006 and PARSEC, respectively.
调度对应用程序的性能影响很大。确定量子长度可能非常棘手,特别是当并发应用程序具有各种特征时。这实际上是虚拟化云计算环境中的情况,其中来自不同用户的虚拟机位于同一物理机器上。我们认为,在多核虚拟化平台中,不同的量子长度应该与不同的应用程序类型相关联。我们在名为AQL_Sched的新调度器中应用了这一原则。我们确定了5种主要的应用类型,并通过实验找到了每种应用类型的最佳量子长度。AQL_Sched动态地将应用程序类型与每个虚拟CPU (vCPU)关联,并根据其类型在最佳量子长度的物理CPU (pCPU)池上调度vCPU。因此,每个vCPU被调度到一个量子长度最佳的pCPU上。我们在Xen中实现了AQL_Sched的原型,并使用各种参考基准(SPECweb2009、SPECmail2009、SPEC CPU2006和PARSEC)对其进行了评估。评估结果表明,aql_scheed优于Xen的信用调度程序。例如,SPECweb2009、SPEC CPU2006和PARSEC分别获得了高达20%、10%和15%的性能改进。
{"title":"Application-specific quantum for multi-core platform scheduler","authors":"Boris Teabe, A. Tchana, D. Hagimont","doi":"10.1145/2901318.2901340","DOIUrl":"https://doi.org/10.1145/2901318.2901340","url":null,"abstract":"Scheduling has a significant influence on application performance. Deciding on a quantum length can be very tricky, especially when concurrent applications have various characteristics. This is actually the case in virtualized cloud computing environments where virtual machines from different users are colocated on the same physical machine. We claim that in a multi-core virtualized platform, different quantum lengths should be associated with different application types. We apply this principle in a new scheduler called AQL_Sched. We identified 5 main application types and experimentally found the best quantum length for each of them. Dynamically, AQL_Sched associates an application type with each virtual CPU (vCPU) and schedules vCPUs according to their type on physical CPU (pCPU) pools with the best quantum length. Therefore, each vCPU is scheduled on a pCPU with the best quantum length. We implemented a prototype of AQL_Sched in Xen and we evaluated it with various reference benchmarks (SPECweb2009, SPECmail2009, SPEC CPU2006, and PARSEC). The evaluation results show that AQL_Sched outperforms Xen's credit scheduler. For instance, up to 20%, 10% and 15% of performance improvements have been obtained with SPECweb2009, SPEC CPU2006 and PARSEC, respectively.","PeriodicalId":20737,"journal":{"name":"Proceedings of the Eleventh European Conference on Computer Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75426718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Picocenter: supporting long-lived, mostly-idle applications in cloud environments Picocenter:在云环境中支持长寿命、大部分空闲的应用程序
Pub Date : 2016-04-18 DOI: 10.1145/2901318.2901345
L. Zhang, J. Litton, Frank Cangialosi, Theophilus A. Benson, Dave Levin, A. Mislove
Cloud computing has evolved to meet user demands, from arbitrary VMs offered by IaaS to the narrow application interfaces of PaaS. Unfortunately, there exists an intermediate point that is not well met by today's offerings: users who wish to run arbitrary, already available binaries (as opposed to rewriting their own application for a PaaS) yet expect their applications to be long-lived but mostly idle (as opposed to the always-on VM of IaaS). For example, end users who wish to run their own email or DNS server. In this paper, we explore an alternative approach for cloud computation based on a process-like abstraction rather than a virtual machine abstraction, thereby gaining the scalability and efficiency of PaaS along with the generality of IaaS. We present the design of Picocenter, a hosting infrastructure for such applications that enables use of legacy applications. The key technical challenge in Picocenter is enabling fast swapping of applications to and from cloud storage (since, by definition, applications are largely idle, we expect them to spend the majority of their time swapped out). We develop an ActiveSet technique that prefetches the application's predicted memory working set when reviving an application. An evaluation on EC2 demonstrates that using ActiveSet, Picocenter is able to swap in applications in under 250 ms even when they are stored in S3 while swapped out.
云计算已经发展到满足用户需求,从IaaS提供的任意vm到PaaS的狭窄应用程序接口。不幸的是,目前的产品没有很好地满足一个中间点:用户希望运行任意的、已经可用的二进制文件(而不是为PaaS重写自己的应用程序),但又希望他们的应用程序寿命长,但大部分时间是空闲的(而不是IaaS的永远在线的VM)。例如,希望运行自己的电子邮件或DNS服务器的最终用户。在本文中,我们探索了一种基于类过程抽象而不是虚拟机抽象的云计算替代方法,从而获得了PaaS的可扩展性和效率以及IaaS的通用性。我们介绍了Picocenter的设计,这是一种用于此类应用程序的托管基础设施,可以使用遗留应用程序。Picocenter的关键技术挑战是实现应用程序与云存储之间的快速交换(因为,根据定义,应用程序基本上是空闲的,我们预计它们将花费大部分时间交换)。我们开发了一种ActiveSet技术,可以在恢复应用程序时预取应用程序的预测内存工作集。对EC2的评估表明,使用ActiveSet, Picocenter能够在250毫秒内交换应用程序,即使它们在交换时存储在S3中。
{"title":"Picocenter: supporting long-lived, mostly-idle applications in cloud environments","authors":"L. Zhang, J. Litton, Frank Cangialosi, Theophilus A. Benson, Dave Levin, A. Mislove","doi":"10.1145/2901318.2901345","DOIUrl":"https://doi.org/10.1145/2901318.2901345","url":null,"abstract":"Cloud computing has evolved to meet user demands, from arbitrary VMs offered by IaaS to the narrow application interfaces of PaaS. Unfortunately, there exists an intermediate point that is not well met by today's offerings: users who wish to run arbitrary, already available binaries (as opposed to rewriting their own application for a PaaS) yet expect their applications to be long-lived but mostly idle (as opposed to the always-on VM of IaaS). For example, end users who wish to run their own email or DNS server. In this paper, we explore an alternative approach for cloud computation based on a process-like abstraction rather than a virtual machine abstraction, thereby gaining the scalability and efficiency of PaaS along with the generality of IaaS. We present the design of Picocenter, a hosting infrastructure for such applications that enables use of legacy applications. The key technical challenge in Picocenter is enabling fast swapping of applications to and from cloud storage (since, by definition, applications are largely idle, we expect them to spend the majority of their time swapped out). We develop an ActiveSet technique that prefetches the application's predicted memory working set when reviving an application. An evaluation on EC2 demonstrates that using ActiveSet, Picocenter is able to swap in applications in under 250 ms even when they are stored in S3 while swapped out.","PeriodicalId":20737,"journal":{"name":"Proceedings of the Eleventh European Conference on Computer Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84513847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
Practical condition synchronization for transactional memory 事务性内存的实际条件同步
Pub Date : 2016-04-18 DOI: 10.1145/2901318.2901342
Chao Wang, Michael F. Spear
Few transactional memory implementations allow for condition synchronization among transactions. The problems are many, most notably the lack of consensus about a single appropriate linguistic construct, and the lack of mechanisms that are compatible with hardware transactional memory. In this paper, we introduce a broadly useful mechanism for supporting condition synchronization among transactions. Our mechanism supports a number of linguistic constructs for coordinating transactions, and does so without introducing overhead on in-flight hardware transactions. Experiments show that our mechanisms work well, and that the diversity of linguistic constructs allows programmers to chose the technique that is best suited to a particular application.
很少有事务性内存实现允许事务间的条件同步。问题有很多,最明显的是缺乏对单一适当的语言结构的共识,以及缺乏与硬件事务性内存兼容的机制。在本文中,我们引入了一种广泛使用的机制来支持事务之间的条件同步。我们的机制支持许多用于协调事务的语言结构,并且不会在运行中的硬件事务上引入开销。实验表明,我们的机制运行良好,语言结构的多样性允许程序员选择最适合特定应用程序的技术。
{"title":"Practical condition synchronization for transactional memory","authors":"Chao Wang, Michael F. Spear","doi":"10.1145/2901318.2901342","DOIUrl":"https://doi.org/10.1145/2901318.2901342","url":null,"abstract":"Few transactional memory implementations allow for condition synchronization among transactions. The problems are many, most notably the lack of consensus about a single appropriate linguistic construct, and the lack of mechanisms that are compatible with hardware transactional memory. In this paper, we introduce a broadly useful mechanism for supporting condition synchronization among transactions. Our mechanism supports a number of linguistic constructs for coordinating transactions, and does so without introducing overhead on in-flight hardware transactions. Experiments show that our mechanisms work well, and that the diversity of linguistic constructs allows programmers to chose the technique that is best suited to a particular application.","PeriodicalId":20737,"journal":{"name":"Proceedings of the Eleventh European Conference on Computer Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85628277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Sinter: low-bandwidth remote access for the visually-impaired 烧结:为视障人士提供低带宽远程访问
Pub Date : 2016-04-18 DOI: 10.1145/2901318.2901335
Syed Masum Billah, Donald E. Porter, I. Ramakrishnan
Computer users commonly use applications designed for different operating systems (OSes). For instance, a Mac user may access a cloud-based Windows remote desktop to run an application required for her job. Current remote access protocols do not work well with screen readers, creating a disproportionate burden for users with visual impairments. These users' productivity depends on features of a specific screen reader, and readers are locked-in to a specific OS. The only current option is to run a different screen reader on each platform, which harms productivity. This paper describes a framework, called Sinter, that efficiently and seamlessly supports remote, cross-platform screen reading, without modifying the application or the screen reader. Sinter addresses these problems with a platform-independent intermediate representation (IR) of a remote application's user interface (UI). The Sinter IR encapsulates platform-specific accessibility code on the remote system, facilitates development of additional accessibility features, and is simple enough to be reconstructed and read on any client platform. In the example above, Sinter allows a Mac-only reader to read remote Windows applications. Sinter supports low-bandwidth, remote access to a wide range of applications, including Microsoft Word and Apple Mail, with both Windows and OS X clients and servers, as well as a web browser client. Sinter's IR-level programming model facilitates development of accessibility features and other enhancements, transparently to the remote application and reader. Sinter's latency is low enough for practical use, even over a relatively slow network connection.
计算机用户通常使用针对不同操作系统(os)设计的应用程序。例如,Mac用户可以访问基于云的Windows远程桌面来运行其工作所需的应用程序。目前的远程访问协议不能很好地与屏幕阅读器一起工作,给有视觉障碍的用户造成了不成比例的负担。这些用户的工作效率取决于特定屏幕阅读器的功能,而阅读器被锁定在特定的操作系统上。目前唯一的选择是在每个平台上运行不同的屏幕阅读器,这会损害工作效率。本文描述了一个名为Sinter的框架,它可以高效无缝地支持远程跨平台屏幕读取,而无需修改应用程序或屏幕阅读器。Sinter通过远程应用程序用户界面(UI)的独立于平台的中间表示(IR)来解决这些问题。Sinter IR在远程系统上封装了特定于平台的可访问性代码,促进了其他可访问性特性的开发,并且非常简单,可以在任何客户机平台上重构和读取。在上面的例子中,Sinter允许mac阅读器读取远程Windows应用程序。烧结支持低带宽,远程访问广泛的应用程序,包括微软Word和苹果邮件,与Windows和OS X客户端和服务器,以及web浏览器客户端。Sinter的ir级编程模型促进了可访问性特性和其他增强功能的开发,对远程应用程序和阅读器透明。烧结的延迟足够低,可以实际使用,即使在相对较慢的网络连接上。
{"title":"Sinter: low-bandwidth remote access for the visually-impaired","authors":"Syed Masum Billah, Donald E. Porter, I. Ramakrishnan","doi":"10.1145/2901318.2901335","DOIUrl":"https://doi.org/10.1145/2901318.2901335","url":null,"abstract":"Computer users commonly use applications designed for different operating systems (OSes). For instance, a Mac user may access a cloud-based Windows remote desktop to run an application required for her job. Current remote access protocols do not work well with screen readers, creating a disproportionate burden for users with visual impairments. These users' productivity depends on features of a specific screen reader, and readers are locked-in to a specific OS. The only current option is to run a different screen reader on each platform, which harms productivity. This paper describes a framework, called Sinter, that efficiently and seamlessly supports remote, cross-platform screen reading, without modifying the application or the screen reader. Sinter addresses these problems with a platform-independent intermediate representation (IR) of a remote application's user interface (UI). The Sinter IR encapsulates platform-specific accessibility code on the remote system, facilitates development of additional accessibility features, and is simple enough to be reconstructed and read on any client platform. In the example above, Sinter allows a Mac-only reader to read remote Windows applications. Sinter supports low-bandwidth, remote access to a wide range of applications, including Microsoft Word and Apple Mail, with both Windows and OS X clients and servers, as well as a web browser client. Sinter's IR-level programming model facilitates development of accessibility features and other enhancements, transparently to the remote application and reader. Sinter's latency is low enough for practical use, even over a relatively slow network connection.","PeriodicalId":20737,"journal":{"name":"Proceedings of the Eleventh European Conference on Computer Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81981516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
zExpander: a key-value cache with both high performance and fewer misses zExpander:一个键值缓存,具有高性能和更少的丢失
Pub Date : 2016-04-18 DOI: 10.1145/2901318.2901332
Xingbo Wu, Li Zhang, Yandong Wang, Yufei Ren, M. Hack, Song Jiang
While key-value (KV) cache, such as memcached, dedicates a large volume of expensive memory to holding performance-critical data, it is important to improve memory efficiency, or to reduce cache miss ratio without adding more memory. As we find that optimizing replacement algorithms is of limited effect for this purpose, a promising approach is to use a compact data organization and data compression to increase effective cache size. However, this approach has the risk of degrading the cache's performance due to additional computation cost. A common perception is that a high-performance KV cache is not compatible with use of data compacting techniques. In this paper, we show that, by leveraging highly skewed data access pattern common in real-world KV cache workloads, we can both reduce miss ratio through improved memory efficiency and maintain high performance for a KV cache. Specifically, we design and implement a KV cache system, named zExpander, which dynamically partitions the cache into two sub-caches. One serves frequently accessed data for high performance, and the other compacts data and metadata for high memory efficiency to reduce misses. Experiments show that zExpander can increase memcached's effective cache size by up to 2x and reduce miss ratio by up to 46%. When integrated with a cache of a higher performance, its advantages remain. For example, with 24 threads on a YCSB workload zExpander can achieve throughput of 32 million RPS with 36% of its cache misses removed.
虽然键值(KV)缓存(如memcached)将大量昂贵的内存用于保存性能关键型数据,但提高内存效率或在不添加更多内存的情况下减少缓存缺失率非常重要。由于我们发现优化替换算法对于这一目的的效果有限,一个有前途的方法是使用紧凑的数据组织和数据压缩来增加有效的缓存大小。但是,由于额外的计算成本,这种方法有降低缓存性能的风险。一个普遍的看法是高性能KV缓存与数据压缩技术的使用不兼容。在本文中,我们表明,通过利用现实世界中KV缓存工作负载中常见的高度倾斜的数据访问模式,我们既可以通过提高内存效率来降低丢失率,又可以保持KV缓存的高性能。具体来说,我们设计并实现了一个KV缓存系统zExpander,它动态地将缓存划分为两个子缓存。一个为高性能提供频繁访问的数据,另一个为高内存效率压缩数据和元数据,以减少丢失。实验表明,zExpander可以将memcached的有效缓存大小提高2倍,并将丢失率降低46%。当与更高性能的缓存集成时,其优势仍然存在。例如,在YCSB工作负载上使用24个线程时,zExpander可以实现3200万RPS的吞吐量,并删除36%的缓存缺失。
{"title":"zExpander: a key-value cache with both high performance and fewer misses","authors":"Xingbo Wu, Li Zhang, Yandong Wang, Yufei Ren, M. Hack, Song Jiang","doi":"10.1145/2901318.2901332","DOIUrl":"https://doi.org/10.1145/2901318.2901332","url":null,"abstract":"While key-value (KV) cache, such as memcached, dedicates a large volume of expensive memory to holding performance-critical data, it is important to improve memory efficiency, or to reduce cache miss ratio without adding more memory. As we find that optimizing replacement algorithms is of limited effect for this purpose, a promising approach is to use a compact data organization and data compression to increase effective cache size. However, this approach has the risk of degrading the cache's performance due to additional computation cost. A common perception is that a high-performance KV cache is not compatible with use of data compacting techniques. In this paper, we show that, by leveraging highly skewed data access pattern common in real-world KV cache workloads, we can both reduce miss ratio through improved memory efficiency and maintain high performance for a KV cache. Specifically, we design and implement a KV cache system, named zExpander, which dynamically partitions the cache into two sub-caches. One serves frequently accessed data for high performance, and the other compacts data and metadata for high memory efficiency to reduce misses. Experiments show that zExpander can increase memcached's effective cache size by up to 2x and reduce miss ratio by up to 46%. When integrated with a cache of a higher performance, its advantages remain. For example, with 24 threads on a YCSB workload zExpander can achieve throughput of 32 million RPS with 36% of its cache misses removed.","PeriodicalId":20737,"journal":{"name":"Proceedings of the Eleventh European Conference on Computer Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84097172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Exploiting variability for energy optimization of parallel programs 利用可变性进行并行程序的能量优化
Pub Date : 2016-04-18 DOI: 10.1145/2901318.2901329
W. Lavrijsen, Costin Iancu, W. A. Jong, Xin Chen, K. Schwan
In this paper we present optimizations that use DVFS mechanisms to reduce the total energy usage in scientific applications. Our main insight is that noise is intrinsic to large scale parallel executions and it appears whenever shared resources are contended. The presence of noise allows us to identify and manipulate any program regions amenable to DVFS. When compared to previous energy optimizations that make per core decisions using predictions of the running time, our scheme uses a qualitative approach to recognize the signature of executions amenable to DVFS. By recognizing the "shape of variability" we can optimize codes with highly dynamic behavior, which pose challenges to all existing DVFS techniques. We validate our approach using offline and online analyses for one-sided and two-sided communication paradigms. We have applied our methods to NWChem, and we show best case improvements in energy use of 12% at no loss in performance when using online optimizations running on 720 Haswell cores with one-sided communication. With NWChem on MPI two-sided and offline analysis, capturing the initialization, we find energy savings of up to 20%, with less than 1% performance cost.
在本文中,我们提出了使用DVFS机制来减少科学应用中总能源使用的优化方法。我们的主要见解是,噪声是大规模并行执行所固有的,每当共享资源争用时就会出现。噪声的存在使我们能够识别和操纵任何适合DVFS的程序区域。与之前使用运行时间预测来做出每个核心决策的能源优化相比,我们的方案使用定性方法来识别适合DVFS的执行签名。通过识别“变异性的形状”,我们可以优化具有高动态行为的代码,这对所有现有的DVFS技术提出了挑战。我们通过对单边和双边交流范例的离线和在线分析来验证我们的方法。我们已经将我们的方法应用于NWChem,我们展示了在720个Haswell内核上运行的单向通信在线优化时,在不损失性能的情况下,能源使用改善了12%。通过NWChem对MPI的双边和离线分析,捕获初始化,我们发现节能高达20%,性能成本不到1%。
{"title":"Exploiting variability for energy optimization of parallel programs","authors":"W. Lavrijsen, Costin Iancu, W. A. Jong, Xin Chen, K. Schwan","doi":"10.1145/2901318.2901329","DOIUrl":"https://doi.org/10.1145/2901318.2901329","url":null,"abstract":"In this paper we present optimizations that use DVFS mechanisms to reduce the total energy usage in scientific applications. Our main insight is that noise is intrinsic to large scale parallel executions and it appears whenever shared resources are contended. The presence of noise allows us to identify and manipulate any program regions amenable to DVFS. When compared to previous energy optimizations that make per core decisions using predictions of the running time, our scheme uses a qualitative approach to recognize the signature of executions amenable to DVFS. By recognizing the \"shape of variability\" we can optimize codes with highly dynamic behavior, which pose challenges to all existing DVFS techniques. We validate our approach using offline and online analyses for one-sided and two-sided communication paradigms. We have applied our methods to NWChem, and we show best case improvements in energy use of 12% at no loss in performance when using online optimizations running on 720 Haswell cores with one-sided communication. With NWChem on MPI two-sided and offline analysis, capturing the initialization, we find energy savings of up to 20%, with less than 1% performance cost.","PeriodicalId":20737,"journal":{"name":"Proceedings of the Eleventh European Conference on Computer Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88000982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Proceedings of the Eleventh European Conference on Computer Systems 第十一届欧洲计算机系统会议论文集
Pub Date : 2016-04-18 DOI: 10.1145/2901318
Cristian Cadar, P. Pietzuch, K. Keeton, R. Rodrigues
Welcome to EuroSys 2016, held at Imperial College London, UK! This year's program includes 38 wonderful papers that cover a wide range of topics, including multicore systems and concurrency, distributed machine learning, studies of familiar operating system abstractions, heterogeneous and non-volatile memory systems, data center networking, novel techniques for energy and power optimization, and experiences from production systems.
欢迎参加在英国伦敦帝国理工学院举行的EuroSys 2016 !今年的课程包括38篇精彩的论文,涵盖了广泛的主题,包括多核系统和并发,分布式机器学习,熟悉的操作系统抽象研究,异构和非易失性存储系统,数据中心网络,能源和电源优化的新技术,以及生产系统的经验。
{"title":"Proceedings of the Eleventh European Conference on Computer Systems","authors":"Cristian Cadar, P. Pietzuch, K. Keeton, R. Rodrigues","doi":"10.1145/2901318","DOIUrl":"https://doi.org/10.1145/2901318","url":null,"abstract":"Welcome to EuroSys 2016, held at Imperial College London, UK! This year's program includes 38 wonderful papers that cover a wide range of topics, including multicore systems and concurrency, distributed machine learning, studies of familiar operating system abstractions, heterogeneous and non-volatile memory systems, data center networking, novel techniques for energy and power optimization, and experiences from production systems.","PeriodicalId":20737,"journal":{"name":"Proceedings of the Eleventh European Conference on Computer Systems","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87854342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Proceedings of the Eleventh European Conference on Computer Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1