Proceedings of the 20th International Middleware Conference最新文献

英文中文

Medley 混合泳

Proceedings of the 20th International Middleware Conference

Pub Date : 2019-12-09 DOI: 10.1093/gmo/9781561592630.article.18254

Rui Yang, Shichu Zhu, Yifei Li, Indranil Gupta

引用次数: 0

Boosting concurrency in Parallel State Machine Replication 提高并行状态机复制中的并发性

Proceedings of the 20th International Middleware Conference

Pub Date : 2019-12-09 DOI: 10.1145/3361525.3361549

Ian Aragon Escobar, E. Alchieri, F. Dotti, F. Pedone

State machine replication (SMR) is a well-known approach to implementing fault-tolerant services, providing high availability and strong consistency. To boost the performance of SMR, some proposals execute independent commands concurrently, while dependent commands execute sequentially in the total delivery order. The most general approach to handling command dependencies resorts to a directed acyclic graph (DAG), where nodes represent commands and edges represent dependencies. In this paper we show that due to the command arrival and multithreaded execution rates of SMR, a highly concurrent implementation of a DAG is needed. We show that a typical coarse-grained DAG implementation, where the whole graph is a critical section, results in a bottleneck in the replica. We propose two improvements to the coarse-grained DAG approach: fine-grained algorithms, using lock-coupling, and lock-free algorithms. Our fine-grain algorithms lock individual vertices in the DAG. The lock-free algorithms use nonblocking synchronization, with atomic operations, and lazy synchronization to postpone physical removal of nodes. All algorithms were integrated in a parallel SMR prototype. Experimental evaluation revealed that the fine-grained algorithms are also subject to a bottleneck. The lock-free implementation, however, sports linear speedup with the number of working threads, in some cases scaling up to 64 threads.

状态机复制(SMR)是一种众所周知的实现容错服务的方法，提供高可用性和强一致性。为了提高SMR的性能，一些建议并发执行独立命令，而依赖命令在总交付顺序中顺序执行。处理命令依赖关系的最通用方法是使用有向无环图(DAG)，其中节点表示命令，边表示依赖关系。在本文中，我们展示了由于SMR的命令到达和多线程执行速率，需要DAG的高度并发实现。我们展示了一个典型的粗粒度DAG实现，其中整个图是一个关键部分，会导致副本出现瓶颈。我们提出了对粗粒度DAG方法的两种改进:使用锁耦合的细粒度算法和无锁算法。我们的细粒度算法锁定DAG中的各个顶点。无锁算法使用非阻塞同步(带有原子操作)和延迟同步来延迟节点的物理移除。将所有算法集成到一个并行SMR原型中。实验评估表明，细粒度算法也会受到瓶颈的影响。但是，无锁实现随着工作线程的数量呈线性加速，在某些情况下可扩展到64个线程。

{"title":"Boosting concurrency in Parallel State Machine Replication","authors":"Ian Aragon Escobar, E. Alchieri, F. Dotti, F. Pedone","doi":"10.1145/3361525.3361549","DOIUrl":"https://doi.org/10.1145/3361525.3361549","url":null,"abstract":"State machine replication (SMR) is a well-known approach to implementing fault-tolerant services, providing high availability and strong consistency. To boost the performance of SMR, some proposals execute independent commands concurrently, while dependent commands execute sequentially in the total delivery order. The most general approach to handling command dependencies resorts to a directed acyclic graph (DAG), where nodes represent commands and edges represent dependencies. In this paper we show that due to the command arrival and multithreaded execution rates of SMR, a highly concurrent implementation of a DAG is needed. We show that a typical coarse-grained DAG implementation, where the whole graph is a critical section, results in a bottleneck in the replica. We propose two improvements to the coarse-grained DAG approach: fine-grained algorithms, using lock-coupling, and lock-free algorithms. Our fine-grain algorithms lock individual vertices in the DAG. The lock-free algorithms use nonblocking synchronization, with atomic operations, and lazy synchronization to postpone physical removal of nodes. All algorithms were integrated in a parallel SMR prototype. Experimental evaluation revealed that the fine-grained algorithms are also subject to a bottleneck. The lock-free implementation, however, sports linear speedup with the number of working threads, in some cases scaling up to 64 threads.","PeriodicalId":381253,"journal":{"name":"Proceedings of the 20th International Middleware Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120961158","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

ProvMark: A Provenance Expressiveness Benchmarking System ProvMark:一个来源表达性基准测试系统

Proceedings of the 20th International Middleware Conference

Pub Date : 2019-09-24 DOI: 10.1145/3361525.3361552

Sheung Chi Chan, J. Cheney, Pramod Bhatotia, Thomas Pasquier, Ashish Gehani, Hassaan Irshad, Lucian Carata, M. Seltzer

System level provenance is of widespread interest for applications such as security enforcement and information protection. However, testing the correctness or completeness of provenance capture tools is challenging and currently done manually. In some cases there is not even a clear consensus about what behavior is correct. We present an automated tool, ProvMark, that uses an existing provenance system as a black box and reliably identifies the provenance graph structure recorded for a given activity, by a reduction to subgraph isomorphism problems handled by an external solver. ProvMark is a beginning step in the much needed area of testing and comparing the expressiveness of provenance systems. We demonstrate ProvMark's usefuless in comparing three capture systems with different architectures and distinct design philosophies.

系统级溯源对于诸如安全执行和信息保护等应用具有广泛的兴趣。然而，测试来源捕获工具的正确性或完整性是具有挑战性的，目前是手工完成的。在某些情况下，对于什么行为是正确的，甚至没有明确的共识。我们提出了一个自动化工具，ProvMark，它使用一个现有的来源系统作为一个黑盒，并可靠地识别为给定活动记录的来源图结构，通过减少子图同构问题，由外部求解器处理。ProvMark是测试和比较来源系统的表达能力这一急需领域的开始。我们在比较具有不同架构和不同设计理念的三种捕获系统时展示了ProvMark的无用性。

引用次数: 6

FfDL: A Flexible Multi-tenant Deep Learning Platform FfDL:灵活的多租户深度学习平台

Proceedings of the 20th International Middleware Conference

Pub Date : 2019-09-14 DOI: 10.1145/3361525.3361538

K. R. Jayaram, Vinod Muthusamy, Parijat Dube, Vatche Isahagian, Chen Wang, Benjamin Herta, S. Boag, Diana Arroyo, A. Tantawi, Archit Verma, Falk Pollok, Rania Y. Khalaf

Deep learning (DL) is becoming increasingly popular in several application domains and has made several new application features involving computer vision, speech recognition and synthesis, self-driving automobiles, drug design, etc. feasible and accurate. As a result, large scale "on-premise" and "cloud-hosted" deep learning platforms have become essential infrastructure in many organizations. These systems accept, schedule, manage and execute DL training jobs at scale. This paper describes the design, implementation and our experiences with FfDL, a DL platform used at IBM. We describe how our design balances dependability with scalability, elasticity, flexibility and efficiency. We examine FfDL qualitatively through a retrospective look at the lessons learned from building, operating, and supporting FfDL; and quantitatively through a detailed empirical evaluation of FfDL, including the overheads introduced by the platform for various DL models, the load and performance observed in a real case study using FfDL within our organization, the frequency of various faults observed including faults that we did not anticipate, and experiments demonstrating the benefits of various scheduling policies. FfDL has been open-sourced.

深度学习(Deep learning, DL)在多个应用领域越来越受欢迎，并使计算机视觉、语音识别与合成、自动驾驶汽车、药物设计等一些新的应用特征变得可行和准确。因此，大规模的“内部部署”和“云托管”深度学习平台已成为许多组织必不可少的基础设施。这些系统接受、安排、管理和执行大规模的深度学习训练任务。本文介绍了IBM的深度学习平台FfDL的设计、实现和使用经验。我们描述了我们的设计如何平衡可靠性与可扩展性、弹性、灵活性和效率。我们通过回顾从构建、运营和支持FfDL中学到的经验教训来定性地检查FfDL;并通过对FfDL进行详细的定量实证评估，包括平台为各种DL模型引入的开销，在我们组织中使用FfDL的实际案例研究中观察到的负载和性能，观察到的各种故障的频率，包括我们没有预料到的故障，以及展示各种调度策略好处的实验。FfDL是开源的。

{"title":"FfDL: A Flexible Multi-tenant Deep Learning Platform","authors":"K. R. Jayaram, Vinod Muthusamy, Parijat Dube, Vatche Isahagian, Chen Wang, Benjamin Herta, S. Boag, Diana Arroyo, A. Tantawi, Archit Verma, Falk Pollok, Rania Y. Khalaf","doi":"10.1145/3361525.3361538","DOIUrl":"https://doi.org/10.1145/3361525.3361538","url":null,"abstract":"Deep learning (DL) is becoming increasingly popular in several application domains and has made several new application features involving computer vision, speech recognition and synthesis, self-driving automobiles, drug design, etc. feasible and accurate. As a result, large scale \"on-premise\" and \"cloud-hosted\" deep learning platforms have become essential infrastructure in many organizations. These systems accept, schedule, manage and execute DL training jobs at scale. This paper describes the design, implementation and our experiences with FfDL, a DL platform used at IBM. We describe how our design balances dependability with scalability, elasticity, flexibility and efficiency. We examine FfDL qualitatively through a retrospective look at the lessons learned from building, operating, and supporting FfDL; and quantitatively through a detailed empirical evaluation of FfDL, including the overheads introduced by the platform for various DL models, the load and performance observed in a real case study using FfDL within our organization, the frequency of various faults observed including faults that we did not anticipate, and experiments demonstrating the benefits of various scheduling policies. FfDL has been open-sourced.","PeriodicalId":381253,"journal":{"name":"Proceedings of the 20th International Middleware Conference","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115573510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

Differential Approximation and Sprinting for Multi-Priority Big Data Engines 多优先级大数据引擎的微分逼近与冲刺

Proceedings of the 20th International Middleware Conference

Pub Date : 2019-09-12 DOI: 10.1145/3361525.3361547

R. Birke, Isabelly Rocha, Juan F. Pérez, V. Schiavoni, P. Felber, L. Chen

Today's big data clusters based on the MapReduce paradigm are capable of executing analysis jobs with multiple priorities, providing differential latency guarantees. Traces from production systems show that the latency advantage of high-priority jobs comes at the cost of severe latency degradation of low-priority jobs as well as daunting resource waste caused by repetitive eviction and re-execution of low-priority jobs. We advocate a new resource management design that exploits the idea of differential approximation and sprinting. The unique combination of approximation and sprinting avoids the eviction of low-priority jobs and its consequent latency degradation and resource waste. To this end, we designed, implemented and evaluated DiAS, an extension of the Spark processing engine to support deflate jobs by dropping tasks and to sprint jobs. Our experiments on scenarios with two and three priority classes indicate that DiAS achieves up to 90% and 60% latency reduction for low- and high-priority jobs, respectively. DiAS not only eliminates resource waste but also (surprisingly) lowers energy consumption up to 30% at only a marginal accuracy loss for low-priority jobs.

如今，基于MapReduce范式的大数据集群能够执行具有多个优先级的分析任务，并提供差异延迟保证。来自生产系统的跟踪显示，高优先级作业的延迟优势是以低优先级作业的严重延迟退化以及低优先级作业的重复退出和重新执行所造成的令人生畏的资源浪费为代价的。我们提倡一种新的资源管理设计，利用微分逼近和冲刺的思想。近似和冲刺的独特组合避免了低优先级作业的淘汰以及随之而来的延迟退化和资源浪费。为此，我们设计、实现并评估了DiAS，这是Spark处理引擎的一个扩展，支持通过丢弃任务来压缩作业和冲刺作业。我们对具有两个和三个优先级类别的场景进行的实验表明，DiAS分别为低优先级和高优先级作业实现了高达90%和60%的延迟减少。DiAS不仅消除了资源浪费，而且(令人惊讶的是)在低优先级作业的精度损失很小的情况下，降低了高达30%的能耗。

{"title":"Differential Approximation and Sprinting for Multi-Priority Big Data Engines","authors":"R. Birke, Isabelly Rocha, Juan F. Pérez, V. Schiavoni, P. Felber, L. Chen","doi":"10.1145/3361525.3361547","DOIUrl":"https://doi.org/10.1145/3361525.3361547","url":null,"abstract":"Today's big data clusters based on the MapReduce paradigm are capable of executing analysis jobs with multiple priorities, providing differential latency guarantees. Traces from production systems show that the latency advantage of high-priority jobs comes at the cost of severe latency degradation of low-priority jobs as well as daunting resource waste caused by repetitive eviction and re-execution of low-priority jobs. We advocate a new resource management design that exploits the idea of differential approximation and sprinting. The unique combination of approximation and sprinting avoids the eviction of low-priority jobs and its consequent latency degradation and resource waste. To this end, we designed, implemented and evaluated DiAS, an extension of the Spark processing engine to support deflate jobs by dropping tasks and to sprint jobs. Our experiments on scenarios with two and three priority classes indicate that DiAS achieves up to 90% and 60% latency reduction for low- and high-priority jobs, respectively. DiAS not only eliminates resource waste but also (surprisingly) lowers energy consumption up to 30% at only a marginal accuracy loss for low-priority jobs.","PeriodicalId":381253,"journal":{"name":"Proceedings of the 20th International Middleware Conference","volume":"112 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116075689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Pando: Personal Volunteer Computing in Browsers Pando:浏览器中的个人志愿计算

Proceedings of the 20th International Middleware Conference

Pub Date : 2018-03-22 DOI: 10.1145/3361525.3361539

Erick Lavoie, L. Hendren, F. Desprez, M. Correia

The large penetration and continued growth in ownership of personal electronic devices represents a freely available and largely untapped source of computing power. To leverage those, we present Pando, a new volunteer computing tool based on a declarative concurrent programming model and implemented using JavaScript, WebRTC, and WebSockets. This tool enables a dynamically varying number of failure-prone personal devices contributed by volunteers to parallelize the application of a function on a stream of values, by using the devices' browsers. We show that Pando can provide throughput improvements compared to a single personal device, on a variety of compute-bound applications including animation rendering and image processing. We also show the flexibility of our approach by deploying Pando on personal devices connected over a local network, on Grid5000, a French-wide computing grid in a virtual private network, and seven PlanetLab nodes distributed in a wide area network over Europe.

个人电子设备的大量普及和所有权的持续增长，代表了一种可以免费获得但基本上尚未开发的计算能力来源。为了利用这些，我们提出了Pando，一个新的志愿者计算工具，基于声明式并发编程模型，并使用JavaScript, WebRTC和WebSockets实现。通过使用设备的浏览器，该工具可以使志愿者提供的易发生故障的个人设备的数量动态变化，从而在值流上并行化一个函数的应用程序。我们展示了Pando可以在各种计算绑定应用程序(包括动画渲染和图像处理)上提供比单个个人设备更高的吞吐量。我们还通过将Pando部署在通过本地网络连接的个人设备上，在Grid5000(虚拟专用网中的法国范围计算网格)和分布在欧洲广域网中的七个PlanetLab节点上，展示了我们方法的灵活性。

引用次数: 15

Proceedings of the 20th International Middleware Conference 第20届国际中间件会议论文集

Proceedings of the 20th International Middleware Conference

Pub Date : 2012-12-03 DOI: 10.1145/3361525

P. Narasimhan, P. Triantafillou

This edition marks the 13th ACM/IFIP/USENIX Middleware Conference. The first conference was held in the Lake District of England in 1998, and its origins reflected the growing importance of middleware and the realization that middleware represented an active, rigorous, growing and evolving research discipline in its own right. The definition of the term "middleware" has also evolved in the past decade, but retains, at its core, the notion of different levels/layers of abstractions in distributed-computing systems. Since its inception, the Middleware Conference has remained a premier forum for the discussion of innovations and recent advances in the design, implementation, experimentation, deployment, and usage of middleware systems.

本次会议是第13届ACM/IFIP/USENIX中间件会议。第一届会议于1998年在英格兰湖区举行，它的起源反映了中间件日益增长的重要性，以及中间件代表了一个积极的、严谨的、不断发展和发展的研究学科的认识。“中间件”一词的定义在过去十年中也有所发展，但其核心仍然是分布式计算系统中不同层次/抽象层的概念。自成立以来，中间件大会一直是讨论中间件系统的设计、实现、实验、部署和使用方面的创新和最新进展的主要论坛。

引用次数: 0

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 20th International Middleware Conference

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀