2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)最新文献

英文中文

Physical Synthesis of Flow-Based Microfluidic Biochips Considering Distributed Channel Storage 考虑分布式通道存储的流动微流控生物芯片的物理合成

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715269

Zhisheng Chen, Xing Huang, Wenzhong Guo, Bing Li, Tsung-Yi Ho, Ulf Schlichtmann

Flow-based microfluidic biochips (FBMBs) have attracted much attention over the past decade. On such a micrometer-scale platform, various biochemical applications, also called bioas-says, can be processed concurrently and automatically. To improve execution efficiency and reduce fabrication cost, a distributed channel-storage architecture (DCSA) can be implemented on this platform, where fluid samples can be cached temporarily in flow channels close to components. Although DCSA can improve the execution efficiency of FBMBs significantly, it requires a careful arrangement of fluid samples to enable the channels to fulfill the dual functions of transportation and caching. In this paper, we formulate the first flow-layer physical design problem considering DCSA, and propose a top-down synthesis algorithm to generate efficient solutions considering execution efficiency, washing, and resource usage simultaneously. Experimental results demonstrate that the proposed algorithm leads to a shorter execution time, less flow-channel length, and a higher efficiency of on-chip resource utilization for biochemical applications compared with a direct approach to incorporate distributed storage into existing frameworks.

近十年来，基于流动的微流控生物芯片(FBMBs)受到了广泛的关注。在这样一个微米级的平台上，各种生化应用(也称为bioas-say)可以同时自动处理。为了提高执行效率并降低制造成本，该平台可以实现分布式通道存储架构(DCSA)，其中流体样品可以临时缓存在靠近组件的流动通道中。虽然DCSA可以显著提高FBMBs的执行效率，但它需要对流体样本进行精心的布置，使通道能够实现运输和缓存的双重功能。在本文中，我们提出了考虑DCSA的第一个流层物理设计问题，并提出了一种自顶向下的综合算法，以同时考虑执行效率、洗涤和资源使用，生成高效的解决方案。实验结果表明，与直接将分布式存储整合到现有框架中的方法相比，该算法可缩短生化应用的执行时间，缩短流通道长度，提高片上资源利用效率。

{"title":"Physical Synthesis of Flow-Based Microfluidic Biochips Considering Distributed Channel Storage","authors":"Zhisheng Chen, Xing Huang, Wenzhong Guo, Bing Li, Tsung-Yi Ho, Ulf Schlichtmann","doi":"10.23919/DATE.2019.8715269","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715269","url":null,"abstract":"Flow-based microfluidic biochips (FBMBs) have attracted much attention over the past decade. On such a micrometer-scale platform, various biochemical applications, also called bioas-says, can be processed concurrently and automatically. To improve execution efficiency and reduce fabrication cost, a distributed channel-storage architecture (DCSA) can be implemented on this platform, where fluid samples can be cached temporarily in flow channels close to components. Although DCSA can improve the execution efficiency of FBMBs significantly, it requires a careful arrangement of fluid samples to enable the channels to fulfill the dual functions of transportation and caching. In this paper, we formulate the first flow-layer physical design problem considering DCSA, and propose a top-down synthesis algorithm to generate efficient solutions considering execution efficiency, washing, and resource usage simultaneously. Experimental results demonstrate that the proposed algorithm leads to a shorter execution time, less flow-channel length, and a higher efficiency of on-chip resource utilization for biochemical applications compared with a direct approach to incorporate distributed storage into existing frameworks.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123246042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

Near-Data Acceleration of Privacy-Preserving Biomarker Search with 3D-Stacked Memory 基于3d堆叠记忆体的近数据加速隐私保护生物标记搜索

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715108

A. O. Glova, Itir Akgun, Shuangchen Li, Xing Hu, Yuan Xie

Homomorphic encryption is a promising technology for enabling various privacy-preserving applications such as secure biomarker search. However, current implementations are not practical due to large performance overheads. A homomorphic encryption scheme has recently been proposed that allows bitwise comparison without the computationally-intensive multiplication and bootstrapping operations. Even so, this scheme still suffers from memory-bound performance bottleneck due to large ciphertext expansion. In this work, we propose HEGA, a near-data processing architecture that leverages this scheme with 3D-stacked memory to accelerate privacy-preserving biomarker search. We observe that homomorphic encryption-based search, like other emerging applications, can greatly benefit from the large throughput, capacity, and energy savings of 3D-stacked memory-based near-data processing architectures. Our near-data acceleration solution can speed up biomarker search by 6.3 × with 5.7× energy savings compared to an 8-core Intel Xeon processor.

同态加密是一种很有前途的技术，可以实现各种隐私保护应用，如安全生物标记搜索。然而，由于性能开销大，目前的实现并不实用。最近提出了一种同态加密方案，允许按位比较，而不需要计算密集型的乘法和自举操作。尽管如此，由于密文扩展较大，该方案仍然存在内存受限的性能瓶颈。在这项工作中，我们提出了HEGA，一种近数据处理架构，利用该方案与3d堆叠内存来加速保护隐私的生物标记物搜索。我们观察到，与其他新兴应用程序一样，基于同态加密的搜索可以极大地受益于基于3d堆叠内存的近数据处理架构的大吞吐量、容量和节能。与8核英特尔至强处理器相比，我们的近数据加速解决方案可以将生物标记物搜索速度提高6.3倍，节省5.7倍的能源。

引用次数: 8

Multicore Early Design Stage Guaranteed Performance Estimates for the Space Domain 多核早期设计阶段保证空间域的性能估计

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715273

Mikel Fernández, Gabriel Fernandez, J. Abella, F. Cazorla

The ability to produce early guaranteed performance (worst-case execution time) estimates for multicores, i.e. before software from different providers gets integrated onto the same critical system, is pivotal. This helps reducing lately-detected costly-to-handle timing violations. An existing methodology creates ‘copy’ (surrogate) applications from the execution in isolation of each target application. Surrogate applications can be used to upperbound multicore contention delay, and hence WCET estimates in multicores. However, this methodology has only been shown to work on a simulation environment. In this paper we show the work we have carried out to adapt this technology to a real multicore processor for the space domain.

为多核系统提供早期保证性能(最坏情况下的执行时间)估计的能力是至关重要的，例如，在来自不同供应商的软件被集成到同一个关键系统之前。这有助于减少最近检测到的处理代价高昂的时间违规。现有方法在独立于每个目标应用程序的情况下，从执行中创建“复制”(代理)应用程序。代理应用程序可用于上限多核争用延迟，从而在多核中估计WCET。然而，这种方法只在模拟环境中被证明有效。在本文中，我们展示了我们所做的工作，将该技术应用于空间领域的真正的多核处理器。

引用次数: 0

Rebooting Our Computing Models 重新启动我们的计算模型

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715167

Patsy Cadareanu, C. N.Reddy, C. G. Almudever, A. Khanna, A. Raychowdhury, S. Datta, K. Bertels, Vijayakrishan Narayanan, M. Ventra, P. Gaillardon

Innovative and new computing paradigms must be considered as we reach the limits of von Neumann computing caused by the growth in necessary data processing. This paper provides an introduction to three emerging computing models that have established themselves as likely post-CMOS and post-von Neumann solutions. The first of these ideas is quantum computing, for which we discuss the challenges and potential of quantum computer architectures. Next, a computational system using intrinsic oscillators is introduced and an example is provided which shows its superiority in comparison to a typical von Neumann computational system. Finally, digital memcomputing using self-organizing logic gates is explained and then discussed as a method for optimization problems and machine learning.

由于必要数据处理的增长，我们达到了冯·诺伊曼计算的极限，因此必须考虑创新和新的计算范式。本文介绍了三种新兴的计算模型，这些模型已经建立起来，可能是后cmos和后冯·诺伊曼解决方案。第一个想法是量子计算，我们讨论了量子计算机架构的挑战和潜力。其次，介绍了一个使用本征振子的计算系统，并举例说明了它与典型冯·诺依曼计算系统相比的优越性。最后，解释了使用自组织逻辑门的数字memcomputing，然后讨论了作为优化问题和机器学习的方法。

引用次数: 5

Prediction-Based Task Migration on S-NUCA Many-Cores 基于预测的S-NUCA多核任务迁移

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714974

Martin Rapp, A. Pathania, T. Mitra, J. Henkel

Performance of a task running on a many-core with distributed shared Last-Level Cache (LLC) strongly depends on two factors: the power budget needed to guarantee thermally safe operation and the LLC latency. The task’s thread-to-core mapping determines both the factors. Arrival and departure of tasks on a many-core deployed in an open system can change its state significantly in terms of available cores and power budget. Task migrations can thereupon be used as a tool to keep the many-core operating at the peak performance. Furthermore, the relative impacts of power budget and LLC latency on a task’s performance can change with its different execution phases mandating its migration on-the-fly.We propose the first run-time algorithm PCMig that increases the performance of a many-core with distributed shared LLC by migrating tasks based on their phases and the many-core’s state. PCMig is based on a performance-prediction model that predicts the performance impact of migrations. PCMig results in up to 16 % reduction in the average response time compared to the state-of-the-art.

在具有分布式共享最后一级缓存(LLC)的多核上运行的任务的性能在很大程度上取决于两个因素:保证热安全操作所需的功率预算和LLC延迟。任务的线程到核映射决定了这两个因素。在开放系统中部署的多核上的任务到达和离开可以在可用核和功率预算方面显着改变其状态。因此，任务迁移可以作为一种工具来保持多核运行在最佳性能。此外，功率预算和LLC延迟对任务性能的相对影响可以随着其不同的执行阶段而变化，从而强制其动态迁移。我们提出了第一个运行时算法PCMig，该算法通过基于阶段和多核状态迁移任务来提高具有分布式共享LLC的多核系统的性能。PCMig基于一个性能预测模型，该模型预测迁移对性能的影响。与最先进的技术相比，PCMig的平均响应时间减少了16%。

引用次数: 16

RDF: Reconfigurable Dataflow RDF:可重构数据流

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714987

Pascal Fradet, A. Girault, R. Krishnaswamy, X. Nicollin, Arash Shafiei

Dataflow Models of Computation (MoCs) are widely used in embedded systems, including multimedia processing, digital signal processing, telecommunications, and automatic control. In a dataflow MoC, an application is specified as a graph of actors connected by FIFO channels. One of the most popular dataflow MoCs, Synchronous Dataflow (SDF), provides static analyses to guarantee boundedness and liveness, which are key properties for embedded systems. However, SDF (and most of its variants) lacks the capability to express the dynamism needed by modern streaming applications. In particular, the applications mentioned above have a strong need for reconfigurability to accommodate changes in the input data, the control objectives, or the environment.We address this need by proposing a new MoC called Reconfigurable Dataflow (RDF). RDF extends SDF with transformation rules that specify how the topology and actors of the graph may be reconfigured. Starting from an initial RDF graph and a set of transformation rules, an arbitrary number of new RDF graphs can be generated at runtime. A key feature of RDF is that it can be statically analyzed to guarantee that all possible graphs generated at runtime will be consistent and live. We introduce the RDF MoC, describe its associated static analyses, and outline its implementation.

数据流计算模型(moc)广泛应用于嵌入式系统，包括多媒体处理、数字信号处理、电信和自动控制等。在数据流MoC中，应用程序被指定为由FIFO通道连接的参与者的图形。同步数据流(SDF)是最流行的数据流moc之一，它提供静态分析以保证有界性和活动性，这是嵌入式系统的关键属性。然而，SDF(及其大多数变体)缺乏表达现代流应用程序所需的动态性的能力。特别是，上面提到的应用程序强烈需要可重构性，以适应输入数据、控制目标或环境中的更改。我们通过提出一种称为可重构数据流(RDF)的新MoC来解决这一需求。RDF用指定如何重新配置图的拓扑和参与者的转换规则扩展了SDF。从初始RDF图和一组转换规则开始，可以在运行时生成任意数量的新RDF图。RDF的一个关键特性是可以对其进行静态分析，以保证在运行时生成的所有可能的图都是一致且有效的。我们将介绍RDF MoC，描述其相关的静态分析，并概述其实现。

{"title":"RDF: Reconfigurable Dataflow","authors":"Pascal Fradet, A. Girault, R. Krishnaswamy, X. Nicollin, Arash Shafiei","doi":"10.23919/DATE.2019.8714987","DOIUrl":"https://doi.org/10.23919/DATE.2019.8714987","url":null,"abstract":"Dataflow Models of Computation (MoCs) are widely used in embedded systems, including multimedia processing, digital signal processing, telecommunications, and automatic control. In a dataflow MoC, an application is specified as a graph of actors connected by FIFO channels. One of the most popular dataflow MoCs, Synchronous Dataflow (SDF), provides static analyses to guarantee boundedness and liveness, which are key properties for embedded systems. However, SDF (and most of its variants) lacks the capability to express the dynamism needed by modern streaming applications. In particular, the applications mentioned above have a strong need for reconfigurability to accommodate changes in the input data, the control objectives, or the environment.We address this need by proposing a new MoC called Reconfigurable Dataflow (RDF). RDF extends SDF with transformation rules that specify how the topology and actors of the graph may be reconfigured. Starting from an initial RDF graph and a set of transformation rules, an arbitrary number of new RDF graphs can be generated at runtime. A key feature of RDF is that it can be statically analyzed to guarantee that all possible graphs generated at runtime will be consistent and live. We introduce the RDF MoC, describe its associated static analyses, and outline its implementation.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131337476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Empirical Evaluation of IC3-Based Model Checking Techniques on Verilog RTL Designs 基于ic3的Verilog RTL设计模型检验技术的实证评价

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715289

Aman Goel, K. Sakallah

IC3-based algorithms have emerged as effective scalable approaches for hardware model checking. In this paper we evaluate six implementations of IC3-based model checkers on a diverse set of publicly-available and proprietary industrial Verilog RTL designs. Four of the six verifiers we examined operate at the bit level and two employ abstraction to take advantage of word-level RTL semantics. Overall, the word-level verifier employing data abstraction outperformed the others, especially on the large industrial designs. The analysis helped us identify several key insights on the techniques underlying these tools, their strengths and weaknesses, differences and commonalities, and opportunities for improvement.

基于ic3的算法已成为硬件模型检查的有效可扩展方法。在本文中，我们在一系列公开可用和专有的工业Verilog RTL设计上评估了六种基于ic3的模型检查器的实现。我们检查的六个验证器中有四个在位级操作，两个使用抽象来利用字级RTL语义。总体而言，采用数据抽象的字级验证器优于其他验证器，特别是在大型工业设计上。分析帮助我们确定了这些工具背后的技术、它们的优势和劣势、差异和共性，以及改进的机会的几个关键见解。

引用次数: 10

Cost/Privacy Co-optimization in Smart Energy Grids 智能电网的成本/隐私协同优化

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715181

Alma Pröbstl, Sangyoung Park, S. Steinhorst, S. Chakraborty

The smart energy grid features real-time monitoring of electricity usage such that it can control the generation and distribution of electricity as well as utilize dynamic pricing in response to the demands. For this purpose, smart metering systems continuously monitor the electricity usage of customers, and report it back to the Utility Provider (UP). This raises privacy concerns regarding the undesired exposure of human activity and time-of-use of home appliances. Photovoltaics (PV) and a residential Electrical Energy Storage (EES) have proven to be effective in mitigating the privacy concerns. However, this comes at several costs: Installation of PV and EES, their subsequent aging and the possibly increased electricity cost. We quantify the trade-off between privacy exposure and financial costs by formulating a stochastic dynamic programming problem. Our analysis shows that i) there is a quantifiable trade-off between the financial cost and privacy leakage, ii) proper control of the system is crucial for both metrics, iii) a strategy solely focusing on privacy results in high financial costs, and iv) that for a typical residential setting, the costs for a trade-off solution lie in the range of 600 US$-1700 US$. As the load flattening has a peak shaving effect desirable for UPs, increasing privacy is mutually beneficial for both, customers and UPs.

智能电网的特点是对电力使用情况进行实时监控，从而控制电力的产生和分配，并根据需求利用动态定价。为此，智能计量系统持续监测客户的用电量，并将其报告给公用事业提供商(UP)。这引起了人们对人类活动和家用电器使用时间的不期望暴露的隐私担忧。光伏(PV)和住宅电能存储(EES)已被证明在减轻隐私问题方面是有效的。然而，这需要付出一些代价:安装PV和EES，它们随后的老化以及可能增加的电力成本。我们量化隐私暴露和财务成本之间的权衡通过制定一个随机动态规划问题。我们的分析表明，i)财务成本和隐私泄露之间存在可量化的权衡，ii)对系统的适当控制对这两个指标都至关重要，iii)仅关注隐私的策略会导致高财务成本，iv)对于典型的住宅环境，权衡解决方案的成本在600美元至1700美元之间。由于负载平坦化对UPs具有可取的削峰效果，因此增加隐私对客户和UPs都是互利的。

{"title":"Cost/Privacy Co-optimization in Smart Energy Grids","authors":"Alma Pröbstl, Sangyoung Park, S. Steinhorst, S. Chakraborty","doi":"10.23919/DATE.2019.8715181","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715181","url":null,"abstract":"The smart energy grid features real-time monitoring of electricity usage such that it can control the generation and distribution of electricity as well as utilize dynamic pricing in response to the demands. For this purpose, smart metering systems continuously monitor the electricity usage of customers, and report it back to the Utility Provider (UP). This raises privacy concerns regarding the undesired exposure of human activity and time-of-use of home appliances. Photovoltaics (PV) and a residential Electrical Energy Storage (EES) have proven to be effective in mitigating the privacy concerns. However, this comes at several costs: Installation of PV and EES, their subsequent aging and the possibly increased electricity cost. We quantify the trade-off between privacy exposure and financial costs by formulating a stochastic dynamic programming problem. Our analysis shows that i) there is a quantifiable trade-off between the financial cost and privacy leakage, ii) proper control of the system is crucial for both metrics, iii) a strategy solely focusing on privacy results in high financial costs, and iv) that for a typical residential setting, the costs for a trade-off solution lie in the range of 600 US$-1700 US$. As the load flattening has a peak shaving effect desirable for UPs, increasing privacy is mutually beneficial for both, customers and UPs.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122939309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

A Security Architecture for RISC-V based IoT Devices 基于RISC-V的物联网设备安全架构

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8714822

L. Auer, Christian Skubich, Matthias Hiller

New IoT applications are demanding for more and more performance in embedded devices while their deployment and operation poses strict power constraints. We present the security concept for a customizable Internet of Things (IoT) platform based on the RISC-V ISA and developed by several Fraunhofer Institutes. It integrates a range of peripherals with a scalable computing subsystem as a three dimensional System-in-Package (3D-SiP).The security features aim for a medium security level and target the requirements of the IoT market. Our security architecture extends given implementations to enable secure deployment, operation, and update. Core security features are secure boot, an authenticated watchdog timer, and key management.The Universal Sensor Platform (USeP) SoC is developed for GLOBALFOUNDRIES’ 22FDX technology and aims to provide a platform for Small and Medium-sized Enterprises (SMEs) that typically do not have access to advanced microelectronics and integration know-how, and are therefore limited to Commercial Off-The-Shelf (COTS) products.

新的物联网应用对嵌入式设备的性能要求越来越高，而它们的部署和运行受到严格的功率限制。我们提出了一个基于RISC-V ISA的可定制物联网(IoT)平台的安全概念，该平台由几个弗劳恩霍夫研究所开发。它将一系列外设与可扩展的计算子系统集成为三维系统级封装(3D-SiP)。安全特性的目标是中等安全级别，并针对物联网市场的需求。我们的安全体系结构扩展了给定的实现，以支持安全的部署、操作和更新。核心安全特性是安全引导、经过身份验证的看门狗计时器和密钥管理。通用传感器平台(USeP) SoC是为GLOBALFOUNDRIES的22FDX技术开发的，旨在为中小型企业(sme)提供一个平台，这些企业通常无法获得先进的微电子和集成技术，因此仅限于商用现货(COTS)产品。

引用次数: 5

Enhancing Two-Phase Cooling Efficiency through Thermal-Aware Workload Mapping for Power-Hungry Servers 通过热感知工作负载映射提高两相冷却效率的高能耗服务器

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Pub Date : 2019-03-25 DOI: 10.23919/DATE.2019.8715033

Arman Iranfar, A. Pahlevan, Marina Zapater, David Atienza Alonso

The power density and, consequently, power hungriness of server processors is growing by the day. Traditional air cooling systems fail to cope with such high heat densities, whereas single-phase liquid-cooling still requires high mass flow-rate, high pumping power, and large facility size. On the contrary, in a micro-scale gravity-driven thermosyphon attached on top of a processor, the refrigerant, absorbing the heat, turns into a two-phase mixture. The vapor-liquid mixture exchanges heat with a coolant at the condenser side, turns back to liquid state, and descends thanks to gravity, eliminating the need for pumping power. However, similar to other cooling technologies, thermosyphon efficiency can considerably vary with respect to workload performance requirements and thermal profile, in addition to the platform features, such as packaging and die floorplan. In this work, we first address the workload- and platform-aware design of a two-phase thermosyphon. Then, we propose a thermal-aware workload mapping strategy considering the potential and limitations of a two-phase thermosyphon to further minimize hot spots and spatial thermal gradients. Our experiments, performed on an 8-core Intel Xeon E5 CPU reveal, on average, up to 10°C reduction in thermal hot spots, and 45% reduction in the maximum spatial thermal gradient on the die. Moreover, our design and mapping strategy are able to decrease the chiller cooling power at least by 45%.

服务器处理器的功率密度和耗电量日益增加。传统的空气冷却系统无法应对如此高的热密度，而单相液体冷却仍然需要高质量流量，高泵送功率和大设施尺寸。相反，在微型重力驱动的热虹吸管中，安装在处理器的顶部，制冷剂吸收热量，变成两相混合物。蒸汽-液体混合物与冷凝器侧的冷却剂交换热量，变回液体状态，并由于重力下降，无需泵送动力。然而，与其他冷却技术类似，除了封装和模具平面设计等平台特性外，热虹吸的效率也会因工作负载性能要求和热剖面而有很大差异。在这项工作中，我们首先解决了两相热虹吸管的工作负载和平台感知设计。然后，考虑到两相热虹吸的潜力和局限性，我们提出了一种热感知工作负载映射策略，以进一步减少热点和空间热梯度。我们在8核Intel Xeon E5 CPU上进行的实验显示，平均而言，热热点降低了10°C，并且在芯片上的最大空间热梯度降低了45%。此外，我们的设计和绘图策略能够将冷水机的冷却功率降低至少45%。

{"title":"Enhancing Two-Phase Cooling Efficiency through Thermal-Aware Workload Mapping for Power-Hungry Servers","authors":"Arman Iranfar, A. Pahlevan, Marina Zapater, David Atienza Alonso","doi":"10.23919/DATE.2019.8715033","DOIUrl":"https://doi.org/10.23919/DATE.2019.8715033","url":null,"abstract":"The power density and, consequently, power hungriness of server processors is growing by the day. Traditional air cooling systems fail to cope with such high heat densities, whereas single-phase liquid-cooling still requires high mass flow-rate, high pumping power, and large facility size. On the contrary, in a micro-scale gravity-driven thermosyphon attached on top of a processor, the refrigerant, absorbing the heat, turns into a two-phase mixture. The vapor-liquid mixture exchanges heat with a coolant at the condenser side, turns back to liquid state, and descends thanks to gravity, eliminating the need for pumping power. However, similar to other cooling technologies, thermosyphon efficiency can considerably vary with respect to workload performance requirements and thermal profile, in addition to the platform features, such as packaging and die floorplan. In this work, we first address the workload- and platform-aware design of a two-phase thermosyphon. Then, we propose a thermal-aware workload mapping strategy considering the potential and limitations of a two-phase thermosyphon to further minimize hot spots and spatial thermal gradients. Our experiments, performed on an 8-core Intel Xeon E5 CPU reveal, on average, up to 10°C reduction in thermal hot spots, and 45% reduction in the maximum spatial thermal gradient on the die. Moreover, our design and mapping strategy are able to decrease the chiller cooling power at least by 45%.","PeriodicalId":445778,"journal":{"name":"2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)","volume":"217 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128934555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2019 Design, Automation & Test in Europe Conference & Exhibition (DATE)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀