首页 > 最新文献

2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)最新文献

英文 中文
Wear Leveling for Crossbar Resistive Memory 横条电阻式记忆器的磨损平整
Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196138
Wen Wen, Youtao Zhang, Jun Yang
Resistive Memory (ReRAM) is an emerging non-volatile memory technology that has many advantages over conventional DRAM. ReRAM crossbar has the smallest 4F2 planar cell size and thus is widely adopted for constructing dense memory with large capacity. However, ReRAM crossbar suffers from large sneaky currents and IR drop. To ensure write reliability, ReRAM write drivers choose larger than ideal write voltages, which over-SET/over-RESET many cells at runtime and lead to severely degraded chip lifetime.In this paper, we propose XWL, a novel table based wear leveling scheme for ReRAM crossbars. We study the correlation between write endurance and voltage stress in ReRAM crossbar. By estimating and tracking the effective write stress to different rows at runtime, XWL chooses the ones that are stressed the most to mitigate. Our experimental results show that, on average, XWL improves the ReRAM crossbar lifetime by 324% over the baseline, with only 6.1% performance overhead.
电阻式存储器(ReRAM)是一种新兴的非易失性存储器技术,与传统的DRAM相比具有许多优点。ReRAM交叉棒具有最小的4F2平面单元尺寸,因此被广泛用于构建大容量的密集存储器。然而,ReRAM横条受到大的潜流和红外下降的影响。为了确保写入可靠性,ReRAM写入驱动程序选择大于理想的写入电压,这会在运行时过度设置/过度重置许多单元,并导致芯片寿命严重降低。在本文中,我们提出了一种新的基于表的ram横梁磨损平衡方案XWL。研究了ReRAM交叉棒的写入寿命与电压应力的关系。通过在运行时估计和跟踪对不同行的有效写压力,XWL选择压力最大的行来减轻压力。我们的实验结果表明,平均而言,XWL使ReRAM交叉杆寿命比基线提高了324%,而性能开销仅为6.1%。
{"title":"Wear Leveling for Crossbar Resistive Memory","authors":"Wen Wen, Youtao Zhang, Jun Yang","doi":"10.1145/3195970.3196138","DOIUrl":"https://doi.org/10.1145/3195970.3196138","url":null,"abstract":"Resistive Memory (ReRAM) is an emerging non-volatile memory technology that has many advantages over conventional DRAM. ReRAM crossbar has the smallest 4F2 planar cell size and thus is widely adopted for constructing dense memory with large capacity. However, ReRAM crossbar suffers from large sneaky currents and IR drop. To ensure write reliability, ReRAM write drivers choose larger than ideal write voltages, which over-SET/over-RESET many cells at runtime and lead to severely degraded chip lifetime.In this paper, we propose XWL, a novel table based wear leveling scheme for ReRAM crossbars. We study the correlation between write endurance and voltage stress in ReRAM crossbar. By estimating and tracking the effective write stress to different rows at runtime, XWL chooses the ones that are stressed the most to mitigate. Our experimental results show that, on average, XWL improves the ReRAM crossbar lifetime by 324% over the baseline, with only 6.1% performance overhead.","PeriodicalId":6491,"journal":{"name":"2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)","volume":"13 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73041671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Efficient Winograd-based Convolution Kernel Implementation on Edge Devices 边缘设备上基于winograd卷积核的高效实现
Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196041
A. Xygkis, Lazaros Papadopoulos, D. Moloney, D. Soudris, Sofiane Yous
The implementation of Convolutional Neural Networks on edge Internet of Things (IoT) devices is a significant programming challenge, due to the limited computational resources and the real-time requirements of modern applications. This work focuses on the efficient implementation of the Winograd convolution, based on a set of application-independent and Winograd-specific software techniques for improving the utilization of the edge devices computational resources. The proposed techniques were evaluated in Intel/Movidius Myriad2 platform, using 4 CNNs of various computational requirements. The results show significant performance improvements, up to 54%, over other convolution algorithms.
由于有限的计算资源和现代应用的实时性要求,在边缘物联网(IoT)设备上实现卷积神经网络是一个重大的编程挑战。这项工作的重点是有效实现Winograd卷积,基于一组独立于应用程序和Winograd特定的软件技术,以提高边缘设备计算资源的利用率。在Intel/Movidius Myriad2平台上,使用4个不同计算需求的cnn对所提出的技术进行了评估。结果表明,与其他卷积算法相比,该算法的性能提高了54%。
{"title":"Efficient Winograd-based Convolution Kernel Implementation on Edge Devices","authors":"A. Xygkis, Lazaros Papadopoulos, D. Moloney, D. Soudris, Sofiane Yous","doi":"10.1145/3195970.3196041","DOIUrl":"https://doi.org/10.1145/3195970.3196041","url":null,"abstract":"The implementation of Convolutional Neural Networks on edge Internet of Things (IoT) devices is a significant programming challenge, due to the limited computational resources and the real-time requirements of modern applications. This work focuses on the efficient implementation of the Winograd convolution, based on a set of application-independent and Winograd-specific software techniques for improving the utilization of the edge devices computational resources. The proposed techniques were evaluated in Intel/Movidius Myriad2 platform, using 4 CNNs of various computational requirements. The results show significant performance improvements, up to 54%, over other convolution algorithms.","PeriodicalId":6491,"journal":{"name":"2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)","volume":"3 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78854206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
Active Forwarding: Eliminate IOMMU Address Translation for Accelerator-rich Architectures 主动转发:消除IOMMU地址转换的加速器丰富的架构
Pub Date : 2018-06-01 DOI: 10.1145/3195970.3195984
H. Fu, Po-Han Wang, Chia-Lin Yang
Accelerator-rich architectures employ IOMMUs to support unified virtual address, but researches show that they fail to meet the performance and energy requirements of accelerators. Instead of optimizing the speed/energy of IOMMU address translation, this work tackles the issue from a new perspective, eliminating the need for translation with an active forwarding (AF) mechanism that forwards input data of accelerators directly from the CPU cache to the scratchpad memory of the accelerator. Results show that on average, AF can provide 8% performance improvement compared to the state-of-the-art mechanism, hostPageWalk, and reduce 22.1% accelerator power.
富加速器架构采用IOMMUs来支持统一的虚拟地址,但研究表明它们不能满足加速器对性能和能量的要求。这项工作没有优化IOMMU地址转换的速度/能量,而是从一个新的角度解决了这个问题,通过主动转发(AF)机制消除了转换的需要,该机制将加速器的输入数据直接从CPU缓存转发到加速器的刮板存储器。结果表明,与最先进的机制hostPageWalk相比,AF平均可以提供8%的性能提升,并减少22.1%的加速器功率。
{"title":"Active Forwarding: Eliminate IOMMU Address Translation for Accelerator-rich Architectures","authors":"H. Fu, Po-Han Wang, Chia-Lin Yang","doi":"10.1145/3195970.3195984","DOIUrl":"https://doi.org/10.1145/3195970.3195984","url":null,"abstract":"Accelerator-rich architectures employ IOMMUs to support unified virtual address, but researches show that they fail to meet the performance and energy requirements of accelerators. Instead of optimizing the speed/energy of IOMMU address translation, this work tackles the issue from a new perspective, eliminating the need for translation with an active forwarding (AF) mechanism that forwards input data of accelerators directly from the CPU cache to the scratchpad memory of the accelerator. Results show that on average, AF can provide 8% performance improvement compared to the state-of-the-art mechanism, hostPageWalk, and reduce 22.1% accelerator power.","PeriodicalId":6491,"journal":{"name":"2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)","volume":"43 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78994053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Inducing Local Timing Fault Through EM Injection 电磁注入诱发本地定时故障
Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196064
M. Ghodrati, Bilgiday Yuce, S. Gujar, Chinmay Deshpande, L. Nazhandali, P. Schaumont
Electromagnetic fault injection (EMFI) is an efficient class of physical attacks that can compromise the immunity of secure cryptographic algorithms. Despite successful EMFI attacks, the effects of electromagnetic injection (EM) on a processor are not well understood. This paper presents a bottom-up analysis of EMFI effects on a RISC microprocessor. We study these effects at three levels: at the wire-level, at the chip-network level, and at the gate-level considering parameters such as EM-injection location and timing. We conclude that EMFI induces local timing errors implying current timing attack detection and prevention techniques can be adapted to overcome EMFI.
电磁故障注入(EMFI)是一类有效的物理攻击,它可以破坏安全密码算法的免疫力。尽管成功的EMFI攻击,电磁注入(EM)对处理器的影响还不是很清楚。本文从下至上的角度分析了EMFI对RISC微处理器的影响。我们在三个层面研究这些影响:在导线层面,在芯片网络层面,在考虑电磁注入位置和时间等参数的栅极层面。我们得出结论,EMFI会引起局部定时错误,这意味着当前的定时攻击检测和预防技术可以适应克服EMFI。
{"title":"Inducing Local Timing Fault Through EM Injection","authors":"M. Ghodrati, Bilgiday Yuce, S. Gujar, Chinmay Deshpande, L. Nazhandali, P. Schaumont","doi":"10.1145/3195970.3196064","DOIUrl":"https://doi.org/10.1145/3195970.3196064","url":null,"abstract":"Electromagnetic fault injection (EMFI) is an efficient class of physical attacks that can compromise the immunity of secure cryptographic algorithms. Despite successful EMFI attacks, the effects of electromagnetic injection (EM) on a processor are not well understood. This paper presents a bottom-up analysis of EMFI effects on a RISC microprocessor. We study these effects at three levels: at the wire-level, at the chip-network level, and at the gate-level considering parameters such as EM-injection location and timing. We conclude that EMFI induces local timing errors implying current timing attack detection and prevention techniques can be adapted to overcome EMFI.","PeriodicalId":6491,"journal":{"name":"2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)","volume":"17 6 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78493591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
DAC 2018 Committees
Pub Date : 2018-06-01 DOI: 10.1109/dac.2018.8465920
{"title":"DAC 2018 Committees","authors":"","doi":"10.1109/dac.2018.8465920","DOIUrl":"https://doi.org/10.1109/dac.2018.8465920","url":null,"abstract":"","PeriodicalId":6491,"journal":{"name":"2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)","volume":"98 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81113800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DAC 2018 TOC
Pub Date : 2018-06-01 DOI: 10.1109/dac.2018.8465776
{"title":"DAC 2018 TOC","authors":"","doi":"10.1109/dac.2018.8465776","DOIUrl":"https://doi.org/10.1109/dac.2018.8465776","url":null,"abstract":"","PeriodicalId":6491,"journal":{"name":"2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)","volume":"28 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81148709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DAC 2018 Commentary
Pub Date : 2018-06-01 DOI: 10.1109/dac.2018.8465864
{"title":"DAC 2018 Commentary","authors":"","doi":"10.1109/dac.2018.8465864","DOIUrl":"https://doi.org/10.1109/dac.2018.8465864","url":null,"abstract":"","PeriodicalId":6491,"journal":{"name":"2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)","volume":"29 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81771840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Thermal-aware Optimizations of ReRAM-based Neuromorphic Computing Systems 基于reram的神经形态计算系统的热感知优化
Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196128
Majed Valad Beigi, G. Memik
ReRAM-based systems are attractive implementation alternatives for neuromorphic computing because of their high speed and low design cost. In this work, we investigate the impact of temperature on the ReRAM-based neuromorphic architectures and show how varying temperatures have a negative impact on the computation accuracy. We first classify ReRAM crossbar cells based on their temperature and identify effective neural network weights that have large impacts on network outputs. Then, we propose a novel temperature-aware training and mapping scheme to prevent the effective weights from being mapped to hot cells to restore the system accuracy. Evaluation results for a two-layer neural network show that our scheme can improve the system accuracy by up to 39.2%.
基于reram的系统由于其高速度和低设计成本而成为神经形态计算的有吸引力的实现方案。在这项工作中,我们研究了温度对基于reram的神经形态架构的影响,并展示了温度变化如何对计算精度产生负面影响。我们首先根据温度对ReRAM交叉栏单元进行分类,并确定对网络输出有较大影响的有效神经网络权重。然后,我们提出了一种新的温度感知训练和映射方案,以防止有效权值被映射到热单元,以恢复系统的精度。对两层神经网络的评估结果表明,该方案可将系统准确率提高39.2%。
{"title":"Thermal-aware Optimizations of ReRAM-based Neuromorphic Computing Systems","authors":"Majed Valad Beigi, G. Memik","doi":"10.1145/3195970.3196128","DOIUrl":"https://doi.org/10.1145/3195970.3196128","url":null,"abstract":"ReRAM-based systems are attractive implementation alternatives for neuromorphic computing because of their high speed and low design cost. In this work, we investigate the impact of temperature on the ReRAM-based neuromorphic architectures and show how varying temperatures have a negative impact on the computation accuracy. We first classify ReRAM crossbar cells based on their temperature and identify effective neural network weights that have large impacts on network outputs. Then, we propose a novel temperature-aware training and mapping scheme to prevent the effective weights from being mapped to hot cells to restore the system accuracy. Evaluation results for a two-layer neural network show that our scheme can improve the system accuracy by up to 39.2%.","PeriodicalId":6491,"journal":{"name":"2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)","volume":"84 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82104622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Power-based Side-Channel Instruction-level Disassembler 基于功率的侧信道指令级反汇编器
Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196094
Jungmin Park, Xiaolin Xu, Yier Jin, Domenic Forte, M. Tehranipoor
Modern embedded computing devices are vulnerable against mal-ware and software piracy due to insufficient security scrutiny and the complications of continuous patching. To detect malicious activity as well as protecting the integrity of executable software, it is necessary to monitor the operation of such devices. In this paper, we propose a disassembler based on power-based side-channel to analyze the real-time operation of embedded systems at instruction-level granularity. The proposed disassembler obtains templates from an original device (e.g., IoT home security system, smart thermostat, etc.) and utilizes machine learning algorithms to uniquely identify instructions executed on the device. The feature selection using Kullback-Leibler (KL) divergence and the dimensional reduction using PCA in the time-frequency domain are proposed to increase the identification accuracy. Moreover, a hierarchical classification framework is proposed to reduce the computational complexity associated with large instruction sets. In addition, covariate shifts caused by different environmental measurements and device-to-device variations are minimized by our covariate shift adaptation technique. We implement this disassembler on an AVR 8-bit microcontroller. Experimental results demonstrate that our proposed disassembler can recognize test instructions including register names with a success rate no lower than 99.03% with quadratic discriminant analysis (QDA).
由于安全审查不足和持续修补的复杂性,现代嵌入式计算设备容易受到恶意软件和软件盗版的攻击。为了检测恶意活动以及保护可执行软件的完整性,有必要监视这些设备的操作。本文提出了一种基于功率侧信道的反汇编器,在指令级粒度上分析嵌入式系统的实时运行。所提出的反汇编器从原始设备(例如,IoT家庭安全系统,智能恒温器等)获取模板,并利用机器学习算法来唯一识别在设备上执行的指令。提出了利用Kullback-Leibler (KL)散度进行特征选择和利用PCA在时频域进行降维的方法来提高识别精度。此外,为了降低大型指令集的计算复杂度,提出了一种分层分类框架。此外,由不同的环境测量和设备到设备的变化引起的协变量位移通过我们的协变量位移适应技术最小化。我们在AVR 8位微控制器上实现了该反汇编器。实验结果表明,采用二次判别分析(quadratic discriminant analysis, QDA),该反汇编器可以识别包含寄存器名的测试指令,成功率不低于99.03%。
{"title":"Power-based Side-Channel Instruction-level Disassembler","authors":"Jungmin Park, Xiaolin Xu, Yier Jin, Domenic Forte, M. Tehranipoor","doi":"10.1145/3195970.3196094","DOIUrl":"https://doi.org/10.1145/3195970.3196094","url":null,"abstract":"Modern embedded computing devices are vulnerable against mal-ware and software piracy due to insufficient security scrutiny and the complications of continuous patching. To detect malicious activity as well as protecting the integrity of executable software, it is necessary to monitor the operation of such devices. In this paper, we propose a disassembler based on power-based side-channel to analyze the real-time operation of embedded systems at instruction-level granularity. The proposed disassembler obtains templates from an original device (e.g., IoT home security system, smart thermostat, etc.) and utilizes machine learning algorithms to uniquely identify instructions executed on the device. The feature selection using Kullback-Leibler (KL) divergence and the dimensional reduction using PCA in the time-frequency domain are proposed to increase the identification accuracy. Moreover, a hierarchical classification framework is proposed to reduce the computational complexity associated with large instruction sets. In addition, covariate shifts caused by different environmental measurements and device-to-device variations are minimized by our covariate shift adaptation technique. We implement this disassembler on an AVR 8-bit microcontroller. Experimental results demonstrate that our proposed disassembler can recognize test instructions including register names with a success rate no lower than 99.03% with quadratic discriminant analysis (QDA).","PeriodicalId":6491,"journal":{"name":"2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)","volume":"7 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82251966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 57
Optimized I/O Determinism for Emerging NVM-based NVMe SSD in an Enterprise System 企业级系统新兴NVMe SSD I/O确定性优化
Pub Date : 2018-06-01 DOI: 10.1145/3195970.3196085
Seonbong Kim, Joon-Sung Yang
Non-volatile memory express (NVMe) over peripheral component interconnect express (PCIe) has been adopted in the storage system to provide low latency and high throughput. NVMe allows a host system to reduce latency because it offers a high parallel operation and optimized command processing flow. In addition, an introduction of emerging non-volatile memory (NVM) significantly reduces the solid state drive (SSD) latency. The latency reduction in the host system and SSD makes a relative ratio of PCIe fabric latency to total I/O latency considerably grow. Therefore, this paper proposes a novel I/O optimization method using the PCIe feature, virtual channel. Unlike conventional approaches with the same priority data path, based on SSD's internal latency, an emerging NVM-based NVMe SSD with the proposed architecture selects a prioritized virtual channel to provide deterministic I/O latency. Experimental results show that the proposed method with phase-change memory (PCM) SSD improves I/O determinism by processing 45~74% more commands within the predictable I/O latency than a conventional PCM SSD.
存储系统采用NVMe (Non-volatile memory express) over PCIe (peripheral component interconnect express)技术,提供低时延和高吞吐量。NVMe允许主机系统减少延迟,因为它提供了高并行操作和优化的命令处理流。此外,新兴的非易失性内存(NVM)的引入显著降低了固态驱动器(SSD)的延迟。主机系统和SSD的延迟减少使得PCIe结构延迟与总I/O延迟的相对比率大大增加。因此,本文提出了一种利用PCIe特性的新型I/O优化方法——虚拟通道。与具有相同优先级数据路径的传统方法不同,基于SSD的内部延迟,新兴的基于nvvm的NVMe SSD采用所提出的架构选择优先虚拟通道来提供确定的I/O延迟。实验结果表明,在可预测的I/O延迟内,该方法比传统的PCM SSD多处理45~74%的命令,提高了I/O确定性。
{"title":"Optimized I/O Determinism for Emerging NVM-based NVMe SSD in an Enterprise System","authors":"Seonbong Kim, Joon-Sung Yang","doi":"10.1145/3195970.3196085","DOIUrl":"https://doi.org/10.1145/3195970.3196085","url":null,"abstract":"Non-volatile memory express (NVMe) over peripheral component interconnect express (PCIe) has been adopted in the storage system to provide low latency and high throughput. NVMe allows a host system to reduce latency because it offers a high parallel operation and optimized command processing flow. In addition, an introduction of emerging non-volatile memory (NVM) significantly reduces the solid state drive (SSD) latency. The latency reduction in the host system and SSD makes a relative ratio of PCIe fabric latency to total I/O latency considerably grow. Therefore, this paper proposes a novel I/O optimization method using the PCIe feature, virtual channel. Unlike conventional approaches with the same priority data path, based on SSD's internal latency, an emerging NVM-based NVMe SSD with the proposed architecture selects a prioritized virtual channel to provide deterministic I/O latency. Experimental results show that the proposed method with phase-change memory (PCM) SSD improves I/O determinism by processing 45~74% more commands within the predictable I/O latency than a conventional PCM SSD.","PeriodicalId":6491,"journal":{"name":"2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)","volume":"123 1 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82877180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
期刊
2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1