首页 > 最新文献

2017 30th IEEE International System-on-Chip Conference (SOCC)最新文献

英文 中文
A 2.4-GHz dual-mode resizing power amplifier with a constant conductance output matching 具有恒电导输出匹配的2.4 ghz双模调整尺寸功率放大器
Pub Date : 2017-09-01 DOI: 10.1109/SOCC.2017.8226053
Wei-Lun Ou, Yu-Kai Tsai, Po-Yen Tseng, Liang-Hung Lu
A 2.4-GHz dual-mode power amplifier with transistor resizing is proposed. The proposed technique keeps the conductance for optimum power-matching constant in both modes. By introducing transformers and capacitors, the 50-ohm load is matched to near-optimal impedances for size-scaling MOS transistors. Using a 0.18-pm CMOS process, the proposed PA demonstrates a PAE enhancement of 2.7×/2.3× respectively at 6.5-dB/9-dB transmitting power back-off.
提出了一种可调整晶体管尺寸的2.4 ghz双模功率放大器。该方法在两种模式下均保持电导恒定以达到最佳功率匹配。通过引入变压器和电容器,50欧姆负载匹配到接近最佳阻抗的尺寸MOS晶体管。采用0.18 pm CMOS工艺,在6.5 db /9 db发射功率下,所提出的PA的PAE分别提高了2.7倍/2.3倍。
{"title":"A 2.4-GHz dual-mode resizing power amplifier with a constant conductance output matching","authors":"Wei-Lun Ou, Yu-Kai Tsai, Po-Yen Tseng, Liang-Hung Lu","doi":"10.1109/SOCC.2017.8226053","DOIUrl":"https://doi.org/10.1109/SOCC.2017.8226053","url":null,"abstract":"A 2.4-GHz dual-mode power amplifier with transistor resizing is proposed. The proposed technique keeps the conductance for optimum power-matching constant in both modes. By introducing transformers and capacitors, the 50-ohm load is matched to near-optimal impedances for size-scaling MOS transistors. Using a 0.18-pm CMOS process, the proposed PA demonstrates a PAE enhancement of 2.7×/2.3× respectively at 6.5-dB/9-dB transmitting power back-off.","PeriodicalId":366264,"journal":{"name":"2017 30th IEEE International System-on-Chip Conference (SOCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130114583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A multi-format floating-point multiplier for power-efficient operations 一种多格式浮点乘法器,用于节能操作
Pub Date : 2017-09-01 DOI: 10.1109/SOCC.2017.8226076
A. Nannarelli
In this work, we present a radix-16 multi-format multiplier to multiply 64-bit unsigned integer operands, double-precision and single-precision operands. The multiplier is sectioned in two lanes such that two single-precision multiplications can be computed in parallel. Radix-16 is chosen for the reduced number of partial products and the resulting power savings. The experimental results show that high power efficiency is obtained by issuing two single-precision multiplications per cycle. Moreover, by converting the double-precision numbers which fit to single-precision, further energy can be saved.
在这项工作中,我们提出了一个基数-16多格式乘法器,用于乘以64位无符号整数操作数,双精度和单精度操作数。乘法器分成两道,使得两个单精度乘法可以并行计算。选择基数-16是为了减少部分产品的数量和由此产生的电力节省。实验结果表明,通过每个周期进行两次单精度乘法,可以获得较高的功率效率。此外,通过将适合的双精度数转换为单精度数,可以进一步节省能量。
{"title":"A multi-format floating-point multiplier for power-efficient operations","authors":"A. Nannarelli","doi":"10.1109/SOCC.2017.8226076","DOIUrl":"https://doi.org/10.1109/SOCC.2017.8226076","url":null,"abstract":"In this work, we present a radix-16 multi-format multiplier to multiply 64-bit unsigned integer operands, double-precision and single-precision operands. The multiplier is sectioned in two lanes such that two single-precision multiplications can be computed in parallel. Radix-16 is chosen for the reduced number of partial products and the resulting power savings. The experimental results show that high power efficiency is obtained by issuing two single-precision multiplications per cycle. Moreover, by converting the double-precision numbers which fit to single-precision, further energy can be saved.","PeriodicalId":366264,"journal":{"name":"2017 30th IEEE International System-on-Chip Conference (SOCC)","volume":"74 31","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134196947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
An ultra high density pseudo dual-port SRAM in 16nm FINFET process for graphics processors 一种用于图形处理器的16nm FINFET工艺的超高密度伪双端口SRAM
Pub Date : 2017-09-01 DOI: 10.1109/SOCC.2017.8225996
V. Nautiyal, G. Singla, Lalita Gupta, S. Dwivedi, M. Kinkade
In recent times, graphic, audio and video definition has improved due to significant advancement in complex algorithms and video processing techniques. These techniques require heterogeneous and multi-core processors because of their complex computation abilities. Dual-port memories have become an essential component of CPUs because multi-core processors require significant data transfer. However, dual-port memories come at a cost of increased area and leakage. In this paper, an ultra-high-density dual-port SRAM (RADPUHD) architecture is proposed which addresses area and leakage challenges. It is designed and fabricated in 16nm technology. This paper presents use of a single-port bitcell to achieve functionality of dual-port SRAM thus improving area efficiency. The use of latches for Port B signals instead of full flip-flops further reduces area. The proposed design is a bolt-on wrapper around a 6T single-port SRAM. This design achieved a memory density of 8.1Mb/mm2 chip area and achieved 53% area savings and approximately 60% leakage savings when compared to an 8T dual-port SRAM that was also fabricated in 16nm. Silicon results show that the proposed circuit is functional down to a minimum operating voltage of 520mV.
近年来,由于复杂算法和视频处理技术的显著进步,图形、音频和视频清晰度得到了改善。这些技术需要异构和多核处理器,因为它们具有复杂的计算能力。由于多核处理器需要大量的数据传输,双端口存储器已成为cpu的重要组成部分。然而,双端口存储器的代价是增加面积和泄漏。本文提出了一种解决面积和泄漏问题的超高密度双端口SRAM (RADPUHD)架构。它采用16nm技术设计和制造。本文提出使用单端口位元来实现双端口SRAM的功能,从而提高面积效率。使用锁存器的端口B信号,而不是完整的触发器进一步减少面积。提出的设计是围绕6T单端口SRAM的螺栓连接包装。该设计实现了8.1Mb/mm2芯片面积的内存密度,与同样采用16nm工艺制造的8T双端口SRAM相比,节省了53%的面积和大约60%的泄漏。硅测试结果表明,该电路的工作电压可低至520mV。
{"title":"An ultra high density pseudo dual-port SRAM in 16nm FINFET process for graphics processors","authors":"V. Nautiyal, G. Singla, Lalita Gupta, S. Dwivedi, M. Kinkade","doi":"10.1109/SOCC.2017.8225996","DOIUrl":"https://doi.org/10.1109/SOCC.2017.8225996","url":null,"abstract":"In recent times, graphic, audio and video definition has improved due to significant advancement in complex algorithms and video processing techniques. These techniques require heterogeneous and multi-core processors because of their complex computation abilities. Dual-port memories have become an essential component of CPUs because multi-core processors require significant data transfer. However, dual-port memories come at a cost of increased area and leakage. In this paper, an ultra-high-density dual-port SRAM (RADPUHD) architecture is proposed which addresses area and leakage challenges. It is designed and fabricated in 16nm technology. This paper presents use of a single-port bitcell to achieve functionality of dual-port SRAM thus improving area efficiency. The use of latches for Port B signals instead of full flip-flops further reduces area. The proposed design is a bolt-on wrapper around a 6T single-port SRAM. This design achieved a memory density of 8.1Mb/mm2 chip area and achieved 53% area savings and approximately 60% leakage savings when compared to an 8T dual-port SRAM that was also fabricated in 16nm. Silicon results show that the proposed circuit is functional down to a minimum operating voltage of 520mV.","PeriodicalId":366264,"journal":{"name":"2017 30th IEEE International System-on-Chip Conference (SOCC)","volume":"546 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123918708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A graph based synthesis procedure for linear analog function 一个基于图的线性模拟函数的合成程序
Pub Date : 2017-09-01 DOI: 10.1109/SOCC.2017.8226071
M. Bhanja, B. Ray
This paper presents a graph based synthesis procedure for reconfigurable linear analog function. A n-level weighted binary tree structure has been used to represent nth order linear network. Root of the binary tree has two children nodes with weights of first order lowpass filter (LPF) and first order highpass filter (HPF). Traversing through each possible path in the tree implements one filter type. The level 2 binary tree has been transformed to a hexagonal closed graph. This conversion has been done to map the proposed synthesis procedure into field programmable analog array (FPAA), which demonstrates the reusuability and programmability. First order LPF and HPF has been used as basic building blocks, whereas a hexagonal structure is denoted as a configurable analog block (CAB) of the FPAA. The hexagonal topology of the FPAA gives the versatile connectivity between two adjacent CABS of the FPAA. Performance has been verified through SPICE simulations.
提出了一种基于图的可重构线性模拟函数的合成方法。采用n级加权二叉树结构表示n阶线性网络。二叉树的根有两个子节点,权值分别为一阶低通滤波器(LPF)和一阶高通滤波器(HPF)。遍历树中的每个可能路径实现一种过滤器类型。将二级二叉树转化为六边形闭图。这种转换已经完成了将所提出的合成过程映射到现场可编程模拟阵列(FPAA),证明了可重用性和可编程性。一阶LPF和HPF被用作FPAA的基本构件,而六边形结构被表示为FPAA的可配置模拟块(CAB)。FPAA的六边形拓扑结构提供了FPAA两个相邻cab之间的通用连接。通过SPICE仿真验证了其性能。
{"title":"A graph based synthesis procedure for linear analog function","authors":"M. Bhanja, B. Ray","doi":"10.1109/SOCC.2017.8226071","DOIUrl":"https://doi.org/10.1109/SOCC.2017.8226071","url":null,"abstract":"This paper presents a graph based synthesis procedure for reconfigurable linear analog function. A n-level weighted binary tree structure has been used to represent nth order linear network. Root of the binary tree has two children nodes with weights of first order lowpass filter (LPF) and first order highpass filter (HPF). Traversing through each possible path in the tree implements one filter type. The level 2 binary tree has been transformed to a hexagonal closed graph. This conversion has been done to map the proposed synthesis procedure into field programmable analog array (FPAA), which demonstrates the reusuability and programmability. First order LPF and HPF has been used as basic building blocks, whereas a hexagonal structure is denoted as a configurable analog block (CAB) of the FPAA. The hexagonal topology of the FPAA gives the versatile connectivity between two adjacent CABS of the FPAA. Performance has been verified through SPICE simulations.","PeriodicalId":366264,"journal":{"name":"2017 30th IEEE International System-on-Chip Conference (SOCC)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129159990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Application specific component-service-aware trace generation on Android-QEMU Android-QEMU上特定于应用程序的组件服务感知跟踪生成
Pub Date : 2017-09-01 DOI: 10.1109/SOCC.2017.8226069
H. Wei, C. King, Bhaskar Das, Mei-Chiao Peng, Chen-Chieh Wang, Hsun-Lun Huang, Juin-Ming Lu
Programmers depend on virtual platforms, such as Android-QEMU, to build and test their applications as well as system software for the ease of development efforts. It is easy to add tracing modules to the virtual platforms to dump the execution trace of the guest system, which can then be used to estimate and evaluate the performance of alternative system designs. However, tracing in virtual platforms are performed at the architecture level, which makes it difficult to generate traces for specific applications due to the lack of high-level software information. This problem becomes even more challenging for Android systems, which use component-based design strategies, where applications request services from other components. In this paper, we propose a novel Android-QEMU tracing system, which follows the invocations among the service components, generates component-service-aware traces only for specific applications. The evaluation results show that the proposed system improves 152% in simulation time and saves 33% of storage space in average compared to the static QEMU-Tracer.
程序员依赖虚拟平台,如Android-QEMU,来构建和测试他们的应用程序以及系统软件,以简化开发工作。很容易向虚拟平台添加跟踪模块,以转储客户机系统的执行跟踪,然后可以使用这些跟踪来估计和评估备选系统设计的性能。然而,虚拟平台中的跟踪是在体系结构级别执行的,这使得由于缺乏高级软件信息而难以为特定应用程序生成跟踪。对于使用基于组件的设计策略的Android系统来说,这个问题变得更加具有挑战性,其中应用程序从其他组件请求服务。在本文中,我们提出了一种新的Android-QEMU跟踪系统,该系统跟踪服务组件之间的调用,仅为特定的应用程序生成组件服务感知跟踪。评估结果表明,与静态QEMU-Tracer相比,该系统的仿真时间提高了152%,平均节省了33%的存储空间。
{"title":"Application specific component-service-aware trace generation on Android-QEMU","authors":"H. Wei, C. King, Bhaskar Das, Mei-Chiao Peng, Chen-Chieh Wang, Hsun-Lun Huang, Juin-Ming Lu","doi":"10.1109/SOCC.2017.8226069","DOIUrl":"https://doi.org/10.1109/SOCC.2017.8226069","url":null,"abstract":"Programmers depend on virtual platforms, such as Android-QEMU, to build and test their applications as well as system software for the ease of development efforts. It is easy to add tracing modules to the virtual platforms to dump the execution trace of the guest system, which can then be used to estimate and evaluate the performance of alternative system designs. However, tracing in virtual platforms are performed at the architecture level, which makes it difficult to generate traces for specific applications due to the lack of high-level software information. This problem becomes even more challenging for Android systems, which use component-based design strategies, where applications request services from other components. In this paper, we propose a novel Android-QEMU tracing system, which follows the invocations among the service components, generates component-service-aware traces only for specific applications. The evaluation results show that the proposed system improves 152% in simulation time and saves 33% of storage space in average compared to the static QEMU-Tracer.","PeriodicalId":366264,"journal":{"name":"2017 30th IEEE International System-on-Chip Conference (SOCC)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123357015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cache attacks and countermeasures for NTRUEncrypt on MPSoCs: Post-quantum resistance for the IoT mpsoc上NTRUEncrypt的缓存攻击及对策:物联网的后量子阻力
Pub Date : 2017-09-01 DOI: 10.1109/SOCC.2017.8226020
Martha Johanna Sepúlveda, A. Zankl, Oliver Mischke
Public-key cryptography (PKC), widely used to protect communication in the Internet of Things (IoT), is the basis for establishing secured communication channels between multiple parties. The foreseeable breakthrough of quantum computers represents a risk for many PKC ecosystems. Almost all approaches in use today rely on the hardness of factoring large integers or computing (elliptic-curve) discrete logarithms. It is known that cryptography based on these problems can be broken in polynomial time by Shors algorithm, once a large enough quantum computer is built. In order to prepare for such an event, the integration of quantum-resistant cryptography on devices operating in the IoT is mandatory to achieve long-term security. Due to their limited resources, tight performance requirements and long-term life-cycles, this is especially challenging for Multi-Processor System-on-Chips (MPSoCs) operating in this context. At the same time, it must be provided that well-known implementation attacks, such as those targeting a cipher's execution time or its use of the processor cache, are inhibited, as they've successfully been used to attack cryptosystems in the pre-quantum era. Hence, this work presents an analysis of the security-critical polynomial multiplication routine within the NTRU algorithm and its susceptibility to timing and cache attacks. We also propose two different countermeasures to harden systems with or without caches against said attacks, and include the evaluation of the respective overheads. We demonstrate that security against timing and cache attacks can be achieved with reasonable overheads depending on the chosen parameters of NTRU.
公钥加密(Public-key cryptography, PKC)是在多方之间建立安全通信通道的基础,在物联网(IoT)中广泛用于保护通信。量子计算机可预见的突破对许多PKC生态系统来说是一种风险。目前使用的几乎所有方法都依赖于分解大整数或计算(椭圆曲线)离散对数的难度。众所周知,一旦建立了足够大的量子计算机,基于这些问题的密码学可以在多项式时间内被Shors算法打破。为了应对此类事件,必须在物联网中运行的设备上集成抗量子加密技术,以实现长期安全。由于其有限的资源、严格的性能要求和长期的生命周期,这对于在这种情况下运行的多处理器片上系统(mpsoc)来说尤其具有挑战性。与此同时,必须提供众所周知的实现攻击,例如针对密码执行时间或处理器缓存使用的攻击,被抑制,因为它们已经成功地用于攻击前量子时代的密码系统。因此,这项工作提出了NTRU算法中的安全关键多项式乘法例程及其对定时和缓存攻击的敏感性的分析。我们还提出了两种不同的对策来强化系统(有或没有缓存)以抵御上述攻击,并包括对各自开销的评估。我们证明,根据所选择的NTRU参数,可以通过合理的开销来实现对定时和缓存攻击的安全性。
{"title":"Cache attacks and countermeasures for NTRUEncrypt on MPSoCs: Post-quantum resistance for the IoT","authors":"Martha Johanna Sepúlveda, A. Zankl, Oliver Mischke","doi":"10.1109/SOCC.2017.8226020","DOIUrl":"https://doi.org/10.1109/SOCC.2017.8226020","url":null,"abstract":"Public-key cryptography (PKC), widely used to protect communication in the Internet of Things (IoT), is the basis for establishing secured communication channels between multiple parties. The foreseeable breakthrough of quantum computers represents a risk for many PKC ecosystems. Almost all approaches in use today rely on the hardness of factoring large integers or computing (elliptic-curve) discrete logarithms. It is known that cryptography based on these problems can be broken in polynomial time by Shors algorithm, once a large enough quantum computer is built. In order to prepare for such an event, the integration of quantum-resistant cryptography on devices operating in the IoT is mandatory to achieve long-term security. Due to their limited resources, tight performance requirements and long-term life-cycles, this is especially challenging for Multi-Processor System-on-Chips (MPSoCs) operating in this context. At the same time, it must be provided that well-known implementation attacks, such as those targeting a cipher's execution time or its use of the processor cache, are inhibited, as they've successfully been used to attack cryptosystems in the pre-quantum era. Hence, this work presents an analysis of the security-critical polynomial multiplication routine within the NTRU algorithm and its susceptibility to timing and cache attacks. We also propose two different countermeasures to harden systems with or without caches against said attacks, and include the evaluation of the respective overheads. We demonstrate that security against timing and cache attacks can be achieved with reasonable overheads depending on the chosen parameters of NTRU.","PeriodicalId":366264,"journal":{"name":"2017 30th IEEE International System-on-Chip Conference (SOCC)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129153443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
The triangle of power density, circuit degradation and reliability 功率密度、电路退化和可靠性的三角关系
Pub Date : 2017-07-01 DOI: 10.1109/MC.2017.214
J. Henkel, P. Montuschi
Power density will stay a major challenge for the foreseeable future. Despite orders-of-magnitude-improved efficiency, power consumption per area is sharply rising, mainly due to the limits of voltage scaling. To investigate the physical implications of high power densities, we must distinguish between peak and average temperatures and temporal and spatial thermal gradients because they trigger circuit-aging mechanisms and eventually jeopardize the reliability of an on-chip system. The talk starts by presenting some basic interdependencies in the triangle of power density, circuit degradation and reliability and continues with some solutions to mitigate the problem via, among others, power density-aware resource management and efficient power budgeting.
在可预见的未来,功率密度仍将是一个主要挑战。尽管效率有了数量级的提高,但单位面积的功耗却急剧上升,这主要是由于电压缩放的限制。为了研究高功率密度的物理影响,我们必须区分峰值和平均温度以及时间和空间热梯度,因为它们会触发电路老化机制,并最终危及片上系统的可靠性。讲座首先介绍了功率密度、电路退化和可靠性三角关系中的一些基本相互依赖关系,并继续介绍了通过功率密度感知资源管理和有效的功率预算来缓解问题的一些解决方案。
{"title":"The triangle of power density, circuit degradation and reliability","authors":"J. Henkel, P. Montuschi","doi":"10.1109/MC.2017.214","DOIUrl":"https://doi.org/10.1109/MC.2017.214","url":null,"abstract":"Power density will stay a major challenge for the foreseeable future. Despite orders-of-magnitude-improved efficiency, power consumption per area is sharply rising, mainly due to the limits of voltage scaling. To investigate the physical implications of high power densities, we must distinguish between peak and average temperatures and temporal and spatial thermal gradients because they trigger circuit-aging mechanisms and eventually jeopardize the reliability of an on-chip system. The talk starts by presenting some basic interdependencies in the triangle of power density, circuit degradation and reliability and continues with some solutions to mitigate the problem via, among others, power density-aware resource management and efficient power budgeting.","PeriodicalId":366264,"journal":{"name":"2017 30th IEEE International System-on-Chip Conference (SOCC)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131970645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MobiCore: An adaptive hybrid approach for power-efficient CPU management on Android devices MobiCore:一种用于Android设备的高效CPU管理的自适应混合方法
Pub Date : 2017-02-21 DOI: 10.1109/SOCC.2017.8226044
Lucie Broyde, Kent W. Nixon, Xiang Chen, Hai Helen Li, Yiran Chen
Smartphones are becoming essential devices used for various types of applications in our daily life. To satisfy the ever-increasing performance requirement, the number of CPU cores in a phone keeps growing, which imposes a great impact on its power consumption. This work presents a series of analysis to understand how the current Android resource management policy adjusts CPU features. Our results indicate a significant improvement margin for CPU power efficiency in modern Android smartphones. We then propose MobiCore — a power-efficient CPU management scheme that can optimize the use of Dynamic and Frequency Voltage Scaling (DVFS) and the Dynamic Core Scaling (DCS) techniques with a sensitive control on CPU bandwidth. The measurements on the real systems prove that MobiCore can achieve substantial CPU power reduction compared to state-of-the-art architectures.
智能手机正在成为我们日常生活中各种应用程序的必备设备。为了满足不断提高的性能要求,手机的CPU内核数量不断增加,这对手机的功耗产生了很大的影响。这项工作提出了一系列的分析,以了解当前Android资源管理策略如何调整CPU功能。我们的研究结果表明,现代Android智能手机的CPU功率效率有显著提高。然后,我们提出了MobiCore -一种节能的CPU管理方案,可以优化动态和频率电压缩放(DVFS)和动态核心缩放(DCS)技术的使用,并对CPU带宽进行敏感控制。在实际系统上的测量证明,与最先进的架构相比,MobiCore可以大幅降低CPU功耗。
{"title":"MobiCore: An adaptive hybrid approach for power-efficient CPU management on Android devices","authors":"Lucie Broyde, Kent W. Nixon, Xiang Chen, Hai Helen Li, Yiran Chen","doi":"10.1109/SOCC.2017.8226044","DOIUrl":"https://doi.org/10.1109/SOCC.2017.8226044","url":null,"abstract":"Smartphones are becoming essential devices used for various types of applications in our daily life. To satisfy the ever-increasing performance requirement, the number of CPU cores in a phone keeps growing, which imposes a great impact on its power consumption. This work presents a series of analysis to understand how the current Android resource management policy adjusts CPU features. Our results indicate a significant improvement margin for CPU power efficiency in modern Android smartphones. We then propose MobiCore — a power-efficient CPU management scheme that can optimize the use of Dynamic and Frequency Voltage Scaling (DVFS) and the Dynamic Core Scaling (DCS) techniques with a sensitive control on CPU bandwidth. The measurements on the real systems prove that MobiCore can achieve substantial CPU power reduction compared to state-of-the-art architectures.","PeriodicalId":366264,"journal":{"name":"2017 30th IEEE International System-on-Chip Conference (SOCC)","volume":"135 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124205992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Asynchronous 1R-1W dual-port SRAM by using single-port SRAM in 28nm UTBB-FDSOI technology 异步1R-1W双端口SRAM采用单端口SRAM在28nm UTBB-FDSOI技术
Pub Date : 2016-12-01 DOI: 10.1109/SOCC.2017.8225994
Harsh Rawat, K. Bharath, Alexander Fell
With the advancement in technology nodes, the number of components operating in different clock domains in a System on Chip (SoC) increases. Asynchronous multi-port memory with dedicated write and read ports is used to allow data to cross clock domain boundaries. The dual-port memory architecture introduced in this paper, is based on the Single-Port SRAM (SP-SRAM) that can be generated in larger capacities with better performance statistics compared to the Dual-Port SRAM (DP-SRAM). The proposed design has been evaluated by comparing existing dual-port 1R-1W and 2RW designs in 28nm Ultra Thin Body and Box Fully Depleted Silicon on Insulator (UTBB-FDSOI) technology. A memory with a capacity of 2048 words with 64 bits, shows 15%, 35%, 28% and 4.5% improvement in read power, write power, read-write power consumption and performance respectively over conventional 1R-1W DP-SRAM with equal area. The synthesis with area optimizations applied instead, shows an area advantage of 50% over conventional 1R-1W DP-SRAM, but with a degradation in performance.
随着技术节点的进步,在片上系统(SoC)中工作在不同时钟域的组件数量增加。异步多端口内存与专用的写和读端口被用来允许数据跨越时钟域边界。本文介绍的双端口内存架构是基于单端口SRAM (SP-SRAM),与双端口SRAM (DP-SRAM)相比,它可以产生更大的容量和更好的性能统计数据。通过比较现有的双端口1R-1W和2RW设计在28nm超薄机身和盒式完全耗尽绝缘体上硅(UTBB-FDSOI)技术,对所提出的设计进行了评估。与同等面积的1R-1W DP-SRAM相比,2048字64位内存的读功率、写功率、读写功耗和性能分别提高了15%、35%、28%和4.5%。与传统的1R-1W DP-SRAM相比,采用面积优化的合成显示出50%的面积优势,但性能有所下降。
{"title":"Asynchronous 1R-1W dual-port SRAM by using single-port SRAM in 28nm UTBB-FDSOI technology","authors":"Harsh Rawat, K. Bharath, Alexander Fell","doi":"10.1109/SOCC.2017.8225994","DOIUrl":"https://doi.org/10.1109/SOCC.2017.8225994","url":null,"abstract":"With the advancement in technology nodes, the number of components operating in different clock domains in a System on Chip (SoC) increases. Asynchronous multi-port memory with dedicated write and read ports is used to allow data to cross clock domain boundaries. The dual-port memory architecture introduced in this paper, is based on the Single-Port SRAM (SP-SRAM) that can be generated in larger capacities with better performance statistics compared to the Dual-Port SRAM (DP-SRAM). The proposed design has been evaluated by comparing existing dual-port 1R-1W and 2RW designs in 28nm Ultra Thin Body and Box Fully Depleted Silicon on Insulator (UTBB-FDSOI) technology. A memory with a capacity of 2048 words with 64 bits, shows 15%, 35%, 28% and 4.5% improvement in read power, write power, read-write power consumption and performance respectively over conventional 1R-1W DP-SRAM with equal area. The synthesis with area optimizations applied instead, shows an area advantage of 50% over conventional 1R-1W DP-SRAM, but with a degradation in performance.","PeriodicalId":366264,"journal":{"name":"2017 30th IEEE International System-on-Chip Conference (SOCC)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130013917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Pin accessibility evaluating model for improving routability of VLSI designs 提高超大规模集成电路设计可达性的引脚可达性评估模型
Pub Date : 1900-01-01 DOI: 10.1109/SOCC.2017.8226007
Hong-Yan Su, S. Nishizawa, Yan-Shiun Wu, Jun Shiomi, Yih-Lang Li, H. Onodera
Pin accessibility influences the routability of a design at the stage of block/chip assembling. The estimation model for pin accessibility in previous researches counts the total number of intersections between each pin and M2 routing tracks. It does not consider the variation of pin accessibility as the spacing between a pin and its neighboring pins and metal wires changes. Besides, it cannot properly deal with the off-grid pin access neither. In this paper, we propose a general model for pin accessibility estimation. In the model, all directions to connect to the boundary of a pin are under estimation. Off-grid pin access is also available. Experimental results show that the reduction rate of minimum area to complete the routing of a circuit can be 7.0% on average. Due to the diminishment of required area for routing, the total number of vias for higher metal layer also decrease under the same area constraint.
在块/芯片组装阶段,引脚可达性影响设计的可达性。以往研究的引脚可达性估计模型计算每个引脚与M2路由轨迹的交集总数。它没有考虑引脚与相邻引脚和金属线间距的变化对引脚可及性的影响。此外,它也不能很好地处理离网引脚接入。在本文中,我们提出了一个通用的引脚可达性估计模型。在该模型中,对连接到引脚边界的所有方向都进行了估计。离网引脚访问也可用。实验结果表明,完成电路布线的最小面积缩减率平均可达7.0%。由于走线所需面积的减小,在相同面积约束下,较高金属层的通孔总数也减少。
{"title":"Pin accessibility evaluating model for improving routability of VLSI designs","authors":"Hong-Yan Su, S. Nishizawa, Yan-Shiun Wu, Jun Shiomi, Yih-Lang Li, H. Onodera","doi":"10.1109/SOCC.2017.8226007","DOIUrl":"https://doi.org/10.1109/SOCC.2017.8226007","url":null,"abstract":"Pin accessibility influences the routability of a design at the stage of block/chip assembling. The estimation model for pin accessibility in previous researches counts the total number of intersections between each pin and M2 routing tracks. It does not consider the variation of pin accessibility as the spacing between a pin and its neighboring pins and metal wires changes. Besides, it cannot properly deal with the off-grid pin access neither. In this paper, we propose a general model for pin accessibility estimation. In the model, all directions to connect to the boundary of a pin are under estimation. Off-grid pin access is also available. Experimental results show that the reduction rate of minimum area to complete the routing of a circuit can be 7.0% on average. Due to the diminishment of required area for routing, the total number of vias for higher metal layer also decrease under the same area constraint.","PeriodicalId":366264,"journal":{"name":"2017 30th IEEE International System-on-Chip Conference (SOCC)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121075225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2017 30th IEEE International System-on-Chip Conference (SOCC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1