首页 > 最新文献

2015 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS XVIII)最新文献

英文 中文
Electronic Paper Display update scheduler for extremely low power non-volatile embedded systems 电子纸显示更新调度超低功耗非易失性嵌入式系统
Pub Date : 2015-04-13 DOI: 10.1109/CoolChips.2015.7158661
Yusuke Shirota, S. Yoshimura, Tatsunori Kanai
Aggressive use of low power modes in embedded systems using emerging non-volatile or low power compute state retainable devices can greatly reduce its power consumption of idle-state. However, in general, non-volatile devices require comparatively large power to switch between the stable states. Therefore, to realize extremely low power mobile platforms with powerful multimedia application processor running solely on photovoltaic-power, mitigating power consumption of its active-state is the next issue. Replacing power hungry conventional LCDs with non-volatile displays is inevitable in realizing such low power platforms, but naive replacement is insufficient. As such, low power control cognizant of non-volatile device properties is necessary[2]. We propose a display update request scheduling scheme designed for a promising non-volatile display: Electronic Paper Display(EPD) and give deep analysis of power consumption. Proposed scheme dynamically rearranges update requests ill-suited for EPDs to localized and collision-free low power consuming requests at the device driver level, reducing EPD-based tablet's energy consumption by up to 49% without requiring application specific modifications.
在使用新兴的非易失性或低功耗计算状态可保留器件的嵌入式系统中积极使用低功耗模式可以大大降低其空闲状态的功耗。然而,一般来说,非易失性器件需要相对较大的功率才能在稳定状态之间切换。因此,要实现具有强大的多媒体应用处理器的超低功耗移动平台,降低其活动状态的功耗是下一步要解决的问题。用非易失性显示器取代耗电的传统lcd是实现这种低功耗平台的必然选择,但单纯的替换是不够的。因此,认识到器件非易失性的低功耗控制是必要的[2]。针对一种具有发展前景的非易失性显示器——电子纸显示器(EPD),提出了一种显示更新请求调度方案,并对其功耗进行了深入分析。提出的方案动态地将不适合epd的更新请求重新安排为设备驱动级的本地化和无冲突的低功耗请求,在不需要特定应用修改的情况下,将基于epd的平板电脑的能耗降低高达49%。
{"title":"Electronic Paper Display update scheduler for extremely low power non-volatile embedded systems","authors":"Yusuke Shirota, S. Yoshimura, Tatsunori Kanai","doi":"10.1109/CoolChips.2015.7158661","DOIUrl":"https://doi.org/10.1109/CoolChips.2015.7158661","url":null,"abstract":"Aggressive use of low power modes in embedded systems using emerging non-volatile or low power compute state retainable devices can greatly reduce its power consumption of idle-state. However, in general, non-volatile devices require comparatively large power to switch between the stable states. Therefore, to realize extremely low power mobile platforms with powerful multimedia application processor running solely on photovoltaic-power, mitigating power consumption of its active-state is the next issue. Replacing power hungry conventional LCDs with non-volatile displays is inevitable in realizing such low power platforms, but naive replacement is insufficient. As such, low power control cognizant of non-volatile device properties is necessary[2]. We propose a display update request scheduling scheme designed for a promising non-volatile display: Electronic Paper Display(EPD) and give deep analysis of power consumption. Proposed scheme dynamically rearranges update requests ill-suited for EPDs to localized and collision-free low power consuming requests at the device driver level, reducing EPD-based tablet's energy consumption by up to 49% without requiring application specific modifications.","PeriodicalId":358999,"journal":{"name":"2015 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS XVIII)","volume":"932 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123288737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Panel discussions computing technology for autonomous driving 小组讨论自动驾驶计算技术
Pub Date : 2015-04-13 DOI: 10.1109/CoolChips.2015.7158654
Shinpei Kato
Autonomous driving is becoming more and more multidisciplinary. Not only vehicular technologies but also computing, networking, and data management technologies are involved in autonomous driving. Of particular interest includes the trade-off between in-vehicle computing and cloud computing to support artificial intelligence of autonomous driving. Perception and planning of autonomy requires high-performance computing while battery-driven vehicles must consider power problems. Offloading such computations onto the cloud could be a drastic solution, though safety and reliability of driving remain major concerns. Data management is also a grand challenge of autonomous driving. In particular, high-precision maps are considered to be the common infrastructure to self-localize vehicles and efficiently route them to their destinations. Unfortunately, current navigation systems are not well compatible to high-precision maps and the sustainable management of map data also remains an open problem. These problems of autonomous driving are not dedicated to particular technologies but need to be addressed by tight coordination of multiple technologies. This panel gathers experts from multiple areas across vehicles, computing platforms, maps, and consumer electronics.
自动驾驶正变得越来越多学科。自动驾驶不仅涉及车辆技术,还涉及计算、网络和数据管理技术。特别令人感兴趣的包括车载计算和云计算之间的权衡,以支持自动驾驶的人工智能。自动驾驶的感知和规划需要高性能计算,而电池驱动的汽车必须考虑电源问题。将这样的计算卸载到云端可能是一个极端的解决方案,尽管驾驶的安全性和可靠性仍然是主要问题。数据管理也是自动驾驶的一大挑战。特别是,高精度地图被认为是自动定位车辆并有效地将其路由到目的地的通用基础设施。不幸的是,目前的导航系统不能很好地与高精度地图兼容,地图数据的可持续管理也仍然是一个悬而未决的问题。自动驾驶的这些问题并不是某一种技术所能解决的,而是需要多种技术的紧密配合才能解决的。该小组汇集了来自汽车、计算平台、地图和消费电子等多个领域的专家。
{"title":"Panel discussions computing technology for autonomous driving","authors":"Shinpei Kato","doi":"10.1109/CoolChips.2015.7158654","DOIUrl":"https://doi.org/10.1109/CoolChips.2015.7158654","url":null,"abstract":"Autonomous driving is becoming more and more multidisciplinary. Not only vehicular technologies but also computing, networking, and data management technologies are involved in autonomous driving. Of particular interest includes the trade-off between in-vehicle computing and cloud computing to support artificial intelligence of autonomous driving. Perception and planning of autonomy requires high-performance computing while battery-driven vehicles must consider power problems. Offloading such computations onto the cloud could be a drastic solution, though safety and reliability of driving remain major concerns. Data management is also a grand challenge of autonomous driving. In particular, high-precision maps are considered to be the common infrastructure to self-localize vehicles and efficiently route them to their destinations. Unfortunately, current navigation systems are not well compatible to high-precision maps and the sustainable management of map data also remains an open problem. These problems of autonomous driving are not dedicated to particular technologies but need to be addressed by tight coordination of multiple technologies. This panel gathers experts from multiple areas across vehicles, computing platforms, maps, and consumer electronics.","PeriodicalId":358999,"journal":{"name":"2015 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS XVIII)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128076111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Power management on 14 nm Intel® Core− M processor 电源管理的14纳米英特尔®酷睿−M处理器
Pub Date : 2015-04-13 DOI: 10.1109/CoolChips.2015.7158657
A. Deval, Avinash Ananthakrishnan, Craig Forbell
The desire to deliver breakthrough performance in a tablet form factor required several innovations in the 14nm Intel® flagship Core™ processor (Broadwell). Better frequency control algorithms including duty cycling graphics cores were developed to improve energy efficiency. New power sharing algorithms were developed to maximize performance of multiple compute domains within tight thermal and power delivery constraints. Innovations resulted in upto 50% increase in performance and upto 25% improvement in battery life over a Haswell system thermally constrained to a 4.5W fanless form factor.
为了在平板电脑上实现突破性的性能,需要在14纳米英特尔®旗舰酷睿™处理器(Broadwell)上进行几项创新。开发了更好的频率控制算法,包括占空比图形内核,以提高能源效率。开发了新的功率共享算法,以在严格的热和功率传输约束下最大化多个计算域的性能。与受限于4.5W无风扇外形的Haswell系统相比,创新使性能提高了50%,电池寿命提高了25%。
{"title":"Power management on 14 nm Intel® Core− M processor","authors":"A. Deval, Avinash Ananthakrishnan, Craig Forbell","doi":"10.1109/CoolChips.2015.7158657","DOIUrl":"https://doi.org/10.1109/CoolChips.2015.7158657","url":null,"abstract":"The desire to deliver breakthrough performance in a tablet form factor required several innovations in the 14nm Intel® flagship Core™ processor (Broadwell). Better frequency control algorithms including duty cycling graphics cores were developed to improve energy efficiency. New power sharing algorithms were developed to maximize performance of multiple compute domains within tight thermal and power delivery constraints. Innovations resulted in upto 50% increase in performance and upto 25% improvement in battery life over a Haswell system thermally constrained to a 4.5W fanless form factor.","PeriodicalId":358999,"journal":{"name":"2015 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS XVIII)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124522056","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
An energy-efficient FPGA-based soft-core processor with a configurable word size ECC arithmetic accelerator 具有可配置字长ECC算法加速器的高效节能fpga软核处理器
Pub Date : 2015-04-13 DOI: 10.1109/CoolChips.2015.7158664
Aiko Iwasaki, Yuichiro Shibata, K. Oguri, Ryuichi Harasawa
This paper proposes an FPGA-based soft core processor architecture equipped with a configurable accelerator to speed up GF(2m) arithmetic for elliptic curve cryptography (ECC) systems. Focusing on the fact the number of operations required for GF(2m) arithmetic is influenced by the relationship between the irreducible polynomial and the machine word size, we propose an approach where the word size of the accelerator is tailored to a given irreducible polynomial. The evaluation results reveal that the performance and the energy efficiency of GF(2m) multiplication including reduction can be improved by up to 6.67 times and 5.24 times, respectively.
本文提出了一种基于fpga的软核处理器体系结构,并配备了可配置的加速器来加速椭圆曲线密码(ECC)系统的GF(2m)算法。考虑到GF(2m)算法所需的运算次数受到不可约多项式和机器字长之间关系的影响,我们提出了一种针对给定不可约多项式定制加速器字长的方法。评价结果表明,GF(2m)乘法(含减量)的性能和能效分别可提高6.67倍和5.24倍。
{"title":"An energy-efficient FPGA-based soft-core processor with a configurable word size ECC arithmetic accelerator","authors":"Aiko Iwasaki, Yuichiro Shibata, K. Oguri, Ryuichi Harasawa","doi":"10.1109/CoolChips.2015.7158664","DOIUrl":"https://doi.org/10.1109/CoolChips.2015.7158664","url":null,"abstract":"This paper proposes an FPGA-based soft core processor architecture equipped with a configurable accelerator to speed up GF(2m) arithmetic for elliptic curve cryptography (ECC) systems. Focusing on the fact the number of operations required for GF(2m) arithmetic is influenced by the relationship between the irreducible polynomial and the machine word size, we propose an approach where the word size of the accelerator is tailored to a given irreducible polynomial. The evaluation results reveal that the performance and the energy efficiency of GF(2m) multiplication including reduction can be improved by up to 6.67 times and 5.24 times, respectively.","PeriodicalId":358999,"journal":{"name":"2015 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS XVIII)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128041082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Fined-grained body biasing for frequency scaling in advanced SOI processes 高级SOI过程中用于频率标度的细粒度体偏置
Pub Date : 2015-04-13 DOI: 10.1109/CoolChips.2015.7158655
Johannes Maximilian Kühn, H. Amano, O. Bringmann, W. Rosenstiel
New SOI processes offer unprecedented flexibility in regard to low-power and performance. The use of fine-grained body biasing in STMicro's 28nm UTBB-FDSOI is evaluated for a Dynamically Reconfigurable Processor design in a frequency scaling scenario. Three different strategies are evaluated for Processing Elements: Static, programmable and dynamic body biasing. Fine-grained body biasing significantly mitigates increased leakage currents of forward body biasing between 42.85% to 64.5% on average. This makes static body biasing a viable low-cost option and makes dynamic body biasing worthwhile even at short time periods.
新的SOI工艺在低功耗和性能方面提供了前所未有的灵活性。在频率缩放场景下,对意法半导体28nm UTBB-FDSOI中细粒度体偏置的使用进行了评估,用于动态可重构处理器设计。对加工单元进行了三种不同的策略评估:静态、可编程和动态车身偏置。细粒体偏置显著减轻了前倾体偏置增加的泄漏电流,平均在42.85% ~ 64.5%之间。这使得静态身体偏置成为可行的低成本选择,而动态身体偏置即使在短时间内也是值得的。
{"title":"Fined-grained body biasing for frequency scaling in advanced SOI processes","authors":"Johannes Maximilian Kühn, H. Amano, O. Bringmann, W. Rosenstiel","doi":"10.1109/CoolChips.2015.7158655","DOIUrl":"https://doi.org/10.1109/CoolChips.2015.7158655","url":null,"abstract":"New SOI processes offer unprecedented flexibility in regard to low-power and performance. The use of fine-grained body biasing in STMicro's 28nm UTBB-FDSOI is evaluated for a Dynamically Reconfigurable Processor design in a frequency scaling scenario. Three different strategies are evaluated for Processing Elements: Static, programmable and dynamic body biasing. Fine-grained body biasing significantly mitigates increased leakage currents of forward body biasing between 42.85% to 64.5% on average. This makes static body biasing a viable low-cost option and makes dynamic body biasing worthwhile even at short time periods.","PeriodicalId":358999,"journal":{"name":"2015 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS XVIII)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124499608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A keypoint-level parallel pipelined object recognition processor with gaze activation image sensor for mobile smart glasses system 基于注视激活图像传感器的移动智能眼镜关键点级并行流水线目标识别处理器
Pub Date : 2015-04-13 DOI: 10.1109/CoolChips.2015.7158531
Injoon Hong, Dongjoo Shin, Youchang Kim, Kyeongryeol Bong, Seongwook Park, K. Lee, H. Yoo
In this paper, a low-power real-time gaze-activated object recognition processor is proposed for a battery-powered smart glasses system. For high energy efficiency, we propose keypoint-level pipelined architecture to increase the hardware utilziation which results in significant power reduction of the real-time recognition processor. In addition, low-power gaze-activation image sensor with mixed-mode architecture is proposed for the glass user's gaze estimation. Therefore, only the small image region where the glasses user is seeing needs to be processed by the recognition processor leading to further power reduction. As a result, the proposed object recognition processor shows 30fps real-time performance only with 75mW power consumption, which is 3.5x and 4.4x smaller power than the state-of-the-art works.
本文提出了一种用于电池供电的智能眼镜系统的低功耗实时注视激活目标识别处理器。为了提高能效,我们提出了关键点级流水线架构,以提高硬件利用率,从而显著降低实时识别处理器的功耗。此外,提出了一种混合模式结构的低功耗注视激活图像传感器,用于眼镜使用者的注视估计。因此,识别处理器只需要处理眼镜使用者所看到的小图像区域,从而进一步降低功耗。因此,该目标识别处理器的实时性能为30fps,功耗为75mW,分别比现有产品低3.5倍和4.4倍。
{"title":"A keypoint-level parallel pipelined object recognition processor with gaze activation image sensor for mobile smart glasses system","authors":"Injoon Hong, Dongjoo Shin, Youchang Kim, Kyeongryeol Bong, Seongwook Park, K. Lee, H. Yoo","doi":"10.1109/CoolChips.2015.7158531","DOIUrl":"https://doi.org/10.1109/CoolChips.2015.7158531","url":null,"abstract":"In this paper, a low-power real-time gaze-activated object recognition processor is proposed for a battery-powered smart glasses system. For high energy efficiency, we propose keypoint-level pipelined architecture to increase the hardware utilziation which results in significant power reduction of the real-time recognition processor. In addition, low-power gaze-activation image sensor with mixed-mode architecture is proposed for the glass user's gaze estimation. Therefore, only the small image region where the glasses user is seeing needs to be processed by the recognition processor leading to further power reduction. As a result, the proposed object recognition processor shows 30fps real-time performance only with 75mW power consumption, which is 3.5x and 4.4x smaller power than the state-of-the-art works.","PeriodicalId":358999,"journal":{"name":"2015 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS XVIII)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129508935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
TURO: A lightweight turn-guided routing scheme for 3D NoCs TURO:用于3D noc的轻量级转弯引导路由方案
Pub Date : 2015-04-13 DOI: 10.1109/CoolChips.2015.7158665
Jun Zhou, Huawei Li, Tiancheng Wang, Ying Wang, Xiaowei Li
Deadlock is a common problem in 3D networks-on-chip. In this paper, we propose a lightweight and deadlock-free turn-guided routing scheme named TURO without requiring any virtual channels, which is a minimal routing guided by a new 3D turn model NeoOE. The theoretical analysis and experimental results show that TURO possesses improved adaptivity, higher performance and lower overhead compared with the state-of-the-art routing schemes using other 3D turn models.
死锁是3D片上网络中的一个常见问题。本文提出了一种不需要任何虚拟通道的轻量化无死锁的转弯引导路由方案TURO,这是一种由新的三维转弯模型NeoOE引导的最小路由方案。理论分析和实验结果表明,与目前采用其他三维转弯模型的路径方案相比,TURO具有更好的自适应性、更高的性能和更低的开销。
{"title":"TURO: A lightweight turn-guided routing scheme for 3D NoCs","authors":"Jun Zhou, Huawei Li, Tiancheng Wang, Ying Wang, Xiaowei Li","doi":"10.1109/CoolChips.2015.7158665","DOIUrl":"https://doi.org/10.1109/CoolChips.2015.7158665","url":null,"abstract":"Deadlock is a common problem in 3D networks-on-chip. In this paper, we propose a lightweight and deadlock-free turn-guided routing scheme named TURO without requiring any virtual channels, which is a minimal routing guided by a new 3D turn model NeoOE. The theoretical analysis and experimental results show that TURO possesses improved adaptivity, higher performance and lower overhead compared with the state-of-the-art routing schemes using other 3D turn models.","PeriodicalId":358999,"journal":{"name":"2015 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS XVIII)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122180245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A leakage current monitor circuit using silicon on thin BOX MOSFET for dynamic back gate bias control 一种利用薄盒硅MOSFET进行动态后门偏置控制的漏电流监测电路
Pub Date : 2015-04-13 DOI: 10.1109/CoolChips.2015.7158656
Hayate Okuhara, K. Usami, H. Amano
A leakage current monitor circuit was developed for dynamic back gate bias control of CMOS LSI with Silicon on Thin BOX (SOTB) technology. By using the SOTB technology, sensors or wearable devices can suppress the leakage power by giving deep reverse body bias when they are not used. Once an event occurs, they must turn to the operational mode by changing the body bias quickly. According to the real chip evaluation, it takes hundreds of micro seconds, and the wake-up time is difficult to be estimated. The proposed detector using a leakage current monitor circuit guarantees that the target module is ready to be operational. The target body bias voltage for operation can be controlled by the bias voltage of the detector domain, which is computed with an expression in advance. SPICE simulation reveals that formulation is done and power overhead is only 42.7-42.9nW in the room temperature. Compensation equations for various temperatures are also shown.
采用薄盒上硅(Silicon on Thin BOX, SOTB)技术,研制了一种用于动态控制CMOS LSI后门偏置的漏电流监测电路。通过使用SOTB技术,传感器或可穿戴设备可以在不使用时通过施加深度反向体偏压来抑制泄漏功率。一旦事件发生,他们必须通过迅速改变身体偏见来转向操作模式。根据真实芯片评估,需要数百微秒,唤醒时间难以估计。所提出的检测器采用漏电流监测电路,保证目标模块准备好可操作。目标体的工作偏置电压可以通过探测器域的偏置电压来控制,该偏置电压是用预先计算好的表达式来控制的。SPICE仿真结果表明,在室温下,配方完成,功率开销仅为42.7-42.9nW。还给出了不同温度下的补偿方程。
{"title":"A leakage current monitor circuit using silicon on thin BOX MOSFET for dynamic back gate bias control","authors":"Hayate Okuhara, K. Usami, H. Amano","doi":"10.1109/CoolChips.2015.7158656","DOIUrl":"https://doi.org/10.1109/CoolChips.2015.7158656","url":null,"abstract":"A leakage current monitor circuit was developed for dynamic back gate bias control of CMOS LSI with Silicon on Thin BOX (SOTB) technology. By using the SOTB technology, sensors or wearable devices can suppress the leakage power by giving deep reverse body bias when they are not used. Once an event occurs, they must turn to the operational mode by changing the body bias quickly. According to the real chip evaluation, it takes hundreds of micro seconds, and the wake-up time is difficult to be estimated. The proposed detector using a leakage current monitor circuit guarantees that the target module is ready to be operational. The target body bias voltage for operation can be controlled by the bias voltage of the detector domain, which is computed with an expression in advance. SPICE simulation reveals that formulation is done and power overhead is only 42.7-42.9nW in the room temperature. Compensation equations for various temperatures are also shown.","PeriodicalId":358999,"journal":{"name":"2015 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS XVIII)","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129878529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lowering the complexity of k-means clustering by BFS-dijkstra method for graph computing 用BFS-dijkstra方法降低图计算k-means聚类的复杂度
Pub Date : 2015-04-13 DOI: 10.1109/CoolChips.2015.7158653
A. Zhang, Jun Yao, Y. Nakashima
K-means is a method of vector quantization, which is now popularly used for clustering analysis in massive data mining. Due to its heavily computational-intensive feature for iteratively re-computing and sorting distances, the execution of k-means takes a huge amount of time, especially when processing large graph data such as the practical social networks. This paper studies an alternative method to emulate the k-clustering from another view, in which the vertices in a graph are partitioned into k farthest clusters. This method can be implementable in a breadth-first-search (BFS) form and then becomes easily parallelizable. Our result shows that our BFS-based k-clustering achieves more than 100x speeds than the traditional partitioning in the open-source graphlab project.
K-means是一种矢量量化方法,目前广泛用于海量数据挖掘中的聚类分析。由于其迭代重新计算和排序距离的大量计算密集型特征,k-means的执行需要花费大量的时间,特别是在处理大型图形数据(如实际的社交网络)时。本文从另一个角度研究了一种模拟k-聚类的替代方法,该方法将图中的顶点划分为k个最远的聚类。这种方法可以以广度优先搜索(BFS)的形式实现,然后变得容易并行化。我们的结果表明,在开源graphlab项目中,基于bfs的k-clustering比传统分区实现了100倍以上的速度。
{"title":"Lowering the complexity of k-means clustering by BFS-dijkstra method for graph computing","authors":"A. Zhang, Jun Yao, Y. Nakashima","doi":"10.1109/CoolChips.2015.7158653","DOIUrl":"https://doi.org/10.1109/CoolChips.2015.7158653","url":null,"abstract":"K-means is a method of vector quantization, which is now popularly used for clustering analysis in massive data mining. Due to its heavily computational-intensive feature for iteratively re-computing and sorting distances, the execution of k-means takes a huge amount of time, especially when processing large graph data such as the practical social networks. This paper studies an alternative method to emulate the k-clustering from another view, in which the vertices in a graph are partitioned into k farthest clusters. This method can be implementable in a breadth-first-search (BFS) form and then becomes easily parallelizable. Our result shows that our BFS-based k-clustering achieves more than 100x speeds than the traditional partitioning in the open-source graphlab project.","PeriodicalId":358999,"journal":{"name":"2015 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS XVIII)","volume":"110 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124106645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
An energy efficient hybrid FPGA-GPU based embedded platform to accelerate face recognition application 基于FPGA-GPU的高能效混合嵌入式平台,加速人脸识别应用
Pub Date : 2015-04-13 DOI: 10.1109/CoolChips.2015.7158532
S. Rethinagiri, Oscar Palomar, J. Moreno, O. Unsal, A. Cristal
Nowadays face recognition application is widely used in various industries such as traffic, safety, medical engineering, etc. In this paper, we propose a power and energy efficient heterogeneous platform to accelerate face recognition applications. To achieve this efficiency, we propose a novel hybrid platform which consists of a Xilinx Zynq (ARM+FPGA) and an NVidia's Jetson TK1 (ARM+GPU) coupled with PCIe card. In this application, we optimized local binary pattern and eigenvalue based face detection and recognition in order to achieve a speedup of 69x when compared to sequential execution on the ARM core, 4.8x against Zynq platform (ARM+FPGA), 3.2x against NVidia platform (ARM+GPU) and 40% more energy efficient against sequential execution.
目前,人脸识别应用已广泛应用于交通、安全、医疗工程等各个行业。在本文中,我们提出了一个功率和能源效率高的异构平台来加速人脸识别的应用。为了实现这种效率,我们提出了一种新的混合平台,该平台由Xilinx Zynq (ARM+FPGA)和NVidia的Jetson TK1 (ARM+GPU)以及PCIe卡组成。在本应用中,我们优化了基于局部二进制模式和特征值的人脸检测和识别,与在ARM核心上的顺序执行相比,速度提高了69倍,与Zynq平台(ARM+FPGA)相比提高了4.8倍,与NVidia平台(ARM+GPU)相比提高了3.2倍,与顺序执行相比节能了40%。
{"title":"An energy efficient hybrid FPGA-GPU based embedded platform to accelerate face recognition application","authors":"S. Rethinagiri, Oscar Palomar, J. Moreno, O. Unsal, A. Cristal","doi":"10.1109/CoolChips.2015.7158532","DOIUrl":"https://doi.org/10.1109/CoolChips.2015.7158532","url":null,"abstract":"Nowadays face recognition application is widely used in various industries such as traffic, safety, medical engineering, etc. In this paper, we propose a power and energy efficient heterogeneous platform to accelerate face recognition applications. To achieve this efficiency, we propose a novel hybrid platform which consists of a Xilinx Zynq (ARM+FPGA) and an NVidia's Jetson TK1 (ARM+GPU) coupled with PCIe card. In this application, we optimized local binary pattern and eigenvalue based face detection and recognition in order to achieve a speedup of 69x when compared to sequential execution on the ARM core, 4.8x against Zynq platform (ARM+FPGA), 3.2x against NVidia platform (ARM+GPU) and 40% more energy efficient against sequential execution.","PeriodicalId":358999,"journal":{"name":"2015 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS XVIII)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127429876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
2015 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS XVIII)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1