首页 > 最新文献

2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)最新文献

英文 中文
Exploring the Potential of Error-permissive Communication in Multi-FPGA-based Edge Computing 探索基于多fpga的边缘计算中容错通信的潜力
Akram Ben Ahmed, Ryousei Takano, Takahiro Hirofuchi
In recent years, Edge Computing has been attracting a lot of attention as it has shown its ability to mitigate some issues found in Cloud Computing (e.g., high latency, centralization issues, programmability, etc.), Moving computation from the Cloud to the Edge has resulted in an exponential increase in the number of devices to be connected. With such an increase, interconnection has become a major design and performance factor, especially in terms of power efficiency. Thus, the question that a lot of research has been trying to answer is how to connect such devices in a power-efficient way while making sure to maintain high-performance and accuracy at the lowest cost. In this paper, we investigate the Error-permissive communication paradigm and its potential to answer the aforementioned question. In particular, we focus on the importance of using this paradigm on Multi-FPGA systems which have gained significant attention in Edge Computing and can be a good candidate for Error-permissive communication. We also present a preliminary evaluation of a novel concept to reduce the power consumption by undervolting the supply voltage of the serial transceivers, and we show its potential to reduce the total FPGA power consumption by up to 26%.
近年来,边缘计算吸引了大量的关注,因为它已经显示出它能够缓解云计算中发现的一些问题(例如,高延迟,集中化问题,可编程性等)。将计算从云计算转移到边缘导致要连接的设备数量呈指数级增长。随着这种增长,互连已成为一个主要的设计和性能因素,特别是在功率效率方面。因此,许多研究一直试图回答的问题是如何以一种节能的方式连接这些设备,同时确保以最低的成本保持高性能和准确性。在本文中,我们研究了允许错误的通信范式及其回答上述问题的潜力。特别是,我们专注于在多fpga系统上使用这种范式的重要性,这些系统在边缘计算中得到了极大的关注,并且可以成为容错通信的良好候选者。我们还对通过降低串行收发器的供电电压来降低功耗的新概念进行了初步评估,并展示了其将FPGA总功耗降低高达26%的潜力。
{"title":"Exploring the Potential of Error-permissive Communication in Multi-FPGA-based Edge Computing","authors":"Akram Ben Ahmed, Ryousei Takano, Takahiro Hirofuchi","doi":"10.1109/MCSoC57363.2022.00024","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00024","url":null,"abstract":"In recent years, Edge Computing has been attracting a lot of attention as it has shown its ability to mitigate some issues found in Cloud Computing (e.g., high latency, centralization issues, programmability, etc.), Moving computation from the Cloud to the Edge has resulted in an exponential increase in the number of devices to be connected. With such an increase, interconnection has become a major design and performance factor, especially in terms of power efficiency. Thus, the question that a lot of research has been trying to answer is how to connect such devices in a power-efficient way while making sure to maintain high-performance and accuracy at the lowest cost. In this paper, we investigate the Error-permissive communication paradigm and its potential to answer the aforementioned question. In particular, we focus on the importance of using this paradigm on Multi-FPGA systems which have gained significant attention in Edge Computing and can be a good candidate for Error-permissive communication. We also present a preliminary evaluation of a novel concept to reduce the power consumption by undervolting the supply voltage of the serial transceivers, and we show its potential to reduce the total FPGA power consumption by up to 26%.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120939215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward EEG-Based Brain State Recognition for Personalized Neuromodulation 基于脑电图的个性化神经调节脑状态识别研究
Yu-Cheng Chang, Pin-Hsuan Chao, Sin-Horng Chen, Chun-Shu Wei
Repetitive transcranial magnetic stimulation (rTMS) is a non-invasive antidepressant neuromodulation therapy for treatment-resistant depression (TRD). However, the remission rate of patients remains unsatisfactory possibly due to the suboptimal configuration of conventional rTMS protocol. This work aims to design a close-loop TMS system and validate the practicability of brain-state-dependent stimulation based on real-time monitoring of electroencephalogram (EEG). We propose a novel method of phase estimation to enhance the precision of EEG phase-triggered firing of TMS. Our implementation supports subsequent studies on personalized brain-state-dependent neuromodulation for clinical applications.
重复经颅磁刺激(rTMS)是治疗难治性抑郁症(TRD)的一种非侵入性抗抑郁神经调节疗法。然而,患者的缓解率仍然不理想,这可能是由于传统rTMS方案的次优配置。本研究旨在设计一个闭环经颅磁刺激系统,验证基于脑电图实时监测的脑状态依赖性刺激的实用性。提出了一种新的相位估计方法,以提高脑电经颅磁刺激相触发的精度。我们的实现支持后续个性化脑状态依赖神经调节的临床应用研究。
{"title":"Toward EEG-Based Brain State Recognition for Personalized Neuromodulation","authors":"Yu-Cheng Chang, Pin-Hsuan Chao, Sin-Horng Chen, Chun-Shu Wei","doi":"10.1109/MCSoC57363.2022.00053","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00053","url":null,"abstract":"Repetitive transcranial magnetic stimulation (rTMS) is a non-invasive antidepressant neuromodulation therapy for treatment-resistant depression (TRD). However, the remission rate of patients remains unsatisfactory possibly due to the suboptimal configuration of conventional rTMS protocol. This work aims to design a close-loop TMS system and validate the practicability of brain-state-dependent stimulation based on real-time monitoring of electroencephalogram (EEG). We propose a novel method of phase estimation to enhance the precision of EEG phase-triggered firing of TMS. Our implementation supports subsequent studies on personalized brain-state-dependent neuromodulation for clinical applications.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121334140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic endometrial segmentation in ultrasound images using deep learning 基于深度学习的超声图像子宫内膜自动分割
Yiyang Liu, Boyuan Peng, Xin Zhu, Wenwen Wang, Qin Zhou, Shixuan Wang, Jingjing Jiang, Li Fang
Endometrial segmentation plays a vital role in the computerized evaluation of uterine ultrasonic images. Accurate segmentation of endometrial regions may improve the accuracy and efficiency of diagnosis. Recent studies have been focused on the employment of deep learning in medical image segmentation. In this study, we compared six models, including five convolutional neural networks with different network architectures (UNet, Segnet) and backbones (Resnet50, Vanilla CNN, VGG16) for the segmentation of endometrium, and one model called deep dual-resolution networks (DDRNets). The training and test datasets were composed of 840 and 210 images from 302 and 68 cases, respectively. Through validation, DRRNets demonstrated the best performance for endometrial segmentation with an average Dice coefficient (DSC) of 0.895.
子宫内膜分割在子宫超声图像的计算机评价中起着至关重要的作用。子宫内膜区域的准确分割可提高诊断的准确性和效率。近年来的研究重点是将深度学习应用于医学图像分割。在这项研究中,我们比较了六种模型,包括五种不同网络架构(UNet, Segnet)和骨干(Resnet50, Vanilla CNN, VGG16)的卷积神经网络用于子宫内膜分割,以及一种称为深度双分辨率网络(DDRNets)的模型。训练和测试数据集分别由来自302例和68例的840和210张图像组成。经验证,DRRNets在子宫内膜分割上表现最佳,平均Dice系数(DSC)为0.895。
{"title":"Automatic endometrial segmentation in ultrasound images using deep learning","authors":"Yiyang Liu, Boyuan Peng, Xin Zhu, Wenwen Wang, Qin Zhou, Shixuan Wang, Jingjing Jiang, Li Fang","doi":"10.1109/MCSoC57363.2022.00020","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00020","url":null,"abstract":"Endometrial segmentation plays a vital role in the computerized evaluation of uterine ultrasonic images. Accurate segmentation of endometrial regions may improve the accuracy and efficiency of diagnosis. Recent studies have been focused on the employment of deep learning in medical image segmentation. In this study, we compared six models, including five convolutional neural networks with different network architectures (UNet, Segnet) and backbones (Resnet50, Vanilla CNN, VGG16) for the segmentation of endometrium, and one model called deep dual-resolution networks (DDRNets). The training and test datasets were composed of 840 and 210 images from 302 and 68 cases, respectively. Through validation, DRRNets demonstrated the best performance for endometrial segmentation with an average Dice coefficient (DSC) of 0.895.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116210864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Traffic-Aware Energy-Efficient Hybrid Input Buffer Design for On-Chip Routers 片上路由器的交通感知节能混合输入缓冲器设计
Yujie Gao, Yuan He, Xiaohan Yue, Haiyan Jiang, Xibo Wang
A growing number of cores per chip has driven the rapid adoption of increasingly complex Networks-on-Chip (NoCs) under diminishing power budgets. In such a situation, having more routers or having routers with higher radix is inevitable, which creates higher demands for input buffers while they already draw a significant amount of power. Thus, this paper introduces a hybrid input buffer design for on-chip routers attempting to shrink their power consumption while conserving performance. The key idea behind this proposal is to design input buffers with the network traffic characteristics in mind. As in our observations, a large portion of the network traffic is short packets, which means, it is fair to implement most of the input buffers with slow but less leaky devices (STT-MRAM) to suppress the static power consumption while still having most of the network traffic stored in fast but leaky SRAM devices to conserve the network performance. Our evaluations show that this hybrid design can achieve an average reduction of energy consumption per flit by 44.5% under 93.6% of the original router area and small degradation of the network performance.
在不断减少的功耗预算下,每个芯片的核心数量不断增加,推动了越来越复杂的片上网络(noc)的快速采用。在这种情况下,拥有更多的路由器或拥有更高基数的路由器是不可避免的,这会在已经消耗大量功率的情况下对输入缓冲区产生更高的需求。因此,本文介绍了一种混合输入缓冲器设计,用于片上路由器,试图在节省性能的同时减少其功耗。这个建议背后的关键思想是在设计输入缓冲区时考虑到网络流量的特征。正如我们的观察,大部分网络流量是短数据包,这意味着,使用缓慢但泄漏较少的设备(STT-MRAM)实现大多数输入缓冲区以抑制静态功耗是公平的,同时仍然将大多数网络流量存储在快速但泄漏的SRAM设备中以保持网络性能。我们的评估表明,这种混合设计可以在原始路由器面积的93.6%的情况下实现平均每飞行能耗降低44.5%,并且网络性能下降很小。
{"title":"Traffic-Aware Energy-Efficient Hybrid Input Buffer Design for On-Chip Routers","authors":"Yujie Gao, Yuan He, Xiaohan Yue, Haiyan Jiang, Xibo Wang","doi":"10.1109/MCSoC57363.2022.10023992","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.10023992","url":null,"abstract":"A growing number of cores per chip has driven the rapid adoption of increasingly complex Networks-on-Chip (NoCs) under diminishing power budgets. In such a situation, having more routers or having routers with higher radix is inevitable, which creates higher demands for input buffers while they already draw a significant amount of power. Thus, this paper introduces a hybrid input buffer design for on-chip routers attempting to shrink their power consumption while conserving performance. The key idea behind this proposal is to design input buffers with the network traffic characteristics in mind. As in our observations, a large portion of the network traffic is short packets, which means, it is fair to implement most of the input buffers with slow but less leaky devices (STT-MRAM) to suppress the static power consumption while still having most of the network traffic stored in fast but leaky SRAM devices to conserve the network performance. Our evaluations show that this hybrid design can achieve an average reduction of energy consumption per flit by 44.5% under 93.6% of the original router area and small degradation of the network performance.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132221348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Composite Lightweight Authenticated Encryption Based on LED Block Cipher and PHOTON Hash Function for IoT Devices 基于LED分组密码和光子哈希函数的物联网设备复合轻量级认证加密
M. Al-Shatari, F. Hussin, A. A. Aziz, M. S. Rohmad, Xuan-Tu Tran
IoT devices are being used in different environments recently. They are mostly resource-constrained, and therefore, their data security is crucial. Several lightweight cryptographic primitives were proposed to overcome the limitations of the devices while maintaining moderate security levels. Such primitives provide either encryption or authentication. The encryption must be authenticated by a Message Authentication Code (MA C) or hash function for better overall security. Therefore, an architecture of integrated lightweight authenticated encryption (AE) based on LED block cipher and PHOTON hash function is presented. LED and PHOTON architectures were combined while exploiting area-performance trade-offs and utilizing the shared internal functions. The architecture is designed in Verilog HDL, synthesized in Altera Quartus II and simulated on Field Programmable Gate Array (FPGA) devices. The individual design of LED utilizes 357 logic elements (LE) and PHOTON utilizes 852 LE resulting in a total of 1209 LE. The logic utilization of the proposed shared architecture is 1046 LE. The results reveal that 13.5 % reduction in logic area is achieved compared to the independent implementations of LED and PHOTON.
最近,物联网设备正在不同的环境中使用。他们大多资源有限,因此,他们的数据安全至关重要。提出了几个轻量级的加密原语,以克服设备的限制,同时保持适度的安全级别。这些原语提供加密或身份验证。加密必须通过消息验证码(Message Authentication Code, MA C)或散列函数进行身份验证,以获得更好的整体安全性。为此,提出了一种基于LED分组密码和光子哈希函数的集成轻量级认证加密(AE)体系结构。LED和光子架构结合在一起,同时利用面积性能权衡和共享的内部功能。该体系结构采用Verilog HDL进行设计,在Altera Quartus II中进行合成,并在现场可编程门阵列(FPGA)器件上进行仿真。LED的单独设计使用357个逻辑元件(LE),而PHOTON使用852个逻辑元件,总共使用1209个逻辑元件。所提出的共享体系结构的逻辑利用率为1046le。结果表明,与独立实现LED和光子相比,该电路的逻辑面积减少了13.5%。
{"title":"Composite Lightweight Authenticated Encryption Based on LED Block Cipher and PHOTON Hash Function for IoT Devices","authors":"M. Al-Shatari, F. Hussin, A. A. Aziz, M. S. Rohmad, Xuan-Tu Tran","doi":"10.1109/MCSoC57363.2022.00030","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00030","url":null,"abstract":"IoT devices are being used in different environments recently. They are mostly resource-constrained, and therefore, their data security is crucial. Several lightweight cryptographic primitives were proposed to overcome the limitations of the devices while maintaining moderate security levels. Such primitives provide either encryption or authentication. The encryption must be authenticated by a Message Authentication Code (MA C) or hash function for better overall security. Therefore, an architecture of integrated lightweight authenticated encryption (AE) based on LED block cipher and PHOTON hash function is presented. LED and PHOTON architectures were combined while exploiting area-performance trade-offs and utilizing the shared internal functions. The architecture is designed in Verilog HDL, synthesized in Altera Quartus II and simulated on Field Programmable Gate Array (FPGA) devices. The individual design of LED utilizes 357 logic elements (LE) and PHOTON utilizes 852 LE resulting in a total of 1209 LE. The logic utilization of the proposed shared architecture is 1046 LE. The results reveal that 13.5 % reduction in logic area is achieved compared to the independent implementations of LED and PHOTON.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115435717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Message from the Chairs: Welcome to the 2022 IEEE 15th International Symposium on embedded Multicore/Many-core Systems-on-Chip (IEEE MCSoC-2022) 主持人致辞:欢迎参加IEEE第15届嵌入式多核/多核系统芯片国际研讨会(IEEE MCSoC-2022)
{"title":"Message from the Chairs: Welcome to the 2022 IEEE 15th International Symposium on embedded Multicore/Many-core Systems-on-Chip (IEEE MCSoC-2022)","authors":"","doi":"10.1109/mcsoc57363.2022.00005","DOIUrl":"https://doi.org/10.1109/mcsoc57363.2022.00005","url":null,"abstract":"","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115589649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Hybrid Opto-Electrical Floating-point Multiplier 一种混合光电浮点乘法器
Takumi Inaba, Takatsugu Ono, Koji Inoue, Satoshi Kawakami
The performance improvement by CMOS circuit technology is reaching its limits. Many researchers have been studying computing technologies that use emerging devices to challenge such critical issues. Nanophotonic technology is a promising candidate due to its ultra-low latency, high bandwidth, and low power natures. The advanced research activity of nanophotonic computing is to design hardware accelerators for AI inference applications. However, few considerations about nanophotonic accelerators for AI training applications have been conducted. The main reason is that state-of-the-art nanophotonic AI accelerators involve integer operations, whereas floating-point (FP) sum-of-products dominate the training process. However, to the best of the authors' knowledge, there are no optical circuits that target floating-point arithmetic units. This study proposes a novel Opto-Electrical Floating-point Multiplier (OEFM) toward ultra-low-latency, a power-efficient nanophotonic accelerator for AI training applications. We design a microarchitecture of OEFM, including a novel optical integer multiplier and other electrical components. Based on our evaluation framework, we analyze the calculation accuracy of the proposed multiplier and OEFM. Experimental results show that OEFM achieves a 56 % reduction in latency and a 41 % reduction in energy consumption compared with a conventional electrical circuit.
CMOS电路技术的性能提升已经达到极限。许多研究人员一直在研究使用新兴设备来挑战这些关键问题的计算技术。纳米光子技术由于其超低延迟、高带宽和低功耗的特性,是一个很有前途的候选技术。纳米光子计算的前沿研究活动是设计用于人工智能推理应用的硬件加速器。然而,纳米光子加速器在人工智能训练中的应用却很少。主要原因是最先进的纳米光子人工智能加速器涉及整数运算,而浮点(FP)乘积和在训练过程中占主导地位。然而,据作者所知,目前还没有针对浮点算术单位的光学电路。本研究提出了一种面向超低延迟的新型光电浮点乘法器(OEFM),一种用于人工智能训练应用的节能纳米光子加速器。我们设计了一个OEFM的微架构,包括一个新型的光学整数乘法器和其他电子元件。基于我们的评估框架,我们分析了所提乘数和OEFM的计算精度。实验结果表明,与传统电路相比,OEFM的延迟降低了56%,能耗降低了41%。
{"title":"A Hybrid Opto-Electrical Floating-point Multiplier","authors":"Takumi Inaba, Takatsugu Ono, Koji Inoue, Satoshi Kawakami","doi":"10.1109/MCSoC57363.2022.00057","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00057","url":null,"abstract":"The performance improvement by CMOS circuit technology is reaching its limits. Many researchers have been studying computing technologies that use emerging devices to challenge such critical issues. Nanophotonic technology is a promising candidate due to its ultra-low latency, high bandwidth, and low power natures. The advanced research activity of nanophotonic computing is to design hardware accelerators for AI inference applications. However, few considerations about nanophotonic accelerators for AI training applications have been conducted. The main reason is that state-of-the-art nanophotonic AI accelerators involve integer operations, whereas floating-point (FP) sum-of-products dominate the training process. However, to the best of the authors' knowledge, there are no optical circuits that target floating-point arithmetic units. This study proposes a novel Opto-Electrical Floating-point Multiplier (OEFM) toward ultra-low-latency, a power-efficient nanophotonic accelerator for AI training applications. We design a microarchitecture of OEFM, including a novel optical integer multiplier and other electrical components. Based on our evaluation framework, we analyze the calculation accuracy of the proposed multiplier and OEFM. Experimental results show that OEFM achieves a 56 % reduction in latency and a 41 % reduction in energy consumption compared with a conventional electrical circuit.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123622611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parameterizable mobile workloads for adaptable base station optimizations 用于自适应基站优化的可参数化移动工作负载
Julian Robledo, J. Castrillón
Recent works on 5G baseband processing systems address the optimization of applications with different require-ments of quality of service (QoS). The volume and heterogeneity of applications that have to be processed on a base station are growing and 5G introduces new use cases that push system designers towards more flexible and adaptable approaches. To investigate future network challenges of mobile communications, a good methodology for the generation of realistic workloads, that allows target optimizations of different traffic scenarios, is required. In this paper, we study the variation of real traffic data on multiple base stations and identify the main sources for the high variation of the 5G workloads. We propose a methodology for parameterizable workload generation for users with different QoS requirements that enables optimization techniques in base-band processing systems. We demonstrate the feasibility of our approach based on a virtual base station using a heterogeneous hardware model and various state-of-the-art mapping policies.
最近关于5G基带处理系统的工作解决了具有不同服务质量(QoS)要求的应用的优化问题。必须在基站上处理的应用程序的数量和异质性正在增长,5G引入了新的用例,推动系统设计人员采用更灵活和适应性更强的方法。为了研究移动通信的未来网络挑战,需要一种良好的方法来生成现实的工作负载,从而允许对不同的流量场景进行目标优化。本文研究了多个基站真实流量数据的变化,确定了5G工作负载高变化的主要来源。我们提出了一种为具有不同QoS要求的用户生成可参数化工作负载的方法,该方法使基带处理系统中的优化技术成为可能。我们展示了我们的方法的可行性,该方法基于使用异构硬件模型和各种最先进的映射策略的虚拟基站。
{"title":"Parameterizable mobile workloads for adaptable base station optimizations","authors":"Julian Robledo, J. Castrillón","doi":"10.1109/MCSoC57363.2022.00067","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00067","url":null,"abstract":"Recent works on 5G baseband processing systems address the optimization of applications with different require-ments of quality of service (QoS). The volume and heterogeneity of applications that have to be processed on a base station are growing and 5G introduces new use cases that push system designers towards more flexible and adaptable approaches. To investigate future network challenges of mobile communications, a good methodology for the generation of realistic workloads, that allows target optimizations of different traffic scenarios, is required. In this paper, we study the variation of real traffic data on multiple base stations and identify the main sources for the high variation of the 5G workloads. We propose a methodology for parameterizable workload generation for users with different QoS requirements that enables optimization techniques in base-band processing systems. We demonstrate the feasibility of our approach based on a virtual base station using a heterogeneous hardware model and various state-of-the-art mapping policies.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121826644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed Decision Fusion for Large Scale IoT- Ecosystem 大规模物联网生态系统的分布式决策融合
Ashwin Raut, Divesh Kumar, V. Chaurasiya, Manish Kumar
IoT data analytics have numerous applications that generate huge data to gain new insights and information. How-ever, this work remains challenging due to the heterogeneity of IoT data sources, unnecessary data processing, uncertainty in decision-making, data biasness, and ever-increasing data size. To overcome these challenges, we propose distributed decision fusion framework for the large-scale IoT ecosystem. The proposed framework has divided into three-level. The first and second level provides the local decision of the small individual ecosystem using the filter method-based feature selection and dynamic classifier selection criteria for decision making; whereas the third level fuses the collected decision from the small ecosystems using Majority voting, Weighted majority voting and distributed Naive Bayes classifier. Lastly, we illustrate performance of the proposed solution on the US-Accidents dataset.
物联网数据分析有许多应用程序,这些应用程序产生大量数据,以获得新的见解和信息。然而,由于物联网数据源的异质性、不必要的数据处理、决策的不确定性、数据偏差以及不断增加的数据大小,这项工作仍然具有挑战性。为了克服这些挑战,我们提出了大规模物联网生态系统的分布式决策融合框架。拟议的框架分为三个层次。第一级和第二级采用基于滤波方法的特征选择和动态分类器选择准则进行决策,提供小个体生态系统的局部决策;而第三层则使用多数投票、加权多数投票和分布式朴素贝叶斯分类器来融合从小生态系统收集的决策。最后,我们展示了所提出的解决方案在US-Accidents数据集上的性能。
{"title":"Distributed Decision Fusion for Large Scale IoT- Ecosystem","authors":"Ashwin Raut, Divesh Kumar, V. Chaurasiya, Manish Kumar","doi":"10.1109/MCSoC57363.2022.00027","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00027","url":null,"abstract":"IoT data analytics have numerous applications that generate huge data to gain new insights and information. How-ever, this work remains challenging due to the heterogeneity of IoT data sources, unnecessary data processing, uncertainty in decision-making, data biasness, and ever-increasing data size. To overcome these challenges, we propose distributed decision fusion framework for the large-scale IoT ecosystem. The proposed framework has divided into three-level. The first and second level provides the local decision of the small individual ecosystem using the filter method-based feature selection and dynamic classifier selection criteria for decision making; whereas the third level fuses the collected decision from the small ecosystems using Majority voting, Weighted majority voting and distributed Naive Bayes classifier. Lastly, we illustrate performance of the proposed solution on the US-Accidents dataset.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125545142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A 94.5% Peak Efficiency, 14mV Output Ripple SC-Buck Step-Up Converter with 1.2-to-5V Output Achieving 20.2% Enhanced Power Efficiency in New PMU Architecture for SoCs 94.5%峰值效率,14mV输出纹波sc -降压升压转换器,1.2至5v输出,在新的soc PMU架构中实现20.2%的功率效率提升
Zhuoqi Guo, Yongchao Zhang, Meiling Hu, Zhongming Xue, Li Geng
For SoC and IoT systems, conventional power management unit (PMU) uses Buck converter and LDOs. With the increase of load current consumed by the digital circuits, the overall efficiency of the conventional architecture deteriorates due to the drop-voltage of the LDO. A new PMU architecture is proposed in this paper to break through the bottleneck of efficiency. The LDO is removed and a step-up converter provides the high voltage for the analog domain. The supply efficiency of the whole SoC could be improved because no LDO is needed for the current-hungry digital circuits. The new step-up converter named switched capacitor Buck (SCB) converter is presented by adding the switched capacitor network in the Buck converter. The most notable feature of SCB converter is that it achieves the step-up conversion with the Buck-like mode. It means that the transfer function, loop design, output ripple, and other characteristics are almost same as Buck converter, which help the implementation of proposed PMU. A prototype is implemented with a standard 0.18-μm CMOS technology. The whole system achieves high power efficiency of 94.5% For the system's power supply efficiency of SoC, the proposed PMU improves the efficiency by at least 12.5% than that of the conventional scheme.
对于SoC和物联网系统,传统的电源管理单元(PMU)使用Buck转换器和ldo。随着数字电路负载电流消耗的增加,由于LDO的降电压,传统架构的整体效率下降。为了突破效率瓶颈,本文提出了一种新的PMU结构。LDO被移除,一个升压转换器为模拟域提供高电压。由于电流饥渴型数字电路不需要LDO,因此可以提高整个SoC的供电效率。通过在Buck变换器中加入开关电容网络,提出了一种新的升压变换器——开关电容Buck变换器。SCB变换器最显著的特点是采用buck模式实现升压转换。这意味着传递函数、环路设计、输出纹波和其他特性几乎与Buck变换器相同,这有助于所提出的PMU的实现。样机采用标准的0.18 μm CMOS技术实现。对于系统SoC的供电效率,所提出的PMU比传统方案的效率提高了至少12.5%。
{"title":"A 94.5% Peak Efficiency, 14mV Output Ripple SC-Buck Step-Up Converter with 1.2-to-5V Output Achieving 20.2% Enhanced Power Efficiency in New PMU Architecture for SoCs","authors":"Zhuoqi Guo, Yongchao Zhang, Meiling Hu, Zhongming Xue, Li Geng","doi":"10.1109/MCSoC57363.2022.00034","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00034","url":null,"abstract":"For SoC and IoT systems, conventional power management unit (PMU) uses Buck converter and LDOs. With the increase of load current consumed by the digital circuits, the overall efficiency of the conventional architecture deteriorates due to the drop-voltage of the LDO. A new PMU architecture is proposed in this paper to break through the bottleneck of efficiency. The LDO is removed and a step-up converter provides the high voltage for the analog domain. The supply efficiency of the whole SoC could be improved because no LDO is needed for the current-hungry digital circuits. The new step-up converter named switched capacitor Buck (SCB) converter is presented by adding the switched capacitor network in the Buck converter. The most notable feature of SCB converter is that it achieves the step-up conversion with the Buck-like mode. It means that the transfer function, loop design, output ripple, and other characteristics are almost same as Buck converter, which help the implementation of proposed PMU. A prototype is implemented with a standard 0.18-μm CMOS technology. The whole system achieves high power efficiency of 94.5% For the system's power supply efficiency of SoC, the proposed PMU improves the efficiency by at least 12.5% than that of the conventional scheme.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132897232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1