首页 > 最新文献

2015 28th IEEE International System-on-Chip Conference (SOCC)最新文献

英文 中文
Multi-objective optimization of a low-noise antenna amplifier for multi-constellation satellite-navigation receivers 多星座卫星导航接收机低噪声天线放大器多目标优化
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406919
J. Dobes, J. Míchal, Jakub Popp, M. Grábner, F. Vejražka, J. Kákona
Although the major parts of function blocks for the satellite navigation receivers are fully integrated in a CMOS chip in most cases, it is convenient to create an antenna preamplifier as a separate circuit based on a low-noise pHEMT. Such an RF front end can be strongly optimized to attain a trade-off between the noise figure and transducer power gain. Furthermore, as all the principal navigation systems (GPS, GLONASS, Galileo, and Compass) work in similar frequency band (roughly from 1.1 to 1.7 GHz), it is reasonable to create this low-noise preamplifier for all of them. In the paper, a sophisticated method of the amplifier design is suggested based on multi-objective optimization. First, an extraction of pHEMT model parameters was performed, including comparisons among several models. The extraction was carried out by our original three-step robust identification procedure based on a combination of meta-heuristic and direct optimization methods. Second, a substantial improvement of a standard method for the multi-objective optimization is outlined. Third, the equations of passive elements of the circuit (including transmission lines and T splitters) were carefully defined using frequency dispersion of their parameters as Q, ESR, etc. Fourth, an optimal selection of the amplifier operating point and essential passive elements was performed using the previously improved goal attainment method. Finally, the s-parameters and noise figure of the proposed preamplifier were measured, and the third-order intermodulation products were also checked.
虽然卫星导航接收机功能模块的主要部分在大多数情况下完全集成在CMOS芯片中,但基于低噪声pHEMT创建天线前置放大器作为单独电路是方便的。这样的射频前端可以进行强优化,以实现噪声系数和换能器功率增益之间的权衡。此外,由于所有主要的导航系统(GPS, GLONASS, Galileo和Compass)工作在相似的频段(大约从1.1到1.7 GHz),因此为所有这些系统创建这种低噪声前置放大器是合理的。本文提出了一种基于多目标优化的放大器设计方法。首先,提取pHEMT模型参数,并对多个模型进行比较。提取过程采用原始的基于元启发式和直接优化方法相结合的三步鲁棒识别程序。其次,对多目标优化的标准方法进行了实质性改进。第三,仔细定义了电路中无源元件(包括传输线和T分路器)的方程,使用其参数Q、ESR等的频散度。第四,利用先前改进的目标实现方法对放大器工作点和基本无源元件进行优化选择。最后,测量了前置放大器的s参数和噪声系数,并对三阶互调积进行了检验。
{"title":"Multi-objective optimization of a low-noise antenna amplifier for multi-constellation satellite-navigation receivers","authors":"J. Dobes, J. Míchal, Jakub Popp, M. Grábner, F. Vejražka, J. Kákona","doi":"10.1109/SOCC.2015.7406919","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406919","url":null,"abstract":"Although the major parts of function blocks for the satellite navigation receivers are fully integrated in a CMOS chip in most cases, it is convenient to create an antenna preamplifier as a separate circuit based on a low-noise pHEMT. Such an RF front end can be strongly optimized to attain a trade-off between the noise figure and transducer power gain. Furthermore, as all the principal navigation systems (GPS, GLONASS, Galileo, and Compass) work in similar frequency band (roughly from 1.1 to 1.7 GHz), it is reasonable to create this low-noise preamplifier for all of them. In the paper, a sophisticated method of the amplifier design is suggested based on multi-objective optimization. First, an extraction of pHEMT model parameters was performed, including comparisons among several models. The extraction was carried out by our original three-step robust identification procedure based on a combination of meta-heuristic and direct optimization methods. Second, a substantial improvement of a standard method for the multi-objective optimization is outlined. Third, the equations of passive elements of the circuit (including transmission lines and T splitters) were carefully defined using frequency dispersion of their parameters as Q, ESR, etc. Fourth, an optimal selection of the amplifier operating point and essential passive elements was performed using the previously improved goal attainment method. Finally, the s-parameters and noise figure of the proposed preamplifier were measured, and the third-order intermodulation products were also checked.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"112 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131855865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Session T4B: Tutorial: Emerging non-volatile memory: Device, circuit, and architecture 会话T4B:教程:新兴的非易失性存储器:器件、电路和体系结构
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406889
Guanyu Sun
In order to mitigate the problem of “memory wall”, various emerging non-volatile memory (NVM) technologies have been proposed to replace traditional ones. These emerging NVMs include STT-RAM, PCRAM, RRAM, RM, etc. Compared to traditional memory technologies, they have advantages of near-zero standby-power, high storage density, and non-volatility, which make them competitive for future memory hierarchy design. However, it is inefficient to directly apply these NVMs in existing memory architectures. On the one hand, these NVMs have their own limitations, such as long write latency, high write energy, limited write numbers, etc. Thus, proper architecture modification is required to adopt them into traditional memory hierarchy. On the other hand, the unique features of these NVMs enable new memory architectures in memory subsystem and also induce new challenges to be solved at the same time. In this tutorial, we first briefly review device level background of these emerging NVMs. Then, we introduce the tool NVMSim for their circuit level modeling. At last, we investigate their implication for memory architecture design.
为了缓解“内存墙”问题,人们提出了各种新兴的非易失性存储技术来取代传统的非易失性存储技术。这些新兴的nvm包括STT-RAM、PCRAM、RRAM、RM等。与传统存储技术相比,它们具有近乎零待机功耗、高存储密度和非易失性等优点,这使它们在未来的存储层次设计中具有竞争力。然而,在现有的内存体系结构中直接应用这些nvm是低效的。一方面,这些nvm有其自身的局限性,如写延迟长、写能量高、写数量有限等。因此,需要对体系结构进行适当的修改,以便将它们纳入传统的内存层次结构。另一方面,这些nvm的独特特性使得存储子系统中出现了新的存储架构,同时也带来了新的挑战。在本教程中,我们首先简要回顾这些新兴nvm的设备级背景。然后,我们介绍了NVMSim工具对它们的电路级建模。最后,探讨了它们对存储器体系结构设计的启示。
{"title":"Session T4B: Tutorial: Emerging non-volatile memory: Device, circuit, and architecture","authors":"Guanyu Sun","doi":"10.1109/SOCC.2015.7406889","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406889","url":null,"abstract":"In order to mitigate the problem of “memory wall”, various emerging non-volatile memory (NVM) technologies have been proposed to replace traditional ones. These emerging NVMs include STT-RAM, PCRAM, RRAM, RM, etc. Compared to traditional memory technologies, they have advantages of near-zero standby-power, high storage density, and non-volatility, which make them competitive for future memory hierarchy design. However, it is inefficient to directly apply these NVMs in existing memory architectures. On the one hand, these NVMs have their own limitations, such as long write latency, high write energy, limited write numbers, etc. Thus, proper architecture modification is required to adopt them into traditional memory hierarchy. On the other hand, the unique features of these NVMs enable new memory architectures in memory subsystem and also induce new challenges to be solved at the same time. In this tutorial, we first briefly review device level background of these emerging NVMs. Then, we introduce the tool NVMSim for their circuit level modeling. At last, we investigate their implication for memory architecture design.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134003594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Formal equivalence checking between SLM and RTL descriptions SLM和RTL描述之间的形式化等价检验
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406927
Jian Hu, Tun Li, Sikun Li
The growing complexity of digital designs makes it harder to discover inconsistency between system level model (SLM) and register transfer level (RTL) model. Equivalence checking is a promising solution to verify that the RTL description meets the requirements of the corresponding SLM description. Finite State Machines with Data Paths (FSMD) based equivalence checking method is widely used in checking the equivalence between system level and RT level designs. The designs without mapping information can not be handled by many formal methods. Deep state sequences (DSS) based equivalence checking method is one of the state of the art FSMD-based methods that can handle designs without mapping information. DSS is the state sequence from start state to final state of FSMD without repeated paths. It proves the equivalence between SLM and RTL by comparing all the DSS-pairs of FSMDs in SLM and RTL. However the previous proposed DSS-based methods compared all the DSS-pairs blindly, which wasted most verification efforts on useless comparisons. This paper proposes a method to improve the DSS-based equivalence checking method by separating and comparing the corresponding potential equivalent DSS-pairs from all the generated paths to avoid blind comparisons. The promising experimental results show that the proposed method can improve the efficiency of DSS-based equivalence checking method.
随着数字化设计的日益复杂,系统级模型(SLM)和寄存器传输级模型(RTL)之间的不一致性越来越难发现。等价性检查是一种很有前途的解决方案,用于验证RTL描述是否满足相应的SLM描述的要求。基于数据路径有限状态机(FSMD)的等价性检验方法被广泛用于检验系统级和RT级设计之间的等价性。没有映射信息的设计是许多形式化方法无法处理的。基于深度状态序列(DSS)的等价性检验方法是目前基于fsmd的方法之一,可以处理没有映射信息的设计。DSS是FSMD从起始状态到最终状态的无重复路径的状态序列。通过比较SLM和RTL中fsmd的所有dss对,证明了SLM和RTL之间的等价性。然而,以往提出的基于dss的方法对所有dss对进行盲目比较,将大部分验证工作浪费在无用的比较上。本文提出了一种改进基于dss的等价性检验方法,从所有生成的路径中分离并比较对应的潜在等价dss对,以避免盲目比较。实验结果表明,该方法可以提高基于dss的等价性检验方法的效率。
{"title":"Formal equivalence checking between SLM and RTL descriptions","authors":"Jian Hu, Tun Li, Sikun Li","doi":"10.1109/SOCC.2015.7406927","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406927","url":null,"abstract":"The growing complexity of digital designs makes it harder to discover inconsistency between system level model (SLM) and register transfer level (RTL) model. Equivalence checking is a promising solution to verify that the RTL description meets the requirements of the corresponding SLM description. Finite State Machines with Data Paths (FSMD) based equivalence checking method is widely used in checking the equivalence between system level and RT level designs. The designs without mapping information can not be handled by many formal methods. Deep state sequences (DSS) based equivalence checking method is one of the state of the art FSMD-based methods that can handle designs without mapping information. DSS is the state sequence from start state to final state of FSMD without repeated paths. It proves the equivalence between SLM and RTL by comparing all the DSS-pairs of FSMDs in SLM and RTL. However the previous proposed DSS-based methods compared all the DSS-pairs blindly, which wasted most verification efforts on useless comparisons. This paper proposes a method to improve the DSS-based equivalence checking method by separating and comparing the corresponding potential equivalent DSS-pairs from all the generated paths to avoid blind comparisons. The promising experimental results show that the proposed method can improve the efficiency of DSS-based equivalence checking method.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"224 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132652454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Statistical rare event analysis using smart sampling and parameter guidance 采用智能抽样和参数指导的统计罕见事件分析
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406910
Yue Zhao, Hosoon Shin, Hai-Bao Chen, S. Tan, G. Shi, Xin Li
In this paper, we propose a new efficient statistical method for failure probability estimation of analog circuits with rate failure events, which is a time-consuming process using the existing Monte Carlo method. On top of this, the new method can also provide the estimation of parameter regions to achieve targeted performance to facilitate the design process, which is missing in the traditional fast statistical methods such as the statistical blockage based method. The new method employs two new techniques to speed up the analysis. First, to reduce the large number of samples for rare event analysis, the new approach employs a smart sample selection scheme, which can consider the effectiveness of samples and well-coverage for the parameter space. As a result, it can reduce an additional simulation costs by pruning less effective samples while keeping the accuracy of failure estimation. Second, the new approach identifies the failure regions in terms of parameters to provide a good design guideline for designers and optimization tools. This is enabled by applying the variance based feature selection to find the dominant parameters. A quasi-random sampling with dominant parameters is then applied to determine in-spec boundaries of those parameters. In addition, we also provide the complete formula for the probability determinations of failure regions in the iterative failure region searching framework. We demonstrate the advantage of our proposed method using two test benches: 6T-SRAM reading failure diagnosis with 27 process parameters, charge pump operation failure diagnosis in a PLL circuit with 81 process parameters. Experimental results show that the new method can be 4X more accurate than the recently proposed REscope method. Furthermore, the new method reduces the simulation cost by 2X than the recursive statistical blockage (RSB) method with same accuracy level. Our approach also provides the precise guidance of diverse parameters with 1.21% estimation error.
本文提出了一种新的有效的统计方法来估计具有率失效事件的模拟电路的失效概率,而现有的蒙特卡罗方法是一个耗时的过程。除此之外,新方法还可以提供参数区域的估计,以达到目标性能,方便设计过程,这是传统快速统计方法如基于统计阻塞的方法所缺乏的。新方法采用了两种新技术来加快分析速度。首先,为了减少用于罕见事件分析的大量样本,该方法采用了一种智能样本选择方案,该方案可以考虑样本的有效性和参数空间的良好覆盖率。因此,在保持故障估计的准确性的同时,它可以通过修剪不太有效的样本来减少额外的模拟成本。其次,新方法根据参数确定故障区域,为设计人员和优化工具提供良好的设计指导。这可以通过应用基于方差的特征选择来找到主要参数来实现。然后应用具有优势参数的准随机抽样来确定这些参数的规格边界。此外,我们还提供了迭代失效区域搜索框架中失效区域概率确定的完整公式。我们通过两个测试平台证明了我们所提出的方法的优势:包含27个工艺参数的6T-SRAM读取故障诊断,以及包含81个工艺参数的锁相环电路中的电荷泵操作故障诊断。实验结果表明,新方法的精度比目前提出的recope方法提高了4倍。此外,在相同精度水平下,该方法比递归统计阻塞(RSB)方法的仿真成本降低了2倍。该方法在不同参数下也能提供精确的制导,估计误差为1.21%。
{"title":"Statistical rare event analysis using smart sampling and parameter guidance","authors":"Yue Zhao, Hosoon Shin, Hai-Bao Chen, S. Tan, G. Shi, Xin Li","doi":"10.1109/SOCC.2015.7406910","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406910","url":null,"abstract":"In this paper, we propose a new efficient statistical method for failure probability estimation of analog circuits with rate failure events, which is a time-consuming process using the existing Monte Carlo method. On top of this, the new method can also provide the estimation of parameter regions to achieve targeted performance to facilitate the design process, which is missing in the traditional fast statistical methods such as the statistical blockage based method. The new method employs two new techniques to speed up the analysis. First, to reduce the large number of samples for rare event analysis, the new approach employs a smart sample selection scheme, which can consider the effectiveness of samples and well-coverage for the parameter space. As a result, it can reduce an additional simulation costs by pruning less effective samples while keeping the accuracy of failure estimation. Second, the new approach identifies the failure regions in terms of parameters to provide a good design guideline for designers and optimization tools. This is enabled by applying the variance based feature selection to find the dominant parameters. A quasi-random sampling with dominant parameters is then applied to determine in-spec boundaries of those parameters. In addition, we also provide the complete formula for the probability determinations of failure regions in the iterative failure region searching framework. We demonstrate the advantage of our proposed method using two test benches: 6T-SRAM reading failure diagnosis with 27 process parameters, charge pump operation failure diagnosis in a PLL circuit with 81 process parameters. Experimental results show that the new method can be 4X more accurate than the recently proposed REscope method. Furthermore, the new method reduces the simulation cost by 2X than the recursive statistical blockage (RSB) method with same accuracy level. Our approach also provides the precise guidance of diverse parameters with 1.21% estimation error.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122730503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Instruction decoders based on pattern factorization 基于模式分解的指令解码器
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406936
R. Santos, Renan Marks, Rafael Alves, Felipe Araujo, Renato Santos
This work presents the design of hardware instruction decoders based on the Pattern Based Instruction Word (PBIW) encoding technique. Instruction decoder circuits have been designed in the datapath of ρ-VEX and the Leon3 soft-core embedded processors. The PBIW encoding scheme focuses on extracting out patterns from original instructions at compiler time. The PBIW hardware decoder works on the processor datapath simplifying the decoding instruction logic by exploring the hardware parallelism between instruction decoding and register read. The experiments show that the instruction decoders based on the PBIW technique present small impacts on area, dynamic power, and timing (3%-10% decrease on clock frequency) on the processor design.
本文提出了基于模式指令字(PBIW)编码技术的硬件指令解码器的设计。在ρ-VEX和Leon3软核嵌入式处理器的数据通路上设计了指令解码器电路。PBIW编码方案侧重于在编译时从原始指令中提取模式。PBIW硬件解码器工作在处理器数据路径上,通过探索指令解码和寄存器读取之间的硬件并行性,简化了解码指令逻辑。实验表明,基于PBIW技术的指令解码器对处理器设计的面积、动态功率和时序影响较小(时钟频率降低3% ~ 10%)。
{"title":"Instruction decoders based on pattern factorization","authors":"R. Santos, Renan Marks, Rafael Alves, Felipe Araujo, Renato Santos","doi":"10.1109/SOCC.2015.7406936","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406936","url":null,"abstract":"This work presents the design of hardware instruction decoders based on the Pattern Based Instruction Word (PBIW) encoding technique. Instruction decoder circuits have been designed in the datapath of ρ-VEX and the Leon3 soft-core embedded processors. The PBIW encoding scheme focuses on extracting out patterns from original instructions at compiler time. The PBIW hardware decoder works on the processor datapath simplifying the decoding instruction logic by exploring the hardware parallelism between instruction decoding and register read. The experiments show that the instruction decoders based on the PBIW technique present small impacts on area, dynamic power, and timing (3%-10% decrease on clock frequency) on the processor design.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"566 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116248033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Floorplan and congestion aware framework for optimal SRAM selection for memory subsystems 内存子系统最佳SRAM选择的平面图和拥塞感知框架
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406922
Gaurav Narang, Alexander Fell, P. Gupta, Anuj Grover
Embedded memories are the key contributor to the chip area, dynamic power dissipation and also form a significant part of critical path for high performance advanced SoCs. Therefore, optimal selection of memory instances becomes imperative for SoC designers. While EDA tools have evolved over the past years to optimally select standard logic cells depending on the timing and the power constraints, optimal memory selection is largely a manual process. We propose a framework to optimize power, performance, and area (PPA) of a memory subsystem (MSS) by including floorplan dependent delays and power consumption in interconnects and glue logic of the MSS in the pre-RTL stage. Through this framework, we demonstrate that for a 4 Mb assembly of SRAM instances, dynamic power is reduced by 44%, area by 49%, and leakage by 71% with the floorplan aware selection. The framework has the capability to use different estimates, when routing congestion is important (for example, in low cost processes with less number of metal layers). We also show that the interconnect delays are reduced by about 68% and dynamic power by 58%, if additional metal layers are available for routing compared to a low cost 6 metal process.
嵌入式存储器是芯片面积、动态功耗的关键贡献者,也是高性能先进soc关键路径的重要组成部分。因此,对于SoC设计人员来说,最佳选择内存实例变得至关重要。虽然EDA工具在过去的几年里已经发展到根据时间和功率限制来最佳地选择标准逻辑单元,但最佳内存选择主要是一个手动过程。我们提出了一个框架来优化内存子系统(MSS)的功率、性能和面积(PPA),该框架包括平面图相关的延迟和互连中的功耗,以及MSS在预rtl阶段的粘合逻辑。通过这个框架,我们证明了对于一个4mb的SRAM实例组件,在平面感知选择下,动态功率减少44%,面积减少49%,泄漏减少71%。当路由拥塞很重要时(例如,在具有较少金属层的低成本流程中),框架具有使用不同估计的能力。我们还表明,与低成本的6金属工艺相比,如果额外的金属层可用于路由,则互连延迟减少了约68%,动态功率减少了58%。
{"title":"Floorplan and congestion aware framework for optimal SRAM selection for memory subsystems","authors":"Gaurav Narang, Alexander Fell, P. Gupta, Anuj Grover","doi":"10.1109/SOCC.2015.7406922","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406922","url":null,"abstract":"Embedded memories are the key contributor to the chip area, dynamic power dissipation and also form a significant part of critical path for high performance advanced SoCs. Therefore, optimal selection of memory instances becomes imperative for SoC designers. While EDA tools have evolved over the past years to optimally select standard logic cells depending on the timing and the power constraints, optimal memory selection is largely a manual process. We propose a framework to optimize power, performance, and area (PPA) of a memory subsystem (MSS) by including floorplan dependent delays and power consumption in interconnects and glue logic of the MSS in the pre-RTL stage. Through this framework, we demonstrate that for a 4 Mb assembly of SRAM instances, dynamic power is reduced by 44%, area by 49%, and leakage by 71% with the floorplan aware selection. The framework has the capability to use different estimates, when routing congestion is important (for example, in low cost processes with less number of metal layers). We also show that the interconnect delays are reduced by about 68% and dynamic power by 58%, if additional metal layers are available for routing compared to a low cost 6 metal process.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"147 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131522394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Design of a 12-bit 0.83 MS/s SAR ADC for an IPMI SoC 用于IPMI SoC的12位0.83 MS/s SAR ADC的设计
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406935
Han Zhou, Xiaoyan Gui, Peng Gao
A 12-bit successive approximation register analog to digital converter (SAR ADC) built in an intelligent platform management interface (IPMI) system on chip (SoC) is proposed in this paper. The work is designed and fabricated in SMIC 0.13 μm CMOS process, using the fully differential C-R hybrid digital to analog converter (DAC) structure. Scaling capacitor is adopted to improve the precision and reduce the chip area. Through the optimization of placement and routing, the DAC achieves high precision. Besides, a multi-stage comparator is designed, and an offset calibration technique with capacitors is applied, too. The measurement results show that the effective number of bits (ENOB) of this ADC reaches 9.711 bit at Nyquist frequency with 0.83 MHz sampling frequency and the total current of the ADC is 970 μA with a 3.3 V power supply.
提出了一种内置在片上智能平台管理接口(IPMI)系统中的12位逐次逼近寄存器模拟数字转换器(SAR ADC)。该器件采用中芯国际0.13 μm CMOS工艺,采用全差分C-R混合数模转换器(DAC)结构设计制作。采用缩放电容,提高了精度,减小了芯片面积。通过布局和布线的优化,实现了高精度。此外,还设计了多级比较器,并采用了电容偏置校准技术。测量结果表明,在0.83 MHz采样频率下,该ADC在奈奎斯特频率下的有效位元数(ENOB)达到9.711 bit,在3.3 V电源下,ADC的总电流为970 μA。
{"title":"Design of a 12-bit 0.83 MS/s SAR ADC for an IPMI SoC","authors":"Han Zhou, Xiaoyan Gui, Peng Gao","doi":"10.1109/SOCC.2015.7406935","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406935","url":null,"abstract":"A 12-bit successive approximation register analog to digital converter (SAR ADC) built in an intelligent platform management interface (IPMI) system on chip (SoC) is proposed in this paper. The work is designed and fabricated in SMIC 0.13 μm CMOS process, using the fully differential C-R hybrid digital to analog converter (DAC) structure. Scaling capacitor is adopted to improve the precision and reduce the chip area. Through the optimization of placement and routing, the DAC achieves high precision. Besides, a multi-stage comparator is designed, and an offset calibration technique with capacitors is applied, too. The measurement results show that the effective number of bits (ENOB) of this ADC reaches 9.711 bit at Nyquist frequency with 0.83 MHz sampling frequency and the total current of the ADC is 970 μA with a 3.3 V power supply.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131803235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
"High-level synthesis and beyond - From datacenters to IoTs" “高级综合及超越——从数据中心到物联网”
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406893
J. Cong
Jason Cong received his B.S. degree in computer science from Peking University in 1985, his M.S. and Ph. D. degrees in computer science from the University of Illinois at Urbana-Champaign in 1987 and 1990, respectively. Currently, he is a Chancellor's Professor at the UCLA Computer Science Department and the director of Center for Domain-Specific Computing (CDSC). He served as the department chair from 2005 to 2008. Dr. Cong's research interests include synthesis of VLSI circuits and systems, energy-efficient computer architectures, reconfigurable systems, nanotechnology and systems, and highly scalable algorithms. He has over 400 publications in these areas, including 10 best paper awards, and the 2011 ACM/IEEE A. Richard Newton Technical Impact Award in Electric Design Automation. He was elected to an IEEE Fellow in 2000 and ACM Fellow in 2008. He is the recipient of the 2010 IEEE Circuits and System Society Technical Achievement Award “For seminal contributions to electronic design automation, especially in FPGA synthesis, VLSI interconnect optimization, and physical design automation.”
Jason Cong, 1985年获得北京大学计算机科学学士学位,1987年和1990年分别获得伊利诺伊大学厄巴纳-香槟分校计算机科学硕士和博士学位。目前,他是加州大学洛杉矶分校计算机科学系的校长教授和特定领域计算中心(CDSC)主任。2005年至2008年担任系主任。他的研究兴趣包括VLSI电路和系统的合成,节能计算机架构,可重构系统,纳米技术和系统,以及高度可扩展的算法。他在这些领域发表了400多篇论文,其中包括10篇最佳论文奖,以及2011年ACM/IEEE A.理查德牛顿电气设计自动化技术影响奖。2000年当选为IEEE Fellow, 2008年当选为ACM Fellow。他是2010年IEEE电路与系统学会技术成就奖的获得者,因为他对电子设计自动化的开创性贡献,特别是在FPGA合成、VLSI互连优化和物理设计自动化方面。
{"title":"\"High-level synthesis and beyond - From datacenters to IoTs\"","authors":"J. Cong","doi":"10.1109/SOCC.2015.7406893","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406893","url":null,"abstract":"Jason Cong received his B.S. degree in computer science from Peking University in 1985, his M.S. and Ph. D. degrees in computer science from the University of Illinois at Urbana-Champaign in 1987 and 1990, respectively. Currently, he is a Chancellor's Professor at the UCLA Computer Science Department and the director of Center for Domain-Specific Computing (CDSC). He served as the department chair from 2005 to 2008. Dr. Cong's research interests include synthesis of VLSI circuits and systems, energy-efficient computer architectures, reconfigurable systems, nanotechnology and systems, and highly scalable algorithms. He has over 400 publications in these areas, including 10 best paper awards, and the 2011 ACM/IEEE A. Richard Newton Technical Impact Award in Electric Design Automation. He was elected to an IEEE Fellow in 2000 and ACM Fellow in 2008. He is the recipient of the 2010 IEEE Circuits and System Society Technical Achievement Award “For seminal contributions to electronic design automation, especially in FPGA synthesis, VLSI interconnect optimization, and physical design automation.”","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132134731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A 802.15.3c/802.11ad compliant 24 Gb/s FFT processor for 60 GHz communication systems 符合802.15.3c/802.11ad标准的24gb /s FFT处理器,适用于60 GHz通信系统
Pub Date : 2015-09-01 DOI: 10.1109/SOCC.2015.7406907
Henry Lopez Davila, Chun-Yi Liu, Wei-Chang Liu, Shen-Jui Huang, S. Jou, Sau-Gee Chen
This paper, we present a 24 Gb/s 512-point 8X-parallel FFT processor for 60 GHz communication systems. The proposed design is a pipelined Multipath Delay Feedback (MDF) radix-23 architecture, which exploits the parallelism of the multipath scheme together with pipeline technique to achieve a high throughput rate. Besides, the proposed FFT processor is implemented with an area efficient optimized multiplier architecture that avoid the need to store the twiddle factor in memory and a dynamic scaling technique to enhance the SQNR, allowing the FFT to operate with 16-QAM and 64-QAM for single carrier (SC) and orthogonal frequency-division multiplexing(OFDM) schemes. This FFT processor has been implemented in a SC/OFDM dual-mode baseband receiver satisfying the requirements of the 802.15.3c/802.11ad standard with a 40nm CMOS process. The post-layout implementation results show that the proposed FFT processor is able to achieve up to 24 Gb/s throughput rate at 500MHz clock with a power consumption of 87mW and area of 0.64mm2.
本文提出了一种用于60ghz通信系统的24gb /s 512点8x并行FFT处理器。提出的多路径延迟反馈(MDF)基数-23的流水线架构,利用多路径方案的并行性和流水线技术来实现高吞吐率。此外,所提出的FFT处理器采用面积高效优化乘法器架构实现,避免了在内存中存储旋转因子的需要,并采用动态缩放技术增强了SQNR,使FFT能够在单载波(SC)和正交频分复用(OFDM)方案下使用16-QAM和64-QAM。该FFT处理器已在满足802.15.3c/802.11ad标准要求的SC/OFDM双模基带接收机上实现,采用40nm CMOS工艺。布局后的实现结果表明,所提出的FFT处理器在500MHz时钟下能够实现高达24gb /s的吞吐率,功耗为87mW,面积为0.64mm2。
{"title":"A 802.15.3c/802.11ad compliant 24 Gb/s FFT processor for 60 GHz communication systems","authors":"Henry Lopez Davila, Chun-Yi Liu, Wei-Chang Liu, Shen-Jui Huang, S. Jou, Sau-Gee Chen","doi":"10.1109/SOCC.2015.7406907","DOIUrl":"https://doi.org/10.1109/SOCC.2015.7406907","url":null,"abstract":"This paper, we present a 24 Gb/s 512-point 8X-parallel FFT processor for 60 GHz communication systems. The proposed design is a pipelined Multipath Delay Feedback (MDF) radix-23 architecture, which exploits the parallelism of the multipath scheme together with pipeline technique to achieve a high throughput rate. Besides, the proposed FFT processor is implemented with an area efficient optimized multiplier architecture that avoid the need to store the twiddle factor in memory and a dynamic scaling technique to enhance the SQNR, allowing the FFT to operate with 16-QAM and 64-QAM for single carrier (SC) and orthogonal frequency-division multiplexing(OFDM) schemes. This FFT processor has been implemented in a SC/OFDM dual-mode baseband receiver satisfying the requirements of the 802.15.3c/802.11ad standard with a 40nm CMOS process. The post-layout implementation results show that the proposed FFT processor is able to achieve up to 24 Gb/s throughput rate at 500MHz clock with a power consumption of 87mW and area of 0.64mm2.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129923680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Session T3A: Tutorial: From frequency to time-average-frequency: A paradigm shift in the design of electronic systems 会议T3A:教程:从频率到时间平均频率:电子系统设计的范式转变
Pub Date : 2015-04-22 DOI: 10.1002/9781119102175
Liming Xiu
Clock technology is one of the four fundamental technologies in the field of IC design. It is the timekeeper and the driver of everything inside a silicon chip. Time-Average-Frequency Direct Period Synthesis is an emerging technology in the field of on-chip frequency synthesis (clock generation). It comprises a new concept, a rigorous mathematical theory, and a novel circuit architecture. Its aim is the two long-lasting problems in this field: arbitrary frequency generation and instantaneous frequency switching. The goal is to achieve these two features on-a-chip simultaneously and at a reasonable cost so that SoC chip architects have a powerful clock generator in their hands to create their innovations at higher levels. It is a component-level enabler for chip-system-architecture level innovations. On a large scale, since clock is omnipresent in electronic system, this technology introduces a paradigm shift in electronic system design. In this tutorial, the Time-Average-Frequency concept will be explained. The circuit architecture will be briefly reviewed. Its impact, the route from component-level enabler to system level innovations, will be illustrated through real application examples. The newly published book in May 2015: “From Frequency to Time-Average-Frequency: A Paradigm Shift in the Design of Electronic System (IEEE Press Series on Microelectronic Systems)” will be provided to each tutorial attendant.
时钟技术是集成电路设计领域的四大基础技术之一。它是计时器,也是硅芯片内一切事物的驱动器。时间均频直接周期合成是片上频率合成(时钟生成)领域的一项新兴技术。它包括一个新的概念,一个严格的数学理论,和一个新颖的电路结构。它的目标是解决该领域长期存在的两个问题:任意频率产生和瞬时频率切换。目标是以合理的成本同时在单片上实现这两个功能,以便SoC芯片架构师拥有强大的时钟生成器,从而在更高级别上进行创新。它是芯片系统架构级创新的组件级推动者。在大范围内,由于时钟在电子系统中无处不在,该技术引入了电子系统设计的范式转变。在本教程中,将解释时间平均频率的概念。我们将简要回顾电路结构。它的影响,即从组件级推动者到系统级创新的路线,将通过实际应用实例加以说明。2015年5月新出版的《从频率到时间平均频率:电子系统设计中的范式转变》(IEEE出版社微电子系统系列)将提供给每位辅导班。
{"title":"Session T3A: Tutorial: From frequency to time-average-frequency: A paradigm shift in the design of electronic systems","authors":"Liming Xiu","doi":"10.1002/9781119102175","DOIUrl":"https://doi.org/10.1002/9781119102175","url":null,"abstract":"Clock technology is one of the four fundamental technologies in the field of IC design. It is the timekeeper and the driver of everything inside a silicon chip. Time-Average-Frequency Direct Period Synthesis is an emerging technology in the field of on-chip frequency synthesis (clock generation). It comprises a new concept, a rigorous mathematical theory, and a novel circuit architecture. Its aim is the two long-lasting problems in this field: arbitrary frequency generation and instantaneous frequency switching. The goal is to achieve these two features on-a-chip simultaneously and at a reasonable cost so that SoC chip architects have a powerful clock generator in their hands to create their innovations at higher levels. It is a component-level enabler for chip-system-architecture level innovations. On a large scale, since clock is omnipresent in electronic system, this technology introduces a paradigm shift in electronic system design. In this tutorial, the Time-Average-Frequency concept will be explained. The circuit architecture will be briefly reviewed. Its impact, the route from component-level enabler to system level innovations, will be illustrated through real application examples. The newly published book in May 2015: “From Frequency to Time-Average-Frequency: A Paradigm Shift in the Design of Electronic System (IEEE Press Series on Microelectronic Systems)” will be provided to each tutorial attendant.","PeriodicalId":329464,"journal":{"name":"2015 28th IEEE International System-on-Chip Conference (SOCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115770800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2015 28th IEEE International System-on-Chip Conference (SOCC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1