2007 25th International Conference on Computer Design最新文献

英文中文

Bounded model checking of embedded software in wireless cognitive radio systems 无线认知无线电系统中嵌入式软件的有界模型检验

2007 25th International Conference on Computer Design

Pub Date : 2007-10-01 DOI: 10.1109/ICCD.2007.4601875

Nannan He, M. Hsiao

We present a new verification approach that applies aggressive program slicing and a proof-based abstraction-refinement strategy to enhance the scalability of bounded model checking of embedded software. While many software model-checking tools use program slicing as a separate or optional step, our program slicing is integrated in the model construction and reduction process. And it is combined with the compilation optimization techniques so to compute a more accurate slice. We also explore a proof-based abstraction-refinement strategy using the under/overapproximation on our proposed software model, and propose a heuristic method of deciding new encoding size to refine the under-approximation. Experiments on C programs from wireless cognitive radio systems show this approach can greatly reduce the model size and shorten the solving time by the SAT-solver.

本文提出了一种新的验证方法，该方法采用主动程序切片和基于证明的抽象细化策略来增强嵌入式软件有界模型检验的可扩展性。虽然许多软件模型检查工具将程序切片作为单独或可选的步骤使用，但我们的程序切片集成在模型构建和约简过程中。并与编译优化技术相结合，计算出更精确的切片。我们还在我们提出的软件模型上探索了一种基于证明的抽象优化策略，该策略使用了欠逼近/过度逼近，并提出了一种确定新编码大小的启发式方法来改进欠逼近。在无线认知无线电系统的C程序上进行的实验表明，该方法可以大大减小模型尺寸，缩短求解时间。

引用次数: 13

Combining cluster sampling with single pass methods for efficient sampling regimen design 将聚类抽样与单次抽样相结合，设计有效的抽样方案

2007 25th International Conference on Computer Design

Pub Date : 2007-10-01 DOI: 10.1109/ICCD.2007.4601941

Paul D. Bryan, T. Conte

Microarchitectural simulation is orders of magnitude slower than native execution. As more elements are accurately modeled, problems associated with slow simulation are further exacerbated. Given these issues, many researchers have devised sampling techniques to reduce simulation time. When cluster sampling techniques are used, care must be taken to remove sampling and non-sampling biases. Researchers have devised clever methods for effectively reducing non-sampling bias, but little work has been proposed for efficient reduction of sampling bias (sampling regimen design). Traditionally, sampling regimen design has been an iterative process that required a full workload simulation for error comparison. In this study, a single-pass simulation technique for sampling regimen design is proposed. Using this method, thousands of sampling regimen candidates can be simultaneously evaluated. With this technique, simulation speed was increased by an average factor of 17 with a maximum increase of 73 times relative to the total workload simulation. Additionally, this technique allows the user to effectively estimate the sample error without running the entire workload.

微架构模拟比本地执行要慢几个数量级。随着越来越多的元素被精确建模，与缓慢模拟相关的问题将进一步加剧。考虑到这些问题，许多研究人员设计了采样技术来减少模拟时间。当使用聚类抽样技术时，必须注意去除抽样和非抽样偏差。研究者们已经设计出了一些巧妙的方法来有效地减少非抽样偏差，但对于有效地减少抽样偏差(抽样方案设计)的研究却很少。传统上，采样方案设计是一个迭代过程，需要全工作量模拟进行误差比较。本研究提出了一种单次模拟采样方案设计技术。使用这种方法，可以同时评估数千个采样方案候选。使用这种技术，相对于总工作负载模拟，仿真速度平均提高了17倍，最大提高了73倍。此外，该技术允许用户在不运行整个工作负载的情况下有效地估计样本误差。

{"title":"Combining cluster sampling with single pass methods for efficient sampling regimen design","authors":"Paul D. Bryan, T. Conte","doi":"10.1109/ICCD.2007.4601941","DOIUrl":"https://doi.org/10.1109/ICCD.2007.4601941","url":null,"abstract":"Microarchitectural simulation is orders of magnitude slower than native execution. As more elements are accurately modeled, problems associated with slow simulation are further exacerbated. Given these issues, many researchers have devised sampling techniques to reduce simulation time. When cluster sampling techniques are used, care must be taken to remove sampling and non-sampling biases. Researchers have devised clever methods for effectively reducing non-sampling bias, but little work has been proposed for efficient reduction of sampling bias (sampling regimen design). Traditionally, sampling regimen design has been an iterative process that required a full workload simulation for error comparison. In this study, a single-pass simulation technique for sampling regimen design is proposed. Using this method, thousands of sampling regimen candidates can be simultaneously evaluated. With this technique, simulation speed was increased by an average factor of 17 with a maximum increase of 73 times relative to the total workload simulation. Additionally, this technique allows the user to effectively estimate the sample error without running the entire workload.","PeriodicalId":6306,"journal":{"name":"2007 25th International Conference on Computer Design","volume":"6 1","pages":"472-479"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91072438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

2007 25th International Conference on Computer Design

Pub Date : 2007-10-01 DOI: 10.1109/ICCD.2007.4601957

N. Chabini, W. Wolf

An important problem is how to carry out register binding such that any register has to be bound to a set of variables such that the difference between their sizes is as small as possible. For the case of hardware implementations, satisfying this latter constraint will allow to reduce the complexity of the clock generation tree, and saving area occupied by the registers which is very important for the case of system-on-chip and some embedded systems. When registers are already built, satisfying this latter constraint would allow reducing power consumption due to useless switching activities that will happen into any register that is bound to variables with different sizes. Assuming that the size in bits of any variable is known, we propose in this paper exact algorithms to optimally solve this problem for the case of acyclic graphs. An extended version of this problem is how to solve it while controlling the number of variables to be assigned to a same register. We also propose exact algorithms to optimally solve this latter version of the problem. Experimental results are provided. We also test the impact of the proposed approach in the case of a hardware implementation using the design analyzer tool from Synopsys Inc.. Obtained results have shown that both area and power consumption have been reduced.

一个重要的问题是如何执行寄存器绑定，使任何寄存器都必须绑定到一组变量，从而使它们之间的大小差异尽可能小。对于硬件实现来说，满足后一种约束可以降低时钟生成树的复杂性，并节省寄存器占用的面积，这对于片上系统和某些嵌入式系统来说是非常重要的。当已经构建寄存器时，满足后一种约束将允许减少由于无用的切换活动而导致的功耗，这些活动将发生在绑定到不同大小变量的任何寄存器中。假设任意变量的比特大小是已知的，本文提出了一种精确的算法来最优地解决无环图的这个问题。这个问题的扩展版本是如何在控制要分配给同一寄存器的变量数量的同时解决它。我们还提出了精确的算法来最优地解决后一个版本的问题。给出了实验结果。我们还使用Synopsys Inc.的设计分析工具测试了在硬件实现的情况下所提出的方法的影响。得到的结果表明，面积和功耗都有所降低。

{"title":"Register binding guided by the size of variables","authors":"N. Chabini, W. Wolf","doi":"10.1109/ICCD.2007.4601957","DOIUrl":"https://doi.org/10.1109/ICCD.2007.4601957","url":null,"abstract":"An important problem is how to carry out register binding such that any register has to be bound to a set of variables such that the difference between their sizes is as small as possible. For the case of hardware implementations, satisfying this latter constraint will allow to reduce the complexity of the clock generation tree, and saving area occupied by the registers which is very important for the case of system-on-chip and some embedded systems. When registers are already built, satisfying this latter constraint would allow reducing power consumption due to useless switching activities that will happen into any register that is bound to variables with different sizes. Assuming that the size in bits of any variable is known, we propose in this paper exact algorithms to optimally solve this problem for the case of acyclic graphs. An extended version of this problem is how to solve it while controlling the number of variables to be assigned to a same register. We also propose exact algorithms to optimally solve this latter version of the problem. Experimental results are provided. We also test the impact of the proposed approach in the case of a hardware implementation using the design analyzer tool from Synopsys Inc.. Obtained results have shown that both area and power consumption have been reduced.","PeriodicalId":6306,"journal":{"name":"2007 25th International Conference on Computer Design","volume":"31 1","pages":"587-594"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90108705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

The challenge in testing MIMO in a Wi-Fi or WiMAX context 在Wi-Fi或WiMAX环境下测试MIMO的挑战

2007 25th International Conference on Computer Design

Pub Date : 2007-10-01 DOI: 10.1109/ICCD.2007.4601904

K. Vandrup

This session will include two invited industry presentations on challenges ahead in the design, manufacturing, and testing of wireless communications devices.

本次会议将包括两场受邀的行业演讲，内容涉及无线通信设备的设计、制造和测试方面的挑战。

引用次数: 0

Hardware design of a Binary Integer Decimal-based floating-point adder 基于二进制整数十进制的浮点加法器的硬件设计

2007 25th International Conference on Computer Design

Pub Date : 2007-10-01 DOI: 10.1109/ICCD.2007.4601915

C. Tsen, S. González-Navarro, M. Schulte

Because of the growing importance of decimal floating-point (DFP) arithmetic, specifications for it are included in the IEEE Draft Standard for Floating-point Arithmetic (IEEE P754). In this paper, we present a novel algorithm and hardware design for a DFP adder. The adder performs addition and subtraction on 64-bit operands that use the IEEE P754 binary encoding of DFP numbers, widely known as the binary integer decimal (BID) encoding. The BID adder uses a novel hardware component for decimal digit counting and an enhanced version of a previously published BID rounding unit. By adding more sophisticated control, operations are performed with variable latency to optimize for common cases. We show that a BID-based DFP adder design can be achieved with a modest area increase compared to a single 2-stage pipelined 64-bit fixed-point multiplier. Over 70% of the BID adderpsilas area is due the 64-bit fixed-point multiplier, which can be shared with a binary floating-point multiplier and hardware for other DFP operations. To our knowledge, this is the first hardware design for adding and subtracting IEEE P754 BID-encoded DFP numbers.

由于十进制浮点(DFP)算法的重要性日益增加，它的规范被包含在IEEE浮点算术标准草案(IEEE P754)中。本文提出了一种DFP加法器的新算法和硬件设计。加法器对使用IEEE P754 DFP数字二进制编码的64位操作数执行加法和减法，该编码被广泛称为二进制整数十进制(BID)编码。BID加法器使用新颖的硬件组件进行十进制数字计数和先前发布的BID舍入单元的增强版本。通过添加更复杂的控制，操作以可变延迟执行，以针对常见情况进行优化。我们表明，与单个2级流水线64位定点乘法器相比，基于bid的DFP加法器设计可以实现适度的面积增加。超过70%的BID附加区域是64位定点乘法器，它可以与二进制浮点乘法器和用于其他DFP操作的硬件共享。据我们所知，这是第一个用于添加和减去IEEE P754 bid编码DFP号码的硬件设计。

{"title":"Hardware design of a Binary Integer Decimal-based floating-point adder","authors":"C. Tsen, S. González-Navarro, M. Schulte","doi":"10.1109/ICCD.2007.4601915","DOIUrl":"https://doi.org/10.1109/ICCD.2007.4601915","url":null,"abstract":"Because of the growing importance of decimal floating-point (DFP) arithmetic, specifications for it are included in the IEEE Draft Standard for Floating-point Arithmetic (IEEE P754). In this paper, we present a novel algorithm and hardware design for a DFP adder. The adder performs addition and subtraction on 64-bit operands that use the IEEE P754 binary encoding of DFP numbers, widely known as the binary integer decimal (BID) encoding. The BID adder uses a novel hardware component for decimal digit counting and an enhanced version of a previously published BID rounding unit. By adding more sophisticated control, operations are performed with variable latency to optimize for common cases. We show that a BID-based DFP adder design can be achieved with a modest area increase compared to a single 2-stage pipelined 64-bit fixed-point multiplier. Over 70% of the BID adderpsilas area is due the 64-bit fixed-point multiplier, which can be shared with a binary floating-point multiplier and hardware for other DFP operations. To our knowledge, this is the first hardware design for adding and subtracting IEEE P754 BID-encoded DFP numbers.","PeriodicalId":6306,"journal":{"name":"2007 25th International Conference on Computer Design","volume":"18 1","pages":"288-295"},"PeriodicalIF":0.0,"publicationDate":"2007-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78400419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 22

Circuit-level mismatch modelling and yield optimization for CMOS analog circuits CMOS模拟电路的电路级失配建模和良率优化

2007 25th International Conference on Computer Design

Pub Date : 2007-10-01 DOI: 10.1109/ICCD.2007.4601948

Mingjing Chen, A. Orailoglu

A methodology for constructing circuit-level mismatch models and performing yield optimization is presented for CMOS analog circuits. The methodology combines statistical techniques with direct investigation of circuit behavior, and achieves model simplification and computational efficiency while ensuring sufficient accuracy. The circuit-level mismatch model can be used in performance characterization and yield estimation, both important in providing information for circuit reliability analysis. The proposed yield optimization technique consists of constructing and refining a yield model over the designable parameters, and ensures fast convergence to the global optimal design. The experimental results on two representative circuits confirm the efficiency and effectiveness of the proposed method.

提出了一种构建电路级失配模型并进行良率优化的方法。该方法将统计技术与电路行为的直接调查相结合，在保证足够精度的同时实现了模型简化和计算效率。电路级失配模型可用于性能表征和良率估计，两者都为电路可靠性分析提供重要信息。提出的成品率优化技术包括在可设计参数上构造和细化成品率模型，并保证快速收敛到全局最优设计。在两个代表性电路上的实验结果验证了该方法的有效性和有效性。

引用次数: 1

Analytical thermal placement for VLSI lifetime improvement and minimum performance variation 分析热安置VLSI寿命改善和最小的性能变化

2007 25th International Conference on Computer Design

Pub Date : 2007-10-01 DOI: 10.1109/ICCD.2007.4601882

A. Kahng, S. Kang, Wei Li, Bao Liu

DSM and nanometer VLSI designs are subject to an increasingly significant thermal effect on VLSI circuit lifetime and performance variation, which can be effectively subdued by VLSI placement. We propose analytical placement for accurate and efficient VLSI thermal optimization, and propose minimized maximum on-chip temperature as the thermal optimization objective for improved VLSI lifetime and minimized performance variation. We develop an effective analytical thermal placement technique, as well as an improved analytical placement technique with a new cell spreading function. Our experimental results show that our proposed analytical thermal placement achieves 17.85% and 30.77% maximum on-chip temperature variation reduction as well as 4.61% and 0.45% wirelength reduction respectively for the two industry design test cases compared with thermal-oblivious analytical placement, e.g., APlace.

DSM和纳米级VLSI设计受到越来越显著的热效应对VLSI电路寿命和性能变化的影响，这可以通过VLSI的放置有效地抑制。我们提出了精确和高效的VLSI热优化的分析布局，并提出最小化最大片上温度作为提高VLSI寿命和最小化性能变化的热优化目标。我们开发了一种有效的分析热放置技术，以及一种改进的分析放置技术，该技术具有新的细胞扩散功能。我们的实验结果表明，与APlace等无热分析封装相比，我们提出的分析式热封装在两种工业设计测试用例中，片上最大温度变化分别减少了17.85%和30.77%，无线长度分别减少了4.61%和0.45%。

引用次数: 5

Digital calibration of RF transceivers for I-Q imbalances and nonlinearity 射频收发器I-Q不平衡和非线性的数字校准

2007 25th International Conference on Computer Design

Pub Date : 2007-10-01 DOI: 10.1109/ICCD.2007.4601946

E. Acar, S. Ozev

As radio frequency (RF) devices become more complex, the specifications become more stringent. In order to guarantee successful operation and compliance to certain specifications, digital correction techniques that compensate the device impairments are needed. In this paper, we present an analytical digital in-phase (I) and quadrature (Q) imbalance and non-linear compression correction methodology that improves the system bit error rate (BER). The gain and phase imbalances are corrected by using the gain and phase imbalance test data obtained during the product testing. The non-linear compression term is removed using Newton's method. The proposed test methodology is applicable for both burst based systems and continuous systems. Simulation results indicate that the proposed method improves the BER even under harsh noise contamination. The computational overhead of the compensation technique is minimal.

随着射频(RF)设备变得越来越复杂，其规格也变得越来越严格。为了保证成功的操作和符合某些规范，需要补偿设备损伤的数字校正技术。在本文中，我们提出了一种分析数字同相(I)和正交(Q)不平衡和非线性压缩校正方法，以提高系统误码率(BER)。利用产品测试过程中获得的增益和相位不平衡测试数据对增益和相位不平衡进行校正。采用牛顿法去除非线性压缩项。所提出的测试方法既适用于突发系统，也适用于连续系统。仿真结果表明，该方法在较强的噪声污染下仍能提高误码率。补偿技术的计算开销是最小的。

引用次数: 13

Statistical timing analysis using Kernel smoothing 使用核平滑的统计时序分析

2007 25th International Conference on Computer Design

Pub Date : 2007-10-01 DOI: 10.1109/ICCD.2007.4601886

J. Wong, A. Davoodi, Vishal Khandelwal, Ankur Srivastava, M. Potkonjak

We have developed a new statistical timing analysis approach that does not impose any assumptions on the nature of manufacturing variability and takes into account an arbitrary model of spatial correlation as well as all types of functional correlations (e.g. reconvergence-based correlations). The starting point for statistical timing analysis is small scale Monte Carlo (MC) simulation. In order to speed-up the MC simulation process we use stratified balanced sampling and postprocessing of the simulation data using non-parametric kernel estimation. The MC simulation and the statistical analysis procedure are interleaved with the calculation of the critical paths. In order to speed up simulation, we identify and simulate only gates relevant for calculation of the clock cycle time. The application of statistical techniques enable not only accurate statistical timing analysis, but also stability and scalability analysis. The approach is evaluated using MCNC benchmarks and yields more than six orders of magnitude speed improvement compared with the standard MC simulation.

我们开发了一种新的统计时间分析方法，该方法不对制造变异性的性质施加任何假设，并考虑了任意的空间相关性模型以及所有类型的功能相关性(例如基于再收敛的相关性)。统计时序分析的起点是小尺度蒙特卡罗(MC)模拟。为了加快MC模拟过程，我们采用分层均衡采样和非参数核估计对模拟数据进行后处理。MC模拟和统计分析过程与关键路径的计算交织在一起。为了加快仿真速度，我们只识别和模拟与时钟周期时间计算相关的门。统计技术的应用不仅可以实现准确的统计时序分析，还可以进行稳定性和可扩展性分析。该方法使用MCNC基准测试进行了评估，与标准MC模拟相比，速度提高了6个数量级以上。

引用次数: 2

Modeling soft error effects considering process variations 考虑工艺变化的软误差影响建模

2007 25th International Conference on Computer Design

Pub Date : 2007-10-01 DOI: 10.1109/ICCD.2007.4601927

Chong Zhao, S. Dey

This paper addresses the aggregated effects of two types of variations that contribute to the reliability degradation. The first one is the increasing level of process variation; the second one is one particular type of environmental variation - the radiation-induced soft error. Their simultaneous presence can cause large negative performance impact. We present a statistical approach to model the generation and propagation of a transient soft error inside combinational circuits considering the existence of inter-die channel length variation in CMOS digital circuits. Experiment results have demonstrated that channel length variation can significantly aggravate the soft error effect, which can be accurately evaluated using the proposed methodology.

本文讨论了导致可靠性退化的两类变量的综合效应。第一个是工艺变化水平的增加;第二种是一种特殊类型的环境变化——辐射引起的软误差。它们同时存在可能会对性能造成很大的负面影响。考虑到CMOS数字电路中存在的芯片间通道长度变化，我们提出了一种统计方法来模拟组合电路中瞬态软误差的产生和传播。实验结果表明，信道长度的变化会显著加剧软误差效应，使用该方法可以准确地评估软误差效应。

引用次数: 4

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2007 25th International Conference on Computer Design

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀