Proceedings Eleventh International Conference on VLSI Design最新文献

英文中文

Double pass-transistor logic for high performance wave pipeline circuits 双通晶体管逻辑用于高性能波管道电路

Proceedings Eleventh International Conference on VLSI Design

Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646655

R. Parthasarathy, R. Sridhar

Wave pipelining is a digital design technique that can be applied to combinational logic circuits to increase the throughput of the system without increasing the demand for storage space and power. The internal capacitances of the gates are used for storage. The gate library for wave pipelining should have input independent, functionality independent and load capacitance independent delays. Conventional static CMOS has input dependent delay and is not suitable for wave pipelining. The wave pipelining design technique requires path delay equalization along all paths from the input to output. Delay balancing is achieved in a design by means of a process called "tuning". Rough tuning, is performed to balance all the paths with the same number of gates and fine tuning is done to adjust the sizes of transistors in the driver gate for different loads. The design styles that have been proposed for wave pipelining have unbalanced input loading and this results in complex fine tuning process. In this paper double pass transistor logic style (DPL) gates are modified to form a library of basic gates having perfect input symmetry. The balanced input capacitance of the DPL gates makes the fine tuning process less computation intensive. A fine tuning method is presented in this paper for wave pipeline designs with DPL logic. An 8 bit adder was designed and the results are presented to show the performance efficiency of double pass transistor logic for wave pipelining.

波浪流水线是一种数字设计技术，可以应用于组合逻辑电路，在不增加存储空间和功率需求的情况下提高系统的吞吐量。栅极的内部电容用于存储。用于波形流水线的栅极库应具有与输入无关、与功能无关和与负载电容无关的延迟。传统的静态CMOS具有与输入相关的延迟，不适合波形流水线。波管道设计技术要求从输入到输出的所有路径上的路径延迟均衡。延迟平衡在设计中是通过一个叫做“调优”的过程来实现的。采用粗调谐来平衡具有相同栅极数的所有路径，并采用微调来调整驱动栅极中晶体管的尺寸以适应不同负载。波浪管道的设计方式存在输入负载不平衡的问题，这导致了复杂的微调过程。本文对双通型晶体管逻辑型(DPL)栅极进行了改进，形成了一个具有完全输入对称性的基本栅极库。DPL门的平衡输入电容使得微调过程的计算量更小。本文提出了一种基于DPL逻辑的波浪管道设计的微调方法。设计了一个8位加法器，并给出了双通晶体管逻辑用于波形流水线的性能效率。

{"title":"Double pass-transistor logic for high performance wave pipeline circuits","authors":"R. Parthasarathy, R. Sridhar","doi":"10.1109/ICVD.1998.646655","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646655","url":null,"abstract":"Wave pipelining is a digital design technique that can be applied to combinational logic circuits to increase the throughput of the system without increasing the demand for storage space and power. The internal capacitances of the gates are used for storage. The gate library for wave pipelining should have input independent, functionality independent and load capacitance independent delays. Conventional static CMOS has input dependent delay and is not suitable for wave pipelining. The wave pipelining design technique requires path delay equalization along all paths from the input to output. Delay balancing is achieved in a design by means of a process called \"tuning\". Rough tuning, is performed to balance all the paths with the same number of gates and fine tuning is done to adjust the sizes of transistors in the driver gate for different loads. The design styles that have been proposed for wave pipelining have unbalanced input loading and this results in complex fine tuning process. In this paper double pass transistor logic style (DPL) gates are modified to form a library of basic gates having perfect input symmetry. The balanced input capacitance of the DPL gates makes the fine tuning process less computation intensive. A fine tuning method is presented in this paper for wave pipeline designs with DPL logic. An 8 bit adder was designed and the results are presented to show the performance efficiency of double pass transistor logic for wave pipelining.","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134128607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

A framework for a parallel architecture dedicated to soft computing 一种专门用于软计算的并行架构框架

Proceedings Eleventh International Conference on VLSI Design

Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646625

G. Ascia, V. Catania

This paper presents the architecture of a parallel processor dedicated to real-time fuzzy application. The main features of the architecture are: a pre-computation phase of the positive degree of truth of the antecedent with fuzzy inputs; a detection phase of the active rules. The processing speed is up to 2.8 MFLIPS (256 Rules, 8 Antecedents, 1 Consequent). The silicon area estimated is 25 mm/sup 2/.

本文介绍了一种用于实时模糊应用的并行处理器体系结构。该体系结构的主要特点是:具有模糊输入的前词的正真度的预计算阶段;活动规则的检测阶段。处理速度高达2.8 MFLIPS(256条规则，8个前因式，1个后因式)。硅面积估计为25mm /sup /。

引用次数: 4

Finite state machines: a deeper look into synthesis optimization for VHDL 有限状态机:对VHDL合成优化的深入研究

Proceedings Eleventh International Conference on VLSI Design

Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646658

Vijay A. Nebhrajani, Nayan Suthar

This paper provides a deeper insight into the synthesis mechanism of VHDL tools. It examines three methods of writing VHDL code, and each of the three models finite state machines in a different way. There can be significant reductions in the VLSI area and improvements in performance by adopting a certain modeling style, but this is at the cost of writing low level VHDL code, thereby undermining the purpose of VHDL as the design, entry medium. However, there is a simpler approach, which is demonstrated by a software tool called vtvt which allows writing VHDL code at high level and optimizes for area and performance without the burden of writing and maintaining low level code.

本文对VHDL工具的合成机制提供了更深入的了解。它研究了编写VHDL代码的三种方法，并且这三种方法中的每一种都以不同的方式为有限状态机建模。采用某种建模风格可以显著减少VLSI的面积并提高性能，但这是以编写低级VHDL代码为代价的，从而破坏了VHDL作为设计入口介质的目的。然而，有一种更简单的方法，它由一个名为vtvt的软件工具演示，它允许在高层次上编写VHDL代码，并优化面积和性能，而无需编写和维护低级代码。

引用次数: 3

Hybrid testing schemes based on mutual and signature testing 基于互测试和签名测试的混合测试方案

Proceedings Eleventh International Conference on VLSI Design

Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646621

M. F. Abdulla, C. Ravikumar, Anshul Kumar

Signature based techniques have been well known for the built-in self-test of integrated systems. We propose a novel test architecture which uses a judicious combination of mutual testing and signature testing to achieve low test area overhead, low aliasing probability and low test application time. The proposed architecture is powerful for testing highly concurrent systems in applications such as iterative logic arrays, real-time systems, systolic arrays, and low-latency pipelines which tend to have a large number of functional modules of a similar nature. We provide graph-theoretic optimization algorithms to optimize the test area and test application time of the resulting test architecture.

基于签名的技术以集成系统的内置自检而闻名。我们提出了一种新的测试架构，将互测试和签名测试巧妙地结合起来，以达到低测试区域开销、低混叠概率和低测试应用时间的目的。所提出的体系结构对于测试应用程序中的高度并发系统非常强大，例如迭代逻辑阵列、实时系统、收缩阵列和低延迟管道，这些应用程序往往具有大量相似性质的功能模块。我们提供了图论优化算法来优化最终测试架构的测试区域和测试应用时间。

引用次数: 2

Automated AC (timing) characterization for digital circuit testing 用于数字电路测试的自动交流(定时)特性

Proceedings Eleventh International Conference on VLSI Design

Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646636

S. Balajee, A. Majhi

One of the major requirements for testing VLSI devices is the validation of its timing specifications. Timing specifications would typically include frequency, propagation delays, minimum pulse width, phase offsets, setup time and hold time measurements. Although parametric specifications may exist for a nominal speed (frequency) of operation of the digital device, it may be necessary to characterize the device under test (DUT) to determine the highest operating frequency of the DUT and the required environmental parameters to run at the highest frequency. Characterization involves measurement of setup time, hold time and pulse width of the signals. In this paper, we have presented an automated AC (timing) characterization flow for digital circuit testing. We have recommended a STIL (Standard Tester Interface Language) like syntax for the timing tests. Various timing data (setup and hold time, propagation delay etc.) are measured in the first pass of the characterization process and are automatically back annotated to the timing test flow to reduce the total test cycle time. The approach will also help in finding the maximum operating frequency of the DUT and speed binning (i.e., sorting the devices based on their operating frequency).

测试VLSI器件的主要要求之一是验证其时序规范。定时规格通常包括频率、传播延迟、最小脉冲宽度、相位偏移、设置时间和保持时间测量。虽然数字设备的标称运行速度(频率)可能存在参数规范，但可能有必要对被测设备(DUT)进行表征，以确定DUT的最高工作频率和在最高频率下运行所需的环境参数。表征包括信号的设置时间、保持时间和脉冲宽度的测量。在本文中，我们提出了一种用于数字电路测试的自动交流(时序)表征流程。我们建议使用STIL(标准测试人员接口语言)之类的语法来进行计时测试。各种定时数据(设置和保持时间，传播延迟等)在表征过程的第一次通过中测量，并自动返回注释到定时测试流中，以减少总测试周期时间。该方法还将有助于找到DUT的最大工作频率和速度分组(即根据其工作频率对设备进行排序)。

{"title":"Automated AC (timing) characterization for digital circuit testing","authors":"S. Balajee, A. Majhi","doi":"10.1109/ICVD.1998.646636","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646636","url":null,"abstract":"One of the major requirements for testing VLSI devices is the validation of its timing specifications. Timing specifications would typically include frequency, propagation delays, minimum pulse width, phase offsets, setup time and hold time measurements. Although parametric specifications may exist for a nominal speed (frequency) of operation of the digital device, it may be necessary to characterize the device under test (DUT) to determine the highest operating frequency of the DUT and the required environmental parameters to run at the highest frequency. Characterization involves measurement of setup time, hold time and pulse width of the signals. In this paper, we have presented an automated AC (timing) characterization flow for digital circuit testing. We have recommended a STIL (Standard Tester Interface Language) like syntax for the timing tests. Various timing data (setup and hold time, propagation delay etc.) are measured in the first pass of the characterization process and are automatically back annotated to the timing test flow to reduce the total test cycle time. The approach will also help in finding the maximum operating frequency of the DUT and speed binning (i.e., sorting the devices based on their operating frequency).","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114601580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Computation of lower and upper bounds for switching activity: a unified approach 开关活动下界和上界的计算:一种统一的方法

Proceedings Eleventh International Conference on VLSI Design

Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646608

V. Krishna, R. Chandramouli, N. Ranganathan

Accurate switching activity estimation is crucial for power budgeting. It is impractical to obtain an accurate estimate by simulating the circuit for all possible inputs. An alternate approach would be to compute tight bounds for the switching activity. In this paper, we propose a non-simulative method to compute bounds for switching activity at the logic level. First, we show that the switching activity can be modeled as the Bayesian distance for an abstract two class problem. The computation of the upper and lower bounds for the switching activity is unified in to a single function, /spl psi/(/spl alpha/,p,/spl rho/), where /spl alpha/ is a parameter, /spl rho/ is the temporal correlation factor and p is the signal probability. The constraints on /spl alpha/ for /spl psi/(/spl alpha/,p,/spl rho/) to be tight upper and lower bounds are derived. The proposed approach computes bounds for individual gate switching. Experimental results are obtained by taking spatial and temporal correlations into account. The computations are simple and fast.

准确的开关活动估计是电力预算的关键。通过模拟电路中所有可能的输入来获得准确的估计是不切实际的。另一种方法是为切换活动计算严格的边界。在本文中，我们提出了一种非模拟的方法来计算逻辑层面上的切换活动边界。首先，我们证明了切换活动可以被建模为一个抽象的两类问题的贝叶斯距离。开关活动的上界和下界的计算统一为一个函数/spl psi/(/spl alpha/，p，/spl rho/)，其中/spl alpha/为参数，/spl rho/为时间相关因子，p为信号概率。推导了/spl psi/(/spl alpha/，p，/spl rho/)为紧上界和下界的/spl alpha/约束条件。该方法计算单个门开关的边界。实验结果考虑了空间相关性和时间相关性。计算简单、快速。

引用次数: 2

Diagnostic simulation of sequential circuits using fault sampling 时序电路故障采样诊断仿真

Proceedings Eleventh International Conference on VLSI Design

Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646652

S. Venkataraman, W. Fuchs, J. Patel

This paper describes a technique to accelerate diagnostic fault simulation of sequential circuits using fault sampling. Diagnostic fault simulation involves computing the indistinguishability relationship between all pairs of modeled faults. The input space is the set of all pairs of modeled faults, thus making the simulation computationally intensive. The diagnostic simulation process is accelerated by considering a sub-space of the input space that is obtained using fault sampling. Results on performance speedup and diagnostic resolution loss are provided for the ISCAS 89 benchmark circuits.

本文介绍了一种利用故障采样加速时序电路诊断故障仿真的技术。诊断故障仿真涉及计算所有建模故障对之间的不可分辨关系。输入空间是所有建模故障对的集合，因此仿真计算量很大。通过考虑故障采样获得的输入空间的子空间，加快了诊断仿真过程。给出了ISCAS 89基准电路的性能加速和诊断分辨率损失的结果。

引用次数: 2

Coding for low-power address and data busses: a source-coding framework and applications 低功耗地址和数据总线的编码:源代码编码框架和应用

Proceedings Eleventh International Conference on VLSI Design

Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646572

S. Ramprasad, Naresh R Shanbhag, I. Hajj

Presented in this paper is a source-coding framework for the design of coding schemes to reduce transition activity. These schemes are suited for high capacitance busses where the extra power dissipation due to the encoder and the decoder circuitry is offset by the power savings at the bus. A framework to characterize low-power encoding schemes is developed based upon the source-channel coding view. In this framework, a data source (characterized in a probabilistic manner) is passed through a decorrelating function f/sub 1/ first. Next, a variant of entropy coding function f/sub 2/ is employed, which reduces the transition activity. The framework is then employed to derive novel encoding schemes whereby practical forms for f/sub 1/ and f/sub 2/ are proposed. Simulation results with an encoding scheme for data busses indicate an average reduction in transition activity of 36%. This translates into a reduction in total power dissipation for bus capacitances greater than 14 pF/bit in 1.2 /spl mu/ CMOS technology and eight times more pourer savings compared to existing schemes with a typical value for bus capacitance of 50p F/bit. Simulation results with an encoding scheme for instruction address busses indicate an average reduction in transition activity by a factor of 3 times and 1.5 times over the Gray and TO coding schemes respectively.

本文提出了一种用于设计编码方案以减少转换活动的源编码框架。这些方案适用于高电容总线，其中由于编码器和解码器电路产生的额外功耗被总线上的功率节省所抵消。基于源信道编码的观点，提出了一种表征低功耗编码方案的框架。在这个框架中，数据源(以概率方式表征)首先通过一个去相关函数f/sub 1/。接下来，采用熵编码函数f/sub 2/的变体，减少了过渡活动。然后利用该框架推导出新的编码方案，从而提出f/sub 1/和f/sub 2/的实际形式。使用数据总线编码方案的仿真结果表明，转换活动平均减少36%。这意味着在1.2 /spl mu/ CMOS技术中，当母线电容大于14pf /bit时，总功耗降低，与现有的母线电容典型值为50pf /bit的方案相比，功耗节省了8倍。指令地址总线编码方案的仿真结果表明，与Gray和TO编码方案相比，转换活动平均减少了3倍和1.5倍。

{"title":"Coding for low-power address and data busses: a source-coding framework and applications","authors":"S. Ramprasad, Naresh R Shanbhag, I. Hajj","doi":"10.1109/ICVD.1998.646572","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646572","url":null,"abstract":"Presented in this paper is a source-coding framework for the design of coding schemes to reduce transition activity. These schemes are suited for high capacitance busses where the extra power dissipation due to the encoder and the decoder circuitry is offset by the power savings at the bus. A framework to characterize low-power encoding schemes is developed based upon the source-channel coding view. In this framework, a data source (characterized in a probabilistic manner) is passed through a decorrelating function f/sub 1/ first. Next, a variant of entropy coding function f/sub 2/ is employed, which reduces the transition activity. The framework is then employed to derive novel encoding schemes whereby practical forms for f/sub 1/ and f/sub 2/ are proposed. Simulation results with an encoding scheme for data busses indicate an average reduction in transition activity of 36%. This translates into a reduction in total power dissipation for bus capacitances greater than 14 pF/bit in 1.2 /spl mu/ CMOS technology and eight times more pourer savings compared to existing schemes with a typical value for bus capacitance of 50p F/bit. Simulation results with an encoding scheme for instruction address busses indicate an average reduction in transition activity by a factor of 3 times and 1.5 times over the Gray and TO coding schemes respectively.","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125643294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Peripheral partitioning and tree decomposition for partial scan 局部扫描的外围分区和树分解

Proceedings Eleventh International Conference on VLSI Design

Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646599

A. Balakrishnan, S. Chakradhar

We propose a new partial scan technique that incurs significantly less area overhead than the pipeline technique (all feedback cycles including self-loops are broken) and yet achieves very high test coverage in short CPU times. Our proposal selects scan flip-flops so that the circuit satisfies two key properties in the test mode. First, the circuit is partitioned into peripherally interacting finite state machines (peripheral partitions). Peripheral partitions do not have combinational paths between flip-flops belonging to different partitions. Second, the flip-flop dependency graph (S-graph) of each peripheral partition has a tree structure. Our technique does not require self-loops to be broken. We believe that peripheral partitions with tree structure S-graphs inherently require low sequential test generation resources. We develop an efficient algorithm for peripheral partitioning and tree decomposition of the S-graph. The scan flip-flop selection algorithm iteratively partitions the S-graph into disjoint sub-graphs with the tree structure. We report results on all the large circuits in the ISCAS 89 benchmark set. These results show that our technique produces scan circuits for which very high (near 100%) fault efficiency is achievable in extremely short CPU times. The high fault efficiencies achieved by our technique are comparable to that of pipeline circuits. However, the area overhead for our technique is significantly less than the pipeline case.

我们提出了一种新的部分扫描技术，它比管道技术产生的面积开销要少得多(所有反馈周期包括自循环都被破坏)，但在短的CPU时间内实现了非常高的测试覆盖率。我们的建议选择扫描触发器，使电路在测试模式下满足两个关键特性。首先，将电路划分为外围交互有限状态机(外围分区)。外围分区在属于不同分区的触发器之间没有组合路径。其次，每个外围分区的触发器依赖图(s图)具有树状结构。我们的技术不需要打破自我循环。我们认为具有树结构s图的外围分区固有地需要较少的顺序测试生成资源。我们提出了一种高效的s图外围划分和树分解算法。扫描触发器选择算法迭代地将s图划分为树形结构的不相交子图。我们报告了ISCAS 89基准集中所有大型电路的结果。这些结果表明，我们的技术产生的扫描电路在极短的CPU时间内可以实现非常高(接近100%)的故障效率。该技术实现的高故障效率可与管道电路相媲美。然而，我们技术的面积开销明显小于管道情况。

{"title":"Peripheral partitioning and tree decomposition for partial scan","authors":"A. Balakrishnan, S. Chakradhar","doi":"10.1109/ICVD.1998.646599","DOIUrl":"https://doi.org/10.1109/ICVD.1998.646599","url":null,"abstract":"We propose a new partial scan technique that incurs significantly less area overhead than the pipeline technique (all feedback cycles including self-loops are broken) and yet achieves very high test coverage in short CPU times. Our proposal selects scan flip-flops so that the circuit satisfies two key properties in the test mode. First, the circuit is partitioned into peripherally interacting finite state machines (peripheral partitions). Peripheral partitions do not have combinational paths between flip-flops belonging to different partitions. Second, the flip-flop dependency graph (S-graph) of each peripheral partition has a tree structure. Our technique does not require self-loops to be broken. We believe that peripheral partitions with tree structure S-graphs inherently require low sequential test generation resources. We develop an efficient algorithm for peripheral partitioning and tree decomposition of the S-graph. The scan flip-flop selection algorithm iteratively partitions the S-graph into disjoint sub-graphs with the tree structure. We report results on all the large circuits in the ISCAS 89 benchmark set. These results show that our technique produces scan circuits for which very high (near 100%) fault efficiency is achievable in extremely short CPU times. The high fault efficiencies achieved by our technique are comparable to that of pipeline circuits. However, the area overhead for our technique is significantly less than the pipeline case.","PeriodicalId":139023,"journal":{"name":"Proceedings Eleventh International Conference on VLSI Design","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126260069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

A retiming based relaxation heuristic for resource-constrained loop pipelining 一种基于重定时的资源约束循环流水线松弛启发式算法

Proceedings Eleventh International Conference on VLSI Design

Pub Date : 1998-01-04 DOI: 10.1109/ICVD.1998.646646

V. Srinivasan, R. Vemuri

This paper presents a fast and efficient heuristic for pipelining a loop under resource-constraints. The loop is represented as a dependence graph, G whose nodes are operations that are bound to available resources and edges denote the data dependencies between the operations. The data dependencies restrict the degree of parallelism that can be achieved while scheduling the graph. We propose a fast retiming based graph transformation technique which relates the data dependencies in the graph while maintaining functional equivalence. Relaxing data dependencies provides more flexibility for the scheduler to schedule operations, thereby leading to faster throughput. Our objective is to obtain a retimed graph which when scheduled achieves an optimal/near-optimal pipelined steady state throughput. A detailed algorithm is presented to solve the problem. We provide results that illustrate the effectiveness of our algorithm.

本文提出了一种快速有效的资源约束下循环管道化的启发式算法。该循环表示为依赖图G，其节点是绑定到可用资源的操作，边表示操作之间的数据依赖关系。数据依赖性限制了在调度图时可以实现的并行度。我们提出了一种快速的基于重定时的图转换技术，该技术在保持功能等价的同时将图中的数据依赖关系联系起来。放松数据依赖性为调度器调度操作提供了更大的灵活性，从而带来更快的吞吐量。我们的目标是获得一个重新定时的图，当调度时达到最优/接近最优的管道稳态吞吐量。提出了一种详细的算法来解决这一问题。我们提供的结果说明了我们的算法的有效性。

引用次数: 3

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings Eleventh International Conference on VLSI Design

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀