首页 > 最新文献

13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07)最新文献

英文 中文
Design of a High-Speed Asynchronous Turbo Decoder 高速异步Turbo解码器的设计
P. Golani, G. Dimou, M. Prakash, P. Beerel
This paper explores the advantages of high performance asynchronous circuits in a semi-custom standard cell environment for high-throughput turbo coding. Turbo codes are high-performance error correction codes used in applications where maximal information transfer is needed over a limited-bandwidth communication link in the presence of data corrupting noise. Specifically we designed an asynchronous high-speed turbo decoder that can be potentially used for new wireless communications protocols with close to OC-12 throughputs. The design has been implemented using a new static single-track-full-buffer (SSTFB) standard cell library in IBM 0.18 mum technology that provides low latency, fast cycle-time, and more robustness to noise than previously studied single-track full-buffer technology (STFB). A high-speed synchronous counterpart using the same high-speed architecture is designed in the same technology for comparison. The results demonstrate that for a variety of network constraints, the asynchronous design provides advantages in throughput per area. Moreover, the asynchronous design can support very low-latency network constraints not achievable with the synchronous alternative.
本文探讨了高性能异步电路在半自定义标准单元环境中用于高吞吐量turbo编码的优势。Turbo码是一种高性能纠错码,用于在存在数据损坏噪声的有限带宽通信链路上需要最大信息传输的应用。具体来说,我们设计了一个异步高速涡轮解码器,可以潜在地用于接近OC-12吞吐量的新无线通信协议。该设计使用IBM 0.18 mum技术中的新的静态单轨全缓冲(SSTFB)标准单元库来实现,该技术提供低延迟,快速周期时间,并且比以前研究的单轨全缓冲技术(STFB)更具抗噪声性。采用相同的技术设计了使用相同高速体系结构的高速同步对应物,以便进行比较。结果表明,对于各种网络约束,异步设计在每个区域的吞吐量方面具有优势。此外,异步设计可以支持同步替代方案无法实现的非常低延迟的网络约束。
{"title":"Design of a High-Speed Asynchronous Turbo Decoder","authors":"P. Golani, G. Dimou, M. Prakash, P. Beerel","doi":"10.1109/ASYNC.2007.16","DOIUrl":"https://doi.org/10.1109/ASYNC.2007.16","url":null,"abstract":"This paper explores the advantages of high performance asynchronous circuits in a semi-custom standard cell environment for high-throughput turbo coding. Turbo codes are high-performance error correction codes used in applications where maximal information transfer is needed over a limited-bandwidth communication link in the presence of data corrupting noise. Specifically we designed an asynchronous high-speed turbo decoder that can be potentially used for new wireless communications protocols with close to OC-12 throughputs. The design has been implemented using a new static single-track-full-buffer (SSTFB) standard cell library in IBM 0.18 mum technology that provides low latency, fast cycle-time, and more robustness to noise than previously studied single-track full-buffer technology (STFB). A high-speed synchronous counterpart using the same high-speed architecture is designed in the same technology for comparison. The results demonstrate that for a variety of network constraints, the asynchronous design provides advantages in throughput per area. Moreover, the asynchronous design can support very low-latency network constraints not achievable with the synchronous alternative.","PeriodicalId":136595,"journal":{"name":"13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07)","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121436183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Demystifying Data-Driven and Pausible Clocking Schemes 揭开数据驱动和可调时钟方案的神秘面纱
R. Mullins, S. Moore
VLSI systems are often constructed from a multitude of independently clocked synchronous IP blocks. Unfortunately, while a synchronous design style may produce efficient block level implementations it does little to support their composition. The addition of asynchronous interfaces to each synchronous block is one way to simplify and strengthen their integration. Asynchronous interfaces allow blocks to be composed without the need to consider synchronisation failure rates, permit data-driven operation and provide greater freedom when designing on-chip buses and networks. This paper surveys the significant body of published work in this area. We highlight similarities between schemes that are often concealed by differences in specification or circuit style. We also present new local clock implementations and provide solutions to mitigate the effect of clock-tree insertion delays. The ultimate goal of this work is to permit multi-clock synchronous systems to be composed simply, robustly and efficiently.
VLSI系统通常由许多独立时钟同步IP块构成。不幸的是,虽然同步设计风格可能会产生高效的块级实现,但它几乎不支持它们的组合。向每个同步块添加异步接口是简化和加强其集成的一种方法。异步接口允许在不需要考虑同步故障率的情况下组成块,允许数据驱动操作,并在设计片上总线和网络时提供更大的自由度。本文综述了这一领域已发表的重要著作。我们强调方案之间的相似之处,这些相似之处往往被规格或电路风格的差异所掩盖。我们还提出了新的本地时钟实现,并提供了减轻时钟树插入延迟影响的解决方案。这项工作的最终目标是允许多时钟同步系统组成简单,健壮和有效。
{"title":"Demystifying Data-Driven and Pausible Clocking Schemes","authors":"R. Mullins, S. Moore","doi":"10.1109/ASYNC.2007.15","DOIUrl":"https://doi.org/10.1109/ASYNC.2007.15","url":null,"abstract":"VLSI systems are often constructed from a multitude of independently clocked synchronous IP blocks. Unfortunately, while a synchronous design style may produce efficient block level implementations it does little to support their composition. The addition of asynchronous interfaces to each synchronous block is one way to simplify and strengthen their integration. Asynchronous interfaces allow blocks to be composed without the need to consider synchronisation failure rates, permit data-driven operation and provide greater freedom when designing on-chip buses and networks. This paper surveys the significant body of published work in this area. We highlight similarities between schemes that are often concealed by differences in specification or circuit style. We also present new local clock implementations and provide solutions to mitigate the effect of clock-tree insertion delays. The ultimate goal of this work is to permit multi-clock synchronous systems to be composed simply, robustly and efficiently.","PeriodicalId":136595,"journal":{"name":"13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123128744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 72
A Jitter Attenuating Timing Chain 抖动衰减时序链
Suwen Yang, M. Greenstreet, Jihong Ren
A long chain of inverters and wire segments will amplify clock jitter and drop timing pulses due to intersymbol interference. We present a jitter attenuating buffer based on surfing techniques. Our buffer circuit consists of a few inverters with variable output strength that implement a simple, low-gain DLL. Chains of these surfing buffers attenuate jitter making them well suited for source-synchronous interfaces. Furthermore, our chains can be used to reliably transmit handshaking signals and support sliding-window protocols to improve the throughput of asynchronous communication.
由于码间干扰,一长串的逆变器和线段会放大时钟抖动并导致时序脉冲下降。提出了一种基于冲浪技术的抖动衰减缓冲器。我们的缓冲电路由几个可变输出强度的逆变器组成,这些逆变器实现了一个简单的低增益DLL。这些冲浪缓冲器的链衰减抖动,使它们非常适合于源同步接口。此外,我们的链可用于可靠地传输握手信号,并支持滑动窗口协议,以提高异步通信的吞吐量。
{"title":"A Jitter Attenuating Timing Chain","authors":"Suwen Yang, M. Greenstreet, Jihong Ren","doi":"10.1109/ASYNC.2007.8","DOIUrl":"https://doi.org/10.1109/ASYNC.2007.8","url":null,"abstract":"A long chain of inverters and wire segments will amplify clock jitter and drop timing pulses due to intersymbol interference. We present a jitter attenuating buffer based on surfing techniques. Our buffer circuit consists of a few inverters with variable output strength that implement a simple, low-gain DLL. Chains of these surfing buffers attenuate jitter making them well suited for source-synchronous interfaces. Furthermore, our chains can be used to reliably transmit handshaking signals and support sliding-window protocols to improve the throughput of asynchronous communication.","PeriodicalId":136595,"journal":{"name":"13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129426072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
High Rate Wave-pipelined Asynchronous On-chip Bit-serial Data Link 高速率波管道异步片上位串行数据链
R. Dobkin, Y. Perelman, T. Liran, R. Ginosar, A. Kolodny
A high data rate asynchronous bit-serial link for long-range on-chip communication is presented. The data bit cycle time is equal to a single gate delay, enabling 67 Gbps throughput in 65 nm technology. The serial link incurs lower power and area costs relative to bit-parallel communications, and enables higher tolerance to PVT variations relative to synchronous links. The link uses differential dual-rail level encoding (LEDR) and current mode signaling over a low-crosstalk interconnect layout. Novel circuits used in the link are described, including a novel splitter shift register, a fast LEDR encoder, a high-speed toggle element, a channel driver with adaptive control and a differential channel receiver.
提出了一种用于远距离片上通信的高数据速率异步位串行链路。数据位周期时间等于单门延迟,在65纳米技术中实现67 Gbps的吞吐量。串行链路相对于位并行通信产生更低的功率和面积成本,并且相对于同步链路具有更高的PVT变化容忍度。该链路采用差分双轨电平编码(LEDR)和低串扰互连布局上的电流模式信令。描述了链路中使用的新型电路,包括新型分路移位寄存器,快速LEDR编码器,高速切换元件,具有自适应控制的信道驱动器和差分信道接收器。
{"title":"High Rate Wave-pipelined Asynchronous On-chip Bit-serial Data Link","authors":"R. Dobkin, Y. Perelman, T. Liran, R. Ginosar, A. Kolodny","doi":"10.1109/ASYNC.2007.20","DOIUrl":"https://doi.org/10.1109/ASYNC.2007.20","url":null,"abstract":"A high data rate asynchronous bit-serial link for long-range on-chip communication is presented. The data bit cycle time is equal to a single gate delay, enabling 67 Gbps throughput in 65 nm technology. The serial link incurs lower power and area costs relative to bit-parallel communications, and enables higher tolerance to PVT variations relative to synchronous links. The link uses differential dual-rail level encoding (LEDR) and current mode signaling over a low-crosstalk interconnect layout. Novel circuits used in the link are described, including a novel splitter shift register, a fast LEDR encoder, a high-speed toggle element, a channel driver with adaptive control and a differential channel receiver.","PeriodicalId":136595,"journal":{"name":"13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114681524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
On-chip samplers for test and debug of asynchronous circuits 用于异步电路测试和调试的片上采样器
Frankie Y. Liu, R. Ho, R. Drost, Scott M. Fairbanks
On-chip high-bandwidth sampling circuits supplement traditional test and debug techniques by non-invasively probing analog voltages for off-chip measurement. Existing circuits rely on sub-sampling techniques and thus require a synchronous clock. We extend these ideas to asynchronous circuits by combining an analog sampling head with a variable delay element and activating this circuit with an asynchronously triggered event. Repeated triggering events with different delays emulate sub-sampling. Simulations in a 180 nm technology of SRAM timing margins and GasP control failure modes show this technique can probe asynchronous signals with high fidelity.
片上高带宽采样电路通过非侵入式探测模拟电压来补充传统的测试和调试技术,用于片外测量。现有的电路依赖于子采样技术,因此需要同步时钟。我们将这些想法扩展到异步电路,通过将模拟采样头与可变延迟元件相结合,并使用异步触发事件激活该电路。具有不同延迟的重复触发事件模拟子采样。在180 nm的SRAM时序裕度和GasP控制失效模式上的仿真表明,该技术可以高保真地探测异步信号。
{"title":"On-chip samplers for test and debug of asynchronous circuits","authors":"Frankie Y. Liu, R. Ho, R. Drost, Scott M. Fairbanks","doi":"10.1109/ASYNC.2007.24","DOIUrl":"https://doi.org/10.1109/ASYNC.2007.24","url":null,"abstract":"On-chip high-bandwidth sampling circuits supplement traditional test and debug techniques by non-invasively probing analog voltages for off-chip measurement. Existing circuits rely on sub-sampling techniques and thus require a synchronous clock. We extend these ideas to asynchronous circuits by combining an analog sampling head with a variable delay element and activating this circuit with an asynchronously triggered event. Repeated triggering events with different delays emulate sub-sampling. Simulations in a 180 nm technology of SRAM timing margins and GasP control failure modes show this technique can probe asynchronous signals with high fidelity.","PeriodicalId":136595,"journal":{"name":"13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114591733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Formal Verification of CHP Specifications with CADP Illustration on an Asynchronous Network-on-Chip 用异步片上网络的CADP图示正式验证CHP规格
Gwen Salaün, Wendelin Serwe, Y. Thonnart, P. Vivet
Few formal verification techniques are currently available for asynchronous designs. In this paper, we describe a new approach for the formal verification of asynchronous architectures described in the high-level language CHP, by using model checking techniques provided by the CADP toolbox. Our proposal is based on an automatic translation from CHP into LOTOS, the process algebra used in CADP. A translator has been implemented, which handles full CHP including the specific probe operator. The CADP toolbox capabilities allow the designer to verify properties such as deadlock-freedom or protocol correctness on substantial systems. Our approach has been successfully applied to formally verify two complex designs. In this paper, we illustrate our technique on an asynchronous network-on-chip architecture. Its formal verification highlights the need to carefully design systems exhibiting non-deterministic behavior.
目前很少有正式的验证技术可用于异步设计。在本文中,我们通过使用CADP工具箱提供的模型检查技术,描述了一种用高级语言CHP描述的异步体系结构的形式化验证的新方法。我们的建议是基于从CHP到LOTOS的自动转换,这是CADP中使用的过程代数。已经实现了一个转换器,它处理完整的CHP,包括特定的探针操作符。CADP工具箱功能允许设计人员在实体系统上验证诸如死锁自由或协议正确性之类的属性。我们的方法已经成功地应用于正式验证两个复杂的设计。在本文中,我们在异步片上网络架构上演示了我们的技术。它的正式验证强调了仔细设计显示非确定性行为的系统的必要性。
{"title":"Formal Verification of CHP Specifications with CADP Illustration on an Asynchronous Network-on-Chip","authors":"Gwen Salaün, Wendelin Serwe, Y. Thonnart, P. Vivet","doi":"10.1109/ASYNC.2007.18","DOIUrl":"https://doi.org/10.1109/ASYNC.2007.18","url":null,"abstract":"Few formal verification techniques are currently available for asynchronous designs. In this paper, we describe a new approach for the formal verification of asynchronous architectures described in the high-level language CHP, by using model checking techniques provided by the CADP toolbox. Our proposal is based on an automatic translation from CHP into LOTOS, the process algebra used in CADP. A translator has been implemented, which handles full CHP including the specific probe operator. The CADP toolbox capabilities allow the designer to verify properties such as deadlock-freedom or protocol correctness on substantial systems. Our approach has been successfully applied to formally verify two complex designs. In this paper, we illustrate our technique on an asynchronous network-on-chip architecture. Its formal verification highlights the need to carefully design systems exhibiting non-deterministic behavior.","PeriodicalId":136595,"journal":{"name":"13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125230582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
The Design of a Genetic Muller C-Element 遗传Muller c -元的设计
Nam-Phuong D. Nguyen, Hiroyuki Kuwahara, C. Myers, J. Keener
Synthetic biology uses engineering principles to design circuits out of genetic materials that are inserted into bacteria to perform various tasks. While synthetic combinational Boolean logic gates have been constructed, there are many open issues in the design of sequential logic gates. One such gate common in most asynchronous circuits is the Muller C-element, which is used to synchronize multiple independent processes. This paper proposes a novel design for a genetic Muller C-element using transcriptional regulatory elements. The design of a genetic Muller C-element enables the construction of virtually any asynchronous circuit from genetic material. There are, however, many issues that complicate designs with genetic materials. These complications result in modifications being required to the normal digital design procedure. This paper presents two designs that are logically equivalent to a Muller C-element. Mathematical analysis and stochastic simulation, however, show that only one functions reliably.
合成生物学利用工程原理设计遗传物质的电路,将其插入细菌中以执行各种任务。虽然合成组合布尔逻辑门已经被构造出来,但是顺序逻辑门的设计还存在许多问题。在大多数异步电路中常见的一种门是Muller c元件,用于同步多个独立进程。本文提出了一种利用转录调控元件设计遗传Muller c元件的新方法。遗传穆勒c元件的设计使几乎任何异步电路的遗传物质的建设。然而,有许多问题使遗传材料的设计复杂化。这些复杂性导致需要对正常的数字设计过程进行修改。本文提出了两种逻辑上等价于Muller c元的设计。然而,数学分析和随机模拟表明,只有一个函数是可靠的。
{"title":"The Design of a Genetic Muller C-Element","authors":"Nam-Phuong D. Nguyen, Hiroyuki Kuwahara, C. Myers, J. Keener","doi":"10.1109/ASYNC.2007.27","DOIUrl":"https://doi.org/10.1109/ASYNC.2007.27","url":null,"abstract":"Synthetic biology uses engineering principles to design circuits out of genetic materials that are inserted into bacteria to perform various tasks. While synthetic combinational Boolean logic gates have been constructed, there are many open issues in the design of sequential logic gates. One such gate common in most asynchronous circuits is the Muller C-element, which is used to synchronize multiple independent processes. This paper proposes a novel design for a genetic Muller C-element using transcriptional regulatory elements. The design of a genetic Muller C-element enables the construction of virtually any asynchronous circuit from genetic material. There are, however, many issues that complicate designs with genetic materials. These complications result in modifications being required to the normal digital design procedure. This paper presents two designs that are logically equivalent to a Muller C-element. Mathematical analysis and stochastic simulation, however, show that only one functions reliably.","PeriodicalId":136595,"journal":{"name":"13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134270310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
A High Resolution Flash Time-to-Digital Converter Taking Into Account Process Variability 一种考虑过程可变性的高分辨率闪存时间-数字转换器
N. Minas, D. Kinniment, K. Heron, G. Russell
Timing issues are a major concern in the design of high performance synchronous, asynchronous circuits and GALS. Investigations into the causes of many timing problems cannot be satisfactorily undertaken using external equipment due to its remoteness from the source of the potential problem; this necessitates the development of on-chip time measurement circuitry. Current techniques have the capability of resolving timing differences down to 5ps [1], however further improvement is impeded by process variations. This paper describes a flash Time to Digital Converter (TDC) suitable for on-chip implementation. The theory to overcome the effects of process variations, potentially permitting the time resolution down to one picosecond is described. Proof of concept is demonstrated by implementing the techniques in an FPGA, improving on the current resolution of FPGA implementation of a TDC.
时序问题是高性能同步、异步电路和GALS设计中的一个主要问题。由于外部设备远离潜在问题的根源,使用外部设备对许多定时问题的原因进行调查不能令人满意;这就需要开发片上时间测量电路。目前的技术有能力将时间差异解决到5ps[1],但是进一步的改进受到工艺变化的阻碍。本文介绍了一种适用于片上实现的闪存时间数字转换器(TDC)。描述了克服工艺变化影响的理论,可能允许时间分辨率降低到1皮秒。通过在FPGA中实现这些技术来验证概念,提高了FPGA实现TDC的当前分辨率。
{"title":"A High Resolution Flash Time-to-Digital Converter Taking Into Account Process Variability","authors":"N. Minas, D. Kinniment, K. Heron, G. Russell","doi":"10.1109/ASYNC.2007.7","DOIUrl":"https://doi.org/10.1109/ASYNC.2007.7","url":null,"abstract":"Timing issues are a major concern in the design of high performance synchronous, asynchronous circuits and GALS. Investigations into the causes of many timing problems cannot be satisfactorily undertaken using external equipment due to its remoteness from the source of the potential problem; this necessitates the development of on-chip time measurement circuitry. Current techniques have the capability of resolving timing differences down to 5ps [1], however further improvement is impeded by process variations. This paper describes a flash Time to Digital Converter (TDC) suitable for on-chip implementation. The theory to overcome the effects of process variations, potentially permitting the time resolution down to one picosecond is described. Proof of concept is demonstrated by implementing the techniques in an FPGA, improving on the current resolution of FPGA implementation of a TDC.","PeriodicalId":136595,"journal":{"name":"13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123301891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Efficient Asynchronous Protocol Converters for Two-Phase Delay-Insensitive Global Communication 用于两相延迟不敏感全局通信的高效异步协议转换器
Amit Mitra, William F. McLaughlin, S. Nowick
As system-level interconnect incurs increasing penalties in latency, round-trip cycle time and power, and as timing-variability becomes an increasing design challenge, there is renewed interest in using two-phase delay-insensitive protocols for global system-level communication. However, in practice, when designing asynchronous systems, it is extremely inefficient to build local computation nodes with two-phase logic, hence four-phase computation blocks are typically used. This paper proposes a new architecture, and circuit-level implementations, for a family of asynchronous protocol converters, which efficiently convert between two- and four-phase protocols, thus facilitating system design with robust global two-phase protocols and local four-phase protocols. The main focus is on a level-encoded dual-rail (LEDR) two-phase protocol for global communication, and a four-phase return-to-zero (RZ) protocol for asynchronous computation blocks. However, with small modifications, the converters are extended to handle other common four-phase protocols, such as 1- of-4 and single-rail bundled data. The converters are highly robust, with almost entirely quasi delay- insensitive implementations, yet exhibit high performance and modest area overhead. Initial post-layout simulations in a 0.18 micron TSMC process are provided, both assuming a small computation block (8times8 combinational multiplier) as well as an empty computation block (FIFO stage).
由于系统级互连在延迟、往返周期时间和功耗方面的影响越来越大,并且时间可变性成为越来越大的设计挑战,因此人们对使用两阶段延迟不敏感协议进行全局系统级通信重新产生了兴趣。然而,在实际设计异步系统时,采用两阶段逻辑构建局部计算节点的效率极低,因此通常使用四阶段计算块。本文提出了一种新的异步协议转换器的结构和电路级实现,该转换器可以有效地在两阶段和四阶段协议之间进行转换,从而促进了具有鲁棒的全局两阶段协议和局部四阶段协议的系统设计。主要焦点是用于全局通信的电平编码双轨(LEDR)两阶段协议,以及用于异步计算块的四阶段归零(RZ)协议。然而,经过小的修改,转换器可以扩展到处理其他常见的四相协议,例如1- of-4和单轨捆绑数据。转换器具有高度鲁棒性,几乎完全是准延迟不敏感的实现,但表现出高性能和适度的面积开销。给出了在0.18微米TSMC工艺中初始的布局后模拟,假设一个小计算块(8times8组合乘法器)和一个空计算块(FIFO阶段)。
{"title":"Efficient Asynchronous Protocol Converters for Two-Phase Delay-Insensitive Global Communication","authors":"Amit Mitra, William F. McLaughlin, S. Nowick","doi":"10.1109/ASYNC.2007.17","DOIUrl":"https://doi.org/10.1109/ASYNC.2007.17","url":null,"abstract":"As system-level interconnect incurs increasing penalties in latency, round-trip cycle time and power, and as timing-variability becomes an increasing design challenge, there is renewed interest in using two-phase delay-insensitive protocols for global system-level communication. However, in practice, when designing asynchronous systems, it is extremely inefficient to build local computation nodes with two-phase logic, hence four-phase computation blocks are typically used. This paper proposes a new architecture, and circuit-level implementations, for a family of asynchronous protocol converters, which efficiently convert between two- and four-phase protocols, thus facilitating system design with robust global two-phase protocols and local four-phase protocols. The main focus is on a level-encoded dual-rail (LEDR) two-phase protocol for global communication, and a four-phase return-to-zero (RZ) protocol for asynchronous computation blocks. However, with small modifications, the converters are extended to handle other common four-phase protocols, such as 1- of-4 and single-rail bundled data. The converters are highly robust, with almost entirely quasi delay- insensitive implementations, yet exhibit high performance and modest area overhead. Initial post-layout simulations in a 0.18 micron TSMC process are provided, both assuming a small computation block (8times8 combinational multiplier) as well as an empty computation block (FIFO stage).","PeriodicalId":136595,"journal":{"name":"13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124912471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
A Cycle-Based Decomposition Method for Burst-Mode Asynchronous Controllers 突发模式异步控制器的循环分解方法
Melinda Y. Agyekum, S. Nowick
In this paper, a systematic and automated methodology is proposed for decomposing an asynchronous burst-mode (BM) controller into smaller sub-controllers, where each resulting sub-controller is activated on a communication channel. The proposed approach consists of a new decomposition algorithm, control micro-architecture and inter-controller communication protocol. This method has also been broadened to handle extended burst- mode (XBM) controllers. For both controller types, only a moderate amount of auxiliary hardware is required, and optimizations are proposed to eliminate or simplify this hardware. Initial runtime results for both burst-mode and extended burst- mode controllers are promising. Two of the largest BM benchmarks (dean-cache, scsi) were run using the Minimalist CAD tool and an optimized script. While the original controllers each timed out after 10 hours, the decomposition runs each completed in under 84 seconds. Further attempts to synthesize the original controllers using a suboptimal script succeeded, but with 16-200x greater runtime. Several XBM benchmarks were synthesized using the 3D CAD tool; one large complex controller (cdp-pl) was unable to complete while the decomposed run succeeded in under 197 seconds.
在本文中,提出了一种系统和自动化的方法,用于将异步突发模式(BM)控制器分解为更小的子控制器,其中每个子控制器在通信信道上被激活。该方法由一种新的分解算法、控制微结构和控制器间通信协议组成。这种方法也被扩展到处理扩展突发模式(XBM)控制器。对于这两种控制器类型,只需要适量的辅助硬件,并提出优化以消除或简化这些硬件。突发模式和扩展突发模式控制器的初始运行结果都是有希望的。两个最大的BM基准测试(dean-cache、scsi)是使用Minimalist CAD工具和优化的脚本运行的。虽然最初的控制器在10小时后都超时了,但分解运行在84秒内完成。使用次优脚本合成原始控制器的进一步尝试成功了,但运行时间增加了16-200倍。使用3D CAD工具合成了几个XBM基准;当分解运行在197秒内成功时,一个大型复杂控制器(cdp-pl)无法完成。
{"title":"A Cycle-Based Decomposition Method for Burst-Mode Asynchronous Controllers","authors":"Melinda Y. Agyekum, S. Nowick","doi":"10.1109/ASYNC.2007.6","DOIUrl":"https://doi.org/10.1109/ASYNC.2007.6","url":null,"abstract":"In this paper, a systematic and automated methodology is proposed for decomposing an asynchronous burst-mode (BM) controller into smaller sub-controllers, where each resulting sub-controller is activated on a communication channel. The proposed approach consists of a new decomposition algorithm, control micro-architecture and inter-controller communication protocol. This method has also been broadened to handle extended burst- mode (XBM) controllers. For both controller types, only a moderate amount of auxiliary hardware is required, and optimizations are proposed to eliminate or simplify this hardware. Initial runtime results for both burst-mode and extended burst- mode controllers are promising. Two of the largest BM benchmarks (dean-cache, scsi) were run using the Minimalist CAD tool and an optimized script. While the original controllers each timed out after 10 hours, the decomposition runs each completed in under 84 seconds. Further attempts to synthesize the original controllers using a suboptimal script succeeded, but with 16-200x greater runtime. Several XBM benchmarks were synthesized using the 3D CAD tool; one large complex controller (cdp-pl) was unable to complete while the decomposed run succeeded in under 197 seconds.","PeriodicalId":136595,"journal":{"name":"13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127262125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1