IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05)最新文献

英文中文

High performance array processor for video decoding 用于视频解码的高性能阵列处理器

IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05)

Pub Date : 2005-05-11 DOI: 10.1109/ISVLSI.2005.36

Jooheung Lee, N. Vijaykrishnan, M. J. Irwin

In this paper, high performance array processor for signal processing algorithms with high computational complexities is implemented using 0.16 /spl mu/m CMOS standard cell library. The proposed array processor consists of simple processing elements. The architectural benefits of highly regular, parallel, and pipelined processing elements simplify the design of complex signal processing systems and enable high throughput rate by massive parallel computations. We show the utility of the proposed architecture as a configurable core by mapping inverse discrete cosine transform (IDCT), motion compensation (MC), and inverse quantization (IQ) onto the proposed fabric. In addition, we propose a novel scheme that integrates the inverse quantization part of video decoding into the 2-D IDCT process simplifying computational logics. The results show that a high throughput rate to meet the real-time requirement is effectively achieved by exploiting the properties of both compressed video data statistics and the array processor architecture.

本文采用0.16 /spl mu/m CMOS标准单元库实现了用于高计算复杂度信号处理算法的高性能阵列处理器。所提出的阵列处理器由简单的处理元素组成。高度规则、并行和流水线处理元素的架构优势简化了复杂信号处理系统的设计，并通过大规模并行计算实现了高吞吐率。我们通过将反离散余弦变换(IDCT)、运动补偿(MC)和反量化(IQ)映射到所提议的结构上，展示了所提议的架构作为可配置核心的实用性。此外，我们提出了一种新的方案，将视频解码的逆量化部分集成到二维IDCT过程中，简化了计算逻辑。结果表明，利用压缩视频数据统计特性和阵列处理器结构，可以有效地实现满足实时性要求的高吞吐量。

引用次数: 3

Design and implementaion of a 2D-DCT architecture using coefficient distributed arithmetic [implementaion read implementation] 基于系数分布算法的2D-DCT结构设计与实现[实现读实现]

IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05)

Pub Date : 2005-05-11 DOI: 10.1109/ISVLSI.2005.25

Soumik Ghosh, Soujanya Venigalla, M. Bayoumi

The paper describes the design and implementation of an 8 /spl times/8 2D DCT chip for use in low-power applications. The design exploits a coefficient distributed arithmetic (CoDA) scheme as opposed to the prevalent data distributed arithmetic (DDA) schemes to achieve low power consumption. The architecture uses no ROMs and uses minimum number of additions by exploiting the redundancy in the adder arrays. The described architecture for the CoDA scheme is implemented on FPGA and has been fabricated on silicon. The fabricated chip computes 8 /spl times/8 2D DCT @ 50 MHz consuming around 137mW of power.

本文介绍了一种用于低功耗应用的8 /spl倍/8二维DCT芯片的设计与实现。该设计利用系数分布式算法(CoDA)方案来实现低功耗，而不是流行的数据分布式算法(DDA)方案。该体系结构不使用rom，并通过利用加法器阵列中的冗余来使用最小数量的加法。所描述的CoDA方案的体系结构在FPGA上实现，并已在硅片上制造。制造的芯片在50 MHz时计算8 /spl次/8次2D DCT，消耗约137mW的功率。

引用次数: 25

Exploiting inter-processor data sharing for improving behavior of multi-processor SoCs 利用处理器间数据共享改善多处理器soc的性能

IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05)

Pub Date : 2005-05-11 DOI: 10.1109/ISVLSI.2005.32

Guilin Chen, Guangyu Chen, O. Ozturk, M. Kandemir

Software-managed memories are important in realtime embedded environments where execution time predictability is an important requirement. With the proliferation of embedded multi-processor systems, software support for their memories is becoming an attractive research area in real-time embedded computing. One of the critical problems in embedded real-time multi-processor SoCs (system-on-a-chip) is to reduce the number of off-chip references. This is because frequent off-chip references can be very costly from both performance and power perspectives. In this paper, we propose a novel compiler-driven strategy for reducing the number of off-chip references, which is based on cooperation between the processors in the multi-processor architecture. Specifically, in the proposed strategy, the processors cache data in their local memories, under compiler control, on behalf of each other if doing so reduces the number of off-chip references.

软件管理的内存在实时嵌入式环境中非常重要，因为执行时间的可预测性是一个重要的需求。随着嵌入式多处理器系统的不断发展，对其存储器的软件支持已成为实时嵌入式计算领域的研究热点。嵌入式实时多处理器soc(片上系统)的关键问题之一是减少片外引用的数量。这是因为从性能和功耗的角度来看，频繁的芯片外引用可能会非常昂贵。在本文中，我们提出了一种新的编译器驱动策略来减少片外引用的数量，该策略基于多处理器架构中处理器之间的合作。具体来说，在提出的策略中，处理器在编译器的控制下，在本地内存中缓存数据，如果这样做可以减少芯片外引用的数量，那么处理器代表彼此缓存数据。

引用次数: 9

A comparative study on dicing of multiple project wafers 多项目晶圆切割的比较研究

IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05)

Pub Date : 2005-05-11 DOI: 10.1109/ISVLSI.2005.3

Meng-Chiou Wu, Rung-Bin Lin

This paper carries out a comparative study on the methods of dicing multi-project wafers (MPW). Our dicing method results in using 40% fewer wafers both for low and high volume production.

本文对多项目晶圆的切割方法进行了比较研究。我们的切割方法可以减少40%的晶圆用于小批量和大批量生产。

引用次数: 10

A data-driven approach for embedded security 嵌入式安全的数据驱动方法

IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05)

Pub Date : 2005-05-11 DOI: 10.1109/ISVLSI.2005.4

H. Saputra, O. Ozturk, N. Vijaykrishnan, M. Kandemir, R. Brooks

As embedded systems are being used in a wide variety of critical applications, providing security to data stored and processed in these systems has become an important issue. However, providing security incurs performance and power overheads that need to be limited in resource-constrained embedded environments. Consequently, architectural support to limit these overheads to be incurred only while storing or processing vital data is desirable. In this paper, we present an architecture that provides selective encryption protection for storage and processing protection to power analysis attacks for data marked as requiring security. Further, we show how the code can be transformed to reduce the overhead associated with protecting secure data.

随着嵌入式系统被广泛应用于各种关键应用，为这些系统中存储和处理的数据提供安全性已成为一个重要问题。然而，提供安全性会导致性能和电源开销，在资源受限的嵌入式环境中需要限制这些开销。因此，需要提供架构支持，将这些开销限制在仅在存储或处理重要数据时发生。在本文中，我们提出了一种体系结构，为存储提供选择性加密保护，并为标记为需要安全的数据提供功率分析攻击的处理保护。此外，我们将展示如何转换代码以减少与保护安全数据相关的开销。

引用次数: 10

Configurable multiprocessors for high-performance MPEG-4 video coding 用于高性能MPEG-4视频编码的可配置多处理器

IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05)

Pub Date : 2005-05-11 DOI: 10.1109/ISVLSI.2005.24

V. Chouliaras, T. Jacobs, A. K. Kumaraswamy, J. Núñez-Yáñez

We investigate the performance improvement of a multithreaded MPEG-4 video encoder executing on a configurable, extensible, SoC multiprocessor. Architecture-level results indicate a significant reduction in the dynamic instruction count of the order of 83% for 16 processor contexts compared to the original single-thread implementation. We extended an open-source 32-bit RISC CPU to include hardware-based multi-processing primitives and associated support state and implemented a parametric, bus-based SoC multiprocessor as the target platform for the threaded video encoder.

我们研究了在可配置、可扩展、SoC多处理器上执行的多线程MPEG-4视频编码器的性能改进。体系结构级的结果表明，与原始的单线程实现相比，在16个处理器上下文中，动态指令计数显著减少了83%。我们扩展了一个开源的32位RISC CPU，包括基于硬件的多处理原语和相关的支持状态，并实现了一个参数化的、基于总线的SoC多处理器作为线程视频编码器的目标平台。

引用次数: 0

Using the nonlinear property of FSR and dictionary coding for reduction of test volume 利用FSR的非线性特性和字典编码来减小测试体积

IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05)

Pub Date : 2005-05-11 DOI: 10.1109/ISVLSI.2005.75

Il-soo Lee, Jae-Hoon Jeong, A. Ambler

Using the nonlinear feedback shift register in testing is known to create a test set for combinational circuits instead of using the deterministic test set. The nonlinear property of feedback shift register is used differently here to reduce the test data volume for combinational circuits without using the nonlinear feedback shift register. In addition, a dictionary coding method is applied to further decrease a reduced test set. Results with benchmark circuits show a great improvement in the reduction of test data volume.

在测试中使用非线性反馈移位寄存器可以代替确定性测试集来创建组合电路的测试集。在不使用非线性反馈移位寄存器的情况下，利用反馈移位寄存器的非线性特性来减少组合电路的测试数据量。此外，还采用字典编码的方法对约简测试集进行进一步约简。基准电路的测试结果表明，在减少测试数据量方面有很大的改进。

引用次数: 0

Design of a real time system for nonlinear enhancement of video streams by an integrated neighborhood dependent approach 基于集成邻域依赖方法的视频流非线性增强实时系统设计

IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05)

Pub Date : 2005-05-11 DOI: 10.1109/ISVLSI.2005.28

A. Livingston, H. T. Ngo, Ming Z. Zhang, Li Tao, V. Asari

In this paper, we propose an efficient VLSI architecture for real time enhancement of video containing non-uniform and low light conditions. The nonlinear transfer function is determined by the cumulative distribution function of the previous frame. A dataflow design technique is used to construct a pipelined multimodule architecture. The design is capable of processing 73 1024/spl times/1024 video frames per second.

在本文中，我们提出了一种高效的VLSI架构，用于包含非均匀和低光条件的视频的实时增强。非线性传递函数由前一框架的累积分布函数决定。采用数据流设计技术构建了一个流水线式的多模块体系结构。该设计能够每秒处理73 1024/spl次/1024帧视频。

引用次数: 2

eWatch: context sensitive system design case study eWatch:上下文敏感系统设计案例研究

IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05)

Pub Date : 2005-05-11 DOI: 10.1109/ISVLSI.2005.31

A. Smailagic, D. Siewiorek, Uwe Maurer, Anthony G. Rowe, Karen P. Tang

In this paper, we introduce a novel context sensitive system design paradigm. Multiple sensors/computational architecture, in the form of our eWatch device, is used to infer the activities that the system is encountering, and can provide a platform for context-aware computing. We created an eWatch prototype that senses user activities and notifies them when important messages have arrived. An accelerometer and microphone provide inputs to a model of interruptibility. A vibration motor for tactile feedback and two ultra bright LEDs for visual feedback provide user notification through different vibration patterns and colors. eWatch is transparently integrated into the user's environment, and communicates via Bluetooth. This new class of integrated systems underscores the need for new forms of regularity, constraints, and design structure. Our results indicate the power of our method to accurately determine a meaningful context model while only requiring data from our eWatch device.

在本文中，我们引入了一种新的上下文敏感系统设计范式。以我们的eWatch设备的形式，使用多个传感器/计算架构来推断系统遇到的活动，并可以为上下文感知计算提供平台。我们创建了一个eWatch原型，可以感知用户活动，并在重要信息到达时通知他们。加速度计和传声器为可中断性模型提供输入。一个用于触觉反馈的振动电机和两个用于视觉反馈的超亮led通过不同的振动模式和颜色为用户提供通知。eWatch透明地集成到用户的环境中，并通过蓝牙进行通信。这种新型集成系统强调了对规则、约束和设计结构的新形式的需求。我们的结果表明，我们的方法可以准确地确定一个有意义的上下文模型，而只需要来自我们的eWatch设备的数据。

引用次数: 17

Boost logic : a high speed energy recovery circuit family 升压逻辑:一种高速能量回收电路系列

IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05)

Pub Date : 2005-05-11 DOI: 10.1109/ISVLSI.2005.22

V. Sathe, M. Papaefthymiou, C. Ziesler

In this paper, we propose boost logic, a logic family which relies on voltage scaling, gate overdrive, and energy recovery techniques to achieve high energy efficiency at frequencies in the GHz range. The key feature of our design is the use of an energy recovering "boost" stage to provide an efficient gate overdrive to a highly voltage-scaled logic at near-threshold supply voltage. We have evaluated our logic family using simulation results from an 8-bit carry-save multiplier in a 0.13 /spl mu/m CMOS process with V/sub th/ = 340 mV at 1.4 GHz and a 1.1 V supply voltage, the boost multiplier dissipates 3.44 pJ per computation, achieving 57% energy savings with respect to its static CMOS counterpart. Using low V/sub th/ devices, boost logic has been verified to operate at 2 GHz with a 1.2 V voltage supply and 3.76 pJ energy dissipation per cycle.

在本文中，我们提出升压逻辑，这是一个逻辑家族，它依赖于电压缩放，栅极超速驱动和能量回收技术，以实现GHz范围内频率的高能效。我们设计的关键特点是使用能量回收“升压”级，在接近阈值的电源电压下为高电压比例逻辑提供有效的栅极超速驱动。我们使用在0.13 /spl mu/m CMOS工艺中使用V/sub / = 340 mV在1.4 GHz和1.1 V电源电压下的8位减持倍增器的仿真结果评估了我们的逻辑家族，每次计算增强倍增器耗散3.44 pJ，相对于其静态CMOS对应物，实现了57%的节能。使用低V/sub /器件，升压逻辑已被验证可在2 GHz下工作，电压为1.2 V，每周期能耗为3.76 pJ。

引用次数: 11

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀