Microprocessing and Microprogramming最新文献

英文中文

The logic of engineering design 工程设计的逻辑

Microprocessing and Microprogramming

Pub Date : 1996-04-01 DOI: 10.1016/0165-6074(96)00009-9

C.A.R. Hoare

引用次数: 14

Bit-permute multistage interconnection networks 位置换多级互连网络

Microprocessing and Microprogramming

Pub Date : 1995-11-01 DOI: 10.1016/0165-6074(95)00017-I

Myung-Kyun Kim, Hyunsoo Yoon, S.R. Maeng

In this paper, we consider a class of log N stage interconnection networks called Bit-Permute Multistage Interconnection Networks (BPMIN's) where the ports of each switch of a stage are different at only one bit position of their labels. We describe the decomposition structure of the BPMIN's and prove that all of the BPMIN's are topologically equivalent and some of them are functionally equivalent. We also identify a class of 2 log N stage rearrangeable networks called symmetric BPMIN's where two log N stage BPMIN's are connected in sequence. The symmetric BPMIN's are either symmetric or asymmetric and regular or irregular in their inter-stage connections and can be reduced into 2 log N-1 stages by combining the two center stages. We show that the symmetric BPMIN's constitute larger class of rearrangeable networks than ever known. We also propose a general routing algorithm for the symmetric BPMIN's by modifying slightly the looping algorithm of the Benes network.

在本文中，我们考虑了一类log N级互连网络，称为位置换多级互连网络(BPMIN)，其中每级交换机的端口仅在其标签的一个位位置不同。我们描述了BPMIN的分解结构，并证明了所有的BPMIN都是拓扑等价的，部分BPMIN是功能等价的。我们还确定了一类2个log N级可重排网络，称为对称BPMIN，其中两个log N级BPMIN按顺序连接。对称BPMIN的级间连接可以是对称的或不对称的，也可以是规则的或不规则的，通过两个中心级的组合可以简化为2个log N-1级。我们证明了对称bpmmin构成了比已知的更大的可重排网络。通过对Benes网络的循环算法稍加修改，提出了一种适用于对称BPMIN的通用路由算法。

引用次数: 5

A new scheduling algorithm for synthesizing the control blocks of control-dominated circuits 一种新的控制主导电路控制块综合调度算法

Microprocessing and Microprogramming

Pub Date : 1995-11-01 DOI: 10.1016/0165-6074(95)00016-H

Shih-Hsu Huang , Yu-Chin Hsu , Yen-Jen Oyang

This paper describes a new scheduling algorithm for automatic synthesis of the control blocks of control-dominated circuits. The proposed scheduling algorithm is distinctive in its approach to partition a control/data flow graph (CDFG) into an equivalent state transition graph. It works on the CDFG to exploit operation relocation, chaining, duplication, and unification. The optimization goal is to schedule each execution path as fast as possible. Benchmark data shows that this approach achieved better results over the previous ones in terms of the speedup of the circuit and the number of states and transitions.

本文提出了一种新的调度算法，用于控制主导电路控制块的自动合成。该调度算法的独特之处在于将控制/数据流图(CDFG)划分为等价的状态转移图。它在CDFG上工作，以开发操作重定位、链接、复制和统一。优化的目标是尽可能快地调度每个执行路径。基准测试数据表明，这种方法在电路的加速和状态和转换的数量方面比以前的方法取得了更好的结果。

引用次数: 0

Calendar of forthcoming conferences and events 即将召开的会议和活动日历

Microprocessing and Microprogramming

Pub Date : 1995-11-01 DOI: 10.1016/0165-6074(95)90002-0

引用次数: 0

Parallel K-set mutual range-join in hypercubes 超立方体中的平行k集互区间连接

Microprocessing and Microprogramming

Pub Date : 1995-11-01 DOI: 10.1016/0165-6074(95)00018-J

Hong Shen

The mutual range-join of k sets, S₁, S₂,…, S_k, is the set containing all tuples (s₁, s₂…, s_k) that satisfy $_{1} ≤ ¦s_{i} − s_{j}$ ¦ ≤ e₂ for all 1 ≤i≠j≤k, where s_i ϵ S_i and e₁ ≤ e₂ are fixed constants. This paper presents an efficient parallel algorithm for computing the k-set mutual range-join in hypercube computers. The proposed algorithm uses a fast method to determine whether the differences of all pair numbers among k given numbers are within a given range and applies the technique of permutation-based range-join [11]. To compute the mutual range-join of k sets S₁,S₂,…, S_k in a hypercube of p processors with O(∑^k_{i = 1}n_in_i/p) local memory, $p ≤ ¦S_{i} ¦ = n_{i}$ and 1 ≤ i ≤ k, our algorithm requires at most O((k log k/p)π^k_{i = 1}n_i) data comparisons in the worst case. The algorithm is implemented in PVM and its performance is extensively evaluated on various input data.

k个集合S1, S2，…，Sk的互值域连接是包含所有1≤i≠j≤k且满足1≤si - sj…≤e2的元组(S1, S2…，Sk)的集合，其中si ε si和e1≤e2是固定常数。本文提出了一种计算超立方体计算机中k集互域连接的高效并行算法。该算法采用基于置换的range-join技术，快速判断k个给定数之间所有对数的差是否在给定范围内[11]。为了计算p个处理器组成的超立方体中k个集合S1,S2，…，Sk的互区间连接，其中局部内存为O(∑ki = 1nini/p)，且p≤σ Si = ni，且1≤i≤k，我们的算法在最坏情况下最多需要O((k log k/p)πki = 1ni)个数据比较。该算法在PVM中实现，并在各种输入数据上对其性能进行了广泛的评估。

{"title":"Parallel K-set mutual range-join in hypercubes","authors":"Hong Shen","doi":"10.1016/0165-6074(95)00018-J","DOIUrl":"10.1016/0165-6074(95)00018-J","url":null,"abstract":"<div>The mutual range-join of k sets, S1, S2,…, Sk, is the set containing all tuples (s1, s2…, sk) that satisfy <math><msub><mi></mi><mn>1</mn></msub><mtext> ≤ ¦s</mtext><msub><mi></mi><mn>i</mn></msub><mtext> − s</mtext><msub><mi></mi><mn>j</mn></msub></math> ¦ ≤ e2 for all 1 ≤i≠j≤k, where si ϵ Si and e1 ≤ e2 are fixed constants. This paper presents an efficient parallel algorithm for computing the k-set mutual range-join in hypercube computers. The proposed algorithm uses a fast method to determine whether the differences of all pair numbers among k given numbers are within a given range and applies the technique of permutation-based range-join [11]. To compute the mutual range-join of k sets S1,S2,…, Sk in a hypercube of p processors with O(∑ki = 1nini/p) local memory, <math><mtext>p ≤ ¦S</mtext><msub><mi></mi><mn>i</mn></msub><mtext>¦ = n</mtext><msub><mi></mi><mn>i</mn></msub></math> and 1 ≤ i ≤ k, our algorithm requires at most O((k log k/p)πki = 1ni) data comparisons in the worst case. The algorithm is implemented in PVM and its performance is extensively evaluated on various input data.</div>","PeriodicalId":100927,"journal":{"name":"Microprocessing and Microprogramming","volume":"41 7","pages":"Pages 443-448"},"PeriodicalIF":0.0,"publicationDate":"1995-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0165-6074(95)00018-J","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121329230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Design of a kernel for implementing communication protocols 实现通信协议的内核设计

Microprocessing and Microprogramming

Pub Date : 1995-11-01 DOI: 10.1016/0165-6074(95)00027-L

R.K. Arora , Vimal K. Khanna

We describe the design of a kernel for an inexpensive front-end processor to run the lower layers of common communication protocols. Implementing a full fledged kernel on a card requires large memory, expensive hardware and heavy processing overhead. We studied the requirements of the layers of communication protocols which generally run as a part of the host kernel. We realised that a workable kernel for the front-end processor which has a subset of the features of the host kernel, could be implemented within reasonable time. This could provide the functions required by the communication protocol layers and run them within the constraints of inexpensive hardware. The aims of this research were manifold:

1.
(1) To explore the general set of requirements of common connection-oriented and connectionless protocols.
2.
(2) To design and present algorithms of a kernel which can satisfy such requirements. Since the kernel is based on this general set of requirements, it will be generic and not specific to only the protocol running on the card. Thus it can be used to download different protocols on the card.
3.
(3) Suggest implementation techniques so as to reduce memory and processing overhead on the host and to improve the performance of the protocols running in the kernel.
4.
(4) To actually run a protocol on this kernel and compare its performance with an existing design.

我们描述了为运行底层通用通信协议的廉价前端处理器设计的内核。在一张卡上实现一个成熟的内核需要大量的内存、昂贵的硬件和繁重的处理开销。我们研究了通信协议层的需求，这些层通常作为主机内核的一部分运行。我们意识到，前端处理器的一个可行的内核可以在合理的时间内实现，该内核具有主机内核的一部分功能。这可以提供通信协议层所需的功能，并在廉价硬件的约束下运行它们。本研究的目的是多方面的:1.(1)探索通用的面向连接和无连接协议的通用需求集;2.(2)设计和实现满足这些需求的核算法。由于内核是基于这一通用需求集的，因此它将是通用的，而不是特定于卡上运行的协议。3.(3)建议实现技术，以减少主机上的内存和处理开销，并提高内核中运行的协议的性能。4.(4)在该内核上实际运行一个协议，并将其性能与现有设计进行比较。

{"title":"Design of a kernel for implementing communication protocols","authors":"R.K. Arora , Vimal K. Khanna","doi":"10.1016/0165-6074(95)00027-L","DOIUrl":"10.1016/0165-6074(95)00027-L","url":null,"abstract":"<div>We describe the design of a kernel for an inexpensive front-end processor to run the lower layers of common communication protocols. Implementing a full fledged kernel on a card requires large memory, expensive hardware and heavy processing overhead. We studied the requirements of the layers of communication protocols which generally run as a part of the host kernel. We realised that a workable kernel for the front-end processor which has a subset of the features of the host kernel, could be implemented within reasonable time. This could provide the functions required by the communication protocol layers and run them within the constraints of inexpensive hardware. The aims of this research were manifold: <ul><li>1.(1) To explore the general set of requirements of common connection-oriented and connectionless protocols.</li><li>2.(2) To design and present algorithms of a kernel which can satisfy such requirements. Since the kernel is based on this general set of requirements, it will be generic and not specific to only the protocol running on the card. Thus it can be used to download different protocols on the card.</li><li>3.(3) Suggest implementation techniques so as to reduce memory and processing overhead on the host and to improve the performance of the protocols running in the kernel.</li><li>4.(4) To actually run a protocol on this kernel and compare its performance with an existing design.</li></ul></div>","PeriodicalId":100927,"journal":{"name":"Microprocessing and Microprogramming","volume":"41 7","pages":"Pages 469-485"},"PeriodicalIF":0.0,"publicationDate":"1995-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0165-6074(95)00027-L","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121655574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Using connectivity and spectral methods to characterize the structure of sequential logic circuits 利用连通性和频谱方法表征时序逻辑电路的结构

Microprocessing and Microprogramming

Pub Date : 1995-11-01 DOI: 10.1016/0165-6074(95)00028-M

Enrico Macii, Massimo Poncino

Representing finite state systems by means of finite state machines is a common approach in VLSI circuit design. BDD-based algorithms have made possible the manipulation of FSMs with very large state spaces; however, when the representation of the set of reachable states grows too much, the original FSM is no longer manageable as a whole, and it needs to be decomposed into smaller sub-machines. Structural analysis of the circuit from which the FSM has been extracted has shown to be very effective to determine good state variable partitions which induce FSM decomposition for logic synthesis and formal verification applications. In this paper we propose FSM analysis techniques based on connectivity and spectral characteristics of the state machine which take into account the mutual dependency of the state variables, but which are no longer dependent on the structure of the underlying circuit; therefore, they may be used in a context different from sequential logic optimization and FSM verification. Experimental results are presented and discussed for the mcnc'91 FSM benchmarks and for the iscas'89 sequential circuits.

用有限状态机表示有限状态系统是VLSI电路设计中常用的一种方法。基于bdd的算法使得具有非常大状态空间的fsm操作成为可能;然而，当可达状态集的表示增长太多时，原始FSM不再作为一个整体进行管理，需要将其分解为更小的子机器。通过对提取FSM的电路进行结构分析，可以非常有效地确定良好的状态变量分区，从而对逻辑综合和形式化验证应用进行FSM分解。在本文中，我们提出了基于状态机的连通性和频谱特征的FSM分析技术，该技术考虑了状态变量的相互依赖性，但不再依赖于底层电路的结构;因此，它们可以在不同于顺序逻辑优化和FSM验证的上下文中使用。给出了mcnc'91 FSM基准和iscas'89顺序电路的实验结果并进行了讨论。

{"title":"Using connectivity and spectral methods to characterize the structure of sequential logic circuits","authors":"Enrico Macii, Massimo Poncino","doi":"10.1016/0165-6074(95)00028-M","DOIUrl":"10.1016/0165-6074(95)00028-M","url":null,"abstract":"<div>Representing finite state systems by means of finite state machines is a common approach in VLSI circuit design. BDD-based algorithms have made possible the manipulation of FSMs with very large state spaces; however, when the representation of the set of reachable states grows too much, the original FSM is no longer manageable as a whole, and it needs to be decomposed into smaller sub-machines. Structural analysis of the circuit from which the FSM has been extracted has shown to be very effective to determine good state variable partitions which induce FSM decomposition for logic synthesis and formal verification applications. In this paper we propose FSM analysis techniques based on connectivity and spectral characteristics of the state machine which take into account the mutual dependency of the state variables, but which are no longer dependent on the structure of the underlying circuit; therefore, they may be used in a context different from sequential logic optimization and FSM verification. Experimental results are presented and discussed for the mcnc'91 FSM benchmarks and for the iscas'89 sequential circuits.</div>","PeriodicalId":100927,"journal":{"name":"Microprocessing and Microprogramming","volume":"41 7","pages":"Pages 487-500"},"PeriodicalIF":0.0,"publicationDate":"1995-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0165-6074(95)00028-M","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126052553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Architecture and programming of two generations video signal processors 两代视频信号处理器的结构与编程

Microprocessing and Microprogramming

Pub Date : 1995-10-01 DOI: 10.1016/0165-6074(95)00147-G

K.A. Vissers, G. Essink, P.H.J. van Gerwen, P.J.M. Janssen, O. Popp, E. Riddersma, W.J.M. Smits, H.J.M. Veendrick

Programmable video signal processor ICs (VSPs) and dedicated programming tools have been developed for the real-time processing of digital video signals. A large number of applications have been developed with boards containing several of these processors. Currently two implementations of the general architecture exist: VSP1 and VSP2. A single VSP chip contains several arithmetic and logic elements (ALEs) and memory elements. A complete switch matrix implements the unconstrained communication between all elements in a single cycle. The programming of these processors is carried out with signal flow graphs. These signal flow graphs can conveniently express multi-rate algorithms. These algorithms are then mapped onto a network of processors. Mapping is decomposed into delay management, partitioning and scheduling. The solution strategies for the partitioning problem and the scheduling problem are illustrated. Applications with these processors have been made for a number of industrially relevant video algorithms, including the complete processing of next generation fully digital studio TV cameras and several image improvement algorithms in medical applications. Results of the mapping are presented for a number of algorithms in the field of TV processing.

可编程视频信号处理器ic (VSPs)和专用编程工具已被开发用于数字视频信号的实时处理。大量的应用已经开发了包含几个这样的处理器的板。目前有两种通用架构的实现:VSP1和VSP2。单个VSP芯片包含多个算术和逻辑元件(ale)以及存储元件。一个完整的交换矩阵实现了单个循环中所有元素之间的无约束通信。这些处理器的编程是用信号流程图来完成的。这些信号流图可以方便地表示多速率算法。然后将这些算法映射到处理器网络上。映射被分解为延迟管理、分区和调度。给出了分区问题和调度问题的求解策略。这些处理器已经应用于许多工业相关的视频算法，包括下一代全数字电视摄像机的完整处理和医疗应用中的几种图像改进算法。本文给出了电视处理领域中一些算法的映射结果。

{"title":"Architecture and programming of two generations video signal processors","authors":"K.A. Vissers, G. Essink, P.H.J. van Gerwen, P.J.M. Janssen, O. Popp, E. Riddersma, W.J.M. Smits, H.J.M. Veendrick","doi":"10.1016/0165-6074(95)00147-G","DOIUrl":"10.1016/0165-6074(95)00147-G","url":null,"abstract":"<div>Programmable video signal processor ICs (VSPs) and dedicated programming tools have been developed for the real-time processing of digital video signals. A large number of applications have been developed with boards containing several of these processors. Currently two implementations of the general architecture exist: VSP1 and VSP2. A single VSP chip contains several arithmetic and logic elements (ALEs) and memory elements. A complete switch matrix implements the unconstrained communication between all elements in a single cycle. The programming of these processors is carried out with signal flow graphs. These signal flow graphs can conveniently express multi-rate algorithms. These algorithms are then mapped onto a network of processors. Mapping is decomposed into delay management, partitioning and scheduling. The solution strategies for the partitioning problem and the scheduling problem are illustrated. Applications with these processors have been made for a number of industrially relevant video algorithms, including the complete processing of next generation fully digital studio TV cameras and several image improvement algorithms in medical applications. Results of the mapping are presented for a number of algorithms in the field of TV processing.</div>","PeriodicalId":100927,"journal":{"name":"Microprocessing and Microprogramming","volume":"41 5","pages":"Pages 373-390"},"PeriodicalIF":0.0,"publicationDate":"1995-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0165-6074(95)00147-G","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130948584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 21

Compiling for massively parallel architectures: a perspective 为大规模并行架构编译:一个视角

Microprocessing and Microprogramming

Pub Date : 1995-10-01 DOI: 10.1016/0165-6074(95)00025-J

Paul Feautrier

The problem of automatically generating programs for massively parallel computers is a very complicated one, mainly because there are many architectures, each of them seeming to pose its own particular compilation problem. The purpose of this paper is to propose a framework in which to discuss the compilation process, and to show that the features which affect it are few and generate a small number of combinations. The paper is oriented toward fine-grained parallelization of static control programs, with emphasis on dataflow analysis, scheduling and placement. When going from there to more general programs and to coarser parallelism, one encounters new problems, some of which are discussed in the conclusion.

为大规模并行计算机自动生成程序是一个非常复杂的问题，主要是因为存在许多体系结构，每个体系结构似乎都提出了自己特定的编译问题。本文的目的是提出一个框架来讨论编译过程，并表明影响它的特征很少，并且产生少量的组合。本文面向静态控制程序的细粒度并行化，重点是数据流分析、调度和放置。当从那里转向更一般的程序和更粗的并行性时，会遇到新的问题，其中一些问题在结论中进行了讨论。

引用次数: 25

Calender of forthcoming conferences and events 即将召开的会议和活动日历

Microprocessing and Microprogramming

Pub Date : 1995-10-01 DOI: 10.1016/0165-6074(95)90000-4

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Microprocessing and Microprogramming

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀