首页 > 最新文献

Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250)最新文献

英文 中文
Design of a system to support security communication between a Web proxy and a CGI program based on PKI 基于PKI的Web代理与CGI程序安全通信系统的设计
Junseok Lee, Kisong Yoon
This paper discusses the design of a system to support security communications between a Web browser and a CGI program using PKI (public key infrastructure). TLS (transport layer security) supports security communications between a Web browser and a Web server, but this system supports security communications between a Web server and a CGI program as well as between a Web browser and a Web server. This system uses GSS (generic security service) API to communicate with PKI, and offers a Web user a Web proxy, and offers three library functions for CGI applications related to security. This makes it easier for a CGI developer to write a CGI program.
本文讨论了一个基于PKI (public key infrastructure,公钥基础设施)的支持Web浏览器和CGI程序之间安全通信的系统设计。TLS(传输层安全)支持Web浏览器和Web服务器之间的安全通信,但本系统支持Web服务器和CGI程序之间以及Web浏览器和Web服务器之间的安全通信。该系统使用GSS (generic security service) API与PKI通信,为Web用户提供Web代理,并为CGI应用提供了三个与安全相关的库功能。这使得CGI开发人员更容易编写CGI程序。
{"title":"Design of a system to support security communication between a Web proxy and a CGI program based on PKI","authors":"Junseok Lee, Kisong Yoon","doi":"10.1109/ICPADS.1998.741116","DOIUrl":"https://doi.org/10.1109/ICPADS.1998.741116","url":null,"abstract":"This paper discusses the design of a system to support security communications between a Web browser and a CGI program using PKI (public key infrastructure). TLS (transport layer security) supports security communications between a Web browser and a Web server, but this system supports security communications between a Web server and a CGI program as well as between a Web browser and a Web server. This system uses GSS (generic security service) API to communicate with PKI, and offers a Web user a Web proxy, and offers three library functions for CGI applications related to security. This makes it easier for a CGI developer to write a CGI program.","PeriodicalId":226947,"journal":{"name":"Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128579672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Object-oriented Ease-based parallel primitives in C++ c++中面向对象的基于简单的并行原语
T. MacKenzie, T. Dix
The paper presents our language C++-with-Ease, a superset of C++ with primitives for process creation and communication. The work adopts the semantics of the Ease parallel primitives, as defined by S.E. Zenith (1990), within the object oriented paradigm. The result is a general purpose, high level, imperative parallel programming language that allows the simple expression of parallel algorithms within a type-safe implementation. Our language implementation is best suited to homogeneous parallel MIMD machines, independent of architecture, but also runs under threads packages. C++-with-Ease addresses efficiency for message copying and provides extensions to allow the passing of arbitrary messages in a natural fashion within the paradigm. The implementation and associated message protocols are discussed.
本文介绍了我们的c++ -with-Ease语言,它是c++的一个超集,具有用于进程创建和通信的原语。这项工作采用了Ease并行原语的语义,如S.E. Zenith(1990)在面向对象范式中定义的那样。其结果是一种通用的、高级的、命令式并行编程语言,它允许在类型安全的实现中简单地表达并行算法。我们的语言实现最适合于同构并行MIMD机器,独立于体系结构,但也在线程包下运行。c++轻松解决了消息复制的效率问题,并提供了扩展,允许在范式内以自然的方式传递任意消息。讨论了实现和相关的消息协议。
{"title":"Object-oriented Ease-based parallel primitives in C++","authors":"T. MacKenzie, T. Dix","doi":"10.1109/ICPADS.1998.741145","DOIUrl":"https://doi.org/10.1109/ICPADS.1998.741145","url":null,"abstract":"The paper presents our language C++-with-Ease, a superset of C++ with primitives for process creation and communication. The work adopts the semantics of the Ease parallel primitives, as defined by S.E. Zenith (1990), within the object oriented paradigm. The result is a general purpose, high level, imperative parallel programming language that allows the simple expression of parallel algorithms within a type-safe implementation. Our language implementation is best suited to homogeneous parallel MIMD machines, independent of architecture, but also runs under threads packages. C++-with-Ease addresses efficiency for message copying and provides extensions to allow the passing of arbitrary messages in a natural fashion within the paradigm. The implementation and associated message protocols are discussed.","PeriodicalId":226947,"journal":{"name":"Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124555904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
IPLS: an intelligent parallel loop scheduling for multiprocessor systems 多处理器系统的智能并行循环调度
Yun-Woei Fann, Chao-Tung Yang, Chang-Jiun Tsai, S. Tseng
We propose a knowledge based approach for solving loop scheduling problems. A rule based system, called the IPLS, is developed by repertory grid and attribute ordering table to construct the knowledge base. The IPLS chooses an appropriate scheduling algorithm by inferring some features of loops and assigns parallel loops on multiprocessors for achieving high speedup. In addition, the refined system of IPLS can automatically adjust the attributes in a knowledge base according to profile information; therefore IPLS has feedback learning ability.
我们提出了一种基于知识的方法来解决循环调度问题。利用库存网格和属性排序表构建知识库,开发了一种基于规则的知识库系统。IPLS通过推断循环的某些特征来选择合适的调度算法,并在多处理器上分配并行循环,以达到较高的加速效果。此外,改进后的IPLS系统可以根据配置文件信息自动调整知识库中的属性;因此IPLS具有反馈学习能力。
{"title":"IPLS: an intelligent parallel loop scheduling for multiprocessor systems","authors":"Yun-Woei Fann, Chao-Tung Yang, Chang-Jiun Tsai, S. Tseng","doi":"10.1109/ICPADS.1998.741167","DOIUrl":"https://doi.org/10.1109/ICPADS.1998.741167","url":null,"abstract":"We propose a knowledge based approach for solving loop scheduling problems. A rule based system, called the IPLS, is developed by repertory grid and attribute ordering table to construct the knowledge base. The IPLS chooses an appropriate scheduling algorithm by inferring some features of loops and assigns parallel loops on multiprocessors for achieving high speedup. In addition, the refined system of IPLS can automatically adjust the attributes in a knowledge base according to profile information; therefore IPLS has feedback learning ability.","PeriodicalId":226947,"journal":{"name":"Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125697091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Effective mechanisms to reduce the overhead of migratory sharing for linked-based cache coherence protocols in clustering multiprocessor architecture 集群多处理器架构下基于链接的缓存一致性协议迁移共享开销的有效降低机制
Der-Lin Pean, Jia-Rong Wu, Cheng Chen
Migratory-sharing data references will incur many cache misses that can be reduced by merging the invalidation/update requests and the cache misses. This paper presents effective software and hardware techniques to reduce the overhead of migratory-sharing references for the linked-based cache coherence protocols. The software scheme combines both compiler labeling and run time detection techniques. The hardware scheme uses the special access patterns in the linked-based protocoIs to detect migratory data objects. We have evaluated the performance on the linked-based program-driven simulation environment by using a set of SPLASH benchmarks. According to the simulation results, our software and hardware methods effectively enhanced the system performance up to 29% and 23% respectively by reducing the overhead of the migratory-sharing references.
迁移共享数据引用将导致许多缓存丢失,可以通过合并无效/更新请求和缓存丢失来减少缓存丢失。本文提出了有效的软件和硬件技术,以减少基于链接的缓存一致性协议的迁移共享引用的开销。该软件方案结合了编译器标记和运行时检测技术。硬件方案利用基于链接的协议中的特殊访问模式来检测迁移数据对象。我们通过使用一组SPLASH基准测试来评估基于链接的程序驱动模拟环境的性能。仿真结果表明,通过减少迁移共享引用的开销,我们的软件和硬件方法可以有效地将系统性能分别提高29%和23%。
{"title":"Effective mechanisms to reduce the overhead of migratory sharing for linked-based cache coherence protocols in clustering multiprocessor architecture","authors":"Der-Lin Pean, Jia-Rong Wu, Cheng Chen","doi":"10.1109/ICPADS.1998.741126","DOIUrl":"https://doi.org/10.1109/ICPADS.1998.741126","url":null,"abstract":"Migratory-sharing data references will incur many cache misses that can be reduced by merging the invalidation/update requests and the cache misses. This paper presents effective software and hardware techniques to reduce the overhead of migratory-sharing references for the linked-based cache coherence protocols. The software scheme combines both compiler labeling and run time detection techniques. The hardware scheme uses the special access patterns in the linked-based protocoIs to detect migratory data objects. We have evaluated the performance on the linked-based program-driven simulation environment by using a set of SPLASH benchmarks. According to the simulation results, our software and hardware methods effectively enhanced the system performance up to 29% and 23% respectively by reducing the overhead of the migratory-sharing references.","PeriodicalId":226947,"journal":{"name":"Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132109069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
User-defined telecooperation services 自定义远程合作服务
V. Gruhn, P. Herrmann, H. Krumm
A user-defined telecooperation service (UTS) provides service elements for application-specific communication and cooperation processes as well as integrated means for the service definition, adaptation, and management. It supports user groups with particular communication, cooperation, and coordination needs which may change over time and which may be that special, that the service maintenance can be carried out only by the users themselves in an economic and satisfactory way. The users may be organized in various open and closed groups. They dispose of personal computing equipment connected via wide-area telecommunication networks. The users participate only from time to time. Therefore, there is a partial and varying accessibility of users and user sites. Interactions are mainly based on asynchronous communication operations. The cooperation and coordination functions have to consider unreachable users. The paper introduces the notion of UTS. Fields of application are addressed. Moreover, we describe the principles of the service element definitions and outline the architecture of a supporting system.
用户自定义远程合作服务(UTS)为特定应用的通信和合作过程提供了服务元素,并为服务的定义、适配和管理提供了集成手段。它支持具有特定的通信、合作和协调需求的用户群体,这些需求可能随着时间的推移而变化,并且可能非常特殊,以至于只能由用户自己以经济和令人满意的方式进行服务维护。用户可以组织在各种开放和封闭组中。他们处理通过广域电信网络连接的个人计算设备。用户只是偶尔参与。因此,用户和用户站点的可访问性存在部分和变化。交互主要基于异步通信操作。合作和协调功能必须考虑到无法到达的用户。本文介绍了UTS的概念。讨论了应用领域。此外,我们还描述了服务元素定义的原则,并概述了支持系统的体系结构。
{"title":"User-defined telecooperation services","authors":"V. Gruhn, P. Herrmann, H. Krumm","doi":"10.1109/ICPADS.1998.741141","DOIUrl":"https://doi.org/10.1109/ICPADS.1998.741141","url":null,"abstract":"A user-defined telecooperation service (UTS) provides service elements for application-specific communication and cooperation processes as well as integrated means for the service definition, adaptation, and management. It supports user groups with particular communication, cooperation, and coordination needs which may change over time and which may be that special, that the service maintenance can be carried out only by the users themselves in an economic and satisfactory way. The users may be organized in various open and closed groups. They dispose of personal computing equipment connected via wide-area telecommunication networks. The users participate only from time to time. Therefore, there is a partial and varying accessibility of users and user sites. Interactions are mainly based on asynchronous communication operations. The cooperation and coordination functions have to consider unreachable users. The paper introduces the notion of UTS. Fields of application are addressed. Moreover, we describe the principles of the service element definitions and outline the architecture of a supporting system.","PeriodicalId":226947,"journal":{"name":"Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130171683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Flat indexing: a compilation technique to enhance parallelism of logic programs 平面索引:一种增强逻辑程序并行性的编译技术
Hiecheol Kim, J. Gaudiot
The paper presents a systematic approach to the compilation of logic programs for efficient clause indexing. As the kernel of the approach, we propose the indexing tree which provides a simple, but precise representation of average parallelism per node (i.e., choice point) as well as the amount of clause trials. It also provides the way to evaluate the number of the cases that the control is passed to the failure code by the indexing instruction such as switch on term, switch on constant, or switch on structure. By analyzing the indexing tree created when using the indexing scheme implemented in the WAM, we show the drawback of the WAM indexing scheme in terms of parallelism exposition and scheduling. Subsequently we propose a new indexing scheme, which we call Flat indexing. Experimental results show that over one half of the benchmarks benefit from the Flat indexing, such that compared with the WAM indexing scheme, the number of choice points is reduced by 15%. Moreover, the amount of failures which occur during the execution of indexing instructions is reduced by 35%.
本文提出了一种编写高效子句索引逻辑程序的系统方法。作为该方法的核心,我们提出了索引树,它提供了每个节点(即选择点)的平均并行度以及子句试验次数的简单而精确的表示。它还提供了一种方法来评估通过索引指令将控制传递给故障代码的情况的数量,例如开关项、开关常量或开关结构。通过分析在使用WAM中实现的索引方案时创建的索引树,我们展示了WAM索引方案在并行性展示和调度方面的缺点。随后,我们提出了一种新的索引方案,我们称之为平面索引。实验结果表明,超过一半的基准测试从Flat索引中受益,与WAM索引方案相比,选择点的数量减少了15%。此外,在执行索引指令期间发生的故障数量减少了35%。
{"title":"Flat indexing: a compilation technique to enhance parallelism of logic programs","authors":"Hiecheol Kim, J. Gaudiot","doi":"10.1109/ICPADS.1998.741166","DOIUrl":"https://doi.org/10.1109/ICPADS.1998.741166","url":null,"abstract":"The paper presents a systematic approach to the compilation of logic programs for efficient clause indexing. As the kernel of the approach, we propose the indexing tree which provides a simple, but precise representation of average parallelism per node (i.e., choice point) as well as the amount of clause trials. It also provides the way to evaluate the number of the cases that the control is passed to the failure code by the indexing instruction such as switch on term, switch on constant, or switch on structure. By analyzing the indexing tree created when using the indexing scheme implemented in the WAM, we show the drawback of the WAM indexing scheme in terms of parallelism exposition and scheduling. Subsequently we propose a new indexing scheme, which we call Flat indexing. Experimental results show that over one half of the benchmarks benefit from the Flat indexing, such that compared with the WAM indexing scheme, the number of choice points is reduced by 15%. Moreover, the amount of failures which occur during the execution of indexing instructions is reduced by 35%.","PeriodicalId":226947,"journal":{"name":"Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133831183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Alignment and distribution is NOT (always) NP-hard 对齐和分配并不(总是)np困难
Vincent Boudet, F. Rastello, Y. Robert
An efficient algorithm to simultaneously implement array alignment and data/computation distribution is introduced and evaluated. We re-visit previous work of Li and Chen (J. Li and M. Chen, 1990; 1991), and we show that their alignment step should not be conducted without preserving the potential parallelism. In other words, the optimal alignment may well sequentialize computations, whatever the distribution afterwards. We provide an efficient algorithm that handles alignment and data/computation distribution simultaneously. The good news is that several important instances of the whole alignment/distribution problem have polynomial complexity, while alignment itself is NP-complete (J. Li and M. Chen, 1990).
介绍并评价了一种同时实现阵列对齐和数据/计算分配的有效算法。我们重新审视Li和Chen之前的工作(J. Li and M. Chen, 1990;1991),我们表明,如果不保留潜在的并行性,他们的对齐步骤不应该进行。换句话说,无论之后的分布如何,最佳对齐都可以很好地将计算顺序化。我们提供了一种高效的算法,可以同时处理对齐和数据/计算分布。好消息是,整个对齐/分布问题的几个重要实例具有多项式复杂度,而对齐本身是np完全的(J. Li和M. Chen, 1990)。
{"title":"Alignment and distribution is NOT (always) NP-hard","authors":"Vincent Boudet, F. Rastello, Y. Robert","doi":"10.1109/ICPADS.1998.741148","DOIUrl":"https://doi.org/10.1109/ICPADS.1998.741148","url":null,"abstract":"An efficient algorithm to simultaneously implement array alignment and data/computation distribution is introduced and evaluated. We re-visit previous work of Li and Chen (J. Li and M. Chen, 1990; 1991), and we show that their alignment step should not be conducted without preserving the potential parallelism. In other words, the optimal alignment may well sequentialize computations, whatever the distribution afterwards. We provide an efficient algorithm that handles alignment and data/computation distribution simultaneously. The good news is that several important instances of the whole alignment/distribution problem have polynomial complexity, while alignment itself is NP-complete (J. Li and M. Chen, 1990).","PeriodicalId":226947,"journal":{"name":"Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134275073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Proposal and verification of a workflow coordination model for core business 提出并验证核心业务的工作流协调模型
N. Kataoka, K. Kuroda, Tsutomu Ohkawa, H. Koizumi, N. Shiratori
As corporations continue to introduce workflow systems, different systems have appeared for processing of core-business tasks for information processing and clerical work. A number of different models for coordination of different workflow systems have been proposed; but workflows for core-business tasks constitute the central business processes of the company, and certain aspects of coordination of such workflows differ from other kinds of coordination. This paper clarifies these differences, and proposes a model for coordination, which takes these differences into consideration. In addition the application of this model to an actual system and confirmation of its effectiveness are also reported.
随着企业不断引入工作流系统,出现了不同的系统来处理核心业务任务,如信息处理和文书工作。已经提出了许多不同的工作流系统协调模型;但是,核心业务任务的工作流构成了公司的中心业务流程,并且这种工作流的协调在某些方面不同于其他类型的协调。本文阐明了这些差异,并提出了一个考虑这些差异的协调模型。此外,还报道了该模型在实际系统中的应用,并验证了其有效性。
{"title":"Proposal and verification of a workflow coordination model for core business","authors":"N. Kataoka, K. Kuroda, Tsutomu Ohkawa, H. Koizumi, N. Shiratori","doi":"10.1109/ICPADS.1998.741161","DOIUrl":"https://doi.org/10.1109/ICPADS.1998.741161","url":null,"abstract":"As corporations continue to introduce workflow systems, different systems have appeared for processing of core-business tasks for information processing and clerical work. A number of different models for coordination of different workflow systems have been proposed; but workflows for core-business tasks constitute the central business processes of the company, and certain aspects of coordination of such workflows differ from other kinds of coordination. This paper clarifies these differences, and proposes a model for coordination, which takes these differences into consideration. In addition the application of this model to an actual system and confirmation of its effectiveness are also reported.","PeriodicalId":226947,"journal":{"name":"Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134323554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and performance evaluation of an adaptive cache coherence protocol 自适应缓存一致性协议的设计与性能评价
W.-K. Hong, Nam-Hee Kim, Shin-Dug Kim
In shared-memory multiprocessor systems, the local caches which are used to tolerate the performance gap between processor and memory cause additional bus transactions to maintain the coherency of shared data. Especially, coherency misses and data traffic due to spatial locality and false sharing have a significant effect on the system performance. In this approach, an adaptive cache coherence protocol based on the sectored cache is introduced. It determines the size of a block to be migrated or invalidated dynamically, depending on the transfer mode, so that it can exploit the spatial locality and reduce useless data traffic due to false sharing at the same time. This protocol is evaluated via event-driven simulation, and its results show a 58% decrease in the data traffic and a 45% decrease in the cache miss ratio. Thus, the adaptive cache coherence protocol provides about a 56% improvement in the execution time.
在共享内存多处理器系统中,用于容忍处理器和内存之间性能差距的本地缓存会导致额外的总线事务来保持共享数据的一致性。特别是空间局部性和虚假共享导致的相干缺失和数据流量对系统性能的影响较大。在该方法中,引入了一种基于扇区缓存的自适应缓存一致性协议。它根据传输方式动态确定要迁移或无效的块的大小,从而利用空间局域性,同时减少由于错误共享而导致的无用数据流量。该协议通过事件驱动仿真进行了评估,其结果显示数据流量减少了58%,缓存丢失率减少了45%。因此,自适应缓存一致性协议在执行时间上提供了大约56%的改进。
{"title":"Design and performance evaluation of an adaptive cache coherence protocol","authors":"W.-K. Hong, Nam-Hee Kim, Shin-Dug Kim","doi":"10.1109/ICPADS.1998.741017","DOIUrl":"https://doi.org/10.1109/ICPADS.1998.741017","url":null,"abstract":"In shared-memory multiprocessor systems, the local caches which are used to tolerate the performance gap between processor and memory cause additional bus transactions to maintain the coherency of shared data. Especially, coherency misses and data traffic due to spatial locality and false sharing have a significant effect on the system performance. In this approach, an adaptive cache coherence protocol based on the sectored cache is introduced. It determines the size of a block to be migrated or invalidated dynamically, depending on the transfer mode, so that it can exploit the spatial locality and reduce useless data traffic due to false sharing at the same time. This protocol is evaluated via event-driven simulation, and its results show a 58% decrease in the data traffic and a 45% decrease in the cache miss ratio. Thus, the adaptive cache coherence protocol provides about a 56% improvement in the execution time.","PeriodicalId":226947,"journal":{"name":"Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134619812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An x86 load/store unit with aggressive scheduling of load/store operations 一种具有主动调度加载/存储操作的x86加载/存储单元
Hui-Yue Hwang, R.-Ming Shiu, J. Shann
Because of register-memory instruction set architecture and limited register set, there are significant amounts of memory access instructions in x86 microprocessors. As the higher issue degree of superscalar microprocessor is provided, an aggressive scheduling policy of load/store operations becomes crucial. We examine the scheduling policies of loads/stores on x86 superscalar microprocessors and propose a new aggressive scheduling policy called load speculation, which allows loads to precede the previous unsolved pending stores. Simulation results show that the load speculation achieves the higher performance in comparison with the traditional scheduling policies such as load bypassing and load forwarding. Furthermore, by reducing the pipeline stages, the load speculation can achieve even higher performance.
由于寄存器-内存指令集体系结构和有限的寄存器集,x86微处理器中存在大量的内存访问指令。随着超标量微处理器问题程度的提高,积极的加载/存储操作调度策略变得至关重要。我们研究了x86超标量微处理器上负载/存储的调度策略,并提出了一种新的主动调度策略,称为负载推测,它允许负载先于先前未解决的未决存储。仿真结果表明,与负载绕过和负载转发等传统调度策略相比,负载推测策略具有更高的性能。此外,通过减少管道阶段,负载推测可以实现更高的性能。
{"title":"An x86 load/store unit with aggressive scheduling of load/store operations","authors":"Hui-Yue Hwang, R.-Ming Shiu, J. Shann","doi":"10.1109/ICPADS.1998.741123","DOIUrl":"https://doi.org/10.1109/ICPADS.1998.741123","url":null,"abstract":"Because of register-memory instruction set architecture and limited register set, there are significant amounts of memory access instructions in x86 microprocessors. As the higher issue degree of superscalar microprocessor is provided, an aggressive scheduling policy of load/store operations becomes crucial. We examine the scheduling policies of loads/stores on x86 superscalar microprocessors and propose a new aggressive scheduling policy called load speculation, which allows loads to precede the previous unsolved pending stores. Simulation results show that the load speculation achieves the higher performance in comparison with the traditional scheduling policies such as load bypassing and load forwarding. Furthermore, by reducing the pipeline stages, the load speculation can achieve even higher performance.","PeriodicalId":226947,"journal":{"name":"Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124301250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Proceedings 1998 International Conference on Parallel and Distributed Systems (Cat. No.98TB100250)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1