首页 > 最新文献

Proceedings International Conference on Parallel Processing最新文献

英文 中文
An efficient fault-tolerant scheduling algorithm for real-time tasks with precedence constraints in heterogeneous systems 异构系统中具有优先约束的实时任务调度算法
Pub Date : 2002-08-18 DOI: 10.1109/ICPP.2002.1040892
X. Qin, Hong Jiang, D. Swanson
In this paper, we investigate an efficient off-line scheduling algorithm in which real-time tasks with precedence constraints are executed in a heterogeneous environment. It provides more features and capabilities than existing algorithms that schedule only independent tasks in real-time homogeneous systems. In addition, the proposed algorithm takes the heterogeneities of computation, communication and reliability into account, thereby improving the reliability. To provide fault-tolerant capability, the algorithm employs a primary-backup copy scheme that enables the system to tolerate permanent failures in any single processor. In this scheme, a backup copy is allowed to overlap with other backup copies on the same processor, as long as their corresponding primary copies are allocated to different processors. Tasks are judiciously allocated to processors so as to reduce the schedule length as well as the reliability cost, defined to be the product of processor failure rate and task execution time. In addition, the time for detecting and handling a permanent fault is incorporated into the scheduling scheme, thus making the algorithm more practical. To quantify the combined performance of fault-tolerance and schedulability, the performability measure is introduced Compared with the existing scheduling algorithms in the literature, our scheduling algorithm achieves an average of 16.4% improvement in reliability and an average of 49.3% improvement in performability.
本文研究了一种在异构环境下执行具有优先级约束的实时任务的高效离线调度算法。它提供了比在实时同构系统中只调度独立任务的现有算法更多的特性和功能。此外,该算法还考虑了计算、通信和可靠性的异构性,从而提高了可靠性。为了提供容错能力,该算法采用主备份复制方案,使系统能够容忍任何单个处理器中的永久故障。在这种方案中,一个备份副本可以与同一处理器上的其他备份副本重叠,只要它们对应的主副本被分配给不同的处理器。将任务明智地分配给处理器,以减少调度长度和可靠性成本,可靠性成本定义为处理器故障率和任务执行时间的乘积。此外,将永久性故障的检测和处理时间纳入调度方案,使算法更加实用。为了量化容错性和可调度性的综合性能,引入了可执行性度量。与文献中已有的调度算法相比,我们的调度算法可靠性平均提高了16.4%,可执行性平均提高了49.3%。
{"title":"An efficient fault-tolerant scheduling algorithm for real-time tasks with precedence constraints in heterogeneous systems","authors":"X. Qin, Hong Jiang, D. Swanson","doi":"10.1109/ICPP.2002.1040892","DOIUrl":"https://doi.org/10.1109/ICPP.2002.1040892","url":null,"abstract":"In this paper, we investigate an efficient off-line scheduling algorithm in which real-time tasks with precedence constraints are executed in a heterogeneous environment. It provides more features and capabilities than existing algorithms that schedule only independent tasks in real-time homogeneous systems. In addition, the proposed algorithm takes the heterogeneities of computation, communication and reliability into account, thereby improving the reliability. To provide fault-tolerant capability, the algorithm employs a primary-backup copy scheme that enables the system to tolerate permanent failures in any single processor. In this scheme, a backup copy is allowed to overlap with other backup copies on the same processor, as long as their corresponding primary copies are allocated to different processors. Tasks are judiciously allocated to processors so as to reduce the schedule length as well as the reliability cost, defined to be the product of processor failure rate and task execution time. In addition, the time for detecting and handling a permanent fault is incorporated into the scheduling scheme, thus making the algorithm more practical. To quantify the combined performance of fault-tolerance and schedulability, the performability measure is introduced Compared with the existing scheduling algorithms in the literature, our scheduling algorithm achieves an average of 16.4% improvement in reliability and an average of 49.3% improvement in performability.","PeriodicalId":393916,"journal":{"name":"Proceedings International Conference on Parallel Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129974706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 103
Neuron-a wide-area service discovery infrastructure 神经元-广域服务发现基础设施
Pub Date : 2002-08-18 DOI: 10.1109/ICPP.2002.1040902
Hung-Chang Hsiao, C. King
A wide-area service discovery infrastructure provides a repository in which services over a wide area can register themselves and clients everywhere can inquire about them. We discuss how to build such an infrastructure based on the peer-to-peer model. The proposed system, called Neuron, can be executed on top of a set of federated nodes across the global network and aggregate their resources to provide the discovery service. Neuron is self-organizing, self-tuning, and capable of tolerating failures of nodes and communication links. In addition, it allows the services to be described with arbitrary forms and the system load to be distributed evenly to the nodes. Neuron also supports event notification. We evaluated Neuron via simulation. The preliminary results show that service registration, discovery and service state advertising take at most O(log N) hops to complete.
广域服务发现基础设施提供了一个存储库,在该存储库中,广域范围内的服务可以注册它们自己,并且各地的客户机都可以查询它们。我们将讨论如何基于点对点模型构建这样的基础设施。所提出的系统称为Neuron,可以在全球网络中的一组联合节点上执行,并聚合它们的资源来提供发现服务。神经元具有自组织、自调整的能力,能够容忍节点和通信链路的故障。此外,它还允许用任意形式描述服务,并将系统负载均匀地分配给节点。Neuron还支持事件通知。我们通过模拟来评估Neuron。初步结果表明,服务注册、发现和服务状态发布最多需要O(log N)个跃点完成。
{"title":"Neuron-a wide-area service discovery infrastructure","authors":"Hung-Chang Hsiao, C. King","doi":"10.1109/ICPP.2002.1040902","DOIUrl":"https://doi.org/10.1109/ICPP.2002.1040902","url":null,"abstract":"A wide-area service discovery infrastructure provides a repository in which services over a wide area can register themselves and clients everywhere can inquire about them. We discuss how to build such an infrastructure based on the peer-to-peer model. The proposed system, called Neuron, can be executed on top of a set of federated nodes across the global network and aggregate their resources to provide the discovery service. Neuron is self-organizing, self-tuning, and capable of tolerating failures of nodes and communication links. In addition, it allows the services to be described with arbitrary forms and the system load to be distributed evenly to the nodes. Neuron also supports event notification. We evaluated Neuron via simulation. The preliminary results show that service registration, discovery and service state advertising take at most O(log N) hops to complete.","PeriodicalId":393916,"journal":{"name":"Proceedings International Conference on Parallel Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127189703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
WebGOP: A framework for architecting and programming dynamic distributed Web applications WebGOP:用于架构和编程动态分布式Web应用程序的框架
Pub Date : 2002-08-18 DOI: 10.1109/ICPP.2002.1040882
Xiaoxing Ma, A. Chan, Jian Lu
This paper presents a novel approach, called WebGOP, for architecture modeling and programming of web-based distributed applications. WebGOP uses the graph-oriented programming (GOP) mode, under which the components of a distributed program are configured as a logical graph and implemented using a set of operations defined over the graph. WebGOP extends the application of GOP to the World Wide Web environment and provides more powerful architectural support. In WebGOP, the architecture graph is reified as an explicit object which itself is distributed over the network providing a graph-oriented context for the execution of distributed applications. The programmer can specialize the type of a graph to represent a particular architecture style tailored for an application. WebGOP also has built-in support for flexible and dynamic architectures, including dynamic reconfiguration. We describe the WebGOP framework, a prototypical implementation of the framework on top of SOAP, and performance evaluation of the prototype. Results of the performance evaluation showed that the overhead introduced by WebGOP over SOAP is reasonable and acceptable.
本文提出了一种新的方法,称为WebGOP,用于基于web的分布式应用程序的体系结构建模和编程。WebGOP使用面向图的编程(GOP)模式,在这种模式下,分布式程序的组件被配置为逻辑图,并使用在图上定义的一组操作来实现。WebGOP将GOP的应用扩展到万维网环境,并提供了更强大的体系结构支持。在WebGOP中,架构图被具体化为一个显式对象,该对象本身分布在网络上,为分布式应用程序的执行提供面向图的上下文。程序员可以专门化图的类型,以表示为应用程序量身定制的特定架构风格。WebGOP还内置了对灵活和动态架构的支持,包括动态重新配置。我们描述了WebGOP框架,一个基于SOAP的框架的原型实现,以及原型的性能评估。性能评估结果表明,WebGOP对SOAP带来的开销是合理和可接受的。
{"title":"WebGOP: A framework for architecting and programming dynamic distributed Web applications","authors":"Xiaoxing Ma, A. Chan, Jian Lu","doi":"10.1109/ICPP.2002.1040882","DOIUrl":"https://doi.org/10.1109/ICPP.2002.1040882","url":null,"abstract":"This paper presents a novel approach, called WebGOP, for architecture modeling and programming of web-based distributed applications. WebGOP uses the graph-oriented programming (GOP) mode, under which the components of a distributed program are configured as a logical graph and implemented using a set of operations defined over the graph. WebGOP extends the application of GOP to the World Wide Web environment and provides more powerful architectural support. In WebGOP, the architecture graph is reified as an explicit object which itself is distributed over the network providing a graph-oriented context for the execution of distributed applications. The programmer can specialize the type of a graph to represent a particular architecture style tailored for an application. WebGOP also has built-in support for flexible and dynamic architectures, including dynamic reconfiguration. We describe the WebGOP framework, a prototypical implementation of the framework on top of SOAP, and performance evaluation of the prototype. Results of the performance evaluation showed that the overhead introduced by WebGOP over SOAP is reasonable and acceptable.","PeriodicalId":393916,"journal":{"name":"Proceedings International Conference on Parallel Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121108393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Optimal code size reduction for software-pipelined loops on DSP applications DSP应用中软件流水线循环的最佳代码缩减
Pub Date : 2002-08-18 DOI: 10.1109/ICPP.2002.1040919
Qingfeng Zhuge, Z. Shao, E. Sha
Code size expansion of software-pipelined loops is a critical problem for DSP systems with strict code size constraint. Some ad-hoc code size reduction techniques were used to try to reduce the prologue/epilogue produced by software pipelining. We present the fundamental understanding of the relationship between code size expansion and software pipelining. Based on the retiming concept, we present a powerful Code-size REDuction (CRED) technique and its application on various kinds of processors. We also provide CRED algorithms integrated with the software pipelining process. One advantage of our algorithms is that it can explore the trade-off space between "perfect" software pipelining and constrained code size. That is, the software pipelining process can be controlled to generate a schedule concerned with code size requirement. The experiment results show the effectiveness of our algorithms in both reducing the code size for software-pipelined loops and exploring the code size/performance trade-off space.
对于具有严格代码长度约束的DSP系统来说,软件流水线循环的代码长度扩展是一个关键问题。一些特别的代码大小缩减技术被用来尝试减少由软件流水线产生的序言/尾声。我们提出了对代码大小扩展和软件流水线之间关系的基本理解。基于重定时的概念,我们提出了一种强大的代码大小缩减(CRED)技术及其在各种处理器上的应用。我们还提供与软件流水线过程集成的CRED算法。我们的算法的一个优点是,它可以探索“完美的”软件流水线和受约束的代码大小之间的权衡空间。也就是说,可以控制软件流水线过程,以生成与代码大小需求相关的时间表。实验结果表明,我们的算法在减少软件流水线循环的代码大小和探索代码大小/性能权衡空间方面是有效的。
{"title":"Optimal code size reduction for software-pipelined loops on DSP applications","authors":"Qingfeng Zhuge, Z. Shao, E. Sha","doi":"10.1109/ICPP.2002.1040919","DOIUrl":"https://doi.org/10.1109/ICPP.2002.1040919","url":null,"abstract":"Code size expansion of software-pipelined loops is a critical problem for DSP systems with strict code size constraint. Some ad-hoc code size reduction techniques were used to try to reduce the prologue/epilogue produced by software pipelining. We present the fundamental understanding of the relationship between code size expansion and software pipelining. Based on the retiming concept, we present a powerful Code-size REDuction (CRED) technique and its application on various kinds of processors. We also provide CRED algorithms integrated with the software pipelining process. One advantage of our algorithms is that it can explore the trade-off space between \"perfect\" software pipelining and constrained code size. That is, the software pipelining process can be controlled to generate a schedule concerned with code size requirement. The experiment results show the effectiveness of our algorithms in both reducing the code size for software-pipelined loops and exploring the code size/performance trade-off space.","PeriodicalId":393916,"journal":{"name":"Proceedings International Conference on Parallel Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121323845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Dynamic hybrid routing (DHR) in mobile ad hoc networks 移动自组织网络中的动态混合路由(DHR)
Pub Date : 2002-08-18 DOI: 10.1109/ICPP.2002.1040896
Seungjin Park, B. V. Voorst
The route discovery and maintenance processes in wireless mobile networks are very expensive tasks due to the mobility of the host. Route discovery requires a considerable amount of resources and therefore it is wise to utilize the effort already invested in existing paths. This paper proposes a dynamic hybrid routing (DHR) protocol in ad hoc networks, which constructs paths only upon demand by taking attributes from both proactive and reactive algorithms. The goal of DHR is to re-use, whenever possible, portions of several existing paths when establishing a new path. The reusability is accomplished by using dynamic proactive zones (PZs), through which nearby existing path information is disseminated. By utilizing the information stored in PZs, considerable savings (in time and traffic) can be achieved over other on-demand routing algorithms that use flooding. In other route-finding algorithms, proactive zones are formed throughout the network and remain unchanged, whereas in DHR, routes are created and destroyed dynamically around the existing paths. Even though DHR may not find the shortest path between source and destination, it does reduce the amount of traffic needed to find a path and therefore increases the available bandwidth for data transfer.
在无线移动网络中,由于主机的移动性,路由发现和维护过程是一项非常昂贵的任务。路由发现需要相当多的资源,因此明智的做法是利用已经在现有路径上投入的精力。本文提出了一种动态混合路由(DHR)协议,该协议通过从主动和被动算法中获取属性,只根据需要构建路径。DHR的目标是在建立新路径时尽可能重用几个现有路径的部分。通过使用动态主动区域(PZs)实现可重用性,通过PZs传播附近的现有路径信息。通过利用存储在pz中的信息,与使用泛洪的其他按需路由算法相比,可以节省大量的时间和流量。在其他寻路算法中,主动区域在整个网络中形成并保持不变,而在DHR中,路由是围绕现有路径动态创建和销毁的。尽管DHR可能无法找到源和目标之间的最短路径,但它确实减少了寻找路径所需的通信量,从而增加了数据传输的可用带宽。
{"title":"Dynamic hybrid routing (DHR) in mobile ad hoc networks","authors":"Seungjin Park, B. V. Voorst","doi":"10.1109/ICPP.2002.1040896","DOIUrl":"https://doi.org/10.1109/ICPP.2002.1040896","url":null,"abstract":"The route discovery and maintenance processes in wireless mobile networks are very expensive tasks due to the mobility of the host. Route discovery requires a considerable amount of resources and therefore it is wise to utilize the effort already invested in existing paths. This paper proposes a dynamic hybrid routing (DHR) protocol in ad hoc networks, which constructs paths only upon demand by taking attributes from both proactive and reactive algorithms. The goal of DHR is to re-use, whenever possible, portions of several existing paths when establishing a new path. The reusability is accomplished by using dynamic proactive zones (PZs), through which nearby existing path information is disseminated. By utilizing the information stored in PZs, considerable savings (in time and traffic) can be achieved over other on-demand routing algorithms that use flooding. In other route-finding algorithms, proactive zones are formed throughout the network and remain unchanged, whereas in DHR, routes are created and destroyed dynamically around the existing paths. Even though DHR may not find the shortest path between source and destination, it does reduce the amount of traffic needed to find a path and therefore increases the available bandwidth for data transfer.","PeriodicalId":393916,"journal":{"name":"Proceedings International Conference on Parallel Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125013817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Hardware schemes for early register release 硬件方案提前注册发布
Pub Date : 2002-08-18 DOI: 10.1109/ICPP.2002.1040854
T. Monreal, V. Viñals, Antonio González, M. Valero
Register files are becoming one of the critical components of current out-of-order processors in terms of delay and power consumption, since their potential to exploit instruction-level parallelism is quite related to the size and number of ports of the register file. In conventional register renaming schemes, register releasing is conservatively done only after the instruction that redefines the same register is committed. Instead, we propose a scheme that releases registers as soon as the processor knows that there will be no further use of them. We present two early releasing hardware implementations with different performance/complexity trade-offs. Detailed cycle-level simulations show either a significant speedup for a given register file size, or a reduction in register file size for a given performance level.
就延迟和功耗而言,寄存器文件正在成为当前乱序处理器的关键组件之一,因为它们利用指令级并行性的潜力与寄存器文件的端口大小和数量密切相关。在传统的寄存器重命名方案中,仅在重新定义同一寄存器的指令提交之后才保守地释放寄存器。相反,我们提出了一种方案,一旦处理器知道不会再使用寄存器,就释放寄存器。我们介绍了两个早期发布的硬件实现,它们具有不同的性能/复杂性权衡。详细的周期级模拟显示,对于给定的寄存器文件大小有显著的加速,或者对于给定的性能水平有寄存器文件大小的减小。
{"title":"Hardware schemes for early register release","authors":"T. Monreal, V. Viñals, Antonio González, M. Valero","doi":"10.1109/ICPP.2002.1040854","DOIUrl":"https://doi.org/10.1109/ICPP.2002.1040854","url":null,"abstract":"Register files are becoming one of the critical components of current out-of-order processors in terms of delay and power consumption, since their potential to exploit instruction-level parallelism is quite related to the size and number of ports of the register file. In conventional register renaming schemes, register releasing is conservatively done only after the instruction that redefines the same register is committed. Instead, we propose a scheme that releases registers as soon as the processor knows that there will be no further use of them. We present two early releasing hardware implementations with different performance/complexity trade-offs. Detailed cycle-level simulations show either a significant speedup for a given register file size, or a reduction in register file size for a given performance level.","PeriodicalId":393916,"journal":{"name":"Proceedings International Conference on Parallel Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134457611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
Computational geometry on the OTIS-Mesh optoelectronic computer OTIS-Mesh光电计算机上的计算几何
Pub Date : 2002-08-18 DOI: 10.1109/ICPP.2002.1040907
Chih-Fang Wang, S. Sahni
We develop efficient algorithms for problems in computational geometry-convex hull, smallest enclosing box, ECDF two-set dominance, maximal points, all-nearest neighbor and closest-pair-on the OTIS-Mesh optoelectronic computer We also demonstrate the algorithms for computing convex hull and prefix sum with condition on a multi-dimensional mesh, which are used to compute convex hull and ECDF respectively. We show that all these problems can be solved in O(/spl radic/N) time even with N/sup 2/ inputs.
在OTIS-Mesh光电计算机上开发了计算几何问题——凸壳、最小封闭盒、ECDF两集优势、最大点、全近邻和最近邻问题的高效算法,并给出了计算凸壳和多维网格上带条件前缀和的算法,分别用于计算凸壳和ECDF。我们证明了所有这些问题都可以在0 (/spl径向/N)时间内解决,即使输入为N/sup 2/。
{"title":"Computational geometry on the OTIS-Mesh optoelectronic computer","authors":"Chih-Fang Wang, S. Sahni","doi":"10.1109/ICPP.2002.1040907","DOIUrl":"https://doi.org/10.1109/ICPP.2002.1040907","url":null,"abstract":"We develop efficient algorithms for problems in computational geometry-convex hull, smallest enclosing box, ECDF two-set dominance, maximal points, all-nearest neighbor and closest-pair-on the OTIS-Mesh optoelectronic computer We also demonstrate the algorithms for computing convex hull and prefix sum with condition on a multi-dimensional mesh, which are used to compute convex hull and ECDF respectively. We show that all these problems can be solved in O(/spl radic/N) time even with N/sup 2/ inputs.","PeriodicalId":393916,"journal":{"name":"Proceedings International Conference on Parallel Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127569764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Region synchronization in message passing systems 消息传递系统中的区域同步
Pub Date : 2002-08-18 DOI: 10.1109/ICPP.2002.1040883
Gurdip Singh, Ye Su
The development of correct synchronization code for distributed programs is a challenging task. In this paper, we propose an aspect oriented technique for developing synchronization code for message passing systems. Our approach is to factor out synchronization as a separate aspect, synthesize synchronization code and then compose it with the functional code. Specifically, we allow the designer of an application to first design the functional code. The designer can then annotate the functional code with regions and specify a high-level "global invariant" specifying the synchronization policy. A synchronization policy essentially gives the occupancy rules for the various regions. The solution to this problem, which we term the region synchronization problem, involves deriving a set of rules for entering and exiting each region. We provide a systematic invariant into a message passing algorithm for a point-to-point message passing system. We show that many existing synchronization problems can be specified as instances of the region synchronization problem. Hence, our algorithms can be used to solve a large class of synchronization problems.
为分布式程序开发正确的同步代码是一项具有挑战性的任务。在本文中,我们提出了一种面向方面的技术来开发消息传递系统的同步代码。我们的方法是将同步作为一个单独的方面提出来,合成同步代码,然后将其与功能代码组合在一起。具体来说,我们允许应用程序的设计者首先设计功能代码。然后,设计人员可以用区域对功能代码进行注释,并指定指定同步策略的高级“全局不变量”。同步策略实质上给出了各个区域的占用规则。这个问题的解决方案(我们称之为区域同步问题)涉及推导一组进入和退出每个区域的规则。我们为点对点消息传递系统的消息传递算法提供了一个系统不变量。我们证明了许多现有的同步问题可以被指定为区域同步问题的实例。因此,我们的算法可以用来解决大量的同步问题。
{"title":"Region synchronization in message passing systems","authors":"Gurdip Singh, Ye Su","doi":"10.1109/ICPP.2002.1040883","DOIUrl":"https://doi.org/10.1109/ICPP.2002.1040883","url":null,"abstract":"The development of correct synchronization code for distributed programs is a challenging task. In this paper, we propose an aspect oriented technique for developing synchronization code for message passing systems. Our approach is to factor out synchronization as a separate aspect, synthesize synchronization code and then compose it with the functional code. Specifically, we allow the designer of an application to first design the functional code. The designer can then annotate the functional code with regions and specify a high-level \"global invariant\" specifying the synchronization policy. A synchronization policy essentially gives the occupancy rules for the various regions. The solution to this problem, which we term the region synchronization problem, involves deriving a set of rules for entering and exiting each region. We provide a systematic invariant into a message passing algorithm for a point-to-point message passing system. We show that many existing synchronization problems can be specified as instances of the region synchronization problem. Hence, our algorithms can be used to solve a large class of synchronization problems.","PeriodicalId":393916,"journal":{"name":"Proceedings International Conference on Parallel Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115331253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Enhancing write I/O performance of disk array RM2 tolerating double disk failures 提高磁盘阵列RM2的写I/O性能,允许双磁盘故障
Pub Date : 2002-08-18 DOI: 10.1109/ICPP.2002.1040876
Y. Nam, D. Kim, Tae-Young Choe, Chan-Ik Park
With a large number of internal disks and the rapid growth of disk capacity, storage systems become more susceptible to double disk failures. Thus, the need for such reliable storage systems as RAID6 is expected to gain in importance. However RAID6 architectures such as RM2, P+Q, EVEN-ODD, and DATUM traditionally suffer from a low write I/O performance caused by updating two distinctive parity data associated with user data. To overcome such a low write I/O performance, we propose an enhanced RM2 architecture which combines RM2, one of the well-known RAID6 architectures, with a Lazy Parity Update (LPU) technique. Extensive performance evaluations reveal that the write I/O performance of the proposed architecture is about two times higher than that of RM2 under various I/O workloads with little degradation in reliability.
随着内部磁盘数量的增加和磁盘容量的快速增长,存储系统更容易出现双盘故障。因此,对像RAID6这样可靠的存储系统的需求将变得越来越重要。然而,RAID6架构,如RM2、P+Q、偶数-奇数和DATUM,由于更新与用户数据相关的两个不同的奇偶校验数据,传统上存在写I/O性能较低的问题。为了克服如此低的写I/O性能,我们提出了一种增强的RM2架构,它将RM2(著名的RAID6架构之一)与延迟奇偶更新(LPU)技术相结合。广泛的性能评估表明,在各种I/O工作负载下,所建议的架构的写I/O性能大约是RM2的两倍,可靠性几乎没有下降。
{"title":"Enhancing write I/O performance of disk array RM2 tolerating double disk failures","authors":"Y. Nam, D. Kim, Tae-Young Choe, Chan-Ik Park","doi":"10.1109/ICPP.2002.1040876","DOIUrl":"https://doi.org/10.1109/ICPP.2002.1040876","url":null,"abstract":"With a large number of internal disks and the rapid growth of disk capacity, storage systems become more susceptible to double disk failures. Thus, the need for such reliable storage systems as RAID6 is expected to gain in importance. However RAID6 architectures such as RM2, P+Q, EVEN-ODD, and DATUM traditionally suffer from a low write I/O performance caused by updating two distinctive parity data associated with user data. To overcome such a low write I/O performance, we propose an enhanced RM2 architecture which combines RM2, one of the well-known RAID6 architectures, with a Lazy Parity Update (LPU) technique. Extensive performance evaluations reveal that the write I/O performance of the proposed architecture is about two times higher than that of RM2 under various I/O workloads with little degradation in reliability.","PeriodicalId":393916,"journal":{"name":"Proceedings International Conference on Parallel Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127099899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Efficient global object space support for distributed JVM on cluster 高效的全局对象空间支持集群上的分布式JVM
Pub Date : 2002-08-18 DOI: 10.1109/ICPP.2002.1040893
W. Fang, Cho-Li Wang, F. Lau
We present the design of a global object space in a distributed Java Virtual Machine that supports parallel execution of a multi-threaded Java program on a cluster of computers. The global object space virtualizes a single Java object heap across machine boundaries to facilitate transparent object accesses. Based on the object connectivity information that is available at runtime, the object reachable from threads at different nodes, called a distributed-shared object, are detected With the detection of distributed-shared objects, we can alleviate overheads in maintaining the memory consistency within the global object space. Several runtime optimization methods have been incorporated in the global object space design, including an object home migration method that reallocates the home of a distributed-shared object, synchronized method migration that allows the remote execution of a synchronized method at the home node of its synchronized object, and object pushing that uses the object connectivity information to improve access locality.
我们提出了一个分布式Java虚拟机中的全局对象空间的设计,该虚拟机支持在计算机集群上并行执行多线程Java程序。全局对象空间跨机器边界虚拟化单个Java对象堆,以促进透明的对象访问。基于运行时可用的对象连通性信息,检测从不同节点上的线程可访问的对象,称为分布式共享对象。通过检测分布式共享对象,我们可以减少在全局对象空间中维护内存一致性的开销。在全局对象空间设计中加入了几种运行时优化方法,包括重新分配分布式共享对象的对象主迁移方法,允许在其同步对象的主节点上远程执行同步方法的同步方法迁移,以及使用对象连接信息来改进访问局部性的对象推送。
{"title":"Efficient global object space support for distributed JVM on cluster","authors":"W. Fang, Cho-Li Wang, F. Lau","doi":"10.1109/ICPP.2002.1040893","DOIUrl":"https://doi.org/10.1109/ICPP.2002.1040893","url":null,"abstract":"We present the design of a global object space in a distributed Java Virtual Machine that supports parallel execution of a multi-threaded Java program on a cluster of computers. The global object space virtualizes a single Java object heap across machine boundaries to facilitate transparent object accesses. Based on the object connectivity information that is available at runtime, the object reachable from threads at different nodes, called a distributed-shared object, are detected With the detection of distributed-shared objects, we can alleviate overheads in maintaining the memory consistency within the global object space. Several runtime optimization methods have been incorporated in the global object space design, including an object home migration method that reallocates the home of a distributed-shared object, synchronized method migration that allows the remote execution of a synchronized method at the home node of its synchronized object, and object pushing that uses the object connectivity information to improve access locality.","PeriodicalId":393916,"journal":{"name":"Proceedings International Conference on Parallel Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123041717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
期刊
Proceedings International Conference on Parallel Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1