首页 > 最新文献

Proceedings Sixth Heterogeneous Computing Workshop (HCW'97)最新文献

英文 中文
Supporting fault-tolerance in heterogeneous distributed applications 支持异构分布式应用程序中的容错
Pub Date : 1997-04-01 DOI: 10.1109/HCW.1997.581421
P. Maheshwari, J. Ouyang
Heterogeneous computing opens up new challenges and opportunities in fields such as parallel and distributed processing, design of algorithms for applications, scheduling of parallel tasks, interconnection network technology and support for reliable distributed heterogeneous computing. A trend of supporting fault-tolerance in distributed computing systems is to incorporate fault-tolerance into applications at low cost, in terms of both run time performance and programming effort required to construct reliable application software. We present an approach for developing efficient reliable distributed applications for heterogeneous computing systems. We propose a library prototype, called H-Libra, to support fault-tolerance in heterogeneous systems with low run-time cost. Fault-tolerance is based on distributed consistent checkpointing and rollback-recovery integrated with a user-level network communication protocol. By employing novel mechanisms, minimum communication overhead is involved for taking a consistent distributed checkpoint and catching messages in transit during a checkpoint. By providing fault-tolerance transparency and a simple, easy to use high-level message-passing interface, H-Libra simplifies the development of reliable heterogeneous distributed applications.
异构计算在并行和分布式处理、应用程序算法设计、并行任务调度、互连网络技术和可靠的分布式异构计算支持等领域提出了新的挑战和机遇。在分布式计算系统中支持容错的一个趋势是以低成本将容错集成到应用程序中,无论是在运行时性能方面还是在构建可靠的应用程序软件所需的编程工作方面。我们提出了一种为异构计算系统开发高效可靠的分布式应用程序的方法。我们提出了一个名为H-Libra的库原型,以低运行时成本支持异构系统中的容错。容错基于与用户级网络通信协议集成的分布式一致检查点和回滚恢复。通过采用新的机制,采用一致的分布式检查点并在检查点期间捕获传输中的消息所涉及的通信开销最小。通过提供容错透明度和简单易用的高级消息传递接口,H-Libra简化了可靠的异构分布式应用程序的开发。
{"title":"Supporting fault-tolerance in heterogeneous distributed applications","authors":"P. Maheshwari, J. Ouyang","doi":"10.1109/HCW.1997.581421","DOIUrl":"https://doi.org/10.1109/HCW.1997.581421","url":null,"abstract":"Heterogeneous computing opens up new challenges and opportunities in fields such as parallel and distributed processing, design of algorithms for applications, scheduling of parallel tasks, interconnection network technology and support for reliable distributed heterogeneous computing. A trend of supporting fault-tolerance in distributed computing systems is to incorporate fault-tolerance into applications at low cost, in terms of both run time performance and programming effort required to construct reliable application software. We present an approach for developing efficient reliable distributed applications for heterogeneous computing systems. We propose a library prototype, called H-Libra, to support fault-tolerance in heterogeneous systems with low run-time cost. Fault-tolerance is based on distributed consistent checkpointing and rollback-recovery integrated with a user-level network communication protocol. By employing novel mechanisms, minimum communication overhead is involved for taking a consistent distributed checkpoint and catching messages in transit during a checkpoint. By providing fault-tolerance transparency and a simple, easy to use high-level message-passing interface, H-Libra simplifies the development of reliable heterogeneous distributed applications.","PeriodicalId":286909,"journal":{"name":"Proceedings Sixth Heterogeneous Computing Workshop (HCW'97)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116828782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Stochastic Petri nets applied to the performance evaluation of static task allocations in heterogeneous computing environments 随机Petri网应用于异构计算环境下静态任务分配的性能评价
Pub Date : 1997-04-01 DOI: 10.1109/HCW.1997.581420
A. McSpadden, N. Lopez-Benitez
A stochastic Petri net (SPN) is systematically constructed from a task graph whose component subtasks are statically allocated onto the processor suite of a heterogeneous computing system (HCS). Given that subtask execution times are exponentially distributed an exponential distribution can be generated for the overall completion time. In particular the enabling functions and rate functions used to specify the SPN model provide needed versatility to integrate processor heterogeneity, task priorities, allocation schemes, communication costs, and other factors characteristic of a HCS into a comprehensive performance analysis. The manner in which these parameters are incorporated into the SPN allows the model to be transformed into a testbed for optimization schemes and heuristics. The proposed approach can be applied to arbitrary task graphs including non-series-parallel.
在异构计算系统(HCS)的处理器套件上,系统地构造了一个随机Petri网(SPN)。如果子任务执行时间呈指数分布,则可以生成总体完成时间的指数分布。特别是用于指定SPN模型的启用函数和速率函数提供了所需的多功能性,可以将处理器异构性、任务优先级、分配方案、通信成本和HCS特征的其他因素集成到综合性能分析中。将这些参数纳入SPN的方式允许将模型转换为优化方案和启发式的测试平台。该方法可应用于任意任务图,包括非串并联任务图。
{"title":"Stochastic Petri nets applied to the performance evaluation of static task allocations in heterogeneous computing environments","authors":"A. McSpadden, N. Lopez-Benitez","doi":"10.1109/HCW.1997.581420","DOIUrl":"https://doi.org/10.1109/HCW.1997.581420","url":null,"abstract":"A stochastic Petri net (SPN) is systematically constructed from a task graph whose component subtasks are statically allocated onto the processor suite of a heterogeneous computing system (HCS). Given that subtask execution times are exponentially distributed an exponential distribution can be generated for the overall completion time. In particular the enabling functions and rate functions used to specify the SPN model provide needed versatility to integrate processor heterogeneity, task priorities, allocation schemes, communication costs, and other factors characteristic of a HCS into a comprehensive performance analysis. The manner in which these parameters are incorporated into the SPN allows the model to be transformed into a testbed for optimization schemes and heuristics. The proposed approach can be applied to arbitrary task graphs including non-series-parallel.","PeriodicalId":286909,"journal":{"name":"Proceedings Sixth Heterogeneous Computing Workshop (HCW'97)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127446764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Exploiting multiple heterogeneous networks to reduce communication costs in parallel programs 利用多个异构网络来降低并行程序中的通信成本
Pub Date : 1997-04-01 DOI: 10.1109/HCW.1997.581412
JunSeong Kim, D. Lilja
The different types of messages used by a parallel application program executing in a distributed system can each have unique characteristics so that no single communication network can produce the lowest latency for all messages. For instance, short control messages may be sent with the lowest overhead on one type of network, such as Ethernet, while bulk data transfers may be better suited to a different type of network, such as Fibre Channel or HiPPI. In this paper, we investigate how to exploit multiple heterogeneous communication networks that interconnect the same set of processing nodes by dynamically selecting the best (lowest latency) network for each message based on the message size. We also show how to aggregate these multiple parallel networks into a single virtual network to further reduce the latency and increase the available bandwidth. We test this multiplexing and aggregation on a cluster of SGI multiprocessors interconnected with both Fibre Channel and Ethernet. We find that multiplexing between Ethernet and Fibre Channel can substantially reduce communication overhead in a synthetic benchmark compared to using either network alone. Aggregating these two networks into a single virtual network can further reduce communication delays for applications with many large messages. The best choice of either multiplexing or aggregation depends on the mix of message sizes in application program and the relative overheads of the two networks.
在分布式系统中执行的并行应用程序程序所使用的不同类型的消息都可以具有独特的特征,因此没有任何一个通信网络可以为所有消息产生最低的延迟。例如,在一种类型的网络(如以太网)上,短控制消息可能以最低的开销发送,而批量数据传输可能更适合于不同类型的网络(如光纤通道或HiPPI)。在本文中,我们研究了如何利用多个异构通信网络,这些网络通过基于消息大小动态选择每个消息的最佳(最低延迟)网络来互连同一组处理节点。我们还展示了如何将这些多个并行网络聚合为单个虚拟网络,以进一步减少延迟并增加可用带宽。我们在通过光纤通道和以太网相互连接的SGI多处理器集群上测试了这种多路复用和聚合。我们发现,与单独使用任何一种网络相比,在以太网和光纤通道之间进行多路复用可以大大减少综合基准测试中的通信开销。将这两个网络聚合到一个虚拟网络中可以进一步减少具有许多大消息的应用程序的通信延迟。多路复用或聚合的最佳选择取决于应用程序中消息大小的混合以及两个网络的相对开销。
{"title":"Exploiting multiple heterogeneous networks to reduce communication costs in parallel programs","authors":"JunSeong Kim, D. Lilja","doi":"10.1109/HCW.1997.581412","DOIUrl":"https://doi.org/10.1109/HCW.1997.581412","url":null,"abstract":"The different types of messages used by a parallel application program executing in a distributed system can each have unique characteristics so that no single communication network can produce the lowest latency for all messages. For instance, short control messages may be sent with the lowest overhead on one type of network, such as Ethernet, while bulk data transfers may be better suited to a different type of network, such as Fibre Channel or HiPPI. In this paper, we investigate how to exploit multiple heterogeneous communication networks that interconnect the same set of processing nodes by dynamically selecting the best (lowest latency) network for each message based on the message size. We also show how to aggregate these multiple parallel networks into a single virtual network to further reduce the latency and increase the available bandwidth. We test this multiplexing and aggregation on a cluster of SGI multiprocessors interconnected with both Fibre Channel and Ethernet. We find that multiplexing between Ethernet and Fibre Channel can substantially reduce communication overhead in a synthetic benchmark compared to using either network alone. Aggregating these two networks into a single virtual network can further reduce communication delays for applications with many large messages. The best choice of either multiplexing or aggregation depends on the mix of message sizes in application program and the relative overheads of the two networks.","PeriodicalId":286909,"journal":{"name":"Proceedings Sixth Heterogeneous Computing Workshop (HCW'97)","volume":"70 6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129639766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
On-line use of off-line derived mappings for iterative automatic target recognition tasks and a particular class of hardware platforms 在线使用离线派生映射迭代自动目标识别任务和特定类别的硬件平台
Pub Date : 1997-04-01 DOI: 10.1109/HCW.1997.581413
J. Budenske, R. Ramanujan, H. Siegel
Heterogeneous computing covers a great variety of situations. This study focuses on a particular application domain (iterative automatic target recognition tasks) and an associated specific class of dedicated heterogeneous hardware platforms. The contribution of this paper is that, for the computational environment considered, it presents a methodology for real-time on-line input-data dependent remappings of the application subtasks to the processors in the heterogeneous hardware platform using previously stored off-line statically determined mappings. That is, the operating system will be able to decide during the execution of the application whether or not to perform a remapping based on information generated by the application from its input data. If the decision is to remap, the operating system will be able to select a previously derived and stored mapping that is appropriate for the given state of the application (e.g., the number of objects it is currently tracking).
异构计算涵盖了各种各样的情况。本研究的重点是一个特定的应用领域(迭代自动目标识别任务)和相关的特定类别的专用异构硬件平台。本文的贡献在于,对于所考虑的计算环境,它提出了一种方法,可以使用以前存储的离线静态确定映射,将应用程序子任务实时在线地依赖于输入数据重新映射到异构硬件平台中的处理器。也就是说,操作系统将能够在应用程序执行期间决定是否根据应用程序从其输入数据生成的信息执行重新映射。如果决定重新映射,操作系统将能够选择先前派生和存储的映射,该映射适合于应用程序的给定状态(例如,它当前正在跟踪的对象数量)。
{"title":"On-line use of off-line derived mappings for iterative automatic target recognition tasks and a particular class of hardware platforms","authors":"J. Budenske, R. Ramanujan, H. Siegel","doi":"10.1109/HCW.1997.581413","DOIUrl":"https://doi.org/10.1109/HCW.1997.581413","url":null,"abstract":"Heterogeneous computing covers a great variety of situations. This study focuses on a particular application domain (iterative automatic target recognition tasks) and an associated specific class of dedicated heterogeneous hardware platforms. The contribution of this paper is that, for the computational environment considered, it presents a methodology for real-time on-line input-data dependent remappings of the application subtasks to the processors in the heterogeneous hardware platform using previously stored off-line statically determined mappings. That is, the operating system will be able to decide during the execution of the application whether or not to perform a remapping based on information generated by the application from its input data. If the decision is to remap, the operating system will be able to select a previously derived and stored mapping that is appropriate for the given state of the application (e.g., the number of objects it is currently tracking).","PeriodicalId":286909,"journal":{"name":"Proceedings Sixth Heterogeneous Computing Workshop (HCW'97)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126545938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Dynamic load balancing of distributed SPMD computations with explicit message-passing 带有显式消息传递的分布式SPMD计算的动态负载平衡
Pub Date : 1997-04-01 DOI: 10.1109/HCW.1997.581406
M. Cermele, M. Colajanni, G. Necci
Distributed systems have the potentiality of becoming an alternative platform for parallel computations. However, there are still many obstacles to overcome, one of the most serious is that distributed systems typically consist of shared heterogeneous components with highly variable computational power. We present a load balancing support that checks the load status and, if necessary, adapts the workload to dynamic platform conditions through data migrations from overloaded to underloaded nodes. Unlike task migration supports for task parallelism and other data migration frameworks for master/slave-based parallel applications, our support works for the entire class of SPMD regular applications with explicit communications such as linear algebra problems, partial differential equation solvers, image processing algorithms. Although we considered several variants (three activation mechanisms, three load monitoring techniques and four decision policies), we implemented only the protocols that guarantee program consistency. The efficiency of the strategies is tested in the instance of two SPMD algorithms that are based on the PVM library enriched by special-purpose primitives for data management. As additional contribution, our research keeps the entire support for dynamic load balancing transparent to the programmer. The only visible interface of our support is the activation phase.
分布式系统有可能成为并行计算的替代平台。然而,仍然有许多障碍需要克服,其中最严重的是分布式系统通常由具有高度可变计算能力的共享异构组件组成。我们提供了一种负载平衡支持,可以检查负载状态,并在必要时通过从负载过重的节点到负载不足的节点的数据迁移,使工作负载适应动态平台条件。与支持任务并行的任务迁移和其他基于主/从的并行应用程序的数据迁移框架不同,我们的支持适用于具有显式通信的整个SPMD常规应用程序类,如线性代数问题、偏微分方程求解器、图像处理算法。尽管我们考虑了几种变体(三种激活机制、三种负载监视技术和四种决策策略),但我们只实现了保证程序一致性的协议。在两个基于PVM库的SPMD算法的实例中测试了这些策略的效率,这些PVM库由用于数据管理的专用原语充实。作为额外的贡献,我们的研究使动态负载平衡的整个支持对程序员透明。我们支持的唯一可见界面是激活阶段。
{"title":"Dynamic load balancing of distributed SPMD computations with explicit message-passing","authors":"M. Cermele, M. Colajanni, G. Necci","doi":"10.1109/HCW.1997.581406","DOIUrl":"https://doi.org/10.1109/HCW.1997.581406","url":null,"abstract":"Distributed systems have the potentiality of becoming an alternative platform for parallel computations. However, there are still many obstacles to overcome, one of the most serious is that distributed systems typically consist of shared heterogeneous components with highly variable computational power. We present a load balancing support that checks the load status and, if necessary, adapts the workload to dynamic platform conditions through data migrations from overloaded to underloaded nodes. Unlike task migration supports for task parallelism and other data migration frameworks for master/slave-based parallel applications, our support works for the entire class of SPMD regular applications with explicit communications such as linear algebra problems, partial differential equation solvers, image processing algorithms. Although we considered several variants (three activation mechanisms, three load monitoring techniques and four decision policies), we implemented only the protocols that guarantee program consistency. The efficiency of the strategies is tested in the instance of two SPMD algorithms that are based on the PVM library enriched by special-purpose primitives for data management. As additional contribution, our research keeps the entire support for dynamic load balancing transparent to the programmer. The only visible interface of our support is the activation phase.","PeriodicalId":286909,"journal":{"name":"Proceedings Sixth Heterogeneous Computing Workshop (HCW'97)","volume":"1212 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116484299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
A performance and portability study of parallel applications using a distributed computing testbed 使用分布式计算测试平台的并行应用程序性能和可移植性研究
Pub Date : 1997-04-01 DOI: 10.1109/HCW.1997.581423
V. Morariu, Mathew Cunningham, Mark Letterman
A case study was conducted to examine the performance and portability of parallel applications, with an emphasis on data transfer among the processors in heterogeneous environments. Several parallel test programs using MPICH, a message passing interface (MPI) library, and the Linda parallel environment were developed to analyze communication performance and portability. These programs implement loosely and tightly synchronized communication models in which each processor exchanges data with two other processors. This data-exchange pattern mimics communication in certain parallel applications using striped partitioning of the computational domain. Tests were performed on an isolated, distributed computing testbed, a live development network and a symmetrical multiprocessing computer system. All network configurations used asynchronous transfer mode (ATM) network technologies. The testbed used in the study was a heterogeneous network consisting of various workstations and networking equipment. This paper presents an analysis of the results and recommendations for designing and implementing course-grained, parallel, scientific applications.
通过一个案例研究来检查并行应用程序的性能和可移植性,重点是异构环境中处理器之间的数据传输。利用MPICH、消息传递接口(MPI)库和Linda并行环境开发了多个并行测试程序,分析了通信性能和可移植性。这些程序实现了松散和紧密同步的通信模型,其中每个处理器与另外两个处理器交换数据。这种数据交换模式使用计算域的条纹划分来模拟某些并行应用程序中的通信。测试在一个隔离的分布式计算试验台、一个实时开发网络和一个对称的多处理计算机系统上进行。所有网络配置都使用了异步传输模式(ATM)网络技术。研究中使用的测试平台是一个由各种工作站和网络设备组成的异构网络。本文对结果进行了分析,并提出了设计和实现细粒度、并行、科学应用程序的建议。
{"title":"A performance and portability study of parallel applications using a distributed computing testbed","authors":"V. Morariu, Mathew Cunningham, Mark Letterman","doi":"10.1109/HCW.1997.581423","DOIUrl":"https://doi.org/10.1109/HCW.1997.581423","url":null,"abstract":"A case study was conducted to examine the performance and portability of parallel applications, with an emphasis on data transfer among the processors in heterogeneous environments. Several parallel test programs using MPICH, a message passing interface (MPI) library, and the Linda parallel environment were developed to analyze communication performance and portability. These programs implement loosely and tightly synchronized communication models in which each processor exchanges data with two other processors. This data-exchange pattern mimics communication in certain parallel applications using striped partitioning of the computational domain. Tests were performed on an isolated, distributed computing testbed, a live development network and a symmetrical multiprocessing computer system. All network configurations used asynchronous transfer mode (ATM) network technologies. The testbed used in the study was a heterogeneous network consisting of various workstations and networking equipment. This paper presents an analysis of the results and recommendations for designing and implementing course-grained, parallel, scientific applications.","PeriodicalId":286909,"journal":{"name":"Proceedings Sixth Heterogeneous Computing Workshop (HCW'97)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115361460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Practical issues in heterogeneous processing systems for military applications 军事应用异构处理系统的实际问题
Pub Date : 1997-04-01 DOI: 10.1109/HCW.1997.581418
G. Ladd
Heterogeneous parallel processing systems have been extensively used in embedded military applications due to their advantages in size, weight, power and hardware cost. This paper reviews the evolution of some of these systems and discusses design factors and tradeoffs which affect their application. As military systems have become more cost sensitive, and initial development more common than long term production, the use of commercial hardware and software has become more common. The rapid advances of computer technology seem likely to accelerate that trend in the future.
异构并行处理系统由于其在体积、重量、功耗和硬件成本方面的优势,在嵌入式军事应用中得到了广泛的应用。本文回顾了其中一些系统的发展,并讨论了影响其应用的设计因素和权衡。由于军事系统变得对成本更加敏感,并且初始开发比长期生产更常见,商用硬件和软件的使用变得更加普遍。计算机技术的飞速发展似乎会在未来加速这一趋势。
{"title":"Practical issues in heterogeneous processing systems for military applications","authors":"G. Ladd","doi":"10.1109/HCW.1997.581418","DOIUrl":"https://doi.org/10.1109/HCW.1997.581418","url":null,"abstract":"Heterogeneous parallel processing systems have been extensively used in embedded military applications due to their advantages in size, weight, power and hardware cost. This paper reviews the evolution of some of these systems and discusses design factors and tradeoffs which affect their application. As military systems have become more cost sensitive, and initial development more common than long term production, the use of commercial hardware and software has become more common. The rapid advances of computer technology seem likely to accelerate that trend in the future.","PeriodicalId":286909,"journal":{"name":"Proceedings Sixth Heterogeneous Computing Workshop (HCW'97)","volume":"221 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115654604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Optimal task assignment in heterogeneous computing systems 异构计算系统中的最优任务分配
Pub Date : 1997-04-01 DOI: 10.1109/HCW.1997.581416
Muhammad Kafil, I. Ahmad
Distributed systems comprising networked heterogeneous workstations are now considered to be a viable choice for high-performance computing. For achieving a fast response time from such systems, an efficient assignment of the application tasks to the processors is imperative. The general assignment problem is known to be NP-hard, except in a few special cases with strict assumptions. While a large number of heuristic techniques have been suggested in the literature that can yield sub-optimal solutions in a reasonable amount of time, we aim to develop techniques for optimal solutions under relaxed assumptions. The basis of our research is a best-first search technique known as the A* algorithm from the area of artificial intelligence. The original search technique guarantees an optimal solution but is not feasible for problems of practically large sizes due to its high time and space complexity. We propose a number of algorithms based around the A* technique. The proposed algorithms also yield optimal solutions but are considerably faster. The first algorithm solves the assignment problem by using parallel processing. Parallelizing the assignment algorithm is a natural way to lower the time complexity, and we believe our algorithm to be novel in this regard. The second algorithm is based on a clustering based pre-processing technique that merges the high affinity tasks. Clustering reduces the problem size, which in turn reduces the state-space for the assignment algorithm. We also propose three heuristics which do not guarantee optimal solutions but provide near-optimal solutions and are considerably faster. By using our parallel formulation, the proposed clustering technique and the heuristics can also be parallelized to further improve their time complexity.
由网络异构工作站组成的分布式系统现在被认为是高性能计算的可行选择。为了从这样的系统获得快速的响应时间,必须将应用程序任务有效地分配给处理器。一般的分配问题是np困难的,除非在一些特殊情况下有严格的假设。虽然文献中已经提出了大量的启发式技术,可以在合理的时间内产生次优解,但我们的目标是开发在宽松假设下的最优解技术。我们研究的基础是一种最佳优先搜索技术,即人工智能领域的a *算法。原有的搜索技术保证了最优解,但由于时间和空间复杂度高,对于实际规模较大的问题不可行。我们提出了一些基于a *技术的算法。所提出的算法也产生最优解,但速度要快得多。第一种算法采用并行处理的方法解决分配问题。并行化分配算法是降低时间复杂度的一种自然方式,我们相信我们的算法在这方面是新颖的。第二种算法基于基于聚类的预处理技术,该技术将高亲和性任务合并在一起。聚类减少了问题的大小,从而减少了分配算法的状态空间。我们还提出了三种启发式算法,它们不保证最优解,但提供接近最优解,而且速度快得多。通过我们的并行公式,聚类技术和启发式算法也可以并行化,以进一步提高它们的时间复杂度。
{"title":"Optimal task assignment in heterogeneous computing systems","authors":"Muhammad Kafil, I. Ahmad","doi":"10.1109/HCW.1997.581416","DOIUrl":"https://doi.org/10.1109/HCW.1997.581416","url":null,"abstract":"Distributed systems comprising networked heterogeneous workstations are now considered to be a viable choice for high-performance computing. For achieving a fast response time from such systems, an efficient assignment of the application tasks to the processors is imperative. The general assignment problem is known to be NP-hard, except in a few special cases with strict assumptions. While a large number of heuristic techniques have been suggested in the literature that can yield sub-optimal solutions in a reasonable amount of time, we aim to develop techniques for optimal solutions under relaxed assumptions. The basis of our research is a best-first search technique known as the A* algorithm from the area of artificial intelligence. The original search technique guarantees an optimal solution but is not feasible for problems of practically large sizes due to its high time and space complexity. We propose a number of algorithms based around the A* technique. The proposed algorithms also yield optimal solutions but are considerably faster. The first algorithm solves the assignment problem by using parallel processing. Parallelizing the assignment algorithm is a natural way to lower the time complexity, and we believe our algorithm to be novel in this regard. The second algorithm is based on a clustering based pre-processing technique that merges the high affinity tasks. Clustering reduces the problem size, which in turn reduces the state-space for the assignment algorithm. We also propose three heuristics which do not guarantee optimal solutions but provide near-optimal solutions and are considerably faster. By using our parallel formulation, the proposed clustering technique and the heuristics can also be parallelized to further improve their time complexity.","PeriodicalId":286909,"journal":{"name":"Proceedings Sixth Heterogeneous Computing Workshop (HCW'97)","volume":"148 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131995755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 43
期刊
Proceedings Sixth Heterogeneous Computing Workshop (HCW'97)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1