首页 > 最新文献

2009 IEEE International Symposium on Parallel and Distributed Processing with Applications最新文献

英文 中文
Semi-automatic Transformation of Sequential Code to Distributed Code Using Model Driven Architecture Approach 基于模型驱动体系结构方法的顺序代码到分布式代码的半自动转换
S. Karimi, Saeed Parsa
In this paper, a Model Driven Architecture (MDA) approach is applied to Semi-automatically translate sequential programs into corresponding distributed code. The novelty of our work is the use of MDA in the process of translating serial into distributed code. The transformation comprises automatic generation of platform independent and then platform specific models from the sequential code. In order to generate the PIM, a meta-model defining the overall architecture of the resultant distributed code is developed. The meta-model is used as a basis for the development of platform independent models (PIM) for the resultant distributed code. A set of transformation rules are defined to transform the resulted PIM into a corresponding platform-specific model. These transformation rules can be modified by the user, depending on the details of the underlying middle-ware applied for the distribution. The platform independent model provides a better understanding of the distributed code and helps the programmer to modify the code more easily.
本文采用模型驱动体系结构(MDA)方法将顺序程序半自动地转换为相应的分布式代码。我们工作的新奇之处在于在将串行代码转换为分布式代码的过程中使用了MDA。转换包括从顺序代码自动生成独立于平台和特定于平台的模型。为了生成PIM,需要开发一个元模型来定义生成的分布式代码的总体体系结构。元模型用于为生成的分布式代码开发与平台无关的模型(PIM)。定义了一组转换规则,将结果PIM转换为相应的特定于平台的模型。这些转换规则可以由用户修改,具体取决于应用于发行版的底层中间件的细节。与平台无关的模型提供了对分布式代码的更好理解,并帮助程序员更容易地修改代码。
{"title":"Semi-automatic Transformation of Sequential Code to Distributed Code Using Model Driven Architecture Approach","authors":"S. Karimi, Saeed Parsa","doi":"10.1109/ISPA.2009.71","DOIUrl":"https://doi.org/10.1109/ISPA.2009.71","url":null,"abstract":"In this paper, a Model Driven Architecture (MDA) approach is applied to Semi-automatically translate sequential programs into corresponding distributed code. The novelty of our work is the use of MDA in the process of translating serial into distributed code. The transformation comprises automatic generation of platform independent and then platform specific models from the sequential code. In order to generate the PIM, a meta-model defining the overall architecture of the resultant distributed code is developed. The meta-model is used as a basis for the development of platform independent models (PIM) for the resultant distributed code. A set of transformation rules are defined to transform the resulted PIM into a corresponding platform-specific model. These transformation rules can be modified by the user, depending on the details of the underlying middle-ware applied for the distribution. The platform independent model provides a better understanding of the distributed code and helps the programmer to modify the code more easily.","PeriodicalId":346815,"journal":{"name":"2009 IEEE International Symposium on Parallel and Distributed Processing with Applications","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124324187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Efficient and Lightweight Data Integrity Check in In-Networking Storage Wireless Sensor Networks 网络存储无线传感器网络中的高效轻量级数据完整性检查
Wei Ren, Yi Ren, Hui Zhang
In In-networking storage Wireless Sensor Networks, sensed data are stored locally for a long term and retrieved on-demand instead of real-time. To maximize data survival, the sensed data are normally distributively stored at multiple nearby nodes. It arises a problem that how to check and grantee data integrity of distributed data storage in the context of resource constraints. In this paper, a technique called Two Granularity Linear Code (TGLC) that consists of Intra-codes and Inter-codes is presented. An efficient and lightweight data integrity check scheme based on TGLC is proposed. Data integrity can be checked by any one who holds short Inter-codes, and the checking credentials is short Intra-codes that is dynamically generated. The proposed scheme is efficient and lightweight with respect to low storage and communication overhead, and yet checking validity is maintained. Our conclusion is justified by extensive analysis.
在网络存储无线传感器网络中,感测数据长期存储在本地,并按需检索,而不是实时检索。为了最大限度地提高数据存活率,感测数据通常分布地存储在附近的多个节点上。在资源约束的情况下,如何检查和保证分布式数据存储的数据完整性成为一个问题。本文提出了一种由内码和间码组成的双粒度线性码(TGLC)技术。提出了一种高效、轻量级的基于TGLC的数据完整性校验方案。任何持有短inter -code的人都可以检查数据完整性,并且检查凭证是动态生成的短intra -code。该方案具有较低的存储和通信开销、高效和轻量级的特点,并且保证了检测的有效性。经过广泛的分析,我们的结论是正确的。
{"title":"Efficient and Lightweight Data Integrity Check in In-Networking Storage Wireless Sensor Networks","authors":"Wei Ren, Yi Ren, Hui Zhang","doi":"10.1109/ISPA.2009.103","DOIUrl":"https://doi.org/10.1109/ISPA.2009.103","url":null,"abstract":"In In-networking storage Wireless Sensor Networks, sensed data are stored locally for a long term and retrieved on-demand instead of real-time. To maximize data survival, the sensed data are normally distributively stored at multiple nearby nodes. It arises a problem that how to check and grantee data integrity of distributed data storage in the context of resource constraints. In this paper, a technique called Two Granularity Linear Code (TGLC) that consists of Intra-codes and Inter-codes is presented. An efficient and lightweight data integrity check scheme based on TGLC is proposed. Data integrity can be checked by any one who holds short Inter-codes, and the checking credentials is short Intra-codes that is dynamically generated. The proposed scheme is efficient and lightweight with respect to low storage and communication overhead, and yet checking validity is maintained. Our conclusion is justified by extensive analysis.","PeriodicalId":346815,"journal":{"name":"2009 IEEE International Symposium on Parallel and Distributed Processing with Applications","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127062255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Conditional Diagnosability of Twisted Cubes under the Comparison Model 比较模型下扭曲立方体的条件可诊断性
Shuming Zhou
In evaluating the fault tolerance of an network structure, it is essential to estimate the order of a maximal connected component of this network provided the faulty vertices may break its connectedness, and it is crucial to local and to replace the faulty processors to maintain system’s high reliability. The fault diagnosis is the process of identifying fault processors in a system through testing. The conditional diagnosis requires that for each processor v in a system, all the processors that are directly connected to v do not fail at the same time. In this paper, the conditional diagnosability of the twisted cubes TQn under the comparison diagnosis model is 3n-5 when n≫6. Hence the conditional diagnosability of TQn is three times larger than its classical diagnosability.
在评估网络结构的容错性时,在故障点可能破坏网络连通性的情况下,估计网络中最大连接分量的阶数是至关重要的,并且局部替换故障处理器是保持系统高可靠性的关键。故障诊断是通过测试识别系统中的故障处理器的过程。条件诊断要求对于系统中的每个处理器v,所有直接连接到v的处理器不能同时发生故障。在比较诊断模型下,当n > 6时,扭曲立方体的条件可诊断性TQn为3n-5。因此,TQn的条件可诊断性是其经典可诊断性的三倍。
{"title":"The Conditional Diagnosability of Twisted Cubes under the Comparison Model","authors":"Shuming Zhou","doi":"10.1080/00207160903477175","DOIUrl":"https://doi.org/10.1080/00207160903477175","url":null,"abstract":"In evaluating the fault tolerance of an network structure, it is essential to estimate the order of a maximal connected component of this network provided the faulty vertices may break its connectedness, and it is crucial to local and to replace the faulty processors to maintain system’s high reliability. The fault diagnosis is the process of identifying fault processors in a system through testing. The conditional diagnosis requires that for each processor v in a system, all the processors that are directly connected to v do not fail at the same time. In this paper, the conditional diagnosability of the twisted cubes TQn under the comparison diagnosis model is 3n-5 when n≫6. Hence the conditional diagnosability of TQn is three times larger than its classical diagnosability.","PeriodicalId":346815,"journal":{"name":"2009 IEEE International Symposium on Parallel and Distributed Processing with Applications","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121982934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
An Efficient Algorithm for Multimedia Delivery in Pervasive Space 一种有效的普适空间多媒体传输算法
S. Zhang, Zhuzhong Qian, M. Guo, Sanglu Lu
Service composition is an effective approach for multimedia delivery in pervasive environment. In previous works, there is one fixed functional path which is composed of several underlying services in a certain order. Actually, there are several functional paths delivering different quality level multimedia from the source to the end user. Due to the dynamicity and mobility of pervasive space, system should generate a reliable and low-delay service path for multimedia delivery in real-time. Since some multimedia service components change the data transmission volume which has a deep impact on the transmission delay, it makes the media delivery problem equal to Multi-Constrained Path problem which is known to NP-Complete. We propose an efficient algorithm LD/RPath(Lowest Delay/Reliability Path) for adaptive multimedia delivery. LD/RPath generates a low-delay service path based on several functional paths with reliability guarantee. Experiment results show that LD/RPath has a good performance and it is an effective algorithm for multimedia delivery in pervasive space.
服务组合是普适环境下实现多媒体交付的一种有效方法。在以前的工作中,有一条固定的功能路径,它由几个底层服务按一定顺序组成。实际上,有几个功能路径将不同质量水平的多媒体从源传输到最终用户。由于普适空间的动态性和移动性,系统需要为多媒体实时传输提供可靠、低时延的服务路径。由于某些多媒体业务组件改变了数据传输量,这对传输延迟有很大的影响,使得媒体传输问题等同于NP-Complete中的多约束路径问题。我们提出了一种有效的自适应多媒体传输算法LD/RPath(最低延迟/可靠性路径)。LD/RPath在多条功能路径的基础上生成一条低时延、可靠性保证的业务路径。实验结果表明,LD/RPath具有良好的性能,是一种有效的普适空间多媒体传输算法。
{"title":"An Efficient Algorithm for Multimedia Delivery in Pervasive Space","authors":"S. Zhang, Zhuzhong Qian, M. Guo, Sanglu Lu","doi":"10.1109/ISPA.2009.34","DOIUrl":"https://doi.org/10.1109/ISPA.2009.34","url":null,"abstract":"Service composition is an effective approach for multimedia delivery in pervasive environment. In previous works, there is one fixed functional path which is composed of several underlying services in a certain order. Actually, there are several functional paths delivering different quality level multimedia from the source to the end user. Due to the dynamicity and mobility of pervasive space, system should generate a reliable and low-delay service path for multimedia delivery in real-time. Since some multimedia service components change the data transmission volume which has a deep impact on the transmission delay, it makes the media delivery problem equal to Multi-Constrained Path problem which is known to NP-Complete. We propose an efficient algorithm LD/RPath(Lowest Delay/Reliability Path) for adaptive multimedia delivery. LD/RPath generates a low-delay service path based on several functional paths with reliability guarantee. Experiment results show that LD/RPath has a good performance and it is an effective algorithm for multimedia delivery in pervasive space.","PeriodicalId":346815,"journal":{"name":"2009 IEEE International Symposium on Parallel and Distributed Processing with Applications","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124196806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An Identity-Based Restricted Deniable Authentication Protocol 基于身份的受限可否认认证协议
Chengyu Fan, Shijie Zhou, Fagen Li
A deniable authentication allows the receiver to identify the source of the received messages but cannot prove it to any third party. However, the deniability of the content, which is called restricted deniability in this paper, is concerned in electronic voting and some other similar application. At present, most non-interactive deniable authentication protocols cannot resist weaken key-compromise impersonation (W-KCI) attack. To settle this problem, a non-interactive identity-based restricted deniable authentication protocol is proposed. It not only can resist W-KCI attack but also has the properties of communication flexibility. It meets the security requirements such as correctness, restricted deniability as well. Therefore, this protocol can be applied in electronic voting.
可否认身份验证允许接收方识别接收到的消息的来源,但不能向任何第三方证明。然而,在电子投票和其他一些类似的应用中,内容的可否认性,在本文中被称为限制性可否认性。目前,大多数非交互式可否认身份验证协议都无法抵御弱密钥妥协冒充(W-KCI)攻击。为了解决这一问题,提出了一种非交互式的基于身份的受限可否认认证协议。它不仅能抵抗W-KCI攻击,而且具有通信灵活性。它还满足了正确性、受限可否认性等安全要求。因此,该协议可以应用于电子投票。
{"title":"An Identity-Based Restricted Deniable Authentication Protocol","authors":"Chengyu Fan, Shijie Zhou, Fagen Li","doi":"10.1109/ISPA.2009.113","DOIUrl":"https://doi.org/10.1109/ISPA.2009.113","url":null,"abstract":"A deniable authentication allows the receiver to identify the source of the received messages but cannot prove it to any third party. However, the deniability of the content, which is called restricted deniability in this paper, is concerned in electronic voting and some other similar application. At present, most non-interactive deniable authentication protocols cannot resist weaken key-compromise impersonation (W-KCI) attack. To settle this problem, a non-interactive identity-based restricted deniable authentication protocol is proposed. It not only can resist W-KCI attack but also has the properties of communication flexibility. It meets the security requirements such as correctness, restricted deniability as well. Therefore, this protocol can be applied in electronic voting.","PeriodicalId":346815,"journal":{"name":"2009 IEEE International Symposium on Parallel and Distributed Processing with Applications","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116573559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Composition and Reduction of Web Service Based on Dynamic Timed Colored Petri Nets 基于动态定时彩色Petri网的Web服务合成与约简
Yaojun Han, Xuemei Luo
An application system for enterprises is a composite Web service that consists of a collection of Web services related by data and control flow. It’s necessary to get formalizing and modeling techniques and graphic tools for reliable Web Service and its application construction. In this paper we present the formalization of Web services and the algorithm for constructing composition. Firstly, we propose a dynamic timed colored Petri net (DTCPN) to model and analyze a Web service. In this Petri net, the colors including parameters and user’s QoS (Quality of Service) requirements represent data flow. The time delay of transition is a function of colors in input place instead of time constant, which shows the dynamic property of Web service. The DTCPN allows the modeling of dynamic behavior of large and complex systems. Secondly, we give an algorithm for constructing composition of DTCPN model for an application composed of Web services. In order to reduce the complexity of model and the state explosion problem in reachability analysis of Petri nets, we give a reduction algorithm of DTCPN for four basic structures of the Web service composition. Finally, we discuss the correctness and time and cost performance of the Web service composition by reducing DTCPN model and analyzing the reachable service graph.
企业应用系统是由数据和控制流相关的Web服务集合组成的复合Web服务。为了构建可靠的Web服务及其应用程序,有必要获得形式化和建模技术以及图形化工具。在本文中,我们提出了Web服务的形式化和构造组合的算法。首先,我们提出了一种动态定时彩色Petri网(DTCPN)来建模和分析Web服务。在该Petri网中,包含参数和用户服务质量(QoS)需求的颜色表示数据流。转换的时间延迟是输入位置颜色的函数,而不是时间常数,这显示了Web服务的动态性。DTCPN允许对大型复杂系统的动态行为进行建模。其次,给出了一种构建由Web服务组成的应用程序DTCPN组合模型的算法。针对Petri网可达性分析中存在的状态爆炸问题,提出了一种针对Web服务组合的四种基本结构的DTCPN约简算法。最后,通过简化DTCPN模型和分析可达服务图,讨论了Web服务组合的正确性、时间和成本性能。
{"title":"Composition and Reduction of Web Service Based on Dynamic Timed Colored Petri Nets","authors":"Yaojun Han, Xuemei Luo","doi":"10.1109/ISPA.2009.21","DOIUrl":"https://doi.org/10.1109/ISPA.2009.21","url":null,"abstract":"An application system for enterprises is a composite Web service that consists of a collection of Web services related by data and control flow. It’s necessary to get formalizing and modeling techniques and graphic tools for reliable Web Service and its application construction. In this paper we present the formalization of Web services and the algorithm for constructing composition. Firstly, we propose a dynamic timed colored Petri net (DTCPN) to model and analyze a Web service. In this Petri net, the colors including parameters and user’s QoS (Quality of Service) requirements represent data flow. The time delay of transition is a function of colors in input place instead of time constant, which shows the dynamic property of Web service. The DTCPN allows the modeling of dynamic behavior of large and complex systems. Secondly, we give an algorithm for constructing composition of DTCPN model for an application composed of Web services. In order to reduce the complexity of model and the state explosion problem in reachability analysis of Petri nets, we give a reduction algorithm of DTCPN for four basic structures of the Web service composition. Finally, we discuss the correctness and time and cost performance of the Web service composition by reducing DTCPN model and analyzing the reachable service graph.","PeriodicalId":346815,"journal":{"name":"2009 IEEE International Symposium on Parallel and Distributed Processing with Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130087846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Implementation of a Visual Modeling Tool for Defining Instance Aspect in Workflow 工作流实例方面定义可视化建模工具的实现
Jianxun Liu, Zefeng Zhu, Yiping Wen, Jinjun Chen
The instance-aspect oriented workflow management system is to vertically combine multiple workflow activity instances and submit them for execution as a whole according to some batch or combination logics. It is inspired by the idea of aspect-oriented programming methodology and aims at improving the execution efficiency of business processes. Traditional workflow systems do not support workflow model with instance aspects. In our previous work, we have studied workflow instance modeling technology. This paper makes a research on the principles, methods and implementation of a workflow visual GUI tool for modeling instance aspects in workflow. It is based on an open source GUI tool, Together Workflow Editor, and makes some expansion in instance aspect functionality.
面向实例方面的工作流管理系统是将多个工作流活动实例垂直组合起来,并按照一定的批处理或组合逻辑将它们作为一个整体提交执行。它受到面向方面编程方法的启发,旨在提高业务流程的执行效率。传统的工作流系统不支持带有实例方面的工作流模型。在之前的工作中,我们研究了工作流实例建模技术。本文研究了工作流实例方面建模可视化GUI工具的原理、方法和实现。它基于一个开源的GUI工具,Together Workflow Editor,并对实例方面的功能进行了一些扩展。
{"title":"Implementation of a Visual Modeling Tool for Defining Instance Aspect in Workflow","authors":"Jianxun Liu, Zefeng Zhu, Yiping Wen, Jinjun Chen","doi":"10.1109/ISPA.2009.59","DOIUrl":"https://doi.org/10.1109/ISPA.2009.59","url":null,"abstract":"The instance-aspect oriented workflow management system is to vertically combine multiple workflow activity instances and submit them for execution as a whole according to some batch or combination logics. It is inspired by the idea of aspect-oriented programming methodology and aims at improving the execution efficiency of business processes. Traditional workflow systems do not support workflow model with instance aspects. In our previous work, we have studied workflow instance modeling technology. This paper makes a research on the principles, methods and implementation of a workflow visual GUI tool for modeling instance aspects in workflow. It is based on an open source GUI tool, Together Workflow Editor, and makes some expansion in instance aspect functionality.","PeriodicalId":346815,"journal":{"name":"2009 IEEE International Symposium on Parallel and Distributed Processing with Applications","volume":"292 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131525441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Virtualized Self-Adaptive Parallel Programming Framework for Heterogeneous High Productivity Computers 面向异构高生产率计算机的虚拟化自适应并行编程框架
Hua Cheng, Zuoning Chen, Ninghui Sun, Fenbin Qi, Chaoqun Dong, Laiwang Cheng
This paper proposed a Virtualized Self-Adaptive Heterogeneous High Productivity Computers Parallel Programming Framework (VAPPF), which is composed of Virtualization-Based Runtime System (VRTS) and Virtualized Adaptive Parallel Programming Model (VAPPM). Virtualization-Based Runtime System is composed of Node-Level Virtual Machine Monitor (NVMM) and System-Level Virtual Infrastructure (SVI). VAPPM program model is not only compatible with conventional data parallel, but also support task parallel. Moreover, with the concept of Domains and virtualized process Locale, Virtualization-Based Runtime System can map between computation and processors according to system-level resources view and performance model. By conceal the hardware details through both runtime system level and programming model level by virtualization, the framework provides programmers a middle-level view independent of hardware details. Programmers can do their programming and debugging works on this middle-level view, and then, the runtime system map it into specific hardware environment. By this way, programming can be relatively separated from specific hardware architectures, this model realized an efficient work division between programmers and systems, and can help to improve the system’s programmability, scalability, portability, robustness, performance, and productivity.
提出了一种虚拟自适应异构高产能计算机并行编程框架(VAPPF),该框架由基于虚拟化的运行时系统(VRTS)和虚拟自适应并行编程模型(VAPPM)组成。基于虚拟化的运行时系统由节点级虚拟机监控(NVMM)和系统级虚拟基础设施(SVI)两部分组成。VAPPM程序模型不仅兼容传统的数据并行,而且支持任务并行。此外,基于虚拟化的运行时系统通过域和虚拟化进程区域的概念,可以根据系统级资源视图和性能模型在计算和处理器之间进行映射。通过虚拟化在运行时系统级别和编程模型级别隐藏硬件细节,该框架为程序员提供了独立于硬件细节的中间层视图。程序员可以在这个中间层视图上进行编程和调试工作,然后运行时系统将其映射到特定的硬件环境中。通过这种方式,编程可以相对地从特定的硬件体系结构中分离出来,该模型实现了程序员和系统之间的有效分工,并有助于提高系统的可编程性、可伸缩性、可移植性、健壮性、性能和生产力。
{"title":"A Virtualized Self-Adaptive Parallel Programming Framework for Heterogeneous High Productivity Computers","authors":"Hua Cheng, Zuoning Chen, Ninghui Sun, Fenbin Qi, Chaoqun Dong, Laiwang Cheng","doi":"10.1109/ISPA.2009.76","DOIUrl":"https://doi.org/10.1109/ISPA.2009.76","url":null,"abstract":"This paper proposed a Virtualized Self-Adaptive Heterogeneous High Productivity Computers Parallel Programming Framework (VAPPF), which is composed of Virtualization-Based Runtime System (VRTS) and Virtualized Adaptive Parallel Programming Model (VAPPM). Virtualization-Based Runtime System is composed of Node-Level Virtual Machine Monitor (NVMM) and System-Level Virtual Infrastructure (SVI). VAPPM program model is not only compatible with conventional data parallel, but also support task parallel. Moreover, with the concept of Domains and virtualized process Locale, Virtualization-Based Runtime System can map between computation and processors according to system-level resources view and performance model. By conceal the hardware details through both runtime system level and programming model level by virtualization, the framework provides programmers a middle-level view independent of hardware details. Programmers can do their programming and debugging works on this middle-level view, and then, the runtime system map it into specific hardware environment. By this way, programming can be relatively separated from specific hardware architectures, this model realized an efficient work division between programmers and systems, and can help to improve the system’s programmability, scalability, portability, robustness, performance, and productivity.","PeriodicalId":346815,"journal":{"name":"2009 IEEE International Symposium on Parallel and Distributed Processing with Applications","volume":"182 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133648900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Empirically Optimized Radix Sort for GPU 基于经验优化的GPU基数排序方法
Bonan Huang, Jinlan Gao, Xiaoming Li
Graphics Processing Units (GPUs) that support general purpose program are promising platforms for high performance computing. However, the fundamental architectural difference between GPU and CPU, the complexity of GPU platform and the diversity of GPU specifications have made the generation of highly efficient code for GPU increasingly difficult. Manual code generation is time consuming and the result tends to be difficult to debug and maintain. On the other hand, the code generated by today's GPU compiler often has much lower performance than the best hand-tuned codes. A promising code generation strategy, implemented by systems like ATLAS~cite{Whaley}, FFTW~cite{FFTW_org}, SPIRAL~cite{Pueschel:05} and X-Sort~cite{Li:05}, uses empirical search to find the parameter values of the implementation, such as the tile size and instruction schedules, that deliver near-optimal performance for a particular machine. However, this approach has only proved successful when applied to CPU where the performance of CPU programs has been relatively better understood. Clearly, empirical search must be extended to general purpose programs on GPU. In this paper, we propose an empirical optimization technique for one of the most important sorting routines on GPU, the radix sort, that generates highly efficient code for a number of representative NVIDIA GPUs with a wide variety of architectural specifications. Our study has been focused on the algorithmic parameters of radix sort that can be adapted to different environments and the GPU architectural factors that affect the performance of radix sort. We present a powerful empirical optimization approach that is shown to be able to find highly efficient code for different NVIDIA GPUs. Our results show that such an empirical optimization approach is quite effective at taking into account the complex interactions between architectural characteristics and that the resulting code performs significantly better than two radix sort implementations that have been shown outperforming other GPU sort routines with the maximal speedup of 33.4%.
支持通用程序的图形处理单元(Graphics Processing unit, gpu)是一种很有前途的高性能计算平台。然而,GPU与CPU在架构上的根本差异、GPU平台的复杂性以及GPU规格的多样性,使得为GPU生成高效的代码变得越来越困难。手动代码生成非常耗时,而且结果往往难以调试和维护。另一方面,由今天的GPU编译器生成的代码通常比最好的手动调优代码的性能低得多。由ATLAS cite{Whaley}、FFTW cite{FFTW_org}、SPIRAL cite{Pueschel:05}和X-Sort cite{Li:05}等系统实现的一种很有前途的代码生成策略,使用经验搜索来找到实现的参数值,例如块大小和指令时间表,为特定机器提供接近最佳的性能。然而,这种方法只有在应用于CPU时才被证明是成功的,因为CPU程序的性能已经得到了相对更好的理解。显然,经验搜索必须扩展到GPU上的通用程序。在本文中,我们提出了一种经验优化技术,用于GPU上最重要的排序例程之一,基数排序,该技术可为具有各种架构规范的许多具有代表性的NVIDIA GPU生成高效代码。我们的研究主要集中在可以适应不同环境的基数排序算法参数和影响基数排序性能的GPU架构因素。我们提出了一个强大的经验优化方法,该方法被证明能够为不同的NVIDIA gpu找到高效的代码。我们的结果表明,这种经验优化方法在考虑到架构特征之间的复杂交互方面非常有效,并且结果代码的性能明显优于两个基数排序实现,这两个实现的性能已经被证明优于其他GPU排序例程,最大加速提升了33.4%。
{"title":"An Empirically Optimized Radix Sort for GPU","authors":"Bonan Huang, Jinlan Gao, Xiaoming Li","doi":"10.1109/ISPA.2009.89","DOIUrl":"https://doi.org/10.1109/ISPA.2009.89","url":null,"abstract":"Graphics Processing Units (GPUs) that support general purpose program are promising platforms for high performance computing. However, the fundamental architectural difference between GPU and CPU, the complexity of GPU platform and the diversity of GPU specifications have made the generation of highly efficient code for GPU increasingly difficult. Manual code generation is time consuming and the result tends to be difficult to debug and maintain. On the other hand, the code generated by today's GPU compiler often has much lower performance than the best hand-tuned codes. A promising code generation strategy, implemented by systems like ATLAS~cite{Whaley}, FFTW~cite{FFTW_org}, SPIRAL~cite{Pueschel:05} and X-Sort~cite{Li:05}, uses empirical search to find the parameter values of the implementation, such as the tile size and instruction schedules, that deliver near-optimal performance for a particular machine. However, this approach has only proved successful when applied to CPU where the performance of CPU programs has been relatively better understood. Clearly, empirical search must be extended to general purpose programs on GPU. In this paper, we propose an empirical optimization technique for one of the most important sorting routines on GPU, the radix sort, that generates highly efficient code for a number of representative NVIDIA GPUs with a wide variety of architectural specifications. Our study has been focused on the algorithmic parameters of radix sort that can be adapted to different environments and the GPU architectural factors that affect the performance of radix sort. We present a powerful empirical optimization approach that is shown to be able to find highly efficient code for different NVIDIA GPUs. Our results show that such an empirical optimization approach is quite effective at taking into account the complex interactions between architectural characteristics and that the resulting code performs significantly better than two radix sort implementations that have been shown outperforming other GPU sort routines with the maximal speedup of 33.4%.","PeriodicalId":346815,"journal":{"name":"2009 IEEE International Symposium on Parallel and Distributed Processing with Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123116100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Web Service Locating Unit in RFID-Centric Anti-counterfeit System rfid防伪系统中的Web服务定位单元
Zhiyuan Tan, Xiangjian He, P. Nanda
The problem of piracy has disturbed people’s daily life for hundreds of years and has not been relieved until now, though many existing anti-counterfeit solutions have been applied. However, due to the emergences of Radio Frequency IDentification (RFID) technologies, there is a more reliable alternative solution to construct authentication system. On the other hand, there arises another issue of how to simplify the deployment of RFID-centric anti-counterfeit system over the Internet. In this article, we propose an approach, Web Service Locating Unit (WSLU), to achieve this goal to manage numbers of RFID-centric authentication services (relied on web services).
数百年来,盗版问题一直困扰着人们的日常生活,直到现在,尽管已有许多防伪解决方案得到了应用,盗版问题仍然没有得到缓解。然而,由于射频识别(RFID)技术的出现,有了一种更可靠的替代方案来构建认证系统。另一方面,如何简化在互联网上部署以rfid为中心的防伪系统的问题也出现了。在本文中,我们提出了一种方法,即Web服务定位单元(WSLU),以实现管理以rfid为中心的身份验证服务数量(依赖于Web服务)的目标。
{"title":"Web Service Locating Unit in RFID-Centric Anti-counterfeit System","authors":"Zhiyuan Tan, Xiangjian He, P. Nanda","doi":"10.1109/ISPA.2009.94","DOIUrl":"https://doi.org/10.1109/ISPA.2009.94","url":null,"abstract":"The problem of piracy has disturbed people’s daily life for hundreds of years and has not been relieved until now, though many existing anti-counterfeit solutions have been applied. However, due to the emergences of Radio Frequency IDentification (RFID) technologies, there is a more reliable alternative solution to construct authentication system. On the other hand, there arises another issue of how to simplify the deployment of RFID-centric anti-counterfeit system over the Internet. In this article, we propose an approach, Web Service Locating Unit (WSLU), to achieve this goal to manage numbers of RFID-centric authentication services (relied on web services).","PeriodicalId":346815,"journal":{"name":"2009 IEEE International Symposium on Parallel and Distributed Processing with Applications","volume":"85 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114339687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2009 IEEE International Symposium on Parallel and Distributed Processing with Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1