2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies最新文献

英文中文

Application-Specific Disk I/O Optimisation for a Search Engine 搜索引擎特定于应用程序的磁盘I/O优化

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies

Pub Date : 2008-12-01 DOI: 10.1109/PDCAT.2008.61

Xiangfei Jia, A. Trotman, Richard A. O'Keefe, Zhiyi Huang

Operating systems only provide general-purpose I/O optimisation since they have to service various types of applications. However, application level I/O optimisation can achieve better performance since an application has a better knowledge of how to optimise disk I/O for the application. In this paper we provide a solution for application-specific I/O for optimising a search engine. It shows a 28% improvement when compared to the general-purpose I/O optimisation of Linux. Our result also shows a 11% improvement when the Linux I/O optimisation is bypassed.

操作系统只提供通用I/O优化，因为它们必须服务于各种类型的应用程序。但是，应用程序级别的I/O优化可以实现更好的性能，因为应用程序更了解如何为应用程序优化磁盘I/O。在本文中，我们为优化搜索引擎提供了一个特定于应用程序的I/O解决方案。与Linux的通用I/O优化相比，它显示了28%的改进。我们的结果还显示，当绕过Linux I/O优化时，性能提高了11%。

引用次数: 15

A Dynamic Provisioning Framework for Multi-tier Internet Applications in Virtualized Data Center 面向虚拟化数据中心多层次互联网应用的动态供应框架

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies

Pub Date : 2008-12-01 DOI: 10.1109/PDCAT.2008.74

Yi Jin, Xu Liu, Jianfeng Zhan, Shuang Gao

With the resurgence of virtualization technology, todaypsilas Internet data centers are shifting towards virtualized data centers. Internet applications tend to see dynamically varying workloads. To address the problem of performance management for multi-tier applications hosted in virtualized Internet data center, we propose a three-level automatic provisioning framework based on feedback control for multi-tier applications. Experiments demonstrate the effectiveness of our technique in SLA guarantees while obtaining improved resource utilization.

随着虚拟化技术的复兴，今天的互联网数据中心正在向虚拟化数据中心转变。Internet应用程序倾向于看到动态变化的工作负载。为了解决虚拟化Internet数据中心中托管的多层应用程序的性能管理问题，提出了一种基于反馈控制的多层应用程序的三级自动供应框架。实验证明了该技术在提高资源利用率的同时，在SLA保证方面的有效性。

引用次数: 0

A Parallel Algorithm for Block Tridiagonal Systems 块三对角系统的并行算法

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies

Pub Date : 2008-12-01 DOI: 10.1109/PDCAT.2008.21

Heng Zhang, Wu Zhang, Xian-He Sun

A parallel algorithm, namely parallel block diagonal dominant (PBDD) algorithm, is proposed to solve block tridiagonal linear systems on multi-computers. This algorithm is based on divided-and-conquer idea of the PDD method. When the systems is strictly block diagonal dominant, the PBDD is highly parallel and provides approximate solutions that equals to the exact solutions within machine accuracy. The PBDD method has been implemented on a 64-node multi-computer. The analytic results match closely with the results measured from the numerical experiments.

提出了一种在多计算机上求解块三对角线性系统的并行算法，即并行块对角优势(PBDD)算法。该算法基于PDD方法的分而治之思想。当系统是严格的块对角线优势时，PBDD是高度平行的，并提供近似解，等于机器精度内的精确解。PBDD方法已在64节点多计算机上实现。分析结果与数值实验结果吻合较好。

引用次数: 3

A Framework for Concurrency in Numerical Simulations Using Lock Free Data Structures: The Graph Parallel Architecture GraPA 使用无锁数据结构的数值模拟中的并发框架:图并行体系结构图

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies

Pub Date : 2008-12-01 DOI: 10.1109/PDCAT.2008.32

P. Klein, Dimo Maleshkov, D. Asenov

The development of numerical simulation software tools for the solution of real-world problems usually calls for domain experts in modeling. The GraPA framework, as an abstraction layer on top of hardware characteristics, supports modelers in two respects: one is the built-in support for co-processing of multiple models and the other is the generically delivered high performance achieved by implementing concurrency features of multicore and distributed memory architectures. Technically, GraPA is designed as a C++ template framework, where the modeler`s data structures and algorithms instantiate the framework. Using this approach, we handle parallel processing of lock-free data structures and message passing transparently to the modelers. In this paper, we report on the status of the implementation of GraPA and on its performance characteristics.

为解决现实问题而开发的数值模拟软件工具通常需要建模领域的专家。grapha框架作为硬件特征之上的抽象层，在两个方面支持建模器:一个是内置的对多个模型协同处理的支持，另一个是通过实现多核和分布式内存架构的并发特性实现的通用交付的高性能。从技术上讲，GraPA被设计为c++模板框架，其中建模器的数据结构和算法实例化了框架。使用这种方法，我们可以处理无锁数据结构的并行处理，并透明地将消息传递给建模者。在本文中，我们报告了GraPA的实现现状及其性能特征。

引用次数: 4

Redundant Radix-2r Number System for Accelerating Arithmetic Operations on the FPGAs fpga上加速算术运算的冗余基数-2r数系统

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies

Pub Date : 2008-12-01 DOI: 10.1109/PDCAT.2008.13

K. Kawakami, K. Shigemoto, K. Nakano

The main contribution of this paper is to present hardware algorithms for redundant radix-2r number system in the FPGA to speed the arithmetic operations for numbers with many bits, which have applications in security systems such as RSA encryption and decryption. Our hardware algorithms accelerate arithmetic operations including addition, multiplication, and Montgomery modulo multiplication.Quite surprisingly, our hardware algorithms of the multiplication and Montgomery multiplication for two 1024-bit numbers runs only 64 clock cycles using redundant radix-216 number system. Also, the experimental results for Xilinx Virtex-II Pro Family FPGA XC2VP100-6 show that the clock frequency of our circuit is independent of the number of bits. The speed up factors of our hardware algorithm using the redundant number system over those using the conventional number system are 8.3 for 1024-bit addition, 3.4 for 1024-bit multiplication, and 2.5 for 1024-bit Montgomery modulo multiplication. Further, for 256-bit Montgomery modulo multiplication, our hardware algorithm runs in 0.38 mus, while a previously known implementation runs in 1.22 mus. Thus, our approach using redundant number system for arithmetic operations is very efficient.

本文的主要贡献是在FPGA上提出冗余基数-2r数系统的硬件算法，以加快对多比特数的算术运算，这些算法在RSA加解密等安全系统中具有应用。我们的硬件算法加速算术运算，包括加法，乘法和蒙哥马利模乘法。令人惊讶的是，我们的两个1024位数字的乘法和Montgomery乘法的硬件算法使用冗余基数-216数字系统仅运行64个时钟周期。此外，在Xilinx Virtex-II Pro系列FPGA XC2VP100-6上的实验结果表明，该电路的时钟频率与比特数无关。使用冗余数字系统的硬件算法比使用传统数字系统的硬件算法的速度因子为:1024位加法的速度因子为8.3,1024位乘法的速度因子为3.4,1024位Montgomery模乘法的速度因子为2.5。此外，对于256位Montgomery模乘法，我们的硬件算法运行时间为0.38 mus，而以前已知的实现运行时间为1.22 mus。因此，我们使用冗余数字系统进行算术运算的方法是非常有效的。

{"title":"Redundant Radix-2r Number System for Accelerating Arithmetic Operations on the FPGAs","authors":"K. Kawakami, K. Shigemoto, K. Nakano","doi":"10.1109/PDCAT.2008.13","DOIUrl":"https://doi.org/10.1109/PDCAT.2008.13","url":null,"abstract":"The main contribution of this paper is to present hardware algorithms for redundant radix-2r number system in the FPGA to speed the arithmetic operations for numbers with many bits, which have applications in security systems such as RSA encryption and decryption. Our hardware algorithms accelerate arithmetic operations including addition, multiplication, and Montgomery modulo multiplication.Quite surprisingly, our hardware algorithms of the multiplication and Montgomery multiplication for two 1024-bit numbers runs only 64 clock cycles using redundant radix-216 number system. Also, the experimental results for Xilinx Virtex-II Pro Family FPGA XC2VP100-6 show that the clock frequency of our circuit is independent of the number of bits. The speed up factors of our hardware algorithm using the redundant number system over those using the conventional number system are 8.3 for 1024-bit addition, 3.4 for 1024-bit multiplication, and 2.5 for 1024-bit Montgomery modulo multiplication. Further, for 256-bit Montgomery modulo multiplication, our hardware algorithm runs in 0.38 mus, while a previously known implementation runs in 1.22 mus. Thus, our approach using redundant number system for arithmetic operations is very efficient.","PeriodicalId":282779,"journal":{"name":"2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies","volume":"186 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123282467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

DCSVS: Distributed Collaborative Set-Top-Box Video Service DCSVS:分布式协同机顶盒视频服务

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies

Pub Date : 2008-12-01 DOI: 10.1109/PDCAT.2008.17

Chao Liu, Hao Chen, D. Ye

This paper presents a novel video service system DCSVS (distributed collaborative set-top-box video service), which encompasses several practical and effective solutions to both live and VoD (video-on-demand) services. DCSVS is established on an overlay DHT (distributed hash table) network, which improves Kademlia protocol to fit for real-time application. We use several types of pre-fetching to enhance continuity rate of VoD service, and we employ an effective scheduling algorithm based on transferring priorities. We design the inner-first and proxy-forward strategy to relieve transferring failure, and we use Window-based storage and hashing to achieve better system performance. Both theoretical analysis and experimental results show that strategies in DCSVS perform well in terms of efficiency and robustness, and could maintain a fine state and scalability in large-scale networking.

本文提出了一种新型的分布式协同机顶盒视频服务系统DCSVS (distributed collaborative set-顶盒视频服务)，该系统包含了直播和视频点播两种实用有效的解决方案。DCSVS建立在覆盖DHT(分布式哈希表)网络上，改进了Kademlia协议以适应实时应用。为了提高视频点播业务的连续性，我们采用了多种预取方式，并采用了一种有效的基于优先级转移的调度算法。我们设计了内优先和代理前向策略来减轻传输故障，并使用基于windows的存储和哈希来获得更好的系统性能。理论分析和实验结果表明，DCSVS中的策略具有良好的效率和鲁棒性，能够在大规模网络中保持良好的状态和可扩展性。

引用次数: 0

An Effective PSO-Based Node Localization Scheme for Wireless Sensor Networks 一种有效的基于pso的无线传感器网络节点定位方案

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies

Pub Date : 2008-12-01 DOI: 10.1109/PDCAT.2008.73

Po-Jen Chuang, Cheng-Pei Wu

Wireless sensor networks (WSNs) usually employ different ranging techniques to measure the distance between an unknown node and its neighboring anchor nodes, and based on the measured distance to estimate the position of the unknown node. This paper presents an effective Particle Swarm Optimization (PSO)-based Localization Scheme using the Radio Signal Strength (RSS) ranging technique. Modified from the iterative multilateration algorithm, our scheme is unique in adopting the location data of remote anchors provided by the closest neighbor anchors of an unknown node to estimate the unknown nodepsilas position and using the PSO algorithm to further reduce error accumulation. The new scheme meanwhile takes in a modified DV-distance approach to raise the success ratios of locating unknown nodes. Compared with related schemes, our scheme is shown through simulations to perform constantly better in increasing localization success ratios and decreasing location errors -- at reduced cost.

无线传感器网络通常采用不同的测距技术来测量未知节点与其相邻锚节点之间的距离，并根据测量到的距离来估计未知节点的位置。提出了一种有效的基于粒子群优化(PSO)的无线电信号强度(RSS)测距方案。该方案在迭代迭代算法的基础上进行了改进，其独特之处在于利用未知节点最近邻锚点提供的远程锚点位置数据来估计未知节点的沉降位置，并使用粒子群算法进一步减少误差积累。同时采用改进的dv距离方法，提高了未知节点的定位成功率。仿真结果表明，该方案在提高定位成功率和降低定位误差方面不断取得较好的效果，且成本较低。

引用次数: 69

Parallelization and Acceleration Scheme of Multilevel Fast Multipole Method 多层快速多极法的并行化与加速方案

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies

Pub Date : 2008-12-01 DOI: 10.1109/PDCAT.2008.34

Wu Wang, Yangde Feng, Xue-bin Chi

The iterative methods such as BiCGStab for solving electromagnetic field integer equations have a complexity of O(N2), which can be reduced to O(N logN) by multilevel fast multipole method (MLFMM). For large scale problems, MLFMM should be parallelized, and the iterative convergence can be accelerated by preconditioners such as incomplete inverse triangular factorization preconditioner. The interpolation based on spherical harmonic transform at each level of MLFMMpsilas octree can be further accelerated by FFT. Based on this acceleration scheme tested on distributed cluster, the results show this algorithm is feasible.

求解电磁场整数方程的BiCGStab等迭代方法的复杂度为0 (N2)，采用多层快速多极法(MLFMM)可将其简化为O(N logN)。对于大规模问题，MLFMM需要并行化，并且可以使用不完全逆三角分解预条件等预条件加速迭代收敛。基于MLFMMpsilas八叉树各层球谐变换的插值可以通过FFT进一步加速。将该加速方案在分布式集群上进行了测试，结果表明该算法是可行的。

引用次数: 2

Portable Object Thermal Awareness: Modeling Intelligent Sensor Networks for Cool Store Applications 便携式物体热感知:为冷库应用建模智能传感器网络

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies

Pub Date : 2008-12-01 DOI: 10.1109/PDCAT.2008.37

N. Yamani, A. Al-Anbuky, A. Gyasi-Agyei

The recent technological advances have entrenched the potential benefits, when large population of wireless sensor nodes deployed in agricultural, industrial and environmental areas to predict the behavioral analysis of physical attributes such as temperature or gas. This work mainly focuses on the three dimensional temperature distribution of a specified field based on virtually deployed sensor nodes in a simulation environment. The parameters temperature and location are considered in the simulation model. In this work, we have evaluated the minimum number of nodes that are required to map the given space. Modeling & simulation has been dealt with in testing the network density on the space coverage. This work exploits a spatial correlation of temperature data in a given space. Finally the paper discusses the extension of approaches that leads to new research challenges due to the relationships between the obstacles within the environment.

当大量无线传感器节点部署在农业、工业和环境领域，以预测温度或气体等物理属性的行为分析时，最近的技术进步已经确立了潜在的好处。本文主要研究了在模拟环境下，基于虚拟部署的传感器节点对特定场的三维温度分布的影响。在仿真模型中考虑了温度和位置参数。在这项工作中，我们评估了映射给定空间所需的最小节点数。对网络密度在空间覆盖上的测试进行了建模和仿真。这项工作利用了给定空间内温度数据的空间相关性。最后，本文讨论了由于环境中障碍之间的关系而导致新的研究挑战的方法的扩展。

引用次数: 1

Popularity-Driven Dynamic Replica Placement in Hierarchical Data Grids 层次数据网格中受欢迎程度驱动的动态副本放置

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies

Pub Date : 2008-12-01 DOI: 10.1109/PDCAT.2008.64

Mohammad Shorfuzzaman, P. Graham, Mehmet Rasit Eskicioglu

Data grids provide geographically distributed storage for large-scale data-intensive applications. Ensuring efficient access to such large and widely distributed datasets is hindered by high latencies. To speed up data access, data grid systems replicate data in multiple locations so a user can access the data from a nearby site. In addition to reducing data access time, replication also aims to use network and storage resources efficiently. While replication is a well-known technique, the problem of replica placement has not been widely studied for data grid environments. To obtain the best possible gains from replication, strategic placement of the replicas is critical. In a grid environment resource availability, network latency, and userspsila requests can vary. To address these issues a placement strategy is needed that adapts to dynamic behavior. This paper proposes a new dynamic replica placement algorithm for hierarchical data grids based on file ldquopopularityrdquo. Our goal is to place replicas close to the clients to reduce access time while using the network and storage efficiently thereby effectively balancing storage cost and access latency. We evaluate our algorithm using OptorSim which shows that our approach outperforms other techniques in terms of access time and bandwidth used.

数据网格为大规模数据密集型应用程序提供地理上的分布式存储。确保对如此庞大和广泛分布的数据集的有效访问受到高延迟的阻碍。为了加快数据访问速度，数据网格系统在多个位置复制数据，以便用户可以从附近的站点访问数据。除了减少数据访问时间外，复制的目的还在于有效利用网络和存储资源。虽然复制是一种众所周知的技术，但在数据网格环境中，副本放置问题还没有得到广泛的研究。为了从复制中获得最佳可能的收益，对副本的策略性放置至关重要。在网格环境中，资源可用性、网络延迟和用户请求可能会有所不同。为了解决这些问题，需要一个适应动态行为的放置策略。本文提出了一种基于当前流行的基于文件的分层数据网格动态副本放置算法。我们的目标是将副本放置在靠近客户端的位置，以减少访问时间，同时有效地使用网络和存储，从而有效地平衡存储成本和访问延迟。我们使用OptorSim对我们的算法进行了评估，结果表明我们的方法在访问时间和带宽方面优于其他技术。

{"title":"Popularity-Driven Dynamic Replica Placement in Hierarchical Data Grids","authors":"Mohammad Shorfuzzaman, P. Graham, Mehmet Rasit Eskicioglu","doi":"10.1109/PDCAT.2008.64","DOIUrl":"https://doi.org/10.1109/PDCAT.2008.64","url":null,"abstract":"Data grids provide geographically distributed storage for large-scale data-intensive applications. Ensuring efficient access to such large and widely distributed datasets is hindered by high latencies. To speed up data access, data grid systems replicate data in multiple locations so a user can access the data from a nearby site. In addition to reducing data access time, replication also aims to use network and storage resources efficiently. While replication is a well-known technique, the problem of replica placement has not been widely studied for data grid environments. To obtain the best possible gains from replication, strategic placement of the replicas is critical. In a grid environment resource availability, network latency, and userspsila requests can vary. To address these issues a placement strategy is needed that adapts to dynamic behavior. This paper proposes a new dynamic replica placement algorithm for hierarchical data grids based on file ldquopopularityrdquo. Our goal is to place replicas close to the clients to reduce access time while using the network and storage efficiently thereby effectively balancing storage cost and access latency. We evaluate our algorithm using OptorSim which shows that our approach outperforms other techniques in terms of access time and bandwidth used.","PeriodicalId":282779,"journal":{"name":"2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121729598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀