2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)最新文献

英文中文

The Optimization of LRU Algorithm Based on Pre-Selection and Cache Prefetching of Files in Hybrid Cloud 混合云中基于文件预选和缓存预取的LRU算法优化

2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)

Pub Date : 2016-12-01 DOI: 10.1109/PDCAT.2016.039

Shumeng Du, Chunlin Li, XiJun Mao, Wei Yan

In recent years, the research on caching in cloud environment has become an important research topic, and it has profound meaning to research the cache replacement algorithm in hybrid Cloud. There aren't enough considerations on some aspects, such as the selection of pending cache files, the prefetching of pending cache files among different clouds and the cost of recovery of files. Considering those shortages, this paper proposes an optimized LRU algorithm based on pre-selection and cache prefetching of files. This algorithm determines whether the file is to meet the pre-selection and cache prefetching conditions before adding a cache file, and it implements the LRU cache replacement algorithm which is based on priority. The algorithm divides the cache into multiple priority queues, and uses the LRU cache replacement algorithm to select the replacement file in each queue. Then select the files in each priority and put them together, select the file to perform replacement operation which has minimum probability of being accessed again. Compared with three typical cache replacement algorithm GD-Size, LRU, LFU, experimental results show that the cache replacement algorithm in this paper not only effectively save cost, but also greatly enhance the byte hit rate, delay savings rate and cache hit rate.

近年来，对云环境下缓存的研究已成为一个重要的研究课题，研究混合云下的缓存替换算法具有深远的意义。在挂起缓存文件的选择、不同云间挂起缓存文件的预取、文件的恢复成本等方面没有足够的考虑。针对这些不足，本文提出了一种基于文件预选和缓存预取的优化LRU算法。该算法在添加缓存文件前判断文件是否满足预选和预取缓存条件，实现基于优先级的LRU缓存替换算法。该算法将缓存划分为多个优先级队列，并使用LRU缓存替换算法在每个队列中选择替换文件。然后将各优先级的文件选择在一起，选择被再次访问概率最小的文件进行替换操作。实验结果表明，与GD-Size、LRU、LFU三种典型的缓存替换算法相比，本文的缓存替换算法不仅有效地节省了成本，而且大大提高了字节命中率、延迟节省率和缓存命中率。

{"title":"The Optimization of LRU Algorithm Based on Pre-Selection and Cache Prefetching of Files in Hybrid Cloud","authors":"Shumeng Du, Chunlin Li, XiJun Mao, Wei Yan","doi":"10.1109/PDCAT.2016.039","DOIUrl":"https://doi.org/10.1109/PDCAT.2016.039","url":null,"abstract":"In recent years, the research on caching in cloud environment has become an important research topic, and it has profound meaning to research the cache replacement algorithm in hybrid Cloud. There aren't enough considerations on some aspects, such as the selection of pending cache files, the prefetching of pending cache files among different clouds and the cost of recovery of files. Considering those shortages, this paper proposes an optimized LRU algorithm based on pre-selection and cache prefetching of files. This algorithm determines whether the file is to meet the pre-selection and cache prefetching conditions before adding a cache file, and it implements the LRU cache replacement algorithm which is based on priority. The algorithm divides the cache into multiple priority queues, and uses the LRU cache replacement algorithm to select the replacement file in each queue. Then select the files in each priority and put them together, select the file to perform replacement operation which has minimum probability of being accessed again. Compared with three typical cache replacement algorithm GD-Size, LRU, LFU, experimental results show that the cache replacement algorithm in this paper not only effectively save cost, but also greatly enhance the byte hit rate, delay savings rate and cache hit rate.","PeriodicalId":203925,"journal":{"name":"2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116806365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Accurate Evaluation of Bivariate Polynomials 二元多项式的精确求值

2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)

Pub Date : 2016-12-01 DOI: 10.1109/PDCAT.2016.026

Peibing Du, Hao Jiang, Housen Li, Lizhi Cheng, Canqun Yang

Polynomials are widely used in scientific computing and engineering. In this paper, we present an accurate and fast compensated algorithm to evaluate bivariate polynomials with floating-point coefficients. This algorithm is applying error free transformations to the bivariate Horner scheme and sum the final decomposition accurately. We also prove the proposed algorithm's accuracy with forward error analysis that the accuracy of the computed result is similar to the result computed by the bivariate Horner scheme in twice the working precision. Numerical experiments illustrate the behavior and it has higher efficiency than the bivariate Horner scheme implemented in double-double library.

多项式在科学计算和工程中有着广泛的应用。本文提出了一种计算浮点系数二元多项式的快速、精确的补偿算法。该算法对二元Horner格式进行无误差变换，并准确地对最终分解求和。通过前向误差分析证明了算法的精度，计算结果与二元Horner格式计算结果在两倍的工作精度下相当。数值实验表明，该方法比双双库中实现的双变量Horner格式具有更高的效率。

引用次数: 4

NUMA-BTDM: A Thread Mapping Algorithm for Balanced Data Locality on NUMA Systems NUMA- btdm:一种NUMA系统上平衡数据局部性的线程映射算法

2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)

Pub Date : 2016-12-01 DOI: 10.1109/PDCAT.2016.074

Iulia Stirb

Optimizing for Non-Uniform Memory Access (NUMA) systems could be considered inappropriate because hardware architecture aware optimizations are not portable. On the contrary, this paper supports the idea that developing NUMA aware optimizations improves performance and energy consumption on NUMA systems and that these optimizations may be considered portable when they are non static. This paper introduces NUMA Balanced Thread and Data Mapping (BTDM), an extension of PThreads4w API [1]. NUMA-BTDM employs balanced data locality concept, improving thread and data mapping for NUMA systems. The purpose is to combine task parallelism with balanced data locality in order to obtain both better performance and reduced energy consumption on NUMA systems at run-time. The implementation of NUMA-BTDM targets homogeneous architectures based on the energy model with constant energy consumption or on the energy model in which each core is powered from a separate source (architectures on which parallel execution may reduce energy consumption compared to serial execution).

对于非统一内存访问(NUMA)系统的优化可能被认为是不合适的，因为硬件架构感知的优化是不可移植的。相反，本文支持这样一种观点，即开发NUMA感知优化可以提高NUMA系统的性能和能耗，并且当这些优化是非静态的时，可以认为它们是可移植的。本文介绍了PThreads4w API[1]的扩展——NUMA平衡线程和数据映射(BTDM)。NUMA- btdm采用均衡的数据局部性概念，改进了NUMA系统的线程和数据映射。目的是将任务并行性与平衡的数据局部性结合起来，以便在NUMA系统运行时获得更好的性能和更低的能耗。NUMA-BTDM的实现目标是基于具有恒定能耗的能量模型或基于每个核心由单独来源供电的能量模型的同构架构(与串行执行相比，并行执行可能减少能耗的架构)。

引用次数: 5

Managing Broadband Access Network with a SDN-Based System 用sdn系统管理宽带接入网

2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)

Pub Date : 2016-12-01 DOI: 10.1109/PDCAT.2016.022

Junpeng Guo, Xiaohan Gao, Rentao Gu

Admittedly, the broadband access network has been improved largely with the developing technologies, it is still facing challenges on managing and maintaining existed resources efficiently. In order to build up an intelligent and open network architecture, and solve the problem of heterogeneous networks consisting of devices from different vendors, we have worked out a web-based managing system implementing the concept of Software-Defined Network (SDN) and Network Functions Virtualization. The controlling plane is centered into the Controller layer and decoupled from the forwarding layer. The frame we proposed is also applicable for old routers, which do not support SDN, with an Agent on it to translate the OpenFlow messages. For a more intelligent routing schema, the controller is able to calculate with a fine-tuned ant colony optimization algorithm. At the top of the controller, the web-based managing system is accessible for operators, and they can manage the resource they possessed. With the above framework, we achieve the goal of an intelligent and open network architecture and verify it.

诚然，随着技术的发展，宽带接入网已经有了很大的改进，但如何有效地管理和维护现有的资源仍然是宽带接入网面临的挑战。为了构建智能、开放的网络架构，解决由不同厂商设备组成的异构网络的问题，我们设计了一个基于web的管理系统，实现了软件定义网络(SDN)和网络功能虚拟化的概念。控制平面以控制器层为中心，与转发层解耦。我们提出的帧也适用于不支持SDN的旧路由器，上面有一个代理来翻译OpenFlow消息。对于更智能的路由模式，控制器可以使用微调的蚁群优化算法进行计算。在控制器的顶部，操作员可以访问基于web的管理系统，他们可以管理自己拥有的资源。通过上述框架，我们实现了智能开放网络架构的目标，并对其进行了验证。

{"title":"Managing Broadband Access Network with a SDN-Based System","authors":"Junpeng Guo, Xiaohan Gao, Rentao Gu","doi":"10.1109/PDCAT.2016.022","DOIUrl":"https://doi.org/10.1109/PDCAT.2016.022","url":null,"abstract":"Admittedly, the broadband access network has been improved largely with the developing technologies, it is still facing challenges on managing and maintaining existed resources efficiently. In order to build up an intelligent and open network architecture, and solve the problem of heterogeneous networks consisting of devices from different vendors, we have worked out a web-based managing system implementing the concept of Software-Defined Network (SDN) and Network Functions Virtualization. The controlling plane is centered into the Controller layer and decoupled from the forwarding layer. The frame we proposed is also applicable for old routers, which do not support SDN, with an Agent on it to translate the OpenFlow messages. For a more intelligent routing schema, the controller is able to calculate with a fine-tuned ant colony optimization algorithm. At the top of the controller, the web-based managing system is accessible for operators, and they can manage the resource they possessed. With the above framework, we achieve the goal of an intelligent and open network architecture and verify it.","PeriodicalId":203925,"journal":{"name":"2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126287199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Green-Aware Online Resource Allocation for Geo-Distributed Cloud Data Centers on Multi-Source Energy

2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)

Pub Date : 2016-12-01 DOI: 10.1109/PDCAT.2016.037

Huaiwen He, Hong Shen

Huge energy consumption of large-scale cloud data centers damages the environment with excessive carbon emission. More and more data center operators are seeking to reduce carbon footprint via various types of renewable energy sources. However, the intermittent availability of renewable energy source makes it quite challenging to cooperate the dynamic workload arrivals. In this paper, we investigate how to coordinate multi-type renewable energy (e.g. wind power and solar power) in order to reduce the long-term energy cost with spatio-temporal diversity of electricity price for geo-distributed cloud data centers under the constraints of service level agreement (SLA) and carbon footprints. To tackle the randomness of workload arrival, dynamic electricity price change and renewable energy generation, we first formulate the minimizing energy cost problem into a constrained stochastic optimization problem. Then, based on Lyapunov optimization technique, we design an online control algorithm which can work without long-term future system information for solving the problem. Finally, we evaluate the effectiveness of the algorithm with extensive simulations based on real-world workload traces, electricity price and historic climate data.

大规模云数据中心能耗巨大，碳排放超标，破坏环境。越来越多的数据中心运营商正在寻求通过各种可再生能源来减少碳足迹。然而，可再生能源的时断时续性使其在动态工作量到来时的合作具有很大的挑战性。本文研究了在服务水平协议(SLA)和碳足迹约束下，基于地理分布式云数据中心电价的时空差异，如何协调多类型可再生能源(如风能和太阳能)以降低长期能源成本。为了解决负荷到达、电价动态变化和可再生能源发电的随机性问题，首先将能量成本最小化问题转化为约束随机优化问题。然后，基于李雅普诺夫优化技术，设计了一种不需要长期未来系统信息的在线控制算法来解决问题。最后，我们通过基于现实世界工作量轨迹、电价和历史气候数据的广泛模拟来评估该算法的有效性。

{"title":"Green-Aware Online Resource Allocation for Geo-Distributed Cloud Data Centers on Multi-Source Energy","authors":"Huaiwen He, Hong Shen","doi":"10.1109/PDCAT.2016.037","DOIUrl":"https://doi.org/10.1109/PDCAT.2016.037","url":null,"abstract":"Huge energy consumption of large-scale cloud data centers damages the environment with excessive carbon emission. More and more data center operators are seeking to reduce carbon footprint via various types of renewable energy sources. However, the intermittent availability of renewable energy source makes it quite challenging to cooperate the dynamic workload arrivals. In this paper, we investigate how to coordinate multi-type renewable energy (e.g. wind power and solar power) in order to reduce the long-term energy cost with spatio-temporal diversity of electricity price for geo-distributed cloud data centers under the constraints of service level agreement (SLA) and carbon footprints. To tackle the randomness of workload arrival, dynamic electricity price change and renewable energy generation, we first formulate the minimizing energy cost problem into a constrained stochastic optimization problem. Then, based on Lyapunov optimization technique, we design an online control algorithm which can work without long-term future system information for solving the problem. Finally, we evaluate the effectiveness of the algorithm with extensive simulations based on real-world workload traces, electricity price and historic climate data.","PeriodicalId":203925,"journal":{"name":"2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125747701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Dynamic Load Balancing for Physical Servers in Virtualized Environment 虚拟化环境下物理服务器的动态负载均衡

2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)

Pub Date : 2016-12-01 DOI: 10.1109/PDCAT.2016.057

Mingming Zhang, Songyun Wang, Gaopan Huang, Yefei Li, S. Zhang, Zhuzhong Qian

Virtualization technology has brought new vitality to data centers but also brought some thorny issues. Virtualization technology creates an abstract intermediate layer, separating the upper layer applications from the underlying infrastructure, which cause some difficulties to the upper layer applications for the effective use of resources. Based on this observation, we propose a dynamic load balancing system in virtualized environment. The system adjusts the weights of virtual machines in real-time in order to balancing physical servers' load and improve data center efficiency. In addition, by monitoring physical machine failure information, we can evacuate physical machines as soon as they failed to ensure high availability of the data center. We design a virtualized environment monitoring system and propose an efficient algorithm. We evaluate the proposed system with real implementations which show DLB has rather good performance.

虚拟化技术给数据中心带来了新的活力，但也带来了一些棘手的问题。虚拟化技术创建了一个抽象的中间层，将上层应用程序与底层基础设施分开，这给上层应用程序有效利用资源带来了一些困难。在此基础上，提出了一种虚拟环境下的动态负载均衡系统。系统实时调整虚拟机的权重，平衡物理服务器的负载，提高数据中心的效率。此外，通过监控物理机故障信息，我们可以在物理机出现故障时及时撤离，确保数据中心的高可用性。设计了一个虚拟环境监测系统，并提出了一种高效的算法。用实际实现对该系统进行了评价，结果表明该系统具有较好的性能。

引用次数: 1

Energy Aware Scheduling on Heterogeneous Multiprocessors with DVFS and Duplication 具有DVFS和复制的异构多处理器的能量感知调度

2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)

Pub Date : 2016-12-01 DOI: 10.1109/PDCAT.2016.036

Jagpreet Singh, Aditya Gujral, Harmandeep Singh, Jagbeer Singh, Nitin Auluck

Duplication and dynamic voltage/frequency scaling (DVFS) creates an interesting trade-off for scheduling task graphs on multiprocessors to improve energy consumption and schedule length (or makespan). With DVFS, tasks are made to run on low voltages, which decreases their computation power. However, it also increases their execution costs and hence, may increase the schedule length. Furthermore, applying DVFS on processors does not impact the communication delay/energy consumption. Duplicating a task on multiple processors reduces the communication delay among them, which further reduces the schedule length. Although duplication reduces the communication energy among processors, it also increases the overall computation energy. In this paper, we explore this trade-off between duplication and DVFS, and propose a polynomial time heuristic to schedule task graphs on heterogeneous multiprocessors. The tasks are carefully duplicated with DVFS to reduce its impact on the computation energy. The results demonstrate that the proposed algorithm is able to effectively balance the makespan and energy consumption over other algorithms in various scenarios.

复制和动态电压/频率缩放(DVFS)为多处理器上的调度任务图创建了一个有趣的权衡，以改善能耗和调度长度(或makespan)。使用DVFS，任务在低电压下运行，这降低了它们的计算能力。然而，它也增加了它们的执行成本，因此可能会增加调度长度。此外，在处理器上应用DVFS不会影响通信延迟/能耗。在多个处理器上复制一个任务可以减少它们之间的通信延迟，从而进一步减少调度长度。虽然重复减少了处理器之间的通信能量，但也增加了整体的计算能量。在本文中，我们探讨了重复和DVFS之间的权衡，并提出了一个多项式时间启发式方法来调度异构多处理器上的任务图。为了减少对计算能量的影响，在DVFS中仔细地重复了这些任务。结果表明，在不同的场景下，该算法能够有效地平衡最大完工时间和能量消耗。

{"title":"Energy Aware Scheduling on Heterogeneous Multiprocessors with DVFS and Duplication","authors":"Jagpreet Singh, Aditya Gujral, Harmandeep Singh, Jagbeer Singh, Nitin Auluck","doi":"10.1109/PDCAT.2016.036","DOIUrl":"https://doi.org/10.1109/PDCAT.2016.036","url":null,"abstract":"Duplication and dynamic voltage/frequency scaling (DVFS) creates an interesting trade-off for scheduling task graphs on multiprocessors to improve energy consumption and schedule length (or makespan). With DVFS, tasks are made to run on low voltages, which decreases their computation power. However, it also increases their execution costs and hence, may increase the schedule length. Furthermore, applying DVFS on processors does not impact the communication delay/energy consumption. Duplicating a task on multiple processors reduces the communication delay among them, which further reduces the schedule length. Although duplication reduces the communication energy among processors, it also increases the overall computation energy. In this paper, we explore this trade-off between duplication and DVFS, and propose a polynomial time heuristic to schedule task graphs on heterogeneous multiprocessors. The tasks are carefully duplicated with DVFS to reduce its impact on the computation energy. The results demonstrate that the proposed algorithm is able to effectively balance the makespan and energy consumption over other algorithms in various scenarios.","PeriodicalId":203925,"journal":{"name":"2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131744789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

An Application Oriented Multi-Agent Based Approach to Dynamic Truck Scheduling at Cross-Dock 面向应用的多智能体交叉码头卡车动态调度方法

2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)

Pub Date : 2016-12-01 DOI: 10.1109/PDCAT.2016.058

Houda Zouhaier, L. B. Said

Truck arrival management forms a very active stream of research and a crucial challenge for a cross-dock terminals. The study focuses on the truck congestion problem, which leads to a lower operation efficiency and a longer waiting time at the gate and at the yard. One of the operational measures to solve this problem is the truck appointment system. It is used to coordinate the major cross-dock planning activities and to regulate the arrival time of trucks at the cross-dock. When the trucker get an appointment time different to its preference time, then we are talking about a truck deviation time. Because the deviation will result in daily operations schedule, an optimization model for truck appointment was proposed in this paper. In the model, the truck deviation time was minimized subject to the constraints of resources availability including dock doors, yard zones, gate lanes, workforce and material handling systems. To solve the model, a method based multi-agent system to real-time truck scheduling, that take into account the uncertainty of arrival time as an operational characteristic, was designed. It ensures a negotiation among truck agents and resource agents. Lastly, a numerical experiments are provided to illustrate the validity of the model and to illustrate the working and benefit of our approach.

货车到达管理是一个非常活跃的研究方向，也是交叉码头的一个重要挑战。研究的重点是货车拥堵问题，该问题导致货车运行效率较低，在门口和场站等待时间较长。解决这一问题的操作措施之一是卡车预约制度。它用于协调主要的交叉码头规划活动，并规范卡车到达交叉码头的时间。当卡车司机得到的预约时间与其偏好时间不同时，我们讨论的是卡车偏离时间。由于这种偏差会影响日常作业计划，本文提出了车辆调度优化模型。在该模型中，受码头门、堆场区、门道、劳动力和物料搬运系统等资源可用性的约束，卡车偏离时间最小化。为求解该模型，设计了一种基于多智能体系统的卡车实时调度方法，该方法将到达时间的不确定性作为一种操作特征。它保证了卡车代理和资源代理之间的协商。最后，通过数值实验验证了该模型的有效性，并说明了该方法的有效性和有效性。

{"title":"An Application Oriented Multi-Agent Based Approach to Dynamic Truck Scheduling at Cross-Dock","authors":"Houda Zouhaier, L. B. Said","doi":"10.1109/PDCAT.2016.058","DOIUrl":"https://doi.org/10.1109/PDCAT.2016.058","url":null,"abstract":"Truck arrival management forms a very active stream of research and a crucial challenge for a cross-dock terminals. The study focuses on the truck congestion problem, which leads to a lower operation efficiency and a longer waiting time at the gate and at the yard. One of the operational measures to solve this problem is the truck appointment system. It is used to coordinate the major cross-dock planning activities and to regulate the arrival time of trucks at the cross-dock. When the trucker get an appointment time different to its preference time, then we are talking about a truck deviation time. Because the deviation will result in daily operations schedule, an optimization model for truck appointment was proposed in this paper. In the model, the truck deviation time was minimized subject to the constraints of resources availability including dock doors, yard zones, gate lanes, workforce and material handling systems. To solve the model, a method based multi-agent system to real-time truck scheduling, that take into account the uncertainty of arrival time as an operational characteristic, was designed. It ensures a negotiation among truck agents and resource agents. Lastly, a numerical experiments are provided to illustrate the validity of the model and to illustrate the working and benefit of our approach.","PeriodicalId":203925,"journal":{"name":"2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123191796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Making User-Level VMM for Deterministic Parallelism Nonblocking and Efficient 使用户级VMM实现确定性并行、非阻塞和高效

2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)

Pub Date : 2016-12-01 DOI: 10.1109/PDCAT.2016.042

Yu Zhang, Jiange Zhang, Qiliang Zhang

Many parallel programs are intended to yield deterministic results, but unpredictable thread or process interleavings can lead to subtle bugs and nondeterminism. We proposed a producer-consumer virtual memory–Many parallel programs are intended to yield deterministic results, but unpredictable thread or process interleavings can lead to subtle bugs and nondeterminism. We proposed a producer-consumer virtual memory–SPMC–for efficient system-enforced deterministic parallelism, and prototyped the SPMC model and its software stack entirely in Linux user space, called DLinux. This paper summarizes the implementation policies and limitations in our previous DLinux. To reduce SPMC page fault overhead and suspend/resume overhead which severely degrade the performance of DLinux, we enhance the SPMC model with nonblocking test and direct read and write primitives. Based on the extended SPMC model, we improve the implementation of upper programming abstractions. Experimental results show that relative to the previous version, the new DLinux can improve the performance of NPB workloads up to 2.33X and 1.76X on 8 and 16 processes, respectively. For CG on 8 processes, its runtime relative to MPICH2 decreases from 4.12X to 1.77X. SPMC–for efficient system-enforced deterministic parallelism, and prototyped the SPMC model and its software stack entirely in Linux user space, called DLinux. This paper summarizes the implementation policies and limitations in our previous DLinux. To reduce SPMC page fault overhead and suspend/resume overhead which severely degrade the performance of DLinux, we enhance the SPMC model with nonblocking test and direct read and write primitives. Based on the extended SPMC model, we improve the implementation of upper programming abstractions. Experimental results show that relative to the previous version, the new DLinux can improve the performance of NPB workloads up to 2.33X and 1.76X on 8 and 16 processes, respectively. For CG on 8 processes, its runtime relative to MPICH2 decreases from 4.12X to 1.77X.

许多并行程序旨在产生确定性的结果，但是不可预测的线程或进程交织可能导致微妙的错误和不确定性。我们提出了一种生产者-消费者虚拟内存——许多并行程序旨在产生确定性的结果，但不可预测的线程或进程交织可能导致微妙的错误和不确定性。我们提出了一个生产者-消费者虚拟内存- SPMC -用于高效的系统强制确定性并行，并在Linux用户空间中对SPMC模型及其软件堆栈进行了原型化，称为DLinux。本文总结了我们以前的DLinux的实现策略和限制。为了减少严重降低DLinux性能的SPMC页面故障开销和挂起/恢复开销，我们使用非阻塞测试和直接读写原语增强了SPMC模型。在扩展SPMC模型的基础上，改进了上层编程抽象的实现。实验结果表明，与之前的版本相比，新版本的DLinux在8个进程和16个进程上的NPB工作负载性能分别提高了2.33倍和1.76倍。对于8个进程的CG，其相对于MPICH2的运行时间从4.12X减少到1.77X。SPMC -高效的系统强制的确定性并行，并在Linux用户空间中对SPMC模型及其软件栈进行了原型化，称为DLinux。本文总结了我们以前的DLinux的实现策略和限制。为了减少严重降低DLinux性能的SPMC页面故障开销和挂起/恢复开销，我们使用非阻塞测试和直接读写原语增强了SPMC模型。在扩展SPMC模型的基础上，改进了上层编程抽象的实现。实验结果表明，与之前的版本相比，新版本的DLinux在8个进程和16个进程上的NPB工作负载性能分别提高了2.33倍和1.76倍。对于8个进程的CG，其相对于MPICH2的运行时间从4.12X减少到1.77X。

{"title":"Making User-Level VMM for Deterministic Parallelism Nonblocking and Efficient","authors":"Yu Zhang, Jiange Zhang, Qiliang Zhang","doi":"10.1109/PDCAT.2016.042","DOIUrl":"https://doi.org/10.1109/PDCAT.2016.042","url":null,"abstract":"Many parallel programs are intended to yield deterministic results, but unpredictable thread or process interleavings can lead to subtle bugs and nondeterminism. We proposed a producer-consumer virtual memory–Many parallel programs are intended to yield deterministic results, but unpredictable thread or process interleavings can lead to subtle bugs and nondeterminism. We proposed a producer-consumer virtual memory–SPMC–for efficient system-enforced deterministic parallelism, and prototyped the SPMC model and its software stack entirely in Linux user space, called DLinux. This paper summarizes the implementation policies and limitations in our previous DLinux. To reduce SPMC page fault overhead and suspend/resume overhead which severely degrade the performance of DLinux, we enhance the SPMC model with nonblocking test and direct read and write primitives. Based on the extended SPMC model, we improve the implementation of upper programming abstractions. Experimental results show that relative to the previous version, the new DLinux can improve the performance of NPB workloads up to 2.33X and 1.76X on 8 and 16 processes, respectively. For CG on 8 processes, its runtime relative to MPICH2 decreases from 4.12X to 1.77X. SPMC–for efficient system-enforced deterministic parallelism, and prototyped the SPMC model and its software stack entirely in Linux user space, called DLinux. This paper summarizes the implementation policies and limitations in our previous DLinux. To reduce SPMC page fault overhead and suspend/resume overhead which severely degrade the performance of DLinux, we enhance the SPMC model with nonblocking test and direct read and write primitives. Based on the extended SPMC model, we improve the implementation of upper programming abstractions. Experimental results show that relative to the previous version, the new DLinux can improve the performance of NPB workloads up to 2.33X and 1.76X on 8 and 16 processes, respectively. For CG on 8 processes, its runtime relative to MPICH2 decreases from 4.12X to 1.77X.","PeriodicalId":203925,"journal":{"name":"2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126304331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Affine Correction Based Image Watermarking Robust to Geometric Attacks 基于仿射校正的图像水印对几何攻击的鲁棒性

2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)

Pub Date : 2016-12-01 DOI: 10.1109/PDCAT.2016.046

Wuyong Zhang, Jianhua Chen, Rongshu Wang, Xiaolong Wang, Tian Meng

How to resist combined geometric attacks effectively while maintain a high embedding capacity is still a challenging task for the digital watermarking research. An affine correction based algorithm is proposed in this paper, which can resist combined geometric attacks and keep a higher watermark embedding capacity. The SURF algorithm and the RANSAC algorithm are used to extract, match and select feature points from the attacked image and the original image. Then, the least square algorithm is used to estimate the affine matrix of the geometric attacks according to the relationship between the matched feature points. The attacks are corrected based on the estimated affine matrix. A fine correction step is included to improve the precision of the watermark detection. To resist the cropping attacks, the watermark information is encoded with LT-coding. The encoded watermark is embedded in the DWT-DCT composite domain of the image. Experimental results show that the proposed algorithm not only has a high embedding capacity, but also is robust to many kinds of geometric attacks.

如何在有效抵抗组合几何攻击的同时保持较高的嵌入容量仍然是数字水印研究的一个具有挑战性的课题。本文提出了一种基于仿射校正的水印算法，该算法能够抵抗组合几何攻击并保持较高的水印嵌入容量。利用SURF算法和RANSAC算法从被攻击图像和原始图像中提取、匹配和选择特征点。然后，根据匹配的特征点之间的关系，利用最小二乘算法估计几何攻击的仿射矩阵;根据估计的仿射矩阵对攻击进行校正。为了提高水印检测的精度，算法中加入了精细校正步骤。为了抵抗裁剪攻击，水印信息采用lt编码进行编码。编码后的水印嵌入到图像的DWT-DCT复合域中。实验结果表明，该算法不仅具有较高的嵌入容量，而且对多种几何攻击具有较强的鲁棒性。

{"title":"Affine Correction Based Image Watermarking Robust to Geometric Attacks","authors":"Wuyong Zhang, Jianhua Chen, Rongshu Wang, Xiaolong Wang, Tian Meng","doi":"10.1109/PDCAT.2016.046","DOIUrl":"https://doi.org/10.1109/PDCAT.2016.046","url":null,"abstract":"How to resist combined geometric attacks effectively while maintain a high embedding capacity is still a challenging task for the digital watermarking research. An affine correction based algorithm is proposed in this paper, which can resist combined geometric attacks and keep a higher watermark embedding capacity. The SURF algorithm and the RANSAC algorithm are used to extract, match and select feature points from the attacked image and the original image. Then, the least square algorithm is used to estimate the affine matrix of the geometric attacks according to the relationship between the matched feature points. The attacks are corrected based on the estimated affine matrix. A fine correction step is included to improve the precision of the watermark detection. To resist the cropping attacks, the watermark information is encoded with LT-coding. The encoded watermark is embedded in the DWT-DCT composite domain of the image. Experimental results show that the proposed algorithm not only has a high embedding capacity, but also is robust to many kinds of geometric attacks.","PeriodicalId":203925,"journal":{"name":"2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122263365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀