2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)最新文献

英文中文

A wearable virtual coach for Marathon beginners 专为马拉松初学者设计的可穿戴虚拟教练

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097908

Jr-Jung Chen, Yi-Fan Chung, Chiu-Ping Chang, C. King, Cheng-Hsin Hsu

Marathon is very popular in recent years. However, finishing the game is no easy task, especially for beginners. Regular practices and training are needed. With the availability of wearable devices, it is possible to develop virtual coaches that monitor the progresses of individual runners closeup and guide them through tailor-made training schedules. Unfortunately, most existing wearable devices only record physiological signals of the runners and rely on off-line processing to provide feedbacks. In this paper, we present the design and development of an on-line virtual coach, which performs real-time tracking and analysis of the physiological status of the runner and suggests appropriate adjustments on the exercise intensity. The proposed virtual coach is a pure software solution and can work with any wearable device that monitors the heart rate and running speed of the runner. The main challenge of our system is to predict when the runner will reach the various running states and instruct the runner to adjust the speed just ahead of time so that her/his body can react in time to maintain the required training intensity. Experiments on real users show that our proposed algorithms can correctly predict the running states of the runners and help them to better maintain the required intensity to maximize the training effects.

马拉松近年来很受欢迎。然而，完成游戏并不是一件容易的事，尤其是对初学者来说。需要定期练习和培训。随着可穿戴设备的出现，有可能开发出虚拟教练，近距离监控单个跑步者的进步，并指导他们完成量身定制的训练计划。遗憾的是，现有的可穿戴设备大多只记录跑步者的生理信号，依靠离线处理来提供反馈。本文设计并开发了一种在线虚拟教练，可以实时跟踪和分析跑步者的生理状态，并提出适当的运动强度调整建议。拟议中的虚拟教练是一个纯软件解决方案，可以与任何监测跑步者心率和跑步速度的可穿戴设备一起工作。我们系统的主要挑战是预测跑步者何时会达到各种跑步状态，并指导跑步者提前调整速度，使她/他的身体能够及时做出反应，以保持所需的训练强度。在真实用户身上的实验表明，我们提出的算法可以正确预测跑步者的跑步状态，帮助他们更好地保持所需的强度，从而达到最大的训练效果。

{"title":"A wearable virtual coach for Marathon beginners","authors":"Jr-Jung Chen, Yi-Fan Chung, Chiu-Ping Chang, C. King, Cheng-Hsin Hsu","doi":"10.1109/PADSW.2014.7097908","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097908","url":null,"abstract":"Marathon is very popular in recent years. However, finishing the game is no easy task, especially for beginners. Regular practices and training are needed. With the availability of wearable devices, it is possible to develop virtual coaches that monitor the progresses of individual runners closeup and guide them through tailor-made training schedules. Unfortunately, most existing wearable devices only record physiological signals of the runners and rely on off-line processing to provide feedbacks. In this paper, we present the design and development of an on-line virtual coach, which performs real-time tracking and analysis of the physiological status of the runner and suggests appropriate adjustments on the exercise intensity. The proposed virtual coach is a pure software solution and can work with any wearable device that monitors the heart rate and running speed of the runner. The main challenge of our system is to predict when the runner will reach the various running states and instruct the runner to adjust the speed just ahead of time so that her/his body can react in time to maintain the required training intensity. Experiments on real users show that our proposed algorithms can correctly predict the running states of the runners and help them to better maintain the required intensity to maximize the training effects.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130160402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Reducing lock contention on multi-core platforms 减少多核平台上的锁争用

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097804

Haimiao Ding, Xiaofei Liao, Hai Jin, Xinqiao Lv, Rentong Guo

As multi-core platforms with hundreds or even more quantities of cores are popular, system optimization issues, including lock contentions, start to puzzle programmers who work on multi-core platforms. Locks are more convenient and clear than lock-free operations (for example, transactional memory) for multi-core programmers. However, lock contention has been recognized as a typical impediment to the performance of shared-memory parallel programs. This paper mainly discusses two important reasons that cause lock contention, including large critical sections and frequent lock requests. For current solutions, it is hard for programmers to find the locations of large critical sections and good scheme to reduce lock contentions on hot critical sections. This paper proposes FFlocker, a series of runtime solutions that reduce lock contention caused by the two issues. FFlocker includes a profiling algorithm to find the locations of large critical sections. Based on the profiling scheme, it binds the threads acquiring the same locks onto the same core. We evaluate our techniques with three benchmarks. The results show that FFlocker offers better performance than Function Flow and OpenMP.

随着拥有数百甚至更多核的多核平台的流行，系统优化问题(包括锁争用)开始困扰在多核平台上工作的程序员。对于多核程序员来说，锁比无锁操作(例如，事务性内存)更方便、更清晰。然而，锁争用已经被认为是影响共享内存并行程序性能的一个典型障碍。本文主要讨论了导致锁争用的两个重要原因，即大临界区和频繁的锁请求。对于目前的解决方案，程序员很难找到大临界区的位置和减少热临界区的锁争用的好方案。本文提出了一系列运行时解决方案FFlocker，以减少这两个问题引起的锁争用。FFlocker包括一个分析算法，用于查找大临界区域的位置。基于分析方案，它将获得相同锁的线程绑定到相同的核上。我们用三个基准来评估我们的技术。结果表明，FFlocker的性能优于Function Flow和OpenMP。

{"title":"Reducing lock contention on multi-core platforms","authors":"Haimiao Ding, Xiaofei Liao, Hai Jin, Xinqiao Lv, Rentong Guo","doi":"10.1109/PADSW.2014.7097804","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097804","url":null,"abstract":"As multi-core platforms with hundreds or even more quantities of cores are popular, system optimization issues, including lock contentions, start to puzzle programmers who work on multi-core platforms. Locks are more convenient and clear than lock-free operations (for example, transactional memory) for multi-core programmers. However, lock contention has been recognized as a typical impediment to the performance of shared-memory parallel programs. This paper mainly discusses two important reasons that cause lock contention, including large critical sections and frequent lock requests. For current solutions, it is hard for programmers to find the locations of large critical sections and good scheme to reduce lock contentions on hot critical sections. This paper proposes FFlocker, a series of runtime solutions that reduce lock contention caused by the two issues. FFlocker includes a profiling algorithm to find the locations of large critical sections. Based on the profiling scheme, it binds the threads acquiring the same locks onto the same core. We evaluate our techniques with three benchmarks. The results show that FFlocker offers better performance than Function Flow and OpenMP.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126525681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

A fully generalized over operator with applications to image composition in parallel visualization for big data science 一个完全广义的over算子及其在大数据科学并行可视化图像合成中的应用

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097854

Dongliang Chu, C. Wu, Zongmin Wang, Yongqiang Wang

The over operator is commonly used for α-blending in various visualization techniques. In the current form, it is a binary operator and must respect the restriction of order dependency, hence posing a significant performance limit. This paper proposes a fully generalized version of this operator. Compared with its predecessor, the fully generalized over operator is not only n-operator compatible but also any-order friendly. To demonstrate the advantages of the proposed operator, we apply it to the asynchronous and order-dependent image composition problem in parallel visualization for big data science and further parallelize it for performance improvement. We conduct theoretical analyses to establish the performance superiority of the proposed over operator in comparison with its original form, which is further validated by extensive experimental results in the context of real-life scientific visualization.

在各种可视化技术中，过算符通常用于α-混合。在目前的形式下，它是一个二进制运算符，必须遵守顺序依赖的限制，因此造成了很大的性能限制。本文提出了该算子的一个完全广义版本。与它的前身相比，完全广义上算子不仅兼容n算子，而且是任意阶友好的。为了证明所提出的算子的优势，我们将其应用于大数据科学并行可视化中的异步和顺序依赖的图像合成问题，并进一步并行化以提高性能。我们进行了理论分析，与原始形式相比，建立了所提出的over算子的性能优势，并通过实际科学可视化背景下的大量实验结果进一步验证了这一点。

引用次数: 1

Combination feature for image retrieval in the distributed datacenter 分布式数据中心图像检索的组合特性

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097871

Di Yang, J. Liao, Q. Qi, Jingyu Wang, Haifeng Sun, Shantao Jiang

Since the emergence of cloud datacenters provides an enormous amount of resources easily accessible to people, it is challenging to provide an efficient search framework in such a distributed environment. However, traditional search techniques only allow users to search images over exact-match keywords through a centralized index. These methods are insufficient to meet requirements of content based image retrieval (CBIR) and more powerful search frameworks are needed. In this paper, we present LCFIR, an effective image retrieval framework for fast content location in the distributed situation. It adopts the peer-to-peer paradigm and combines color and edge features. The basic idea is to construct multiple replicas of an image's index through exploiting the property of Locality Sensitive Hashing (LSH). Thus, the indexes of similar images are probabilistically gathered into the same node without the knowledge of any global information. The empirical results show that the system is able to yield high accuracy with load balancing, and only contacts a few number of the participating nodes.

由于云数据中心的出现为人们提供了大量易于访问的资源，因此在这样一个分布式环境中提供高效的搜索框架是一项挑战。然而，传统的搜索技术只允许用户通过集中索引在精确匹配的关键字上搜索图像。这些方法不足以满足基于内容的图像检索(CBIR)的要求，需要更强大的搜索框架。在本文中，我们提出了一种有效的图像检索框架LCFIR，用于在分布式情况下快速定位内容。它采用点对点模式，结合了颜色和边缘特征。其基本思想是通过利用局域敏感散列(Locality Sensitive hash, LSH)的特性来构建图像索引的多个副本。这样，在不知道任何全局信息的情况下，将相似图像的索引概率地聚集到同一节点。实验结果表明，在负载均衡的情况下，该系统能够产生较高的准确率，并且只接触少量的参与节点。

{"title":"Combination feature for image retrieval in the distributed datacenter","authors":"Di Yang, J. Liao, Q. Qi, Jingyu Wang, Haifeng Sun, Shantao Jiang","doi":"10.1109/PADSW.2014.7097871","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097871","url":null,"abstract":"Since the emergence of cloud datacenters provides an enormous amount of resources easily accessible to people, it is challenging to provide an efficient search framework in such a distributed environment. However, traditional search techniques only allow users to search images over exact-match keywords through a centralized index. These methods are insufficient to meet requirements of content based image retrieval (CBIR) and more powerful search frameworks are needed. In this paper, we present LCFIR, an effective image retrieval framework for fast content location in the distributed situation. It adopts the peer-to-peer paradigm and combines color and edge features. The basic idea is to construct multiple replicas of an image's index through exploiting the property of Locality Sensitive Hashing (LSH). Thus, the indexes of similar images are probabilistically gathered into the same node without the knowledge of any global information. The empirical results show that the system is able to yield high accuracy with load balancing, and only contacts a few number of the participating nodes.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127558871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DLBS: Decentralized load balancing scheme for event-driven cloud frameworks DLBS:用于事件驱动云框架的分散负载平衡方案

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097896

Changlong Li, Xuehai Zhou, Mingming Sun, Kun Lu, Jinhong Zhou, Hang Zhuang, Dong Dai

With the development of cloud computing, more and more applications are moving to a distributed fashion to solve problems. These applications usually contain complex iterative or incremental procedures and have a more urgent requirement on low-latency. Thus many event-driven cloud frameworks are proposed. To optimize this kind of frameworks, an efficient strategy to minimize the execution time by redistributing work- loads is needed. Nowadays, load balance is a critical issue for the efficient operation of cloud platforms and many centralized schemes have already been proposed. However, few of them have been designed to support event-driven frameworks. Besides, as the cluster size and volume of tasks increases, centralized scheme will lead to a bottleneck of master node. In this paper, we demonstrate a decentralized load balancing scheme named DLBS for event-driven cloud frameworks and present two technologies to optimize it. In our design, schedulers are placed in every node for independently load-monitoring, autonomous decision-making and parallel task-scheduling. With the help of DLBS, master frees from the burden and tasks are executed with lower latency. We analyze the excellence of DLBS theoretically and proof it through simulation. At last, we implement and deploy it on a 64-machine cluster and demonstrate that it performs within 20% of an ideal scheme, which are consistent with simulation results.

随着云计算的发展，越来越多的应用程序正在转向分布式方式来解决问题。这些应用程序通常包含复杂的迭代或增量过程，并且对低延迟有更迫切的要求。因此，人们提出了许多事件驱动的云框架。为了优化这类框架，需要一种有效的策略，通过重新分配工作负载来最小化执行时间。目前，负载均衡是云平台高效运行的关键问题，已经提出了许多集中的方案。然而，它们中很少被设计成支持事件驱动的框架。此外，随着集群规模和任务量的增加，集中式方案会导致主节点的瓶颈。在本文中，我们展示了一种名为DLBS的分散负载平衡方案，用于事件驱动的云框架，并提出了两种技术来优化它。在我们的设计中，调度程序被放置在每个节点中，用于独立的负载监控、自主决策和并行任务调度。在DLBS的帮助下，master摆脱了负担，以更低的延迟执行任务。从理论上分析了DLBS的优越性，并通过仿真验证了其优越性。最后，我们在一个64机集群上进行了实现和部署，结果表明，该方案的性能在理想方案的20%以内，与仿真结果一致。

{"title":"DLBS: Decentralized load balancing scheme for event-driven cloud frameworks","authors":"Changlong Li, Xuehai Zhou, Mingming Sun, Kun Lu, Jinhong Zhou, Hang Zhuang, Dong Dai","doi":"10.1109/PADSW.2014.7097896","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097896","url":null,"abstract":"With the development of cloud computing, more and more applications are moving to a distributed fashion to solve problems. These applications usually contain complex iterative or incremental procedures and have a more urgent requirement on low-latency. Thus many event-driven cloud frameworks are proposed. To optimize this kind of frameworks, an efficient strategy to minimize the execution time by redistributing work- loads is needed. Nowadays, load balance is a critical issue for the efficient operation of cloud platforms and many centralized schemes have already been proposed. However, few of them have been designed to support event-driven frameworks. Besides, as the cluster size and volume of tasks increases, centralized scheme will lead to a bottleneck of master node. In this paper, we demonstrate a decentralized load balancing scheme named DLBS for event-driven cloud frameworks and present two technologies to optimize it. In our design, schedulers are placed in every node for independently load-monitoring, autonomous decision-making and parallel task-scheduling. With the help of DLBS, master frees from the burden and tasks are executed with lower latency. We analyze the excellence of DLBS theoretically and proof it through simulation. At last, we implement and deploy it on a 64-machine cluster and demonstrate that it performs within 20% of an ideal scheme, which are consistent with simulation results.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115149752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Performance evaluation of OpenDaylight SDN controller OpenDaylight SDN控制器的性能评估

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097868

Zuhran Khan Khattak, M. Awais, Adnan Iqbal

Enourmous amount of data has resulted into large data centres. Virtual machines and virtual networks are an integral part of large data centres. As a result, software defined network controllers have emerged as viable solution to manage such networks. The performance analysis of network controllers is generally carried out through benchmarking. Although several benchmarking studies exist, recently launched OpenDaylight SDN Controller is not considered in any benchmarking study yet. In this paper, we present initial results of benchmarking of Open-Daylight SDN Controller with well known Floodlight controller. We present latency and throughput results of OpenDaylight SDN Controller and Floodlight under different scenarios. We find that OpenDaylight SDN Controller has low average responses as compared to Floodlight. We also note that the standard benchmarking tool - Cbench - has no support for real traffic patterns in a data centre, since data centre traffic is considerably complex. In addition to benchmarking of OpenDaylight SDN Controller, we propose modifications in Cbench to accommodate models of real traffic in data centres. We also discuss our initial implementation.

海量的数据形成了大型的数据中心。虚拟机和虚拟网络是大型数据中心的组成部分。因此，软件定义的网络控制器已成为管理此类网络的可行解决方案。网络控制器的性能分析一般通过基准测试进行。虽然存在一些基准研究，但最近推出的OpenDaylight SDN控制器尚未被考虑在任何基准研究中。在本文中，我们介绍了Open-Daylight SDN控制器与众所周知的泛光灯控制器的基准测试的初步结果。我们给出了OpenDaylight SDN控制器和泛光灯在不同场景下的延迟和吞吐量结果。我们发现OpenDaylight SDN控制器与泛光灯相比具有较低的平均响应。我们还注意到，标准基准测试工具—Cbench—不支持数据中心中的实际流量模式，因为数据中心流量相当复杂。除了对OpenDaylight SDN控制器进行基准测试外，我们还建议在chbench中进行修改，以适应数据中心的真实流量模型。我们还讨论了我们的初始实现。

{"title":"Performance evaluation of OpenDaylight SDN controller","authors":"Zuhran Khan Khattak, M. Awais, Adnan Iqbal","doi":"10.1109/PADSW.2014.7097868","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097868","url":null,"abstract":"Enourmous amount of data has resulted into large data centres. Virtual machines and virtual networks are an integral part of large data centres. As a result, software defined network controllers have emerged as viable solution to manage such networks. The performance analysis of network controllers is generally carried out through benchmarking. Although several benchmarking studies exist, recently launched OpenDaylight SDN Controller is not considered in any benchmarking study yet. In this paper, we present initial results of benchmarking of Open-Daylight SDN Controller with well known Floodlight controller. We present latency and throughput results of OpenDaylight SDN Controller and Floodlight under different scenarios. We find that OpenDaylight SDN Controller has low average responses as compared to Floodlight. We also note that the standard benchmarking tool - Cbench - has no support for real traffic patterns in a data centre, since data centre traffic is considerably complex. In addition to benchmarking of OpenDaylight SDN Controller, we propose modifications in Cbench to accommodate models of real traffic in data centres. We also discuss our initial implementation.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127587796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 102

A victim and rescuer communication model in collapsed buildings/structures 倒塌建筑物/构筑物中的受害者与救援者沟通模型

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097887

Tanapat Anusas-Amornkul

In a disaster hit area, infrastructure networks may be disconnected and main communications between victims and rescuers are not available. An interesting question is that how many people are trapped in a collapsed building and how close they are to a nearby rescuer's position. A smart-phone is widely used today and it normally has a Wi-Fi function to connect to the Internet. The phone can be used for saving victims' lives. In this paper, a communication model is proposed for a victim to communicate with others and also a rescuer to communicate with victims. State diagrams of the victim and rescuer models are proposed along with pseudo codes for the model operation. An example communication scenario is presented to demonstrate how the model works.

在受灾地区，基础设施网络可能会中断，受害者和救援人员之间的主要通信也无法使用。一个有趣的问题是，有多少人被困在倒塌的建筑物里，他们离附近的救援人员有多近。智能手机如今被广泛使用，它通常具有Wi-Fi功能来连接互联网。这款手机可以用来挽救受害者的生命。本文提出了受害者与他人沟通的沟通模式，以及救援者与受害者沟通的沟通模式。提出了受害者和救援者模型的状态图以及模型运行的伪代码。给出了一个示例通信场景来演示该模型是如何工作的。

引用次数: 4

Session-based fault-tolerant design patterns 基于会话的容错设计模式

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097875

N. Ivaki, Filipe Araújo, F. Barros

Despite offering reliability against dropped and reordered packets, the widely adopted Transmission Control Protocol (TCP) provides nearly no recovery options for longterm network outages. When the network fails, developers must rollback the application to some coherent state on their own, using error-prone solutions. Overcoming this limitation is, therefore, a deeply investigated and challenging problem. Existing solutions range from transport-layer to application-layer protocols, including additions to TCP, usually transparent to the application. None of these solutions is perfect, because they all impact TCP's simplicity, performance or ubiquity, if not all. To avoid these shortcomings, we contain TCP connection crashes inside a single session layer exposed as a sockets interface. Based on this interface, we create a blocking and a non-blocking fault-tolerant design pattern. We explore the blocking design in an open source File Transfer Protocol (FTP) server and perform a thorough evaluation of performance, complexity and overhead of both designs. Our results show that using one of the patterns to tolerate TCP connection crashes, in new or existing applications, involves a very limited effort and negligible penalties.

尽管广泛采用的传输控制协议(TCP)提供了针对丢弃和重新排序数据包的可靠性，但它几乎没有为长期网络中断提供恢复选项。当网络出现故障时，开发人员必须使用容易出错的解决方案，自己将应用程序回滚到某种一致状态。因此，克服这一限制是一个深入研究和具有挑战性的问题。现有的解决方案包括从传输层到应用层协议，包括对TCP的添加，通常对应用程序是透明的。这些解决方案都不是完美的，因为它们都会影响TCP的简单性、性能或普遍性，如果不是全部的话。为了避免这些缺点，我们将TCP连接崩溃包含在作为套接字接口公开的单个会话层中。基于该接口，我们创建了一个阻塞和一个非阻塞容错设计模式。我们探索了开源文件传输协议(FTP)服务器中的阻塞设计，并对两种设计的性能、复杂性和开销进行了全面的评估。我们的结果表明，在新的或现有的应用程序中，使用其中一种模式来容忍TCP连接崩溃所涉及的努力非常有限，并且可以忽略不计。

{"title":"Session-based fault-tolerant design patterns","authors":"N. Ivaki, Filipe Araújo, F. Barros","doi":"10.1109/PADSW.2014.7097875","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097875","url":null,"abstract":"Despite offering reliability against dropped and reordered packets, the widely adopted Transmission Control Protocol (TCP) provides nearly no recovery options for longterm network outages. When the network fails, developers must rollback the application to some coherent state on their own, using error-prone solutions. Overcoming this limitation is, therefore, a deeply investigated and challenging problem. Existing solutions range from transport-layer to application-layer protocols, including additions to TCP, usually transparent to the application. None of these solutions is perfect, because they all impact TCP's simplicity, performance or ubiquity, if not all. To avoid these shortcomings, we contain TCP connection crashes inside a single session layer exposed as a sockets interface. Based on this interface, we create a blocking and a non-blocking fault-tolerant design pattern. We explore the blocking design in an open source File Transfer Protocol (FTP) server and perform a thorough evaluation of performance, complexity and overhead of both designs. Our results show that using one of the patterns to tolerate TCP connection crashes, in new or existing applications, involves a very limited effort and negligible penalties.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115883838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Physically based parallel ray tracer for the Metropolis light transport algorithm on the Tianhe-2 supercomputer 天河2号超级计算机上Metropolis光传输算法的物理并行光线追踪器

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097840

Changmao Wu, Yunquan Zhang, Congli Yang, Yutong Lu

Developing an efficient and highly scalable ray tracer for the Metropolis light transport algorithm is becoming increasingly important as the request for photorealistic images becomes a common trend. Although the Metropolis light transport algorithm has produced some of the most realistic images to date, it usually takes a great amount of time to render an image. The development of an efficient and highly scalable ray tracer for the Metropolis light transport algorithm is hard due in large part to the irregular memory access patterns, the imbalanced workload of light-carrying paths and the complicated mathematical model and complex physical processes. In this paper, we present a highly scalable physically based parallel ray tracer for the Metropolis light transport algorithm. Firstly, we present the idea of snapshot and sub-snapshot, then propose a novel assignment partitioning algorithm for compute nodes and CPU cores since the demand-driven assignment partitioning algorithms don't work. Secondly, we propose a physically based parallel ray racing framework for the Metropolis light transport algorithm, which is based on a master-worker architecture. Finally, we discuss the issue of granularity of the assignment partitioning and some optimization strategies for improving overall performance, then a hybrid scheduling strategy combining a static and dynamic scheduling strategy is described. Experiments show that our physically based ray tracer almost reaches linear speedup by using 26,400 CPU cores on the Tianhe-2 supercomputer. Our ray tracer is more efficient and highly scalable.

随着对逼真图像的要求成为一种普遍趋势，为Metropolis光传输算法开发一种高效且高度可扩展的光线追踪器变得越来越重要。尽管Metropolis光传输算法已经产生了一些迄今为止最逼真的图像，但渲染图像通常需要花费大量时间。对于Metropolis光传输算法，由于内存访问模式不规范、光传输路径负载不平衡以及复杂的数学模型和物理过程，开发高效、高可扩展性的光线追踪器是一项艰巨的任务。在本文中，我们提出了一个高度可扩展的基于物理的平行光线追踪器，用于Metropolis光传输算法。首先，我们提出了快照和子快照的思想，然后针对需求驱动的分配分区算法不能工作的情况，提出了一种新的计算节点和CPU内核分配分区算法。其次，我们提出了一种基于主工架构的Metropolis光传输算法的物理并行光线竞速框架。最后，讨论了分配分区的粒度问题和提高整体性能的优化策略，并提出了一种静态和动态混合调度策略。实验表明，在天河二号超级计算机上使用26400个CPU内核，我们的物理光线追踪器几乎达到了线性加速。我们的光线追踪器更高效，可高度扩展。

{"title":"Physically based parallel ray tracer for the Metropolis light transport algorithm on the Tianhe-2 supercomputer","authors":"Changmao Wu, Yunquan Zhang, Congli Yang, Yutong Lu","doi":"10.1109/PADSW.2014.7097840","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097840","url":null,"abstract":"Developing an efficient and highly scalable ray tracer for the Metropolis light transport algorithm is becoming increasingly important as the request for photorealistic images becomes a common trend. Although the Metropolis light transport algorithm has produced some of the most realistic images to date, it usually takes a great amount of time to render an image. The development of an efficient and highly scalable ray tracer for the Metropolis light transport algorithm is hard due in large part to the irregular memory access patterns, the imbalanced workload of light-carrying paths and the complicated mathematical model and complex physical processes. In this paper, we present a highly scalable physically based parallel ray tracer for the Metropolis light transport algorithm. Firstly, we present the idea of snapshot and sub-snapshot, then propose a novel assignment partitioning algorithm for compute nodes and CPU cores since the demand-driven assignment partitioning algorithms don't work. Secondly, we propose a physically based parallel ray racing framework for the Metropolis light transport algorithm, which is based on a master-worker architecture. Finally, we discuss the issue of granularity of the assignment partitioning and some optimization strategies for improving overall performance, then a hybrid scheduling strategy combining a static and dynamic scheduling strategy is described. Experiments show that our physically based ray tracer almost reaches linear speedup by using 26,400 CPU cores on the Tianhe-2 supercomputer. Our ray tracer is more efficient and highly scalable.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123001694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

LDSN: Localization scheme for double-head maritime Sensor Networks LDSN:双头海事传感器网络定位方案

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

Pub Date : 2014-12-01 DOI: 10.1109/PADSW.2014.7097813

Hanjiang Luo, Kaishun Wu, Jiang Xiao, Zhongwen Guo

Ocean covers nearly 71% of our planet's surface, yet 95% of the ocean remains unexplored by human being, and wireless sensor networks are envisioned to perform monitoring tasks over the large portion of our world. However, deploying wireless sensor networks on the sea poses many challenges and for maritime surveillance security applications we may need to deploy sensors both on the sea surface and underwater for three-dimensional detection. In this paper, we propose a hybrid ocean sensor networks called Double-head maritime Sensor Networks (DSNs), which combine the advantages of wireless sensor networks and underwater acoustic sensor networks. By leveraging the unique characteristics of DSNs, we design a localization scheme LDSN which is consisted of two algorithms SML and FLA. We first use SML to localize moored anchor nodes as seed nodes. After the underwater sensor networks have been localized, the floating double-head nodes can figure out its instant position via FLA algorithm. We evaluate the scheme by simulations and the results show that the scheme can achieve a high localization accuracy.

海洋覆盖了地球表面近71%的面积，但95%的海洋仍未被人类开发，无线传感器网络有望在我们世界的大部分地区执行监测任务。然而，在海上部署无线传感器网络带来了许多挑战，对于海上监视安全应用，我们可能需要在海面和水下部署传感器进行三维探测。本文提出了一种结合无线传感器网络和水声传感器网络优点的混合型海洋传感器网络——双头海洋传感器网络(DSNs)。利用dsn的独特特性，设计了一种由SML和FLA两种算法组成的定位方案LDSN。我们首先使用SML将锚节点定位为种子节点。水下传感器网络定位完成后，浮动双头节点可通过FLA算法计算出其瞬间位置。仿真结果表明，该方案具有较高的定位精度。

{"title":"LDSN: Localization scheme for double-head maritime Sensor Networks","authors":"Hanjiang Luo, Kaishun Wu, Jiang Xiao, Zhongwen Guo","doi":"10.1109/PADSW.2014.7097813","DOIUrl":"https://doi.org/10.1109/PADSW.2014.7097813","url":null,"abstract":"Ocean covers nearly 71% of our planet's surface, yet 95% of the ocean remains unexplored by human being, and wireless sensor networks are envisioned to perform monitoring tasks over the large portion of our world. However, deploying wireless sensor networks on the sea poses many challenges and for maritime surveillance security applications we may need to deploy sensors both on the sea surface and underwater for three-dimensional detection. In this paper, we propose a hybrid ocean sensor networks called Double-head maritime Sensor Networks (DSNs), which combine the advantages of wireless sensor networks and underwater acoustic sensor networks. By leveraging the unique characteristics of DSNs, we design a localization scheme LDSN which is consisted of two algorithms SML and FLA. We first use SML to localize moored anchor nodes as seed nodes. After the underwater sensor networks have been localized, the floating double-head nodes can figure out its instant position via FLA algorithm. We evaluate the scheme by simulations and the results show that the scheme can achieve a high localization accuracy.","PeriodicalId":421740,"journal":{"name":"2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124171601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀