International Journal of Parallel Emergent and Distributed Systems最新文献

英文中文

Modelling distributive computation by selective machines 用选择性机器模拟分布计算

IF 1.1 Q4 COMPUTER SCIENCE, THEORY & METHODS

International Journal of Parallel Emergent and Distributed Systems

Pub Date : 2021-09-03 DOI: 10.1080/17445760.2021.1934837

M. Burgin

In this paper, classes of automata that perform distributed computations with unconventional interaction are described and studied. These automata are called selective machines and they are more powerful than Turing machines while their high computing and recognising power can be achieved exclusively by interaction when a system of recursive algorithms (automata) becomes super-recursive due to their interaction. Computations of selective machines are described by selective algorithms, which are super-recursive allowing computations of functions that are incomputable by Turing machines. Examples of selective algorithms are grammars with prohibition, correction grammars and grammars with exclusion. The study of selective machines and selective algorithms is based on the axiomatic theory of algorithms, in which the results are obtained in the general situation of axiomatically defined classes of automata and algorithms. Then these results are specified for many concrete classes of automata and algorithms, such as finite automata or Turing machines, by checking the necessary axioms.

本文描述和研究了具有非常规交互作用的分布式计算自动机。这些自动机被称为选择性机器，它们比图灵机更强大，而当递归算法(自动机)系统由于相互作用而变得超递归时，它们的高计算和识别能力只能通过相互作用来实现。选择机的计算由选择算法来描述，选择算法是超递归的，允许计算图灵机无法计算的函数。选择性算法的例子有禁止语法、纠正语法和排除语法。选择机器和选择算法的研究是基于算法的公理化理论，其结果是在公理化定义的自动机和算法类的一般情况下得到的。然后，通过检查必要的公理，这些结果被指定为许多具体的自动机和算法，如有限自动机或图灵机。

引用次数: 0

Enhancing parallelism of distributed algorithms with the actor model and a smart data movement technique 利用参与者模型和智能数据移动技术增强分布式算法的并行性

IF 1.1 Q4 COMPUTER SCIENCE, THEORY & METHODS

International Journal of Parallel Emergent and Distributed Systems

Pub Date : 2021-08-31 DOI: 10.1080/17445760.2021.1971665

A. Doroshenko, E. Tulika, O. Yatsenko

ABSTRACT The centralised orchestration technique is often a bad idea for massive parallelism applications if we want to achieve a scalable solution. In this paper for this purpose, the choreography approach is undertaken and some adaptive methods and software tools of distributed implementation are proposed to enhance computation parallelism applied to the optimisation of a class of block-recursive algorithms. A new formal model of distribution and coordination of the tasks in a computing cluster as asynchronous reactive processes with message-passing represented with an actor model and choreography of actors is developed. Also, a new scheme of data placement in a multiprocessor cluster based on prioritisation of block-recursive operations is developed to reduce idling time, data movement time. Adaptive adjustment of the data placement in a cluster at run time to account for current cluster load is developed and an auto-tuning of the actor placement in a cluster based on previous statistics for optimisation is implemented. The experiments show that the choreography of actors allows to remove the central coordinating element, to avoid hard dependencies between cluster nodes, and to achieve a better degree of the parallel applications’ scalability. GRAPHICAL ABSTRACT

如果我们想要实现可扩展的解决方案，集中式编排技术对于大规模并行应用来说通常是一个坏主意。为此，本文采用了编排方法，并提出了一些分布式实现的自适应方法和软件工具，以增强应用于优化一类块递归算法的计算并行性。提出了计算集群中任务分配和协调的一种新的形式化模型，该模型作为异步响应过程，其消息传递用参与者模型和参与者编排表示。此外，还提出了一种基于块递归操作优先级的多处理器集群数据放置新方案，以减少空闲时间和数据移动时间。开发了在运行时对集群中的数据位置进行自适应调整以考虑当前集群负载，并基于先前的优化统计实现了集群中参与者位置的自动调优。实验表明，参与者编排可以去除中心协调元素，避免集群节点之间的硬依赖，实现更好的并行应用可扩展性。图形抽象

{"title":"Enhancing parallelism of distributed algorithms with the actor model and a smart data movement technique","authors":"A. Doroshenko, E. Tulika, O. Yatsenko","doi":"10.1080/17445760.2021.1971665","DOIUrl":"https://doi.org/10.1080/17445760.2021.1971665","url":null,"abstract":"ABSTRACT The centralised orchestration technique is often a bad idea for massive parallelism applications if we want to achieve a scalable solution. In this paper for this purpose, the choreography approach is undertaken and some adaptive methods and software tools of distributed implementation are proposed to enhance computation parallelism applied to the optimisation of a class of block-recursive algorithms. A new formal model of distribution and coordination of the tasks in a computing cluster as asynchronous reactive processes with message-passing represented with an actor model and choreography of actors is developed. Also, a new scheme of data placement in a multiprocessor cluster based on prioritisation of block-recursive operations is developed to reduce idling time, data movement time. Adaptive adjustment of the data placement in a cluster at run time to account for current cluster load is developed and an auto-tuning of the actor placement in a cluster based on previous statistics for optimisation is implemented. The experiments show that the choreography of actors allows to remove the central coordinating element, to avoid hard dependencies between cluster nodes, and to achieve a better degree of the parallel applications’ scalability. GRAPHICAL ABSTRACT","PeriodicalId":45411,"journal":{"name":"International Journal of Parallel Emergent and Distributed Systems","volume":"36 1","pages":"565 - 578"},"PeriodicalIF":1.1,"publicationDate":"2021-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48451020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Influence of the multi-resolution technique on tomographic reconstruction in ultrasound tomography 超声层析成像中多分辨率技术对断层重建的影响

IF 1.1 Q4 COMPUTER SCIENCE, THEORY & METHODS

International Journal of Parallel Emergent and Distributed Systems

Pub Date : 2021-08-22 DOI: 10.1080/17445760.2021.1967350

L. T. Theu, Q. Tran, Vijender Kumar Solanki, Tatiana R. Shemeleva, Duc-Tan Tran

The greatest advantage of scattering theory-based ultrasound tomography (UT) is its ability to investigate small structures. DBIM is the Distorted Born Iterative Method. The nearest neighbour interpolation method is used to enhance the reconstruction performance and reduce the reconstruction time. The raw (N 1 × N 1) and dense (N 2 × N 2) meshed integration areas are reconstructed in NN 1 and NN 2 iterations, respectively. However, choosing the best value of NN 1 to get the highest performance was not mentioned in previous works. If it is not well chosen, the reconstruction quality is even worse than that when using no interpolation. This study proposes a method to enhance the UT reconstruction by using the nearest neighbour interpolation (MR-DBIM). The corresponding algorithms are specified by the graphical concurrent programming language of Sleptsov nets. Some significant results are (1) the MR-DBIM is only meaningful when (i.e. sparse scattering domain); (2) the best performance is obtained in the DBIM when Nt = Nr , but in the MR-DBIM when Nr = 2Nt ; (3) the well-investigated value of NN 1 is 2 when and is 3 when . GRAPHICAL ABSTRACT

基于散射理论的超声层析成像(UT)的最大优点是它能够研究小结构。DBIM是一种畸变Born迭代法。采用最近邻插值法提高了重建性能，缩短了重建时间。原始(N 1 × N 1)和密集(N 2 × N 2)网格积分区域分别在NN 1和NN 2迭代中重构。然而，选择NN 1的最佳值来获得最高的性能在之前的作品中并没有提到。如果选择不好，重建质量甚至比不使用插值时还要差。本研究提出了一种利用最近邻插值(MR-DBIM)增强UT重建的方法。相应的算法由Sleptsov网络的图形并行编程语言指定。一些重要的结果是:(1)MR-DBIM只有在(即稀疏散射域)时才有意义;(2)当Nt = Nr时，DBIM的性能最好，而当Nr = 2Nt时，MR-DBIM的性能最好;(3)当，nnn 1的值为2，当，nnn 1的值为3。图形抽象

{"title":"Influence of the multi-resolution technique on tomographic reconstruction in ultrasound tomography","authors":"L. T. Theu, Q. Tran, Vijender Kumar Solanki, Tatiana R. Shemeleva, Duc-Tan Tran","doi":"10.1080/17445760.2021.1967350","DOIUrl":"https://doi.org/10.1080/17445760.2021.1967350","url":null,"abstract":"The greatest advantage of scattering theory-based ultrasound tomography (UT) is its ability to investigate small structures. DBIM is the Distorted Born Iterative Method. The nearest neighbour interpolation method is used to enhance the reconstruction performance and reduce the reconstruction time. The raw (N 1 × N 1) and dense (N 2 × N 2) meshed integration areas are reconstructed in NN 1 and NN 2 iterations, respectively. However, choosing the best value of NN 1 to get the highest performance was not mentioned in previous works. If it is not well chosen, the reconstruction quality is even worse than that when using no interpolation. This study proposes a method to enhance the UT reconstruction by using the nearest neighbour interpolation (MR-DBIM). The corresponding algorithms are specified by the graphical concurrent programming language of Sleptsov nets. Some significant results are (1) the MR-DBIM is only meaningful when (i.e. sparse scattering domain); (2) the best performance is obtained in the DBIM when Nt = Nr , but in the MR-DBIM when Nr = 2Nt ; (3) the well-investigated value of NN 1 is 2 when and is 3 when . GRAPHICAL ABSTRACT","PeriodicalId":45411,"journal":{"name":"International Journal of Parallel Emergent and Distributed Systems","volume":"36 1","pages":"579 - 593"},"PeriodicalIF":1.1,"publicationDate":"2021-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45990088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Parallel algorithm development and testing using Petri-object simulation 并行算法开发与Petri-object仿真测试

IF 1.1 Q4 COMPUTER SCIENCE, THEORY & METHODS

International Journal of Parallel Emergent and Distributed Systems

Pub Date : 2021-07-23 DOI: 10.1080/17445760.2021.1955113

I. V. Stetsenko, A. A. Pavlov, Oleksandra Dyfuchyna

Parallel algorithms are problematic to develop because of the negative influence of synchronisation, complicated behaviour of threads’ capturing computing resources. Experimental results show performance time’s strong dependence on algorithm parameters, such as the number of subtasks and the complexity of each task. The optimal value of subtask complexity is revealed for the particular algorithm. It is the same for different complexity of the parallelised task (with the same computing resource). To guarantee algorithm speed-up it is important to have a method for investigating the efficiency of parallel algorithm before its implementation on specified computing resources. Stochastic Petri net potentially could be a high accuracy tool for investigating the efficiency of a parallel algorithm. However, a huge number of elements are needed to compose a model of non-trivial algorithm that limits the application of this tool in practice. Petri-object simulation method allows replication of Petri nets with specified parameters and model creation of a list of linked Petri-objects. Basic templates for the model creation of a multithreaded algorithm are developed. Applying these templates, the model of the parallel discrete event simulation algorithm is developed and investigated. By the model results, the algorithm parameters providing the least performance time can be determined.

由于同步的负面影响、线程捕获计算资源的复杂行为，并行算法的开发存在问题。实验结果表明，性能时间对子任务数和每个任务的复杂度等算法参数有很强的依赖性。揭示了特定算法的子任务复杂度的最优值。对于并行任务的不同复杂性(使用相同的计算资源)，这是相同的。为了保证算法的加速，在并行算法在指定的计算资源上实现之前，有一种方法来研究并行算法的效率是很重要的。随机Petri网有可能成为研究并行算法效率的高精度工具。然而，需要大量的元素来组成一个非平凡算法的模型，这限制了该工具在实践中的应用。Petri对象模拟方法允许复制Petri网与指定的参数和模型创建链Petri对象的列表。开发了多线程算法模型创建的基本模板。应用这些模板，建立并研究了并行离散事件仿真算法的模型。根据模型结果，可以确定提供最小性能时间的算法参数。

{"title":"Parallel algorithm development and testing using Petri-object simulation","authors":"I. V. Stetsenko, A. A. Pavlov, Oleksandra Dyfuchyna","doi":"10.1080/17445760.2021.1955113","DOIUrl":"https://doi.org/10.1080/17445760.2021.1955113","url":null,"abstract":"Parallel algorithms are problematic to develop because of the negative influence of synchronisation, complicated behaviour of threads’ capturing computing resources. Experimental results show performance time’s strong dependence on algorithm parameters, such as the number of subtasks and the complexity of each task. The optimal value of subtask complexity is revealed for the particular algorithm. It is the same for different complexity of the parallelised task (with the same computing resource). To guarantee algorithm speed-up it is important to have a method for investigating the efficiency of parallel algorithm before its implementation on specified computing resources. Stochastic Petri net potentially could be a high accuracy tool for investigating the efficiency of a parallel algorithm. However, a huge number of elements are needed to compose a model of non-trivial algorithm that limits the application of this tool in practice. Petri-object simulation method allows replication of Petri nets with specified parameters and model creation of a list of linked Petri-objects. Basic templates for the model creation of a multithreaded algorithm are developed. Applying these templates, the model of the parallel discrete event simulation algorithm is developed and investigated. By the model results, the algorithm parameters providing the least performance time can be determined.","PeriodicalId":45411,"journal":{"name":"International Journal of Parallel Emergent and Distributed Systems","volume":"36 1","pages":"549 - 564"},"PeriodicalIF":1.1,"publicationDate":"2021-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/17445760.2021.1955113","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42215247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Spatial specification of hypertorus interconnect by infinite and reenterable coloured Petri nets 基于无限可重入有色Petri网的超拓扑互连空间规范

IF 1.1 Q4 COMPUTER SCIENCE, THEORY & METHODS

International Journal of Parallel Emergent and Distributed Systems

Pub Date : 2021-07-20 DOI: 10.1080/17445760.2021.1952580

D. Zaitsev, T. Shmeleva, B. Pröll

Multidimensional torus interconnect finds wide application in modern exascale computing. For models design in high-performance computing, grid and cloud computing, and also systems biology, two basic ways of specifying spatial structures with Petri nets are considered – an infinite Petri net specified by a parametric expression (PE) and a reenterable coloured Petri net (CPN). The paper studies a composition of hypertorus grid models in the form of a PE and a reenterable CPN, their mutual transformations, and unfolding into a place/transition net; the parameters are the number of dimensions and the size of grid. A grid is composed via connection of neighbouring cells by dedicated transitions modelling channels. Reenterable model peculiarities are explained on step-by-step simulation examples. The rules of mutual transformations of Petri net spatial specifications are specified. Comparative investigation of two mentioned forms of spatial specifications is implemented, including analysis techniques and tools. CPNs are convenient for the state space analysis. The main advantage of PEs is the ability to obtain linear invariants and other structural constructs of Petri nets, for instance, siphons and traps, in parametric form that allows us to draw conclusions on Petri net properties for any values of parameters.

多维环面互连在现代exascale计算中有着广泛的应用。对于高性能计算、网格和云计算以及系统生物学中的模型设计，考虑了用Petri网指定空间结构的两种基本方法——由参数表达式（PE）指定的无限Petri网和可重入有色Petri网（CPN）。本文研究了PE和可重入CPN形式的超拓扑网格模型的组成、它们的相互转换以及展开为位置/过渡网；参数是尺寸的数量和网格的大小。网格是通过专用的过渡建模通道通过相邻单元的连接组成的。在逐步模拟的例子中解释了可重现模型的特性。给出了Petri网空间规范相互转换的规则。对上述两种形式的空间规范进行了比较研究，包括分析技术和工具。CPN便于进行状态空间分析。PE的主要优点是能够以参数形式获得线性不变量和Petri网的其他结构构造，例如虹吸管和陷阱，这使我们能够对任何参数值的Petri网性质得出结论。

{"title":"Spatial specification of hypertorus interconnect by infinite and reenterable coloured Petri nets","authors":"D. Zaitsev, T. Shmeleva, B. Pröll","doi":"10.1080/17445760.2021.1952580","DOIUrl":"https://doi.org/10.1080/17445760.2021.1952580","url":null,"abstract":"Multidimensional torus interconnect finds wide application in modern exascale computing. For models design in high-performance computing, grid and cloud computing, and also systems biology, two basic ways of specifying spatial structures with Petri nets are considered – an infinite Petri net specified by a parametric expression (PE) and a reenterable coloured Petri net (CPN). The paper studies a composition of hypertorus grid models in the form of a PE and a reenterable CPN, their mutual transformations, and unfolding into a place/transition net; the parameters are the number of dimensions and the size of grid. A grid is composed via connection of neighbouring cells by dedicated transitions modelling channels. Reenterable model peculiarities are explained on step-by-step simulation examples. The rules of mutual transformations of Petri net spatial specifications are specified. Comparative investigation of two mentioned forms of spatial specifications is implemented, including analysis techniques and tools. CPNs are convenient for the state space analysis. The main advantage of PEs is the ability to obtain linear invariants and other structural constructs of Petri nets, for instance, siphons and traps, in parametric form that allows us to draw conclusions on Petri net properties for any values of parameters.","PeriodicalId":45411,"journal":{"name":"International Journal of Parallel Emergent and Distributed Systems","volume":"37 1","pages":"1 - 21"},"PeriodicalIF":1.1,"publicationDate":"2021-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/17445760.2021.1952580","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43572163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Cyclic connectivity of the data center network 数据中心网络的循环连通性

IF 1.1 Q4 COMPUTER SCIENCE, THEORY & METHODS

International Journal of Parallel Emergent and Distributed Systems

Pub Date : 2021-07-14 DOI: 10.1080/17445760.2021.1952579

Hongzhou Zhu, J. Meng

Let G be a connected graph, F be a subset of , S be a subset of . The cyclic vertex connectivity of G, denoted by , is the minimum cardinality of F such that G−F is disconnected and at least two of its components contain cycles. The cyclic edge connectivity of G, denoted by , is the minimum cardinality of S such that G−S is disconnected and at least two of its components contain cycles. Let denote the data center network. In this paper, we obtain the following results: for ; for , ; for ; for ; for , .

设G是连通图，F是的子集，S是的子集。G的循环顶点连通性，记作，是F的最小基数，使得G−F是不连通的，并且它的至少两个分量包含环。G的循环边连通性，记作，是S的最小基数，使得G−S是不连通的，并且它的至少两个分量包含循环。表示数据中心网络。本文得到如下结果:为;为，;为;为;对于，。

引用次数: 0

Realisation of the double sweep method by using a Sleptsov net 用Sleptsov网络实现双扫描方法

IF 1.1 Q4 COMPUTER SCIENCE, THEORY & METHODS

International Journal of Parallel Emergent and Distributed Systems

Pub Date : 2021-06-27 DOI: 10.1080/17445760.2021.1945054

A. Kostikov, N. D. Zaitsev, O. Subotin

The present article is focused on the application of Sleptsov net (SN) for solving system of linear algebraic equations with a tridiagonal matrix. SN, which implements the double sweep algorithm for solving a system of linear equations with a tridiagonal matrix, is constructed. The work of the constructed network is considered when solving a specific example. The number of operations carried out by the net in performing computations has been calculated and the performance of the parallel double sweep method implemented by the net is compared with the sequential double sweep method. We also consider representation of integer and real numbers by a SN and implementation of basic arithmetic operations over these numbers because the implementation of the algorithm requires floating point operations. GRAPHICAL ABSTRACT

本文主要研究Sleptsov网（SN）在求解具有三对角矩阵的线性代数方程组中的应用。构造了SN，它实现了求解具有三对角矩阵的线性方程组的双扫描算法。在解决一个具体的例子时，会考虑所构建的网络的工作。计算了网络在执行计算时执行的操作次数，并将网络实现的并行双扫描方法的性能与顺序双扫描方法进行了比较。我们还考虑通过SN表示整数和实数，以及对这些数字进行基本算术运算的实现，因为算法的实现需要浮点运算。图形摘要

引用次数: 2

Deep learning on Sleptsov nets Sleptsov网络上的深度学习

IF 1.1 Q4 COMPUTER SCIENCE, THEORY & METHODS

International Journal of Parallel Emergent and Distributed Systems

Pub Date : 2021-06-27 DOI: 10.1080/17445760.2021.1945055

T. Shmeleva, J. Owsinski, A. A. Lawan

Sleptsov nets are applied as a uniform language to specify models of unconventional computations and artificial intelligence systems. A technique for specification of neural networks, including multidimensional and multilayer networks of deep learning approach, using Sleptsov nets, is shown; the ways of specifying basic activation functions by Sleptsov net are discussed, the threshold and sigmoid functions implemented. A methodology of training neural networks is presented with the loss function minimisation, based on a run of a pair of interacting Sleptsov nets, the first net implementing the neural network based on data flow approach, while the second net solves the optimisation task by adjusting the weights of the first net by the gradient descend method. The optimising net uses the earlier developed technology of programming in Sleptsov nets with reverse control flow and the subnet call technique. Real numbers and arrays are represented as markings of a single place of a Sleptsov net. Hyperperformance is achieved because of the possibility of implementing mass parallel computations.

Sleptsov网作为一种统一语言被应用于指定非常规计算和人工智能系统的模型。展示了一种使用Sleptsov网络的神经网络规范技术，包括深度学习方法的多维和多层网络；讨论了Sleptsov网络指定基本激活函数的方法，实现了阈值函数和sigmoid函数。基于一对相互作用的Sleptsov网络的运行，提出了一种具有损失函数最小化的神经网络训练方法，第一个网络基于数据流方法实现神经网络，而第二个网络通过梯度下降方法调整第一个网络的权重来解决优化任务。优化网络使用了Sleptsov网络中早期开发的具有反向控制流的编程技术和子网调用技术。实数和数组表示为Sleptsov网络的单个位置的标记。实现高性能是因为有可能实现大规模并行计算。

引用次数: 1

Choice of parallelism: multi-GPU driven pipeline for huge academic backbone network 并行性的选择:多gpu驱动的管道用于庞大的学术骨干网络

IF 1.1 Q4 COMPUTER SCIENCE, THEORY & METHODS

International Journal of Parallel Emergent and Distributed Systems

Pub Date : 2021-06-24 DOI: 10.1080/17445760.2021.1941009

R. Ando, Y. Kadobayashi, H. Takakura

Science Information Network (SINET) is a Japanese academic backbone network for more than 800 research institutions and universities. In this paper, we present a multi-GPU-driven pipeline for handling huge session data of SINET. Our pipeline consists of ELK stack, multi-GPU server, and Splunk. A multi-GPU server is responsible for two procedures: discrimination and histogramming. Discrimination is dividing session data into ingoing/outgoing with subnet mask calculation and network address matching. Histogramming is grouping ingoing/outgoing session data into bins with map-reduce. In our architecture, we use GPU for the acceleration of ingress/egress discrimination of session data. Also, we use a tiling design pattern for building a two-stage map-reduce of CPU and GPU. Our multi-GPU-driven pipeline has succeeded in processing huge workloads of about 1.2–1.6 billion session streams (500–650 GB) within 24 hours. GRAPHICAL ABSTRACT

科学信息网(SINET)是日本800多所研究机构和大学的学术骨干网络。在本文中，我们提出了一个多gpu驱动的管道来处理SINET的海量会话数据。我们的流水线由ELK堆栈、多gpu服务器和Splunk组成。一个多gpu服务器负责两个程序:判别和直方图。判别是通过子网掩码计算和网络地址匹配将会话数据划分为入/出。直方图是使用map-reduce将入/出会话数据分组到bin中。在我们的架构中，我们使用GPU来加速会话数据的入口/出口识别。此外，我们使用平铺设计模式来构建CPU和GPU的两阶段映射缩减。我们的多gpu驱动管道已经成功地在24小时内处理了大约12 - 16亿个会话流(500-650 GB)的巨大工作负载。图形抽象

引用次数: 2

Byzantine-tolerant uniform node sampling service in large-scale networks 大规模网络中的拜占庭容忍统一节点采样服务

IF 1.1 Q4 COMPUTER SCIENCE, THEORY & METHODS

International Journal of Parallel Emergent and Distributed Systems

Pub Date : 2021-06-20 DOI: 10.1080/17445760.2021.1939873

E. Anceaume, Yann Busnel, B. Sericola

We consider the problem of achieving uniform node sampling in large scale systems in presence of Byzantine nodes. This service offers a single simple primitive that returns, upon invocation, the identifier of a random node that belongs to the system. We first propose an omniscient strategy that processes on the fly an unbounded and arbitrarily biased input stream made of node identifiers exchanged within the system, and outputs a stream that preserves the uniformity property (same probability to appear in the sample). We show that this property holds despite any arbitrary bias introduced by the adversary. We then propose a strategy that is capable of approximating the omniscient strategy without requiring any prior knowledge on the composition of the input stream. We show through both theoretical analysis and extensive simulations that this strategy accurately approximates the omniscient one. We evaluate the resilience of the strategy by studying two representative attacks. We quantify the minimum number of identifiers that Byzantine nodes must insert in the input stream to prevent uniformity. Finally, we propose a new construction in series that allows to both increase the accuracy of a single sketch and decrease the time to converge to a uniform output stream.

我们考虑了在拜占庭节点存在的情况下，在大规模系统中实现统一节点采样的问题。该服务提供了一个简单的原语，该原语在调用时返回属于系统的随机节点的标识符。我们首先提出了一种全知策略，该策略动态处理由系统内交换的节点标识符组成的无边界和任意偏置的输入流，并输出保持一致性特性的流（出现在样本中的概率相同）。我们证明，尽管对手引入了任何任意的偏见，这种性质仍然成立。然后，我们提出了一种能够近似全知策略的策略，而不需要任何关于输入流组成的先验知识。我们通过理论分析和广泛的模拟表明，这种策略准确地接近于无所不知的策略。我们通过研究两种具有代表性的攻击来评估该策略的弹性。我们量化拜占庭节点必须在输入流中插入的标识符的最小数量，以防止一致性。最后，我们提出了一种新的串联结构，既可以提高单个草图的精度，又可以减少收敛到均匀输出流的时间。

{"title":"Byzantine-tolerant uniform node sampling service in large-scale networks","authors":"E. Anceaume, Yann Busnel, B. Sericola","doi":"10.1080/17445760.2021.1939873","DOIUrl":"https://doi.org/10.1080/17445760.2021.1939873","url":null,"abstract":"We consider the problem of achieving uniform node sampling in large scale systems in presence of Byzantine nodes. This service offers a single simple primitive that returns, upon invocation, the identifier of a random node that belongs to the system. We first propose an omniscient strategy that processes on the fly an unbounded and arbitrarily biased input stream made of node identifiers exchanged within the system, and outputs a stream that preserves the uniformity property (same probability to appear in the sample). We show that this property holds despite any arbitrary bias introduced by the adversary. We then propose a strategy that is capable of approximating the omniscient strategy without requiring any prior knowledge on the composition of the input stream. We show through both theoretical analysis and extensive simulations that this strategy accurately approximates the omniscient one. We evaluate the resilience of the strategy by studying two representative attacks. We quantify the minimum number of identifiers that Byzantine nodes must insert in the input stream to prevent uniformity. Finally, we propose a new construction in series that allows to both increase the accuracy of a single sketch and decrease the time to converge to a uniform output stream.","PeriodicalId":45411,"journal":{"name":"International Journal of Parallel Emergent and Distributed Systems","volume":"36 1","pages":"412 - 439"},"PeriodicalIF":1.1,"publicationDate":"2021-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/17445760.2021.1939873","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48862274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International Journal of Parallel Emergent and Distributed Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀