Parallel Process. Lett.最新文献

英文中文

Machine Learning to Design an Auto-tuning System for the Best Compressed Format Detection for Parallel Sparse Computations 机器学习设计一种自动调谐系统，用于并行稀疏计算的最佳压缩格式检测

Parallel Process. Lett.

Pub Date : 2021-11-16 DOI: 10.1142/s0129626421500195

O. Hamdi-Larbi, Ichrak Mehrez, T. Dufaud

Many applications in scientific computing process very large sparse matrices on parallel architectures. The presented work in this paper is a part of a project where our general aim is to develop an auto-tuner system for the selection of the best matrix compression format in the context of high-performance computing. The target smart system can automatically select the best compression format for a given sparse matrix, a numerical method processing this matrix, a parallel programming model and a target architecture. Hence, this paper describes the design and implementation of the proposed concept. We consider a case study consisting of a numerical method reduced to the sparse matrix vector product (SpMV), some compression formats, the data parallel as a programming model and, a distributed multi-core platform as a target architecture. This study allows extracting a set of important novel metrics and parameters which are relative to the considered programming model. Our metrics are used as input to a machine-learning algorithm to predict the best matrix compression format. An experimental study targeting a distributed multi-core platform and processing random and real-world matrices shows that our system can improve in average up to 7% the accuracy of the machine learning.

科学计算中的许多应用都是在并行架构上处理非常大的稀疏矩阵。本文中介绍的工作是一个项目的一部分，我们的总体目标是开发一个自动调谐系统，用于在高性能计算的背景下选择最佳的矩阵压缩格式。目标智能系统能够对给定的稀疏矩阵自动选择最佳压缩格式、矩阵的数值处理方法、并行规划模型和目标体系结构。因此，本文描述了所提出概念的设计和实现。我们考虑了一个案例研究，包括一个简化为稀疏矩阵向量积(SpMV)的数值方法，一些压缩格式，数据并行作为编程模型，以及一个分布式多核平台作为目标架构。这项研究允许提取一组重要的新指标和参数，这些指标和参数与所考虑的编程模型有关。我们的指标被用作机器学习算法的输入，以预测最佳矩阵压缩格式。一项针对分布式多核平台并处理随机和现实世界矩阵的实验研究表明，我们的系统可以将机器学习的准确率平均提高7%。

{"title":"Machine Learning to Design an Auto-tuning System for the Best Compressed Format Detection for Parallel Sparse Computations","authors":"O. Hamdi-Larbi, Ichrak Mehrez, T. Dufaud","doi":"10.1142/s0129626421500195","DOIUrl":"https://doi.org/10.1142/s0129626421500195","url":null,"abstract":"Many applications in scientific computing process very large sparse matrices on parallel architectures. The presented work in this paper is a part of a project where our general aim is to develop an auto-tuner system for the selection of the best matrix compression format in the context of high-performance computing. The target smart system can automatically select the best compression format for a given sparse matrix, a numerical method processing this matrix, a parallel programming model and a target architecture. Hence, this paper describes the design and implementation of the proposed concept. We consider a case study consisting of a numerical method reduced to the sparse matrix vector product (SpMV), some compression formats, the data parallel as a programming model and, a distributed multi-core platform as a target architecture. This study allows extracting a set of important novel metrics and parameters which are relative to the considered programming model. Our metrics are used as input to a machine-learning algorithm to predict the best matrix compression format. An experimental study targeting a distributed multi-core platform and processing random and real-world matrices shows that our system can improve in average up to 7% the accuracy of the machine learning.","PeriodicalId":422436,"journal":{"name":"Parallel Process. Lett.","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124163348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Beyond Rings: Gathering in 1-Interval Connected Graphs 超越环:1区间连通图的聚集

Parallel Process. Lett.

Pub Date : 2021-11-11 DOI: 10.1142/s0129626421500201

O. Michail, P. Spirakis, Michail Theofilatos

We examine the problem of gathering [Formula: see text] agents (or multi-agent rendezvous) in dynamic graphs which may change in every round. We consider a variant of the [Formula: see text]-interval connectivity model [9] in which all instances (snapshots) are always connected spanning subgraphs of an underlying graph, not necessarily a clique. The agents are identical and not equipped with explicit communication capabilities, and are initially arbitrarily positioned on the graph. The problem is for the agents to gather at the same node, not fixed in advance. We first show that the problem becomes impossible to solve if the underlying graph has a cycle. In light of this, we study a relaxed version of this problem, called weak gathering, where the agents are allowed to gather either at the same node, or at two adjacent nodes. Our goal is to characterize the class of 1-interval connected graphs and initial configurations in which the problem is solvable, both with and without homebases. On the negative side we show that when the underlying graph contains a spanning bicyclic subgraph and satisfies an additional connectivity property, weak gathering is unsolvable, thus we concentrate mainly on unicyclic graphs. As we show, in most instances of initial agent configurations, the agents must meet on the cycle. This adds an additional difficulty to the problem, as they need to explore the graph and recognize the nodes that form the cycle. We provide a deterministic algorithm for the solvable cases of this problem that runs in [Formula: see text] number of rounds.

我们研究了在每一轮可能变化的动态图中收集[公式:见文本]代理(或多代理集合)的问题。我们考虑了[公式:见文本]-区间连接模型[9]的一种变体，其中所有实例(快照)总是连接底层图的生成子图，而不一定是一个团。代理是相同的，没有配备显式通信功能，并且最初在图上是任意定位的。问题是代理聚集在同一节点，而不是事先固定的。我们首先证明，如果底层图有一个循环，这个问题就不可能解决。鉴于此，我们研究了该问题的一个宽松版本，称为弱聚集，其中允许代理在同一节点或两个相邻节点上聚集。我们的目标是描述一类1间隔连接图和初始配置，其中问题是可解的，无论是否有本垒。在消极方面，我们表明当底层图包含生成双环子图并满足附加连通性时，弱聚集是不可解的，因此我们主要集中在单环图上。正如我们所展示的，在初始代理配置的大多数实例中，代理必须在循环中相遇。这给问题增加了额外的难度，因为他们需要探索图并识别形成循环的节点。我们为这个问题的可解情况提供了一个确定性算法，该算法运行在[公式:见文本]轮数中。

{"title":"Beyond Rings: Gathering in 1-Interval Connected Graphs","authors":"O. Michail, P. Spirakis, Michail Theofilatos","doi":"10.1142/s0129626421500201","DOIUrl":"https://doi.org/10.1142/s0129626421500201","url":null,"abstract":"We examine the problem of gathering [Formula: see text] agents (or multi-agent rendezvous) in dynamic graphs which may change in every round. We consider a variant of the [Formula: see text]-interval connectivity model [9] in which all instances (snapshots) are always connected spanning subgraphs of an underlying graph, not necessarily a clique. The agents are identical and not equipped with explicit communication capabilities, and are initially arbitrarily positioned on the graph. The problem is for the agents to gather at the same node, not fixed in advance. We first show that the problem becomes impossible to solve if the underlying graph has a cycle. In light of this, we study a relaxed version of this problem, called weak gathering, where the agents are allowed to gather either at the same node, or at two adjacent nodes. Our goal is to characterize the class of 1-interval connected graphs and initial configurations in which the problem is solvable, both with and without homebases. On the negative side we show that when the underlying graph contains a spanning bicyclic subgraph and satisfies an additional connectivity property, weak gathering is unsolvable, thus we concentrate mainly on unicyclic graphs. As we show, in most instances of initial agent configurations, the agents must meet on the cycle. This adds an additional difficulty to the problem, as they need to explore the graph and recognize the nodes that form the cycle. We provide a deterministic algorithm for the solvable cases of this problem that runs in [Formula: see text] number of rounds.","PeriodicalId":422436,"journal":{"name":"Parallel Process. Lett.","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132521463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Reliability Evaluation of Bicube-Based Multiprocessor System under the g-Good-Neighbor Restriction 基于双立方体的多处理机系统在g-近邻约束下的可靠性评估

Parallel Process. Lett.

Pub Date : 2021-11-11 DOI: 10.1142/s0129626421500183

Jiafei Liu, Shuming Zhou, E. Cheng, Gaolin Chen, Min Li

Multiprocessor systems are commonly deployed for big data analysis because of evolution in technologies such as cloud computing, IoT, social network and so on. Reliability evaluation is of significant importance for maintenance and improvement of fault tolerance for multiprocessor systems, and system-level diagnosis is a primary strategy to identify the faulty processors in the systems. In this paper, we first determine the [Formula: see text]-good-neighbor connectivity of the [Formula: see text]-dimensional Bicube-based multiprocessor system [Formula: see text], a novel variant of hypercube. Besides, we establish the [Formula: see text]-good-neighbor diagnosability of the Bicube-based multiprocessor system [Formula: see text] under the PMC and MM* models.

由于云计算、物联网、社交网络等技术的发展，多处理器系统通常用于大数据分析。可靠性评估对于多处理机系统的维护和容错性的提高具有重要意义，而系统级诊断是识别系统中故障处理机的主要策略。在本文中，我们首先确定了[公式:见文]-基于多维立方体的多处理器系统[公式:见文]的睦邻连通性[公式:见文]，这是超立方体的一种新变体。此外，我们建立了基于biccube的多处理器系统在PMC和MM*模型下的睦邻可诊断性[公式:见文本]。

引用次数: 1

Abnormal Quantum State Search Based on Parallel Phase Comparison 基于并行相位比较的异常量子态搜索

Parallel Process. Lett.

Pub Date : 2021-11-09 DOI: 10.1142/s0129626421500225

Guanlei Xu, Xiaogang Xu, Xiaotong Wang

We discuss the problem of filtering out abnormal states from a larger number of quantum states. For this type of problem with [Formula: see text] items to be searched, both the traditional search by enumeration and classical Grover search algorithm have the complexity about [Formula: see text]. In this letter a novel quantum search scheme with exponential speed up is proposed for abnormal states. First, a new comprehensive quantum operator is well-designed to extract the superposition state containing all abnormal states with unknown number [Formula: see text] with complexity [Formula: see text] in probability 1 via well-designed parallel phase comparison. Then, every abnormal state is achieved respectively from [Formula: see text] abnormal states via [Formula: see text] times’ measurement. Finally, a numerical example is given to show the efficiency of the proposed scheme.

我们讨论了从大量量子态中滤除异常态的问题。对于这类搜索项为[Formula: see text]的问题，传统的枚举搜索和经典的Grover搜索算法都具有[Formula: see text]的复杂度。本文提出了一种新的针对异常状态的指数速度量子搜索方案。首先，设计了一种新的综合量子算子，通过设计好的并行相位比较，提取概率为1的包含所有未知数[公式:见文]、复杂度[公式:见文]的异常态的叠加态。然后，通过【公式:见文】次的测量，分别得到【公式:见文】次的异常状态。最后，通过数值算例验证了该方法的有效性。

引用次数: 0

Fault Detection Method of CNC Machine Tool Based on Wavelet Transform 基于小波变换的数控机床故障检测方法

Parallel Process. Lett.

Pub Date : 2021-10-20 DOI: 10.1142/s0129626421410012

Junying Liu

In order to overcome the problems of low detection accuracy and long detection time of traditional fault detection methods for CNC machine tools, a new fault detection method for CNC machine tools based on wavelet transform is proposed in this paper. In order to improve the effectiveness of running fault detection of CNC machine tools, a wavelet transform method is used to extract the features of the running fault signals of CNC machine tools. According to the feature extraction results, the convolution calculation of the continuous wavelet transform is used to complete the fault detection of CNC machine tool according to the scale result of fault signal. The experimental results show that, compared with traditional fault detection methods, the detection accuracy and efficiency of this method is significantly better: the highest detection accuracy is 97%, and the lowest detection time is only 1.1s.

针对传统数控机床故障检测方法检测精度低、检测时间长等问题，提出了一种基于小波变换的数控机床故障检测新方法。为了提高数控机床运行故障检测的有效性，采用小波变换方法提取数控机床运行故障信号的特征。根据特征提取结果，利用连续小波变换的卷积计算，根据故障信号的尺度结果完成数控机床的故障检测。实验结果表明，与传统的故障检测方法相比，该方法的检测精度和效率显著提高:最高检测精度为97%，最低检测时间仅为1.1s。

引用次数: 0

Design and Implementation of Low Power and Area Efficient Architecture for High Performance ALU 高性能ALU低功耗、高效率架构的设计与实现

Parallel Process. Lett.

Pub Date : 2021-10-19 DOI: 10.1142/s0129626421500171

U. Penchalaiah, V. S. Kumar

Digital Signal Processors (DSP) have a ubiquitous presence in almost all civil and military signal processing applications, including mission critical environments like nuclear reactors, process control etc. Arithmetic and Logic units (ALU), being the heart of any digital signal processor, play critical and decisive roles in achieving the required parameter benchmarks and the overall efficiency and robustness of the digital signal processor. State of the art research has shown successful traction with the performance requirements of critical Multiply-Accumulate (MAC) parameters, like reduced power consumption, small electronic real estate footprint and reduction in delay with the associated design complexity. Judicious placement of its building blocks, namely, the truncated multiplier and half-sum carry generation-sum carry generation (HSCG-SCG) adder in the architectural design of ALU and the type of adder and multiplier circuits selected are the core decisions that decide the overall performance of the ALU. To overcome the drawback and to improve the performance further, this work proposes a new architecture for the square root (SQRT) carry select adder (CSLA) using half-sum generation (HSG), half-carry generation (HCG), full-sum generation (FSG) and full-carry generation (FCG) blocks. The proposed design contains N-bit architecture, and comparative results are considered for 8-bit, 16-bit and 32-bit combinations. All the designs are implemented in the Xilinx ISE environment and the results show that better area, power, and delay performance compared to the state of art methods.

数字信号处理器(DSP)在几乎所有民用和军用信号处理应用中无处不在，包括关键任务环境，如核反应堆，过程控制等。算术和逻辑单元(ALU)是任何数字信号处理器的核心，在实现所需的参数基准和数字信号处理器的整体效率和鲁棒性方面起着关键和决定性的作用。最先进的研究已经成功地满足了关键的乘法累积(MAC)参数的性能要求，如降低功耗、小电子占用空间和减少延迟以及相关的设计复杂性。在ALU的架构设计中，其组成模块即截断乘法器和半和进位生成-和进位生成(HSCG-SCG)加法器的合理放置以及加法器和乘法器电路的选择是决定ALU整体性能的核心决策。为了克服这个缺点并进一步提高性能，本文提出了一种新的平方根进位选择加法器(CSLA)架构，使用半和生成(HSG)、半进位生成(HCG)、全和生成(FSG)和全进位生成(FCG)块。提出的设计包含n位架构，并考虑了8位，16位和32位组合的比较结果。所有设计都在赛灵思ISE环境中实现，结果表明，与最先进的方法相比，该方法具有更好的面积、功耗和延迟性能。

{"title":"Design and Implementation of Low Power and Area Efficient Architecture for High Performance ALU","authors":"U. Penchalaiah, V. S. Kumar","doi":"10.1142/s0129626421500171","DOIUrl":"https://doi.org/10.1142/s0129626421500171","url":null,"abstract":"Digital Signal Processors (DSP) have a ubiquitous presence in almost all civil and military signal processing applications, including mission critical environments like nuclear reactors, process control etc. Arithmetic and Logic units (ALU), being the heart of any digital signal processor, play critical and decisive roles in achieving the required parameter benchmarks and the overall efficiency and robustness of the digital signal processor. State of the art research has shown successful traction with the performance requirements of critical Multiply-Accumulate (MAC) parameters, like reduced power consumption, small electronic real estate footprint and reduction in delay with the associated design complexity. Judicious placement of its building blocks, namely, the truncated multiplier and half-sum carry generation-sum carry generation (HSCG-SCG) adder in the architectural design of ALU and the type of adder and multiplier circuits selected are the core decisions that decide the overall performance of the ALU. To overcome the drawback and to improve the performance further, this work proposes a new architecture for the square root (SQRT) carry select adder (CSLA) using half-sum generation (HSG), half-carry generation (HCG), full-sum generation (FSG) and full-carry generation (FCG) blocks. The proposed design contains N-bit architecture, and comparative results are considered for 8-bit, 16-bit and 32-bit combinations. All the designs are implemented in the Xilinx ISE environment and the results show that better area, power, and delay performance compared to the state of art methods.","PeriodicalId":422436,"journal":{"name":"Parallel Process. Lett.","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125895329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Efficient Non-Parametric Statistical Test for Assessing Some Treatment Methods of Clinical Data 一种评估临床资料处理方法的有效非参数统计检验

Parallel Process. Lett.

Pub Date : 2021-09-24 DOI: 10.1142/s0129626421420019

Mahmoud Mansour, Mohamed Aboshady

The recent rapid spread of deadly epidemics have precipitated an urgent need to speed up the development of different treatments, as well as methods of evaluating their efficacy. The first step towards this is the collection of data relating to the cure rate in groups of patients who have had different treatments applied to them. As most of the available data in these cases is random, it is now the role of statisticians to provide efficient statistical tests to assess the treatment methods through the data. This research aims to provide a new statistical test with high efficiency to reach the right decision with accurate results as quickly as possible using parallel computing algorithms through Wolfram Mathematica software.

最近致命流行病的迅速蔓延促使人们迫切需要加快开发不同的治疗方法以及评估其疗效的方法。实现这一目标的第一步是收集与接受过不同治疗的患者群体的治愈率有关的数据。由于在这些情况下大多数可用的数据是随机的，现在统计学家的作用是通过数据提供有效的统计检验来评估治疗方法。本研究旨在通过Wolfram Mathematica软件，利用并行计算算法，提供一种新的高效率的统计检验方法，以最快的速度得出正确的决策和准确的结果。

引用次数: 0

Parallel Network Analysis and Communities Detection (PANC) Pipeline for the Analysis and Visualization of COVID-19 Data 面向COVID-19数据分析和可视化的并行网络分析和社区检测(PANC)管道

Parallel Process. Lett.

Pub Date : 2021-09-22 DOI: 10.1142/s0129626421420020

Giuseppe Agapito, Marianna Milano, M. Cannataro

A new coronavirus, causing a severe acute respiratory syndrome (COVID-19), was started at Wuhan, China, in December 2019. The epidemic has rapidly spread across the world becoming a pandemic that, as of today, has affected more than 70 million people causing over 2 million deaths. To better understand the evolution of spread of the COVID-19 pandemic, we developed PANC (Parallel Network Analysis and Communities Detection), a new parallel preprocessing methodology for network-based analysis and communities detection on Italian COVID-19 data. The goal of the methodology is to analyze set of homogeneous datasets (i.e. COVID-19 data in several regions) using a statistical test to find similar/dissimilar behaviours, mapping such similarity information on a graph and then using community detection algorithm to visualize and analyze the initial dataset. The methodology includes the following steps: (i) a parallel methodology to build similarity matrices that represent similar or dissimilar regions with respect to data; (ii) an effective workload balancing function to improve performance; (iii) the mapping of similarity matrices into networks where nodes represent Italian regions, and edges represent similarity relationships; (iv) the discovering and visualization of communities of regions that show similar behaviour. The methodology is general and can be applied to world-wide data about COVID-19, as well as to all types of data sets in tabular and matrix format. To estimate the scalability with increasing workloads, we analyzed three synthetic COVID-19 datasets with the size of 90.0[Formula: see text]MB, 180.0[Formula: see text]MB, and 360.0[Formula: see text]MB. Experiments was performed on showing the amount of data that can be analyzed in a given amount of time increases almost linearly with the number of computing resources available. Instead, to perform communities detection, we employed the real data set.

2019年12月，一种引起严重急性呼吸系统综合征(COVID-19)的新型冠状病毒在中国武汉爆发。这一流行病已迅速蔓延到世界各地，成为一种大流行病，截至今天，已影响到7 000多万人，造成200多万人死亡。为了更好地了解COVID-19大流行的传播演变，我们开发了PANC(并行网络分析和社区检测)，这是一种新的并行预处理方法，用于对意大利COVID-19数据进行基于网络的分析和社区检测。该方法的目标是使用统计测试来分析一组同质数据集(即几个地区的COVID-19数据)，以发现相似/不相似的行为，将此类相似性信息映射到图表上，然后使用社区检测算法对初始数据集进行可视化和分析。该方法包括以下步骤:(i)建立相似矩阵的并行方法，表示数据的相似或不相似区域;(ii)有效的工作负载平衡功能，以改善工作表现;(iii)将相似矩阵映射到网络中，节点代表意大利地区，边缘代表相似关系;(iv)发现和可视化表现出相似行为的地区社区。该方法是通用的，可应用于有关COVID-19的全球数据，以及表格和矩阵格式的所有类型的数据集。为了评估随着工作量增加的可扩展性，我们分析了三个合成COVID-19数据集，大小分别为90.0[公式:见文]MB、180.0[公式:见文]MB和360.0[公式:见文]MB。实验表明，在给定时间内可以分析的数据量几乎随可用计算资源的数量线性增加。相反，为了进行社区检测，我们使用了真实的数据集。

{"title":"Parallel Network Analysis and Communities Detection (PANC) Pipeline for the Analysis and Visualization of COVID-19 Data","authors":"Giuseppe Agapito, Marianna Milano, M. Cannataro","doi":"10.1142/s0129626421420020","DOIUrl":"https://doi.org/10.1142/s0129626421420020","url":null,"abstract":"A new coronavirus, causing a severe acute respiratory syndrome (COVID-19), was started at Wuhan, China, in December 2019. The epidemic has rapidly spread across the world becoming a pandemic that, as of today, has affected more than 70 million people causing over 2 million deaths. To better understand the evolution of spread of the COVID-19 pandemic, we developed PANC (Parallel Network Analysis and Communities Detection), a new parallel preprocessing methodology for network-based analysis and communities detection on Italian COVID-19 data. The goal of the methodology is to analyze set of homogeneous datasets (i.e. COVID-19 data in several regions) using a statistical test to find similar/dissimilar behaviours, mapping such similarity information on a graph and then using community detection algorithm to visualize and analyze the initial dataset. The methodology includes the following steps: (i) a parallel methodology to build similarity matrices that represent similar or dissimilar regions with respect to data; (ii) an effective workload balancing function to improve performance; (iii) the mapping of similarity matrices into networks where nodes represent Italian regions, and edges represent similarity relationships; (iv) the discovering and visualization of communities of regions that show similar behaviour. The methodology is general and can be applied to world-wide data about COVID-19, as well as to all types of data sets in tabular and matrix format. To estimate the scalability with increasing workloads, we analyzed three synthetic COVID-19 datasets with the size of 90.0[Formula: see text]MB, 180.0[Formula: see text]MB, and 360.0[Formula: see text]MB. Experiments was performed on showing the amount of data that can be analyzed in a given amount of time increases almost linearly with the number of computing resources available. Instead, to perform communities detection, we employed the real data set.","PeriodicalId":422436,"journal":{"name":"Parallel Process. Lett.","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133715858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

On the 3-Extra Connectivity of Enhanced Hypercubes 增强型超立方体的3-Extra连通性

Parallel Process. Lett.

Pub Date : 2021-09-16 DOI: 10.1142/s012962642150016x

Liyang Zhai, Liqiong Xu, Shanshan Yin

Reliability evaluation of interconnection networks is of significant importance to the design and maintenance of interconnection networks. The extra connectivity is an important parameter for the reliability evaluation of interconnection networks. Given a graph [Formula: see text] and a positive integer [Formula: see text], the [Formula: see text]-extra connectivity, denoted by [Formula: see text], is the minimum cardinality of a set of vertices in [Formula: see text], if exists, whose deletion disconnects [Formula: see text] and leaves each remaining component with at least [Formula: see text] vertices. In this paper, we show that the 3-extra connectivity of the [Formula: see text]-enhanced hypercube is [Formula: see text] for [Formula: see text] and [Formula: see text]. Some previous results in [IEEE Trans. Comput. 63 (2014) 1594–1600] and [Theor. Comput. Sci. 799 (2019) 22–31] are extended.

互联网络可靠性评估对互联网络的设计和维护具有重要意义。额外连通性是互联网络可靠性评估的一个重要参数。给定一个图[公式:见文]和一个正整数[公式:见文]，[公式:见文]-额外连接，用[公式:见文]表示，是[公式:见文]中一组顶点的最小基数，如果存在，其删除将断开[公式:见文]，并使每个剩余组件至少具有[公式:见文]顶点。在本文中，我们证明了对于[Formula: see text]和[Formula: see text]， [Formula: see text]增强的超立方体的3-extra连通性为[Formula: see text]。一些先前的研究结果发表在[IEEE Trans.]计算，63 (2014)1594-1600 [j]。第一版。Sci. 799(2019) 22-31。

引用次数: 0

OpenMP Implementation of Parallel Longest Common Subsequence Algorithm for Mathematical Expression Retrieval 数学表达式检索并行最长公共子序列算法的OpenMP实现

Parallel Process. Lett.

Pub Date : 2021-06-01 DOI: 10.1142/S0129626421500079

Pavan Kumar Perepu

Given a mathematical expression in LaTeX or MathML format, retrieval algorithm extracts similar expressions from a database. In our previous work, we have used Longest Common Subsequence (LCS) algorithm to match two expressions of lengths, [Formula: see text] and [Formula: see text], which takes [Formula: see text] time complexity. If there are [Formula: see text] database expressions, total complexity is [Formula: see text], and an increase in [Formula: see text] also increases this complexity. In the present work, we propose to use parallel LCS algorithm in our retrieval process. Parallel LCS has [Formula: see text] time complexity with [Formula: see text] processors and total complexity can be reduced to [Formula: see text]. For our experimentation, OpenMP based implementation has been used on Intel [Formula: see text] processor with 4 cores. However, for smaller expressions, parallel version takes more time as the implementation overhead dominates the algorithmic improvement. As such, we have proposed to use parallel version, selectively, only on larger expressions, in our retrieval algorithm to achieve better performance. We have compared the sequential and parallel versions of our ME retrieval algorithm, and the performance results have been reported on a database of 829 mathematical expressions.

给定LaTeX或MathML格式的数学表达式，检索算法从数据库中提取类似的表达式。在我们之前的工作中，我们使用了LCS算法来匹配两个长度表达式，[Formula: see text]和[Formula: see text]，这需要[Formula: see text]的时间复杂度。如果存在[Formula: see text]数据库表达式，则总复杂度为[Formula: see text]，并且[Formula: see text]的增加也会增加该复杂度。在目前的工作中，我们建议在我们的检索过程中使用并行LCS算法。并行LCS具有[公式:参见文本]处理器的时间复杂度，并且总复杂度可以降低到[公式:参见文本]。在我们的实验中，基于OpenMP的实现已经在Intel 4核处理器上使用。然而，对于较小的表达式，并行版本需要更多的时间，因为实现开销占算法改进的主导地位。因此，我们建议在检索算法中选择性地只对较大的表达式使用并行版本，以获得更好的性能。我们比较了我们的ME检索算法的顺序和并行版本，并在一个包含829个数学表达式的数据库上报告了性能结果。

{"title":"OpenMP Implementation of Parallel Longest Common Subsequence Algorithm for Mathematical Expression Retrieval","authors":"Pavan Kumar Perepu","doi":"10.1142/S0129626421500079","DOIUrl":"https://doi.org/10.1142/S0129626421500079","url":null,"abstract":"Given a mathematical expression in LaTeX or MathML format, retrieval algorithm extracts similar expressions from a database. In our previous work, we have used Longest Common Subsequence (LCS) algorithm to match two expressions of lengths, [Formula: see text] and [Formula: see text], which takes [Formula: see text] time complexity. If there are [Formula: see text] database expressions, total complexity is [Formula: see text], and an increase in [Formula: see text] also increases this complexity. In the present work, we propose to use parallel LCS algorithm in our retrieval process. Parallel LCS has [Formula: see text] time complexity with [Formula: see text] processors and total complexity can be reduced to [Formula: see text]. For our experimentation, OpenMP based implementation has been used on Intel [Formula: see text] processor with 4 cores. However, for smaller expressions, parallel version takes more time as the implementation overhead dominates the algorithmic improvement. As such, we have proposed to use parallel version, selectively, only on larger expressions, in our retrieval algorithm to achieve better performance. We have compared the sequential and parallel versions of our ME retrieval algorithm, and the performance results have been reported on a database of 829 mathematical expressions.","PeriodicalId":422436,"journal":{"name":"Parallel Process. Lett.","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121700870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Parallel Process. Lett.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀