{"title":"维度扩展蜻蜓互联网络的深入研究","authors":"Yaodong Wang, Yamin Li","doi":"10.1002/cpe.8286","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Dragonfly topology is a commonly utilized design for interconnection networks in parallel and distributed systems. A classical dragonfly can be denoted as dragonfly(<span></span><math>\n <semantics>\n <mrow>\n <mi>k</mi>\n </mrow>\n <annotation>$$ k $$</annotation>\n </semantics></math>,<span></span><math>\n <semantics>\n <mrow>\n <mi>m</mi>\n </mrow>\n <annotation>$$ m $$</annotation>\n </semantics></math>,<span></span><math>\n <semantics>\n <mrow>\n <mi>l</mi>\n </mrow>\n <annotation>$$ l $$</annotation>\n </semantics></math>), where<span></span><math>\n <semantics>\n <mrow>\n <mi>m</mi>\n </mrow>\n <annotation>$$ m $$</annotation>\n </semantics></math> is the number of routers in a group,<span></span><math>\n <semantics>\n <mrow>\n <mi>l</mi>\n </mrow>\n <annotation>$$ l $$</annotation>\n </semantics></math> is the number of links per router connected to other groups, and<span></span><math>\n <semantics>\n <mrow>\n <mi>k</mi>\n </mrow>\n <annotation>$$ k $$</annotation>\n </semantics></math> is the number of links per router connected to compute nodes. Each router has other<span></span><math>\n <semantics>\n <mrow>\n <mi>m</mi>\n <mo>−</mo>\n <mn>1</mn>\n </mrow>\n <annotation>$$ m-1 $$</annotation>\n </semantics></math> links fully connected to other<span></span><math>\n <semantics>\n <mrow>\n <mi>m</mi>\n <mo>−</mo>\n <mn>1</mn>\n </mrow>\n <annotation>$$ m-1 $$</annotation>\n </semantics></math> routers within a group. Each group has<span></span><math>\n <semantics>\n <mrow>\n <mi>m</mi>\n <mi>l</mi>\n </mrow>\n <annotation>$$ ml $$</annotation>\n </semantics></math> links connected to other groups. The groups are also fully connected, therefore there are<span></span><math>\n <semantics>\n <mrow>\n <mi>m</mi>\n <mi>l</mi>\n <mo>+</mo>\n <mn>1</mn>\n </mrow>\n <annotation>$$ ml+1 $$</annotation>\n </semantics></math> groups in total. The router radix in a dragonfly(<span></span><math>\n <semantics>\n <mrow>\n <mi>k</mi>\n </mrow>\n <annotation>$$ k $$</annotation>\n </semantics></math>,<span></span><math>\n <semantics>\n <mrow>\n <mi>m</mi>\n </mrow>\n <annotation>$$ m $$</annotation>\n </semantics></math>,<span></span><math>\n <semantics>\n <mrow>\n <mi>l</mi>\n </mrow>\n <annotation>$$ l $$</annotation>\n </semantics></math>) is<span></span><math>\n <semantics>\n <mrow>\n <mi>l</mi>\n <mo>+</mo>\n <mi>k</mi>\n <mo>+</mo>\n <mi>m</mi>\n <mo>−</mo>\n <mn>1</mn>\n </mrow>\n <annotation>$$ l+k+m-1 $$</annotation>\n </semantics></math>. Building a large dragonfly system requires a large number of high-radix routers, increasing hardware costs. To reduce hardware costs, this paper proposes a more flexible topology called dimension-extended dragonfly (DED). Rather than routers in a group being fully connected, each router in a group is arranged in an<span></span><math>\n <semantics>\n <mrow>\n <mi>n</mi>\n </mrow>\n <annotation>$$ n $$</annotation>\n </semantics></math>-dimensional matrix, and routers of the same dimension are fully connected. We use<span></span><math>\n <semantics>\n <mrow>\n <mi>n</mi>\n </mrow>\n <annotation>$$ n $$</annotation>\n </semantics></math> to denote the dimension such that each group in the DED has<span></span><math>\n <semantics>\n <mrow>\n <msup>\n <mrow>\n <mi>m</mi>\n </mrow>\n <mrow>\n <mi>n</mi>\n </mrow>\n </msup>\n </mrow>\n <annotation>$$ {m}^n $$</annotation>\n </semantics></math> routers. This study comprehensively evaluates DED in terms of cost, performance, fault tolerance, and packet latency. The findings show that DED provides a more economical hardware solution compared to traditional Dragonfly and Cascade topologies, especially for<span></span><math>\n <semantics>\n <mrow>\n <mi>n</mi>\n <mo>≥</mo>\n <mn>3</mn>\n </mrow>\n <annotation>$$ n\\ge 3 $$</annotation>\n </semantics></math>. Beyond cost-efficiency, DED enhances system design flexibility. It offers diverse possibilities for system scaling through different combinations of diameter and radix, giving system architects more adaptable options. To further enhance the versatility of DED, three disjoint path routing algorithms are proposed and their fault tolerance is evaluated through simulation. The simulation results also show that the packet latency of DED is lower than dragonfly and cascade.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"36 27","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An in-depth study of dimension-extended dragonfly interconnection network\",\"authors\":\"Yaodong Wang, Yamin Li\",\"doi\":\"10.1002/cpe.8286\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Dragonfly topology is a commonly utilized design for interconnection networks in parallel and distributed systems. A classical dragonfly can be denoted as dragonfly(<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>k</mi>\\n </mrow>\\n <annotation>$$ k $$</annotation>\\n </semantics></math>,<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>m</mi>\\n </mrow>\\n <annotation>$$ m $$</annotation>\\n </semantics></math>,<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>l</mi>\\n </mrow>\\n <annotation>$$ l $$</annotation>\\n </semantics></math>), where<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>m</mi>\\n </mrow>\\n <annotation>$$ m $$</annotation>\\n </semantics></math> is the number of routers in a group,<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>l</mi>\\n </mrow>\\n <annotation>$$ l $$</annotation>\\n </semantics></math> is the number of links per router connected to other groups, and<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>k</mi>\\n </mrow>\\n <annotation>$$ k $$</annotation>\\n </semantics></math> is the number of links per router connected to compute nodes. Each router has other<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>m</mi>\\n <mo>−</mo>\\n <mn>1</mn>\\n </mrow>\\n <annotation>$$ m-1 $$</annotation>\\n </semantics></math> links fully connected to other<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>m</mi>\\n <mo>−</mo>\\n <mn>1</mn>\\n </mrow>\\n <annotation>$$ m-1 $$</annotation>\\n </semantics></math> routers within a group. Each group has<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>m</mi>\\n <mi>l</mi>\\n </mrow>\\n <annotation>$$ ml $$</annotation>\\n </semantics></math> links connected to other groups. The groups are also fully connected, therefore there are<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>m</mi>\\n <mi>l</mi>\\n <mo>+</mo>\\n <mn>1</mn>\\n </mrow>\\n <annotation>$$ ml+1 $$</annotation>\\n </semantics></math> groups in total. The router radix in a dragonfly(<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>k</mi>\\n </mrow>\\n <annotation>$$ k $$</annotation>\\n </semantics></math>,<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>m</mi>\\n </mrow>\\n <annotation>$$ m $$</annotation>\\n </semantics></math>,<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>l</mi>\\n </mrow>\\n <annotation>$$ l $$</annotation>\\n </semantics></math>) is<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>l</mi>\\n <mo>+</mo>\\n <mi>k</mi>\\n <mo>+</mo>\\n <mi>m</mi>\\n <mo>−</mo>\\n <mn>1</mn>\\n </mrow>\\n <annotation>$$ l+k+m-1 $$</annotation>\\n </semantics></math>. Building a large dragonfly system requires a large number of high-radix routers, increasing hardware costs. To reduce hardware costs, this paper proposes a more flexible topology called dimension-extended dragonfly (DED). Rather than routers in a group being fully connected, each router in a group is arranged in an<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>n</mi>\\n </mrow>\\n <annotation>$$ n $$</annotation>\\n </semantics></math>-dimensional matrix, and routers of the same dimension are fully connected. We use<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>n</mi>\\n </mrow>\\n <annotation>$$ n $$</annotation>\\n </semantics></math> to denote the dimension such that each group in the DED has<span></span><math>\\n <semantics>\\n <mrow>\\n <msup>\\n <mrow>\\n <mi>m</mi>\\n </mrow>\\n <mrow>\\n <mi>n</mi>\\n </mrow>\\n </msup>\\n </mrow>\\n <annotation>$$ {m}^n $$</annotation>\\n </semantics></math> routers. This study comprehensively evaluates DED in terms of cost, performance, fault tolerance, and packet latency. The findings show that DED provides a more economical hardware solution compared to traditional Dragonfly and Cascade topologies, especially for<span></span><math>\\n <semantics>\\n <mrow>\\n <mi>n</mi>\\n <mo>≥</mo>\\n <mn>3</mn>\\n </mrow>\\n <annotation>$$ n\\\\ge 3 $$</annotation>\\n </semantics></math>. Beyond cost-efficiency, DED enhances system design flexibility. It offers diverse possibilities for system scaling through different combinations of diameter and radix, giving system architects more adaptable options. To further enhance the versatility of DED, three disjoint path routing algorithms are proposed and their fault tolerance is evaluated through simulation. The simulation results also show that the packet latency of DED is lower than dragonfly and cascade.</p>\\n </div>\",\"PeriodicalId\":55214,\"journal\":{\"name\":\"Concurrency and Computation-Practice & Experience\",\"volume\":\"36 27\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2024-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Concurrency and Computation-Practice & Experience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpe.8286\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.8286","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
摘要
蜻蜓拓扑是并行和分布式系统中常用的互连网络设计。经典的蜻蜓可以表示为 dragonfly( k $$ k $$ , m $$ m $$ , l $$ l $$),其中 m $$ m $$ 是组中路由器的数量,l $$ l $$ 是每个路由器连接到其他组的链接数,k $$ k $$ 是每个路由器连接到计算节点的链接数。每个路由器都有 m - 1 $$ m-1 $$ 个链接与组内其他 m - 1 $$ m-1 $$ 个路由器完全相连。每个组都有 m l $ m$ ml $ 链接与其他组相连。各组也是完全连接的,因此总共有 m l + 1 $ ml+1 $ $ 组。蜻蜓(k $$ k $$ , m $$ m $$ , l $$ l $$)的路由器半角为 l + k + m - 1 $$ l+k+m-1 $$。构建一个大型蜻蜓系统需要大量高分辨率路由器,从而增加了硬件成本。为了降低硬件成本,本文提出了一种更灵活的拓扑结构,称为维度扩展蜻蜓(DED)。一个组中的每个路由器不是完全连接的,而是排列在一个 n $$ n $$ 的维度矩阵中,相同维度的路由器是完全连接的。我们使用 n $$ n $$ 表示维度,这样 DED 中的每个组都有 m n $$ {m}^n $$ 路由器。本研究从成本、性能、容错和数据包延迟等方面对 DED 进行了全面评估。研究结果表明,与传统的 Dragonfly 和 Cascade 拓扑相比,DED 提供了更经济的硬件解决方案,尤其是在 n ≥ 3 $$ n\ge 3 $$ 的情况下。除了成本效益,DED 还增强了系统设计的灵活性。它通过直径和弧度的不同组合为系统扩展提供了多种可能性,为系统架构师提供了更多适应性更强的选择。为了进一步增强 DED 的多功能性,我们提出了三种不相交路径路由算法,并通过仿真评估了它们的容错性。仿真结果还表明,DED 的数据包延迟低于蜻蜓和级联。
An in-depth study of dimension-extended dragonfly interconnection network
Dragonfly topology is a commonly utilized design for interconnection networks in parallel and distributed systems. A classical dragonfly can be denoted as dragonfly(,,), where is the number of routers in a group, is the number of links per router connected to other groups, and is the number of links per router connected to compute nodes. Each router has other links fully connected to other routers within a group. Each group has links connected to other groups. The groups are also fully connected, therefore there are groups in total. The router radix in a dragonfly(,,) is. Building a large dragonfly system requires a large number of high-radix routers, increasing hardware costs. To reduce hardware costs, this paper proposes a more flexible topology called dimension-extended dragonfly (DED). Rather than routers in a group being fully connected, each router in a group is arranged in an-dimensional matrix, and routers of the same dimension are fully connected. We use to denote the dimension such that each group in the DED has routers. This study comprehensively evaluates DED in terms of cost, performance, fault tolerance, and packet latency. The findings show that DED provides a more economical hardware solution compared to traditional Dragonfly and Cascade topologies, especially for. Beyond cost-efficiency, DED enhances system design flexibility. It offers diverse possibilities for system scaling through different combinations of diameter and radix, giving system architects more adaptable options. To further enhance the versatility of DED, three disjoint path routing algorithms are proposed and their fault tolerance is evaluated through simulation. The simulation results also show that the packet latency of DED is lower than dragonfly and cascade.
期刊介绍:
Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of:
Parallel and distributed computing;
High-performance computing;
Computational and data science;
Artificial intelligence and machine learning;
Big data applications, algorithms, and systems;
Network science;
Ontologies and semantics;
Security and privacy;
Cloud/edge/fog computing;
Green computing; and
Quantum computing.