Sam F L Windels, Daniel Tello Velasco, Mikhail Rotkevich, Noël Malod-Dognin, Nataša Pržulj
{"title":"基于小图的双曲嵌入捕捉遗传网络中的进化动态","authors":"Sam F L Windels, Daniel Tello Velasco, Mikhail Rotkevich, Noël Malod-Dognin, Nataša Pržulj","doi":"10.1093/bioinformatics/btae650","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Spatial Analysis of Functional Enrichment (SAFE) is a popular tool for biologists to investigate the functional organization of biological networks via highly intuitive 2D functional maps. To create these maps, SAFE uses Spring embedding to project a given network into a 2D space in which nodes connected in the network are near each other in space. However, many biological networks are scale-free, containing highly connected hub nodes. Because Spring embedding fails to separate hub nodes, it provides uninformative embeddings that resemble a 'hairball'. In addition, Spring embedding only captures direct node connectivity in the network and does not consider higher-order node wiring patterns, which are best captured by graphlets, small, connected, nonisomorphic, induced subgraphs. The scale-free structure of biological networks is hypothesized to stem from an underlying low-dimensional hyperbolic geometry, which novel hyperbolic embedding methods try to uncover. These include coalescent embedding, which projects a network onto a 2D disk.</p><p><strong>Results: </strong>To better capture the functional organization of scale-free biological networks, whilst also going beyond simple direct connectivity patterns, we introduce Graphlet Coalescent (GraCoal) embedding, which embeds nodes nearby on a disk if they frequently co-occur on a given graphlet together. We use GraCoal to extend SAFE-based network analysis. Through SAFE-enabled enrichment analysis, we show that GraCoal outperforms graphlet-based Spring embedding in capturing the functional organization of the genetic interaction networks of fruit fly, budding yeast, fission yeast and Escherichia coli. We show that depending on the underlying graphlet, GraCoal embeddings capture different topology-function relationships. We show that triangle-based GraCoal embedding captures functional redundancies between paralogs.</p><p><strong>Availability and implementation: </strong>https://gitlab.bsc.es/swindels/gracoal_embedding.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568109/pdf/","citationCount":"0","resultStr":"{\"title\":\"Graphlet-based hyperbolic embeddings capture evolutionary dynamics in genetic networks.\",\"authors\":\"Sam F L Windels, Daniel Tello Velasco, Mikhail Rotkevich, Noël Malod-Dognin, Nataša Pržulj\",\"doi\":\"10.1093/bioinformatics/btae650\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Motivation: </strong>Spatial Analysis of Functional Enrichment (SAFE) is a popular tool for biologists to investigate the functional organization of biological networks via highly intuitive 2D functional maps. To create these maps, SAFE uses Spring embedding to project a given network into a 2D space in which nodes connected in the network are near each other in space. However, many biological networks are scale-free, containing highly connected hub nodes. Because Spring embedding fails to separate hub nodes, it provides uninformative embeddings that resemble a 'hairball'. In addition, Spring embedding only captures direct node connectivity in the network and does not consider higher-order node wiring patterns, which are best captured by graphlets, small, connected, nonisomorphic, induced subgraphs. The scale-free structure of biological networks is hypothesized to stem from an underlying low-dimensional hyperbolic geometry, which novel hyperbolic embedding methods try to uncover. These include coalescent embedding, which projects a network onto a 2D disk.</p><p><strong>Results: </strong>To better capture the functional organization of scale-free biological networks, whilst also going beyond simple direct connectivity patterns, we introduce Graphlet Coalescent (GraCoal) embedding, which embeds nodes nearby on a disk if they frequently co-occur on a given graphlet together. We use GraCoal to extend SAFE-based network analysis. Through SAFE-enabled enrichment analysis, we show that GraCoal outperforms graphlet-based Spring embedding in capturing the functional organization of the genetic interaction networks of fruit fly, budding yeast, fission yeast and Escherichia coli. We show that depending on the underlying graphlet, GraCoal embeddings capture different topology-function relationships. We show that triangle-based GraCoal embedding captures functional redundancies between paralogs.</p><p><strong>Availability and implementation: </strong>https://gitlab.bsc.es/swindels/gracoal_embedding.</p>\",\"PeriodicalId\":93899,\"journal\":{\"name\":\"Bioinformatics (Oxford, England)\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568109/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinformatics (Oxford, England)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/bioinformatics/btae650\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btae650","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
动机功能富集空间分析(Space Analysis of Functional Enrichment,SAFE)是生物学家常用的一种工具,可通过高度直观的二维功能图研究生物网络的功能组织。为了绘制这些地图,SAFE 使用 Spring embedding 技术将给定网络投影到二维空间中,在这个空间中,网络中相互连接的节点在空间上彼此靠近。然而,许多生物网络是无标度的,包含高度连接的中心节点。由于 Spring embedding 无法分离枢纽节点,因此它提供的嵌入信息并不丰富,就像一个 "毛球"。此外,Spring embedding 只能捕捉网络中的直接节点连通性,并没有考虑高阶节点布线模式,而这种模式最好通过小图(小的、连通的、非同构的诱导子图)来捕捉。据推测,生物网络的无标度结构源于潜在的低维双曲几何,新型双曲嵌入方法试图揭示这种结构。这些方法包括凝聚嵌入法,它将网络投射到二维圆盘上:为了更好地捕捉无标度生物网络的功能组织,同时超越简单的直接连接模式,我们引入了小图聚合嵌入(GraCoal),如果节点经常共同出现在给定的小图上,就将它们嵌入到圆盘上。我们使用 GraCoal 扩展基于 SAFE 的网络分析。通过基于 SAFE 的富集分析,我们发现 GraCoal 在捕捉果蝇、芽生酵母、裂殖酵母和大肠杆菌的遗传相互作用网络的功能组织方面优于基于小图的 Spring 嵌入。我们发现,根据底层小图的不同,GraCoal 嵌入捕捉到的拓扑-功能关系也不同。我们表明,基于三角形的 GraCoal 嵌入能捕捉到旁系亲属之间的功能冗余。可用性:https://gitlab.bsc.es/swindels/gracoal_embedding.Supplementary 信息:补充数据可在 Bioinformatics online 上获取。
Graphlet-based hyperbolic embeddings capture evolutionary dynamics in genetic networks.
Motivation: Spatial Analysis of Functional Enrichment (SAFE) is a popular tool for biologists to investigate the functional organization of biological networks via highly intuitive 2D functional maps. To create these maps, SAFE uses Spring embedding to project a given network into a 2D space in which nodes connected in the network are near each other in space. However, many biological networks are scale-free, containing highly connected hub nodes. Because Spring embedding fails to separate hub nodes, it provides uninformative embeddings that resemble a 'hairball'. In addition, Spring embedding only captures direct node connectivity in the network and does not consider higher-order node wiring patterns, which are best captured by graphlets, small, connected, nonisomorphic, induced subgraphs. The scale-free structure of biological networks is hypothesized to stem from an underlying low-dimensional hyperbolic geometry, which novel hyperbolic embedding methods try to uncover. These include coalescent embedding, which projects a network onto a 2D disk.
Results: To better capture the functional organization of scale-free biological networks, whilst also going beyond simple direct connectivity patterns, we introduce Graphlet Coalescent (GraCoal) embedding, which embeds nodes nearby on a disk if they frequently co-occur on a given graphlet together. We use GraCoal to extend SAFE-based network analysis. Through SAFE-enabled enrichment analysis, we show that GraCoal outperforms graphlet-based Spring embedding in capturing the functional organization of the genetic interaction networks of fruit fly, budding yeast, fission yeast and Escherichia coli. We show that depending on the underlying graphlet, GraCoal embeddings capture different topology-function relationships. We show that triangle-based GraCoal embedding captures functional redundancies between paralogs.
Availability and implementation: https://gitlab.bsc.es/swindels/gracoal_embedding.