Multi-layer Bundling as a New Approach for Determining Multi-scale Correlations Within a High-Dimensional Dataset.

IF 2.2 4区 数学 Q2 BIOLOGY Bulletin of Mathematical Biology Pub Date : 2024-07-12 DOI:10.1007/s11538-024-01335-8
Mehran Fazli, Richard Bertram, Deborah A Striegel
{"title":"Multi-layer Bundling as a New Approach for Determining Multi-scale Correlations Within a High-Dimensional Dataset.","authors":"Mehran Fazli, Richard Bertram, Deborah A Striegel","doi":"10.1007/s11538-024-01335-8","DOIUrl":null,"url":null,"abstract":"<p><p>The growing complexity of biological data has spurred the development of innovative computational techniques to extract meaningful information and uncover hidden patterns within vast datasets. Biological networks, such as gene regulatory networks and protein-protein interaction networks, hold critical insights into biological features' connections and functions. Integrating and analyzing high-dimensional data, particularly in gene expression studies, stands prominent among the challenges in deciphering these networks. Clustering methods play a crucial role in addressing these challenges, with spectral clustering emerging as a potent unsupervised technique considering intrinsic geometric structures. However, spectral clustering's user-defined cluster number can lead to inconsistent and sometimes orthogonal clustering regimes. We propose the Multi-layer Bundling (MLB) method to address this limitation, combining multiple prominent clustering regimes to offer a comprehensive data view. We call the outcome clusters \"bundles\". This approach refines clustering outcomes, unravels hierarchical organization, and identifies bridge elements mediating communication between network components. By layering clustering results, MLB provides a global-to-local view of biological feature clusters enabling insights into intricate biological systems. Furthermore, the method enhances bundle network predictions by integrating the bundle co-cluster matrix with the affinity matrix. The versatility of MLB extends beyond biological networks, making it applicable to various domains where understanding complex relationships and patterns is needed.</p>","PeriodicalId":9372,"journal":{"name":"Bulletin of Mathematical Biology","volume":"86 9","pages":"105"},"PeriodicalIF":2.2000,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11245441/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of Mathematical Biology","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s11538-024-01335-8","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The growing complexity of biological data has spurred the development of innovative computational techniques to extract meaningful information and uncover hidden patterns within vast datasets. Biological networks, such as gene regulatory networks and protein-protein interaction networks, hold critical insights into biological features' connections and functions. Integrating and analyzing high-dimensional data, particularly in gene expression studies, stands prominent among the challenges in deciphering these networks. Clustering methods play a crucial role in addressing these challenges, with spectral clustering emerging as a potent unsupervised technique considering intrinsic geometric structures. However, spectral clustering's user-defined cluster number can lead to inconsistent and sometimes orthogonal clustering regimes. We propose the Multi-layer Bundling (MLB) method to address this limitation, combining multiple prominent clustering regimes to offer a comprehensive data view. We call the outcome clusters "bundles". This approach refines clustering outcomes, unravels hierarchical organization, and identifies bridge elements mediating communication between network components. By layering clustering results, MLB provides a global-to-local view of biological feature clusters enabling insights into intricate biological systems. Furthermore, the method enhances bundle network predictions by integrating the bundle co-cluster matrix with the affinity matrix. The versatility of MLB extends beyond biological networks, making it applicable to various domains where understanding complex relationships and patterns is needed.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多层捆绑是确定高维数据集内多尺度相关性的新方法
生物数据的复杂性不断增加,推动了创新计算技术的发展,以提取有意义的信息并揭示庞大数据集中隐藏的模式。生物网络,如基因调控网络和蛋白质-蛋白质相互作用网络,是了解生物特征的联系和功能的重要途径。整合和分析高维数据,尤其是基因表达研究中的高维数据,是破译这些网络所面临的突出挑战。聚类方法在应对这些挑战中发挥着至关重要的作用,其中光谱聚类是一种考虑到内在几何结构的有效无监督技术。然而,频谱聚类的用户自定义聚类数可能会导致不一致的聚类机制,有时甚至是正交的聚类机制。我们提出了多层捆绑(MLB)方法来解决这一局限性,将多种突出的聚类机制结合起来,提供全面的数据视图。我们将结果聚类称为 "捆绑"。这种方法可以完善聚类结果,揭示层级组织,并识别网络组件之间沟通的桥梁元素。通过对聚类结果进行分层,MLB 提供了生物特征集群的全局到局部视图,有助于深入了解错综复杂的生物系统。此外,该方法还通过整合束共簇矩阵和亲和矩阵来增强束网络预测。MLB 的多功能性超越了生物网络,使其适用于需要了解复杂关系和模式的各个领域。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
3.90
自引率
8.60%
发文量
123
审稿时长
7.5 months
期刊介绍: The Bulletin of Mathematical Biology, the official journal of the Society for Mathematical Biology, disseminates original research findings and other information relevant to the interface of biology and the mathematical sciences. Contributions should have relevance to both fields. In order to accommodate the broad scope of new developments, the journal accepts a variety of contributions, including: Original research articles focused on new biological insights gained with the help of tools from the mathematical sciences or new mathematical tools and methods with demonstrated applicability to biological investigations Research in mathematical biology education Reviews Commentaries Perspectives, and contributions that discuss issues important to the profession All contributions are peer-reviewed.
期刊最新文献
A Random Differential Equation Approach for Modeling the Growth of Microalgae in Photobioreactors. Pattern Formation in Agent-Based and PDE Models for Evolutionary Games with Payoff-Driven Motion. Mathematical Modeling of Lesion Pattern Formation in Dendritic Keratitis. Modeling and Simulation of the Role of Mass Testing in Controlling COVID-19. Inference of Genetic Networks from Pseudo Time Series of Single-cell Gene Expression Data using Modified Random Forests.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1