首页 > 最新文献

Journal of Computational Biology最新文献

英文 中文
Recursive Clustering of Cellular Diversity in scRNA-Seq Data. scRNA-Seq数据中细胞多样性的递归聚类。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-04-01 Epub Date: 2025-03-28 DOI: 10.1089/cmb.2024.0625
Michael Squires, Peng Qiu

In scRNA-seq analysis, cell clusters are typically defined by a single round of feature extraction and clustering. This approach may miss phenotypic differences in cell types that are characterized by genes not sufficiently represented in the feature set derived using all cells, such as rare cell types. This work explores an alternative approach, where cell clusters are identified by recursively performing feature extraction and clustering on previously identified clusters, such that each subclustering step uses features that are more specific to distinguishing the higher-resolution subclusters. We benchmark this recursive approach against the conventional, nonrecursive clustering approach and demonstrate that the recursive method results in robust improvement in cell type detection on four scRNA-seq datasets across a wide range of clustering resolution parameters. We apply the recursive approach to cluster scRNA-seq data obtained from patients with Crohn's disease belonging to three clinical phenotypes and observe that recursive clustering captures phenotypic differences only visible at specific levels of granularity within an interpretable hierarchical framework while defining cell clusters within a gene expression feature space more specific to each cluster.

在scRNA-seq分析中,细胞簇通常通过一轮特征提取和聚类来定义。这种方法可能会错过细胞类型的表型差异,这些差异是由基因表征的,在使用所有细胞衍生的特征集中没有充分代表,例如罕见的细胞类型。这项工作探索了一种替代方法,其中通过递归地对先前识别的集群进行特征提取和聚类来识别细胞集群,这样每个子聚类步骤使用更具体的特征来区分更高分辨率的子集群。我们将这种递归方法与传统的非递归聚类方法进行了基准测试,并证明递归方法在广泛的聚类分辨率参数范围内对四个scRNA-seq数据集的细胞类型检测产生了鲁棒性改进。我们将递归方法应用于从克罗恩病患者中获得的属于三种临床表型的scRNA-seq数据,并观察到递归聚类仅在可解释的层次框架内的特定粒度水平上可见的表型差异,同时在基因表达特征空间中定义每个簇更具体的细胞簇。
{"title":"Recursive Clustering of Cellular Diversity in scRNA-Seq Data.","authors":"Michael Squires, Peng Qiu","doi":"10.1089/cmb.2024.0625","DOIUrl":"10.1089/cmb.2024.0625","url":null,"abstract":"<p><p>In scRNA-seq analysis, cell clusters are typically defined by a single round of feature extraction and clustering. This approach may miss phenotypic differences in cell types that are characterized by genes not sufficiently represented in the feature set derived using all cells, such as rare cell types. This work explores an alternative approach, where cell clusters are identified by recursively performing feature extraction and clustering on previously identified clusters, such that each subclustering step uses features that are more specific to distinguishing the higher-resolution subclusters. We benchmark this recursive approach against the conventional, nonrecursive clustering approach and demonstrate that the recursive method results in robust improvement in cell type detection on four scRNA-seq datasets across a wide range of clustering resolution parameters. We apply the recursive approach to cluster scRNA-seq data obtained from patients with Crohn's disease belonging to three clinical phenotypes and observe that recursive clustering captures phenotypic differences only visible at specific levels of granularity within an interpretable hierarchical framework while defining cell clusters within a gene expression feature space more specific to each cluster.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"444-460"},"PeriodicalIF":1.4,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143730225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VIPER: Virus Inhibition Via Peptide Engineering and Receptor Mimicry. VIPER:通过肽工程和受体模仿抑制病毒。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-04-01 Epub Date: 2025-02-14 DOI: 10.1089/cmb.2024.0866
Anna Sophie Klingenberg, Dario Ghersi

A key step in most viral infections is the binding of a viral protein to a host receptor, leading to the virus entering the host cell. Disrupting this protein-protein interaction is an effective strategy for preventing infection and subsequent disease. Building on recent advances in computational tools for structural biology, we introduce Virus Inhibition via Peptide Engineering and Receptor Mimicry (VIPER), a novel approach for the automatic derivation and optimization of biomimetic decoy peptides that mimic binding sites of human proteins. VIPER leverages structural data from human-pathogen protein complexes, yielding peptides that can competitively inhibit viral entry by mimicking the natural receptor. We computationally validated VIPER using molecular dynamics simulations and showcased its applicability on three clinically relevant viruses, highlighting its potential to accelerate therapeutic development. With a focus on reproducibility and extensibility, VIPER can facilitate the rapid development of antiviral inhibitors by automating the design and optimization of biomimetic compounds.

大多数病毒感染的关键步骤是病毒蛋白与宿主受体结合,导致病毒进入宿主细胞。破坏这种蛋白质-蛋白质相互作用是预防感染和随后疾病的有效策略。基于结构生物学计算工具的最新进展,我们通过肽工程和受体模仿(VIPER)引入病毒抑制,这是一种自动衍生和优化模拟人类蛋白质结合位点的仿生诱饵肽的新方法。VIPER利用人类病原体蛋白复合物的结构数据,产生可以通过模仿天然受体竞争性地抑制病毒进入的肽。我们使用分子动力学模拟对VIPER进行了计算验证,并展示了其在三种临床相关病毒上的适用性,强调了其加速治疗开发的潜力。VIPER专注于可重复性和可扩展性,可以通过自动化设计和优化仿生化合物来促进抗病毒抑制剂的快速开发。
{"title":"VIPER: Virus Inhibition Via Peptide Engineering and Receptor Mimicry.","authors":"Anna Sophie Klingenberg, Dario Ghersi","doi":"10.1089/cmb.2024.0866","DOIUrl":"10.1089/cmb.2024.0866","url":null,"abstract":"<p><p>A key step in most viral infections is the binding of a viral protein to a host receptor, leading to the virus entering the host cell. Disrupting this protein-protein interaction is an effective strategy for preventing infection and subsequent disease. Building on recent advances in computational tools for structural biology, we introduce Virus Inhibition via Peptide Engineering and Receptor Mimicry (VIPER), a novel approach for the automatic derivation and optimization of biomimetic decoy peptides that mimic binding sites of human proteins. VIPER leverages structural data from human-pathogen protein complexes, yielding peptides that can competitively inhibit viral entry by mimicking the natural receptor. We computationally validated VIPER using molecular dynamics simulations and showcased its applicability on three clinically relevant viruses, highlighting its potential to accelerate therapeutic development. With a focus on reproducibility and extensibility, VIPER can facilitate the rapid development of antiviral inhibitors by automating the design and optimization of biomimetic compounds.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"362-373"},"PeriodicalIF":1.4,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143414411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Joint Bayesian Model for Change-Points and Heteroskedasticity Applied to the Canadian Longitudinal Study on Aging. 变化点和异方差联合贝叶斯模型在加拿大老龄化纵向研究中的应用。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-04-01 Epub Date: 2025-01-20 DOI: 10.1089/cmb.2024.0563
Joosung Min, Olga Vishnyakova, Angela Brooks-Wilson, Lloyd T Elliott

Maintaining homeostasis, the regulation of internal physiological parameters, is essential for health and well-being. Deviations from optimal levels, or 'sweet spots,' can lead to health deterioration and disease. Identifying biomarkers with sweet spots requires both change-point detection and variance effect analysis. Traditional approaches involve separate tests for change-points and heteroskedasticity, which can yield inaccurate results if model assumptions are violated. To address these challenges, we propose a unified approach: Bayesian Testing for Heteroskedasticity and Sweet Spots (BTHS). This framework integrates sampling-based parameter estimation and Bayes factor computation to enhance change-point detection, heteroskedasticity quantification, and testing in change-point regression settings, and extends previous Bayesian approaches. BTHS eliminates the need for separate analyses and provides detailed insights into both the magnitude and shape of heteroskedasticity, enabling robust identification of sweet spots without strong assumptions. We applied BTHS to blood elements from the Canadian Longitudinal Study on Aging identifying nine blood elements with significant sweet spot variance effects.

维持体内平衡,调节内部生理参数,对健康和幸福至关重要。偏离最佳水平或“最佳点”会导致健康恶化和疾病。识别具有最佳点的生物标志物需要变化点检测和方差效应分析。传统的方法包括对变化点和异方差的单独测试,如果模型假设被违反,可能会产生不准确的结果。为了解决这些挑战,我们提出了一种统一的方法:异方差和最佳点贝叶斯检验(BTHS)。该框架集成了基于采样的参数估计和贝叶斯因子计算,以增强变化点检测、异方差量化和变化点回归设置中的测试,并扩展了以前的贝叶斯方法。BTHS消除了单独分析的需要,并提供了对异方差的大小和形状的详细见解,可以在没有强假设的情况下可靠地识别最佳点。我们将BTHS应用于加拿大衰老纵向研究中的血液元素,确定了九种具有显著甜点方差效应的血液元素。
{"title":"A Joint Bayesian Model for Change-Points and Heteroskedasticity Applied to the Canadian Longitudinal Study on Aging.","authors":"Joosung Min, Olga Vishnyakova, Angela Brooks-Wilson, Lloyd T Elliott","doi":"10.1089/cmb.2024.0563","DOIUrl":"10.1089/cmb.2024.0563","url":null,"abstract":"<p><p>Maintaining homeostasis, the regulation of internal physiological parameters, is essential for health and well-being. Deviations from optimal levels, or 'sweet spots,' can lead to health deterioration and disease. Identifying biomarkers with sweet spots requires both change-point detection and variance effect analysis. Traditional approaches involve separate tests for change-points and heteroskedasticity, which can yield inaccurate results if model assumptions are violated. To address these challenges, we propose a unified approach: Bayesian Testing for Heteroskedasticity and Sweet Spots (BTHS). This framework integrates sampling-based parameter estimation and Bayes factor computation to enhance change-point detection, heteroskedasticity quantification, and testing in change-point regression settings, and extends previous Bayesian approaches. BTHS eliminates the need for separate analyses and provides detailed insights into both the magnitude and shape of heteroskedasticity, enabling robust identification of sweet spots without strong assumptions. We applied BTHS to blood elements from the Canadian Longitudinal Study on Aging identifying nine blood elements with significant sweet spot variance effects.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"374-393"},"PeriodicalIF":1.4,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143006258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Earth Mover's Distance-Based Self-Supervised Framework for Cellular Dynamic Grading in Live-Cell Imaging. 在活细胞成像中,基于距离的土工自监督框架用于细胞动态分级。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-03-01 Epub Date: 2024-12-02 DOI: 10.1089/cmb.2024.0672
Fengqian Pang, Chunyue Lei, Hongfei Zhao, Zhiqiang Xing

Cellular appearance and its dynamics frequently serve as a proxy measurement of live-cell physiological properties. The computational analysis of cell properties is considered to be a significant endeavor in biological and biomedical research. Deep learning has garnered considerable success across various fields. In light of this, various neural networks have been developed to analyze live-cell microscopic videos and capture cellular dynamics with biological significance. Specifically, cellular dynamic grading (CDG) is the task that provides a predefined dynamic grade for a live-cell according to the speed of cellular deformation and intracellular movement. This task involves recording the morphological and cytoplasmic dynamics in live-cell microscopic videos. Similar to other medical image processing tasks, CDG faces challenges in collecting and annotating cellular videos. These deficiencies in medical data limit the performance of deep learning models. In this article, we propose a novel self-supervised framework to overcome these limitations for the CDG task. Our framework relies on the assumption that increasing or decreasing cell dynamic grades is consistent with accelerating or decelerating cell appearance change in videos, respectively. This consistency is subsequently incorporated as a constraint in the loss function for the self-supervised training strategy. Our framework is implemented by formulating a probability transition matrix based on the Earth Mover's Distance and imposing a loss constraint on the elements of this matrix. Experimental results demonstrate that our proposed framework enhances the model's ability to learn spatiotemporal dynamics. Furthermore, our framework outperforms the existing methods on our cell video database.

细胞外观及其动力学常常作为活细胞生理特性的代理测量。细胞特性的计算分析被认为是生物学和生物医学研究中的一项重要工作。深度学习在各个领域都取得了相当大的成功。鉴于此,已经开发了各种神经网络来分析活细胞微观视频并捕获具有生物学意义的细胞动力学。具体来说,细胞动态分级(CDG)是根据细胞变形和细胞内运动的速度为活细胞提供预定义的动态分级的任务。这项任务包括在活细胞显微录像中记录形态学和细胞质动力学。与其他医学图像处理任务类似,CDG在收集和注释蜂窝视频方面面临挑战。医疗数据中的这些缺陷限制了深度学习模型的性能。在本文中,我们提出了一种新的自监督框架来克服CDG任务的这些限制。我们的框架依赖于这样的假设,即增加或减少细胞动态等级分别与视频中加速或减速细胞外观变化一致。这种一致性随后作为约束纳入到自监督训练策略的损失函数中。我们的框架是通过制定一个基于地球移动距离的概率转移矩阵并对该矩阵的元素施加损失约束来实现的。实验结果表明,我们提出的框架增强了模型学习时空动态的能力。此外,我们的框架在我们的手机视频数据库上优于现有的方法。
{"title":"An Earth Mover's Distance-Based Self-Supervised Framework for Cellular Dynamic Grading in Live-Cell Imaging.","authors":"Fengqian Pang, Chunyue Lei, Hongfei Zhao, Zhiqiang Xing","doi":"10.1089/cmb.2024.0672","DOIUrl":"10.1089/cmb.2024.0672","url":null,"abstract":"<p><p>Cellular appearance and its dynamics frequently serve as a proxy measurement of live-cell physiological properties. The computational analysis of cell properties is considered to be a significant endeavor in biological and biomedical research. Deep learning has garnered considerable success across various fields. In light of this, various neural networks have been developed to analyze live-cell microscopic videos and capture cellular dynamics with biological significance. Specifically, cellular dynamic grading (CDG) is the task that provides a predefined dynamic grade for a live-cell according to the speed of cellular deformation and intracellular movement. This task involves recording the morphological and cytoplasmic dynamics in live-cell microscopic videos. Similar to other medical image processing tasks, CDG faces challenges in collecting and annotating cellular videos. These deficiencies in medical data limit the performance of deep learning models. In this article, we propose a novel self-supervised framework to overcome these limitations for the CDG task. Our framework relies on the assumption that increasing or decreasing cell dynamic grades is consistent with accelerating or decelerating cell appearance change in videos, respectively. This consistency is subsequently incorporated as a constraint in the loss function for the self-supervised training strategy. Our framework is implemented by formulating a probability transition matrix based on the Earth Mover's Distance and imposing a loss constraint on the elements of this matrix. Experimental results demonstrate that our proposed framework enhances the model's ability to learn spatiotemporal dynamics. Furthermore, our framework outperforms the existing methods on our cell video database.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"274-297"},"PeriodicalIF":1.4,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142769460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Graph-Based Machine-Learning Approach Combined with Optical Measurements to Understand Beating Dynamics of Cardiomyocytes. 基于图的机器学习方法结合光学测量来理解心肌细胞的跳动动力学。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-03-01 Epub Date: 2024-12-09 DOI: 10.1089/cmb.2024.0491
Ziqian Wu, Jiyoon Park, Paul R Steiner, Bo Zhu, John X J Zhang

The development of computational models for the prediction of cardiac cellular dynamics remains a challenge due to the lack of first-principled mathematical models. We develop a novel machine-learning approach hybridizing physics simulation and graph networks to deliver robust predictions of cardiomyocyte dynamics. Embedded with inductive physical priors, the proposed constraint-based interaction neural projection (CINP) algorithm can uncover hidden physical constraints from sparse image data on a small set of beating cardiac cells and provide robust predictions for heterogenous large-scale cell sets. We also implement an in vitro culture and imaging platform for cellular motion and calcium transient analysis to validate the model. We showcase our model's efficacy by predicting complex organoid cellular behaviors in both in silico and in vitro settings.

由于缺乏第一性原理的数学模型,用于预测心脏细胞动力学的计算模型的发展仍然是一个挑战。我们开发了一种新的机器学习方法,将物理模拟和图形网络相结合,以提供心肌细胞动力学的稳健预测。本文提出的基于约束的交互神经投影(CINP)算法嵌入归纳物理先验,可以从一小组跳动的心脏细胞的稀疏图像数据中发现隐藏的物理约束,并为异构大规模细胞集提供鲁棒预测。我们还实现了体外培养和成像平台,用于细胞运动和钙瞬态分析,以验证模型。我们通过在计算机和体外环境中预测复杂的类器官细胞行为来展示我们的模型的功效。
{"title":"A Graph-Based Machine-Learning Approach Combined with Optical Measurements to Understand Beating Dynamics of Cardiomyocytes.","authors":"Ziqian Wu, Jiyoon Park, Paul R Steiner, Bo Zhu, John X J Zhang","doi":"10.1089/cmb.2024.0491","DOIUrl":"10.1089/cmb.2024.0491","url":null,"abstract":"<p><p>The development of computational models for the prediction of cardiac cellular dynamics remains a challenge due to the lack of first-principled mathematical models. We develop a novel machine-learning approach hybridizing physics simulation and graph networks to deliver robust predictions of cardiomyocyte dynamics. Embedded with inductive physical priors, the proposed constraint-based interaction neural projection (CINP) algorithm can uncover hidden physical constraints from sparse image data on a small set of beating cardiac cells and provide robust predictions for heterogenous large-scale cell sets. We also implement an in vitro culture and imaging platform for cellular motion and calcium transient analysis to validate the model. We showcase our model's efficacy by predicting complex organoid cellular behaviors in both in silico and in vitro settings.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"239-252"},"PeriodicalIF":1.4,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142794605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fuzzy-Based Identification of Transition Cells to Infer Cell Trajectory for Single-Cell Transcriptomics. 基于模糊识别的过渡细胞推断单细胞转录组学的细胞轨迹。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-03-01 Epub Date: 2024-12-13 DOI: 10.1089/cmb.2023.0432
Xiang Chen, Yibing Ma, Yongle Shi, Bai Zhang, Hanwen Wu, Jie Gao

With the continuous evolution of single-cell RNA sequencing technology, it has become feasible to reconstruct cell development processes using computational methods. Trajectory inference is a crucial downstream analytical task that provides valuable insights into understanding cell cycle and differentiation. During cell development, cells exhibit both stable and transition states, which makes it challenging to accurately identify these cells. To address this challenge, we propose a novel single-cell trajectory inference method using fuzzy clustering, named scFCTI. By introducing fuzzy clustering and quantifying cell uncertainty, scFCTI can identify transition cells within unstable cell states. Moreover, scFCTI can obtain refined cell classification by characterizing different cell stages, which gain more accurate single-cell trajectory reconstruction containing transition paths. To validate the effectiveness of scFCTI, we conduct experiments on five real datasets and four different structure simulation datasets, comparing them with several state-of-the-art trajectory inference methods. The results demonstrate that scFCTI outperforms these methods by successfully identifying unstable cell clusters and obtaining more accurate cell paths with transition states. Especially the experimental results demonstrate that scFCTI can reconstruct the cell trajectory more precisely.

随着单细胞RNA测序技术的不断发展,利用计算方法重构细胞发育过程已经成为可能。轨迹推断是一项至关重要的下游分析任务,为理解细胞周期和分化提供了有价值的见解。在细胞发育过程中,细胞表现出稳定和过渡状态,这使得准确鉴定这些细胞具有挑战性。为了解决这一挑战,我们提出了一种新的单细胞轨迹推理方法,使用模糊聚类,命名为scFCTI。通过引入模糊聚类和量化细胞不确定性,scFCTI可以识别不稳定状态下的过渡细胞。此外,scFCTI可以通过表征细胞的不同阶段来获得精细的细胞分类,从而获得更精确的包含过渡路径的单细胞轨迹重建。为了验证scFCTI的有效性,我们在5个真实数据集和4个不同的结构仿真数据集上进行了实验,并将其与几种最先进的轨迹推断方法进行了比较。结果表明,scFCTI通过成功识别不稳定的细胞簇和获得更准确的过渡状态细胞路径,优于这些方法。特别是实验结果表明,scFCTI可以更精确地重建细胞轨迹。
{"title":"Fuzzy-Based Identification of Transition Cells to Infer Cell Trajectory for Single-Cell Transcriptomics.","authors":"Xiang Chen, Yibing Ma, Yongle Shi, Bai Zhang, Hanwen Wu, Jie Gao","doi":"10.1089/cmb.2023.0432","DOIUrl":"10.1089/cmb.2023.0432","url":null,"abstract":"<p><p>With the continuous evolution of single-cell RNA sequencing technology, it has become feasible to reconstruct cell development processes using computational methods. Trajectory inference is a crucial downstream analytical task that provides valuable insights into understanding cell cycle and differentiation. During cell development, cells exhibit both stable and transition states, which makes it challenging to accurately identify these cells. To address this challenge, we propose a novel single-cell trajectory inference method using fuzzy clustering, named scFCTI. By introducing fuzzy clustering and quantifying cell uncertainty, scFCTI can identify transition cells within unstable cell states. Moreover, scFCTI can obtain refined cell classification by characterizing different cell stages, which gain more accurate single-cell trajectory reconstruction containing transition paths. To validate the effectiveness of scFCTI, we conduct experiments on five real datasets and four different structure simulation datasets, comparing them with several state-of-the-art trajectory inference methods. The results demonstrate that scFCTI outperforms these methods by successfully identifying unstable cell clusters and obtaining more accurate cell paths with transition states. Especially the experimental results demonstrate that scFCTI can reconstruct the cell trajectory more precisely.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"253-273"},"PeriodicalIF":1.4,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142818363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Arithmetic Coding-Based Encoding Method Toward High-Density DNA Storage. 基于自适应算术编码的编码方法,迈向高密度 DNA 存储。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-03-01 Epub Date: 2024-11-15 DOI: 10.1089/cmb.2024.0697
Yingxin Hu, Yanjun Liu, Yuefei Yang

With the rapid advancement of big data and artificial intelligence technologies, the limitations inherent in traditional storage media for accommodating vast amounts of data have become increasingly evident. DNA storage is an innovative approach harnessing DNA and other biomolecules as storage mediums, endowed with superior characteristics including expansive capacity, remarkable density, minimal energy requirements, and unparalleled longevity. Central to the efficient DNA storage is the process of DNA coding, whereby digital information is converted into sequences of DNA bases. A novel encoding method based on adaptive arithmetic coding (AAC) has been introduced, delineating the encoding process into three distinct phases: compression, error correction, and mapping. Prediction by Partial Matching (PPM)-based AAC in the compression phase serves to compress data and enhance storage density. Subsequently, the error correction phase relies on octal Hamming code to rectify errors and safeguard data integrity. The mapping phase employs a "3-2 code" mapping relationship to ensure adherence to biochemical constraints. The proposed method was verified by encoding different formats of files such as text, pictures, and audio. The results indicated that the average coding density of bases can be up to 3.25 per nucleotide, the GC content (which includes guanine [G] and cytosine [C]) can be stabilized at 50% and the homopolymer length is restricted to no more than 2. Simulation experimental results corroborate the method's efficacy in preserving data integrity during both reading and writing operations, augmenting storage density, and exhibiting robust error correction capabilities.

随着大数据和人工智能技术的快速发展,传统存储介质在容纳海量数据方面的局限性日益明显。DNA 存储是一种利用 DNA 和其他生物大分子作为存储介质的创新方法,具有容量大、密度高、能耗低和寿命长等优越特性。高效 DNA 存储的核心是 DNA 编码过程,即把数字信息转换成 DNA 碱基序列的过程。一种基于自适应算术编码(AAC)的新型编码方法已经问世,它将编码过程划分为三个不同的阶段:压缩、纠错和映射。在压缩阶段,基于部分匹配预测(PPM)的自适应算术编码可压缩数据并提高存储密度。随后,纠错阶段依靠八进制汉明码来纠正错误并保护数据完整性。映射阶段采用 "3-2 码 "映射关系,以确保遵守生化约束。通过对文本、图片和音频等不同格式的文件进行编码,对所提出的方法进行了验证。模拟实验结果表明,该方法能在读写操作中保持数据的完整性,提高存储密度,并具有强大的纠错能力。
{"title":"Adaptive Arithmetic Coding-Based Encoding Method Toward High-Density DNA Storage.","authors":"Yingxin Hu, Yanjun Liu, Yuefei Yang","doi":"10.1089/cmb.2024.0697","DOIUrl":"10.1089/cmb.2024.0697","url":null,"abstract":"<p><p>With the rapid advancement of big data and artificial intelligence technologies, the limitations inherent in traditional storage media for accommodating vast amounts of data have become increasingly evident. DNA storage is an innovative approach harnessing DNA and other biomolecules as storage mediums, endowed with superior characteristics including expansive capacity, remarkable density, minimal energy requirements, and unparalleled longevity. Central to the efficient DNA storage is the process of DNA coding, whereby digital information is converted into sequences of DNA bases. A novel encoding method based on adaptive arithmetic coding (AAC) has been introduced, delineating the encoding process into three distinct phases: compression, error correction, and mapping. Prediction by Partial Matching (PPM)-based AAC in the compression phase serves to compress data and enhance storage density. Subsequently, the error correction phase relies on octal Hamming code to rectify errors and safeguard data integrity. The mapping phase employs a \"3-2 code\" mapping relationship to ensure adherence to biochemical constraints. The proposed method was verified by encoding different formats of files such as text, pictures, and audio. The results indicated that the average coding density of bases can be up to 3.25 per nucleotide, the GC content (which includes guanine [G] and cytosine [C]) can be stabilized at 50% and the homopolymer length is restricted to no more than 2. Simulation experimental results corroborate the method's efficacy in preserving data integrity during both reading and writing operations, augmenting storage density, and exhibiting robust error correction capabilities.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"298-315"},"PeriodicalIF":1.4,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142621431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Drug Repurposing Using Hypergraph Embedding Based on Common Therapeutic Targets of a Drug. 基于药物共同治疗靶点的超图嵌入药物再利用。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-03-01 Epub Date: 2024-12-09 DOI: 10.1089/cmb.2023.0427
Hanieh Abbasi, Amir Lakizadeh

Developing a new drug is a long and expensive process that typically takes 10-15 years and costs billions of dollars. This has led to an increasing interest in drug repositioning, which involves finding new therapeutic uses for existing drugs. Computational methods become an increasingly important tool for identifying associations between drugs and new diseases. Graph- and hypergraph-based approaches are a type of computational method that can be used to identify potential associations between drugs and new diseases. Here, we present a drug repurposing method based on hypergraph neural network for predicting drug-disease association in three stages. First, it constructs a heterogeneous graph that contains drug and disease nodes and links between them; in the second stage, it converts the heterogeneous simple graph to a hypergraph with only disease nodes. This is achieved by grouping diseases that use the same drug into a hyperedge. Indeed, all the diseases that are the common therapeutic goal of a drug are placed on a hyperedge. Finally, a graph neural network is used to predict drug-disease association based on the structure of the hypergraph. This model is more efficient than other methods because it uses a hypergraph to model relationships more effectively than graphs. Furthermore, it constructs the hypergraph using only a drug-disease association matrix, eliminating the need for extensive amounts of data. Experimental results show that the hypergraph-based approach effectively captures complex interrelationships between drugs and diseases, leading to improved accuracy of drug-disease association prediction compared to state-of-the-art methods.

开发一种新药是一个漫长而昂贵的过程,通常需要10-15年的时间,耗资数十亿美元。这导致了对药物重新定位的兴趣日益增加,这涉及到为现有药物寻找新的治疗用途。计算方法成为识别药物和新疾病之间联系的越来越重要的工具。基于图和超图的方法是一种可用于识别药物和新疾病之间潜在关联的计算方法。在此,我们提出了一种基于超图神经网络的药物再利用方法,用于预测药物-疾病关联的三个阶段。首先,它构建了一个包含药物和疾病节点以及它们之间的链接的异构图;在第二阶段,将异构简单图转换为只有疾病节点的超图。这是通过将使用相同药物的疾病分组到一个超边缘来实现的。事实上,所有作为药物共同治疗目标的疾病都被置于超边缘。最后,基于超图的结构,利用图神经网络对药物-疾病关联进行预测。此模型比其他方法更有效,因为它使用超图比图更有效地建模关系。此外,它仅使用药物-疾病关联矩阵构建超图,从而消除了对大量数据的需要。实验结果表明,基于超图的方法有效地捕获了药物和疾病之间复杂的相互关系,与最先进的方法相比,提高了药物-疾病关联预测的准确性。
{"title":"Drug Repurposing Using Hypergraph Embedding Based on Common Therapeutic Targets of a Drug.","authors":"Hanieh Abbasi, Amir Lakizadeh","doi":"10.1089/cmb.2023.0427","DOIUrl":"10.1089/cmb.2023.0427","url":null,"abstract":"<p><p>Developing a new drug is a long and expensive process that typically takes 10-15 years and costs billions of dollars. This has led to an increasing interest in drug repositioning, which involves finding new therapeutic uses for existing drugs. Computational methods become an increasingly important tool for identifying associations between drugs and new diseases. Graph- and hypergraph-based approaches are a type of computational method that can be used to identify potential associations between drugs and new diseases. Here, we present a drug repurposing method based on hypergraph neural network for predicting drug-disease association in three stages. First, it constructs a heterogeneous graph that contains drug and disease nodes and links between them; in the second stage, it converts the heterogeneous simple graph to a hypergraph with only disease nodes. This is achieved by grouping diseases that use the same drug into a hyperedge. Indeed, all the diseases that are the common therapeutic goal of a drug are placed on a hyperedge. Finally, a graph neural network is used to predict drug-disease association based on the structure of the hypergraph. This model is more efficient than other methods because it uses a hypergraph to model relationships more effectively than graphs. Furthermore, it constructs the hypergraph using only a drug-disease association matrix, eliminating the need for extensive amounts of data. Experimental results show that the hypergraph-based approach effectively captures complex interrelationships between drugs and diseases, leading to improved accuracy of drug-disease association prediction compared to state-of-the-art methods.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"316-329"},"PeriodicalIF":1.4,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142794721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DRGAT: Predicting Drug Responses Via Diffusion-Based Graph Attention Network. 基于扩散的图注意网络预测药物反应。
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-03-01 Epub Date: 2024-12-06 DOI: 10.1089/cmb.2024.0807
Emre Sefer

Accurately predicting drug response depending on a patient's genomic profile is critical for advancing personalized medicine. Deep learning approaches rise and especially the rise of graph neural networks leveraging large-scale omics datasets have been a key driver of research in this area. However, these biological datasets, which are typically high dimensional but have small sample sizes, present challenges such as overfitting and poor generalization in predictive models. As a complicating matter, gene expression (GE) data must capture complex inter-gene relationships, exacerbating these issues. In this article, we tackle these challenges by introducing a drug response prediction method, called drug response graph attention network (DRGAT), which combines a denoising diffusion implicit model for data augmentation with a recently introduced graph attention network (GAT) with high-order neighbor propagation (HO-GATs) prediction module. Our proposed approach achieved almost 5% improvement in the area under receiver operating characteristic curve compared with state-of-the-art models for the many studied drugs, indicating our method's reasonable generalization capabilities. Moreover, our experiments confirm the potential of diffusion-based generative models, a core component of our method, to mitigate the inherent limitations of omics datasets by effectively augmenting GE data.

根据患者的基因组谱准确预测药物反应对于推进个性化医疗至关重要。深度学习方法的兴起,特别是利用大规模组学数据集的图神经网络的兴起,已经成为该领域研究的关键驱动力。然而,这些生物数据集通常是高维的,但样本量小,在预测模型中存在过拟合和泛化差等挑战。作为一个复杂的问题,基因表达(GE)数据必须捕捉复杂的基因间关系,加剧了这些问题。在本文中,我们通过引入一种药物反应预测方法来解决这些挑战,称为药物反应图注意网络(DRGAT),该方法将用于数据增强的去噪扩散隐式模型与最近引入的具有高阶邻居传播(HO-GATs)预测模块的图注意网络(GAT)相结合。与许多研究药物的最先进模型相比,我们提出的方法在受试者工作特征曲线下的面积提高了近5%,表明我们的方法具有合理的泛化能力。此外,我们的实验证实了基于扩散的生成模型的潜力,这是我们方法的核心组成部分,可以通过有效地增强GE数据来减轻组学数据集的固有局限性。
{"title":"DRGAT: Predicting Drug Responses Via Diffusion-Based Graph Attention Network.","authors":"Emre Sefer","doi":"10.1089/cmb.2024.0807","DOIUrl":"10.1089/cmb.2024.0807","url":null,"abstract":"<p><p>Accurately predicting drug response depending on a patient's genomic profile is critical for advancing personalized medicine. Deep learning approaches rise and especially the rise of graph neural networks leveraging large-scale omics datasets have been a key driver of research in this area. However, these biological datasets, which are typically high dimensional but have small sample sizes, present challenges such as overfitting and poor generalization in predictive models. As a complicating matter, gene expression (GE) data must capture complex inter-gene relationships, exacerbating these issues. In this article, we tackle these challenges by introducing a drug response prediction method, called drug response graph attention network (DRGAT), which combines a denoising diffusion implicit model for data augmentation with a recently introduced graph attention network (GAT) with high-order neighbor propagation (HO-GATs) prediction module. Our proposed approach achieved almost 5% improvement in the area under receiver operating characteristic curve compared with state-of-the-art models for the many studied drugs, indicating our method's reasonable generalization capabilities. Moreover, our experiments confirm the potential of diffusion-based generative models, a core component of our method, to mitigate the inherent limitations of omics datasets by effectively augmenting GE data.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"330-350"},"PeriodicalIF":1.4,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142785889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CFINet: Cross-Modality MRI Feature Interaction Network for Pseudoprogression Prediction of Glioblastoma. CFINet:用于胶质母细胞瘤假性进展预测的跨模态磁共振成像特征交互网络
IF 1.4 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-02-01 Epub Date: 2024-07-08 DOI: 10.1089/cmb.2024.0518
Ya Lv, Jin Liu, Xu Tian, Pei Yang, Yi Pan

Pseudoprogression (PSP) is a related reaction of glioblastoma treatment, and misdiagnosis can lead to unnecessary intervention. Magnetic resonance imaging (MRI) provides cross-modality images for PSP prediction studies. However, how to effectively use the complementary information between the cross-modality MRI to improve PSP prediction is still a challenging task. To address this challenge, we propose a cross-modality feature interaction network for PSP prediction. Firstly, we propose a triple-branch multi-scale module to extract low-order feature representations and a skip-connection multi-scale module to extract high-order feature representations. Then, a cross-modality interaction module based on attention mechanism is designed to make the complementary information between cross-modality MRI fully interact. Finally, the high-order cross-modality interaction information is fed into a multi-layer perceptron to achieve the PSP prediction task. We evaluate the proposed network on a private dataset with 52 subjects from Hunan Cancer Hospital and validate it on a private dataset with 30 subjects from Xiangya Hospital. The accuracy of our proposed network on the datasets is 0.954 and 0.929, respectively, which is better than most typical convolutional neural network and interaction methods.

假性进展(PSP)是胶质母细胞瘤治疗的相关反应,误诊可能导致不必要的干预。磁共振成像(MRI)可为PSP预测研究提供跨模态图像。然而,如何有效利用跨模态磁共振成像之间的互补信息来改善 PSP 预测仍是一项具有挑战性的任务。为了应对这一挑战,我们提出了一种用于 PSP 预测的跨模态特征交互网络。首先,我们提出了一个提取低阶特征表征的三分支多尺度模块和一个提取高阶特征表征的跳接多尺度模块。然后,设计了基于注意机制的跨模态交互模块,使跨模态磁共振成像之间的互补信息充分互动。最后,将高阶跨模态交互信息输入多层感知器,以完成 PSP 预测任务。我们在湖南省肿瘤医院 52 名受试者的私人数据集上评估了所提出的网络,并在湘雅医院 30 名受试者的私人数据集上进行了验证。我们提出的网络在这些数据集上的准确率分别为 0.954 和 0.929,优于大多数典型的卷积神经网络和交互方法。
{"title":"CFINet: Cross-Modality MRI Feature Interaction Network for Pseudoprogression Prediction of Glioblastoma.","authors":"Ya Lv, Jin Liu, Xu Tian, Pei Yang, Yi Pan","doi":"10.1089/cmb.2024.0518","DOIUrl":"10.1089/cmb.2024.0518","url":null,"abstract":"<p><p>Pseudoprogression (PSP) is a related reaction of glioblastoma treatment, and misdiagnosis can lead to unnecessary intervention. Magnetic resonance imaging (MRI) provides cross-modality images for PSP prediction studies. However, how to effectively use the complementary information between the cross-modality MRI to improve PSP prediction is still a challenging task. To address this challenge, we propose a cross-modality feature interaction network for PSP prediction. Firstly, we propose a triple-branch multi-scale module to extract low-order feature representations and a skip-connection multi-scale module to extract high-order feature representations. Then, a cross-modality interaction module based on attention mechanism is designed to make the complementary information between cross-modality MRI fully interact. Finally, the high-order cross-modality interaction information is fed into a multi-layer perceptron to achieve the PSP prediction task. We evaluate the proposed network on a private dataset with 52 subjects from Hunan Cancer Hospital and validate it on a private dataset with 30 subjects from Xiangya Hospital. The accuracy of our proposed network on the datasets is 0.954 and 0.929, respectively, which is better than most typical convolutional neural network and interaction methods.</p>","PeriodicalId":15526,"journal":{"name":"Journal of Computational Biology","volume":" ","pages":"212-224"},"PeriodicalIF":1.4,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141554866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Computational Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1