首页 > 最新文献

Journal of Bioinformatics and Computational Biology最新文献

英文 中文
COLDLNA: Enhancing long-range node features extraction to improve robust generalization ability of drug-target binding affinity prediction in cold-start scenarios. COLDLNA:增强远程节点特征提取,提高冷启动场景下药物靶点结合亲和力预测的鲁棒泛化能力。
IF 0.7 4区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-10-01 Epub Date: 2025-09-20 DOI: 10.1142/S0219720025500131
Ting Xu, Shaohua Jiang, Weibin Ding, Peng Wang

Recent advances in deep learning have driven significant progress in drug-target affinity (DTA) prediction. However, many models do not effectively utilize drug molecular graphs or capture long-range protein features, limiting their predictive accuracy. To address these limitations, a novel COLDLNA model is designed for robust DTA prediction. The model employs the Long-range Node Attention Module to refine drug structure representations, while leveraging the Convolutional Attention Module to elucidate critical binding sites by extracting pivotal long-range information from protein amino acid sequences. Compared with the baseline model GraphDTA, COLDLNA reduced the MSE by 12.2% and 11.5% on the Davis and KIBA datasets, respectively. Additionally, its strong generalization ability was further validated on the Human dataset, C. elegans dataset, and in cold-start scenarios.

深度学习的最新进展推动了药物靶标亲和力(DTA)预测的重大进展。然而,许多模型不能有效地利用药物分子图或捕获远程蛋白质特征,限制了它们的预测准确性。为了解决这些限制,设计了一种新的COLDLNA模型,用于稳健的DTA预测。该模型采用远程节点注意模块来细化药物结构表征,同时利用卷积注意模块通过从蛋白质氨基酸序列中提取关键的远程信息来阐明关键的结合位点。与基线模型GraphDTA相比,COLDLNA在Davis和KIBA数据集上的MSE分别降低了12.2%和11.5%。此外,在人类数据集、秀丽隐杆线虫数据集和冷启动场景下,进一步验证了其较强的泛化能力。
{"title":"COLDLNA: Enhancing long-range node features extraction to improve robust generalization ability of drug-target binding affinity prediction in cold-start scenarios.","authors":"Ting Xu, Shaohua Jiang, Weibin Ding, Peng Wang","doi":"10.1142/S0219720025500131","DOIUrl":"10.1142/S0219720025500131","url":null,"abstract":"<p><p>Recent advances in deep learning have driven significant progress in drug-target affinity (DTA) prediction. However, many models do not effectively utilize drug molecular graphs or capture long-range protein features, limiting their predictive accuracy. To address these limitations, a novel COLDLNA model is designed for robust DTA prediction. The model employs the Long-range Node Attention Module to refine drug structure representations, while leveraging the Convolutional Attention Module to elucidate critical binding sites by extracting pivotal long-range information from protein amino acid sequences. Compared with the baseline model GraphDTA, COLDLNA reduced the MSE by 12.2% and 11.5% on the Davis and KIBA datasets, respectively. Additionally, its strong generalization ability was further validated on the Human dataset, C. elegans dataset, and in cold-start scenarios.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":" ","pages":"2550013"},"PeriodicalIF":0.7,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145114833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parameter estimation analysis of the glioblastoma immune model. 胶质母细胞瘤免疫模型参数估计分析。
IF 0.7 4区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 Epub Date: 2025-07-26 DOI: 10.1142/S0219720025500088
Biao Liu, Mengru Shen, Meiling Zhao

In exploring optimal strategies for immunotherapy in glioblastoma (GBM), one of the main challenges is enhancing treatment response. To better understand the dynamics of tumor-immune interactions, one applied Bayesian methods to estimate the parameters of glioblastoma immune model by using experimental data. One compared the effects of using uniform prior distributions versus improved prior distributions, which were adjusted based on posterior information, during parameter estimation. In addition, a comparative analysis of the results obtained by using four Markov Chain Monte Carlo (MCMC) sampling algorithms which respectively are Metropolis, DEMetropolis, DEMetropolisZ and NUTS, were performed. The results showed that the improved prior distribution significantly enhanced the accuracy of the model parameter estimates, and reduced the variance of the posterior distribution, but increased computational time and resource demands. Furthermore, DEMetropolisZ provided such efficient sampling and narrower confidence intervals within a shorter time frame, which outperformed the others. In contrast, the efficiency and stability of the Metropolis method were relatively poor. Therefore, the importance of selecting appropriate prior distributions and sampling algorithms to improve both the accuracy and efficiency of model inference were studied. The study provides valuable insights for optimizing GBM immunotherapy strategies and serves as a reference for modeling and parameter estimation of complex biological systems.

在探索胶质母细胞瘤(GBM)免疫治疗的最佳策略时,主要挑战之一是提高治疗反应。为了更好地了解肿瘤-免疫相互作用的动力学,利用实验数据,应用贝叶斯方法估计胶质母细胞瘤免疫模型的参数。其中一项比较了均匀先验分布与改进先验分布的效果,后者在参数估计期间根据后验信息进行调整。此外,还对Metropolis、DEMetropolis、DEMetropolisZ和NUTS四种Markov Chain Monte Carlo (MCMC)采样算法的结果进行了对比分析。结果表明,改进后的先验分布显著提高了模型参数估计的准确性,减小了后验分布的方差,但增加了计算时间和资源需求。此外,DEMetropolisZ在更短的时间框架内提供了如此高效的采样和更窄的置信区间,优于其他方法。相比之下,Metropolis方法的效率和稳定性相对较差。因此,研究了选择合适的先验分布和采样算法对提高模型推理的准确性和效率的重要性。该研究为优化GBM免疫治疗策略提供了有价值的见解,并为复杂生物系统的建模和参数估计提供了参考。
{"title":"Parameter estimation analysis of the glioblastoma immune model.","authors":"Biao Liu, Mengru Shen, Meiling Zhao","doi":"10.1142/S0219720025500088","DOIUrl":"10.1142/S0219720025500088","url":null,"abstract":"<p><p>In exploring optimal strategies for immunotherapy in glioblastoma (GBM), one of the main challenges is enhancing treatment response. To better understand the dynamics of tumor-immune interactions, one applied Bayesian methods to estimate the parameters of glioblastoma immune model by using experimental data. One compared the effects of using uniform prior distributions versus improved prior distributions, which were adjusted based on posterior information, during parameter estimation. In addition, a comparative analysis of the results obtained by using four Markov Chain Monte Carlo (MCMC) sampling algorithms which respectively are Metropolis, DEMetropolis, DEMetropolisZ and NUTS, were performed. The results showed that the improved prior distribution significantly enhanced the accuracy of the model parameter estimates, and reduced the variance of the posterior distribution, but increased computational time and resource demands. Furthermore, DEMetropolisZ provided such efficient sampling and narrower confidence intervals within a shorter time frame, which outperformed the others. In contrast, the efficiency and stability of the Metropolis method were relatively poor. Therefore, the importance of selecting appropriate prior distributions and sampling algorithms to improve both the accuracy and efficiency of model inference were studied. The study provides valuable insights for optimizing GBM immunotherapy strategies and serves as a reference for modeling and parameter estimation of complex biological systems.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 4","pages":"2550008"},"PeriodicalIF":0.7,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144849461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Metagenomic sequence classification based on local sensitive hashing and Bi-LSTM. 基于局部敏感哈希和Bi-LSTM的宏基因组序列分类。
IF 0.7 4区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 DOI: 10.1142/S021972002550012X
Yan Qian, Lei Xiao, Yiding Zhou, Li Deng

Current metagenomic classification methods are limited by short k-mer lengths and database dependency, resulting in insufficient taxonomic resolution at the species and genus level. This study proposes the first method integrating Locality-Sensitive Hashing (LSH) and Bidirectional Long-Short Term Memory (Bi-LSTM) networks for metagenomic sequence classification. The approach reduces runtime reliance on reference databases by learning discriminative features directly from sequences, while supporting long k-mers. The method consists of three key steps: (1) k-mer representation via locality-sensitive hashing, (2) k-mer embedding implementation using the skip-gram model, (3) label assignment to embedded vectors, followed by training in a Bi-LSTM network. Experimental results demonstrate superior classification performance at the genus level compared to existing models. Future work will explore the application of this method in the rapid detection of clinical pathogens.

目前的宏基因组分类方法受k-mer长度短和数据库依赖性的限制,导致在种和属水平上的分类分辨率不足。本研究首次提出了结合位置敏感哈希(LSH)和双向长短期记忆(Bi-LSTM)网络进行宏基因组序列分类的方法。该方法通过直接从序列中学习判别特征来减少对参考数据库的运行依赖,同时支持长k-mers。该方法包括三个关键步骤:(1)通过位置敏感哈希表示k-mer,(2)使用skip-gram模型实现k-mer嵌入,(3)为嵌入向量分配标签,然后在Bi-LSTM网络中进行训练。实验结果表明,与现有模型相比,该模型在属水平上具有更好的分类性能。今后的工作将探索该方法在临床病原体快速检测中的应用。
{"title":"Metagenomic sequence classification based on local sensitive hashing and Bi-LSTM.","authors":"Yan Qian, Lei Xiao, Yiding Zhou, Li Deng","doi":"10.1142/S021972002550012X","DOIUrl":"10.1142/S021972002550012X","url":null,"abstract":"<p><p>Current metagenomic classification methods are limited by short <i>k</i>-mer lengths and database dependency, resulting in insufficient taxonomic resolution at the species and genus level. This study proposes the first method integrating Locality-Sensitive Hashing (LSH) and Bidirectional Long-Short Term Memory (Bi-LSTM) networks for metagenomic sequence classification. The approach reduces runtime reliance on reference databases by learning discriminative features directly from sequences, while supporting long <i>k</i>-mers. The method consists of three key steps: (1) <i>k</i>-mer representation via locality-sensitive hashing, (2) <i>k</i>-mer embedding implementation using the skip-gram model, (3) label assignment to embedded vectors, followed by training in a Bi-LSTM network. Experimental results demonstrate superior classification performance at the genus level compared to existing models. Future work will explore the application of this method in the rapid detection of clinical pathogens.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 4","pages":"2550012"},"PeriodicalIF":0.7,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144849459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visual-SELEX: A technology ensemble for evaluating aptamer structural similarity via 3D visual spatial conformational analysis. visual - selex:通过三维视觉空间构象分析评估适体结构相似性的技术集成。
IF 0.7 4区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 DOI: 10.1142/S0219720025500106
Nijia Wang

To date, the study of single-stranded DNA (ssDNA) similarity has focused mainly on the similarity of bases in the same position in the nucleic acid sequence. However, focusing only on the similarity of base sequences has limitations. This similarity evaluation considers only the one-dimensional similarity of ssDNA and cannot fully capture the three-dimensional (3D) structural consistency of aptamers for nucleic acids with 3D structures. Therefore, it is necessary to develop a program that can quickly and accurately evaluate the 3D spatial consistency of ssDNA. To this end, we designed a Visual-SELEX rapid response program, which uses a screening ssDNA sequence set enriched in the DKK1 protein for analysis. The program directly generates a stable 3D structure of ssDNA through coarse-grained simulation and molecular dynamics (MD) simulation, converts the structure into a point cloud model, and then analyzes the similarity of the spatial structure of ssDNA through point cloud model alignment and superposition. The analysis results show that Visual-SELEX can accurately match ssDNAs with dissimilar base fragments but similar 3D spatial structures, providing richer 3D spatial similarity information than sequence similarity comparison alone.

迄今为止,对单链DNA (ssDNA)相似性的研究主要集中在核酸序列中相同位置碱基的相似性上。然而,仅关注碱基序列的相似性有其局限性。这种相似性评价只考虑了ssDNA的一维相似性,不能完全捕捉具有三维结构的核酸适体的三维(3D)结构一致性。因此,有必要开发一种能够快速准确评估ssDNA三维空间一致性的程序。为此,我们设计了Visual-SELEX快速反应程序,该程序使用富含DKK1蛋白的筛选ssDNA序列集进行分析。该程序通过粗粒度模拟和分子动力学(MD)模拟直接生成稳定的ssDNA三维结构,将该结构转化为点云模型,然后通过点云模型对齐和叠加分析ssDNA空间结构的相似性。分析结果表明,Visual-SELEX可以准确匹配具有不同碱基片段但三维空间结构相似的ssdna,提供比单独比较序列相似性更丰富的三维空间相似性信息。
{"title":"Visual-SELEX: A technology ensemble for evaluating aptamer structural similarity via 3D visual spatial conformational analysis.","authors":"Nijia Wang","doi":"10.1142/S0219720025500106","DOIUrl":"10.1142/S0219720025500106","url":null,"abstract":"<p><p>To date, the study of single-stranded DNA (ssDNA) similarity has focused mainly on the similarity of bases in the same position in the nucleic acid sequence. However, focusing only on the similarity of base sequences has limitations. This similarity evaluation considers only the one-dimensional similarity of ssDNA and cannot fully capture the three-dimensional (3D) structural consistency of aptamers for nucleic acids with 3D structures. Therefore, it is necessary to develop a program that can quickly and accurately evaluate the 3D spatial consistency of ssDNA. To this end, we designed a Visual-SELEX rapid response program, which uses a screening ssDNA sequence set enriched in the DKK1 protein for analysis. The program directly generates a stable 3D structure of ssDNA through coarse-grained simulation and molecular dynamics (MD) simulation, converts the structure into a point cloud model, and then analyzes the similarity of the spatial structure of ssDNA through point cloud model alignment and superposition. The analysis results show that Visual-SELEX can accurately match ssDNAs with dissimilar base fragments but similar 3D spatial structures, providing richer 3D spatial similarity information than sequence similarity comparison alone.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 4","pages":"2550010"},"PeriodicalIF":0.7,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144849462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NanoporeInspect: An interactive tool for evaluating nanopore sequencing quality and ligation efficiency. 一个评价纳米孔测序质量和连接效率的交互式工具。
IF 0.7 4区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-08-01 DOI: 10.1142/S0219720025500118
Maria A Grigoryeva, Maria G Khrenova, Maria I Zvereva

In nanopore sequencing, especially in SELEX-based aptamer discovery, the correct ligation of artificial sequences (primers, adapters, barcodes) is crucial for library quality. Errors at this stage can lead to misidentification of sequences and loss of valuable information. Existing quality control tools lack focused capabilities to assess the positioning and prevalence of these artificial sequences. NanoporeInspect is a web-based tool designed to fill this gap by providing targeted insights into ligation efficacy and systematic biases within sequencing data. NanoporeInspect operates as a user-friendly, web-based platform that leverages a modern software stack with Flask, Celery and Redis to handle scalable and asynchronous task processing, and Plotly to deliver interactive visualizations. Evaluation of NanoporeInspect on various nanopore datasets demonstrated its effectiveness in discerning differences in ligation quality. Libraries with inefficient ligation showed irregular adapter and barcode distributions, indicating preparation issues, while high-quality libraries displayed uniform patterns, reflecting effective ligation.

在纳米孔测序中,特别是在基于selex的适体发现中,人工序列(引物、适配器、条形码)的正确连接对文库质量至关重要。这一阶段的错误可能导致序列的错误识别和宝贵信息的丢失。现有的质量控制工具缺乏集中的能力来评估这些人工序列的定位和流行程度。NanoporeInspect是一个基于网络的工具,旨在填补这一空白,提供有针对性的见解,结扎效果和测序数据的系统性偏差。NanoporeInspect是一个用户友好的、基于web的平台,它利用Flask、芹菜和Redis的现代软件堆栈来处理可扩展和异步任务处理,并使用Plotly来提供交互式可视化。在各种纳米孔数据集上对NanoporeInspect的评估证明了它在识别连接质量差异方面的有效性。连接效率不高的文库适配器和条形码分布不规则,说明存在准备问题;而连接效率高的文库适配器和条形码分布均匀,说明连接效率高。
{"title":"NanoporeInspect: An interactive tool for evaluating nanopore sequencing quality and ligation efficiency.","authors":"Maria A Grigoryeva, Maria G Khrenova, Maria I Zvereva","doi":"10.1142/S0219720025500118","DOIUrl":"10.1142/S0219720025500118","url":null,"abstract":"<p><p>In nanopore sequencing, especially in SELEX-based aptamer discovery, the correct ligation of artificial sequences (primers, adapters, barcodes) is crucial for library quality. Errors at this stage can lead to misidentification of sequences and loss of valuable information. Existing quality control tools lack focused capabilities to assess the positioning and prevalence of these artificial sequences. NanoporeInspect is a web-based tool designed to fill this gap by providing targeted insights into ligation efficacy and systematic biases within sequencing data. NanoporeInspect operates as a user-friendly, web-based platform that leverages a modern software stack with Flask, Celery and Redis to handle scalable and asynchronous task processing, and Plotly to deliver interactive visualizations. Evaluation of NanoporeInspect on various nanopore datasets demonstrated its effectiveness in discerning differences in ligation quality. Libraries with inefficient ligation showed irregular adapter and barcode distributions, indicating preparation issues, while high-quality libraries displayed uniform patterns, reflecting effective ligation.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 4","pages":"2550011"},"PeriodicalIF":0.7,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144849460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning inference of miRNA expression from bulk and single-cell mRNA expression. 从大细胞和单细胞mRNA表达中深度学习推断miRNA表达。
IF 0.7 4区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-01 DOI: 10.1142/S021972002550009X
Rony Chowdhury Ripan, Tasbiraha Athaya, Xiaoman Li, Haiyan Hu

Studying miRNA activity at the single-cell level presents a significant challenge due to the limitations of existing single-cell technologies in capturing miRNAs. To address this, we introduce two deep learning models: Cross-modality (CM) and single-modality (SM), both based on encoder-decoder architectures. These models predict miRNA expression at both bulk and single-cell levels using mRNA data. We evaluated the performance of CM and SM against the state-of-the-art miRSCAPE approach, using both bulk and single-cell datasets. Our results demonstrate that both CM and SM outperform miRSCAPE in accuracy. Furthermore, incorporating miRNA target information substantially enhanced performance compared to models that utilized all genes. These models provide powerful tools for predicting miRNA expression from single-cell mRNA data.

由于现有单细胞技术在捕获miRNA方面的局限性,在单细胞水平上研究miRNA活性提出了一个重大挑战。为了解决这个问题,我们引入了两种深度学习模型:跨模态(CM)和单模态(SM),它们都基于编码器-解码器架构。这些模型使用mRNA数据预测大细胞和单细胞水平上的miRNA表达。我们使用大容量和单细胞数据集,对比最先进的miRSCAPE方法,评估了CM和SM的性能。我们的结果表明,CM和SM在精度上都优于miRSCAPE。此外,与使用所有基因的模型相比,纳入miRNA目标信息大大提高了性能。这些模型为从单细胞mRNA数据预测miRNA表达提供了强大的工具。
{"title":"Deep learning inference of miRNA expression from bulk and single-cell mRNA expression.","authors":"Rony Chowdhury Ripan, Tasbiraha Athaya, Xiaoman Li, Haiyan Hu","doi":"10.1142/S021972002550009X","DOIUrl":"10.1142/S021972002550009X","url":null,"abstract":"<p><p>Studying miRNA activity at the single-cell level presents a significant challenge due to the limitations of existing single-cell technologies in capturing miRNAs. To address this, we introduce two deep learning models: Cross-modality (CM) and single-modality (SM), both based on encoder-decoder architectures. These models predict miRNA expression at both bulk and single-cell levels using mRNA data. We evaluated the performance of CM and SM against the state-of-the-art miRSCAPE approach, using both bulk and single-cell datasets. Our results demonstrate that both CM and SM outperform miRSCAPE in accuracy. Furthermore, incorporating miRNA target information substantially enhanced performance compared to models that utilized all genes. These models provide powerful tools for predicting miRNA expression from single-cell mRNA data.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 3","pages":"2550009"},"PeriodicalIF":0.7,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144734511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An algorithm for peptide de novo sequencing from a group of SILAC labeled MS/MS spectra. 从一组SILAC标记的MS/MS光谱中进行肽从头测序的算法。
IF 0.7 4区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-01 Epub Date: 2025-07-15 DOI: 10.1142/S0219720025500076
Fang Han, Kaizhong Zhang

Shotgun proteomics coupled with high-performance liquid chromatography and mass spectrometry has been instrumental in identifying proteins in complex mixtures. Effective computational approaches are required to automate the spectra interpretation process to handle the vast amount of data collected in a single Liquid Chromatography-Tandem Mass Spectrometry (LC-MS/MS) run. De novo sequencing from MS/MS has emerged as a vital technology for peptide sequencing in proteomics. To enhance the accuracy and practicality of de novo sequencing, previous algorithms have utilized multiple spectra to identify peptide sequences. Here, our study focuses on de novo sequencing of multiple tandem mass spectra of peptides with stable isotope labeling with amino acids in cell culture (SILAC) by incorporating different isotope-labeled amino acids into newly synthesized proteins. Multiple MS/MS spectra for the same peptide sequence are produced by the spectrometer after the SILAC samples undergo processing by LC-MS/MS shotgun proteomics. Taking into consideration the factors such as retention time and precursor ion mass, we aim to identify the peptide sequence with specific SILAC modifications and their locations. To do so, we propose de novo sequencing algorithms to compute the potential candidate peptide sequence by using similarity scores, followed by refinement algorithms to evaluate them. We also use real experimental data to test the algorithms.

霰弹枪蛋白质组学与高效液相色谱和质谱相结合,在鉴定复杂混合物中的蛋白质方面发挥了重要作用。需要有效的计算方法来自动化光谱解释过程,以处理在单次液相色谱-串联质谱(LC-MS/MS)运行中收集的大量数据。MS/MS从头测序已成为蛋白质组学中肽段测序的重要技术。为了提高从头测序的准确性和实用性,以前的算法利用多光谱来识别肽序列。在这里,我们的研究重点是通过将不同的同位素标记的氨基酸加入到新合成的蛋白质中,对具有稳定同位素标记的细胞培养氨基酸(SILAC)肽的多个串联质谱进行从头测序。SILAC样品经LC-MS/MS霰弹枪蛋白质组学处理后,谱仪可生成多个相同肽序列的MS/MS谱图。考虑到保留时间和前体离子质量等因素,我们的目标是确定具有特定SILAC修饰的肽序列及其位置。为此,我们提出了从头测序算法,通过使用相似性评分来计算潜在的候选肽序列,然后使用改进算法来评估它们。我们还用真实的实验数据对算法进行了验证。
{"title":"An algorithm for peptide de novo sequencing from a group of SILAC labeled MS/MS spectra.","authors":"Fang Han, Kaizhong Zhang","doi":"10.1142/S0219720025500076","DOIUrl":"10.1142/S0219720025500076","url":null,"abstract":"<p><p>Shotgun proteomics coupled with high-performance liquid chromatography and mass spectrometry has been instrumental in identifying proteins in complex mixtures. Effective computational approaches are required to automate the spectra interpretation process to handle the vast amount of data collected in a single Liquid Chromatography-Tandem Mass Spectrometry (LC-MS/MS) run. De novo sequencing from MS/MS has emerged as a vital technology for peptide sequencing in proteomics. To enhance the accuracy and practicality of de novo sequencing, previous algorithms have utilized multiple spectra to identify peptide sequences. Here, our study focuses on de novo sequencing of multiple tandem mass spectra of peptides with stable isotope labeling with amino acids in cell culture (SILAC) by incorporating different isotope-labeled amino acids into newly synthesized proteins. Multiple MS/MS spectra for the same peptide sequence are produced by the spectrometer after the SILAC samples undergo processing by LC-MS/MS shotgun proteomics. Taking into consideration the factors such as retention time and precursor ion mass, we aim to identify the peptide sequence with specific SILAC modifications and their locations. To do so, we propose de novo sequencing algorithms to compute the potential candidate peptide sequence by using similarity scores, followed by refinement algorithms to evaluate them. We also use real experimental data to test the algorithms.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":" ","pages":"2550007"},"PeriodicalIF":0.7,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144568024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A brief review and comparative analysis of RNA secondary structure prediction tools. RNA二级结构预测工具综述及比较分析。
IF 0.7 4区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-01 Epub Date: 2025-07-21 DOI: 10.1142/S0219720025300011
Pranav Ballaney, Gourav Saha, Vaibhav Kulshrestha, Poojan Hasmukhray Thaker, Prakhar Hasija, Indrani Talukdar, Raviprasad Aduri

Ribonucleic acid (RNA) lies at the heart of the central dogma. It spans the breadth of biological functions, from information storage to gene regulation and catalysis. RNA molecules must attain specific structures to perform these functions, and their structures depend on their sequences. Predicting the structure of RNA has been a central problem in computational biology. Various methods have been developed for this purpose - while some consider the thermodynamics of folding, others abstract away the details behind neural networks (NN). This paper presents a brief overview of the existing tools for predicting RNA secondary structures from a given single RNA sequence. Furthermore, a comparative analysis of the different prediction software packages is also presented. Performance is analyzed by running each of the available software packages on a novel dataset developed using 3D crystal structures of RNA. Software packages considered include those that can predict pseudoknots along with those that cannot. Variation in software performance based on the length and type of RNA is described.

核糖核酸(RNA)位于中心法则的核心。它涵盖了从信息存储到基因调控和催化的生物学功能。RNA分子必须达到特定的结构来执行这些功能,而它们的结构取决于它们的序列。预测RNA的结构一直是计算生物学中的一个核心问题。为此目的开发了各种方法——有些考虑折叠的热力学,有些则抽象出神经网络(NN)背后的细节。本文介绍了从给定的单个RNA序列预测RNA二级结构的现有工具的简要概述。此外,还对不同预测软件包进行了对比分析。通过在使用RNA的3D晶体结构开发的新数据集上运行每个可用软件包来分析性能。考虑的软件包包括那些可以预测伪结的软件包和那些不能预测伪结的软件包。描述了基于RNA长度和类型的软件性能变化。
{"title":"A brief review and comparative analysis of RNA secondary structure prediction tools.","authors":"Pranav Ballaney, Gourav Saha, Vaibhav Kulshrestha, Poojan Hasmukhray Thaker, Prakhar Hasija, Indrani Talukdar, Raviprasad Aduri","doi":"10.1142/S0219720025300011","DOIUrl":"10.1142/S0219720025300011","url":null,"abstract":"<p><p>Ribonucleic acid (RNA) lies at the heart of the central dogma. It spans the breadth of biological functions, from information storage to gene regulation and catalysis. RNA molecules must attain specific structures to perform these functions, and their structures depend on their sequences. Predicting the structure of RNA has been a central problem in computational biology. Various methods have been developed for this purpose - while some consider the thermodynamics of folding, others abstract away the details behind neural networks (NN). This paper presents a brief overview of the existing tools for predicting RNA secondary structures from a given single RNA sequence. Furthermore, a comparative analysis of the different prediction software packages is also presented. Performance is analyzed by running each of the available software packages on a novel dataset developed using 3D crystal structures of RNA. Software packages considered include those that can predict pseudoknots along with those that cannot. Variation in software performance based on the length and type of RNA is described.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":" ","pages":"2530001"},"PeriodicalIF":0.7,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144683501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fractal dimensionality of a coiled helical coil. 螺旋线圈的分形维数。
IF 0.7 4区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-01 Epub Date: 2025-06-12 DOI: 10.1142/S0219720025710015
Subhash Kak

The helical coil is ubiquitous in biological and natural systems, and it is often the basic form in complex structures. This paper considers the question of its dimensionality, D, in biological information as the helical coil goes through recursive coiling as in DNA and RNA molecules in chromatin, in which the D-value is a function of the lengthening of the curve. It is shown that the dimensionality of coiled coils is virtually equal to e. Of the three forms of DNA, the dimensionality of the B-form is nearest to the optimal value, and this might be the reason why it is most common.

螺旋线圈在生物和自然系统中无处不在,它往往是复杂结构的基本形式。本文考虑了生物信息中的D维问题,当螺旋线圈像染色质中的DNA和RNA分子一样经过递归卷曲时,其中D值是曲线长度的函数。结果表明,盘绕线圈的维数实际上等于e。在三种形式的DNA中,b形式的维数最接近最佳值,这可能是它最常见的原因。
{"title":"Fractal dimensionality of a coiled helical coil.","authors":"Subhash Kak","doi":"10.1142/S0219720025710015","DOIUrl":"10.1142/S0219720025710015","url":null,"abstract":"<p><p>The helical coil is ubiquitous in biological and natural systems, and it is often the basic form in complex structures. This paper considers the question of its dimensionality, <i>D</i>, in biological information as the helical coil goes through recursive coiling as in DNA and RNA molecules in chromatin, in which the <i>D</i>-value is a function of the lengthening of the curve. It is shown that the dimensionality of coiled coils is virtually equal to <i>e</i>. Of the three forms of DNA, the dimensionality of the B-form is nearest to the optimal value, and this might be the reason why it is most common.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":" ","pages":"2571001"},"PeriodicalIF":0.7,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144286929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computational modeling and dynamical analysis for B. subtilis competence genic regulation circuit with multiple time delays and external noisy regulation. 具有多时滞和外部噪声调控的枯草芽孢杆菌能力基因调控电路的计算建模与动力学分析。
IF 0.9 4区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-04-01 Epub Date: 2025-06-05 DOI: 10.1142/S0219720025500052
Na Zhao, Haihong Liu, Fang Yan

Bacillus subtilis (B. subtilis), a bacterium known to enter a competent state spontaneously, has garnered significant attention due to its intricate internal regulatory mechanisms. This study proposes a six-dimensional continuous delay differential equation (DDE) model incorporating two-time delays and a stochastic model that accounts for noise, aimed at delving deeper into the dynamic behaviors of the B. subtilis competence gene regulation circuit. Our investigation reveals that time delays play a crucial role in inducing oscillatory behavior within the continuous DDE model. Analyzing the dynamics of multiple time delays proves to be more intricate than studying a single delay. Furthermore, certain parameter adjustments significantly influence the system's dynamic characteristics. The introduction of noise also triggers oscillations, with the irregular oscillation patterns closely aligning with real-world observations. Intriguingly, the effects of parameters and noise regulation undergo significant changes when time delays are jointly considered. This analysis offers a fresh perspective on understanding B. subtilis competence and provides essential theoretical support for subsequent experimental endeavors in this domain of biomathematics.

枯草芽孢杆菌(Bacillus subtilis,简称B. subtilis)是一种已知能自发进入能态的细菌,由于其复杂的内部调控机制而引起了人们的极大关注。为了深入研究枯草芽孢杆菌能力基因调控回路的动态行为,本研究提出了一个包含双时间延迟和考虑噪声的随机模型的六维连续延迟微分方程(DDE)模型。我们的研究表明,在连续DDE模型中,时间延迟在诱导振荡行为中起着至关重要的作用。分析多时滞的动力学比研究单个时滞要复杂得多。此外,某些参数的调整会显著影响系统的动态特性。噪声的引入也会引发振荡,这种不规则的振荡模式与现实世界的观测结果密切相关。有趣的是,当共同考虑时间延迟时,参数和噪声调节的效果会发生显着变化。该分析为了解枯草芽孢杆菌的能力提供了新的视角,并为该生物数学领域的后续实验工作提供了重要的理论支持。
{"title":"Computational modeling and dynamical analysis for B. subtilis competence genic regulation circuit with multiple time delays and external noisy regulation.","authors":"Na Zhao, Haihong Liu, Fang Yan","doi":"10.1142/S0219720025500052","DOIUrl":"10.1142/S0219720025500052","url":null,"abstract":"<p><p>Bacillus subtilis (B. subtilis), a bacterium known to enter a competent state spontaneously, has garnered significant attention due to its intricate internal regulatory mechanisms. This study proposes a six-dimensional continuous delay differential equation (DDE) model incorporating two-time delays and a stochastic model that accounts for noise, aimed at delving deeper into the dynamic behaviors of the B. subtilis competence gene regulation circuit. Our investigation reveals that time delays play a crucial role in inducing oscillatory behavior within the continuous DDE model. Analyzing the dynamics of multiple time delays proves to be more intricate than studying a single delay. Furthermore, certain parameter adjustments significantly influence the system's dynamic characteristics. The introduction of noise also triggers oscillations, with the irregular oscillation patterns closely aligning with real-world observations. Intriguingly, the effects of parameters and noise regulation undergo significant changes when time delays are jointly considered. This analysis offers a fresh perspective on understanding B. subtilis competence and provides essential theoretical support for subsequent experimental endeavors in this domain of biomathematics.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"23 2","pages":"2550005"},"PeriodicalIF":0.9,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144267689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Bioinformatics and Computational Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1