首页 > 最新文献

BMC Bioinformatics最新文献

英文 中文
Improving protein interaction prediction in GenPPi: a novel interaction sampling approach preserving network topology. 改进GenPPi中的蛋白质相互作用预测:一种保持网络拓扑结构的新型相互作用采样方法。
IF 3.3 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-12-29 DOI: 10.1186/s12859-025-06325-8
Alisson Silva, Carlos Marquez, Iury Godoy, Lucas Silva, Matheus Prado, Murilo Beppler, Natanael Avila, Bruno Travençolo, Anderson R Santos

Background: Computational prediction of protein-protein interactions (PPIs) is crucial for understanding cell biology and drug development, offering an alternative to costly experimental methods. The original GenPPi software advanced ab initio PPI network prediction from bacterial genomes but was limited by its reliance on high sequence similarity. This work introduces GenPPi 1.5 to enhance these predictive capabilities.

Results: GenPPi 1.5 incorporates a Random Forest (RF) algorithm, trained on 60 biophysical features from amino acid propensity indices, to classify protein similarity even in low sequence identity scenarios (targeting >65% identity). To manage computational complexity from the increased interactions generated by the RF model, especially in extensive conserved phylogenetic profiles, we developed and integrated the Reduced Interaction Sampling (RIS) algorithm. RIS stochastically samples interactions within these profiles, optimizing performance for complete genome analysis. Extensive simulations across various configurations validated the methodology. RF integration significantly broadened GenPPi's predictive power; application to Buchnera aphidicola showed up to 62% overlap with STRING database interactions. Analysis of RIS demonstrated that while introducing some randomness, critical node identification remains robust, particularly for Top_N values ≥ 100, indicating minimal compromise to network integrity.

Conclusion: The combination of Machine Learning (RF) and the RIS algorithm in GenPPi 1.5 represents a significant advancement. It overcomes the high-similarity dependency of the previous version while efficiently handling complex genomes. GenPPi 1.5 provides a robust and scalable alignment-free PPI prediction solution, enabling users to train custom models tailored to specific genomic contexts. GenPPi is freely available on our website https://genppi.facom.ufu.br/ , its source code is hosted on GitHub https://github.com/santosardr/genppi , and it can be easily installed via the Python Package Index using the command pip install genppi-py.

背景:蛋白质-蛋白质相互作用(PPIs)的计算预测对于理解细胞生物学和药物开发至关重要,为昂贵的实验方法提供了一种替代方法。最初的GenPPi软件从细菌基因组中进行从头算PPI网络预测,但由于依赖于高序列相似性而受到限制。本工作引入了GenPPi 1.5来增强这些预测能力。结果:GenPPi 1.5采用随机森林(Random Forest, RF)算法,对来自氨基酸倾向指数的60种生物物理特征进行训练,即使在低序列同一性情况下(目标为>65%同一性),也能对蛋白质相似性进行分类。为了管理由RF模型产生的增加的相互作用带来的计算复杂性,特别是在广泛保守的系统发育剖面中,我们开发并集成了减少相互作用采样(RIS)算法。RIS随机取样这些谱中的相互作用,优化全基因组分析的性能。各种配置的大量模拟验证了该方法。射频集成显著提高了GenPPi的预测能力;应用程序显示,与STRING数据库交互的重叠高达62%。RIS的分析表明,虽然引入了一些随机性,但关键节点识别仍然是稳健的,特别是当Top_N值≥100时,这表明对网络完整性的损害最小。结论:GenPPi 1.5中机器学习(RF)与RIS算法的结合是一个显著的进步。它克服了先前版本的高度相似性依赖,同时有效地处理复杂的基因组。GenPPi 1.5提供了一个强大且可扩展的无对齐PPI预测解决方案,使用户能够根据特定的基因组背景训练定制模型。GenPPi可以在我们的网站https://genppi.facom.ufu.br/上免费获得,其源代码托管在GitHub https://github.com/santosardr/genppi上,并且可以使用pip install GenPPi -py命令通过Python Package Index轻松安装。
{"title":"Improving protein interaction prediction in GenPPi: a novel interaction sampling approach preserving network topology.","authors":"Alisson Silva, Carlos Marquez, Iury Godoy, Lucas Silva, Matheus Prado, Murilo Beppler, Natanael Avila, Bruno Travençolo, Anderson R Santos","doi":"10.1186/s12859-025-06325-8","DOIUrl":"10.1186/s12859-025-06325-8","url":null,"abstract":"<p><strong>Background: </strong>Computational prediction of protein-protein interactions (PPIs) is crucial for understanding cell biology and drug development, offering an alternative to costly experimental methods. The original GenPPi software advanced ab initio PPI network prediction from bacterial genomes but was limited by its reliance on high sequence similarity. This work introduces GenPPi 1.5 to enhance these predictive capabilities.</p><p><strong>Results: </strong>GenPPi 1.5 incorporates a Random Forest (RF) algorithm, trained on 60 biophysical features from amino acid propensity indices, to classify protein similarity even in low sequence identity scenarios (targeting >65% identity). To manage computational complexity from the increased interactions generated by the RF model, especially in extensive conserved phylogenetic profiles, we developed and integrated the Reduced Interaction Sampling (RIS) algorithm. RIS stochastically samples interactions within these profiles, optimizing performance for complete genome analysis. Extensive simulations across various configurations validated the methodology. RF integration significantly broadened GenPPi's predictive power; application to Buchnera aphidicola showed up to 62% overlap with STRING database interactions. Analysis of RIS demonstrated that while introducing some randomness, critical node identification remains robust, particularly for Top_N values ≥ 100, indicating minimal compromise to network integrity.</p><p><strong>Conclusion: </strong>The combination of Machine Learning (RF) and the RIS algorithm in GenPPi 1.5 represents a significant advancement. It overcomes the high-similarity dependency of the previous version while efficiently handling complex genomes. GenPPi 1.5 provides a robust and scalable alignment-free PPI prediction solution, enabling users to train custom models tailored to specific genomic contexts. GenPPi is freely available on our website https://genppi.facom.ufu.br/ , its source code is hosted on GitHub https://github.com/santosardr/genppi , and it can be easily installed via the Python Package Index using the command pip install genppi-py.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"296"},"PeriodicalIF":3.3,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12751606/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145854196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prodigy protein: Python package for zero-shot protein engineering using protein language models. Prodigy protein: Python包,用于使用蛋白质语言模型的零射击蛋白质工程。
IF 3.3 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-12-29 DOI: 10.1186/s12859-025-06316-9
Matthew Massett, Adrian Carr

Background: Protein Language Models (PLMs) are emerging as powerful tools for designing human proteins, including antibodies. These models can predict the effects of mutations in a zero-shot setting-without requiring additional fine-tuning-and suggest plausible amino acid substitutions.

Results: We introduce Protein Diversification and Generation through Yielded Mutations (Prodigy) Protein which provides several DirectedEvolution classes that introduce amino acid substitutions in a stepwise manner. Each substitution is evaluated using one of two scoring strategies, and the most promising candidates are sampled accordingly. Users can customize the number of evolution steps, specify target regions within the protein sequence, and set score thresholds to filter out low-quality substitutions during the design process.

Conclusion: Protein Diversification and Generation through Yielded Mutations (Prodigy) Protein is a fast and flexible tool for in silico protein design. It introduces a consistent and efficient probabilistic framework that leverages any masked language modeling Protein Language Model (PLM) available via Hugging Face. Unlike existing tools, Prodigy Protein can integrate multiple PLMs to design protein variants-an approach not currently supported by other publicly available software.

背景:蛋白质语言模型(PLMs)正在成为设计人类蛋白质(包括抗体)的强大工具。这些模型可以在零突变的情况下预测突变的影响——不需要额外的微调——并建议合理的氨基酸替代。结果:我们通过产生的突变(Prodigy)蛋白引入了蛋白质多样化和生成,该蛋白提供了几个定向进化类,以逐步的方式引入氨基酸取代。每个替换都使用两种评分策略中的一种进行评估,并相应地对最有希望的候选对象进行抽样。用户可以自定义进化步骤的数量,在蛋白质序列中指定目标区域,并设置评分阈值,以过滤掉设计过程中低质量的替代。结论:Prodigy蛋白是一种快速、灵活的硅蛋白设计工具。它引入了一个一致和有效的概率框架,利用任何屏蔽语言建模蛋白质语言模型(PLM)通过拥抱脸可用。与现有的工具不同,Prodigy Protein可以集成多个plm来设计蛋白质变体,这是目前其他公开软件不支持的一种方法。
{"title":"Prodigy protein: Python package for zero-shot protein engineering using protein language models.","authors":"Matthew Massett, Adrian Carr","doi":"10.1186/s12859-025-06316-9","DOIUrl":"10.1186/s12859-025-06316-9","url":null,"abstract":"<p><strong>Background: </strong>Protein Language Models (PLMs) are emerging as powerful tools for designing human proteins, including antibodies. These models can predict the effects of mutations in a zero-shot setting-without requiring additional fine-tuning-and suggest plausible amino acid substitutions.</p><p><strong>Results: </strong>We introduce Protein Diversification and Generation through Yielded Mutations (Prodigy) Protein which provides several DirectedEvolution classes that introduce amino acid substitutions in a stepwise manner. Each substitution is evaluated using one of two scoring strategies, and the most promising candidates are sampled accordingly. Users can customize the number of evolution steps, specify target regions within the protein sequence, and set score thresholds to filter out low-quality substitutions during the design process.</p><p><strong>Conclusion: </strong>Protein Diversification and Generation through Yielded Mutations (Prodigy) Protein is a fast and flexible tool for in silico protein design. It introduces a consistent and efficient probabilistic framework that leverages any masked language modeling Protein Language Model (PLM) available via Hugging Face. Unlike existing tools, Prodigy Protein can integrate multiple PLMs to design protein variants-an approach not currently supported by other publicly available software.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"298"},"PeriodicalIF":3.3,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12751917/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145854235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust subspace structure discovery for cell type identification in scRNA-seq data. 基于scRNA-seq数据的细胞类型识别的稳健子空间结构发现。
IF 3.3 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-12-29 DOI: 10.1186/s12859-025-06317-8
Xianyong Zhou, Xindian Wei, Cheng Liu, Wenjun Shen, Ping Xuan, Si Wu, Hau-San Wong

Single-cell RNA sequencing (scRNA-seq) technology has transformed gene expression studies by enabling analysis at the individual cell level, offering unprecedented insights into cellular heterogeneity. A key challenge in scRNA-seq data analysis is cell type identification, which requires grouping cells with similar gene expression profiles using unsupervised clustering methods. However, the high dimensionality, inherent noise, and significant sparsity of scRNA-seq data present substantial obstacles to accurately determining relationships among cell samples. To address these challenges, we propose a novel deep subspace clustering approach for cell type identification that captures a more reliable subspace structure from scRNA-seq data. Our method leverages a robust self-representation learning framework to effectively characterize and learn the underlying cluster structure. This framework is optimized through an integrated strategy combining a structure-guided approach with an optimal transport algorithm, enhancing the robustness of the subspace clustering process. By mitigating the effects of noise and sparsity in scRNA-seq data, this approach enables more accurate cell clustering. Experimental results on 18 real scRNA-seq datasets demonstrate that our method outperforms several state-of-the-art clustering approaches tailored for scRNA-seq data, excelling in both accuracy and interpretability.

单细胞RNA测序(scRNA-seq)技术通过在单个细胞水平上进行分析,改变了基因表达研究,为细胞异质性提供了前所未有的见解。scRNA-seq数据分析的一个关键挑战是细胞类型鉴定,这需要使用无监督聚类方法对具有相似基因表达谱的细胞进行分组。然而,scRNA-seq数据的高维性、固有的噪声和显著的稀疏性给准确确定细胞样本之间的关系带来了实质性的障碍。为了解决这些挑战,我们提出了一种新的深度子空间聚类方法,用于细胞类型鉴定,从scRNA-seq数据中捕获更可靠的子空间结构。我们的方法利用一个鲁棒的自表示学习框架来有效地表征和学习底层集群结构。该框架通过结合结构导向方法和最优传输算法的集成策略进行优化,增强了子空间聚类过程的鲁棒性。通过减轻scRNA-seq数据中的噪声和稀疏性的影响,该方法可以实现更准确的细胞聚类。在18个真实的scRNA-seq数据集上的实验结果表明,我们的方法优于为scRNA-seq数据定制的几种最先进的聚类方法,在准确性和可解释性方面都表现出色。
{"title":"Robust subspace structure discovery for cell type identification in scRNA-seq data.","authors":"Xianyong Zhou, Xindian Wei, Cheng Liu, Wenjun Shen, Ping Xuan, Si Wu, Hau-San Wong","doi":"10.1186/s12859-025-06317-8","DOIUrl":"10.1186/s12859-025-06317-8","url":null,"abstract":"<p><p>Single-cell RNA sequencing (scRNA-seq) technology has transformed gene expression studies by enabling analysis at the individual cell level, offering unprecedented insights into cellular heterogeneity. A key challenge in scRNA-seq data analysis is cell type identification, which requires grouping cells with similar gene expression profiles using unsupervised clustering methods. However, the high dimensionality, inherent noise, and significant sparsity of scRNA-seq data present substantial obstacles to accurately determining relationships among cell samples. To address these challenges, we propose a novel deep subspace clustering approach for cell type identification that captures a more reliable subspace structure from scRNA-seq data. Our method leverages a robust self-representation learning framework to effectively characterize and learn the underlying cluster structure. This framework is optimized through an integrated strategy combining a structure-guided approach with an optimal transport algorithm, enhancing the robustness of the subspace clustering process. By mitigating the effects of noise and sparsity in scRNA-seq data, this approach enables more accurate cell clustering. Experimental results on 18 real scRNA-seq datasets demonstrate that our method outperforms several state-of-the-art clustering approaches tailored for scRNA-seq data, excelling in both accuracy and interpretability.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"295"},"PeriodicalIF":3.3,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12752283/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145854162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DriverSub-SVM: a machine learning approach for cancer subtype classification by integrating patient-specific and global driver genes. DriverSub-SVM:一种通过整合患者特异性和全局驱动基因进行癌症亚型分类的机器学习方法。
IF 3.3 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-12-29 DOI: 10.1186/s12859-025-06318-7
Junrong Song, Yuanli Gong, Zhiming Song, Xinggui Xu, Kun Qian, Yingbo Liu

Background: Cancer's complexity and heterogeneity pose significant challenges for personalized treatment. Accurate classification of patients into molecular subtypes is critical for targeted therapy and improved outcomes. However, existing methods often fail to simultaneously capture inter-patient heterogeneity and shared molecular patterns in driver gene profiles.

Results: To address this limitation, we propose DriverSub-SVM, a novel framework for interpretable cancer subtype classification that integrates patient-specific and cohort-wide driver gene information. Our method first models the bidirectional influence between mutated and dysregulated genes via a random walk on a functional interaction network. It then applies Bayesian Personalized Ranking (BPR) to infer personalized driver gene rankings for each patient. These rankings are aggregated into a consensus driver gene set using the Condorcet. Subsequently, a One-Against-One Multiclass Support Vector Machine (OAO-MSVM) classifies patients based on their gene-level profiles. Evaluated on multiple TCGA datasets, DriverSub-SVM outperformed four state-of-the-art methods, achieving higher accuracy and identifying clinically relevant genes associated with prognosis and therapeutic response.

Conclusion: DriverSub-SVM offers an effective and interpretable approach for cancer subtype classification by bridging individual heterogeneity and population-level patterns. It enhances understanding of tumor biology and holds promise for precision oncology and biomarker discovery. The source code is available at https://github.com/sjunrong/DriverSub-SVM .

背景:癌症的复杂性和异质性对个性化治疗提出了重大挑战。准确地将患者分类为分子亚型对于靶向治疗和改善结果至关重要。然而,现有的方法往往不能同时捕获患者间异质性和驱动基因谱中的共享分子模式。结果:为了解决这一限制,我们提出了DriverSub-SVM,这是一个可解释的癌症亚型分类的新框架,整合了患者特异性和队列范围的驱动基因信息。我们的方法首先通过在功能相互作用网络上的随机漫步来模拟突变和失调基因之间的双向影响。然后应用贝叶斯个性化排名(BPR)来推断每个患者的个性化驱动基因排名。这些排名被汇总成一个共识驱动基因集使用孔多塞。随后,一对一多类支持向量机(OAO-MSVM)根据患者的基因水平谱对其进行分类。在多个TCGA数据集上进行评估,DriverSub-SVM优于四种最先进的方法,实现了更高的准确性,并识别出与预后和治疗反应相关的临床相关基因。结论:DriverSub-SVM通过连接个体异质性和人群水平模式,为癌症亚型分类提供了一种有效且可解释的方法。它提高了对肿瘤生物学的理解,并为精确肿瘤学和生物标志物的发现带来了希望。源代码可从https://github.com/sjunrong/DriverSub-SVM获得。
{"title":"DriverSub-SVM: a machine learning approach for cancer subtype classification by integrating patient-specific and global driver genes.","authors":"Junrong Song, Yuanli Gong, Zhiming Song, Xinggui Xu, Kun Qian, Yingbo Liu","doi":"10.1186/s12859-025-06318-7","DOIUrl":"10.1186/s12859-025-06318-7","url":null,"abstract":"<p><strong>Background: </strong>Cancer's complexity and heterogeneity pose significant challenges for personalized treatment. Accurate classification of patients into molecular subtypes is critical for targeted therapy and improved outcomes. However, existing methods often fail to simultaneously capture inter-patient heterogeneity and shared molecular patterns in driver gene profiles.</p><p><strong>Results: </strong>To address this limitation, we propose DriverSub-SVM, a novel framework for interpretable cancer subtype classification that integrates patient-specific and cohort-wide driver gene information. Our method first models the bidirectional influence between mutated and dysregulated genes via a random walk on a functional interaction network. It then applies Bayesian Personalized Ranking (BPR) to infer personalized driver gene rankings for each patient. These rankings are aggregated into a consensus driver gene set using the Condorcet. Subsequently, a One-Against-One Multiclass Support Vector Machine (OAO-MSVM) classifies patients based on their gene-level profiles. Evaluated on multiple TCGA datasets, DriverSub-SVM outperformed four state-of-the-art methods, achieving higher accuracy and identifying clinically relevant genes associated with prognosis and therapeutic response.</p><p><strong>Conclusion: </strong>DriverSub-SVM offers an effective and interpretable approach for cancer subtype classification by bridging individual heterogeneity and population-level patterns. It enhances understanding of tumor biology and holds promise for precision oncology and biomarker discovery. The source code is available at https://github.com/sjunrong/DriverSub-SVM .</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"297"},"PeriodicalIF":3.3,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12751776/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145854201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Batch optimization for balanced binary sequences and DNA sequences. 平衡二进制序列和DNA序列的批量优化。
IF 3.3 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-12-29 DOI: 10.1186/s12859-025-06336-5
Yiming Ma

Background: DNA data storage offers exceptional density and longevity, but its practicality is hampered by the high cost and low throughput of de novo DNA synthesis. A key cost driver in array-based synthesis is the length of a common supersequence required to encode a batch of DNA strands.

Objective: This study aims to address this cost bottleneck by investigating the optimal batch partitioning of DNA sequences. Our goal is to minimize the total synthesis cost, which is defined as the sum of the lengths of the shortest common supersequences (SCS) across all batches.

Results: Given a large pool [Formula: see text] of balanced binary sequences, which is partitioned into k batches with almost equal size, we define the total cost of [Formula: see text] to be the sum of lengths of the shortest common supersequence (SCS) of all sequences in each batch. The central problem is to determine the minimum total cost of [Formula: see text], denoted by [Formula: see text], among all partitions into k batches.

Conclusions: When [Formula: see text] is the set of all balanced binary sequences of length 2n, we use combinatorial methods to obtain [Formula: see text] for any positive n, and [Formula: see text] for [Formula: see text] and large n with C a constant depending on k. Similarly, we get [Formula: see text] for [Formula: see text] and large n when [Formula: see text] is the set of all balanced DNA sequences of length 2n. Previously, the probabilistic model of this problem was studied by Makarychev et al. (IEEE Trans Inf Theory 68:7454-7470, 2022), where strings are unconstrained or without consecutive identical letters.

背景:DNA数据存储具有卓越的密度和寿命,但其实用性受到高成本和低通量从头DNA合成的阻碍。在基于阵列的合成中,一个关键的成本驱动因素是编码一批DNA链所需的共同超序列的长度。目的:本研究旨在通过研究DNA序列的最佳批量分配来解决这一成本瓶颈。我们的目标是最小化总合成成本,其定义为所有批次中最短共同超序列(SCS)长度的总和。结果:给定一个大的平衡二值序列池[公式:见文],它被划分为k个几乎相等大小的批次,我们定义[公式:见文]的总代价为每批次中所有序列的最短公共超序列(SCS)的长度之和。中心问题是确定[公式:见文]的最小总成本,用[公式:见文]表示,在所有分区中分成k批。结论:当[公式:见文]是长度为2n的所有平衡二值序列的集合时,对于任意正n,我们使用组合方法得到[公式:见文],对于[公式:见文]和大n,我们使用[公式:见文],并且C是一个依赖于k的常数。同样,当[公式:见文]是长度为2n的所有平衡DNA序列的集合时,我们得到[公式:见文]和大n。此前,Makarychev等人(IEEE Trans Inf Theory 68:7454-7470, 2022)研究了该问题的概率模型,其中字符串不受约束或没有连续相同的字母。
{"title":"Batch optimization for balanced binary sequences and DNA sequences.","authors":"Yiming Ma","doi":"10.1186/s12859-025-06336-5","DOIUrl":"10.1186/s12859-025-06336-5","url":null,"abstract":"<p><strong>Background: </strong>DNA data storage offers exceptional density and longevity, but its practicality is hampered by the high cost and low throughput of de novo DNA synthesis. A key cost driver in array-based synthesis is the length of a common supersequence required to encode a batch of DNA strands.</p><p><strong>Objective: </strong>This study aims to address this cost bottleneck by investigating the optimal batch partitioning of DNA sequences. Our goal is to minimize the total synthesis cost, which is defined as the sum of the lengths of the shortest common supersequences (SCS) across all batches.</p><p><strong>Results: </strong>Given a large pool [Formula: see text] of balanced binary sequences, which is partitioned into k batches with almost equal size, we define the total cost of [Formula: see text] to be the sum of lengths of the shortest common supersequence (SCS) of all sequences in each batch. The central problem is to determine the minimum total cost of [Formula: see text], denoted by [Formula: see text], among all partitions into k batches.</p><p><strong>Conclusions: </strong>When [Formula: see text] is the set of all balanced binary sequences of length 2n, we use combinatorial methods to obtain [Formula: see text] for any positive n, and [Formula: see text] for [Formula: see text] and large n with C a constant depending on k. Similarly, we get [Formula: see text] for [Formula: see text] and large n when [Formula: see text] is the set of all balanced DNA sequences of length 2n. Previously, the probabilistic model of this problem was studied by Makarychev et al. (IEEE Trans Inf Theory 68:7454-7470, 2022), where strings are unconstrained or without consecutive identical letters.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"300"},"PeriodicalIF":3.3,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12750718/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145854153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MGANSL: multi-network representation generating with generative adversarial network for synthetic lethality prediction. MGANSL:基于生成对抗网络的多网络表示生成方法。
IF 3.3 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-12-29 DOI: 10.1186/s12859-025-06345-4
Jinxin Li, Xinguo Lu, Zihao Li, Xing Liu, Hongrui Liu, Jingjing Ruan

Background: Cancer is a complex disease that arises from the simultaneous mutations of multiple biological molecules. An effective therapeutic strategy is to exploit synthetic lethality (SL) by targeting the SL partner of cancer driver genes. Computational approaches have emerged as efficient complements to traditional methods. Although some methods integrate heterogeneous sources to learn multi-network representations, they often neglect consistent information shared across different networks and specific characteristic specific to individual network. Therefore, a comprehensive representation learning framework for capturing both multi-network consistency and network-specific information of gene pair is needed.

Results: We proposed a novel approach capturing Multi-network consistent and specific representation with Generative Adversarial Network for Synthetic Lethality prediction (MGANSL). MGANSL employs network-aligned and network-specific encoding modules to cooperatively learn comprehensive multi-network representations of gene pair. In particular, network-aligned encoding module can capture cross-modal consistent information via cross-network adversarial generation, and network-specific encoding module can capture single network specific information via intra-network adversarial generation.

Conclusions: Comprehensive experiments conducted on two human synthetic lethality datasets demonstrate the superiority of proposed method in SL prediction. Moreover, the novel predicted SL associations could aid in designing anti-cancer drugs and providing potential drug targets.

背景:癌症是由多种生物分子同时突变引起的复杂疾病。一种有效的治疗策略是通过靶向肿瘤驱动基因的SL伴侣来利用合成致死性(SL)。计算方法已经成为传统方法的有效补充。尽管一些方法集成了异构源来学习多网络表示,但它们往往忽略了不同网络之间共享的一致信息和单个网络特有的特征。因此,需要一个综合的表征学习框架来捕获基因对的多网络一致性和网络特异性信息。结果:我们提出了一种基于生成对抗网络的合成致命预测(MGANSL)的多网络一致性和特异性表征的新方法。MGANSL采用网络对齐和网络特定编码模块,协同学习基因对的综合多网络表示。其中,面向网络的编码模块通过跨网络对抗生成捕获跨模态一致信息,面向网络的编码模块通过网络内对抗生成捕获单个网络特定信息。结论:在两个人类合成致死数据集上进行的综合实验证明了本文方法在SL预测中的优越性。此外,新预测的SL关联可以帮助设计抗癌药物并提供潜在的药物靶点。
{"title":"MGANSL: multi-network representation generating with generative adversarial network for synthetic lethality prediction.","authors":"Jinxin Li, Xinguo Lu, Zihao Li, Xing Liu, Hongrui Liu, Jingjing Ruan","doi":"10.1186/s12859-025-06345-4","DOIUrl":"10.1186/s12859-025-06345-4","url":null,"abstract":"<p><strong>Background: </strong>Cancer is a complex disease that arises from the simultaneous mutations of multiple biological molecules. An effective therapeutic strategy is to exploit synthetic lethality (SL) by targeting the SL partner of cancer driver genes. Computational approaches have emerged as efficient complements to traditional methods. Although some methods integrate heterogeneous sources to learn multi-network representations, they often neglect consistent information shared across different networks and specific characteristic specific to individual network. Therefore, a comprehensive representation learning framework for capturing both multi-network consistency and network-specific information of gene pair is needed.</p><p><strong>Results: </strong>We proposed a novel approach capturing Multi-network consistent and specific representation with Generative Adversarial Network for Synthetic Lethality prediction (MGANSL). MGANSL employs network-aligned and network-specific encoding modules to cooperatively learn comprehensive multi-network representations of gene pair. In particular, network-aligned encoding module can capture cross-modal consistent information via cross-network adversarial generation, and network-specific encoding module can capture single network specific information via intra-network adversarial generation.</p><p><strong>Conclusions: </strong>Comprehensive experiments conducted on two human synthetic lethality datasets demonstrate the superiority of proposed method in SL prediction. Moreover, the novel predicted SL associations could aid in designing anti-cancer drugs and providing potential drug targets.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":"27"},"PeriodicalIF":3.3,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12860022/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145854190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contrastive learning for cell division detection and tracking in live cell imaging data. 活细胞成像数据中细胞分裂检测和跟踪的对比学习。
IF 3.3 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-12-27 DOI: 10.1186/s12859-025-06344-5
Daniel Zyss, Amritansh Sharma, Susana A Ribeiro, Claire E Repellin, Oliver Lai, Mary J C Ludlam, Thomas Walter, Amin Fehri
{"title":"Contrastive learning for cell division detection and tracking in live cell imaging data.","authors":"Daniel Zyss, Amritansh Sharma, Susana A Ribeiro, Claire E Repellin, Oliver Lai, Mary J C Ludlam, Thomas Walter, Amin Fehri","doi":"10.1186/s12859-025-06344-5","DOIUrl":"10.1186/s12859-025-06344-5","url":null,"abstract":"","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":"30"},"PeriodicalIF":3.3,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12859858/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145846402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AR-CDT NET: a deep deformable convolutional network for gut microbiome-based disease classification. ar - cdtnet:用于肠道微生物群疾病分类的深度可变形卷积网络。
IF 3.3 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-12-26 DOI: 10.1186/s12859-025-06357-0
Jiaye Li, Zijian Sun, Shuo Chai, Hangming Li, Yijun Wang, Jingkui Tian

Advances in metagenomic sequencing have increasingly implicated gut microbiome dysbiosis in numerous complex diseases, yet its application for precise differential diagnosis remains a major challenge. Existing computational approaches often show limited predictive performance and insufficient robustness when applied to large-scale, imbalanced microbiome datasets, and they typically lack mechanisms to effectively capture microbial community-level or functional guild interactions. To address these limitations, we developed AR-CDT Net, a novel deep learning framework that integrates a Multi-Scale Deformable Convolution (MS-DConv) module with a Channel-wise Dynamic Tanh (CD-Tanh) activation function to achieve more accurate and robust classification of host disease states. Evaluated on a large-scale cohort comprising over 8000 samples spanning eight disease phenotypes, AR-CDT Net demonstrated highly competitive within-cohort performance, outperforming nine representative models across the majority of classification tasks. Importantly, in a stringent cross-dataset generalization test, the model was trained on the highly imbalanced primary multi-disease cohort and validated on relatively balanced independent external cohorts. It achieved a statistically significant AUC of 0.7921 on the highly heterogeneous external T2D cohort, confirming that AR-CDT captures transferable biological signals rather than dataset-specific artifacts. Furthermore, by combining dimensionality reduction with SHAP-based interpretation of our One-vs-Rest (OvR) classifiers, AR-CDT disentangles disease-specific pathogenic signatures from the shared dysbiotic background among clinically distinct yet microbially similar diseases.

宏基因组测序的进展越来越多地涉及许多复杂疾病的肠道微生物群失调,但其在精确鉴别诊断中的应用仍然是一个主要挑战。当应用于大规模、不平衡的微生物组数据集时,现有的计算方法往往表现出有限的预测性能和不足的鲁棒性,并且它们通常缺乏有效捕获微生物群落水平或功能guild相互作用的机制。为了解决这些限制,我们开发了AR-CDT Net,这是一个新的深度学习框架,它集成了多尺度可变形卷积(MS-DConv)模块和通道动态Tanh (CD-Tanh)激活函数,以实现更准确和稳健的宿主疾病状态分类。在一个包含8000多个样本、跨越8种疾病表型的大规模队列中进行评估,AR-CDT Net在队列内表现出高度的竞争力,在大多数分类任务中优于9个代表性模型。重要的是,在严格的跨数据集泛化检验中,该模型在高度不平衡的原发性多疾病队列上进行了训练,并在相对平衡的独立外部队列上进行了验证。在高度异质性的外部T2D队列中,AUC达到了统计学意义上的0.7921,证实AR-CDT捕获的是可转移的生物信号,而不是数据集特定的伪像。此外,通过结合降维和基于shap的One-vs-Rest (OvR)分类器的解释,AR-CDT从临床不同但微生物相似的疾病中共享的生态失调背景中分离出疾病特异性致病特征。
{"title":"AR-CDT NET: a deep deformable convolutional network for gut microbiome-based disease classification.","authors":"Jiaye Li, Zijian Sun, Shuo Chai, Hangming Li, Yijun Wang, Jingkui Tian","doi":"10.1186/s12859-025-06357-0","DOIUrl":"10.1186/s12859-025-06357-0","url":null,"abstract":"<p><p>Advances in metagenomic sequencing have increasingly implicated gut microbiome dysbiosis in numerous complex diseases, yet its application for precise differential diagnosis remains a major challenge. Existing computational approaches often show limited predictive performance and insufficient robustness when applied to large-scale, imbalanced microbiome datasets, and they typically lack mechanisms to effectively capture microbial community-level or functional guild interactions. To address these limitations, we developed AR-CDT Net, a novel deep learning framework that integrates a Multi-Scale Deformable Convolution (MS-DConv) module with a Channel-wise Dynamic Tanh (CD-Tanh) activation function to achieve more accurate and robust classification of host disease states. Evaluated on a large-scale cohort comprising over 8000 samples spanning eight disease phenotypes, AR-CDT Net demonstrated highly competitive within-cohort performance, outperforming nine representative models across the majority of classification tasks. Importantly, in a stringent cross-dataset generalization test, the model was trained on the highly imbalanced primary multi-disease cohort and validated on relatively balanced independent external cohorts. It achieved a statistically significant AUC of 0.7921 on the highly heterogeneous external T2D cohort, confirming that AR-CDT captures transferable biological signals rather than dataset-specific artifacts. Furthermore, by combining dimensionality reduction with SHAP-based interpretation of our One-vs-Rest (OvR) classifiers, AR-CDT disentangles disease-specific pathogenic signatures from the shared dysbiotic background among clinically distinct yet microbially similar diseases.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":"23"},"PeriodicalIF":3.3,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12849458/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145843427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical modelling of an outcome variable with integrated multi-omics. 综合多组学结果变量的统计建模。
IF 3.3 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-12-24 DOI: 10.1186/s12859-025-06349-0
He Li, Zander Gu, Said El Bouhaddani, Jeanine Houwing-Duistermaat

Background: In studies that aim to model the relationship between an outcome variable and multiple omics datasets, it is often desirable to reduce the dimensionality of these datasets or to represent one omics dataset in terms of another. Several approaches exist for this purpose, including univariate methods such as polygenic scores, and multivariate methods. Multivariate approaches offer advantages by producing lower-dimensional integrative scores, capturing joint structures across datasets, and filtering out dataset-specific noise. In this paper, we describe one univariate and two multivariate methods, and evaluate their performance through simulations involving two correlated multivariate normally distributed omics datasets, as well as a combination of one multivariate normal and one fixed categorical dataset.

Results: We assess method performance using the root mean squared error (RMSE) when modelling the outcome variable as a function of the reduced omics representations. Multivariate methods generally perform well, particularly when a slightly higher number of components is used for integration. They outperform the univariate method in scenarios involving two normally distributed omics datasets and perform comparably in settings with one normal and one categorical dataset. In real data applications, including two metabolomics datasets from TwinsUK and a metabolomics-genetic dataset from ORCADES, all methods show similar performance in modelling body mass index.

Conclusions: Multivariate methods provide a valuable framework for summarizing multi-omics datasets into low-dimensional components suitable for outcome modelling. Even in the presence of non-normal data, these methods offer a promising alternative to high-dimensional univariate approaches.

背景:在旨在为结果变量和多个组学数据集之间的关系建模的研究中,通常需要降低这些数据集的维数,或者用另一个组学数据集表示一个组学数据集。有几种方法可以达到这个目的,包括单变量方法,如多基因评分和多变量方法。多变量方法通过产生低维综合分数、捕获数据集之间的联合结构以及过滤数据集特定的噪声来提供优势。在本文中,我们描述了一种单变量和两种多变量方法,并通过两个相关的多变量正态分布组学数据集,以及一个多变量正态和一个固定类别数据集的组合的模拟来评估它们的性能。结果:我们使用均方根误差(RMSE)来评估方法的性能,将结果变量建模为减少组学表征的函数。多元方法通常表现良好,特别是当用于集成的组件数量略高时。它们在涉及两个正态分布组学数据集的场景中优于单变量方法,并且在一个正态和一个分类数据集的设置中表现相当。在实际数据应用中,包括来自TwinsUK的两个代谢组学数据集和来自ORCADES的代谢组学-遗传数据集,所有方法在模拟体重指数方面都显示出相似的性能。结论:多变量方法为将多组学数据集汇总为适合结果建模的低维组件提供了一个有价值的框架。即使在存在非正态数据的情况下,这些方法也为高维单变量方法提供了一个有希望的替代方法。
{"title":"Statistical modelling of an outcome variable with integrated multi-omics.","authors":"He Li, Zander Gu, Said El Bouhaddani, Jeanine Houwing-Duistermaat","doi":"10.1186/s12859-025-06349-0","DOIUrl":"10.1186/s12859-025-06349-0","url":null,"abstract":"<p><strong>Background: </strong>In studies that aim to model the relationship between an outcome variable and multiple omics datasets, it is often desirable to reduce the dimensionality of these datasets or to represent one omics dataset in terms of another. Several approaches exist for this purpose, including univariate methods such as polygenic scores, and multivariate methods. Multivariate approaches offer advantages by producing lower-dimensional integrative scores, capturing joint structures across datasets, and filtering out dataset-specific noise. In this paper, we describe one univariate and two multivariate methods, and evaluate their performance through simulations involving two correlated multivariate normally distributed omics datasets, as well as a combination of one multivariate normal and one fixed categorical dataset.</p><p><strong>Results: </strong>We assess method performance using the root mean squared error (RMSE) when modelling the outcome variable as a function of the reduced omics representations. Multivariate methods generally perform well, particularly when a slightly higher number of components is used for integration. They outperform the univariate method in scenarios involving two normally distributed omics datasets and perform comparably in settings with one normal and one categorical dataset. In real data applications, including two metabolomics datasets from TwinsUK and a metabolomics-genetic dataset from ORCADES, all methods show similar performance in modelling body mass index.</p><p><strong>Conclusions: </strong>Multivariate methods provide a valuable framework for summarizing multi-omics datasets into low-dimensional components suitable for outcome modelling. Even in the presence of non-normal data, these methods offer a promising alternative to high-dimensional univariate approaches.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":"26"},"PeriodicalIF":3.3,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12859906/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145826816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LONMF: a non-negative matrix factorization model based on graph Laplacian and optimal transmission for paired single-cell multi-omics data integration. LONMF:一种基于图拉普拉斯和最优传输的非负矩阵分解模型,用于配对单细胞多组学数据集成。
IF 3.3 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-12-23 DOI: 10.1186/s12859-025-06301-2
Mengdi Nan, Qing Ren, Yuhan Fu, Xiang Chen, Guanpeng Qi, Liugen Wang, Jie Gao

The rapid development of single-cell sequencing technologies has provided a robust technical support for the efficient resolution of multiple levels of molecular information from a single-cell population. However, the data produced by these technologies often contain a lot of noise and differences in characteristics that make it difficult to integrate and analyze single-cell multi-omics data. In this study, there is a growing demand for methods to integrate single-cell multi-omics data, which is expected to enhance the ability to reveal cellular heterogeneity and provide new biological perspectives for a deeper understanding of cellular phenotypes by jointly analyzing multi-omics data. We propose LONMF, a non-negative matrix factorization algorithm combining graph Laplacian and optimal transmission to enhance clustering performance and interpretability. We apply LONMF to visualize and cluster multi-pair single-cell multi-omics data, including 10X-multi-group, CITE-seq, and TEA-multi-group seq, to facilitate marker characterization and gene ontology enrichment analysis and to provide rich biological insights for downstream analyses. Our comprehensive benchmarking demonstrates that LONMF exhibits comparable performance compared with the current state-of-the-art in cell clustering and outperforms other methods in terms of biological interpretability.

单细胞测序技术的快速发展为单细胞群体多层次分子信息的高效解析提供了强有力的技术支持。然而,这些技术产生的数据往往包含大量的噪声和特征差异,这给单细胞多组学数据的整合和分析带来了困难。在本研究中,对整合单细胞多组学数据的方法的需求日益增长,这有望通过联合分析多组学数据来增强揭示细胞异质性的能力,并为更深入地理解细胞表型提供新的生物学视角。为了提高聚类性能和可解释性,我们提出了一种结合图拉普拉斯和最优传输的非负矩阵分解算法LONMF。我们应用LONMF对多对单细胞多组学数据进行可视化和聚类,包括10x -多组、CITE-seq和tea -多组seq,以促进标记表征和基因本体富集分析,并为下游分析提供丰富的生物学见解。我们的综合基准测试表明,与当前最先进的细胞聚类相比,LONMF表现出相当的性能,并且在生物可解释性方面优于其他方法。
{"title":"LONMF: a non-negative matrix factorization model based on graph Laplacian and optimal transmission for paired single-cell multi-omics data integration.","authors":"Mengdi Nan, Qing Ren, Yuhan Fu, Xiang Chen, Guanpeng Qi, Liugen Wang, Jie Gao","doi":"10.1186/s12859-025-06301-2","DOIUrl":"10.1186/s12859-025-06301-2","url":null,"abstract":"<p><p>The rapid development of single-cell sequencing technologies has provided a robust technical support for the efficient resolution of multiple levels of molecular information from a single-cell population. However, the data produced by these technologies often contain a lot of noise and differences in characteristics that make it difficult to integrate and analyze single-cell multi-omics data. In this study, there is a growing demand for methods to integrate single-cell multi-omics data, which is expected to enhance the ability to reveal cellular heterogeneity and provide new biological perspectives for a deeper understanding of cellular phenotypes by jointly analyzing multi-omics data. We propose LONMF, a non-negative matrix factorization algorithm combining graph Laplacian and optimal transmission to enhance clustering performance and interpretability. We apply LONMF to visualize and cluster multi-pair single-cell multi-omics data, including 10X-multi-group, CITE-seq, and TEA-multi-group seq, to facilitate marker characterization and gene ontology enrichment analysis and to provide rich biological insights for downstream analyses. Our comprehensive benchmarking demonstrates that LONMF exhibits comparable performance compared with the current state-of-the-art in cell clustering and outperforms other methods in terms of biological interpretability.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"294"},"PeriodicalIF":3.3,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12729160/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145817499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
BMC Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1