首页 > 最新文献

Briefings in bioinformatics最新文献

英文 中文
Towards molecular structure discovery from cryo-ET density volumes via modelling auxiliary semantic prototypes. 通过建模辅助语义原型从低温et密度体积中发现分子结构。
IF 6.8 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-22 DOI: 10.1093/bib/bbae570
Ashwin Nair, Xingjian Li, Bhupendra Solanki, Souradeep Mukhopadhyay, Ankit Jha, Mostofa Rafid Uddin, Mainak Singha, Biplab Banerjee, Min Xu

Cryo-electron tomography (cryo-ET) is confronted with the intricate task of unveiling novel structures. General class discovery (GCD) seeks to identify new classes by learning a model that can pseudo-label unannotated (novel) instances solely using supervision from labeled (base) classes. While 2D GCD for image data has made strides, its 3D counterpart remains unexplored. Traditional methods encounter challenges due to model bias and limited feature transferability when clustering unlabeled 2D images into known and potentially novel categories based on labeled data. To address this limitation and extend GCD to 3D structures, we propose an innovative approach that harnesses a pretrained 2D transformer, enriched by an effective weight inflation strategy tailored for 3D adaptation, followed by a decoupled prototypical network. Incorporating the power of pretrained weight-inflated Transformers, we further integrate CLIP, a vision-language model to incorporate textual information. Our method synergizes a graph convolutional network with CLIP's frozen text encoder, preserving class neighborhood structure. In order to effectively represent unlabeled samples, we devise semantic distance distributions, by formulating a bipartite matching problem for category prototypes using a decoupled prototypical network. Empirical results unequivocally highlight our method's potential in unveiling hitherto unknown structures in cryo-ET. By bridging the gap between 2D GCD and the distinctive challenges of 3D cryo-ET data, our approach paves novel avenues for exploration and discovery in this domain.

低温电子断层扫描(cryo-ET)面临着揭示新结构的复杂任务。通用类发现(GCD)试图通过学习一个模型来识别新类,该模型可以仅使用标记(基)类的监督对未注释(新颖)实例进行伪标记。虽然用于图像数据的2D GCD已经取得了长足的进步,但其3D版本仍未被探索。传统的方法在将未标记的二维图像聚类到已知和潜在的新类别时,由于模型偏差和有限的特征可转移性而面临挑战。为了解决这一限制并将GCD扩展到3D结构,我们提出了一种创新的方法,利用预训练的2D变压器,通过为3D适应量身定制的有效权重膨胀策略进行充实,然后是解耦原型网络。结合预训练的加权膨胀变形金刚的力量,我们进一步整合视觉语言模型CLIP来整合文本信息。我们的方法将图形卷积网络与CLIP的冻结文本编码器协同,保留了类邻域结构。为了有效地表示未标记的样本,我们设计了语义距离分布,通过使用解耦原型网络为类别原型制定了一个二部匹配问题。实证结果毫不含糊地强调了我们的方法在揭示迄今未知的结构在冷冻et的潜力。通过弥合2D GCD和3D cryo-ET数据的独特挑战之间的差距,我们的方法为该领域的探索和发现铺平了新的道路。
{"title":"Towards molecular structure discovery from cryo-ET density volumes via modelling auxiliary semantic prototypes.","authors":"Ashwin Nair, Xingjian Li, Bhupendra Solanki, Souradeep Mukhopadhyay, Ankit Jha, Mostofa Rafid Uddin, Mainak Singha, Biplab Banerjee, Min Xu","doi":"10.1093/bib/bbae570","DOIUrl":"10.1093/bib/bbae570","url":null,"abstract":"<p><p>Cryo-electron tomography (cryo-ET) is confronted with the intricate task of unveiling novel structures. General class discovery (GCD) seeks to identify new classes by learning a model that can pseudo-label unannotated (novel) instances solely using supervision from labeled (base) classes. While 2D GCD for image data has made strides, its 3D counterpart remains unexplored. Traditional methods encounter challenges due to model bias and limited feature transferability when clustering unlabeled 2D images into known and potentially novel categories based on labeled data. To address this limitation and extend GCD to 3D structures, we propose an innovative approach that harnesses a pretrained 2D transformer, enriched by an effective weight inflation strategy tailored for 3D adaptation, followed by a decoupled prototypical network. Incorporating the power of pretrained weight-inflated Transformers, we further integrate CLIP, a vision-language model to incorporate textual information. Our method synergizes a graph convolutional network with CLIP's frozen text encoder, preserving class neighborhood structure. In order to effectively represent unlabeled samples, we devise semantic distance distributions, by formulating a bipartite matching problem for category prototypes using a decoupled prototypical network. Empirical results unequivocally highlight our method's potential in unveiling hitherto unknown structures in cryo-ET. By bridging the gap between 2D GCD and the distinctive challenges of 3D cryo-ET data, our approach paves novel avenues for exploration and discovery in this domain.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11790060/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143000412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
STCGAN: a novel cycle-consistent generative adversarial network for spatial transcriptomics cellular deconvolution. STCGAN:一个新的周期一致的生成对抗网络的空间转录组细胞反褶积。
IF 6.8 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-22 DOI: 10.1093/bib/bbae670
Bo Wang, Yahui Long, Yuting Bai, Jiawei Luo, Chee Keong Kwoh

Motivation: Spatial transcriptomics (ST) technologies have revolutionized our ability to map gene expression patterns within native tissue context, providing unprecedented insights into tissue architecture and cellular heterogeneity. However, accurately deconvolving cell-type compositions from ST spots remains challenging due to the sparse and averaged nature of ST data, which is essential for accurately depicting tissue architecture. While numerous computational methods have been developed for cell-type deconvolution and spatial distribution reconstruction, most fail to capture tissue complexity at the single-cell level, thereby limiting their applicability in practical scenarios.

Results: To this end, we propose a novel cycle-consistent generative adversarial network named STCGAN for cellular deconvolution in spatial transcriptomic. STCGAN first employs a cycle-consistent generative adversarial network (CGAN) to pre-train on ST data, ensuring that both the mapping from ST data to latent space and its reverse mapping are consistent, capturing complex spatial gene expression patterns and learning robust latent representations. Based on the learned representation, STCGAN then optimizes a trainable cell-to-spot mapping matrix to integrate scRNA-seq data with ST data, accurately estimating cellular composition within each capture spot and effectively reconstructing the spatial distribution of cells across the tissue. To further enhance deconvolution accuracy, we incorporate spatial-aware regularization that ensures accurate cellular distribution reconstruction within the spatial context. Benchmarking against seven state-of-the-art methods on five simulated and real datasets from various tissues, STCGAN consistently delivers superior cell-type deconvolution performance.

Availability: The code of STCGAN can be downloaded from https://github.com/cs-wangbo/STCGAN and all the mentioned datasets are available on Zenodo at https://zenodo.org/doi/10.5281/zenodo.10799113.

动机:空间转录组学(ST)技术彻底改变了我们在原生组织环境中绘制基因表达模式的能力,为组织结构和细胞异质性提供了前所未有的见解。然而,由于ST数据的稀疏和平均性质,从ST点准确地反卷积细胞类型组成仍然具有挑战性,这对于准确描绘组织结构至关重要。虽然已经开发了许多用于细胞型反褶积和空间分布重建的计算方法,但大多数计算方法无法在单细胞水平上捕获组织复杂性,从而限制了它们在实际场景中的适用性。结果:为此,我们提出了一种新的周期一致的生成对抗网络,称为STCGAN,用于空间转录组学的细胞反卷积。STCGAN首先采用循环一致生成对抗网络(CGAN)对ST数据进行预训练,确保ST数据到潜在空间的映射及其反向映射是一致的,捕获复杂的空间基因表达模式并学习鲁棒潜在表征。基于学习到的表示,STCGAN优化了一个可训练的细胞-点映射矩阵,将scRNA-seq数据与ST数据整合,准确估计每个捕获点内的细胞组成,并有效地重建细胞在组织中的空间分布。为了进一步提高反卷积精度,我们结合了空间感知正则化,以确保在空间背景下精确的细胞分布重建。通过对来自各种组织的五个模拟和真实数据集的七种最先进方法进行基准测试,STCGAN始终提供卓越的细胞型反褶积性能。可用性:STCGAN的代码可以从https://github.com/cs-wangbo/STCGAN下载,所有提到的数据集都可以在Zenodo上获得https://zenodo.org/doi/10.5281/zenodo.10799113。
{"title":"STCGAN: a novel cycle-consistent generative adversarial network for spatial transcriptomics cellular deconvolution.","authors":"Bo Wang, Yahui Long, Yuting Bai, Jiawei Luo, Chee Keong Kwoh","doi":"10.1093/bib/bbae670","DOIUrl":"10.1093/bib/bbae670","url":null,"abstract":"<p><strong>Motivation: </strong>Spatial transcriptomics (ST) technologies have revolutionized our ability to map gene expression patterns within native tissue context, providing unprecedented insights into tissue architecture and cellular heterogeneity. However, accurately deconvolving cell-type compositions from ST spots remains challenging due to the sparse and averaged nature of ST data, which is essential for accurately depicting tissue architecture. While numerous computational methods have been developed for cell-type deconvolution and spatial distribution reconstruction, most fail to capture tissue complexity at the single-cell level, thereby limiting their applicability in practical scenarios.</p><p><strong>Results: </strong>To this end, we propose a novel cycle-consistent generative adversarial network named STCGAN for cellular deconvolution in spatial transcriptomic. STCGAN first employs a cycle-consistent generative adversarial network (CGAN) to pre-train on ST data, ensuring that both the mapping from ST data to latent space and its reverse mapping are consistent, capturing complex spatial gene expression patterns and learning robust latent representations. Based on the learned representation, STCGAN then optimizes a trainable cell-to-spot mapping matrix to integrate scRNA-seq data with ST data, accurately estimating cellular composition within each capture spot and effectively reconstructing the spatial distribution of cells across the tissue. To further enhance deconvolution accuracy, we incorporate spatial-aware regularization that ensures accurate cellular distribution reconstruction within the spatial context. Benchmarking against seven state-of-the-art methods on five simulated and real datasets from various tissues, STCGAN consistently delivers superior cell-type deconvolution performance.</p><p><strong>Availability: </strong>The code of STCGAN can be downloaded from https://github.com/cs-wangbo/STCGAN and all the mentioned datasets are available on Zenodo at https://zenodo.org/doi/10.5281/zenodo.10799113.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11666287/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142880673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Steering veridical large language model analyses by correcting and enriching generated database queries: first steps toward ChatGPT bioinformatics.
IF 6.8 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-22 DOI: 10.1093/bib/bbaf045
Olivier Cinquin

Large language models (LLMs) leverage factual knowledge from pretraining. Yet this knowledge remains incomplete and sometimes challenging to retrieve-especially in scientific domains not extensively covered in pretraining datasets and where information is still evolving. Here, we focus on genomics and bioinformatics. We confirm and expand upon issues with plain ChatGPT functioning as a bioinformatics assistant. Poor data retrieval and hallucination lead ChatGPT to err, as do incorrect sequence manipulations. To address this, we propose a system basing LLM outputs on up-to-date, authoritative facts and facilitating LLM-guided data analysis. Specifically, we introduce NagGPT, a middleware tool to insert between LLMs and databases, designed to bridge gaps in LLM knowledge and usage of database application programming interfaces. NagGPT proxies LLM-generated database queries, with special handling of incorrect queries. It acts as a gatekeeper between query responses and the LLM prompt, redirecting large responses to files but providing a synthesized snippet and injecting comments to steer the LLM. A companion OpenAI custom GPT, Genomics Fetcher-Analyzer, connects ChatGPT with NagGPT. It steers ChatGPT to generate and run Python code, performing bioinformatics tasks on data dynamically retrieved from a dozen common genomics databases (e.g. NCBI, Ensembl, UniProt, WormBase, and FlyBase). We implement partial mitigations for encountered challenges: detrimental interactions between code generation style and data analysis, confusion between database identifiers, and hallucination of both data and actions taken. Our results identify avenues to augment ChatGPT as a bioinformatics assistant and, more broadly, to improve factual accuracy and instruction following of unmodified LLMs.

{"title":"Steering veridical large language model analyses by correcting and enriching generated database queries: first steps toward ChatGPT bioinformatics.","authors":"Olivier Cinquin","doi":"10.1093/bib/bbaf045","DOIUrl":"10.1093/bib/bbaf045","url":null,"abstract":"<p><p>Large language models (LLMs) leverage factual knowledge from pretraining. Yet this knowledge remains incomplete and sometimes challenging to retrieve-especially in scientific domains not extensively covered in pretraining datasets and where information is still evolving. Here, we focus on genomics and bioinformatics. We confirm and expand upon issues with plain ChatGPT functioning as a bioinformatics assistant. Poor data retrieval and hallucination lead ChatGPT to err, as do incorrect sequence manipulations. To address this, we propose a system basing LLM outputs on up-to-date, authoritative facts and facilitating LLM-guided data analysis. Specifically, we introduce NagGPT, a middleware tool to insert between LLMs and databases, designed to bridge gaps in LLM knowledge and usage of database application programming interfaces. NagGPT proxies LLM-generated database queries, with special handling of incorrect queries. It acts as a gatekeeper between query responses and the LLM prompt, redirecting large responses to files but providing a synthesized snippet and injecting comments to steer the LLM. A companion OpenAI custom GPT, Genomics Fetcher-Analyzer, connects ChatGPT with NagGPT. It steers ChatGPT to generate and run Python code, performing bioinformatics tasks on data dynamically retrieved from a dozen common genomics databases (e.g. NCBI, Ensembl, UniProt, WormBase, and FlyBase). We implement partial mitigations for encountered challenges: detrimental interactions between code generation style and data analysis, confusion between database identifiers, and hallucination of both data and actions taken. Our results identify avenues to augment ChatGPT as a bioinformatics assistant and, more broadly, to improve factual accuracy and instruction following of unmodified LLMs.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11798674/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143254680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inferring tumor purity using multi-omics data based on a uniform machine learning framework MoTP.
IF 6.8 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-22 DOI: 10.1093/bib/bbaf056
Qiqi Lu, Zhixian Liu, Xiaosheng Wang

Existing algorithms for assessing tumor purity are limited to a single omics data, such as gene expression, somatic copy number variations, somatic mutations, and DNA methylation. Here we proposed the machine learning Multi-omics Tumor Purity prediction (MoTP) algorithm to estimate tumor purity based on multiple types of omics data. MoTP utilizes the Bayesian Regularized Neural Networks as the prediction algorithm, and Consensus Tumor Purity Estimates as labels. We trained MoTP using multi-omics data (mRNA, microRNA, long non-coding RNA, and DNA methylation) across 21 TCGA solid cancer types. By testing MoTP in TCGA validation sets, TCGA test sets, and eight datasets outside the TCGA cancer cohorts, we showed that although MoTP could achieve excellent performance in predicting tumor purity based on a single omics data type, the integration of multiple single omics data-based predictions can enhance the prediction performance. Moreover, we demonstrated the robustness of MoTP by testing it in datasets with Gaussian noise and feature missing. Benchmark analysis showed that MoTP outperformed most established tumor purity prediction algorithms, and that it required less running time and computational resource to fulfill the predictive task. Thus, MoTP would be an attractive option for computational tumor purity inference.

现有的肿瘤纯度评估算法仅限于单一的组学数据,如基因表达、体细胞拷贝数变异、体细胞突变和DNA甲基化。在这里,我们提出了机器学习多组学肿瘤纯度预测(MoTP)算法,以基于多种组学数据来估计肿瘤纯度。MoTP采用贝叶斯正则化神经网络作为预测算法,以共识肿瘤纯度估计值作为标签。我们使用 21 种 TCGA 实体癌类型的多组学数据(mRNA、microRNA、长非编码 RNA 和 DNA 甲基化)对 MoTP 进行了训练。通过在TCGA验证集、TCGA测试集和TCGA癌症队列之外的8个数据集中测试MoTP,我们发现尽管MoTP在基于单个组学数据类型预测肿瘤纯度方面可以取得优异的性能,但整合多个基于单个组学数据的预测可以提高预测性能。此外,我们还在具有高斯噪声和特征缺失的数据集上测试了MoTP的鲁棒性。基准分析表明,MoTP的性能优于大多数成熟的肿瘤纯度预测算法,而且它完成预测任务所需的运行时间和计算资源更少。因此,MoTP将是计算肿瘤纯度推断的一个有吸引力的选择。
{"title":"Inferring tumor purity using multi-omics data based on a uniform machine learning framework MoTP.","authors":"Qiqi Lu, Zhixian Liu, Xiaosheng Wang","doi":"10.1093/bib/bbaf056","DOIUrl":"10.1093/bib/bbaf056","url":null,"abstract":"<p><p>Existing algorithms for assessing tumor purity are limited to a single omics data, such as gene expression, somatic copy number variations, somatic mutations, and DNA methylation. Here we proposed the machine learning Multi-omics Tumor Purity prediction (MoTP) algorithm to estimate tumor purity based on multiple types of omics data. MoTP utilizes the Bayesian Regularized Neural Networks as the prediction algorithm, and Consensus Tumor Purity Estimates as labels. We trained MoTP using multi-omics data (mRNA, microRNA, long non-coding RNA, and DNA methylation) across 21 TCGA solid cancer types. By testing MoTP in TCGA validation sets, TCGA test sets, and eight datasets outside the TCGA cancer cohorts, we showed that although MoTP could achieve excellent performance in predicting tumor purity based on a single omics data type, the integration of multiple single omics data-based predictions can enhance the prediction performance. Moreover, we demonstrated the robustness of MoTP by testing it in datasets with Gaussian noise and feature missing. Benchmark analysis showed that MoTP outperformed most established tumor purity prediction algorithms, and that it required less running time and computational resource to fulfill the predictive task. Thus, MoTP would be an attractive option for computational tumor purity inference.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11826339/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143413486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A large language model framework for literature-based disease-gene association prediction.
IF 6.8 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-22 DOI: 10.1093/bib/bbaf070
Peng-Hsuan Li, Yih-Yun Sun, Hsueh-Fen Juan, Chien-Yu Chen, Huai-Kuang Tsai, Jia-Hsin Huang

With the exponential growth of biomedical literature, leveraging Large Language Models (LLMs) for automated medical knowledge understanding has become increasingly critical for advancing precision medicine. However, current approaches face significant challenges in reliability, verifiability, and scalability when extracting complex biological relationships from scientific literature using LLMs. To overcome the obstacles of LLM development in biomedical literature understating, we propose LORE, a novel unsupervised two-stage reading methodology with LLM that models literature as a knowledge graph of verifiable factual statements and, in turn, as semantic embeddings in Euclidean space. LORE captured essential gene pathogenicity information when applied to PubMed abstracts for large-scale understanding of disease-gene relationships. We demonstrated that modeling a latent pathogenic flow in the semantic embedding with supervision from the ClinVar database led to a 90% mean average precision in identifying relevant genes across 2097 diseases. This work provides a scalable and reproducible approach for leveraging LLMs in biomedical literature analysis, offering new opportunities for researchers to identify therapeutic targets efficiently.

{"title":"A large language model framework for literature-based disease-gene association prediction.","authors":"Peng-Hsuan Li, Yih-Yun Sun, Hsueh-Fen Juan, Chien-Yu Chen, Huai-Kuang Tsai, Jia-Hsin Huang","doi":"10.1093/bib/bbaf070","DOIUrl":"https://doi.org/10.1093/bib/bbaf070","url":null,"abstract":"<p><p>With the exponential growth of biomedical literature, leveraging Large Language Models (LLMs) for automated medical knowledge understanding has become increasingly critical for advancing precision medicine. However, current approaches face significant challenges in reliability, verifiability, and scalability when extracting complex biological relationships from scientific literature using LLMs. To overcome the obstacles of LLM development in biomedical literature understating, we propose LORE, a novel unsupervised two-stage reading methodology with LLM that models literature as a knowledge graph of verifiable factual statements and, in turn, as semantic embeddings in Euclidean space. LORE captured essential gene pathogenicity information when applied to PubMed abstracts for large-scale understanding of disease-gene relationships. We demonstrated that modeling a latent pathogenic flow in the semantic embedding with supervision from the ClinVar database led to a 90% mean average precision in identifying relevant genes across 2097 diseases. This work provides a scalable and reproducible approach for leveraging LLMs in biomedical literature analysis, offering new opportunities for researchers to identify therapeutic targets efficiently.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143490678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A versatile pipeline to identify convergently lost ancestral conserved fragments associated with convergent evolution of vocal learning. 一种用于识别与发声学习趋同进化相关的祖先保守片段的多功能管道。
IF 6.8 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-22 DOI: 10.1093/bib/bbae614
Xiaoyi Li, Kangli Zhu, Ying Zhen

Molecular convergence in convergently evolved lineages provides valuable insights into the shared genetic basis of converged phenotypes. However, most methods are limited to coding regions, overlooking the potential contribution of regulatory regions. We focused on the independently evolved vocal learning ability in multiple avian lineages, and developed a whole-genome-alignment-free approach to identify genome-wide Convergently Lost Ancestral Conserved fragments (CLACs) in these lineages, encompassing noncoding regions. We discovered 2711 CLACs that are overrepresented in noncoding regions. Proximal genes of these CLACs exhibit significant enrichment in neurological pathways, including glutamate receptor signaling pathway and axon guidance pathway. Moreover, their expression is highly enriched in brain tissues associated with speech formation. Notably, several have known functions in speech and language learning, including ROBO family, SLIT2, GRIN1, and GRIN2B. Additionally, we found significantly enriched motifs in noncoding CLACs, which match binding motifs of transcriptional factors involved in neurogenesis and gene expression regulation in brain. Furthermore, we discovered 19 candidate genes that harbor CLACs in both human and multiple avian vocal learning lineages, suggesting their potential contribution to the independent evolution of vocal learning in both birds and humans.

趋同进化种系的分子趋同为了解趋同表型的共同遗传基础提供了宝贵的见解。然而,大多数方法仅限于编码区,忽略了调控区的潜在贡献。我们重点研究了多个鸟类品系中独立进化的发声学习能力,并开发了一种无全基因组比对的方法来鉴定这些品系中的全基因组会聚丢失的祖先保守片段(CLACs),其中包括非编码区。我们发现了 2711 个在非编码区比例过高的 CLACs。这些CLACs的近端基因在神经通路中表现出显著的富集,包括谷氨酸受体信号通路和轴突导向通路。此外,它们在与语言形成相关的脑组织中的表达也高度富集。值得注意的是,其中一些基因在语音和语言学习中具有已知的功能,包括 ROBO 家族、SLIT2、GRIN1 和 GRIN2B。此外,我们还在非编码 CLAC 中发现了与参与脑部神经发生和基因表达调控的转录因子结合基序相匹配的显著富集基序。此外,我们还在人类和多种鸟类发声学习谱系中发现了 19 个携带 CLACs 的候选基因,这表明它们可能对鸟类和人类发声学习的独立进化做出了贡献。
{"title":"A versatile pipeline to identify convergently lost ancestral conserved fragments associated with convergent evolution of vocal learning.","authors":"Xiaoyi Li, Kangli Zhu, Ying Zhen","doi":"10.1093/bib/bbae614","DOIUrl":"10.1093/bib/bbae614","url":null,"abstract":"<p><p>Molecular convergence in convergently evolved lineages provides valuable insights into the shared genetic basis of converged phenotypes. However, most methods are limited to coding regions, overlooking the potential contribution of regulatory regions. We focused on the independently evolved vocal learning ability in multiple avian lineages, and developed a whole-genome-alignment-free approach to identify genome-wide Convergently Lost Ancestral Conserved fragments (CLACs) in these lineages, encompassing noncoding regions. We discovered 2711 CLACs that are overrepresented in noncoding regions. Proximal genes of these CLACs exhibit significant enrichment in neurological pathways, including glutamate receptor signaling pathway and axon guidance pathway. Moreover, their expression is highly enriched in brain tissues associated with speech formation. Notably, several have known functions in speech and language learning, including ROBO family, SLIT2, GRIN1, and GRIN2B. Additionally, we found significantly enriched motifs in noncoding CLACs, which match binding motifs of transcriptional factors involved in neurogenesis and gene expression regulation in brain. Furthermore, we discovered 19 candidate genes that harbor CLACs in both human and multiple avian vocal learning lineages, suggesting their potential contribution to the independent evolution of vocal learning in both birds and humans.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11586126/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142709168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cell-specific priors rescue differential gene expression in spatial spot-based technologies. 细胞特异性先验拯救基于空间点的技术中的差异基因表达。
IF 6.8 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-22 DOI: 10.1093/bib/bbae621
Ornit Nahman, Timothy J Few-Cooper, Shai S Shen-Orr

Spatial transcriptomics (ST), a breakthrough technology, captures the complex structure and state of tissues through the spatial profiling of gene expression. A variety of ST technologies have now emerged, most prominently spot-based platforms such as Visium. Despite the widespread use of ST and its distinct data characteristics, the vast majority of studies continue to analyze ST data using algorithms originally designed for older technologies such as single-cell (SC) and bulk RNA-seq-particularly when identifying differentially expressed genes (DEGs). However, it remains unclear whether these algorithms are still valid or appropriate for ST data. Therefore, here, we sought to characterize the performance of these methods by constructing an in silico simulator of ST data with a controllable and known DEG ground truth. Surprisingly, our findings reveal little variation in the performance of classic DEG algorithms-all of which fail to accurately recapture known DEGs to significant levels. We further demonstrate that cellular heterogeneity within spots is a primary cause of this poor performance and propose a simple gene-selection scheme, based on prior knowledge of cell-type specificity, to overcome this. Notably, our approach outperforms existing data-driven methods designed specifically for ST data and offers improved DEG recovery and reliability rates. In summary, our work details a conceptual framework that can be used upstream, agnostically, of any DEG algorithm to improve the accuracy of ST analysis and any downstream findings.

空间转录组学(ST)是一项突破性技术,它通过对基因表达进行空间剖析来捕捉组织的复杂结构和状态。目前已经出现了多种空间转录组学技术,其中最突出的是基于斑点的平台,如 Visium。尽管 ST 技术得到了广泛应用,而且其数据特征明显,但绝大多数研究仍在使用最初为单细胞(SC)和批量 RNA-seq 等旧技术设计的算法分析 ST 数据,尤其是在识别差异表达基因(DEG)时。然而,这些算法是否仍然有效或适用于 ST 数据仍不清楚。因此,在这里,我们试图通过构建一个具有可控已知 DEG 基本真相的 ST 数据硅学模拟器来鉴定这些方法的性能。令人惊讶的是,我们的研究结果表明,经典 DEG 算法的性能差异很小--所有这些算法都无法准确地再现已知 DEGs 的显著水平。我们进一步证明,斑点内的细胞异质性是导致这种性能低下的主要原因,并根据细胞类型特异性的先验知识提出了一种简单的基因选择方案来克服这一问题。值得注意的是,我们的方法优于专为 ST 数据设计的现有数据驱动方法,并提高了 DEG 的恢复率和可靠性。总之,我们的工作详细介绍了一个概念框架,它可以在任何 DEG 算法的上游、不可知论中使用,以提高 ST 分析和任何下游研究结果的准确性。
{"title":"Cell-specific priors rescue differential gene expression in spatial spot-based technologies.","authors":"Ornit Nahman, Timothy J Few-Cooper, Shai S Shen-Orr","doi":"10.1093/bib/bbae621","DOIUrl":"10.1093/bib/bbae621","url":null,"abstract":"<p><p>Spatial transcriptomics (ST), a breakthrough technology, captures the complex structure and state of tissues through the spatial profiling of gene expression. A variety of ST technologies have now emerged, most prominently spot-based platforms such as Visium. Despite the widespread use of ST and its distinct data characteristics, the vast majority of studies continue to analyze ST data using algorithms originally designed for older technologies such as single-cell (SC) and bulk RNA-seq-particularly when identifying differentially expressed genes (DEGs). However, it remains unclear whether these algorithms are still valid or appropriate for ST data. Therefore, here, we sought to characterize the performance of these methods by constructing an in silico simulator of ST data with a controllable and known DEG ground truth. Surprisingly, our findings reveal little variation in the performance of classic DEG algorithms-all of which fail to accurately recapture known DEGs to significant levels. We further demonstrate that cellular heterogeneity within spots is a primary cause of this poor performance and propose a simple gene-selection scheme, based on prior knowledge of cell-type specificity, to overcome this. Notably, our approach outperforms existing data-driven methods designed specifically for ST data and offers improved DEG recovery and reliability rates. In summary, our work details a conceptual framework that can be used upstream, agnostically, of any DEG algorithm to improve the accuracy of ST analysis and any downstream findings.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11647270/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142827377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DeepHapNet: a haplotype assembly method based on RetNet and deep spectral clustering. DeepHapNet:基于RetNet和深度谱聚类的单倍型组装方法。
IF 6.8 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-22 DOI: 10.1093/bib/bbae656
Junwei Luo, Jiaojiao Wang, Jingjing Wei, Chaokun Yan, Huimin Luo

Gene polymorphism originates from single-nucleotide polymorphisms (SNPs), and the analysis and study of SNPs are of great significance in the field of biogenetics. The haplotype, which consists of the sequence of SNP loci, carries more genetic information than a single SNP. Haplotype assembly plays a significant role in understanding gene function, diagnosing complex diseases, and pinpointing species genes. We propose a novel method, DeepHapNet, for haplotype assembly through the clustering of reads and learning correlations between read pairs. We employ a sequence model called Retentive Network (RetNet), which utilizes a multiscale retention mechanism to extract read features and learn the global relationships among them. Based on the feature representation of reads learned from the RetNet model, the clustering process of reads is implemented using the SpectralNet model, and, finally, haplotypes are constructed based on the read clusters. Experiments with simulated and real datasets show that the method performs well in the haplotype assembly problem of diploid and polyploid based on either long or short reads. The code implementation of DeepHapNet and the processing scripts for experimental data are publicly available at https://github.com/wjj6666/DeepHapNet.

基因多态性起源于单核苷酸多态性(single-nucleotide polymorphisms, SNPs),对其进行分析和研究在生物遗传学领域具有重要意义。单倍型由SNP位点序列组成,比单个SNP携带更多的遗传信息。单倍型组装在了解基因功能、诊断复杂疾病和精确定位物种基因方面发挥着重要作用。我们提出了一种新的方法,DeepHapNet,通过聚类读取和学习读取对之间的相关性来进行单倍型组装。我们采用了一种称为RetNet的序列模型,该模型利用多尺度保留机制提取读特征并学习它们之间的全局关系。在RetNet模型学习到的reads特征表示的基础上,利用SpectralNet模型实现reads聚类过程,最后基于读聚类构建单倍型。实验结果表明,该方法在二倍体和多倍体的长、短序列单倍型装配问题上都有较好的效果。DeepHapNet的代码实现和实验数据的处理脚本可在https://github.com/wjj6666/DeepHapNet上公开获取。
{"title":"DeepHapNet: a haplotype assembly method based on RetNet and deep spectral clustering.","authors":"Junwei Luo, Jiaojiao Wang, Jingjing Wei, Chaokun Yan, Huimin Luo","doi":"10.1093/bib/bbae656","DOIUrl":"10.1093/bib/bbae656","url":null,"abstract":"<p><p>Gene polymorphism originates from single-nucleotide polymorphisms (SNPs), and the analysis and study of SNPs are of great significance in the field of biogenetics. The haplotype, which consists of the sequence of SNP loci, carries more genetic information than a single SNP. Haplotype assembly plays a significant role in understanding gene function, diagnosing complex diseases, and pinpointing species genes. We propose a novel method, DeepHapNet, for haplotype assembly through the clustering of reads and learning correlations between read pairs. We employ a sequence model called Retentive Network (RetNet), which utilizes a multiscale retention mechanism to extract read features and learn the global relationships among them. Based on the feature representation of reads learned from the RetNet model, the clustering process of reads is implemented using the SpectralNet model, and, finally, haplotypes are constructed based on the read clusters. Experiments with simulated and real datasets show that the method performs well in the haplotype assembly problem of diploid and polyploid based on either long or short reads. The code implementation of DeepHapNet and the processing scripts for experimental data are publicly available at https://github.com/wjj6666/DeepHapNet.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11652615/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142845785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diffusion model assisted designing self-assembling collagen mimetic peptides as biocompatible materials. 扩散模型辅助设计自组装的模拟胶原肽作为生物相容性材料。
IF 6.8 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-22 DOI: 10.1093/bib/bbae622
Xinglong Wang, Kangjie Xu, Lingling Ma, Ruoxi Sun, Kun Wang, Ruiyan Wang, Junli Zhang, Wenwen Tao, Kai Linghu, Shuyao Yu, Jingwen Zhou

Collagen self-assembly supports its mechanical function, but controlling collagen mimetic peptides (CMPs) to self-assemble into higher-order oligomers with numerous functions remains challenging due to the vast potential amino acid sequence space. Herein, we developed a diffusion model to learn features from different types of human collagens and generate CMPs; obtaining 66% of synthetic CMPs could self-assemble into triple helices. Triple-helical and untwisting states were probed by melting temperature (Tm); hence, we developed a model to predict collagen Tm, achieving a state-of-art Pearson's correlation (PC) of 0.95 by cross-validation and a PC of 0.8 for predicting Tm values of synthetic CMPs. Our chemically synthesized short CMPs and recombinantly expressed long CMPs could self-assemble, with the lowest requirement for hydrogel formation at a concentration of 0.08% (w/v). Five CMPs could promote osteoblast differentiation. Our results demonstrated the potential for using computer-aided methods to design functional self-assembling CMPs.

胶原自组装支持其机械功能,但由于潜在的氨基酸序列空间巨大,控制胶原模拟肽(CMPs)自组装成具有多种功能的高阶低聚物仍然具有挑战性。在此,我们开发了一个扩散模型来学习不同类型的人胶原的特征并生成cmp;66%的合成cmp可以自组装成三螺旋结构。用熔融温度(Tm)探测三螺旋态和解扭态;因此,我们开发了一个预测胶原蛋白Tm的模型,通过交叉验证实现了最先进的皮尔逊相关性(PC)为0.95,PC为0.8,用于预测合成cmp的Tm值。我们化学合成的短CMPs和重组表达的长CMPs都可以自组装,在0.08% (w/v)的浓度下形成水凝胶的要求最低。5种cmp均能促进成骨细胞分化。我们的结果证明了使用计算机辅助方法设计功能自组装cmp的潜力。
{"title":"Diffusion model assisted designing self-assembling collagen mimetic peptides as biocompatible materials.","authors":"Xinglong Wang, Kangjie Xu, Lingling Ma, Ruoxi Sun, Kun Wang, Ruiyan Wang, Junli Zhang, Wenwen Tao, Kai Linghu, Shuyao Yu, Jingwen Zhou","doi":"10.1093/bib/bbae622","DOIUrl":"10.1093/bib/bbae622","url":null,"abstract":"<p><p>Collagen self-assembly supports its mechanical function, but controlling collagen mimetic peptides (CMPs) to self-assemble into higher-order oligomers with numerous functions remains challenging due to the vast potential amino acid sequence space. Herein, we developed a diffusion model to learn features from different types of human collagens and generate CMPs; obtaining 66% of synthetic CMPs could self-assemble into triple helices. Triple-helical and untwisting states were probed by melting temperature (Tm); hence, we developed a model to predict collagen Tm, achieving a state-of-art Pearson's correlation (PC) of 0.95 by cross-validation and a PC of 0.8 for predicting Tm values of synthetic CMPs. Our chemically synthesized short CMPs and recombinantly expressed long CMPs could self-assemble, with the lowest requirement for hydrogel formation at a concentration of 0.08% (w/v). Five CMPs could promote osteoblast differentiation. Our results demonstrated the potential for using computer-aided methods to design functional self-assembling CMPs.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11650526/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142833838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comprehensive bioinformatics and machine learning analyses for breast cancer staging using TCGA dataset. 基于TCGA数据集的乳腺癌分期综合生物信息学和机器学习分析。
IF 6.8 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2024-11-22 DOI: 10.1093/bib/bbae628
Saurav Chandra Das, Wahia Tasnim, Humayan Kabir Rana, Uzzal Kumar Acharjee, Md Manowarul Islam, Rabea Khatun

Breast cancer is an alarming global health concern, including a vast and varied set of illnesses with different molecular characteristics. The fusion of sophisticated computational methodologies with extensive biological datasets has emerged as an effective strategy for unravelling complex patterns in cancer oncology. This research delves into breast cancer staging, classification, and diagnosis by leveraging the comprehensive dataset provided by the The Cancer Genome Atlas (TCGA). By integrating advanced machine learning algorithms with bioinformatics analysis, it introduces a cutting-edge methodology for identifying complex molecular signatures associated with different subtypes and stages of breast cancer. This study utilizes TCGA gene expression data to detect and categorize breast cancer through the application of machine learning and systems biology techniques. Researchers identified differentially expressed genes in breast cancer and analyzed them using signaling pathways, protein-protein interactions, and regulatory networks to uncover potential therapeutic targets. The study also highlights the roles of specific proteins (MYH2, MYL1, MYL2, MYH7) and microRNAs (such as hsa-let-7d-5p) that are the potential biomarkers in cancer progression founded on several analyses. In terms of diagnostic accuracy for cancer staging, the random forest method achieved 97.19%, while the XGBoost algorithm attained 95.23%. Bioinformatics and machine learning meet in this study to find potential biomarkers that influence the progression of breast cancer. The combination of sophisticated analytical methods and extensive genomic datasets presents a promising path for expanding our understanding and enhancing clinical outcomes in identifying and categorizing this intricate illness.

乳腺癌是一个令人担忧的全球健康问题,包括一系列具有不同分子特征的种类繁多的疾病。复杂的计算方法与广泛的生物数据集的融合已经成为揭示癌症肿瘤学复杂模式的有效策略。本研究通过利用癌症基因组图谱(TCGA)提供的综合数据集,深入研究乳腺癌的分期、分类和诊断。通过将先进的机器学习算法与生物信息学分析相结合,它引入了一种前沿的方法来识别与不同亚型和阶段乳腺癌相关的复杂分子特征。本研究利用TCGA基因表达数据,通过应用机器学习和系统生物学技术对乳腺癌进行检测和分类。研究人员确定了乳腺癌中差异表达的基因,并利用信号通路、蛋白质相互作用和调节网络对其进行分析,以发现潜在的治疗靶点。该研究还强调了特定蛋白质(MYH2, MYL1, MYL2, MYH7)和microrna(如hsa-let-7d-5p)的作用,这些蛋白质是基于几项分析的癌症进展中的潜在生物标志物。在癌症分期的诊断准确率方面,随机森林方法达到97.19%,XGBoost算法达到95.23%。生物信息学和机器学习在这项研究中相遇,以寻找影响乳腺癌进展的潜在生物标志物。复杂的分析方法和广泛的基因组数据集的结合为扩大我们对这种复杂疾病的认识和提高临床结果提供了一条有希望的途径。
{"title":"Comprehensive bioinformatics and machine learning analyses for breast cancer staging using TCGA dataset.","authors":"Saurav Chandra Das, Wahia Tasnim, Humayan Kabir Rana, Uzzal Kumar Acharjee, Md Manowarul Islam, Rabea Khatun","doi":"10.1093/bib/bbae628","DOIUrl":"10.1093/bib/bbae628","url":null,"abstract":"<p><p>Breast cancer is an alarming global health concern, including a vast and varied set of illnesses with different molecular characteristics. The fusion of sophisticated computational methodologies with extensive biological datasets has emerged as an effective strategy for unravelling complex patterns in cancer oncology. This research delves into breast cancer staging, classification, and diagnosis by leveraging the comprehensive dataset provided by the The Cancer Genome Atlas (TCGA). By integrating advanced machine learning algorithms with bioinformatics analysis, it introduces a cutting-edge methodology for identifying complex molecular signatures associated with different subtypes and stages of breast cancer. This study utilizes TCGA gene expression data to detect and categorize breast cancer through the application of machine learning and systems biology techniques. Researchers identified differentially expressed genes in breast cancer and analyzed them using signaling pathways, protein-protein interactions, and regulatory networks to uncover potential therapeutic targets. The study also highlights the roles of specific proteins (MYH2, MYL1, MYL2, MYH7) and microRNAs (such as hsa-let-7d-5p) that are the potential biomarkers in cancer progression founded on several analyses. In terms of diagnostic accuracy for cancer staging, the random forest method achieved 97.19%, while the XGBoost algorithm attained 95.23%. Bioinformatics and machine learning meet in this study to find potential biomarkers that influence the progression of breast cancer. The combination of sophisticated analytical methods and extensive genomic datasets presents a promising path for expanding our understanding and enhancing clinical outcomes in identifying and categorizing this intricate illness.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11630003/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142827380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Briefings in bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1