首页 > 最新文献

Briefings in bioinformatics最新文献

英文 中文
Cross-ancestry information transfer framework improves protein abundance prediction and protein-trait association identification. 跨祖先信息传递框架改进了蛋白质丰度预测和蛋白质性状关联鉴定。
IF 7.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-07 DOI: 10.1093/bib/bbaf707
Wenli Zhai, Lingyun Sun, Wenwei Fang, Yidan Dong, Chunxiao Cheng, Yuanjiao Liu, Yuan Zhou, Jiadong Ji, Lang Wu, An Pan, Eric R Gamazon, Xiong-Fei Pan, Dan Zhou

Genetics-informed proteome-wide association studies (PWASs) provide an effective way to uncover proteomic mechanisms underlying complex diseases. PWAS relies on an ancestry-matched reference panel to model the impact of genetically determined protein expression on phenotype. However, reference panels from underrepresented populations remain relatively limited. We developed a multi-ancestry framework to enhance protein prediction in these populations by integrating diverse information-sharing strategies into a Multi-Ancestry Best-performing Model (MABM). Results indicated that MABM increased the prediction performance with higher performance observed in both cross-validation and an external dataset. Leveraging the Biobank Japan, we identified three times as many significant PWAS associations using MABM as using Lasso model. Notably, 47.5% of the MABM specific associations were reproduced in independent East Asian datasets with concordant effect sizes. Furthermore, MABM enhanced decision-making in gene/protein prioritization for functional validation for complex traits by validating well-established associations and uncovering novel trait-related candidates. The benefits of MABM were further validated in additional ancestries and demonstrated in brain tissue-based PWAS, underscoring its broad applicability. Our findings close critical gaps in multi-omics research among underrepresented populations and facilitate trait-relevant protein discovery in underrepresented populations.

遗传信息蛋白质组关联研究(PWASs)为揭示复杂疾病背后的蛋白质组机制提供了一种有效的方法。PWAS依赖于一个祖先匹配的参考面板来模拟基因决定的蛋白质表达对表型的影响。然而,来自代表性不足人口的参考小组仍然相对有限。我们开发了一个多祖先框架,通过将不同的信息共享策略整合到多祖先最佳表现模型(MABM)中来增强这些人群的蛋白质预测。结果表明,MABM提高了预测性能,在交叉验证和外部数据集中都观察到更高的性能。利用日本生物银行,我们发现使用MABM的PWAS关联是使用Lasso模型的三倍。值得注意的是,47.5%的MABM特异性关联在具有一致效应量的独立东亚数据集中重现。此外,MABM通过验证已建立的关联和发现新的性状相关候选者,增强了复杂性状功能验证中基因/蛋白优先级的决策。MABM的益处在其他祖先中得到进一步验证,并在基于脑组织的PWAS中得到证实,强调了其广泛的适用性。我们的研究结果填补了代表性不足人群中多组学研究的关键空白,并促进了代表性不足人群中性状相关蛋白的发现。
{"title":"Cross-ancestry information transfer framework improves protein abundance prediction and protein-trait association identification.","authors":"Wenli Zhai, Lingyun Sun, Wenwei Fang, Yidan Dong, Chunxiao Cheng, Yuanjiao Liu, Yuan Zhou, Jiadong Ji, Lang Wu, An Pan, Eric R Gamazon, Xiong-Fei Pan, Dan Zhou","doi":"10.1093/bib/bbaf707","DOIUrl":"10.1093/bib/bbaf707","url":null,"abstract":"<p><p>Genetics-informed proteome-wide association studies (PWASs) provide an effective way to uncover proteomic mechanisms underlying complex diseases. PWAS relies on an ancestry-matched reference panel to model the impact of genetically determined protein expression on phenotype. However, reference panels from underrepresented populations remain relatively limited. We developed a multi-ancestry framework to enhance protein prediction in these populations by integrating diverse information-sharing strategies into a Multi-Ancestry Best-performing Model (MABM). Results indicated that MABM increased the prediction performance with higher performance observed in both cross-validation and an external dataset. Leveraging the Biobank Japan, we identified three times as many significant PWAS associations using MABM as using Lasso model. Notably, 47.5% of the MABM specific associations were reproduced in independent East Asian datasets with concordant effect sizes. Furthermore, MABM enhanced decision-making in gene/protein prioritization for functional validation for complex traits by validating well-established associations and uncovering novel trait-related candidates. The benefits of MABM were further validated in additional ancestries and demonstrated in brain tissue-based PWAS, underscoring its broad applicability. Our findings close critical gaps in multi-omics research among underrepresented populations and facilitate trait-relevant protein discovery in underrepresented populations.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"27 1","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12777707/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145917075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
iceDP: identifying inter-chromatin engagement via density peaks clustering algorithm. iceDP:通过密度峰聚类算法识别染色质间接合。
IF 7.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-07 DOI: 10.1093/bib/bbaf704
Ruhai Chen, Jiekai Chen, Lingling Shi, Jiangping He

Chromatin topological structure is critical for gene regulation. Hi-C based experiments have significantly advanced our understanding chromatin organization. Numerous computational tools have been developed to identify various structural levels of chromatin, ranging from compartments to loops. However, there remains a lack of specialized tools for identifying non-homologous inter-chromatin contacts (NHCCs), which play important roles in chromosome territories. In this study, we present iceDP, a tool that leverages the Density Peaks clustering algorithm to identify local high-density regions within inter-chromatin. These regions undergo two subsequent filtering steps to eliminate obvious false positives. When applied to three Hi-C datasets, iceDP accurately identified known NHCCs, including olfactory receptor genes in mature olfactory sensory neurons and Polycomb repressive complex-regulated developmental genes in mouse embryonic stem cells (mESCs). Notably, iceDP also uncovered previously unreported transcriptionally active NHCCs. Compared to diffHiC and FitHiC, iceDP exhibited superior performance with the highest positive rate. Moreover, iceDP is compatible with a wide range of chromatin conformation capture techniques, including in-situ Hi-C, Micro-C, HiChIP, and BL-HiC, demonstrating its versatility and utility.

染色质拓扑结构对基因调控至关重要。基于Hi-C的实验极大地促进了我们对染色质组织的理解。已经开发了许多计算工具来识别染色质的各种结构水平,从隔室到环。然而,仍然缺乏专门的工具来识别非同源染色质间接触(nhcc),它在染色体区域中起着重要作用。在这项研究中,我们提出了iceDP,一个利用密度峰聚类算法来识别染色质间局部高密度区域的工具。这些区域经过两个后续的过滤步骤,以消除明显的误报。当应用于三个high - c数据集时,iceDP准确地鉴定了已知的nhcc,包括成熟嗅觉感觉神经元中的嗅觉受体基因和小鼠胚胎干细胞(mESCs)中的Polycomb抑制复合物调节的发育基因。值得注意的是,iceDP还发现了以前未报道的转录活性nhcc。与diffHiC和FitHiC相比,iceDP表现出更好的性能,阳性率最高。此外,iceDP与广泛的染色质构象捕获技术兼容,包括原位Hi-C、Micro-C、HiChIP和bl - hc,显示了其通用性和实用性。
{"title":"iceDP: identifying inter-chromatin engagement via density peaks clustering algorithm.","authors":"Ruhai Chen, Jiekai Chen, Lingling Shi, Jiangping He","doi":"10.1093/bib/bbaf704","DOIUrl":"10.1093/bib/bbaf704","url":null,"abstract":"<p><p>Chromatin topological structure is critical for gene regulation. Hi-C based experiments have significantly advanced our understanding chromatin organization. Numerous computational tools have been developed to identify various structural levels of chromatin, ranging from compartments to loops. However, there remains a lack of specialized tools for identifying non-homologous inter-chromatin contacts (NHCCs), which play important roles in chromosome territories. In this study, we present iceDP, a tool that leverages the Density Peaks clustering algorithm to identify local high-density regions within inter-chromatin. These regions undergo two subsequent filtering steps to eliminate obvious false positives. When applied to three Hi-C datasets, iceDP accurately identified known NHCCs, including olfactory receptor genes in mature olfactory sensory neurons and Polycomb repressive complex-regulated developmental genes in mouse embryonic stem cells (mESCs). Notably, iceDP also uncovered previously unreported transcriptionally active NHCCs. Compared to diffHiC and FitHiC, iceDP exhibited superior performance with the highest positive rate. Moreover, iceDP is compatible with a wide range of chromatin conformation capture techniques, including in-situ Hi-C, Micro-C, HiChIP, and BL-HiC, demonstrating its versatility and utility.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"27 1","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12777978/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145917093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CircRM: profiling circular RNA modifications from nanopore direct RNA sequencing. CircRM:从纳米孔直接RNA测序分析环状RNA修饰。
IF 7.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-07 DOI: 10.1093/bib/bbaf726
Jiayi Li, Shenglun Chen, Zhixing Wu, Haozhe Wang, Rong Xia, Jia Meng, Yuxin Zhang

Circular RNA (circRNA) represents a critical class of regulatory RNAs with distinctive structural and functional features. The functions of circRNAs are modulated by various RNA modifications. Here, we present CircRM, a nanopore direct RNA sequencing-based computational method for profiling RNA modifications in circRNAs at single-base and single-molecule resolution. By integrating circRNA detection, read-level modification detection, and quantitative assessment of methylation rates, CircRM identified 427 high-confidence circRNAs and enables systematic characterization of three major modifications, m5C (AUC = 0.855), m6A (AUC = 0.817) and m1A (AUC = 0.769). It revealed distinct modification patterns compared with linear RNAs, highlighting RNA-type-specific regulations. We also identified the key features of circRNA-specific modifications, such as the enrichment near the back-splice junctions. Cross-cell line analyses further demonstrated conserved and cell-type-specific modification patterns. Together, these findings reveal, at the computational level, a unique epitranscriptomic landscape associated with circRNAs and establish CircRM as a powerful tool for advancing the study of RNA modifications in circular RNA biology. CircRM is free accessible at: https://github.com/jiayiAnnie17/CircRM.

环状RNA (circRNA)是一类具有独特结构和功能特征的关键调控RNA。环状RNA的功能受到各种RNA修饰的调节。在这里,我们提出了CircRM,一种基于纳米孔直接RNA测序的计算方法,用于在单碱基和单分子分辨率下分析circRNAs中的RNA修饰。通过整合circRNA检测、读级修饰检测和甲基化率定量评估,CircRM鉴定出427个高置信度的circRNA,并能够系统表征三种主要修饰,m5C (AUC = 0.855)、m6A (AUC = 0.817)和m1A (AUC = 0.769)。与线性rna相比,它揭示了不同的修饰模式,突出了rna类型特异性调控。我们还确定了circrna特异性修饰的关键特征,例如后剪接连接处附近的富集。跨细胞系分析进一步证明了保守的和细胞类型特异性的修饰模式。总之,这些发现在计算水平上揭示了与环状RNA相关的独特的表转录组学景观,并使CircRM成为推进环状RNA生物学中RNA修饰研究的有力工具。CircRM可以免费访问:https://github.com/jiayiAnnie17/CircRM。
{"title":"CircRM: profiling circular RNA modifications from nanopore direct RNA sequencing.","authors":"Jiayi Li, Shenglun Chen, Zhixing Wu, Haozhe Wang, Rong Xia, Jia Meng, Yuxin Zhang","doi":"10.1093/bib/bbaf726","DOIUrl":"10.1093/bib/bbaf726","url":null,"abstract":"<p><p>Circular RNA (circRNA) represents a critical class of regulatory RNAs with distinctive structural and functional features. The functions of circRNAs are modulated by various RNA modifications. Here, we present CircRM, a nanopore direct RNA sequencing-based computational method for profiling RNA modifications in circRNAs at single-base and single-molecule resolution. By integrating circRNA detection, read-level modification detection, and quantitative assessment of methylation rates, CircRM identified 427 high-confidence circRNAs and enables systematic characterization of three major modifications, m5C (AUC = 0.855), m6A (AUC = 0.817) and m1A (AUC = 0.769). It revealed distinct modification patterns compared with linear RNAs, highlighting RNA-type-specific regulations. We also identified the key features of circRNA-specific modifications, such as the enrichment near the back-splice junctions. Cross-cell line analyses further demonstrated conserved and cell-type-specific modification patterns. Together, these findings reveal, at the computational level, a unique epitranscriptomic landscape associated with circRNAs and establish CircRM as a powerful tool for advancing the study of RNA modifications in circular RNA biology. CircRM is free accessible at: https://github.com/jiayiAnnie17/CircRM.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"27 1","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12798809/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145965377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Probing genomic language models: Nucleotide Generative Pretrained Transformer and the role of pretraining in learned representations. 探索基因组语言模型:核苷酸生成预训练转换器和预训练在学习表征中的作用。
IF 7.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-07 DOI: 10.1093/bib/bbag011
Shae M Mclaughlin, Daniel A Lim

The value and nature of the representations learned during the pretraining of genomic language models (gLMs) remain actively debated. We introduce Nucleotide Generative Pretrained Transformer (GPT), a decoder-only transformer with single-nucleotide tokenization, to dissect the role of pretraining. Through experiments varying repetitive element (RE) weights during pretraining (0.0-1.0), comparative finetuning against random initialization, linear probing of internal representations, and sparse autoencoder (SAE)-based interpretability, we evaluated the impact of pretraining and how REs in genomic data influence model learning. Models with moderate RE downweighting (0.5) consistently achieved optimal performance across seven genomic classification tasks, with pretrained models providing substantial performance gains over baselines. SAE feature annotation via sequence alignment revealed substantial RE-associated patterns in the pretrained model internal representations, suggesting that REs-which comprise 30%-60% of mammalian genomes-may dominate the pretraining objective. Our findings support the utility of pretraining and underscore the need for pretraining strategies that better accommodate repetitive sequences across the genome while also fostering the learning of less common but biologically important representations. This study highlights a key challenge for gLMs: ensuring that models broadly learn functional genomic syntax beyond simply recognizing ubiquitous repeats.

在基因组语言模型(gLMs)预训练期间学习到的表征的价值和性质仍然存在积极的争论。我们引入核苷酸生成预训练转换器(GPT),一种具有单核苷酸标记化的解码器转换器,来剖析预训练的作用。通过在预训练(0.0-1.0)期间改变重复元素(RE)权重的实验,对随机初始化的比较微调,内部表征的线性探测以及基于稀疏自编码器(SAE)的可解释性,我们评估了预训练的影响以及基因组数据中的REs如何影响模型学习。适度RE降权重(0.5)的模型在七个基因组分类任务中始终获得最佳性能,预训练模型在基线上提供了显著的性能提升。通过序列比对的SAE特征注释揭示了预训练模型内部表示中大量re相关模式,表明re(占哺乳动物基因组的30%-60%)可能主导预训练目标。我们的研究结果支持了预训练的效用,并强调了预训练策略的必要性,这种策略可以更好地适应基因组中的重复序列,同时也可以促进对不太常见但生物学上重要的表征的学习。这项研究强调了glm面临的一个关键挑战:确保模型广泛地学习功能基因组语法,而不仅仅是识别无处不在的重复序列。
{"title":"Probing genomic language models: Nucleotide Generative Pretrained Transformer and the role of pretraining in learned representations.","authors":"Shae M Mclaughlin, Daniel A Lim","doi":"10.1093/bib/bbag011","DOIUrl":"10.1093/bib/bbag011","url":null,"abstract":"<p><p>The value and nature of the representations learned during the pretraining of genomic language models (gLMs) remain actively debated. We introduce Nucleotide Generative Pretrained Transformer (GPT), a decoder-only transformer with single-nucleotide tokenization, to dissect the role of pretraining. Through experiments varying repetitive element (RE) weights during pretraining (0.0-1.0), comparative finetuning against random initialization, linear probing of internal representations, and sparse autoencoder (SAE)-based interpretability, we evaluated the impact of pretraining and how REs in genomic data influence model learning. Models with moderate RE downweighting (0.5) consistently achieved optimal performance across seven genomic classification tasks, with pretrained models providing substantial performance gains over baselines. SAE feature annotation via sequence alignment revealed substantial RE-associated patterns in the pretrained model internal representations, suggesting that REs-which comprise 30%-60% of mammalian genomes-may dominate the pretraining objective. Our findings support the utility of pretraining and underscore the need for pretraining strategies that better accommodate repetitive sequences across the genome while also fostering the learning of less common but biologically important representations. This study highlights a key challenge for gLMs: ensuring that models broadly learn functional genomic syntax beyond simply recognizing ubiquitous repeats.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"27 1","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12866925/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146112291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unveiling patterns: an exploration of machine learning techniques for unsupervised feature selection in single-cell data. 揭示模式:探索单细胞数据中无监督特征选择的机器学习技术。
IF 7.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-07 DOI: 10.1093/bib/bbag006
Nandini Chatterjee, Aleksandr Taraskin, Hridya Divakaran, Natalia Jaeger, Victor Enriquez, Catherine C Hedrick, Ahmad Alimadadi

The rapid evolution of single-cell technologies has generated vast, multimodal datasets encompassing genomic, transcriptomic, proteomic, and spatial information. However, high dimensionality, noise, and computational costs pose significant challenges, often introducing bias through traditional feature selection methods, such as highly variable gene selection. Unsupervised machine learning (ML) provides a solution by identifying informative features without predefined labels, thereby minimizing bias and capturing complex patterns. This paper reviews a diverse array of unsupervised ML techniques tailored for single-cell data. These approaches could enhance downstream analyses, such as clustering, dimensionality reduction, visualization, and data denoising, and reveal biologically relevant gene modules. Despite their advantages, challenges such as data sparsity, parameter tuning, and scalability persist. Future directions include integrating multiomic data, incorporating domain-specific knowledge, and developing scalable and interpretable algorithms. By addressing these challenges, unsupervised ML-based feature selection promises to revolutionize single-cell data analysis, driving unbiased insights into cellular heterogeneity and advancing biological discovery.

单细胞技术的快速发展产生了大量的、多模式的数据集,包括基因组、转录组、蛋白质组和空间信息。然而,高维数、噪声和计算成本带来了重大挑战,通常会通过传统的特征选择方法(如高度可变的基因选择)引入偏差。无监督机器学习(ML)提供了一种解决方案,通过在没有预定义标签的情况下识别信息特征,从而最大限度地减少偏见并捕获复杂模式。本文回顾了为单细胞数据量身定制的各种无监督ML技术。这些方法可以增强下游分析,如聚类、降维、可视化和数据去噪,并揭示生物学相关的基因模块。尽管它们具有优势,但数据稀疏性、参数调优和可伸缩性等挑战仍然存在。未来的方向包括集成多组数据,结合特定领域的知识,以及开发可扩展和可解释的算法。通过解决这些挑战,基于无监督机器学习的特征选择有望彻底改变单细胞数据分析,推动对细胞异质性的公正见解,并推进生物学发现。
{"title":"Unveiling patterns: an exploration of machine learning techniques for unsupervised feature selection in single-cell data.","authors":"Nandini Chatterjee, Aleksandr Taraskin, Hridya Divakaran, Natalia Jaeger, Victor Enriquez, Catherine C Hedrick, Ahmad Alimadadi","doi":"10.1093/bib/bbag006","DOIUrl":"10.1093/bib/bbag006","url":null,"abstract":"<p><p>The rapid evolution of single-cell technologies has generated vast, multimodal datasets encompassing genomic, transcriptomic, proteomic, and spatial information. However, high dimensionality, noise, and computational costs pose significant challenges, often introducing bias through traditional feature selection methods, such as highly variable gene selection. Unsupervised machine learning (ML) provides a solution by identifying informative features without predefined labels, thereby minimizing bias and capturing complex patterns. This paper reviews a diverse array of unsupervised ML techniques tailored for single-cell data. These approaches could enhance downstream analyses, such as clustering, dimensionality reduction, visualization, and data denoising, and reveal biologically relevant gene modules. Despite their advantages, challenges such as data sparsity, parameter tuning, and scalability persist. Future directions include integrating multiomic data, incorporating domain-specific knowledge, and developing scalable and interpretable algorithms. By addressing these challenges, unsupervised ML-based feature selection promises to revolutionize single-cell data analysis, driving unbiased insights into cellular heterogeneity and advancing biological discovery.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"27 1","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12834302/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146050340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Personalized gene expression prediction in the era of deep learning: a review. 深度学习时代的个性化基因表达预测:综述。
IF 7.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-07 DOI: 10.1093/bib/bbag022
Viksar Dubey, Li Shen

Predicting gene expression from genomic sequences is a central goal in computational genomics. Recent advances have demonstrated that deep learning models trained on large-scale epigenomic datasets hold significant promise for this task. However, their success heavily depends on how they are applied: most models are trained exclusively on a reference genome, limiting their ability to capture individual-specific genetic variation. Consequently, while these models perform well on reference genomes, they often struggle when applied to personal genomic data. This review discusses recent efforts to overcome these limitations and explores methods aimed at improving the prediction of personalized gene expression. In particular, we compare the performance of deep learning models with traditional expression quantitative trait loci-based linear approaches, examining novel fine-tuning strategies, and highlighting the emergence of genomic language models. Across multiple studies, we find that deep learning models still face significant challenges in outperforming linear models for cross-individual gene expression prediction. Despite ongoing advances in model architecture and training methodology, accurately and robustly predicting personalized gene expression remains an open challenge in the field.

从基因组序列预测基因表达是计算基因组学的核心目标。最近的进展表明,在大规模表观基因组数据集上训练的深度学习模型在这项任务中具有重要的前景。然而,它们的成功在很大程度上取决于它们的应用方式:大多数模型都是专门针对参考基因组进行训练的,这限制了它们捕捉个体特异性遗传变异的能力。因此,虽然这些模型在参考基因组上表现良好,但在应用于个人基因组数据时往往表现不佳。这篇综述讨论了最近克服这些限制的努力,并探讨了旨在提高个性化基因表达预测的方法。特别地,我们比较了深度学习模型与传统的基于表达数量性状的基于位点的线性方法的性能,研究了新的微调策略,并强调了基因组语言模型的出现。在多项研究中,我们发现深度学习模型在跨个体基因表达预测方面仍然面临着重大挑战。尽管在模型架构和训练方法方面不断取得进展,但准确而稳健地预测个性化基因表达仍然是该领域的一个公开挑战。
{"title":"Personalized gene expression prediction in the era of deep learning: a review.","authors":"Viksar Dubey, Li Shen","doi":"10.1093/bib/bbag022","DOIUrl":"10.1093/bib/bbag022","url":null,"abstract":"<p><p>Predicting gene expression from genomic sequences is a central goal in computational genomics. Recent advances have demonstrated that deep learning models trained on large-scale epigenomic datasets hold significant promise for this task. However, their success heavily depends on how they are applied: most models are trained exclusively on a reference genome, limiting their ability to capture individual-specific genetic variation. Consequently, while these models perform well on reference genomes, they often struggle when applied to personal genomic data. This review discusses recent efforts to overcome these limitations and explores methods aimed at improving the prediction of personalized gene expression. In particular, we compare the performance of deep learning models with traditional expression quantitative trait loci-based linear approaches, examining novel fine-tuning strategies, and highlighting the emergence of genomic language models. Across multiple studies, we find that deep learning models still face significant challenges in outperforming linear models for cross-individual gene expression prediction. Despite ongoing advances in model architecture and training methodology, accurately and robustly predicting personalized gene expression remains an open challenge in the field.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"27 1","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12856953/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146084285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SGCRNA: spectral clustering-guided co-expression network analysis without scale-free constraints for multi-omic data. SGCRNA:多基因组数据无标度约束的谱聚类引导共表达网络分析。
IF 7.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-07 DOI: 10.1093/bib/bbag021
Tatsunori Osone, Tomoka Takao, Shigeo Otake, Takeshi Takarada

Weighted gene co-expression network analysis (WGCNA) is among the most widely employed methods in bioinformatics. WGCNA enables the identification of gene clusters (modules) exhibiting correlated expression patterns, the association of these modules with traits, and the exploration of candidate biomarker genes by focusing on hub genes within the modules. WGCNA has been successfully applied in diverse biological contexts. However, conventional algorithms manifest three principal limitations: the assumption of scale-free topology, the requirement for parameter tuning, and the neglect of regression line slopes. These limitations are addressed by SGCRNA. SGCRNA provides Julia functions for the analysis of co-expression networks derived from various types of biological data, such as gene expression data. The Julia packages and their source code are freely available at https://github.com/C37H41N2O6/SGCRNAs.jl.

加权基因共表达网络分析(WGCNA)是生物信息学中应用最广泛的方法之一。WGCNA能够识别出表现出相关表达模式的基因簇(模块),这些模块与性状的关联,并通过关注模块内的中心基因来探索候选生物标志物基因。WGCNA已成功地应用于多种生物学背景。然而,传统的算法表现出三个主要的局限性:假设无标度拓扑,参数调整的要求,以及忽略回归线斜率。SGCRNA解决了这些限制。SGCRNA提供Julia功能,用于分析源自各种生物数据(如基因表达数据)的共表达网络。Julia包及其源代码可以在https://github.com/C37H41N2O6/SGCRNAs.jl上免费获得。
{"title":"SGCRNA: spectral clustering-guided co-expression network analysis without scale-free constraints for multi-omic data.","authors":"Tatsunori Osone, Tomoka Takao, Shigeo Otake, Takeshi Takarada","doi":"10.1093/bib/bbag021","DOIUrl":"10.1093/bib/bbag021","url":null,"abstract":"<p><p>Weighted gene co-expression network analysis (WGCNA) is among the most widely employed methods in bioinformatics. WGCNA enables the identification of gene clusters (modules) exhibiting correlated expression patterns, the association of these modules with traits, and the exploration of candidate biomarker genes by focusing on hub genes within the modules. WGCNA has been successfully applied in diverse biological contexts. However, conventional algorithms manifest three principal limitations: the assumption of scale-free topology, the requirement for parameter tuning, and the neglect of regression line slopes. These limitations are addressed by SGCRNA. SGCRNA provides Julia functions for the analysis of co-expression networks derived from various types of biological data, such as gene expression data. The Julia packages and their source code are freely available at https://github.com/C37H41N2O6/SGCRNAs.jl.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"27 1","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12856952/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146084364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decoding RNA triple helices: identification from sequence and secondary structure. 解码RNA三螺旋:从序列和二级结构鉴定。
IF 7.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-07 DOI: 10.1093/bib/bbag009
Margherita A G Matarrese, Michela Quadrini, Nicole Luchetti, Federico Di Petta, Daniele Durante, Monica Ballarino, Letizia Chiodo, Luca Tesei

The discovery of long non-coding RNAs (lncRNA) has revealed additional layers of gene-expression control. Specific interactions of lncRNAs with DNA, RNAs, and RNA-binding proteins enable regulation in both cytoplasmic and nuclear compartments; e.g. a conserved triple-helix motif is essential for MALAT1 stability and oncogenic activity. Here, we present a secondary-structure-based framework to annotate and detect RNA triple helices. First, we extend the dot-bracket formalism with a third annotation line that encodes Hoogsteen contacts. Second, we introduce TripleMatcher, which searches for a triple-helix pattern, filters candidates by C1'-C1' distance thresholds, and merges overlaps into region-level zones. Using telomerase RNAs and RNA-stability elements with experimentally established triple helices (8 RNAs), TripleMatcher localized all annotated regions (structure-wise detection 8/8); geometric filtering removed most spurious candidates and improved precision (positive predictive value from 0.42 to 0.81) and overall accuracy (F$_{1}$ from 0.42 to 0.62) while maintaining sensitivity. Benchmarking eight predictors showed that pseudoknot-aware methods most reliably reproduce the local architecture required for detection, aligning secondary-structure quality with downstream triple-helix recovery. Applied prospectively, the framework identified candidate regions directly from predicted secondary structures and scaled to a screen of 4160 RNAs, where distance filtering reduced 150 990 (median per molecule: 108 [20-270]) raw candidates to 97 geometrically feasible regions across seven molecules, including human telomerase complexes. Together, the notation and TripleMatcher provide a concise route from secondary structure to a small, interpretable set of triple-helix candidates suitable for targeted experimental validation.

长链非编码rna (lncRNA)的发现揭示了基因表达控制的额外层面。lncRNAs与DNA、rna和rna结合蛋白的特异性相互作用使细胞质和核室的调控成为可能;例如,保守的三螺旋基序对MALAT1的稳定性和致癌活性至关重要。在这里,我们提出了一个基于二级结构的框架来注释和检测RNA三螺旋。首先,我们用第三个注释行扩展点括号形式,该注释行对Hoogsteen联系人进行编码。其次,我们引入了TripleMatcher,它搜索三螺旋模式,通过C1‘-C1’距离阈值过滤候选区域,并将重叠区域合并为区域级区域。使用端粒酶rna和具有实验建立的三螺旋(8个rna)的rna稳定元件,TripleMatcher定位了所有注释区域(结构检测8/8);几何滤波在保持灵敏度的同时,提高了精度(阳性预测值从0.42提高到0.81)和总体精度(F$_{1}$从0.42提高到0.62)。对8个预测指标进行基准测试表明,伪结感知方法最可靠地再现了检测所需的局部结构,使二级结构质量与下游三螺旋恢复保持一致。展望应用,该框架直接从预测的二级结构中确定候选区域,并缩放到4160个rna的筛选,其中距离过滤将150 990(每个分子中位数:108[20-270])原始候选区域减少到97个几何上可行的区域,跨越7个分子,包括人类端粒酶复合物。该符号和TripleMatcher一起提供了从二级结构到适合目标实验验证的小的、可解释的三螺旋候选序列的简明路线。
{"title":"Decoding RNA triple helices: identification from sequence and secondary structure.","authors":"Margherita A G Matarrese, Michela Quadrini, Nicole Luchetti, Federico Di Petta, Daniele Durante, Monica Ballarino, Letizia Chiodo, Luca Tesei","doi":"10.1093/bib/bbag009","DOIUrl":"10.1093/bib/bbag009","url":null,"abstract":"<p><p>The discovery of long non-coding RNAs (lncRNA) has revealed additional layers of gene-expression control. Specific interactions of lncRNAs with DNA, RNAs, and RNA-binding proteins enable regulation in both cytoplasmic and nuclear compartments; e.g. a conserved triple-helix motif is essential for MALAT1 stability and oncogenic activity. Here, we present a secondary-structure-based framework to annotate and detect RNA triple helices. First, we extend the dot-bracket formalism with a third annotation line that encodes Hoogsteen contacts. Second, we introduce TripleMatcher, which searches for a triple-helix pattern, filters candidates by C1'-C1' distance thresholds, and merges overlaps into region-level zones. Using telomerase RNAs and RNA-stability elements with experimentally established triple helices (8 RNAs), TripleMatcher localized all annotated regions (structure-wise detection 8/8); geometric filtering removed most spurious candidates and improved precision (positive predictive value from 0.42 to 0.81) and overall accuracy (F$_{1}$ from 0.42 to 0.62) while maintaining sensitivity. Benchmarking eight predictors showed that pseudoknot-aware methods most reliably reproduce the local architecture required for detection, aligning secondary-structure quality with downstream triple-helix recovery. Applied prospectively, the framework identified candidate regions directly from predicted secondary structures and scaled to a screen of 4160 RNAs, where distance filtering reduced 150 990 (median per molecule: 108 [20-270]) raw candidates to 97 geometrically feasible regions across seven molecules, including human telomerase complexes. Together, the notation and TripleMatcher provide a concise route from secondary structure to a small, interpretable set of triple-helix candidates suitable for targeted experimental validation.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"27 1","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12834306/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146050324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Task-specific pre-training for molecular property prediction. 针对特定任务的分子性质预测预训练。
IF 7.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-07 DOI: 10.1093/bib/bbag010
Wenbo Zhang, Yihui Wang, Jin Liu, Bowen Ke, Jiancheng Lv, Xianggen Liu

Molecular property prediction is a critical task in computational chemistry and drug discovery. While deep learning has advanced this field, the increasing complexity of models contrasts with the scarcity of labeled data, leading to severe overfitting and limited generalization. In this paper, we propose TasProp, a task-specific pre-training strategy for molecular property prediction, particularly for the scenarios with small labeled datasets. To learn a robust molecular representation, TasProp first projects both labeled and unlabeled data into a unified latent space. Then, we introduce a task-specific contrastive loss that aligns closely with the final prediction task and apply it to the labeled data. This contrastive loss encourages the model to learn more cohesive and distinguishable molecular representations corresponding to property categories, which in turn, enhances the model's performance on downstream property prediction tasks. Additionally, we propose a novel data augmentation method, accompanied by a theoretical analysis, to mitigate the challenge of labeled data scarcity. With the task-specific pre-training and augmented data, TasProp outperforms the state-of-the-art methods on many molecular property prediction tasks, including three publicly available datasets and two curated datasets related to anesthesiology. Furthermore, we provide an interactive web resource to facilitate model exploration and application, allowing users to easily predict the properties of input molecules online.

分子性质预测是计算化学和药物发现中的一项重要任务。虽然深度学习推动了这一领域的发展,但模型的日益复杂与标记数据的稀缺性形成鲜明对比,导致严重的过拟合和有限的泛化。在本文中,我们提出了TasProp,这是一种针对分子性质预测的任务特定预训练策略,特别适用于带有小标记数据集的场景。为了学习健壮的分子表示,TasProp首先将标记和未标记的数据投影到统一的潜在空间中。然后,我们引入与最终预测任务密切相关的特定任务的对比损失,并将其应用于标记数据。这种对比损失鼓励模型学习与属性类别相对应的更有凝聚力和可区分的分子表示,这反过来又增强了模型在下游属性预测任务上的性能。此外,我们提出了一种新的数据增强方法,并辅以理论分析,以减轻标记数据稀缺性的挑战。通过任务特定的预训练和增强数据,TasProp在许多分子特性预测任务上优于最先进的方法,包括三个公开可用的数据集和两个与麻醉学相关的策划数据集。此外,我们提供了一个交互式的网络资源,以促进模型的探索和应用,使用户可以轻松地在线预测输入分子的性质。
{"title":"Task-specific pre-training for molecular property prediction.","authors":"Wenbo Zhang, Yihui Wang, Jin Liu, Bowen Ke, Jiancheng Lv, Xianggen Liu","doi":"10.1093/bib/bbag010","DOIUrl":"10.1093/bib/bbag010","url":null,"abstract":"<p><p>Molecular property prediction is a critical task in computational chemistry and drug discovery. While deep learning has advanced this field, the increasing complexity of models contrasts with the scarcity of labeled data, leading to severe overfitting and limited generalization. In this paper, we propose TasProp, a task-specific pre-training strategy for molecular property prediction, particularly for the scenarios with small labeled datasets. To learn a robust molecular representation, TasProp first projects both labeled and unlabeled data into a unified latent space. Then, we introduce a task-specific contrastive loss that aligns closely with the final prediction task and apply it to the labeled data. This contrastive loss encourages the model to learn more cohesive and distinguishable molecular representations corresponding to property categories, which in turn, enhances the model's performance on downstream property prediction tasks. Additionally, we propose a novel data augmentation method, accompanied by a theoretical analysis, to mitigate the challenge of labeled data scarcity. With the task-specific pre-training and augmented data, TasProp outperforms the state-of-the-art methods on many molecular property prediction tasks, including three publicly available datasets and two curated datasets related to anesthesiology. Furthermore, we provide an interactive web resource to facilitate model exploration and application, allowing users to easily predict the properties of input molecules online.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"27 1","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12853129/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146084351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven discovery of digital twins in biomedical research. 生物医学研究中数字孪生的数据驱动发现。
IF 7.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-07 DOI: 10.1093/bib/bbaf722
Clémence Métayer, Annabelle Ballesta, Julien Martinelli

Recent technological advances have expanded the availability of high-throughput biological datasets, opening the way to the reliable design of digital twins of biomedical systems or patients. Such computational tools represent key chemical reaction networks driving perturbation or drug response and can profoundly guide drug discovery and personalized therapeutics. Yet, their development still depends on laborious data integration by the human modeler, so that automated approaches are critically needed. The successes of data-driven system discovery in Physics, rooted in clean datasets and well-defined governing laws, have fueled interest in applying similar techniques in Biology, which presents unique challenges. Here, we reviewed 177 methodologies for automatically inferring digital twins from biological time series, which mostly involved symbolic or sparse regression, and recapitulated them in a Shiny app. We evaluated algorithms according to eight biological and methodological challenges, associated with integrating noisy/incomplete data, multiple conditions, prior knowledge, latent variables, or dealing with high dimensionality, unobserved variable derivatives, candidate library design, and uncertainty quantification. Upon these criteria, sparse regression generally outperformed symbolic regression, particularly when using Bayesian frameworks. Next, deep learning and large language models further emerge as innovative tools to integrate prior knowledge, although their reliability and consistency need to be improved. While no single method addresses all challenges, we argue that progress in learning digital twins will come from hybrid and modular frameworks combining chemical reaction network-based mechanistic grounding, Bayesian uncertainty quantification, and the generative and knowledge integration capacities of deep learning. To support their development, we further highlight key components required for future benchmark development to evaluate methods across all challenges.

最近的技术进步扩大了高通量生物数据集的可用性,为生物医学系统或患者的数字双胞胎的可靠设计开辟了道路。这样的计算工具代表了驱动微扰或药物反应的关键化学反应网络,可以深刻地指导药物发现和个性化治疗。然而,它们的开发仍然依赖于人工建模者费力的数据集成,因此自动化方法是非常需要的。物理学中数据驱动系统发现的成功,植根于清晰的数据集和明确的管理规律,激发了对将类似技术应用于生物学的兴趣,这带来了独特的挑战。在这里,我们回顾了177种从生物时间序列中自动推断数字双胞胎的方法,这些方法大多涉及符号或稀疏回归,并在Shiny应用程序中进行了概述。我们根据8种生物学和方法挑战评估了算法,这些挑战与整合噪声/不完整数据、多条件、先验知识、潜在变量、或处理高维、未观察到的变量导数、候选库设计、不确定度量化。根据这些标准,稀疏回归通常优于符号回归,特别是在使用贝叶斯框架时。接下来,深度学习和大型语言模型将进一步成为整合先验知识的创新工具,尽管它们的可靠性和一致性有待提高。虽然没有一种方法可以解决所有挑战,但我们认为,学习数字双胞胎的进展将来自混合和模块化框架,这些框架结合了基于化学反应网络的机械基础、贝叶斯不确定性量化以及深度学习的生成和知识整合能力。为了支持它们的开发,我们进一步强调了未来基准开发所需的关键组件,以评估所有挑战的方法。
{"title":"Data-driven discovery of digital twins in biomedical research.","authors":"Clémence Métayer, Annabelle Ballesta, Julien Martinelli","doi":"10.1093/bib/bbaf722","DOIUrl":"10.1093/bib/bbaf722","url":null,"abstract":"<p><p>Recent technological advances have expanded the availability of high-throughput biological datasets, opening the way to the reliable design of digital twins of biomedical systems or patients. Such computational tools represent key chemical reaction networks driving perturbation or drug response and can profoundly guide drug discovery and personalized therapeutics. Yet, their development still depends on laborious data integration by the human modeler, so that automated approaches are critically needed. The successes of data-driven system discovery in Physics, rooted in clean datasets and well-defined governing laws, have fueled interest in applying similar techniques in Biology, which presents unique challenges. Here, we reviewed 177 methodologies for automatically inferring digital twins from biological time series, which mostly involved symbolic or sparse regression, and recapitulated them in a Shiny app. We evaluated algorithms according to eight biological and methodological challenges, associated with integrating noisy/incomplete data, multiple conditions, prior knowledge, latent variables, or dealing with high dimensionality, unobserved variable derivatives, candidate library design, and uncertainty quantification. Upon these criteria, sparse regression generally outperformed symbolic regression, particularly when using Bayesian frameworks. Next, deep learning and large language models further emerge as innovative tools to integrate prior knowledge, although their reliability and consistency need to be improved. While no single method addresses all challenges, we argue that progress in learning digital twins will come from hybrid and modular frameworks combining chemical reaction network-based mechanistic grounding, Bayesian uncertainty quantification, and the generative and knowledge integration capacities of deep learning. To support their development, we further highlight key components required for future benchmark development to evaluate methods across all challenges.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"27 1","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12890721/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146156168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Briefings in bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1