首页 > 最新文献

Bioinformatics (Oxford, England)最新文献

英文 中文
3D dendritic spines shape descriptors for efficient classification and morphology analysis in control and Alzheimer's disease modeling neurons. 用于控制和阿尔茨海默病建模神经元的有效分类和形态学分析的3D树突棘形状描述符。
IF 5.4 Pub Date : 2026-02-03 DOI: 10.1093/bioinformatics/btag025
Daria Smirnova, Anita Ustinova, Viacheslav Chukanov, Ekaterina Pchitskaya

Motivation: Dendritic spines, postsynaptic structures characterized by their complex shapes, provide the essential structural foundation for synaptic function. Their shape is dynamic, undergoing alterations in various conditions, notably during neurodegenerative disorders like Alzheimer's disease. The dramatically increasing prevalence of such diseases highlights an urgent need for effective treatments. A key strategy in developing these treatments involves evaluating how dendritic spine morphology responds to potential therapeutic compounds. Although a link between spine shape and function is recognized, its precise nature is still not fully elucidated. Consequently, advancing our understanding of dendritic spines in both health and disease necessitates the urgent development of more effective methods for assessing their morphology.

Results: This study introduces qualitatively new 3D dendritic shape descriptors based on spherical harmonics and Zernike moments and proposes a bases on them clustering approach for grouping dendritic spines with similar shapes applied to 3D polygonal spines meshes acquired from Z-stack dendrite images. By integrating these methods, we achieve improved differentiation between normal and pathological spines represented by the Alzheimer's disease in vitro model, offering a more precise representation of morphological diversity. Additionally, the proposed spherical harmonics approach enables dendritic spine reconstruction from vector-based shape representations, providing a novel tool for studying structural changes associated with neurodegeneration and possibilities for synthetic dendritic spines dataset generation.

Availability and implementation: The software used for experiments is public and available at https://github.com/Biomed-imaging-lab/SpineTool with the DOI: 10.5281/zenodo.17359066. Descriptors codebase is available at https://github.com/Biomed-imaging-lab/Spine-Shape-Descriptors with the DOI: 10.5281/zenodo.17302859.

动机:树突棘是突触后结构,其形状复杂,为突触功能提供了必要的结构基础。它们的形状是动态的,在各种情况下都会发生变化,尤其是在阿尔茨海默病等神经退行性疾病期间。这类疾病的发病率急剧上升,突出表明迫切需要有效的治疗方法。开发这些治疗的关键策略包括评估树突脊柱形态对潜在治疗化合物的反应。虽然脊柱形状和功能之间的联系是公认的,但其确切性质仍未完全阐明。因此,提高我们对树突棘在健康和疾病中的理解,迫切需要开发更有效的方法来评估它们的形态。结果:本文引入了一种基于球面谐波和泽尼克矩的定性三维树突形状描述符,并提出了一种基于球面谐波和泽尼克矩的聚类方法,用于对Z-stack树突图像获取的三维多边形树突网格中具有相似形状的树突棘进行分组。通过整合这些方法,我们实现了以阿尔茨海默病体外模型为代表的正常和病理脊柱的更好区分,提供了更精确的形态学多样性表征。此外,提出的球面谐波方法能够从基于矢量的形状表示中重建树突脊柱,为研究与神经变性相关的结构变化和合成树突脊柱数据集生成的可能性提供了一种新的工具。可用性:用于实验的软件是公开的,可以在https://github.com/Biomed-imaging-lab/SpineTool上获得,DOI: 10.5281/zenodo.17359066。描述符代码库可在https://github.com/Biomed-imaging-lab/Spine-Shape-Descriptors上获得,DOI: 10.5281/zenodo.17302859。补充信息:补充数据可在生物信息学在线获取。
{"title":"3D dendritic spines shape descriptors for efficient classification and morphology analysis in control and Alzheimer's disease modeling neurons.","authors":"Daria Smirnova, Anita Ustinova, Viacheslav Chukanov, Ekaterina Pchitskaya","doi":"10.1093/bioinformatics/btag025","DOIUrl":"10.1093/bioinformatics/btag025","url":null,"abstract":"<p><strong>Motivation: </strong>Dendritic spines, postsynaptic structures characterized by their complex shapes, provide the essential structural foundation for synaptic function. Their shape is dynamic, undergoing alterations in various conditions, notably during neurodegenerative disorders like Alzheimer's disease. The dramatically increasing prevalence of such diseases highlights an urgent need for effective treatments. A key strategy in developing these treatments involves evaluating how dendritic spine morphology responds to potential therapeutic compounds. Although a link between spine shape and function is recognized, its precise nature is still not fully elucidated. Consequently, advancing our understanding of dendritic spines in both health and disease necessitates the urgent development of more effective methods for assessing their morphology.</p><p><strong>Results: </strong>This study introduces qualitatively new 3D dendritic shape descriptors based on spherical harmonics and Zernike moments and proposes a bases on them clustering approach for grouping dendritic spines with similar shapes applied to 3D polygonal spines meshes acquired from Z-stack dendrite images. By integrating these methods, we achieve improved differentiation between normal and pathological spines represented by the Alzheimer's disease in vitro model, offering a more precise representation of morphological diversity. Additionally, the proposed spherical harmonics approach enables dendritic spine reconstruction from vector-based shape representations, providing a novel tool for studying structural changes associated with neurodegeneration and possibilities for synthetic dendritic spines dataset generation.</p><p><strong>Availability and implementation: </strong>The software used for experiments is public and available at https://github.com/Biomed-imaging-lab/SpineTool with the DOI: 10.5281/zenodo.17359066. Descriptors codebase is available at https://github.com/Biomed-imaging-lab/Spine-Shape-Descriptors with the DOI: 10.5281/zenodo.17302859.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146013884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A case-based explainable graph neural network framework for mechanistic drug repositioning. 一种基于案例的可解释图神经网络框架用于机械药物重新定位。
IF 5.4 Pub Date : 2026-02-03 DOI: 10.1093/bioinformatics/btag008
Adriana Carolina Gonzalez-Cavazos, Roger Tu, Meghamala Sinha, Andrew I Su

Drug repositioning offers a cost-effective alternative to traditional drug development by identifying new uses for existing drugs. Recent advances leverage Graph Neural Networks (GNNs) to model complex biological data, showing promise in predicting novel drug-disease associations; however, these frameworks often lack explainability, a critical factor for validating predictions and understanding drug mechanisms. Here, we introduce Drug-Based Reasoning Explainer (DBR-X), an explainable GNN model that integrates a link-prediction module with a path-identification module to generate interpretable and faithful explanations. When benchmarked against other GNN-based link-prediction frameworks, DBR-X achieves superior performance in identifying known drug-disease associations, demonstrating higher accuracy across all evaluation metrics. The quality of DBR-X biological explanations was evaluated through multiple complementary approaches, including comparison with manually curated drug mechanisms, assessment of explanation faithfulness using deletion and insertion studies, and measurement of stability under graph perturbations. Together, these results show that DBR-X advances the state of the art in drug repositioning while providing multi-hop mechanistic explanations that can facilitate the translation of computational predictions into clinical applications. Availability and implementation: DBR-X package is freely accessible from online repository https://github.com/SuLab/DBR-X.

药物重新定位通过确定现有药物的新用途,为传统药物开发提供了一种具有成本效益的替代方案。最近的进展利用图神经网络(GNN)来模拟复杂的生物数据,在预测新的药物-疾病关联方面显示出希望。然而,这些框架往往缺乏可解释性,这是验证预测和理解药物机制的关键因素。本文介绍了基于药物的推理解释器(Drug-Based Reasoning Explainer, DBR-X),这是一种可解释的GNN模型,它结合了链接预测模块和路径识别模块,以生成可解释和可靠的解释。当与其他GNN链接预测框架进行基准比较时,DBR-X在识别已知药物-疾病关联方面表现优异,在所有评估指标中都显示出更高的准确性。通过多种方法评估DBR-X生物学解释的质量:与人工编制的药物机制进行比较,通过删除和插入研究评估解释的可信度,以及测量图扰动下的稳定性。总之,我们的模型不仅推进了最先进的药物重新定位预测,而且提供了多跳解释,可以加速将计算预测转化为临床应用。
{"title":"A case-based explainable graph neural network framework for mechanistic drug repositioning.","authors":"Adriana Carolina Gonzalez-Cavazos, Roger Tu, Meghamala Sinha, Andrew I Su","doi":"10.1093/bioinformatics/btag008","DOIUrl":"10.1093/bioinformatics/btag008","url":null,"abstract":"<p><p>Drug repositioning offers a cost-effective alternative to traditional drug development by identifying new uses for existing drugs. Recent advances leverage Graph Neural Networks (GNNs) to model complex biological data, showing promise in predicting novel drug-disease associations; however, these frameworks often lack explainability, a critical factor for validating predictions and understanding drug mechanisms. Here, we introduce Drug-Based Reasoning Explainer (DBR-X), an explainable GNN model that integrates a link-prediction module with a path-identification module to generate interpretable and faithful explanations. When benchmarked against other GNN-based link-prediction frameworks, DBR-X achieves superior performance in identifying known drug-disease associations, demonstrating higher accuracy across all evaluation metrics. The quality of DBR-X biological explanations was evaluated through multiple complementary approaches, including comparison with manually curated drug mechanisms, assessment of explanation faithfulness using deletion and insertion studies, and measurement of stability under graph perturbations. Together, these results show that DBR-X advances the state of the art in drug repositioning while providing multi-hop mechanistic explanations that can facilitate the translation of computational predictions into clinical applications. Availability and implementation: DBR-X package is freely accessible from online repository https://github.com/SuLab/DBR-X.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145986038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inference of marker genes of subtle cell state changes via iLR: iterative logistic regression. 通过iLR推断细胞状态微妙变化的标记基因:迭代逻辑回归。
IF 5.4 Pub Date : 2026-02-02 DOI: 10.1093/bioinformatics/btag051
Yingtong Liu, Aaron G Baugh, Evanthia T Roussos Torres, Adam L MacLean

Motivation: Differential expression and marker gene selection methods for single-cell RNA sequencing (scRNA-seq) data can struggle to identify small sets of informative genes, especially for subtle differences between cell states, as can be induced by disease or treatment.

Results: We present iterative logistic regression (iLR) for the identification of small sets of informative marker genes. iLR applied logistic regression iteratively with a Pareto front optimization to balance gene set size with classification performance. We benchmark iLR on in silico datasets, demonstrating comparable performance to the state-of-the-art at single-cell classification using only a fraction of the genes. We test iLR on its ability to distinguish neuronal cell subtypes in healthy vs. autism spectrum disorder patients and find that it achieves high accuracy with small sets of disease-relevant genes. We apply iLR to investigate immunotherapeutic effects in cell types from different tumor microenvironments and find that iLR infers informative genes that translate across organs and even species (mouse-to-human) comparison. We predicted via iLR that entinostat acts in part through the modulation of myeloid cell differentiation routes in the lung microenvironment. Overall, iLR provides means to infer interpretable transcriptional signatures from complex datasets with prognostic or therapeutic potential.

Availability and implementation: iLR is freely available at GitHub https://github.com/maclean-lab/iLR and Zenodo https://zenodo.org/records/17728797.

Supplementary information: Supplementary data are available at Bioinformaticss online.

动机:单细胞RNA测序(scRNA-seq)数据的差异表达和标记基因选择方法可能难以识别小组信息基因,特别是对于可能由疾病或治疗引起的细胞状态之间的细微差异。结果:我们提出了迭代逻辑回归(iLR)来鉴定小组信息标记基因。iLR采用逻辑回归迭代和Pareto前优化来平衡基因集大小和分类性能。我们在计算机数据集上对iLR进行基准测试,证明了仅使用一小部分基因在单细胞分类方面与最先进的性能相当。我们测试了iLR在健康和自闭症谱系障碍患者中区分神经元细胞亚型的能力,发现它在小组疾病相关基因上达到了很高的准确性。我们应用iLR来研究来自不同肿瘤微环境的细胞类型的免疫治疗效果,并发现iLR推断出跨器官甚至物种(小鼠到人类)比较翻译的信息基因。我们通过iLR预测,eninostat部分通过调节肺微环境中的骨髓细胞分化途径起作用。总的来说,iLR提供了从具有预后或治疗潜力的复杂数据集推断可解释的转录特征的方法。可用性和实施:iLR可在GitHub https://github.com/maclean-lab/iLR和Zenodo https://zenodo.org/records/17728797.Supplementary免费获得。信息:补充数据可在bioinformatics在线获取。
{"title":"Inference of marker genes of subtle cell state changes via iLR: iterative logistic regression.","authors":"Yingtong Liu, Aaron G Baugh, Evanthia T Roussos Torres, Adam L MacLean","doi":"10.1093/bioinformatics/btag051","DOIUrl":"https://doi.org/10.1093/bioinformatics/btag051","url":null,"abstract":"<p><strong>Motivation: </strong>Differential expression and marker gene selection methods for single-cell RNA sequencing (scRNA-seq) data can struggle to identify small sets of informative genes, especially for subtle differences between cell states, as can be induced by disease or treatment.</p><p><strong>Results: </strong>We present iterative logistic regression (iLR) for the identification of small sets of informative marker genes. iLR applied logistic regression iteratively with a Pareto front optimization to balance gene set size with classification performance. We benchmark iLR on in silico datasets, demonstrating comparable performance to the state-of-the-art at single-cell classification using only a fraction of the genes. We test iLR on its ability to distinguish neuronal cell subtypes in healthy vs. autism spectrum disorder patients and find that it achieves high accuracy with small sets of disease-relevant genes. We apply iLR to investigate immunotherapeutic effects in cell types from different tumor microenvironments and find that iLR infers informative genes that translate across organs and even species (mouse-to-human) comparison. We predicted via iLR that entinostat acts in part through the modulation of myeloid cell differentiation routes in the lung microenvironment. Overall, iLR provides means to infer interpretable transcriptional signatures from complex datasets with prognostic or therapeutic potential.</p><p><strong>Availability and implementation: </strong>iLR is freely available at GitHub https://github.com/maclean-lab/iLR and Zenodo https://zenodo.org/records/17728797.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformaticss online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146108740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PYRAMA: An open-source tool for advanced meta-analysis of genome wide association studies. 一个开源工具,用于基因组广泛关联研究的高级荟萃分析。
IF 5.4 Pub Date : 2026-02-02 DOI: 10.1093/bioinformatics/btag054
Georgios A Manios, Sophia Nteli, Panagiota I Kontou, Pantelis G Bagos

Motivation: Genome-wide association study (GWAS) meta-analysis tools are essential for integrating summary statistics across multiple cohorts, thereby increasing statistical power and validating genetic associations. Widely cited tools, such as METAL, PLINK, and GWAMA, have facilitated numerous significant discoveries in the field of GWAS. Nevertheless, these tools offer a limited set of meta-analysis methods and typically require users to have prior experience with command-line tools to be executed.

Results: We present here PYRAMA, an open-source tool which is designed for meta-analysis of genome wide association studies. This work introduces an easy-to-use software package that includes several meta-analysis methods that are absent in similar software packages. PYRAMA is faster compared to other tools, supports robust methods for analysis and meta-analysis, fixed-effects, random-effects and Bayesian meta-analysis and it is currently the only tool that supports meta-analysis with imputation of summary statistics. It is available both as a standalone tool and as a freely available web server.

Availability: https://github.com/pbagos/PYRAMA, https://doi.org/10.5281/zenodo.17830449.

Supplementary information: Supplementary data are available at Bioinformatics online.

动机:全基因组关联研究(GWAS)荟萃分析工具对于整合跨多个队列的汇总统计数据至关重要,从而提高统计能力并验证遗传关联。被广泛引用的工具,如METAL、PLINK和GWAMA,促进了GWAS领域的许多重大发现。然而,这些工具提供了一组有限的元分析方法,并且通常要求用户具有先前使用命令行工具的经验。结果:我们在这里提出PYRAMA,一个开源工具,设计用于基因组全关联研究的荟萃分析。这项工作介绍了一个易于使用的软件包,其中包括在类似软件包中不存在的几个元分析方法。与其他工具相比,PYRAMA更快,支持稳健的分析和元分析方法,固定效应,随机效应和贝叶斯元分析,它是目前唯一支持汇总统计输入的元分析的工具。它既可以作为独立工具,也可以作为免费的web服务器。可用性:https://github.com/pbagos/PYRAMA, https://doi.org/10.5281/zenodo.17830449.Supplementary信息:补充数据可在Bioinformatics在线获取。
{"title":"PYRAMA: An open-source tool for advanced meta-analysis of genome wide association studies.","authors":"Georgios A Manios, Sophia Nteli, Panagiota I Kontou, Pantelis G Bagos","doi":"10.1093/bioinformatics/btag054","DOIUrl":"https://doi.org/10.1093/bioinformatics/btag054","url":null,"abstract":"<p><strong>Motivation: </strong>Genome-wide association study (GWAS) meta-analysis tools are essential for integrating summary statistics across multiple cohorts, thereby increasing statistical power and validating genetic associations. Widely cited tools, such as METAL, PLINK, and GWAMA, have facilitated numerous significant discoveries in the field of GWAS. Nevertheless, these tools offer a limited set of meta-analysis methods and typically require users to have prior experience with command-line tools to be executed.</p><p><strong>Results: </strong>We present here PYRAMA, an open-source tool which is designed for meta-analysis of genome wide association studies. This work introduces an easy-to-use software package that includes several meta-analysis methods that are absent in similar software packages. PYRAMA is faster compared to other tools, supports robust methods for analysis and meta-analysis, fixed-effects, random-effects and Bayesian meta-analysis and it is currently the only tool that supports meta-analysis with imputation of summary statistics. It is available both as a standalone tool and as a freely available web server.</p><p><strong>Availability: </strong>https://github.com/pbagos/PYRAMA, https://doi.org/10.5281/zenodo.17830449.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146108879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using semantic search to find publicly available gene-expression datasets. 使用语义搜索来查找公开可用的基因表达数据集。
IF 5.4 Pub Date : 2026-02-02 DOI: 10.1093/bioinformatics/btag053
Grace S Brown, James Wengler, Aaron Joyce S Fabelico, Abigail Muir, Anna Tubbs, Amanda Warren, Alexandra N Millett, Xinrui Xiang Yu, Paul Pavlidis, Sanja Rogic, Stephen R Piccolo

Motivation: Millions of high-throughput, molecular datasets have been shared in public repositories. Researchers can reuse such data to validate their own findings and explore novel questions. A frequent goal is to find multiple datasets that address similar research topics and to either combine them directly or integrate inferences from them. However, a major challenge is finding relevant datasets due to the vast number of candidates, inconsistencies in their descriptions, and a lack of semantic annotations. This challenge is first among the FAIR principles for scientific data. Here we focus on dataset discovery within Gene Expression Omnibus (GEO), a repository containing 100,000 s of data series. GEO supports queries based on keywords, ontology terms, and other annotations. However, reviewing these results is time-consuming and tedious, and it often misses relevant datasets.

Results: We hypothesized that language models could address this problem by summarizing dataset descriptions as numeric representations (embeddings). Assuming a researcher has previously found some relevant datasets, we evaluated the potential to find additional relevant datasets. For six human medical conditions, we used 30 models to generate embeddings for datasets that human curators had previously associated with the conditions and identified other datasets with the most similar descriptions. This approach was often, but not always, more effective than GEO's search engine. The top-performing models were trained on general corpora, used contrastive-learning strategies, and used relatively large embeddings. Our findings suggest that language models have the potential to improve dataset discovery, likely in combination with existing search tools.

Availability: Our analysis code and a Web-based tool that enables others to use our methodology are availabe from https://github.com/srp33/GEO_NLP and https://github.com/srp33/GEOfinder3.0, respectively.

Supplementary information: Supplementary data are available at Bioinformatics online.

动机:数以百万计的高通量分子数据集已经在公共存储库中共享。研究人员可以重复使用这些数据来验证他们自己的发现并探索新的问题。一个常见的目标是找到解决类似研究主题的多个数据集,并直接组合它们或整合它们的推断。然而,一个主要的挑战是找到相关的数据集,因为候选数据集数量庞大,描述不一致,缺乏语义注释。这一挑战是FAIR科学数据原则中的第一项。在这里,我们专注于基因表达Omnibus (GEO)中的数据集发现,这是一个包含100,000 s数据序列的存储库。GEO支持基于关键字、本体术语和其他注释的查询。然而,回顾这些结果既耗时又乏味,而且经常遗漏相关的数据集。结果:我们假设语言模型可以通过将数据集描述总结为数字表示(嵌入)来解决这个问题。假设研究人员之前已经发现了一些相关数据集,我们评估了发现其他相关数据集的潜力。对于六种人类医疗条件,我们使用30个模型为人类管理员先前与这些条件关联的数据集生成嵌入,并识别出具有最相似描述的其他数据集。这种方法通常比GEO的搜索引擎更有效,但并不总是如此。表现最好的模型在一般语料库上进行训练,使用对比学习策略,并使用相对较大的嵌入。我们的研究结果表明,语言模型有潜力改善数据集发现,可能与现有的搜索工具相结合。可用性:我们的分析代码和一个基于web的工具,使其他人能够使用我们的方法,分别可以从https://github.com/srp33/GEO_NLP和https://github.com/srp33/GEOfinder3.0获得。补充信息:补充数据可在生物信息学在线获取。
{"title":"Using semantic search to find publicly available gene-expression datasets.","authors":"Grace S Brown, James Wengler, Aaron Joyce S Fabelico, Abigail Muir, Anna Tubbs, Amanda Warren, Alexandra N Millett, Xinrui Xiang Yu, Paul Pavlidis, Sanja Rogic, Stephen R Piccolo","doi":"10.1093/bioinformatics/btag053","DOIUrl":"10.1093/bioinformatics/btag053","url":null,"abstract":"<p><strong>Motivation: </strong>Millions of high-throughput, molecular datasets have been shared in public repositories. Researchers can reuse such data to validate their own findings and explore novel questions. A frequent goal is to find multiple datasets that address similar research topics and to either combine them directly or integrate inferences from them. However, a major challenge is finding relevant datasets due to the vast number of candidates, inconsistencies in their descriptions, and a lack of semantic annotations. This challenge is first among the FAIR principles for scientific data. Here we focus on dataset discovery within Gene Expression Omnibus (GEO), a repository containing 100,000 s of data series. GEO supports queries based on keywords, ontology terms, and other annotations. However, reviewing these results is time-consuming and tedious, and it often misses relevant datasets.</p><p><strong>Results: </strong>We hypothesized that language models could address this problem by summarizing dataset descriptions as numeric representations (embeddings). Assuming a researcher has previously found some relevant datasets, we evaluated the potential to find additional relevant datasets. For six human medical conditions, we used 30 models to generate embeddings for datasets that human curators had previously associated with the conditions and identified other datasets with the most similar descriptions. This approach was often, but not always, more effective than GEO's search engine. The top-performing models were trained on general corpora, used contrastive-learning strategies, and used relatively large embeddings. Our findings suggest that language models have the potential to improve dataset discovery, likely in combination with existing search tools.</p><p><strong>Availability: </strong>Our analysis code and a Web-based tool that enables others to use our methodology are availabe from https://github.com/srp33/GEO_NLP and https://github.com/srp33/GEOfinder3.0, respectively.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146108913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A spectral dimension reduction technique that improves pattern detection in multivariate spatial data. 一种改进多变量空间数据模式检测的光谱降维技术。
IF 5.4 Pub Date : 2026-01-31 DOI: 10.1093/bioinformatics/btag052
David Köhler, Niklas Kleinenkuhnen, Kiarash Rastegar, Till Baar, Chrysa Nikopoulou, Vangelis Kondylis, Vlada Milchevskaya, Matthias Schmid, Peter Tessarz, Achim Tresch

Motivation: We introduce a statistical approach for pattern recognition in multivariate spatial transcriptomics data.

Results: Our algorithm constructs a projection of the data onto a low-dimensional feature space which is optimal in maximising Moran's I, a measure of spatial dependency. This projection mitigates non-spatial variation and outperforms principal components analysis for pre-processing. Patterns of spatially variable genes are well represented in this feature space, and their projection can be shown to be a denoising operation. Our framework does not require any parameter tuning, and it furthermore gives rise to a calibrated, powerful test of spatial gene expression.

Availability and implementation: The algorithm is implemented in the open source software R and is available at https://github.com/IMSBCompBio/SpaCo.

动机:我们介绍了一种用于多变量空间转录组学数据模式识别的统计方法。结果:我们的算法构建了一个低维特征空间的数据投影,这在最大化Moran's I(一种空间依赖性度量)方面是最优的。这种投影减轻了非空间变化,并且优于预处理的主成分分析。空间可变基因的模式在这个特征空间中得到很好的表示,它们的投影可以被证明是一个去噪操作。我们的框架不需要任何参数调整,而且它进一步产生了一个校准的,强大的空间基因表达测试。可用性和实现:该算法是在开源软件R中实现的,可以在https://github.com/IMSBCompBio/SpaCo上获得。
{"title":"A spectral dimension reduction technique that improves pattern detection in multivariate spatial data.","authors":"David Köhler, Niklas Kleinenkuhnen, Kiarash Rastegar, Till Baar, Chrysa Nikopoulou, Vangelis Kondylis, Vlada Milchevskaya, Matthias Schmid, Peter Tessarz, Achim Tresch","doi":"10.1093/bioinformatics/btag052","DOIUrl":"https://doi.org/10.1093/bioinformatics/btag052","url":null,"abstract":"<p><strong>Motivation: </strong>We introduce a statistical approach for pattern recognition in multivariate spatial transcriptomics data.</p><p><strong>Results: </strong>Our algorithm constructs a projection of the data onto a low-dimensional feature space which is optimal in maximising Moran's I, a measure of spatial dependency. This projection mitigates non-spatial variation and outperforms principal components analysis for pre-processing. Patterns of spatially variable genes are well represented in this feature space, and their projection can be shown to be a denoising operation. Our framework does not require any parameter tuning, and it furthermore gives rise to a calibrated, powerful test of spatial gene expression.</p><p><strong>Availability and implementation: </strong>The algorithm is implemented in the open source software R and is available at https://github.com/IMSBCompBio/SpaCo.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146097705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ProteoGyver: A Fast, User-Friendly Tool for Routine QC and Analysis of MS-based Proteomics Data. ProteoGyver:一个快速,用户友好的工具,用于常规QC和分析基于质谱的蛋白质组学数据。
IF 5.4 Pub Date : 2026-01-30 DOI: 10.1093/bioinformatics/btag050
Kari Salokas, Salla Keskitalo, Markku Varjosalo

Mass spectrometry-based proteomics generates increasingly large datasets requiring rapid quality control (QC) and preliminary analysis. Current software solutions often require specialized knowledge, limiting their routine use. We developed ProteoGyver (PG), an accessible, lightweight software solution designed for rapid QC and preliminary proteomics data analysis. PG provides automated QC metrics, intuitive graphical reports, and streamlined workflows for whole-proteome and interactomics datasets, significantly lowering the barrier to regular QC practices. The platform includes additional tools such as MS Inspector for longitudinal chromatogram inspection and Colocalizer for microscopy data. PG is easily deployed as a Docker container or standalone Python installation. PG is open-source and freely available in dockerhub and source code in github at github.com/varjolab/Proteogyver. Availability PG image and source code are available in github and dockerhub under LGPL-2.1.

基于质谱的蛋白质组学产生越来越大的数据集,需要快速的质量控制(QC)和初步分析。当前的软件解决方案通常需要专业知识,限制了它们的日常使用。我们开发了ProteoGyver (PG),这是一种易于使用的轻量级软件解决方案,专为快速QC和初步蛋白质组学数据分析而设计。PG提供自动化的QC指标,直观的图形报告,以及全蛋白质组和相互作用组数据集的简化工作流程,大大降低了常规QC实践的障碍。该平台包括其他工具,如用于纵向色谱检查的MS Inspector和用于显微镜数据的Colocalizer。PG很容易部署为Docker容器或独立的Python安装。PG是开源的,可以在dockerhub和github中免费获得,源代码在github.com/varjolab/Proteogyver。可用性PG映像和源代码可在LGPL-2.1下的github和dockerhub中获得。
{"title":"ProteoGyver: A Fast, User-Friendly Tool for Routine QC and Analysis of MS-based Proteomics Data.","authors":"Kari Salokas, Salla Keskitalo, Markku Varjosalo","doi":"10.1093/bioinformatics/btag050","DOIUrl":"https://doi.org/10.1093/bioinformatics/btag050","url":null,"abstract":"<p><p>Mass spectrometry-based proteomics generates increasingly large datasets requiring rapid quality control (QC) and preliminary analysis. Current software solutions often require specialized knowledge, limiting their routine use. We developed ProteoGyver (PG), an accessible, lightweight software solution designed for rapid QC and preliminary proteomics data analysis. PG provides automated QC metrics, intuitive graphical reports, and streamlined workflows for whole-proteome and interactomics datasets, significantly lowering the barrier to regular QC practices. The platform includes additional tools such as MS Inspector for longitudinal chromatogram inspection and Colocalizer for microscopy data. PG is easily deployed as a Docker container or standalone Python installation. PG is open-source and freely available in dockerhub and source code in github at github.com/varjolab/Proteogyver. Availability PG image and source code are available in github and dockerhub under LGPL-2.1.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146088319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bit-Reproducible Parallel Phylogenetic Tree Inference. 位可复制并行系统发育树推断。
IF 5.4 Pub Date : 2026-01-30 DOI: 10.1093/bioinformatics/btag044
Christoph Stelz, Lukas Hübner, Alexandros Stamatakis
<p><strong>Motivation: </strong>Phylogenetic trees describe the evolutionary history among biological species based on their genomic data. Maximum Likelihood (ML) based phylogenetic inference tools search for the tree and evolutionary model that best explain the observed genomic data. Given the independence of likelihood score calculations between different genomic sites, parallel computation is commonly deployed. This is followed by a parallel summation over the per-site scores to obtain the overall likelihood score of the tree. However, basic arithmetic operations on IEEE 754 floating-point numbers, such as addition and multiplication, inherently introduce rounding errors. Consequently, the order by which floating-point operations are executed affects the exact resulting likelihood value since these operations are not associative. Moreover, parallel reduction algorithms in numerical codes re-associate operations as a function of the core count and cluster network topology, inducing different round-off errors. These low-level deviations can cause heuristic searches to diverge and induce high-level result discrepancies (e.g., yield topologically distinct phylogenies). This effect has also been observed in multiple scientific fields beyond phylogenetics.</p><p><strong>Results: </strong>We observe that varying the degree of parallelism results in diverging phylogenetic tree searches (high level results) for over 31% out of 10 179 empirical datasets. More importantly, 8% of these diverging datasets yield trees that are statistically significantly worse than the best known ML tree for the dataset (AU-test, p<0.05). To alleviate this, we develop a variant of the widely used phylogenetic inference tool RAxML-NG, which does yield bit-reproducible results under varying core-counts, with a slowdown of only 0 to 12.7% (median 0.8%) on up to 768 cores. For this, we introduce the ReproRed reduction algorithm, which yields bit-identical results under varying core-counts, by maintaining a fixed operation order that is independent of the communication pattern. ReproRed is thus applicable to all associative reduction operations-in contrast to competitors, which are confined to summation. Our ReproRed reduction algorithm only exchanges the theoretical minimum number of messages, overlaps communication with computation, and utilizes fast base-cases for local reductions. ReproRed is able to all-reduce (via a subsequent broadcast) 4.1×106 operands across 48 to 768 cores in 19.7 to 48.61 μs, thereby exhibiting a slowdown of 13 to 93% over a non-reproducible all-reduce algorithm. ReproRed outperforms the state-of-the-art reproducible all-reduction algorithm ReproBLAS (offers summation only) beyond 10 000 elements per core. In summary, we re-assess non-reproducibility in parallel phylogenetic inference, present the first bit-reproducible parallel phylogenetic inference tool, as well as introduce a general algorithm and open-source code for conducting reproducible assoc
动机:系统发生树描述生物物种之间的进化历史是基于它们的基因组数据。基于最大似然(ML)的系统发育推断工具搜索最能解释所观察到的基因组数据的树和进化模型。考虑到不同基因组位点之间的似然评分计算的独立性,并行计算通常被部署。然后对每个站点的得分进行并行求和,以获得树的总体似然得分。然而,IEEE 754浮点数的基本算术运算,如加法和乘法,固有地引入舍入误差。因此,执行浮点运算的顺序会影响最终的似然值,因为这些运算不是关联的。此外,数字编码中的并行约简算法根据核心数和集群网络拓扑结构重新关联操作,从而产生不同的舍入误差。这些低水平的偏差可能导致启发式搜索出现分歧,并导致高水平的结果差异(例如,在拓扑结构上产生不同的系统发育)。在系统发育学以外的多个科学领域也观察到这种效应。结果:我们观察到,在10179个经验数据集中,不同的并行度导致超过31%的系统发育树搜索出现分歧(高水平结果)。更重要的是,这些分散的数据集中有8%产生的树在统计上明显差于最著名的数据集ML树(AU-test, pAvailability和implementation: erered: https://doi.org/10.5281/zenodo.15004918 (LGPL)-Reproducible RAxML-NG version https://doi.org/10.5281/zenodo.15017407 (GPL))。补充信息:https://doi.org/10.5281/zenodo.15524754.Funding:本项目由欧盟克劳斯·奇拉基金会资助,欧洲研究理事会(ERC) Horizon 2020研究与创新资助号882500,欧盟ERA主席(Horizon - widera -2022- talent -01: 2023-2028)计划资助号101087081 (comp - biodivi - gr)。作者感谢高斯超级计算中心e. V. (www.gauss-centre.eu)通过在莱布尼茨超级计算中心(www.lrz.de)的GCS超级计算机supermu - ng上提供计算时间来资助本项目。
{"title":"Bit-Reproducible Parallel Phylogenetic Tree Inference.","authors":"Christoph Stelz, Lukas Hübner, Alexandros Stamatakis","doi":"10.1093/bioinformatics/btag044","DOIUrl":"https://doi.org/10.1093/bioinformatics/btag044","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Motivation: &lt;/strong&gt;Phylogenetic trees describe the evolutionary history among biological species based on their genomic data. Maximum Likelihood (ML) based phylogenetic inference tools search for the tree and evolutionary model that best explain the observed genomic data. Given the independence of likelihood score calculations between different genomic sites, parallel computation is commonly deployed. This is followed by a parallel summation over the per-site scores to obtain the overall likelihood score of the tree. However, basic arithmetic operations on IEEE 754 floating-point numbers, such as addition and multiplication, inherently introduce rounding errors. Consequently, the order by which floating-point operations are executed affects the exact resulting likelihood value since these operations are not associative. Moreover, parallel reduction algorithms in numerical codes re-associate operations as a function of the core count and cluster network topology, inducing different round-off errors. These low-level deviations can cause heuristic searches to diverge and induce high-level result discrepancies (e.g., yield topologically distinct phylogenies). This effect has also been observed in multiple scientific fields beyond phylogenetics.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;We observe that varying the degree of parallelism results in diverging phylogenetic tree searches (high level results) for over 31% out of 10 179 empirical datasets. More importantly, 8% of these diverging datasets yield trees that are statistically significantly worse than the best known ML tree for the dataset (AU-test, p&lt;0.05). To alleviate this, we develop a variant of the widely used phylogenetic inference tool RAxML-NG, which does yield bit-reproducible results under varying core-counts, with a slowdown of only 0 to 12.7% (median 0.8%) on up to 768 cores. For this, we introduce the ReproRed reduction algorithm, which yields bit-identical results under varying core-counts, by maintaining a fixed operation order that is independent of the communication pattern. ReproRed is thus applicable to all associative reduction operations-in contrast to competitors, which are confined to summation. Our ReproRed reduction algorithm only exchanges the theoretical minimum number of messages, overlaps communication with computation, and utilizes fast base-cases for local reductions. ReproRed is able to all-reduce (via a subsequent broadcast) 4.1×106 operands across 48 to 768 cores in 19.7 to 48.61 μs, thereby exhibiting a slowdown of 13 to 93% over a non-reproducible all-reduce algorithm. ReproRed outperforms the state-of-the-art reproducible all-reduction algorithm ReproBLAS (offers summation only) beyond 10 000 elements per core. In summary, we re-assess non-reproducibility in parallel phylogenetic inference, present the first bit-reproducible parallel phylogenetic inference tool, as well as introduce a general algorithm and open-source code for conducting reproducible assoc","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146088299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DNAvi: Integration, statistics, and visualization of cell-free DNA fragment traces. DNAvi:整合,统计,和可视化的细胞游离DNA片段痕迹。
IF 5.4 Pub Date : 2026-01-29 DOI: 10.1093/bioinformatics/btag041
Anja Hess, Dominik Seelow, Helene Kretzmer

Summary: DNAvi is a Python-based tool for rapid grouped analysis and visualization of cell-free DNA fragment size profiles directly from electrophoresis data, overcoming the need for sequencing in basic fragmentomic screenings. It enables normalization, statistical comparison, and publication-ready plotting of multiple samples, supporting quality control and exploratory fragmentomics in clinical and research workflows.

Availability and implementation: DNAvi is implemented in Python and freely available on GitHub at https://github.com/anjahess/DNAvi under a GNU General Public License v3.0, along with source code, documentation, and examples. An archived version is available under https://doi.org/10.5281/zenodo.18097730.

Supplementary information: Supplementary data are available at Bioinformatics online.

摘要:DNAvi是一个基于python的工具,可直接从电泳数据中快速分组分析和可视化无细胞DNA片段大小谱,克服了在基本片段组学筛选中对测序的需求。它可以实现多个样本的规范化、统计比较和出版准备绘图,支持临床和研究工作流程中的质量控制和探索性片段组学。可用性和实现:DNAvi是用Python实现的,在GNU通用公共许可证v3.0下的GitHub (https://github.com/anjahess/DNAvi)上免费提供,以及源代码、文档和示例。存档版本可在https://doi.org/10.5281/zenodo.18097730.Supplementary信息中获得;补充数据可在Bioinformatics在线获得。
{"title":"DNAvi: Integration, statistics, and visualization of cell-free DNA fragment traces.","authors":"Anja Hess, Dominik Seelow, Helene Kretzmer","doi":"10.1093/bioinformatics/btag041","DOIUrl":"https://doi.org/10.1093/bioinformatics/btag041","url":null,"abstract":"<p><strong>Summary: </strong>DNAvi is a Python-based tool for rapid grouped analysis and visualization of cell-free DNA fragment size profiles directly from electrophoresis data, overcoming the need for sequencing in basic fragmentomic screenings. It enables normalization, statistical comparison, and publication-ready plotting of multiple samples, supporting quality control and exploratory fragmentomics in clinical and research workflows.</p><p><strong>Availability and implementation: </strong>DNAvi is implemented in Python and freely available on GitHub at https://github.com/anjahess/DNAvi under a GNU General Public License v3.0, along with source code, documentation, and examples. An archived version is available under https://doi.org/10.5281/zenodo.18097730.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146088313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
STransfer: A Transfer Learning-Enhanced Graph Convolutional Network for Clustering Spatial Transcriptomics Data. 迁移:用于聚类空间转录组学数据的迁移学习增强图卷积网络。
IF 5.4 Pub Date : 2026-01-27 DOI: 10.1093/bioinformatics/btag049
Chaojie Wang, Xin Yu

Motivation: Capturing spatial structure is fundamental to the analysis of spatial transcriptomics data. However, most existing methods focus on clustering within individual tissue slices and often ignore the high inter-slice similarity inherent in multi-slice datasets.

Results: To address this limitation, we propose STransfer, a novel transfer learning framework that combines graph convolutional networks (GCNs) with positive pointwise mutual information (PPMI) to model both local and global spatial dependencies. An attention-based module is introduced to fuse features from multiple graphs into unified node representations, facilitating the learning of low-dimensional embeddings that jointly encode gene expression and spatial context. By transferring knowledge from labeled slices to adjacent unlabeled ones, STransfer significantly enhances clustering accuracy while reducing manual annotation costs. Extensive experiments demonstrate that STransfer consistently outperforms state-of-the-art methods in both spatial modeling and cross-slice transfer performance.

Availability and implementation: The code for STransfer has been uploaded to GitHub: https://github.com/Saki-JSU/Publications/tree/main/STransfer.

Supplementary information: No supplementary information is available for this manuscript.

动机:获取空间结构是空间转录组学数据分析的基础。然而,大多数现有的方法都集中在单个组织切片内的聚类,往往忽略了多切片数据集固有的高切片间相似性。为了解决这一限制,我们提出了一种新的迁移学习框架,将图卷积网络(GCNs)与正点互信息(PPMI)结合起来,对局部和全局空间依赖关系进行建模。引入了一个基于注意力的模块,将多个图的特征融合到统一的节点表示中,促进了共同编码基因表达和空间上下文的低维嵌入的学习。通过将知识从标记的切片转移到相邻的未标记的切片,strtransfer显著提高了聚类精度,同时降低了人工标注成本。大量的实验表明,在空间建模和横切片传输性能方面,transfer始终优于最先进的方法。可用性和实现:strtransfer的代码已上传到GitHub: https://github.com/Saki-JSU/Publications/tree/main/STransfer.Supplementary information:本文没有补充信息。
{"title":"STransfer: A Transfer Learning-Enhanced Graph Convolutional Network for Clustering Spatial Transcriptomics Data.","authors":"Chaojie Wang, Xin Yu","doi":"10.1093/bioinformatics/btag049","DOIUrl":"https://doi.org/10.1093/bioinformatics/btag049","url":null,"abstract":"<p><strong>Motivation: </strong>Capturing spatial structure is fundamental to the analysis of spatial transcriptomics data. However, most existing methods focus on clustering within individual tissue slices and often ignore the high inter-slice similarity inherent in multi-slice datasets.</p><p><strong>Results: </strong>To address this limitation, we propose STransfer, a novel transfer learning framework that combines graph convolutional networks (GCNs) with positive pointwise mutual information (PPMI) to model both local and global spatial dependencies. An attention-based module is introduced to fuse features from multiple graphs into unified node representations, facilitating the learning of low-dimensional embeddings that jointly encode gene expression and spatial context. By transferring knowledge from labeled slices to adjacent unlabeled ones, STransfer significantly enhances clustering accuracy while reducing manual annotation costs. Extensive experiments demonstrate that STransfer consistently outperforms state-of-the-art methods in both spatial modeling and cross-slice transfer performance.</p><p><strong>Availability and implementation: </strong>The code for STransfer has been uploaded to GitHub: https://github.com/Saki-JSU/Publications/tree/main/STransfer.</p><p><strong>Supplementary information: </strong>No supplementary information is available for this manuscript.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146069500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Bioinformatics (Oxford, England)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1