首页 > 最新文献

Bioinformatics advances最新文献

英文 中文
Desiderata for a biomedical knowledge network: opportunities, challenges and future directions. 对生物医学知识网络的渴望:机遇、挑战和未来方向。
IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-03-20 eCollection Date: 2026-01-01 DOI: 10.1093/bioadv/vbag036
Chunlei Wu, Hongfang Liu, Jason Flannick, Mark A Musen, Andrew I Su, Lawrence E Hunter, Thomas M Powers, Cathy H Wu

Motivation: Knowledge graphs (KGs), collectively as a knowledge network, have become critical tools for knowledge discovery in computable and explainable knowledge systems. Due to the semantic and structural complexities of biomedical data, these KGs need to enable dynamic reasoning over large evolving graphs and support fit-for-purpose abstraction. Crucially, this requires establishing standards, preserving provenance and enforcing policy constraints for actionable discovery.

Results: A recent meeting of leading scientists discussed the opportunities, challenges, and future directions of a biomedical knowledge network. Here we present six desiderata inspired by the meeting: (i) inference and reasoning in biomedical KGs need domain-centric approaches, (ii) harmonized and accessible standards are required for knowledge graph representation and metadata, (iii) robust validation of biomedical KGs needs multilayered, context-aware approaches that are both rigorous and scalable, (iv) the evolving and synergistic relationship between KGs and large language models is essential in empowering AI-driven biomedical discovery, (v) integrated development environments, public repositories, and governance frameworks are essential for secure and reproducible knowledge graph sharing, and (vi) robust validation, provenance, and ethical governance are critical for trustworthy biomedical KGs. Addressing these key issues will be essential to realize the promises of a biomedical knowledge network in advancing biomedicine.

动机:知识图作为一个知识网络,已经成为在可计算和可解释的知识系统中发现知识的关键工具。由于生物医学数据的语义和结构复杂性,这些kg需要在大型演化图上进行动态推理,并支持适合目的的抽象。至关重要的是,这需要建立标准,保存来源,并为可操作的发现执行政策约束。结果:最近的一次顶级科学家会议讨论了生物医学知识网络的机遇、挑战和未来方向。在此,我们提出受会议启发的六个愿望:(i)生物医学kg中的推理和推理需要以领域为中心的方法,(ii)知识图表示和元数据需要统一和可访问的标准,(iii)生物医学kg的稳健验证需要多层、上下文感知的方法,这些方法既严格又可扩展,(iv) kg和大型语言模型之间的不断发展和协同关系对于增强人工智能驱动的生物医学发现至关重要,(v)集成开发环境,公共知识库和治理框架对于安全和可复制的知识图谱共享至关重要,而(vi)稳健的验证、来源和伦理治理对于值得信赖的生物医学知识库至关重要。解决这些关键问题对于实现生物医学知识网络在推进生物医学方面的承诺至关重要。
{"title":"Desiderata for a biomedical knowledge network: opportunities, challenges and future directions.","authors":"Chunlei Wu, Hongfang Liu, Jason Flannick, Mark A Musen, Andrew I Su, Lawrence E Hunter, Thomas M Powers, Cathy H Wu","doi":"10.1093/bioadv/vbag036","DOIUrl":"https://doi.org/10.1093/bioadv/vbag036","url":null,"abstract":"<p><strong>Motivation: </strong>Knowledge graphs (KGs), collectively as a knowledge network, have become critical tools for knowledge discovery in computable and explainable knowledge systems. Due to the semantic and structural complexities of biomedical data, these KGs need to enable dynamic reasoning over large evolving graphs and support fit-for-purpose abstraction. Crucially, this requires establishing standards, preserving provenance and enforcing policy constraints for actionable discovery.</p><p><strong>Results: </strong>A recent meeting of leading scientists discussed the opportunities, challenges, and future directions of a biomedical knowledge network. Here we present six desiderata inspired by the meeting: (i) inference and reasoning in biomedical KGs need domain-centric approaches, (ii) harmonized and accessible standards are required for knowledge graph representation and metadata, (iii) robust validation of biomedical KGs needs multilayered, context-aware approaches that are both rigorous and scalable, (iv) the evolving and synergistic relationship between KGs and large language models is essential in empowering AI-driven biomedical discovery, (v) integrated development environments, public repositories, and governance frameworks are essential for secure and reproducible knowledge graph sharing, and (vi) robust validation, provenance, and ethical governance are critical for trustworthy biomedical KGs. Addressing these key issues will be essential to realize the promises of a biomedical knowledge network in advancing biomedicine.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"6 1","pages":"vbag036"},"PeriodicalIF":2.8,"publicationDate":"2026-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13004217/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147500788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Panalyze: automated virus pangenome variation graph construction, analysis and annotation. Panalyze:自动构建病毒泛基因组变异图,分析和注释。
IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-03-10 eCollection Date: 2026-01-01 DOI: 10.1093/bioadv/vbag071
Chandana Tennakoon, Thibaut Freville, Tim Downing

Motivation: Constructing and studying pangenome variation graphs (PVGs) supports new insights into viral genomic diversity. This is because such pangenomes are less prone to reference bias, which affects mutation detection. Interpreting the information arising from this is challenging, so automating these processes to allow exploratory investigations for PVG optimisation is essential. Moreover, existing methods do not scale well to the smaller virus genome sizes and to facilitate analysis in laptop environments. To address this, we developed an easily deployable pipeline to facilitate the rapid creation of virus PVGs that applies a broad range of analyses to these PVGs.

Results: We present Panalyze, a computationally scalable virus PVG construction, analysis and annotation tool implemented in NextFlow and containerised in Docker. Panalyze uses NextFlow to efficiently complete tasks across multiple compute nodes and in diverse computing environments. Panalyze can also operate on a single thread on a standard laptop, and analyse sequence lengths of any size. We illustrate how Panalyze works and the valuable outputs it can generate using a range of common viral pathogens.

Availability and implementation: Panalyze is released under a MIT open-source license, available on GitHub with documentation accessible at https://github.com/downingtim/Panalyze/.

动机:构建和研究泛基因组变异图(PVGs)为研究病毒基因组多样性提供了新的思路。这是因为这种泛基因组不太容易产生参考偏差,而参考偏差会影响突变检测。解释由此产生的信息是具有挑战性的,因此自动化这些过程以进行PVG优化的探索性调查是必不可少的。此外,现有的方法不能很好地扩展到较小的病毒基因组大小,也不能便于在笔记本电脑环境中进行分析。为了解决这个问题,我们开发了一个易于部署的管道,以促进病毒PVGs的快速创建,并对这些PVGs进行广泛的分析。结果:我们提出了Panalyze,一个计算可扩展的病毒PVG构建、分析和注释工具,在NextFlow中实现,并在Docker中容器化。Panalyze使用NextFlow在多个计算节点和不同计算环境中高效地完成任务。Panalyze还可以在标准笔记本电脑上的单个线程上运行,并分析任何大小的序列长度。我们说明了Panalyze是如何工作的,以及它可以使用一系列常见的病毒病原体产生的有价值的输出。可用性和实现:Panalyze是在MIT开源许可下发布的,可在GitHub上获得,文档可访问https://github.com/downingtim/Panalyze/。
{"title":"Panalyze: automated virus pangenome variation graph construction, analysis and annotation.","authors":"Chandana Tennakoon, Thibaut Freville, Tim Downing","doi":"10.1093/bioadv/vbag071","DOIUrl":"https://doi.org/10.1093/bioadv/vbag071","url":null,"abstract":"<p><strong>Motivation: </strong>Constructing and studying pangenome variation graphs (PVGs) supports new insights into viral genomic diversity. This is because such pangenomes are less prone to reference bias, which affects mutation detection. Interpreting the information arising from this is challenging, so automating these processes to allow exploratory investigations for PVG optimisation is essential. Moreover, existing methods do not scale well to the smaller virus genome sizes and to facilitate analysis in laptop environments. To address this, we developed an easily deployable pipeline to facilitate the rapid creation of virus PVGs that applies a broad range of analyses to these PVGs.</p><p><strong>Results: </strong>We present Panalyze, a computationally scalable virus PVG construction, analysis and annotation tool implemented in NextFlow and containerised in Docker. Panalyze uses NextFlow to efficiently complete tasks across multiple compute nodes and in diverse computing environments. Panalyze can also operate on a single thread on a standard laptop, and analyse sequence lengths of any size. We illustrate how Panalyze works and the valuable outputs it can generate using a range of common viral pathogens.</p><p><strong>Availability and implementation: </strong>Panalyze is released under a MIT open-source license, available on GitHub with documentation accessible at https://github.com/downingtim/Panalyze/.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"6 1","pages":"vbag071"},"PeriodicalIF":2.8,"publicationDate":"2026-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13005692/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147500843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classification of driver and passenger mutations in different cancer types using deep neural networks. 利用深度神经网络对不同癌症类型的驱动和乘客突变进行分类。
IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-26 eCollection Date: 2026-01-01 DOI: 10.1093/bioadv/vbag068
Medha Pandey, Anoosha Paruchuri, M Michael Gromiha

Motivation: Cancer is driven by genetic changes, known as mutations, that lead to the uncontrolled division of cells. The functional significance of a vast number of these cancer somatic mutations is unknown, and it is one of the major challenges in cancer research. In this study, we performed an integrative analysis of 30 tumor types from PAN-cancer mutation data collected from the COSMIC database. We have analyzed a set of 61 364 missense mutations (57 535 drivers and 3829 passengers) from 682 cancer-causing genes and derived various important features from amino acid sequences, predicted AlphaFold structures, and amino acid contact networks. We observed that the motif-based preference, neighboring residue information, residue depth, and disorder regions around the site of mutation are important for the discrimination of drivers and passengers.

Results: We further developed cancer-specific computational models to discriminate cancer-causing and passenger mutations using deep learning, and the integration of AlphaFold predicted structure information improved the pathogenicity prediction of mutations. Our method achieved an average classification accuracy of 84.06% with 10-fold cross-validation.

Availability and implementation: The prediction server is available at https://web.iitm.ac.in/bioinfo2/PANDriver/index.html. We envisage that the AI-based prediction models would be an important tool to identify driver mutations and could extend the scope of precision medicine for cancer.

动机:癌症是由基因变化(即突变)引起的,这种变化会导致细胞分裂失控。大量这些癌症体细胞突变的功能意义尚不清楚,这是癌症研究的主要挑战之一。在这项研究中,我们对从COSMIC数据库收集的pan -癌症突变数据中的30种肿瘤类型进行了综合分析。我们分析了来自682个致癌基因的61 364个错义突变(57 535个驱动基因和3829个乘客基因),并从氨基酸序列中获得了各种重要特征,预测了AlphaFold结构和氨基酸接触网络。我们发现,基于基序的偏好、邻近残基信息、残基深度和突变位点周围的无序区域对司机和乘客的区分很重要。结果:我们利用深度学习进一步开发了癌症特异性计算模型来区分致癌突变和乘客突变,整合AlphaFold预测的结构信息提高了突变的致病性预测。通过10倍交叉验证,该方法的平均分类准确率为84.06%。可用性和实现:预测服务器可在https://web.iitm.ac.in/bioinfo2/PANDriver/index.html上获得。我们设想,基于人工智能的预测模型将成为识别驱动突变的重要工具,并可以扩展癌症精准医疗的范围。
{"title":"Classification of driver and passenger mutations in different cancer types using deep neural networks.","authors":"Medha Pandey, Anoosha Paruchuri, M Michael Gromiha","doi":"10.1093/bioadv/vbag068","DOIUrl":"https://doi.org/10.1093/bioadv/vbag068","url":null,"abstract":"<p><strong>Motivation: </strong>Cancer is driven by genetic changes, known as mutations, that lead to the uncontrolled division of cells. The functional significance of a vast number of these cancer somatic mutations is unknown, and it is one of the major challenges in cancer research. In this study, we performed an integrative analysis of 30 tumor types from PAN-cancer mutation data collected from the COSMIC database. We have analyzed a set of 61 364 missense mutations (57 535 drivers and 3829 passengers) from 682 cancer-causing genes and derived various important features from amino acid sequences, predicted AlphaFold structures, and amino acid contact networks. We observed that the motif-based preference, neighboring residue information, residue depth, and disorder regions around the site of mutation are important for the discrimination of drivers and passengers.</p><p><strong>Results: </strong>We further developed cancer-specific computational models to discriminate cancer-causing and passenger mutations using deep learning, and the integration of AlphaFold predicted structure information improved the pathogenicity prediction of mutations. Our method achieved an average classification accuracy of 84.06% with 10-fold cross-validation.</p><p><strong>Availability and implementation: </strong>The prediction server is available at https://web.iitm.ac.in/bioinfo2/PANDriver/index.html. We envisage that the AI-based prediction models would be an important tool to identify driver mutations and could extend the scope of precision medicine for cancer.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"6 1","pages":"vbag068"},"PeriodicalIF":2.8,"publicationDate":"2026-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12989160/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147470123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AxioSAFE: an accessible, semi-automatic filtering tool for the curation of genotyping datasets. AxioSAFE:一个可访问的半自动过滤工具,用于管理基因分型数据集。
IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-19 eCollection Date: 2026-01-01 DOI: 10.1093/bioadv/vbag062
Lorenzo Spina, Nicholas P Howard, Stijn Vanderzande, Giorgio Tumino, Michela Troggio, Eric van de Weg, Diego Micheletti, Luca Bianco

Motivation: Genotyping datasets generated via the Thermo Fisher Axiom® array are generally big, as they comprise tens of thousands of markers and hundreds of individuals, and currently, no automatic data curation pipelines are available for this kind of data. This leaves researchers with only time-consuming manual analysis as the current standard for processing these complex genotyping datasets. There is a clear need for a more efficient, streamlined approach to handle the specific quality control challenges inherent in this platform.

Results: AxioSAFE (Axiom SNP Assessment and Filtering Engine) is a semi-automatic computer tool for the curation of single nucleotide polymorphism (SNP) genotyping datasets generated via Thermo Fisher Axiom® array experiments. AxioSAFE provides an alternative methodology to cover a set of data curation operations, including steps such as a ploidy check, SNP filtering, Mendelian error analysis, and phasing. AxioSAFE identifies major occurrences of problematic SNPs and samples, including those not caught by the Axiom array default QC filters. Further functionality is included to let the user review identified problematic SNP classes.

Availability and implementation: AxioSAFE is a Python program that can be either used via the command line interface or through a graphical user interface (GUI) and is provided as a Docker container available on DockerHub at https://hub.docker.com/r/lzspin/axiosafe, which includes all required libraries, software, and a tutorial dataset. The source code and documentation are available at https://bitbucket.org/lzspin/axiosafe/. The apple dataset used for the development of AxioSAFE is available at DOI: https://doi.org/10.5281/zenodo.18034024.

动机:通过赛默飞世尔Axiom®阵列生成的基因分型数据集通常很大,因为它们包含数万个标记和数百个个体,目前还没有可用于此类数据的自动数据管理管道。这使得研究人员只有耗时的手工分析作为处理这些复杂的基因分型数据集的当前标准。显然需要一种更有效、更精简的方法来处理这个平台中固有的特定质量控制挑战。AxioSAFE (Axiom SNP评估和过滤引擎)是一种半自动计算机工具,用于管理通过赛默飞世尔Axiom®阵列实验生成的单核苷酸多态性(SNP)基因分型数据集。AxioSAFE提供了一种替代方法来涵盖一组数据管理操作,包括倍性检查、SNP过滤、孟德尔错误分析和分阶段等步骤。AxioSAFE识别有问题的snp和样本的主要出现情况,包括那些未被Axiom阵列默认QC过滤器捕获的snp和样本。它还包含了进一步的功能,可以让用户查看已确定的有问题的SNP类。可用性和实现:AxioSAFE是一个Python程序,既可以通过命令行界面使用,也可以通过图形用户界面(GUI)使用,它作为Docker容器提供,可在DockerHub (https://hub.docker.com/r/lzspin/axiosafe)上获得,其中包括所有必需的库、软件和教程数据集。源代码和文档可从https://bitbucket.org/lzspin/axiosafe/获得。用于开发AxioSAFE的apple数据集可在DOI: https://doi.org/10.5281/zenodo.18034024获得。
{"title":"AxioSAFE: an accessible, semi-automatic filtering tool for the curation of genotyping datasets.","authors":"Lorenzo Spina, Nicholas P Howard, Stijn Vanderzande, Giorgio Tumino, Michela Troggio, Eric van de Weg, Diego Micheletti, Luca Bianco","doi":"10.1093/bioadv/vbag062","DOIUrl":"10.1093/bioadv/vbag062","url":null,"abstract":"<p><strong>Motivation: </strong>Genotyping datasets generated via the Thermo Fisher Axiom<sup>®</sup> array are generally big, as they comprise tens of thousands of markers and hundreds of individuals, and currently, no automatic data curation pipelines are available for this kind of data. This leaves researchers with only time-consuming manual analysis as the current standard for processing these complex genotyping datasets. There is a clear need for a more efficient, streamlined approach to handle the specific quality control challenges inherent in this platform.</p><p><strong>Results: </strong>AxioSAFE (Axiom SNP Assessment and Filtering Engine) is a semi-automatic computer tool for the curation of single nucleotide polymorphism (SNP) genotyping datasets generated via Thermo Fisher Axiom<sup>®</sup> array experiments. AxioSAFE provides an alternative methodology to cover a set of data curation operations, including steps such as a ploidy check, SNP filtering, Mendelian error analysis, and phasing. AxioSAFE identifies major occurrences of problematic SNPs and samples, including those not caught by the Axiom array default QC filters. Further functionality is included to let the user review identified problematic SNP classes.</p><p><strong>Availability and implementation: </strong>AxioSAFE is a Python program that can be either used via the command line interface or through a graphical user interface (GUI) and is provided as a Docker container available on DockerHub at https://hub.docker.com/r/lzspin/axiosafe, which includes all required libraries, software, and a tutorial dataset. The source code and documentation are available at https://bitbucket.org/lzspin/axiosafe/. The apple dataset used for the development of AxioSAFE is available at DOI: https://doi.org/10.5281/zenodo.18034024.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"6 1","pages":"vbag062"},"PeriodicalIF":2.8,"publicationDate":"2026-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12967218/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147379393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scaling the profile of life by function with SPIN. 使用SPIN按功能缩放生命概况。
IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-19 eCollection Date: 2026-01-01 DOI: 10.1093/bioadv/vbag064
Andrea Mancini, Vinh-Son Pho, Alessandro Bianchi, Gianluca Lombardi, Chujun Lyu, Alessandra Carbone

Motivations: Classifying hundreds of thousands of protein sequences by function remains a significant computational challenge. Building on the ProfileView method for identifying functional classes and subclasses, our goal is to achieve large-scale classification of proteins from extensive databases and ongoing high-throughput sequencing efforts, ultimately producing comprehensive sets of sequences that share the same function.

Results: By applying deep learning techniques, SPIN learns discriminative patterns in functionally related sequences, allowing the classification of hundreds of thousands of sequences into a defined number of functional classes. SPIN offers an effective compromise between small, family-specific protein language models (pLMs) and computational cost, with a time complexity linear in the number of sequences. It enables the identification of family-specific conserved residues, providing insight into the functional nuances of protein subclasses. By enhancing the scalability of protein function predictors, SPIN advances our understanding of protein functions and their evolutionary relationships.

Availability and implementation: The data and code that support the findings of this study are publicly available at https://gitlab.lcqb.upmc.fr/andrea.mancini/SPIN.

动机:根据功能对成千上万的蛋白质序列进行分类仍然是一个重大的计算挑战。基于ProfileView方法来识别功能类和亚类,我们的目标是从广泛的数据库和正在进行的高通量测序工作中实现大规模的蛋白质分类,最终产生具有相同功能的综合序列集。结果:通过应用深度学习技术,SPIN学习了功能相关序列中的判别模式,允许将数十万个序列分类为定义数量的功能类。SPIN在小的、家族特异性的蛋白质语言模型(pLMs)和计算成本之间提供了一种有效的折衷,其时间复杂度与序列数量呈线性关系。它能够识别家族特定的保守残基,提供洞察蛋白质亚类的功能细微差别。通过增强蛋白质功能预测因子的可扩展性,SPIN促进了我们对蛋白质功能及其进化关系的理解。可用性和实现:支持本研究结果的数据和代码可在https://gitlab.lcqb.upmc.fr/andrea.mancini/SPIN上公开获得。
{"title":"Scaling the profile of life by function with SPIN.","authors":"Andrea Mancini, Vinh-Son Pho, Alessandro Bianchi, Gianluca Lombardi, Chujun Lyu, Alessandra Carbone","doi":"10.1093/bioadv/vbag064","DOIUrl":"https://doi.org/10.1093/bioadv/vbag064","url":null,"abstract":"<p><strong>Motivations: </strong>Classifying hundreds of thousands of protein sequences by function remains a significant computational challenge. Building on the ProfileView method for identifying functional classes and subclasses, our goal is to achieve large-scale classification of proteins from extensive databases and ongoing high-throughput sequencing efforts, ultimately producing comprehensive sets of sequences that share the same function.</p><p><strong>Results: </strong>By applying deep learning techniques, SPIN learns discriminative patterns in functionally related sequences, allowing the classification of hundreds of thousands of sequences into a defined number of functional classes. SPIN offers an effective compromise between small, family-specific protein language models (pLMs) and computational cost, with a time complexity linear in the number of sequences. It enables the identification of family-specific conserved residues, providing insight into the functional nuances of protein subclasses. By enhancing the scalability of protein function predictors, SPIN advances our understanding of protein functions and their evolutionary relationships.</p><p><strong>Availability and implementation: </strong>The data and code that support the findings of this study are publicly available at https://gitlab.lcqb.upmc.fr/andrea.mancini/SPIN.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"6 1","pages":"vbag064"},"PeriodicalIF":2.8,"publicationDate":"2026-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12970593/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147438101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AssiST: convolutional neural network for analysis of antibiotic susceptibility testing. 辅助:卷积神经网络用于抗生素药敏试验分析。
IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-18 eCollection Date: 2026-01-01 DOI: 10.1093/bioadv/vbag063
Carmen Li, Sydney Schock, Abigail Costa, Amir Mitchell

Summary: Antibiotic susceptibility testing (AST) is routinely used to evaluate microbial responses to antimicrobials. We present AssiST, a convolutional neural network (CNN) pipeline that classifies microbial growth in scanned 96-well broth microdilution plates to infer drug susceptibility at scale. AssiST accommodates diverse growth morphologies and supports a user-configurable mapping from phenotype to susceptibility calls, enabling flexible use across microorganism species, media types, and drugs. AssiST allows labs to convert flatbed-scanner images into reproducible drug sensitivity readouts with a standard personal computer.

Availability and implementation: AssiST is distributed as a MATLAB library and is freely available for non-commercial use. Code, documentation, and training/inference instructions are available at https://github.com/Mitchell-SysBio/AssiST/. We also provide pre-trained models and a library of sample images. The software accepts image files from standard flatbed scanners. We commit to maintaining the repository for at least 2 years post-publication.

摘要:抗生素敏感性试验(AST)通常用于评估微生物对抗菌素的反应。我们提出了AssiST,一种卷积神经网络(CNN)管道,可对扫描的96孔肉汤微稀释板中的微生物生长进行分类,以大规模推断药物敏感性。AssiST适应不同的生长形态,并支持从表型到敏感性呼叫的用户可配置映射,从而实现跨微生物物种,介质类型和药物的灵活使用。AssiST允许实验室用标准的个人电脑将平板扫描仪图像转换为可重复的药敏读数。可用性和实现:AssiST作为MATLAB库发布,可免费用于非商业用途。代码、文档和训练/推理说明可在https://github.com/Mitchell-SysBio/AssiST/上获得。我们还提供了预训练模型和样本图像库。该软件接受来自标准平板扫描仪的图像文件。我们承诺在发布后至少维护存储库2年。
{"title":"AssiST: convolutional neural network for analysis of antibiotic susceptibility testing.","authors":"Carmen Li, Sydney Schock, Abigail Costa, Amir Mitchell","doi":"10.1093/bioadv/vbag063","DOIUrl":"10.1093/bioadv/vbag063","url":null,"abstract":"<p><strong>Summary: </strong>Antibiotic susceptibility testing (AST) is routinely used to evaluate microbial responses to antimicrobials. We present AssiST, a convolutional neural network (CNN) pipeline that classifies microbial growth in scanned 96-well broth microdilution plates to infer drug susceptibility at scale. AssiST accommodates diverse growth morphologies and supports a user-configurable mapping from phenotype to susceptibility calls, enabling flexible use across microorganism species, media types, and drugs. AssiST allows labs to convert flatbed-scanner images into reproducible drug sensitivity readouts with a standard personal computer.</p><p><strong>Availability and implementation: </strong>AssiST is distributed as a MATLAB library and is freely available for non-commercial use. Code, documentation, and training/inference instructions are available at https://github.com/Mitchell-SysBio/AssiST/. We also provide pre-trained models and a library of sample images. The software accepts image files from standard flatbed scanners. We commit to maintaining the repository for at least 2 years post-publication.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"6 1","pages":"vbag063"},"PeriodicalIF":2.8,"publicationDate":"2026-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12952218/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147349799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SLE-diseaseome: a comprehensive meta-collection of systemic lupus erythematosus relevant functional pathways. sle -疾病组:系统性红斑狼疮相关功能通路的综合meta集合。
IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-18 eCollection Date: 2026-01-01 DOI: 10.1093/bioadv/vbag061
Daniel Toro-Domínguez, Chang Wang, Iván Ellson-Lancho, Jordi Martorell-Marugán, Pedro Carmona-Sáez, Marta E Alarcón-Riquelme, Frédéric Baribaud

Motivation: Systemic lupus erythematosus patients exhibit a broad clinical spectrum of manifestations and suffer from high rates of treatment failure. These can be attributed to disease heterogeneity due to differentially dysregulated pathways. Precision medicine considering the individualized molecular disease driving mechanisms is a promising strategy to address challenges imposed by disease heterogeneity. Available patient blood transcriptome data coupled with pathway-based single-sample scoring approaches have been extensively employed to reveal molecular footprints of disease states and progression as well as delineate population heterogeneity. However, systemic understanding of pathways involved in disease pathogenesis remains lacking.

Results: We created a SLE-diseaseome, an integrative multi-cohort collection of disease-relevant functional gene sets. This resource contains a comprehensive collection of disease-specific gene signatures combining knowledge from several pathway databases and signature sources robustly defined by integrating multiple studies. It offers reliable and extensive reference signatures in a disease-specific manner for functional interpretation of molecular data from clinical studies.

Availability and implementation: The code used to run the pipeline and the R object containing the SLE-diseaseome collection are available at https://github.com/dtordom/SLEDiseaseome.

动机:系统性红斑狼疮患者表现出广泛的临床表现,并遭受高失败率的治疗。这些可归因于由于差异失调通路导致的疾病异质性。考虑个体化分子疾病驱动机制的精准医学是解决疾病异质性挑战的一种有前景的策略。现有的患者血液转录组数据与基于通路的单样本评分方法已被广泛用于揭示疾病状态和进展的分子足迹以及描述人群异质性。然而,系统的了解途径参与疾病的发病机制仍然缺乏。结果:我们创建了一个ssi疾病组,一个与疾病相关的功能基因集的综合多队列集合。该资源包含疾病特异性基因签名的综合收集,结合了来自多个途径数据库和签名源的知识,这些签名源通过整合多个研究得到了强有力的定义。它为临床研究分子数据的功能解释提供了疾病特异性的可靠和广泛的参考签名。可用性和实现:用于运行管道和包含sle -disease - ome集合的R对象的代码可从https://github.com/dtordom/SLEDiseaseome获得。
{"title":"SLE-diseaseome: a comprehensive meta-collection of systemic lupus erythematosus relevant functional pathways.","authors":"Daniel Toro-Domínguez, Chang Wang, Iván Ellson-Lancho, Jordi Martorell-Marugán, Pedro Carmona-Sáez, Marta E Alarcón-Riquelme, Frédéric Baribaud","doi":"10.1093/bioadv/vbag061","DOIUrl":"https://doi.org/10.1093/bioadv/vbag061","url":null,"abstract":"<p><strong>Motivation: </strong>Systemic lupus erythematosus patients exhibit a broad clinical spectrum of manifestations and suffer from high rates of treatment failure. These can be attributed to disease heterogeneity due to differentially dysregulated pathways. Precision medicine considering the individualized molecular disease driving mechanisms is a promising strategy to address challenges imposed by disease heterogeneity. Available patient blood transcriptome data coupled with pathway-based single-sample scoring approaches have been extensively employed to reveal molecular footprints of disease states and progression as well as delineate population heterogeneity. However, systemic understanding of pathways involved in disease pathogenesis remains lacking.</p><p><strong>Results: </strong>We created a SLE-diseaseome, an integrative multi-cohort collection of disease-relevant functional gene sets. This resource contains a comprehensive collection of disease-specific gene signatures combining knowledge from several pathway databases and signature sources robustly defined by integrating multiple studies. It offers reliable and extensive reference signatures in a disease-specific manner for functional interpretation of molecular data from clinical studies.</p><p><strong>Availability and implementation: </strong>The code used to run the pipeline and the R object containing the SLE-diseaseome collection are available at https://github.com/dtordom/SLEDiseaseome.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"6 1","pages":"vbag061"},"PeriodicalIF":2.8,"publicationDate":"2026-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12989159/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147470092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HERMES: an open-source mining tool for open-access literature. HERMES:一个开放获取文献的开源挖掘工具。
IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-17 eCollection Date: 2026-01-01 DOI: 10.1093/bioadv/vbag058
Julien Charest, Katarina Priselac, Georg H Reischer, Andreas H Farnleitner, Robert L Mach, Astrid R Mach-Aigner

Motivation: The exponential growth of open-access scientific literature presents researchers with unprecedented opportunities but also poses a significant challenge: how to efficiently identify and prioritize relevant publications in a transparent and customizable manner. Existing search engines index large volumes of biomedical literature but rarely provide user-defined ranking options, reproducibility, or integration of domain-specific criteria. This gap is particularly limiting for specialized fields, where nuanced keyword combinations, literature recency, and contextual interpretation are critical.

Results: We present HERMES, an open-source literature mining tool for targeted retrieval and ranking of full-text open-access publications from PubMed Central (PMC). HERMES employs a composite scoring algorithm that integrates keyword frequency, citation counts, and publication age to prioritize publications. It further supports summarization, biomedical entity recognition, and PDF report generation. An intuitive graphical user interface (GUI) allows researchers without programming expertise to perform complex literature mining tasks, while multithreaded processing ensures efficiency for large-scale queries. HERMES provides a reproducible and adaptable framework for literature discovery, empowering researchers to rapidly identify relevant literature and promoting transparency and community-driven extension.

Availability and implementation: HERMES (version 1.2) is implemented in Python (3.11). The source code is freely available on GitHub at https://github.com/julien-charest/hermes and is distributed under the GPL-3 license.

动机:开放获取科学文献的指数级增长为研究人员提供了前所未有的机会,但也提出了重大挑战:如何以透明和可定制的方式有效地识别和优先考虑相关出版物。现有的搜索引擎索引了大量的生物医学文献,但很少提供用户定义的排名选项、可重复性或特定领域标准的集成。这种差距在专业领域尤其有限,在这些领域,微妙的关键字组合、文献近代性和上下文解释至关重要。结果:我们提出了一个开源文献挖掘工具HERMES,用于有针对性地检索PubMed Central (PMC)的全文开放获取出版物并对其进行排名。HERMES采用了一种综合评分算法,该算法集成了关键词频率、引用次数和出版时间,从而对出版物进行优先排序。它进一步支持摘要、生物医学实体识别和PDF报告生成。直观的图形用户界面(GUI)允许没有编程专业知识的研究人员执行复杂的文献挖掘任务,而多线程处理确保了大规模查询的效率。HERMES为文献发现提供了一个可复制和适应性强的框架,使研究人员能够快速识别相关文献,并促进透明度和社区驱动的扩展。可用性和实现:HERMES(版本1.2)在Python(3.11)中实现。源代码可以在GitHub (https://github.com/julien-charest/hermes)上免费获得,并根据GPL-3许可发布。
{"title":"HERMES: an open-source mining tool for open-access literature.","authors":"Julien Charest, Katarina Priselac, Georg H Reischer, Andreas H Farnleitner, Robert L Mach, Astrid R Mach-Aigner","doi":"10.1093/bioadv/vbag058","DOIUrl":"https://doi.org/10.1093/bioadv/vbag058","url":null,"abstract":"<p><strong>Motivation: </strong>The exponential growth of open-access scientific literature presents researchers with unprecedented opportunities but also poses a significant challenge: how to efficiently identify and prioritize relevant publications in a transparent and customizable manner. Existing search engines index large volumes of biomedical literature but rarely provide user-defined ranking options, reproducibility, or integration of domain-specific criteria. This gap is particularly limiting for specialized fields, where nuanced keyword combinations, literature recency, and contextual interpretation are critical.</p><p><strong>Results: </strong>We present HERMES, an open-source literature mining tool for targeted retrieval and ranking of full-text open-access publications from PubMed Central (PMC). HERMES employs a composite scoring algorithm that integrates keyword frequency, citation counts, and publication age to prioritize publications. It further supports summarization, biomedical entity recognition, and PDF report generation. An intuitive graphical user interface (GUI) allows researchers without programming expertise to perform complex literature mining tasks, while multithreaded processing ensures efficiency for large-scale queries. HERMES provides a reproducible and adaptable framework for literature discovery, empowering researchers to rapidly identify relevant literature and promoting transparency and community-driven extension.</p><p><strong>Availability and implementation: </strong>HERMES (version 1.2) is implemented in Python (3.11). The source code is freely available on GitHub at https://github.com/julien-charest/hermes and is distributed under the GPL-3 license.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"6 1","pages":"vbag058"},"PeriodicalIF":2.8,"publicationDate":"2026-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12952204/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147349773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stratified signaling network remodeling of kinase-transcription factors' interactions in Parkinson's disease. 帕金森病中激酶-转录因子相互作用的分层信号网络重构。
IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-17 eCollection Date: 2026-01-01 DOI: 10.1093/bioadv/vbag059
Xiaoyan Zhou, Luca Parisi, Sicen Liu, Ziqi Cheng, Hanwen Liang, Mansour Youseffi, Farideh Javid, Renfei Ma

Motivation: Understanding how signaling networks differ across molecular subgroups of Parkinson's disease (PD) is essential for gaining further mechanistic insights and advancing therapeutic development for the disease. This study introduces an integrative, stratified computational framework to characterize subgroup-specific changes in kinase-transcription factors' (TFs) interactions using transcriptomic profiles.

Results: Differential expression analysis was leveraged to identify kinases with altered expression across various PD subgroups, while transcription factor activity inferred by multi-sample Virtual Inference of Protein-activity by Enriched Regulon revealed dysregulated transcription relative to controls. Phosphorylation data from SIGNOR 4.0 enabled the construction of kinase-TF subnetworks, which were analysed via pathway enrichment to reveal affected biological pathways. Comparative analyses and modeling revealed both shared and distinct signaling features among PD stratified subgroups. A recurring pattern across multiple groups involved STAT family-specific activation downstream of receptor and non-receptor tyrosine kinases, consistently with a conserved inflammatory and pro-survival signaling axis. In contrast, PD_LRRK2 showed selective involvement of immune-metabolic pathways, including AMPK to HNF4A and PAK5 to NF- κ B, while PD_GBA and prodromal cohorts were characterized by stress and apoptosis-related mechanisms involving MAPK10 (JNK3), TP53, and hormone receptor pathways (AR and ESR1). Overall, this novel stratified computational framework integrates gene expression, infers subtle TF activity, identifies differentially expressed kinases, and leverages mechanistic interaction data to unveil signaling heterogeneity in PD. Identifying regulators and subgroup-specific network features provides opportunities to inform, influence, and enable the unveiling of novel biomarkers and develop more effective and proactive precision therapeutics.

Availability and implementation: Source code is available at https://github.com/xyzhou218/Kin_TF_net.

动机:了解帕金森氏病(PD)分子亚群之间信号网络的差异对于获得进一步的机制见解和推进该疾病的治疗开发至关重要。本研究引入了一个综合的、分层的计算框架,利用转录组谱来表征激酶-转录因子(tf)相互作用的亚群特异性变化。结果:利用差异表达分析来鉴定不同PD亚组中表达改变的激酶,而通过多样本富集调节蛋白活性虚拟推断推断的转录因子活性显示相对于对照组的转录失调。SIGNOR 4.0的磷酸化数据可以构建激酶- tf子网络,通过途径富集分析这些子网络以揭示受影响的生物学途径。对比分析和建模揭示了PD分层亚组之间共享和不同的信号特征。在多个组中重复出现的模式涉及STAT家族特异性激活下游的受体和非受体酪氨酸激酶,与保守的炎症和促生存信号轴一致。相比之下,PD_LRRK2显示选择性参与免疫代谢途径,包括AMPK到HNF4A和PAK5到NF- κ B,而PD_GBA和前体期队列的特征是应激和凋亡相关机制,包括MAPK10 (JNK3)、TP53和激素受体途径(AR和ESR1)。总的来说,这种新的分层计算框架整合了基因表达,推断了微妙的TF活性,识别了差异表达的激酶,并利用机制相互作用数据揭示了PD的信号异质性。识别调节因子和亚群特定网络特征提供了信息、影响和揭示新型生物标志物的机会,并开发出更有效、更主动的精确治疗方法。可用性和实现:源代码可从https://github.com/xyzhou218/Kin_TF_net获得。
{"title":"Stratified signaling network remodeling of kinase-transcription factors' interactions in Parkinson's disease.","authors":"Xiaoyan Zhou, Luca Parisi, Sicen Liu, Ziqi Cheng, Hanwen Liang, Mansour Youseffi, Farideh Javid, Renfei Ma","doi":"10.1093/bioadv/vbag059","DOIUrl":"https://doi.org/10.1093/bioadv/vbag059","url":null,"abstract":"<p><strong>Motivation: </strong>Understanding how signaling networks differ across molecular subgroups of Parkinson's disease (PD) is essential for gaining further mechanistic insights and advancing therapeutic development for the disease. This study introduces an integrative, stratified computational framework to characterize subgroup-specific changes in kinase-transcription factors' (TFs) interactions using transcriptomic profiles.</p><p><strong>Results: </strong>Differential expression analysis was leveraged to identify kinases with altered expression across various PD subgroups, while transcription factor activity inferred by multi-sample Virtual Inference of Protein-activity by Enriched Regulon revealed dysregulated transcription relative to controls. Phosphorylation data from SIGNOR 4.0 enabled the construction of kinase-TF subnetworks, which were analysed via pathway enrichment to reveal affected biological pathways. Comparative analyses and modeling revealed both shared and distinct signaling features among PD stratified subgroups. A recurring pattern across multiple groups involved STAT family-specific activation downstream of receptor and non-receptor tyrosine kinases, consistently with a conserved inflammatory and pro-survival signaling axis. In contrast, PD_LRRK2 showed selective involvement of immune-metabolic pathways, including AMPK to HNF4A and PAK5 to NF- <math><mi>κ</mi></math> B, while PD_GBA and prodromal cohorts were characterized by stress and apoptosis-related mechanisms involving MAPK10 (JNK3), TP53, and hormone receptor pathways (AR and ESR1). Overall, this novel stratified computational framework integrates gene expression, infers subtle TF activity, identifies differentially expressed kinases, and leverages mechanistic interaction data to unveil signaling heterogeneity in PD. Identifying regulators and subgroup-specific network features provides opportunities to inform, influence, and enable the unveiling of novel biomarkers and develop more effective and proactive precision therapeutics.</p><p><strong>Availability and implementation: </strong>Source code is available at https://github.com/xyzhou218/Kin_TF_net.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"6 1","pages":"vbag059"},"PeriodicalIF":2.8,"publicationDate":"2026-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12955839/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147357791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OmniCorr: an R-package for visualizing putative host-microbiome interactions using multi-omics data. OmniCorr:使用多组学数据可视化假定宿主-微生物组相互作用的r包。
IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-02-17 eCollection Date: 2026-01-01 DOI: 10.1093/bioadv/vbag057
Shashank Gupta, Veronica Quarato, Wanxin Lai, Carl M Kobel, Velma T E Aho, Arturo Vera-Ponce de León, Sabina Leanti La Rosa, Simen R Sandve, Phillip B Pope, Torgeir R Hvidsten

Holo-omics leverages omics datasets to explore the interactions between hosts and their associated microbiomes. Although the generation of omics data from matching host and microbiome samples is steadily increasing, there remains a scarcity of computational tools capable of integrating and visualizing this data to facilitate the prediction and interpretation of host-microbiome interactions. We present OmniCorr, an R package designed to: (i) manage the complexity of omics data by clustering co-varying features (e.g. genes, proteins, and metabolites) into modules, (ii) visualize correlations of these modules across different omics layers, host-microbiome interfaces, and metadata, and (iii) identify statistically significant associations indicative of putative host-microbiome interactions. OmniCorr's utility is demonstrated using datasets from two systems: (i) Atlantic salmon, integrating host transcriptomics with metagenomics and metatranscriptomics to explore dietary impacts, and (ii) cattle, combining host proteomics with metaproteomics to investigate methane emission variability. Availability and implementation: OmniCorr is freely available at https://github.com/shashank-KU/OmniCorr.

全息组学利用组学数据集来探索宿主及其相关微生物组之间的相互作用。尽管从匹配宿主和微生物组样本中产生的组学数据正在稳步增加,但仍然缺乏能够整合和可视化这些数据以促进宿主-微生物组相互作用的预测和解释的计算工具。我们提出了OmniCorr,这是一个R软件包,旨在:(i)通过将共同变化的特征(如基因、蛋白质和代谢物)聚类到模块中来管理组学数据的复杂性,(ii)可视化这些模块在不同组学层、宿主-微生物组界面和元数据之间的相关性,以及(iii)确定表明假定的宿主-微生物组相互作用的统计上显着的关联。OmniCorr的效用是通过两个系统的数据集来证明的:(i)大西洋鲑鱼,将宿主转录组学与元基因组学和元转录组学相结合,以探索饮食的影响;(ii)牛,将宿主蛋白质组学与宏蛋白质组学相结合,以研究甲烷排放的变异性。可用性和实现:OmniCorr可在https://github.com/shashank-KU/OmniCorr免费获得。
{"title":"OmniCorr: an R-package for visualizing putative host-microbiome interactions using multi-omics data.","authors":"Shashank Gupta, Veronica Quarato, Wanxin Lai, Carl M Kobel, Velma T E Aho, Arturo Vera-Ponce de León, Sabina Leanti La Rosa, Simen R Sandve, Phillip B Pope, Torgeir R Hvidsten","doi":"10.1093/bioadv/vbag057","DOIUrl":"10.1093/bioadv/vbag057","url":null,"abstract":"<p><p>Holo-omics leverages omics datasets to explore the interactions between hosts and their associated microbiomes. Although the generation of omics data from matching host and microbiome samples is steadily increasing, there remains a scarcity of computational tools capable of integrating and visualizing this data to facilitate the prediction and interpretation of host-microbiome interactions. We present <b>OmniCorr</b>, an R package designed to: (i) manage the complexity of omics data by clustering co-varying features (e.g. genes, proteins, and metabolites) into modules, (ii) visualize correlations of these modules across different omics layers, host-microbiome interfaces, and metadata, and (iii) identify statistically significant associations indicative of putative host-microbiome interactions. OmniCorr's utility is demonstrated using datasets from two systems: (i) Atlantic salmon, integrating host transcriptomics with metagenomics and metatranscriptomics to explore dietary impacts, and (ii) cattle, combining host proteomics with metaproteomics to investigate methane emission variability. <i>Availability and implementation</i>: OmniCorr is freely available at https://github.com/shashank-KU/OmniCorr.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"6 1","pages":"vbag057"},"PeriodicalIF":2.8,"publicationDate":"2026-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12961270/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147379767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Bioinformatics advances
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1