首页 > 最新文献

Bioinformatics (Oxford, England)最新文献

英文 中文
Gene count estimation with pytximport enables reproducible analysis of bulk RNA sequencing data in Python. 利用 pytximport 估算基因数量,可在 Python 中对大量 RNA 测序数据进行可重现的分析。
Pub Date : 2024-11-20 DOI: 10.1093/bioinformatics/btae700
Malte Kuehl, Milagros N Wong, Nicola Wanner, Stefan Bonn, Victor G Puelles

Summary: Transcript quantification tools efficiently map bulk RNA sequencing reads to reference transcriptomes. However, their output consists of transcript count estimates that are subject to multiple biases and cannot be readily used with existing differential gene expression analysis tools in Python.Here we present pytximport, a Python implementation of the tximport R package that supports a variety of input formats, different modes of bias correction, inferential replicates, gene-level summarization of transcript counts, transcript-level exports, transcript-to-gene mapping generation and optional filtering of transcripts by biotype. pytximport is part of the scverse ecosystem of open-source Python software packages for omics analyses and includes both a Python as well as a command-line interface.With pytximport, we propose a bulk RNA sequencing analysis workflow based on Bioconda and scverse ecosystem packages, ensuring reproducible analyses through Snakemake rules. We apply this pipeline to a publicly available RNA-sequencing dataset, demonstrating how pytximport enables the creation of Python-centric workflows capable of providing insights into transcriptomic alterations.

Availability: pytximport is licensed under the GNU General Public License version 3. The source code is available at https://github.com/complextissue/pytximport and via Zenodo with DOI: 10.5281/zenodo.13907917. A related Snakemake workflow is available through GitHub at https://github.com/complextissue/snakemake-bulk-rna-seq-workflow and Zenodo with DOI: 10.5281/zenodo.12713811. Documentation and a vignette for new users are available at: https://pytximport.readthedocs.io.

Supplementary information: Supplementary Material is available at Bioinformatics online.

摘要:转录本定量工具能有效地将大量 RNA 测序读数映射到参考转录组。我们在此介绍 pytximport,它是 tximport R 软件包的 Python 实现,支持多种输入格式、不同的偏差校正模式、推理重复、转录本计数的基因级汇总、转录本级导出、转录本到基因映射生成以及可选的生物型转录本过滤。pytximport是用于omics分析的开源Python软件包scverse生态系统的一部分,包括一个Python和一个命令行界面。通过pytximport,我们提出了一个基于Bioconda和scverse生态系统软件包的批量RNA测序分析工作流,通过Snakemake规则确保分析的可重复性。我们将这一流程应用于一个公开的 RNA 测序数据集,展示了 pytximport 如何创建以 Python 为中心的工作流,从而深入了解转录组的变化。源代码可从 https://github.com/complextissue/pytximport 获取,也可通过 Zenodo 获取,DOI:10.5281/zenodo.13907917。相关的 Snakemake 工作流程可通过 GitHub https://github.com/complextissue/snakemake-bulk-rna-seq-workflow 和 Zenodo 获取,DOI:10.5281/zenodo.12713811。文档和面向新用户的小节可在以下网址获取:https://pytximport.readthedocs.io.Supplementary information:补充材料可在 Bioinformatics online 上获取。
{"title":"Gene count estimation with pytximport enables reproducible analysis of bulk RNA sequencing data in Python.","authors":"Malte Kuehl, Milagros N Wong, Nicola Wanner, Stefan Bonn, Victor G Puelles","doi":"10.1093/bioinformatics/btae700","DOIUrl":"https://doi.org/10.1093/bioinformatics/btae700","url":null,"abstract":"<p><strong>Summary: </strong>Transcript quantification tools efficiently map bulk RNA sequencing reads to reference transcriptomes. However, their output consists of transcript count estimates that are subject to multiple biases and cannot be readily used with existing differential gene expression analysis tools in Python.Here we present pytximport, a Python implementation of the tximport R package that supports a variety of input formats, different modes of bias correction, inferential replicates, gene-level summarization of transcript counts, transcript-level exports, transcript-to-gene mapping generation and optional filtering of transcripts by biotype. pytximport is part of the scverse ecosystem of open-source Python software packages for omics analyses and includes both a Python as well as a command-line interface.With pytximport, we propose a bulk RNA sequencing analysis workflow based on Bioconda and scverse ecosystem packages, ensuring reproducible analyses through Snakemake rules. We apply this pipeline to a publicly available RNA-sequencing dataset, demonstrating how pytximport enables the creation of Python-centric workflows capable of providing insights into transcriptomic alterations.</p><p><strong>Availability: </strong>pytximport is licensed under the GNU General Public License version 3. The source code is available at https://github.com/complextissue/pytximport and via Zenodo with DOI: 10.5281/zenodo.13907917. A related Snakemake workflow is available through GitHub at https://github.com/complextissue/snakemake-bulk-rna-seq-workflow and Zenodo with DOI: 10.5281/zenodo.12713811. Documentation and a vignette for new users are available at: https://pytximport.readthedocs.io.</p><p><strong>Supplementary information: </strong>Supplementary Material is available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142683861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pf-HaploAtlas: an interactive web app for spatiotemporal analysis of P. falciparum genes. Pf-HaploAtlas:用于恶性疟原虫基因时空分析的交互式网络应用程序。
Pub Date : 2024-11-20 DOI: 10.1093/bioinformatics/btae673
Chiyun Lee, Eyyüb S Ünlü, Nina F D White, Jacob Almagro-Garcia, Cristina Ariani, Richard D Pearson

Motivation: Monitoring the genomic evolution of Plasmodium falciparum-the most widespread and deadliest of the human-infecting malaria species-is critical for making decisions in response to changes in drug resistance, diagnostic test failures, and vaccine effectiveness. The MalariaGEN data resources are the world's largest whole genome sequencing databases for Plasmodium parasites. The size and complexity of such data is a barrier to many potential end users in both public health and academic research. A user-friendly method for accessing and exploring data on the genetic variation of P. falciparum would greatly enable efforts in studying and controlling malaria.

Results: We developed Pf-HaploAtlas, a web application enabling exploratory data analysis of genomic variation without requiring advanced technical expertise. The app provides analysis-ready data catalogues and visualisations of amino acid haplotypes for all 5,102 core P. falciparum genes. Pf-HaploAtlas facilitates comprehensive spatial and temporal exploration of genes and variants of interest by using data from 16,203 samples, from 33 countries, and spread between the years 1984 and 2018. The scope of Pf-HaploAtlas will expand with each new MalariaGEN Plasmodium data release.

Availability: Pf-HaploAtlas is available online for public use at https://apps.malariagen.net/pf-haploatlas, which allows users to download the underlying amino acid haplotype data for further analyses, and its source code is freely available on GitHub under the MIT licence at https://github.com/malariagen/pf-haploatlas.

动因:恶性疟原虫是人类感染疟疾种类中分布最广、最致命的一种,监测恶性疟原虫的基因组进化对于针对耐药性、诊断检测失败和疫苗有效性的变化做出决策至关重要。MalariaGEN 数据资源是世界上最大的疟原虫全基因组测序数据库。这些数据的规模和复杂性阻碍了公共卫生和学术研究领域的许多潜在最终用户。如果能有一种用户友好型方法来访问和探索恶性疟原虫基因变异数据,将极大地促进疟疾研究和控制工作:结果:我们开发了 Pf-HaploAtlas,这是一款网络应用程序,无需高级技术知识即可对基因组变异进行探索性数据分析。该应用程序为恶性疟原虫的所有 5102 个核心基因提供了分析就绪的数据目录和氨基酸单倍型可视化。Pf-HaploAtlas 利用来自 33 个国家的 16203 个样本的数据,从 1984 年到 2018 年对相关基因和变异进行了全面的时空探索。Pf-HaploAtlas 的范围将随着 MalariaGEN 疟原虫数据的发布而扩大:Pf-HaploAtlas 可在 https://apps.malariagen.net/pf-haploatlas 上供公众在线使用,用户可以下载底层氨基酸单倍型数据以进行进一步分析,其源代码可在 GitHub 上以 MIT 许可免费获取,网址为 https://github.com/malariagen/pf-haploatlas。
{"title":"Pf-HaploAtlas: an interactive web app for spatiotemporal analysis of P. falciparum genes.","authors":"Chiyun Lee, Eyyüb S Ünlü, Nina F D White, Jacob Almagro-Garcia, Cristina Ariani, Richard D Pearson","doi":"10.1093/bioinformatics/btae673","DOIUrl":"https://doi.org/10.1093/bioinformatics/btae673","url":null,"abstract":"<p><strong>Motivation: </strong>Monitoring the genomic evolution of Plasmodium falciparum-the most widespread and deadliest of the human-infecting malaria species-is critical for making decisions in response to changes in drug resistance, diagnostic test failures, and vaccine effectiveness. The MalariaGEN data resources are the world's largest whole genome sequencing databases for Plasmodium parasites. The size and complexity of such data is a barrier to many potential end users in both public health and academic research. A user-friendly method for accessing and exploring data on the genetic variation of P. falciparum would greatly enable efforts in studying and controlling malaria.</p><p><strong>Results: </strong>We developed Pf-HaploAtlas, a web application enabling exploratory data analysis of genomic variation without requiring advanced technical expertise. The app provides analysis-ready data catalogues and visualisations of amino acid haplotypes for all 5,102 core P. falciparum genes. Pf-HaploAtlas facilitates comprehensive spatial and temporal exploration of genes and variants of interest by using data from 16,203 samples, from 33 countries, and spread between the years 1984 and 2018. The scope of Pf-HaploAtlas will expand with each new MalariaGEN Plasmodium data release.</p><p><strong>Availability: </strong>Pf-HaploAtlas is available online for public use at https://apps.malariagen.net/pf-haploatlas, which allows users to download the underlying amino acid haplotype data for further analyses, and its source code is freely available on GitHub under the MIT licence at https://github.com/malariagen/pf-haploatlas.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142683864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MMOSurv: Meta-learning for few-shot survival analysis with multi-omics data. MMOSurv:利用多组学数据的元学习(Meta-learning for few-shot survival analysis with multi-omics data)。
Pub Date : 2024-11-19 DOI: 10.1093/bioinformatics/btae684
Gang Wen, Limin Li

Motivation: High-throughput techniques have produced a large amount of high-dimensional multi-omics data, which makes it promising to predict patient survival outcomes more accurately. Recent work has showed the superiority of multi-omics data in survival analysis. However, it remains challenging to integrate multi-omics data to solve few-shot survival prediction problem, with only a few available training samples, especially for rare cancers.

Results: In this work, we propose a meta-learning framework for multi-omics few-shot survival analysis, namely MMOSurv, which enables to learn an effective multi-omics survival prediction model from a very few training samples of a specific cancer type, with the meta-knowledge across tasks from relevant cancer types. By assuming a deep Cox survival model with multiple omics, MMOSurv first learns an adaptable parameter initialization for the multi-omics survival model from abundant data of relevant cancers, and then adapts the parameters quickly and efficiently for the target cancer task with a very few training samples. Our experiments on eleven cancer types in TCGA datasets show that, compared to single-omics meta-learning methods, MMOSurv can better utilize the meta-information of similarities and relationships between different omics data from relevant cancer datasets to improve survival prediction of the target cancer with a very few multi-omics training samples. Furthermore, MMOSurv achieves better prediction performance than other state-of-the-art strategies such as multi-task learning and pre-training.

Availability and implementation: MMOSurv is freely available at https://github.com/LiminLi-xjtu/MMOSurv.

Supplementary information: Supplementary data are available at Bioinformatics online.

动机高通量技术产生了大量的高维多组学数据,因此有望更准确地预测患者的生存结果。最近的研究表明,多组学数据在生存分析中具有优越性。然而,在只有少量可用训练样本的情况下,特别是对于罕见癌症,整合多组学数据以解决少次生存预测问题仍具有挑战性:在这项工作中,我们提出了一种用于多组学少次生存分析的元学习框架,即 MMOSurv,它能利用相关癌症类型任务中的元知识,从特定癌症类型的极少数训练样本中学习有效的多组学生存预测模型。MMOSurv 假设有一个包含多个组学的深度考克斯生存模型,它首先从相关癌症的大量数据中为多组学生存模型学习一个可调整的参数初始化,然后针对目标癌症任务,用极少的训练样本快速有效地调整参数。我们在 TCGA 数据集中 11 种癌症类型的实验表明,与单一组学元学习方法相比,MMOSurv 能更好地利用相关癌症数据集中不同组学数据之间相似性和关系的元信息,以极少的多组学训练样本提高目标癌症的生存预测能力。此外,与多任务学习和预训练等其他最先进的策略相比,MMOSurv 的预测效果更好:MMOSurv 可在 https://github.com/LiminLi-xjtu/MMOSurv.Supplementary 网站上免费获取:补充数据可在 Bioinformatics online 上获取。
{"title":"MMOSurv: Meta-learning for few-shot survival analysis with multi-omics data.","authors":"Gang Wen, Limin Li","doi":"10.1093/bioinformatics/btae684","DOIUrl":"https://doi.org/10.1093/bioinformatics/btae684","url":null,"abstract":"<p><strong>Motivation: </strong>High-throughput techniques have produced a large amount of high-dimensional multi-omics data, which makes it promising to predict patient survival outcomes more accurately. Recent work has showed the superiority of multi-omics data in survival analysis. However, it remains challenging to integrate multi-omics data to solve few-shot survival prediction problem, with only a few available training samples, especially for rare cancers.</p><p><strong>Results: </strong>In this work, we propose a meta-learning framework for multi-omics few-shot survival analysis, namely MMOSurv, which enables to learn an effective multi-omics survival prediction model from a very few training samples of a specific cancer type, with the meta-knowledge across tasks from relevant cancer types. By assuming a deep Cox survival model with multiple omics, MMOSurv first learns an adaptable parameter initialization for the multi-omics survival model from abundant data of relevant cancers, and then adapts the parameters quickly and efficiently for the target cancer task with a very few training samples. Our experiments on eleven cancer types in TCGA datasets show that, compared to single-omics meta-learning methods, MMOSurv can better utilize the meta-information of similarities and relationships between different omics data from relevant cancer datasets to improve survival prediction of the target cancer with a very few multi-omics training samples. Furthermore, MMOSurv achieves better prediction performance than other state-of-the-art strategies such as multi-task learning and pre-training.</p><p><strong>Availability and implementation: </strong>MMOSurv is freely available at https://github.com/LiminLi-xjtu/MMOSurv.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142678071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DrugRepPT: a deep pre-training and fine-tuning framework for drug repositioning based on drug's expression perturbation and treatment effectiveness. DrugRepPT:基于药物表达扰动和治疗效果的药物重新定位深度预训练和微调框架。
Pub Date : 2024-11-19 DOI: 10.1093/bioinformatics/btae692
Shuyue Fan, Kuo Yang, Kezhi Lu, Xin Dong, Xianan Li, Qiang Zhu, Shao Li, Jianyang Zeng, Xuezhong Zhou

Motivation: Drug repositioning, identifying novel indications for approved drugs, is a cost-effective strategy in drug discovery. Despite numerous proposed drug repositioning models, integrating network-based features, differential gene expression, and chemical structures for high-performance drug repositioning remains challenging.

Results: We propose a comprehensive deep pre-training and fine-tuning framework for drug repositioning, termed DrugRepPT. Initially, we design a graph pre-training module employing model-augmented contrastive learning on a vast drug-disease heterogeneous graph to capture nuanced interactions and expression perturbations after intervention. Subsequently, we introduce a fine-tuning module leveraging a graph residual-like convolution network to elucidate intricate interactions between diseases and drugs. Moreover, a Bayesian multi-loss approach is introduced to balance the existence and effectiveness of drug treatment effectively. Extensive experiments showcase the efficacy of our framework, with DrugRepPT exhibiting remarkable performance improvements compared to SOTA baseline methods (Improvement 106.13% on Hit@1 and 54.45% on mean reciprocal rank). The reliability of predicted results is further validated through two case studies, ie, gastritis and fatty liver, via literature validation, network medicine analysis, and docking screening.

Availability and implementation: The code and results are available at https://github.com/2020MEAI/DrugRepPT.

Supplementary information: Supplementary data are available at Bioinformatics online.

动机药物重新定位,即确定已批准药物的新适应症,是药物发现过程中一种具有成本效益的策略。尽管提出了许多药物重新定位模型,但整合基于网络的特征、差异基因表达和化学结构以实现高性能的药物重新定位仍具有挑战性:我们为药物重新定位提出了一个全面的深度预训练和微调框架,称为 DrugRepPT。首先,我们设计了一个图预训练模块,在庞大的药物-疾病异构图上采用模型增强对比学习,捕捉细微的相互作用和干预后的表达扰动。随后,我们引入了一个微调模块,利用类似图残差的卷积网络来阐明疾病与药物之间错综复杂的相互作用。此外,我们还引入了贝叶斯多损失方法,以有效平衡药物治疗的存在性和有效性。广泛的实验证明了我们框架的有效性,与 SOTA 基线方法相比,DrugRepPT 的性能有了显著提高(Hit@1 提高了 106.13%,平均倒数等级提高了 54.45%)。通过文献验证、网络医学分析和对接筛选,两个案例研究(即胃炎和脂肪肝)进一步验证了预测结果的可靠性:代码和结果见 https://github.com/2020MEAI/DrugRepPT.Supplementary 信息:补充数据可在 Bioinformatics online 上获取。
{"title":"DrugRepPT: a deep pre-training and fine-tuning framework for drug repositioning based on drug's expression perturbation and treatment effectiveness.","authors":"Shuyue Fan, Kuo Yang, Kezhi Lu, Xin Dong, Xianan Li, Qiang Zhu, Shao Li, Jianyang Zeng, Xuezhong Zhou","doi":"10.1093/bioinformatics/btae692","DOIUrl":"https://doi.org/10.1093/bioinformatics/btae692","url":null,"abstract":"<p><strong>Motivation: </strong>Drug repositioning, identifying novel indications for approved drugs, is a cost-effective strategy in drug discovery. Despite numerous proposed drug repositioning models, integrating network-based features, differential gene expression, and chemical structures for high-performance drug repositioning remains challenging.</p><p><strong>Results: </strong>We propose a comprehensive deep pre-training and fine-tuning framework for drug repositioning, termed DrugRepPT. Initially, we design a graph pre-training module employing model-augmented contrastive learning on a vast drug-disease heterogeneous graph to capture nuanced interactions and expression perturbations after intervention. Subsequently, we introduce a fine-tuning module leveraging a graph residual-like convolution network to elucidate intricate interactions between diseases and drugs. Moreover, a Bayesian multi-loss approach is introduced to balance the existence and effectiveness of drug treatment effectively. Extensive experiments showcase the efficacy of our framework, with DrugRepPT exhibiting remarkable performance improvements compared to SOTA baseline methods (Improvement 106.13% on Hit@1 and 54.45% on mean reciprocal rank). The reliability of predicted results is further validated through two case studies, ie, gastritis and fatty liver, via literature validation, network medicine analysis, and docking screening.</p><p><strong>Availability and implementation: </strong>The code and results are available at https://github.com/2020MEAI/DrugRepPT.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142678057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PhosX: data-driven kinase activity inference from phosphoproteomics experiments. PhosX:从磷酸蛋白组学实验中推断数据驱动的激酶活性。
Pub Date : 2024-11-19 DOI: 10.1093/bioinformatics/btae697
Alessandro Lussana, Sophia Müller-Dott, Julio Saez-Rodriguez, Evangelia Petsalaki

Summary: The inference of kinase activity from phosphoproteomics data can point to causal mechanisms driving signalling processes and potential drug targets. Identifying the kinases whose change in activity explains the observed phosphorylation profiles, however, remains challenging, and constrained by the manually curated knowledge of kinase-substrate associations. Recently, experimentally determined substrate sequence specificities of human kinases have become available, but robust methods to exploit this new data for kinase activity inference are still missing. We present PhosX, a method to estimate differential kinase activity from phosphoproteomics data that combines state-of-the art statistics in enrichment analysis with kinases' substrate sequence specificity information. Using a large phosphoproteomics dataset with known differentially regulated kinases we show that our method identifies upregulated and downregulated kinases by only relying on the input phosphopeptides' sequences and intensity changes. We find that PhosX outperforms the currently available approach for the same task, and performs better or similarly to state-of-the-art methods that rely on previously known kinase-substrate associations. We therefore recommend its use for data-driven kinase activity inference.

Availability and implementation: PhosX is implemented in Python, open-source under the Apache-2.0 licence, and distributed on the Python Package Index. The code is available on GitHub (https://github.com/alussana/phosx).

Supplementary information: Supplementary data are available at Bioinformatics online.

摘要:从磷酸化蛋白质组学数据中推断激酶活性可以指出驱动信号过程和潜在药物靶点的因果机制。然而,要确定哪些激酶的活性变化可以解释观察到的磷酸化图谱,仍然具有挑战性,并且受到人工整理的激酶-底物关联知识的限制。最近,通过实验确定的人类激酶底物序列特异性已经可用,但利用这些新数据进行激酶活性推断的可靠方法仍然缺乏。我们介绍的 PhosX 是一种从磷酸蛋白组学数据中估算不同激酶活性的方法,它将富集分析中最先进的统计方法与激酶底物序列特异性信息相结合。通过使用一个包含已知差异调控激酶的大型磷酸蛋白组学数据集,我们发现我们的方法仅依靠输入磷酸肽的序列和强度变化就能识别上调和下调的激酶。我们发现,PhosX 的表现优于目前可用于相同任务的方法,其性能也优于或类似于依赖先前已知激酶-底物关联的先进方法。因此,我们推荐将其用于数据驱动的激酶活性推断:PhosX用Python实现,在Apache-2.0许可下开源,并发布在Python软件包索引上。代码可在 GitHub (https://github.com/alussana/phosx) 上获取。补充信息:补充数据可在 Bioinformatics online 上获取。
{"title":"PhosX: data-driven kinase activity inference from phosphoproteomics experiments.","authors":"Alessandro Lussana, Sophia Müller-Dott, Julio Saez-Rodriguez, Evangelia Petsalaki","doi":"10.1093/bioinformatics/btae697","DOIUrl":"https://doi.org/10.1093/bioinformatics/btae697","url":null,"abstract":"<p><strong>Summary: </strong>The inference of kinase activity from phosphoproteomics data can point to causal mechanisms driving signalling processes and potential drug targets. Identifying the kinases whose change in activity explains the observed phosphorylation profiles, however, remains challenging, and constrained by the manually curated knowledge of kinase-substrate associations. Recently, experimentally determined substrate sequence specificities of human kinases have become available, but robust methods to exploit this new data for kinase activity inference are still missing. We present PhosX, a method to estimate differential kinase activity from phosphoproteomics data that combines state-of-the art statistics in enrichment analysis with kinases' substrate sequence specificity information. Using a large phosphoproteomics dataset with known differentially regulated kinases we show that our method identifies upregulated and downregulated kinases by only relying on the input phosphopeptides' sequences and intensity changes. We find that PhosX outperforms the currently available approach for the same task, and performs better or similarly to state-of-the-art methods that rely on previously known kinase-substrate associations. We therefore recommend its use for data-driven kinase activity inference.</p><p><strong>Availability and implementation: </strong>PhosX is implemented in Python, open-source under the Apache-2.0 licence, and distributed on the Python Package Index. The code is available on GitHub (https://github.com/alussana/phosx).</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142677173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mutual information for detecting multi-class biomarkers when integrating multiple bulk or single-cell transcriptomic studies. 在整合多个批量或单细胞转录组研究时检测多类生物标记物的互信息。
Pub Date : 2024-11-19 DOI: 10.1093/bioinformatics/btae696
Jian Zou, Zheqi Li, Neil Carleton, Steffi Oesterreich, Adrian V Lee, George C Tseng

Motivation: Biomarker detection plays a pivotal role in biomedical research. Integrating omics studies from multiple cohorts can enhance statistical power, accuracy and robustness of the detection results. However, existing methods for horizontally combining omics studies are mostly designed for two-class scenarios (e.g., cases versus controls) and are not directly applicable for studies with multi-class design (e.g., samples from multiple disease subtypes, treatments, tissues, or cell types).

Results: We propose a statistical framework, namely Mutual Information Concordance Analysis (MICA), to detect biomarkers with concordant multi-class expression pattern across multiple omics studies from an information theoretic perspective. Our approach first detects biomarkers with concordant multi-class patterns across partial or all of the omics studies using a global test by mutual information. A post hoc analysis is then performed for each detected biomarkers and identify studies with concordant pattern. Extensive simulations demonstrate improved accuracy and successful false discovery rate control of MICA compared to an existing MCC method. The method is then applied to two practical scenarios: four tissues of mouse metabolism-related transcriptomic studies, and three sources of estrogen treatment expression profiles. Detected biomarkers by MICA show intriguing biological insights and functional annotations. Additionally, we implemented MICA for single-cell RNA-Seq data for tumor progression biomarkers, highlighting critical roles of ribosomal function in the tumor microenvironment of triple-negative breast cancer and underscoring the potential of MICA for detecting novel therapeutic targets.

Availability: The source code is available on Figshare at https://doi.org/10.6084/m9.figshare.27635436. Additionally, the R package can be installed directly from GitHub at https://github.com/jianzou75/MICA.

Supplementary information: Supplementary data are available at Bioinformatics online.

动机生物标记物检测在生物医学研究中起着举足轻重的作用。整合来自多个队列的 omics 研究可以提高检测结果的统计能力、准确性和稳健性。然而,现有的横向联合 omics 研究方法大多是针对两类情况(如病例与对照)设计的,并不能直接适用于多类设计的研究(如来自多种疾病亚型、治疗方法、组织或细胞类型的样本):我们提出了一个统计框架,即互信息一致性分析(MICA),从信息论的角度来检测在多项omics研究中具有多类一致表达模式的生物标记物。我们的方法首先通过互信息进行全局检验,检测在部分或全部 omics 研究中具有多类一致表达模式的生物标记物。然后对每个检测到的生物标记物进行事后分析,找出具有一致模式的研究。大量的仿真证明,与现有的 MCC 方法相比,MICA 的准确性得到了提高,并成功地控制了误诊率。该方法随后被应用于两种实际情况:四种组织的小鼠代谢相关转录组学研究和三种来源的雌激素治疗表达谱。通过 MICA 检测到的生物标记物显示出令人感兴趣的生物学见解和功能注释。此外,我们还对单细胞 RNA-Seq 数据中的肿瘤进展生物标记物实施了 MICA,突出了核糖体功能在三阴性乳腺癌肿瘤微环境中的关键作用,并强调了 MICA 在检测新型治疗靶点方面的潜力:源代码可在 Figshare 网站 https://doi.org/10.6084/m9.figshare.27635436 上获取。此外,R软件包可直接从GitHub安装,网址为 https://github.com/jianzou75/MICA.Supplementary:补充数据可在 Bioinformatics online 上获取。
{"title":"Mutual information for detecting multi-class biomarkers when integrating multiple bulk or single-cell transcriptomic studies.","authors":"Jian Zou, Zheqi Li, Neil Carleton, Steffi Oesterreich, Adrian V Lee, George C Tseng","doi":"10.1093/bioinformatics/btae696","DOIUrl":"https://doi.org/10.1093/bioinformatics/btae696","url":null,"abstract":"<p><strong>Motivation: </strong>Biomarker detection plays a pivotal role in biomedical research. Integrating omics studies from multiple cohorts can enhance statistical power, accuracy and robustness of the detection results. However, existing methods for horizontally combining omics studies are mostly designed for two-class scenarios (e.g., cases versus controls) and are not directly applicable for studies with multi-class design (e.g., samples from multiple disease subtypes, treatments, tissues, or cell types).</p><p><strong>Results: </strong>We propose a statistical framework, namely Mutual Information Concordance Analysis (MICA), to detect biomarkers with concordant multi-class expression pattern across multiple omics studies from an information theoretic perspective. Our approach first detects biomarkers with concordant multi-class patterns across partial or all of the omics studies using a global test by mutual information. A post hoc analysis is then performed for each detected biomarkers and identify studies with concordant pattern. Extensive simulations demonstrate improved accuracy and successful false discovery rate control of MICA compared to an existing MCC method. The method is then applied to two practical scenarios: four tissues of mouse metabolism-related transcriptomic studies, and three sources of estrogen treatment expression profiles. Detected biomarkers by MICA show intriguing biological insights and functional annotations. Additionally, we implemented MICA for single-cell RNA-Seq data for tumor progression biomarkers, highlighting critical roles of ribosomal function in the tumor microenvironment of triple-negative breast cancer and underscoring the potential of MICA for detecting novel therapeutic targets.</p><p><strong>Availability: </strong>The source code is available on Figshare at https://doi.org/10.6084/m9.figshare.27635436. Additionally, the R package can be installed directly from GitHub at https://github.com/jianzou75/MICA.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142676862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Micro-DeMix: a mixture beta-multinomial model for investigating the heterogeneity of the stool microbiome compositions. Micro-DeMix:用于研究粪便微生物组组成异质性的β-多项式混合模型。
Pub Date : 2024-11-19 DOI: 10.1093/bioinformatics/btae667
Ruoqian Liu, Yue Wang, Dan Cheng

Motivation: Extensive research has uncovered the critical role of the human gut microbiome in various aspects of health, including metabolism, nutrition, physiology, and immune function. Fecal microbiota is often used as a proxy for understanding the gut microbiome, but it represents an aggregate view, overlooking spatial variations across different gastrointestinal (GI) locations. Emerging studies with spatial microbiome data collected from specific GI regions offer a unique opportunity to better understand the spatial composition of the stool microbiome.

Results: We introduce Micro-DeMix, a mixture beta-multinomial model that deconvolutes the fecal microbiome at the compositional level by integrating stool samples with spatial microbiome data. Micro-DeMix facilitates the comparison of microbial compositions across different GI regions within the stool microbiome through a hypothesis-testing framework. We demonstrate the effectiveness and efficiency of Micro-DeMix using multiple simulated data sets and the Inflammatory Bowel Disease (IBD) data from the NIH Integrative Human Microbiome Project.

Availability and implementation: The R package is available at https://github.com/liuruoqian/MicroDemix.

Supplementary information: Supplementary data are available at Bioinformatics online.

研究动机广泛的研究揭示了人类肠道微生物群在新陈代谢、营养、生理和免疫功能等各方面健康中的关键作用。粪便微生物群通常被用作了解肠道微生物群的替代物,但它代表了一种总体观点,忽略了不同胃肠道(GI)部位的空间变化。从特定胃肠道区域收集空间微生物组数据的新兴研究为更好地了解粪便微生物组的空间组成提供了一个独特的机会:结果:我们引入了 Micro-DeMix,这是一个混合 beta 多叉模型,通过整合粪便样本和空间微生物组数据,在组成水平上解卷粪便微生物组。Micro-DeMix 通过一个假设检验框架,有助于比较粪便微生物组中不同消化道区域的微生物组成。我们使用多个模拟数据集和来自美国国立卫生研究院人类微生物组整合项目的炎症性肠病(IBD)数据展示了 Micro-DeMix 的有效性和效率:R 软件包可从 https://github.com/liuruoqian/MicroDemix.Supplementary 信息中获取:补充数据可在 Bioinformatics online 上获取。
{"title":"Micro-DeMix: a mixture beta-multinomial model for investigating the heterogeneity of the stool microbiome compositions.","authors":"Ruoqian Liu, Yue Wang, Dan Cheng","doi":"10.1093/bioinformatics/btae667","DOIUrl":"https://doi.org/10.1093/bioinformatics/btae667","url":null,"abstract":"<p><strong>Motivation: </strong>Extensive research has uncovered the critical role of the human gut microbiome in various aspects of health, including metabolism, nutrition, physiology, and immune function. Fecal microbiota is often used as a proxy for understanding the gut microbiome, but it represents an aggregate view, overlooking spatial variations across different gastrointestinal (GI) locations. Emerging studies with spatial microbiome data collected from specific GI regions offer a unique opportunity to better understand the spatial composition of the stool microbiome.</p><p><strong>Results: </strong>We introduce Micro-DeMix, a mixture beta-multinomial model that deconvolutes the fecal microbiome at the compositional level by integrating stool samples with spatial microbiome data. Micro-DeMix facilitates the comparison of microbial compositions across different GI regions within the stool microbiome through a hypothesis-testing framework. We demonstrate the effectiveness and efficiency of Micro-DeMix using multiple simulated data sets and the Inflammatory Bowel Disease (IBD) data from the NIH Integrative Human Microbiome Project.</p><p><strong>Availability and implementation: </strong>The R package is available at https://github.com/liuruoqian/MicroDemix.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142678058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DeepDR: a deep learning library for drug response prediction. DeepDR:用于药物反应预测的深度学习库。
Pub Date : 2024-11-18 DOI: 10.1093/bioinformatics/btae688
Zhengxiang Jiang, Pengyong Li

Summary: Accurate drug response prediction is critical to advancing precision medicine and drug discovery. Recent advances in deep learning (DL) have shown promise in predicting drug response; however, the lack of convenient tools to support such modeling limits their widespread application. To address this, we introduce DeepDR, the first DL library specifically developed for drug response prediction. DeepDR simplifies the process by automating drug and cell featurization, model construction, training, and inference, all achievable with brief programming. The library incorporates three types of drug features along with nine drug encoders, four types of cell features along with nine cell encoders, and two fusion modules, enabling the implementation of up to 135 DL models for drug response prediction. We also explored benchmarking performance with DeepDR, and the optimal models are available on a user-friendly visual interface.

Availability and implementation: DeepDR can be installed from PyPI (https://pypi.org/project/deepdr). The source code and experimental data are available on GitHub (https://github.com/user15632/DeepDR).

Supplementary information: Supplementary data are available at Bioinformatics online.

摘要:准确的药物反应预测对于推进精准医疗和药物发现至关重要。深度学习(DL)的最新进展显示了预测药物反应的前景;然而,由于缺乏支持此类建模的便捷工具,限制了其广泛应用。为了解决这个问题,我们推出了 DeepDR,这是第一个专门为药物反应预测开发的深度学习库。DeepDR 通过自动进行药物和细胞特征描述、模型构建、训练和推理来简化流程,所有这些都可以通过简短的编程实现。该库包含三种类型的药物特征和九种药物编码器、四种类型的细胞特征和九种细胞编码器以及两个融合模块,可实现多达 135 个用于药物反应预测的 DL 模型。我们还利用 DeepDR 探索了基准性能,并在用户友好的可视化界面上提供了最佳模型:DeepDR 可从 PyPI (https://pypi.org/project/deepdr) 安装。源代码和实验数据可从 GitHub(https://github.com/user15632/DeepDR)获取:补充数据可在 Bioinformatics online 上获取。
{"title":"DeepDR: a deep learning library for drug response prediction.","authors":"Zhengxiang Jiang, Pengyong Li","doi":"10.1093/bioinformatics/btae688","DOIUrl":"10.1093/bioinformatics/btae688","url":null,"abstract":"<p><strong>Summary: </strong>Accurate drug response prediction is critical to advancing precision medicine and drug discovery. Recent advances in deep learning (DL) have shown promise in predicting drug response; however, the lack of convenient tools to support such modeling limits their widespread application. To address this, we introduce DeepDR, the first DL library specifically developed for drug response prediction. DeepDR simplifies the process by automating drug and cell featurization, model construction, training, and inference, all achievable with brief programming. The library incorporates three types of drug features along with nine drug encoders, four types of cell features along with nine cell encoders, and two fusion modules, enabling the implementation of up to 135 DL models for drug response prediction. We also explored benchmarking performance with DeepDR, and the optimal models are available on a user-friendly visual interface.</p><p><strong>Availability and implementation: </strong>DeepDR can be installed from PyPI (https://pypi.org/project/deepdr). The source code and experimental data are available on GitHub (https://github.com/user15632/DeepDR).</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142670096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting transposable elements in long read genomes using sTELLeR. 利用 sTELLeR 检测长读数基因组中的转座元件。
Pub Date : 2024-11-18 DOI: 10.1093/bioinformatics/btae686
Kristine Bilgrav Saether, Jesper Eisfeldt

Motivation: Repeat elements such as transposable elements (TE), are highly repetitive DNA sequences that compose around 50% of the genome. TEs such as Alu, SVA, HERV and L1 elements can cause disease through disrupting genes, causing frameshift mutations or altering splicing patters. These are elements challenging to characterize using short-read genome sequencing (srGS), due to its read length and TEs repetitive nature. Long read genome sequencing (lrGS) enables bridging of TEs, allowing increased resolution across repetitive DNA sequences. lrGS therefore present an opportunity for improved TE detection and analysis, not only from a research perspective, but also for future clinical detection. When choosing a lrGS TE caller, parameters such as runtime, CPU hours, sensitivity, precision and compatibility with inclusion into pipelines are crucial for efficient detection.

Results: We therefore developed sTELLeR, (s) Transposable ELement in Long (e) Read, for accurate, fast and effective TE detection. Particularly, sTELLeR exhibit higher precision and sensitivity for calling of Alu elements than similar tools. The caller is 5-48x as fast and uses <2% of the CPU hours compared to competitive callers. The caller is haplotype aware and output results in a VCF file, enabling compatibility with other variant callers and downstream analysis.

Availability: sTELLeR is a python-based tool and is available at https://github.com/kristinebilgrav/sTELLeR. Altogether, we show that sTELLeR is a fast, sensitive and precise caller for detection of TE elements, and can easily be implemented into variant calling workflows.

Supplementary information: Supplementary data are available at Bioinformatics online.

动机可转座元件(TE)等重复元件是高度重复的 DNA 序列,约占基因组的 50%。Alu、SVA、HERV 和 L1 等可转座元件可通过破坏基因、导致换框突变或改变剪接模式而致病。由于短读数基因组测序(srGS)的读数长度和TEs的重复性,这些元素的特征描述具有挑战性。因此,长读数基因组测序(lrGS)为改进 TE 检测和分析提供了机会,不仅从研究角度来看是如此,在未来的临床检测中也是如此。在选择 lrGS TE 调用器时,运行时间、CPU 小时数、灵敏度、精确度以及与纳入管道的兼容性等参数对于高效检测至关重要:因此,我们开发了 sTELLeR(s) Transposable ELement in Long (e) Read,用于准确、快速、有效地检测 TE。特别是,与同类工具相比,sTELLeR 在调用 Alu 元素方面表现出更高的精度和灵敏度。调用速度是同类工具的5-48倍,可用性:sTELLeR是一个基于python的工具,可在https://github.com/kristinebilgrav/sTELLeR。总之,我们证明了 sTELLeR 是一种快速、灵敏、精确的 TE 元素检测调用工具,可以很容易地应用到变异调用工作流中:补充数据可在 Bioinformatics online 上获取。
{"title":"Detecting transposable elements in long read genomes using sTELLeR.","authors":"Kristine Bilgrav Saether, Jesper Eisfeldt","doi":"10.1093/bioinformatics/btae686","DOIUrl":"10.1093/bioinformatics/btae686","url":null,"abstract":"<p><strong>Motivation: </strong>Repeat elements such as transposable elements (TE), are highly repetitive DNA sequences that compose around 50% of the genome. TEs such as Alu, SVA, HERV and L1 elements can cause disease through disrupting genes, causing frameshift mutations or altering splicing patters. These are elements challenging to characterize using short-read genome sequencing (srGS), due to its read length and TEs repetitive nature. Long read genome sequencing (lrGS) enables bridging of TEs, allowing increased resolution across repetitive DNA sequences. lrGS therefore present an opportunity for improved TE detection and analysis, not only from a research perspective, but also for future clinical detection. When choosing a lrGS TE caller, parameters such as runtime, CPU hours, sensitivity, precision and compatibility with inclusion into pipelines are crucial for efficient detection.</p><p><strong>Results: </strong>We therefore developed sTELLeR, (s) Transposable ELement in Long (e) Read, for accurate, fast and effective TE detection. Particularly, sTELLeR exhibit higher precision and sensitivity for calling of Alu elements than similar tools. The caller is 5-48x as fast and uses <2% of the CPU hours compared to competitive callers. The caller is haplotype aware and output results in a VCF file, enabling compatibility with other variant callers and downstream analysis.</p><p><strong>Availability: </strong>sTELLeR is a python-based tool and is available at https://github.com/kristinebilgrav/sTELLeR. Altogether, we show that sTELLeR is a fast, sensitive and precise caller for detection of TE elements, and can easily be implemented into variant calling workflows.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142670098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
APAtizer: a tool for alternative polyadenylation analysis of RNA-Seq data. APAtizer:RNA-Seq 数据的替代多腺苷酸化分析工具。
Pub Date : 2024-11-18 DOI: 10.1093/bioinformatics/btae689
Bruno Sousa, Maria Bessa, Filipa L de Mendonça, Pedro G Ferreira, Alexandra Moreira, Isabel Pereira-Castro

Summary: APAtizer is a tool designed to analyze alternative polyadenylation events on RNA-sequencing data. The tool handles different file formats, including BAM, htseq and DaPars bedGraph files. It provides a user-friendly interface that allows users to generate informative visualizations, including Volcano plots, heatmaps and gene lists. These outputs allow the user to retrieve useful biological insights such as the occurrence of polyadenylation events when comparing two biological conditions. Additionally, it can perform differential gene expression, gene ontology analysis, visualization of Venn diagram intersections and correlation analysis.

Availability and implementation: Source code and example case studies for APAtizer are available at https://github.com/GeneRegulationi3S/APAtizer/.

摘要:APAtizer 是一款用于分析 RNA 序列数据中替代多腺苷酸化事件的工具。该工具可处理不同的文件格式,包括 BAM、htseq 和 DaPars bedGraph 文件。它提供了一个用户友好界面,允许用户生成信息丰富的可视化结果,包括火山图、热图和基因列表。通过这些输出结果,用户可以获得有用的生物学见解,例如在比较两种生物条件时,多聚腺苷酸化事件的发生情况。此外,它还能进行差异基因表达、基因本体分析、维恩图交叉点可视化和相关性分析:APAtizer的源代码和示例研究可在https://github.com/GeneRegulationi3S/APAtizer/。
{"title":"APAtizer: a tool for alternative polyadenylation analysis of RNA-Seq data.","authors":"Bruno Sousa, Maria Bessa, Filipa L de Mendonça, Pedro G Ferreira, Alexandra Moreira, Isabel Pereira-Castro","doi":"10.1093/bioinformatics/btae689","DOIUrl":"10.1093/bioinformatics/btae689","url":null,"abstract":"<p><strong>Summary: </strong>APAtizer is a tool designed to analyze alternative polyadenylation events on RNA-sequencing data. The tool handles different file formats, including BAM, htseq and DaPars bedGraph files. It provides a user-friendly interface that allows users to generate informative visualizations, including Volcano plots, heatmaps and gene lists. These outputs allow the user to retrieve useful biological insights such as the occurrence of polyadenylation events when comparing two biological conditions. Additionally, it can perform differential gene expression, gene ontology analysis, visualization of Venn diagram intersections and correlation analysis.</p><p><strong>Availability and implementation: </strong>Source code and example case studies for APAtizer are available at https://github.com/GeneRegulationi3S/APAtizer/.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142670092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Bioinformatics (Oxford, England)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1