Bioinformatics advances最新文献

英文中文

SeqLengthPlot v2.0: an all-in-one, easy-to-use tool for visualizing and retrieving sequence lengths from FASTA files. SeqLengthPlot v2.0：一款一体化的易用工具，用于从 FASTA 文件中可视化和检索序列长度。

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Bioinformatics advances

Pub Date : 2024-11-20 eCollection Date: 2025-01-01 DOI: 10.1093/bioadv/vbae183

Dany Domínguez-Pérez, Guillermin Agüero-Chapin, Serena Leone, Maria Vittoria Modica

Motivation: Accurate sequence length profiling is essential in bioinformatics, particularly in genomics and proteomics. Existing tools like SeqKit and the Trinity toolkit provide basic sequence statistics but often fall short in offering comprehensive analytics and plotting options. For instance, SeqKit is a very complete and fast tool for sequence analysis, delivering useful metrics (e.g. number of sequences, average, minimum, and maximum lengths) and can return sequences either shorter or longer (but not both at once) for a given length. Similarly, Trinity's Perl-based scripts provide detailed contig length distributions (e.g. N50, median, and average lengths) but do not include the total number of sequences or offer graphical representations of the data.

Results: Given that key sequence analysis tasks are often distributed across multiple tools, we introduce SeqLengthPlot v2.0, an all-in-one, easy-to-use Python-based tool. Through a simple command-line interface, this straightforward tool enables users to split input FASTA files (nucleotide and protein) into two distinct files based on a customizable sequence length cutoff. It also automatically retrieves the resulting FASTA files, generates length distribution plots, and provides comprehensive statistical summaries.

Availability and implementation: SeqLengthPlot_v2.0.2 can be accessed at https://github.com/danydguezperez/SeqLengthPlot/releases/tag/v2.0.2.

动机：准确的序列长度分析在生物信息学，特别是基因组学和蛋白质组学中是必不可少的。现有的工具，如SeqKit和Trinity工具包，提供基本的序列统计，但往往不能提供全面的分析和绘图选项。例如，SeqKit是一个非常完整和快速的序列分析工具，提供有用的指标（例如序列数量，平均，最小和最大长度），并且可以返回给定长度的序列或短或长（但不是同时返回）。类似地，Trinity的基于perl的脚本提供了详细的组长度分布（例如N50、中位数和平均长度），但不包括序列的总数或提供数据的图形表示。结果：考虑到关键序列分析任务通常分布在多个工具之间，我们介绍了SeqLengthPlot v2.0，这是一个集所有功能于一体、易于使用的基于python的工具。通过一个简单的命令行界面，这个简单的工具使用户能够根据可定制的序列长度截断将输入FASTA文件（核苷酸和蛋白质）拆分为两个不同的文件。它还自动检索结果FASTA文件，生成长度分布图，并提供全面的统计摘要。可用性和实现：可通过https://github.com/danydguezperez/SeqLengthPlot/releases/tag/v2.0.2访问SeqLengthPlot_v2.0.2。

{"title":"SeqLengthPlot v2.0: an all-in-one, easy-to-use tool for visualizing and retrieving sequence lengths from FASTA files.","authors":"Dany Domínguez-Pérez, Guillermin Agüero-Chapin, Serena Leone, Maria Vittoria Modica","doi":"10.1093/bioadv/vbae183","DOIUrl":"10.1093/bioadv/vbae183","url":null,"abstract":"Motivation: Accurate sequence length profiling is essential in bioinformatics, particularly in genomics and proteomics. Existing tools like SeqKit and the Trinity toolkit provide basic sequence statistics but often fall short in offering comprehensive analytics and plotting options. For instance, SeqKit is a very complete and fast tool for sequence analysis, delivering useful metrics (e.g. number of sequences, average, minimum, and maximum lengths) and can return sequences either shorter or longer (but not both at once) for a given length. Similarly, Trinity's Perl-based scripts provide detailed contig length distributions (e.g. N50, median, and average lengths) but do not include the total number of sequences or offer graphical representations of the data.Results: Given that key sequence analysis tasks are often distributed across multiple tools, we introduce SeqLengthPlot v2.0, an all-in-one, easy-to-use Python-based tool. Through a simple command-line interface, this straightforward tool enables users to split input FASTA files (nucleotide and protein) into two distinct files based on a customizable sequence length cutoff. It also automatically retrieves the resulting FASTA files, generates length distribution plots, and provides comprehensive statistical summaries.Availability and implementation: SeqLengthPlot_v2.0.2 can be accessed at https://github.com/danydguezperez/SeqLengthPlot/releases/tag/v2.0.2.","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbae183"},"PeriodicalIF":2.4,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11671033/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142904226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

KTED: a comprehensive web-based database for transposable elements in the Korean genome. 韩国基因组转座因子的综合网络数据库。

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Bioinformatics advances

Pub Date : 2024-11-19 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae179

Jin-Ok Lee, Sejoon Lee, Dongyoon Lee, Taeyeon Hwang, Soobok Joe, Jin Ok Yang, Jibin Jeong, Jung Hun Ohn, Jee Hyun Kim

Summary: Transposable elements (TEs), commonly referred to as "mobile elements," constitute DNA segments capable of relocating within a genome. Initially disregarded as "junk DNA" devoid of specific functionality, it has become evident that TEs have diverse influences on an organism's biology and health. The impact of these elements varies according to their location, classification, and their effects on specific genes or regulatory components. Despite their significant roles, a paucity of resources concerning TEs in population-scale genome sequencing remains. Herein, we analyze whole-genome sequencing data sourced from the Korean Genome and Epidemiology Study, encompassing 2500 Korean individuals. To facilitate convenient data access and observation, we developed a web-based database, KTED. Additionally, we scrutinized the differential distributions of TEs across five distinct common disease groups: dyslipidemia, hypertension, diabetes, thyroid disease, and cancer.

Availability and implementation: https://snubh.shinyapps.io/KTED.

摘要：转座元件（te），通常被称为“可移动元件”，构成能够在基因组内重新定位的DNA片段。最初被认为是缺乏特定功能的“垃圾DNA”，但很明显，te对生物体的生物学和健康有多种影响。这些元素的影响根据它们的位置、分类以及它们对特定基因或调控成分的影响而变化。尽管它们具有重要的作用，但在种群规模的基因组测序中，关于te的资源仍然缺乏。在此，我们分析了来自韩国基因组和流行病学研究的全基因组测序数据，其中包括2500名韩国人。为了方便查阅和观察数据，我们开发了一个基于网络的数据库KTED。此外，我们仔细检查了TEs在五种不同常见疾病组中的差异分布：血脂异常、高血压、糖尿病、甲状腺疾病和癌症。可用性和实现：https://snubh.shinyapps.io/KTED。

{"title":"KTED: a comprehensive web-based database for transposable elements in the Korean genome.","authors":"Jin-Ok Lee, Sejoon Lee, Dongyoon Lee, Taeyeon Hwang, Soobok Joe, Jin Ok Yang, Jibin Jeong, Jung Hun Ohn, Jee Hyun Kim","doi":"10.1093/bioadv/vbae179","DOIUrl":"10.1093/bioadv/vbae179","url":null,"abstract":"Summary: Transposable elements (TEs), commonly referred to as \"mobile elements,\" constitute DNA segments capable of relocating within a genome. Initially disregarded as \"junk DNA\" devoid of specific functionality, it has become evident that TEs have diverse influences on an organism's biology and health. The impact of these elements varies according to their location, classification, and their effects on specific genes or regulatory components. Despite their significant roles, a paucity of resources concerning TEs in population-scale genome sequencing remains. Herein, we analyze whole-genome sequencing data sourced from the Korean Genome and Epidemiology Study, encompassing 2500 Korean individuals. To facilitate convenient data access and observation, we developed a web-based database, KTED. Additionally, we scrutinized the differential distributions of TEs across five distinct common disease groups: dyslipidemia, hypertension, diabetes, thyroid disease, and cancer.Availability and implementation: https://snubh.shinyapps.io/KTED.","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae179"},"PeriodicalIF":2.4,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11652267/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142857185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The ISCB competency framework v. 3: a revised and extended standard for bioinformatics education and training. ISCB能力框架v. 3：修订和扩展的生物信息学教育和培训标准。

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Bioinformatics advances

Pub Date : 2024-11-18 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae166

Cath Brooksbank, Michelle D Brazas, Nicola Mulder, Russell Schwartz, Verena Ras, Sarah L Morgan, Marta Lloret Llinares, Patricia Carvajal López, Lee Larcombe, Amel Ghouila, Tom Hancocks, Venkata Satagopam, Javier De Las Rivas, Gaston Mazandu, Bruno Gaeta

Motivation: Developing competency in the broad area of bioinformatics is challenging globally, owing to the breadth of the field and the diversity of its audiences for education and training. Course design can be facilitated by the use of a competency framework-a set of competency requirements that define the knowledge, skills and attitudes needed by individuals in (or aspiring to be in) a particular profession or role. These competency requirements can help to define curricula as they can inform both the content and level to which competency needs to be developed. The International Society for Computational Biology (ISCB) developed a list of bioinformatics competencies in 2014, and these have undergone several rounds of improvement. In consultation with a broad bioinformatics training community, these have now been further refined and extended to include knowledge skills and attitudes, and mappings to previous and other existing competency frameworks.

Results: Here, we present version 3 of the ISCB competency framework. We describe how it was developed and how to access it, as well as providing some examples of how it has been used.

Availability and implementation: The framework is openly accessible at https://competency.ebi.ac.uk/framework/iscb/3.0/competencies.

动机：由于生物信息学领域的广泛性及其教育和培训对象的多样性，在全球范围内培养生物信息学领域的能力具有挑战性。能力框架是一套能力要求，它规定了从事（或希望从事）特定职业或角色的个人所需的知识、技能和态度。这些能力要求可以帮助确定课程，因为它们可以告知需要培养的能力的内容和水平。国际计算生物学会（ISCB）于2014年制定了生物信息学能力清单，并对其进行了多轮改进。经过与生物信息学培训界的广泛磋商，这些能力现在得到了进一步完善和扩展，包括知识技能和态度，以及与以前和其他现有能力框架的映射：结果：在此，我们介绍 ISCB 能力框架的第三版。我们介绍了该框架的开发过程和使用方法，并提供了一些使用实例：该框架可通过 https://competency.ebi.ac.uk/framework/iscb/3.0/competencies 公开获取。

{"title":"The ISCB competency framework v. 3: a revised and extended standard for bioinformatics education and training.","authors":"Cath Brooksbank, Michelle D Brazas, Nicola Mulder, Russell Schwartz, Verena Ras, Sarah L Morgan, Marta Lloret Llinares, Patricia Carvajal López, Lee Larcombe, Amel Ghouila, Tom Hancocks, Venkata Satagopam, Javier De Las Rivas, Gaston Mazandu, Bruno Gaeta","doi":"10.1093/bioadv/vbae166","DOIUrl":"10.1093/bioadv/vbae166","url":null,"abstract":"Motivation: Developing competency in the broad area of bioinformatics is challenging globally, owing to the breadth of the field and the diversity of its audiences for education and training. Course design can be facilitated by the use of a competency framework-a set of competency requirements that define the knowledge, skills and attitudes needed by individuals in (or aspiring to be in) a particular profession or role. These competency requirements can help to define curricula as they can inform both the content and level to which competency needs to be developed. The International Society for Computational Biology (ISCB) developed a list of bioinformatics competencies in 2014, and these have undergone several rounds of improvement. In consultation with a broad bioinformatics training community, these have now been further refined and extended to include knowledge skills and attitudes, and mappings to previous and other existing competency frameworks.Results: Here, we present version 3 of the ISCB competency framework. We describe how it was developed and how to access it, as well as providing some examples of how it has been used.Availability and implementation: The framework is openly accessible at https://competency.ebi.ac.uk/framework/iscb/3.0/competencies.","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae166"},"PeriodicalIF":2.4,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11646570/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Flexible fitting of AlphaFold2-predicted models to cryo-EM density maps using elastic network models: a methodical affirmation.

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Bioinformatics advances

Pub Date : 2024-11-18 eCollection Date: 2025-01-01 DOI: 10.1093/bioadv/vbae181

Maytha Alshammari, Jing He, Willy Wriggers

Motivation: This study investigates the flexible refinement of AlphaFold2 models against corresponding cryo-electron microscopy (cryo-EM) maps using normal modes derived from elastic network models (ENMs) as basis functions for displacement. AlphaFold2 generally predicts highly accurate structures, but 18 of the 137 models of isolated chains exhibit a TM-score below 0.80. We achieved a significant improvement in four of these deviating structures and used them to systematically optimize the parameters of the ENM motion model.

Results: We successfully refined four AlphaFold2 models with notable discrepancies: lipid-preserved respiratory supercomplex (TM-score increased from 0.52 to 0.69), flagellar L-ring protein (TM-score increased from 0.53 to 0.64), cation diffusion facilitator YiiP (TM-score increased from 0.76 to 0.83), and Sulfolobus islandicus pilus (TM-score increased from 0.77 to 0.85). We explored the effect of three different mode ranges (modes 1-9, 7-9, and 1-12), masked or box-cropped density maps, numerical optimization methods, and two similarity measures (Pearson correlation and inner product). The best results were achieved for the widest mode range (modes 1-12), masked maps, inner product, and local Powell optimization. These optimal parameters were implemented in the flexible fitting utility elforge.py in version 1.4 of our Python-based ModeHunter package.

Availability and implementation: https://modehunter.biomachina.org.

{"title":"Flexible fitting of AlphaFold2-predicted models to cryo-EM density maps using elastic network models: a methodical affirmation.","authors":"Maytha Alshammari, Jing He, Willy Wriggers","doi":"10.1093/bioadv/vbae181","DOIUrl":"10.1093/bioadv/vbae181","url":null,"abstract":"Motivation: This study investigates the flexible refinement of AlphaFold2 models against corresponding cryo-electron microscopy (cryo-EM) maps using normal modes derived from elastic network models (ENMs) as basis functions for displacement. AlphaFold2 generally predicts highly accurate structures, but 18 of the 137 models of isolated chains exhibit a TM-score below 0.80. We achieved a significant improvement in four of these deviating structures and used them to systematically optimize the parameters of the ENM motion model.Results: We successfully refined four AlphaFold2 models with notable discrepancies: lipid-preserved respiratory supercomplex (TM-score increased from 0.52 to 0.69), flagellar L-ring protein (TM-score increased from 0.53 to 0.64), cation diffusion facilitator YiiP (TM-score increased from 0.76 to 0.83), and Sulfolobus islandicus pilus (TM-score increased from 0.77 to 0.85). We explored the effect of three different mode ranges (modes 1-9, 7-9, and 1-12), masked or box-cropped density maps, numerical optimization methods, and two similarity measures (Pearson correlation and inner product). The best results were achieved for the widest mode range (modes 1-12), masked maps, inner product, and local Powell optimization. These optimal parameters were implemented in the flexible fitting utility elforge.py in version 1.4 of our Python-based ModeHunter package.Availability and implementation: https://modehunter.biomachina.org.","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbae181"},"PeriodicalIF":2.4,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11783307/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143082478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MeTEor: an R Shiny app for exploring longitudinal metabolomics data. MeTEor：一个用于探索纵向代谢组学数据的R Shiny应用程序。

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Bioinformatics advances

Pub Date : 2024-11-14 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae178

Gordon Grabert, Daniel Dehncke, Tushar More, Markus List, Anke R M Kraft, Markus Cornberg, Karsten Hiller, Tim Kacprowski

Motivation: The availability of longitudinal omics data is increasing in metabolomics research. Viewing metabolomics data over time provides detailed insight into biological processes and fosters understanding of how systems react over time. However, the analysis of longitudinal metabolomics data poses various challenges, both in terms of statistical evaluation and visualization.

Results: To make explorative analysis of longitudinal data readily available to researchers without formal background in computer science and programming, we present MEtabolite Trajectory ExplORer (MeTEor). MeTEor is an R Shiny app providing a comprehensive set of statistical analysis methods. To demonstrate the capabilities of MeTEor, we replicated the analysis of metabolomics data from a previously published study on COVID-19 patients.

Availability and implementation: MeTEor is available as an R package and as a Docker image. Source code and instructions for setting up the app can be found on GitHub (https://github.com/scibiome/meteor). The Docker image is available at Docker Hub (https://hub.docker.com/r/gordomics/meteor). MeTEor has been tested on Microsoft Windows, Unix/Linux, and macOS.

动机：在代谢组学研究中，纵向组学数据的可用性正在增加。随着时间的推移查看代谢组学数据可以详细了解生物过程，并促进对系统如何随时间反应的理解。然而，纵向代谢组学数据的分析在统计评估和可视化方面都面临着各种挑战。结果：为了使没有计算机科学和编程背景的研究人员能够很容易地对纵向数据进行探索性分析，我们提出了代谢物轨迹探索者（MeTEor）。MeTEor是一个R Shiny的应用程序，提供了一套全面的统计分析方法。为了证明MeTEor的能力，我们复制了之前发表的一项关于COVID-19患者的研究的代谢组学数据分析。可用性和实现：MeTEor可以作为R包和Docker镜像使用。可以在GitHub （https://github.com/scibiome/meteor）上找到设置应用程序的源代码和说明。Docker镜像可以在Docker Hub （https://hub.docker.com/r/gordomics/meteor）上获得。MeTEor已经在Microsoft Windows、Unix/Linux和macOS上进行了测试。

{"title":"MeTEor: an R Shiny app for exploring longitudinal metabolomics data.","authors":"Gordon Grabert, Daniel Dehncke, Tushar More, Markus List, Anke R M Kraft, Markus Cornberg, Karsten Hiller, Tim Kacprowski","doi":"10.1093/bioadv/vbae178","DOIUrl":"10.1093/bioadv/vbae178","url":null,"abstract":"Motivation: The availability of longitudinal omics data is increasing in metabolomics research. Viewing metabolomics data over time provides detailed insight into biological processes and fosters understanding of how systems react over time. However, the analysis of longitudinal metabolomics data poses various challenges, both in terms of statistical evaluation and visualization.Results: To make explorative analysis of longitudinal data readily available to researchers without formal background in computer science and programming, we present MEtabolite Trajectory ExplORer (MeTEor). MeTEor is an R Shiny app providing a comprehensive set of statistical analysis methods. To demonstrate the capabilities of MeTEor, we replicated the analysis of metabolomics data from a previously published study on COVID-19 patients.Availability and implementation: MeTEor is available as an R package and as a Docker image. Source code and instructions for setting up the app can be found on GitHub (https://github.com/scibiome/meteor). The Docker image is available at Docker Hub (https://hub.docker.com/r/gordomics/meteor). MeTEor has been tested on Microsoft Windows, Unix/Linux, and macOS.","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae178"},"PeriodicalIF":2.4,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11631383/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142808674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MultiOmicsIntegrator: a nextflow pipeline for integrated omics analyses. MultiOmicsIntegrator：一个用于综合 omics 分析的 nextflow 管道。

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Bioinformatics advances

Pub Date : 2024-11-14 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae175

Bianka Alexandra Pasat, Eleftherios Pilalis, Katarzyna Mnich, Afshin Samali, Aristotelis Chatziioannou, Adrienne M Gorman

Motivation: Analysis of gene and isoform expression levels is becoming critical for the detailed understanding of biochemical mechanisms. In addition, integrating RNA-seq data with other omics data types, such as proteomics and metabolomics, provides a strong approach for consolidating our understanding of biological processes across various organizational tiers, thus promoting the identification of potential therapeutic targets.

Results: We present our pipeline, called MultiOmicsIntegrator (MOI), an inclusive pipeline for comprehensive omics analyses. MOI represents a unified approach that performs in-depth individual analyses of diverse omics. Specifically, exhaustive analysis of RNA-seq data at the level of genes, isoforms of genes, as well as miRNA is offered, coupled with functional annotation and structure prediction of these transcripts. Additionally, proteomics and metabolomics data are supported providing a holistic view of biological systems. Finally, MOI has tools to integrate simultaneously multiple and diverse omics datasets, with both data- and function-driven approaches, fostering a deeper understanding of intricate biological interactions.

Availability and implementation: MOI and ReadTheDocs.

动机基因和同工酶表达水平的分析对于详细了解生化机制至关重要。此外，RNA-seq 数据与蛋白质组学和代谢组学等其他全息数据类型的整合，为巩固我们对不同组织层级的生物过程的理解提供了强有力的方法，从而促进了潜在治疗靶点的确定：我们介绍了名为 "MultiOmicsIntegrator (MOI) "的管道，它是一种用于综合全局组学分析的包容性管道。MOI 是一种统一的方法，可对不同的 omics 进行深入的单独分析。具体来说，MOI 可在基因、基因同工酶和 miRNA 水平上对 RNA-seq 数据进行详尽分析，并对这些转录本进行功能注释和结构预测。此外，还支持蛋白质组学和代谢组学数据，提供生物系统的整体视图。最后，MOI 还提供了同时整合多个不同的 omics 数据集的工具，采用数据和功能驱动的方法，加深对错综复杂的生物相互作用的理解：MOI 和 ReadTheDocs。

{"title":"MultiOmicsIntegrator: a nextflow pipeline for integrated omics analyses.","authors":"Bianka Alexandra Pasat, Eleftherios Pilalis, Katarzyna Mnich, Afshin Samali, Aristotelis Chatziioannou, Adrienne M Gorman","doi":"10.1093/bioadv/vbae175","DOIUrl":"10.1093/bioadv/vbae175","url":null,"abstract":"Motivation: Analysis of gene and isoform expression levels is becoming critical for the detailed understanding of biochemical mechanisms. In addition, integrating RNA-seq data with other omics data types, such as proteomics and metabolomics, provides a strong approach for consolidating our understanding of biological processes across various organizational tiers, thus promoting the identification of potential therapeutic targets.Results: We present our pipeline, called MultiOmicsIntegrator (MOI), an inclusive pipeline for comprehensive omics analyses. MOI represents a unified approach that performs in-depth individual analyses of diverse omics. Specifically, exhaustive analysis of RNA-seq data at the level of genes, isoforms of genes, as well as miRNA is offered, coupled with functional annotation and structure prediction of these transcripts. Additionally, proteomics and metabolomics data are supported providing a holistic view of biological systems. Finally, MOI has tools to integrate simultaneously multiple and diverse omics datasets, with both data- and function-driven approaches, fostering a deeper understanding of intricate biological interactions.Availability and implementation: MOI and ReadTheDocs.","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae175"},"PeriodicalIF":2.4,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11576358/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142683388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

mxfda: a comprehensive toolkit for functional data analysis of single-cell spatial data. mxfda：用于单细胞空间数据功能数据分析的综合工具包。

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Bioinformatics advances

Pub Date : 2024-11-13 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae155

Julia Wrobel, Alex C Soupir, Mitchell T Hayes, Lauren C Peres, Thao Vu, Andrew Leroux, Brooke L Fridley

Summary: Technologies that produce spatial single-cell (SC) data have revolutionized the study of tissue microstructures and promise to advance personalized treatment of cancer by revealing new insights about the tumor microenvironment. Functional data analysis (FDA) is an ideal analytic framework for connecting cell spatial relationships to patient outcomes, but can be challenging to implement. To address this need, we present mxfda, an R package for end-to-end analysis of SC spatial data using FDA. mxfda implements a suite of methods to facilitate spatial analysis of SC imaging data using FDA techniques.

Availability and implementation: The mxfda R package is freely available at https://cran.r-project.org/package=mxfda and has detailed documentation, including four vignettes, available at http://juliawrobel.com/mxfda/.

摘要：产生空间单细胞（SC）数据的技术彻底改变了对组织微结构的研究，并有望通过揭示肿瘤微环境的新见解推进癌症的个性化治疗。功能数据分析（FDA）是将细胞空间关系与患者预后联系起来的理想分析框架，但实施起来却很困难。为了满足这一需求，我们推出了 mxfda，这是一个使用 FDA 对 SC 空间数据进行端到端分析的 R 软件包。mxfda 实现了一套方法，便于使用 FDA 技术对 SC 成像数据进行空间分析：mxfda R 软件包可从 https://cran.r-project.org/package=mxfda 免费获取，其详细文档（包括四个小节）可从 http://juliawrobel.com/mxfda/ 获取。

引用次数: 0

Conditional flux balance analysis toolbox for python: application to research metabolism in cyclic environments. python 条件通量平衡分析工具箱：循环环境中新陈代谢研究的应用。

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Bioinformatics advances

Pub Date : 2024-11-13 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae174

Timothy Páez-Watson, Ricardo Hernández Medina, Loek Vellekoop, Mark C M van Loosdrecht, S Aljoscha Wahl

Summary: We present py_cFBA, a Python-based toolbox for conditional flux balance analysis (cFBA). Our toolbox allows for an easy implementation of cFBA models using a well-documented and modular approach and supports the generation of Systems Biology Markup Language models. The toolbox is designed to be user-friendly, versatile, and freely available to non-commercial users, serving as a valuable resource for researchers predicting metabolic behaviour with resource allocation in dynamic-cyclic environments.

Availability and implementation: Extensive documentation, installation steps, tutorials, and examples are available at https://tp-watson-python-cfba.readthedocs.io/en/. The py_cFBA python package is available at https://pypi.org/project/py-cfba/.

摘要：我们介绍了基于 Python 的条件通量平衡分析（cFBA）工具箱 py_cFBA。我们的工具箱采用文档齐全的模块化方法，可轻松实现条件通量平衡分析模型，并支持生成系统生物学标记语言模型。该工具箱设计为用户友好型、多功能型，可免费提供给非商业用户，是研究人员预测动态循环环境中资源分配的代谢行为的宝贵资源：广泛的文档、安装步骤、教程和示例可从 https://tp-watson-python-cfba.readthedocs.io/en/ 获取。py_cFBA python 软件包可从 https://pypi.org/project/py-cfba/ 获取。

引用次数: 0

Introducing GWAStic: a user-friendly, cross-platform solution for genome-wide association studies and genomic prediction. 介绍 GWAStic：全基因组关联研究和基因组预测的用户友好型跨平台解决方案。

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Bioinformatics advances

Pub Date : 2024-11-12 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae177

Stefanie Lück, Uwe Scholz, Dimitar Douchkov

Motivation: Advances in genomics have created an insistent need for accessible tools that simplify complex genetic data analysis, enabling researchers across fields to harness the power of genome-wide association studies and genomic prediction. GWAStic was developed to bridge this gap, providing an intuitive platform that combines artificial intelligence with traditional statistical methods, making sophisticated genomic analysis accessible without requiring deep expertise in statistical software.

Results: We present GWAStic, an intuitive, cross-platform desktop application designed to streamline genome-wide association studies and genomic prediction for biological and medical researchers. With a user-friendly graphical interface, GWAStic integrates machine learning and traditional statistical approaches to support genetic analysis. The application accepts inputs from standard text-based Variant Call Formats and PLINK binary files, generating clear graphical outputs, including Manhattan plots, quantile-quantile plots, and genomic prediction correlation plots to enhance data visualization and analysis.

Availability and implementation: Project page: https://github.com/snowformatics/gwastic_desktop; GWAStic documentation: https://snowformatics.gitbook.io/product-docs; PyPI: https://pypi.org/project/gwastic-desktop/.

动机随着基因组学的发展，人们亟需能够简化复杂基因数据分析的工具，使各领域的研究人员能够利用全基因组关联研究和基因组预测的力量。GWAStic 就是为了弥补这一差距而开发的，它提供了一个将人工智能与传统统计方法相结合的直观平台，使复杂的基因组分析变得易学易用，而无需深厚的统计软件专业知识：我们介绍的 GWAStic 是一款直观、跨平台的桌面应用程序，旨在为生物和医学研究人员简化全基因组关联研究和基因组预测。GWAStic 采用用户友好的图形界面，整合了机器学习和传统统计方法，为遗传分析提供支持。该应用程序接受基于标准文本的变异调用格式和 PLINK 二进制文件的输入，生成清晰的图形输出，包括曼哈顿图、量纲-量纲图和基因组预测相关图，以加强数据的可视化和分析：项目页面：https://github.com/snowformatics/gwastic_desktop；GWAStic 文档：https://snowformatics.gitbook.io/product-docs；PyPI：https://pypi.org/project/gwastic-desktop/。

{"title":"Introducing GWAStic: a user-friendly, cross-platform solution for genome-wide association studies and genomic prediction.","authors":"Stefanie Lück, Uwe Scholz, Dimitar Douchkov","doi":"10.1093/bioadv/vbae177","DOIUrl":"10.1093/bioadv/vbae177","url":null,"abstract":"Motivation: Advances in genomics have created an insistent need for accessible tools that simplify complex genetic data analysis, enabling researchers across fields to harness the power of genome-wide association studies and genomic prediction. GWAStic was developed to bridge this gap, providing an intuitive platform that combines artificial intelligence with traditional statistical methods, making sophisticated genomic analysis accessible without requiring deep expertise in statistical software.Results: We present GWAStic, an intuitive, cross-platform desktop application designed to streamline genome-wide association studies and genomic prediction for biological and medical researchers. With a user-friendly graphical interface, GWAStic integrates machine learning and traditional statistical approaches to support genetic analysis. The application accepts inputs from standard text-based Variant Call Formats and PLINK binary files, generating clear graphical outputs, including Manhattan plots, quantile-quantile plots, and genomic prediction correlation plots to enhance data visualization and analysis.Availability and implementation: Project page: https://github.com/snowformatics/gwastic_desktop; GWAStic documentation: https://snowformatics.gitbook.io/product-docs; PyPI: https://pypi.org/project/gwastic-desktop/.","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae177"},"PeriodicalIF":2.4,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643344/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

LUKB: preparing local UK Biobank data for analysis. LUKB：准备用于分析的英国生物库本地数据。

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Bioinformatics advances

Pub Date : 2024-11-09 eCollection Date: 2024-01-01 DOI: 10.1093/bioadv/vbae176

Xiangnan Li, Yaqi Huang, Shuming Wang, Meng Hao, Yi Li, Hui Zhang, Zixin Hu

Motivation: The UK Biobank data holds immense potential for human health research. However, the complex data preparation and interpretation processes often act as barriers for researchers, diverting them from their core research questions.

Results: We developed LUKB, an R Shiny-based web tool that simplifies UK Biobank data preparation by automating these preprocessing tasks. LUKB reduces preprocessing time and integrates functions for initial data exploration, allowing researchers to dedicate more time to their scientific endeavors. Detailed deployment and usage can be found in the Supplementary Data.

Availability and implementation: LUKB is freely available at https://github.com/HaiGenBuShang/LUKB.

动机英国生物库数据为人类健康研究提供了巨大潜力。然而，复杂的数据准备和解释过程往往成为研究人员的障碍，使他们偏离核心研究问题：我们开发了基于 R Shiny 的网络工具 LUKB，通过自动完成这些预处理任务来简化英国生物库数据的准备工作。LUKB 减少了预处理时间，并集成了用于初始数据探索的功能，使研究人员能够将更多时间投入到科学研究中。详细的部署和使用方法见补充数据：LUKB 可在 https://github.com/HaiGenBuShang/LUKB 免费获取。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Bioinformatics advances

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀