Pub Date : 2024-11-20eCollection Date: 2025-01-01DOI: 10.1093/bioadv/vbae183
Dany Domínguez-Pérez, Guillermin Agüero-Chapin, Serena Leone, Maria Vittoria Modica
Motivation: Accurate sequence length profiling is essential in bioinformatics, particularly in genomics and proteomics. Existing tools like SeqKit and the Trinity toolkit provide basic sequence statistics but often fall short in offering comprehensive analytics and plotting options. For instance, SeqKit is a very complete and fast tool for sequence analysis, delivering useful metrics (e.g. number of sequences, average, minimum, and maximum lengths) and can return sequences either shorter or longer (but not both at once) for a given length. Similarly, Trinity's Perl-based scripts provide detailed contig length distributions (e.g. N50, median, and average lengths) but do not include the total number of sequences or offer graphical representations of the data.
Results: Given that key sequence analysis tasks are often distributed across multiple tools, we introduce SeqLengthPlot v2.0, an all-in-one, easy-to-use Python-based tool. Through a simple command-line interface, this straightforward tool enables users to split input FASTA files (nucleotide and protein) into two distinct files based on a customizable sequence length cutoff. It also automatically retrieves the resulting FASTA files, generates length distribution plots, and provides comprehensive statistical summaries.
Availability and implementation: SeqLengthPlot_v2.0.2 can be accessed at https://github.com/danydguezperez/SeqLengthPlot/releases/tag/v2.0.2.
{"title":"SeqLengthPlot v2.0: an all-in-one, easy-to-use tool for visualizing and retrieving sequence lengths from FASTA files.","authors":"Dany Domínguez-Pérez, Guillermin Agüero-Chapin, Serena Leone, Maria Vittoria Modica","doi":"10.1093/bioadv/vbae183","DOIUrl":"10.1093/bioadv/vbae183","url":null,"abstract":"<p><strong>Motivation: </strong>Accurate sequence length profiling is essential in bioinformatics, particularly in genomics and proteomics. Existing tools like SeqKit and the Trinity toolkit provide basic sequence statistics but often fall short in offering comprehensive analytics and plotting options. For instance, SeqKit is a very complete and fast tool for sequence analysis, delivering useful metrics (e.g. number of sequences, average, minimum, and maximum lengths) and can return sequences either shorter or longer (but not both at once) for a given length. Similarly, Trinity's Perl-based scripts provide detailed contig length distributions (e.g. N50, median, and average lengths) but do not include the total number of sequences or offer graphical representations of the data.</p><p><strong>Results: </strong>Given that key sequence analysis tasks are often distributed across multiple tools, we introduce <b>SeqLengthPlot v2.0</b>, an all-in-one, easy-to-use Python-based tool. Through a simple command-line interface, this straightforward tool enables users to split input FASTA files (nucleotide and protein) into two distinct files based on a customizable sequence length cutoff. It also automatically retrieves the resulting FASTA files, generates length distribution plots, and provides comprehensive statistical summaries.</p><p><strong>Availability and implementation: </strong>SeqLengthPlot_v2.0.2 can be accessed at https://github.com/danydguezperez/SeqLengthPlot/releases/tag/v2.0.2.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbae183"},"PeriodicalIF":2.4,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11671033/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142904226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-19eCollection Date: 2024-01-01DOI: 10.1093/bioadv/vbae179
Jin-Ok Lee, Sejoon Lee, Dongyoon Lee, Taeyeon Hwang, Soobok Joe, Jin Ok Yang, Jibin Jeong, Jung Hun Ohn, Jee Hyun Kim
Summary: Transposable elements (TEs), commonly referred to as "mobile elements," constitute DNA segments capable of relocating within a genome. Initially disregarded as "junk DNA" devoid of specific functionality, it has become evident that TEs have diverse influences on an organism's biology and health. The impact of these elements varies according to their location, classification, and their effects on specific genes or regulatory components. Despite their significant roles, a paucity of resources concerning TEs in population-scale genome sequencing remains. Herein, we analyze whole-genome sequencing data sourced from the Korean Genome and Epidemiology Study, encompassing 2500 Korean individuals. To facilitate convenient data access and observation, we developed a web-based database, KTED. Additionally, we scrutinized the differential distributions of TEs across five distinct common disease groups: dyslipidemia, hypertension, diabetes, thyroid disease, and cancer.
Availability and implementation: https://snubh.shinyapps.io/KTED.
{"title":"KTED: a comprehensive web-based database for transposable elements in the Korean genome.","authors":"Jin-Ok Lee, Sejoon Lee, Dongyoon Lee, Taeyeon Hwang, Soobok Joe, Jin Ok Yang, Jibin Jeong, Jung Hun Ohn, Jee Hyun Kim","doi":"10.1093/bioadv/vbae179","DOIUrl":"10.1093/bioadv/vbae179","url":null,"abstract":"<p><strong>Summary: </strong>Transposable elements (TEs), commonly referred to as \"mobile elements,\" constitute DNA segments capable of relocating within a genome. Initially disregarded as \"junk DNA\" devoid of specific functionality, it has become evident that TEs have diverse influences on an organism's biology and health. The impact of these elements varies according to their location, classification, and their effects on specific genes or regulatory components. Despite their significant roles, a paucity of resources concerning TEs in population-scale genome sequencing remains. Herein, we analyze whole-genome sequencing data sourced from the Korean Genome and Epidemiology Study, encompassing 2500 Korean individuals. To facilitate convenient data access and observation, we developed a web-based database, KTED. Additionally, we scrutinized the differential distributions of TEs across five distinct common disease groups: dyslipidemia, hypertension, diabetes, thyroid disease, and cancer.</p><p><strong>Availability and implementation: </strong>https://snubh.shinyapps.io/KTED.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae179"},"PeriodicalIF":2.4,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11652267/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142857185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-18eCollection Date: 2024-01-01DOI: 10.1093/bioadv/vbae166
Cath Brooksbank, Michelle D Brazas, Nicola Mulder, Russell Schwartz, Verena Ras, Sarah L Morgan, Marta Lloret Llinares, Patricia Carvajal López, Lee Larcombe, Amel Ghouila, Tom Hancocks, Venkata Satagopam, Javier De Las Rivas, Gaston Mazandu, Bruno Gaeta
Motivation: Developing competency in the broad area of bioinformatics is challenging globally, owing to the breadth of the field and the diversity of its audiences for education and training. Course design can be facilitated by the use of a competency framework-a set of competency requirements that define the knowledge, skills and attitudes needed by individuals in (or aspiring to be in) a particular profession or role. These competency requirements can help to define curricula as they can inform both the content and level to which competency needs to be developed. The International Society for Computational Biology (ISCB) developed a list of bioinformatics competencies in 2014, and these have undergone several rounds of improvement. In consultation with a broad bioinformatics training community, these have now been further refined and extended to include knowledge skills and attitudes, and mappings to previous and other existing competency frameworks.
Results: Here, we present version 3 of the ISCB competency framework. We describe how it was developed and how to access it, as well as providing some examples of how it has been used.
Availability and implementation: The framework is openly accessible at https://competency.ebi.ac.uk/framework/iscb/3.0/competencies.
{"title":"The ISCB competency framework v. 3: a revised and extended standard for bioinformatics education and training.","authors":"Cath Brooksbank, Michelle D Brazas, Nicola Mulder, Russell Schwartz, Verena Ras, Sarah L Morgan, Marta Lloret Llinares, Patricia Carvajal López, Lee Larcombe, Amel Ghouila, Tom Hancocks, Venkata Satagopam, Javier De Las Rivas, Gaston Mazandu, Bruno Gaeta","doi":"10.1093/bioadv/vbae166","DOIUrl":"10.1093/bioadv/vbae166","url":null,"abstract":"<p><strong>Motivation: </strong>Developing competency in the broad area of bioinformatics is challenging globally, owing to the breadth of the field and the diversity of its audiences for education and training. Course design can be facilitated by the use of a competency framework-a set of competency requirements that define the knowledge, skills and attitudes needed by individuals in (or aspiring to be in) a particular profession or role. These competency requirements can help to define curricula as they can inform both the content and level to which competency needs to be developed. The International Society for Computational Biology (ISCB) developed a list of bioinformatics competencies in 2014, and these have undergone several rounds of improvement. In consultation with a broad bioinformatics training community, these have now been further refined and extended to include knowledge skills and attitudes, and mappings to previous and other existing competency frameworks.</p><p><strong>Results: </strong>Here, we present version 3 of the ISCB competency framework. We describe how it was developed and how to access it, as well as providing some examples of how it has been used.</p><p><strong>Availability and implementation: </strong>The framework is openly accessible at https://competency.ebi.ac.uk/framework/iscb/3.0/competencies.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae166"},"PeriodicalIF":2.4,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11646570/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-18eCollection Date: 2025-01-01DOI: 10.1093/bioadv/vbae181
Maytha Alshammari, Jing He, Willy Wriggers
Motivation: This study investigates the flexible refinement of AlphaFold2 models against corresponding cryo-electron microscopy (cryo-EM) maps using normal modes derived from elastic network models (ENMs) as basis functions for displacement. AlphaFold2 generally predicts highly accurate structures, but 18 of the 137 models of isolated chains exhibit a TM-score below 0.80. We achieved a significant improvement in four of these deviating structures and used them to systematically optimize the parameters of the ENM motion model.
Results: We successfully refined four AlphaFold2 models with notable discrepancies: lipid-preserved respiratory supercomplex (TM-score increased from 0.52 to 0.69), flagellar L-ring protein (TM-score increased from 0.53 to 0.64), cation diffusion facilitator YiiP (TM-score increased from 0.76 to 0.83), and Sulfolobus islandicus pilus (TM-score increased from 0.77 to 0.85). We explored the effect of three different mode ranges (modes 1-9, 7-9, and 1-12), masked or box-cropped density maps, numerical optimization methods, and two similarity measures (Pearson correlation and inner product). The best results were achieved for the widest mode range (modes 1-12), masked maps, inner product, and local Powell optimization. These optimal parameters were implemented in the flexible fitting utility elforge.py in version 1.4 of our Python-based ModeHunter package.
Availability and implementation: https://modehunter.biomachina.org.
{"title":"Flexible fitting of AlphaFold2-predicted models to cryo-EM density maps using elastic network models: a methodical affirmation.","authors":"Maytha Alshammari, Jing He, Willy Wriggers","doi":"10.1093/bioadv/vbae181","DOIUrl":"10.1093/bioadv/vbae181","url":null,"abstract":"<p><strong>Motivation: </strong>This study investigates the flexible refinement of AlphaFold2 models against corresponding cryo-electron microscopy (cryo-EM) maps using normal modes derived from elastic network models (ENMs) as basis functions for displacement. AlphaFold2 generally predicts highly accurate structures, but 18 of the 137 models of isolated chains exhibit a TM-score below 0.80. We achieved a significant improvement in four of these deviating structures and used them to systematically optimize the parameters of the ENM motion model.</p><p><strong>Results: </strong>We successfully refined four AlphaFold2 models with notable discrepancies: lipid-preserved respiratory supercomplex (TM-score increased from 0.52 to 0.69), flagellar L-ring protein (TM-score increased from 0.53 to 0.64), cation diffusion facilitator YiiP (TM-score increased from 0.76 to 0.83), and <i>Sulfolobus islandicus</i> pilus (TM-score increased from 0.77 to 0.85). We explored the effect of three different mode ranges (modes 1-9, 7-9, and 1-12), masked or box-cropped density maps, numerical optimization methods, and two similarity measures (Pearson correlation and inner product). The best results were achieved for the widest mode range (modes 1-12), masked maps, inner product, and local Powell optimization. These optimal parameters were implemented in the flexible fitting utility elforge.py in version 1.4 of our Python-based ModeHunter package.</p><p><strong>Availability and implementation: </strong>https://modehunter.biomachina.org.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbae181"},"PeriodicalIF":2.4,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11783307/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143082478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-14eCollection Date: 2024-01-01DOI: 10.1093/bioadv/vbae178
Gordon Grabert, Daniel Dehncke, Tushar More, Markus List, Anke R M Kraft, Markus Cornberg, Karsten Hiller, Tim Kacprowski
Motivation: The availability of longitudinal omics data is increasing in metabolomics research. Viewing metabolomics data over time provides detailed insight into biological processes and fosters understanding of how systems react over time. However, the analysis of longitudinal metabolomics data poses various challenges, both in terms of statistical evaluation and visualization.
Results: To make explorative analysis of longitudinal data readily available to researchers without formal background in computer science and programming, we present MEtabolite Trajectory ExplORer (MeTEor). MeTEor is an R Shiny app providing a comprehensive set of statistical analysis methods. To demonstrate the capabilities of MeTEor, we replicated the analysis of metabolomics data from a previously published study on COVID-19 patients.
Availability and implementation: MeTEor is available as an R package and as a Docker image. Source code and instructions for setting up the app can be found on GitHub (https://github.com/scibiome/meteor). The Docker image is available at Docker Hub (https://hub.docker.com/r/gordomics/meteor). MeTEor has been tested on Microsoft Windows, Unix/Linux, and macOS.
{"title":"MeTEor: an R Shiny app for exploring longitudinal metabolomics data.","authors":"Gordon Grabert, Daniel Dehncke, Tushar More, Markus List, Anke R M Kraft, Markus Cornberg, Karsten Hiller, Tim Kacprowski","doi":"10.1093/bioadv/vbae178","DOIUrl":"10.1093/bioadv/vbae178","url":null,"abstract":"<p><strong>Motivation: </strong>The availability of longitudinal omics data is increasing in metabolomics research. Viewing metabolomics data over time provides detailed insight into biological processes and fosters understanding of how systems react over time. However, the analysis of longitudinal metabolomics data poses various challenges, both in terms of statistical evaluation and visualization.</p><p><strong>Results: </strong>To make explorative analysis of longitudinal data readily available to researchers without formal background in computer science and programming, we present MEtabolite Trajectory ExplORer (MeTEor). MeTEor is an R Shiny app providing a comprehensive set of statistical analysis methods. To demonstrate the capabilities of MeTEor, we replicated the analysis of metabolomics data from a previously published study on COVID-19 patients.</p><p><strong>Availability and implementation: </strong>MeTEor is available as an R package and as a Docker image. Source code and instructions for setting up the app can be found on GitHub (https://github.com/scibiome/meteor). The Docker image is available at Docker Hub (https://hub.docker.com/r/gordomics/meteor). MeTEor has been tested on Microsoft Windows, Unix/Linux, and macOS.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae178"},"PeriodicalIF":2.4,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11631383/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142808674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-14eCollection Date: 2024-01-01DOI: 10.1093/bioadv/vbae175
Bianka Alexandra Pasat, Eleftherios Pilalis, Katarzyna Mnich, Afshin Samali, Aristotelis Chatziioannou, Adrienne M Gorman
Motivation: Analysis of gene and isoform expression levels is becoming critical for the detailed understanding of biochemical mechanisms. In addition, integrating RNA-seq data with other omics data types, such as proteomics and metabolomics, provides a strong approach for consolidating our understanding of biological processes across various organizational tiers, thus promoting the identification of potential therapeutic targets.
Results: We present our pipeline, called MultiOmicsIntegrator (MOI), an inclusive pipeline for comprehensive omics analyses. MOI represents a unified approach that performs in-depth individual analyses of diverse omics. Specifically, exhaustive analysis of RNA-seq data at the level of genes, isoforms of genes, as well as miRNA is offered, coupled with functional annotation and structure prediction of these transcripts. Additionally, proteomics and metabolomics data are supported providing a holistic view of biological systems. Finally, MOI has tools to integrate simultaneously multiple and diverse omics datasets, with both data- and function-driven approaches, fostering a deeper understanding of intricate biological interactions.
Availability and implementation: MOI and ReadTheDocs.
{"title":"MultiOmicsIntegrator: a nextflow pipeline for integrated omics analyses.","authors":"Bianka Alexandra Pasat, Eleftherios Pilalis, Katarzyna Mnich, Afshin Samali, Aristotelis Chatziioannou, Adrienne M Gorman","doi":"10.1093/bioadv/vbae175","DOIUrl":"10.1093/bioadv/vbae175","url":null,"abstract":"<p><strong>Motivation: </strong>Analysis of gene and isoform expression levels is becoming critical for the detailed understanding of biochemical mechanisms. In addition, integrating RNA-seq data with other omics data types, such as proteomics and metabolomics, provides a strong approach for consolidating our understanding of biological processes across various organizational tiers, thus promoting the identification of potential therapeutic targets.</p><p><strong>Results: </strong>We present our pipeline, called MultiOmicsIntegrator (MOI), an inclusive pipeline for comprehensive omics analyses. MOI represents a unified approach that performs in-depth individual analyses of diverse omics. Specifically, exhaustive analysis of RNA-seq data at the level of genes, isoforms of genes, as well as miRNA is offered, coupled with functional annotation and structure prediction of these transcripts. Additionally, proteomics and metabolomics data are supported providing a holistic view of biological systems. Finally, MOI has tools to integrate simultaneously multiple and diverse omics datasets, with both data- and function-driven approaches, fostering a deeper understanding of intricate biological interactions.</p><p><strong>Availability and implementation: </strong>MOI and ReadTheDocs.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae175"},"PeriodicalIF":2.4,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11576358/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142683388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-13eCollection Date: 2024-01-01DOI: 10.1093/bioadv/vbae155
Julia Wrobel, Alex C Soupir, Mitchell T Hayes, Lauren C Peres, Thao Vu, Andrew Leroux, Brooke L Fridley
Summary: Technologies that produce spatial single-cell (SC) data have revolutionized the study of tissue microstructures and promise to advance personalized treatment of cancer by revealing new insights about the tumor microenvironment. Functional data analysis (FDA) is an ideal analytic framework for connecting cell spatial relationships to patient outcomes, but can be challenging to implement. To address this need, we present mxfda, an R package for end-to-end analysis of SC spatial data using FDA. mxfda implements a suite of methods to facilitate spatial analysis of SC imaging data using FDA techniques.
Availability and implementation: The mxfda R package is freely available at https://cran.r-project.org/package=mxfda and has detailed documentation, including four vignettes, available at http://juliawrobel.com/mxfda/.
摘要:产生空间单细胞(SC)数据的技术彻底改变了对组织微结构的研究,并有望通过揭示肿瘤微环境的新见解推进癌症的个性化治疗。功能数据分析(FDA)是将细胞空间关系与患者预后联系起来的理想分析框架,但实施起来却很困难。为了满足这一需求,我们推出了 mxfda,这是一个使用 FDA 对 SC 空间数据进行端到端分析的 R 软件包。mxfda 实现了一套方法,便于使用 FDA 技术对 SC 成像数据进行空间分析:mxfda R 软件包可从 https://cran.r-project.org/package=mxfda 免费获取,其详细文档(包括四个小节)可从 http://juliawrobel.com/mxfda/ 获取。
{"title":"mxfda: a comprehensive toolkit for functional data analysis of single-cell spatial data.","authors":"Julia Wrobel, Alex C Soupir, Mitchell T Hayes, Lauren C Peres, Thao Vu, Andrew Leroux, Brooke L Fridley","doi":"10.1093/bioadv/vbae155","DOIUrl":"10.1093/bioadv/vbae155","url":null,"abstract":"<p><strong>Summary: </strong>Technologies that produce spatial single-cell (SC) data have revolutionized the study of tissue microstructures and promise to advance personalized treatment of cancer by revealing new insights about the tumor microenvironment. Functional data analysis (FDA) is an ideal analytic framework for connecting cell spatial relationships to patient outcomes, but can be challenging to implement. To address this need, we present mxfda, an R package for end-to-end analysis of SC spatial data using FDA. mxfda implements a suite of methods to facilitate spatial analysis of SC imaging data using FDA techniques.</p><p><strong>Availability and implementation: </strong>The mxfda R package is freely available at https://cran.r-project.org/package=mxfda and has detailed documentation, including four vignettes, available at http://juliawrobel.com/mxfda/.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae155"},"PeriodicalIF":2.4,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11568348/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142649354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-13eCollection Date: 2024-01-01DOI: 10.1093/bioadv/vbae174
Timothy Páez-Watson, Ricardo Hernández Medina, Loek Vellekoop, Mark C M van Loosdrecht, S Aljoscha Wahl
Summary: We present py_cFBA, a Python-based toolbox for conditional flux balance analysis (cFBA). Our toolbox allows for an easy implementation of cFBA models using a well-documented and modular approach and supports the generation of Systems Biology Markup Language models. The toolbox is designed to be user-friendly, versatile, and freely available to non-commercial users, serving as a valuable resource for researchers predicting metabolic behaviour with resource allocation in dynamic-cyclic environments.
Availability and implementation: Extensive documentation, installation steps, tutorials, and examples are available at https://tp-watson-python-cfba.readthedocs.io/en/. The py_cFBA python package is available at https://pypi.org/project/py-cfba/.
{"title":"Conditional flux balance analysis toolbox for python: application to research metabolism in cyclic environments.","authors":"Timothy Páez-Watson, Ricardo Hernández Medina, Loek Vellekoop, Mark C M van Loosdrecht, S Aljoscha Wahl","doi":"10.1093/bioadv/vbae174","DOIUrl":"10.1093/bioadv/vbae174","url":null,"abstract":"<p><strong>Summary: </strong>We present py_cFBA, a Python-based toolbox for conditional flux balance analysis (cFBA). Our toolbox allows for an easy implementation of cFBA models using a well-documented and modular approach and supports the generation of Systems Biology Markup Language models. The toolbox is designed to be user-friendly, versatile, and freely available to non-commercial users, serving as a valuable resource for researchers predicting metabolic behaviour with resource allocation in dynamic-cyclic environments.</p><p><strong>Availability and implementation: </strong>Extensive documentation, installation steps, tutorials, and examples are available at https://tp-watson-python-cfba.readthedocs.io/en/. The py_cFBA python package is available at https://pypi.org/project/py-cfba/.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae174"},"PeriodicalIF":2.4,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11593493/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142735127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-12eCollection Date: 2024-01-01DOI: 10.1093/bioadv/vbae177
Stefanie Lück, Uwe Scholz, Dimitar Douchkov
Motivation: Advances in genomics have created an insistent need for accessible tools that simplify complex genetic data analysis, enabling researchers across fields to harness the power of genome-wide association studies and genomic prediction. GWAStic was developed to bridge this gap, providing an intuitive platform that combines artificial intelligence with traditional statistical methods, making sophisticated genomic analysis accessible without requiring deep expertise in statistical software.
Results: We present GWAStic, an intuitive, cross-platform desktop application designed to streamline genome-wide association studies and genomic prediction for biological and medical researchers. With a user-friendly graphical interface, GWAStic integrates machine learning and traditional statistical approaches to support genetic analysis. The application accepts inputs from standard text-based Variant Call Formats and PLINK binary files, generating clear graphical outputs, including Manhattan plots, quantile-quantile plots, and genomic prediction correlation plots to enhance data visualization and analysis.
{"title":"Introducing GWAStic: a user-friendly, cross-platform solution for genome-wide association studies and genomic prediction.","authors":"Stefanie Lück, Uwe Scholz, Dimitar Douchkov","doi":"10.1093/bioadv/vbae177","DOIUrl":"10.1093/bioadv/vbae177","url":null,"abstract":"<p><strong>Motivation: </strong>Advances in genomics have created an insistent need for accessible tools that simplify complex genetic data analysis, enabling researchers across fields to harness the power of genome-wide association studies and genomic prediction. GWAStic was developed to bridge this gap, providing an intuitive platform that combines artificial intelligence with traditional statistical methods, making sophisticated genomic analysis accessible without requiring deep expertise in statistical software.</p><p><strong>Results: </strong>We present GWAStic, an intuitive, cross-platform desktop application designed to streamline genome-wide association studies and genomic prediction for biological and medical researchers. With a user-friendly graphical interface, GWAStic integrates machine learning and traditional statistical approaches to support genetic analysis. The application accepts inputs from standard text-based Variant Call Formats and PLINK binary files, generating clear graphical outputs, including Manhattan plots, quantile-quantile plots, and genomic prediction correlation plots to enhance data visualization and analysis.</p><p><strong>Availability and implementation: </strong>Project page: https://github.com/snowformatics/gwastic_desktop; GWAStic documentation: https://snowformatics.gitbook.io/product-docs; PyPI: https://pypi.org/project/gwastic-desktop/.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae177"},"PeriodicalIF":2.4,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11643344/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142831010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-09eCollection Date: 2024-01-01DOI: 10.1093/bioadv/vbae176
Xiangnan Li, Yaqi Huang, Shuming Wang, Meng Hao, Yi Li, Hui Zhang, Zixin Hu
Motivation: The UK Biobank data holds immense potential for human health research. However, the complex data preparation and interpretation processes often act as barriers for researchers, diverting them from their core research questions.
Results: We developed LUKB, an R Shiny-based web tool that simplifies UK Biobank data preparation by automating these preprocessing tasks. LUKB reduces preprocessing time and integrates functions for initial data exploration, allowing researchers to dedicate more time to their scientific endeavors. Detailed deployment and usage can be found in the Supplementary Data.
Availability and implementation: LUKB is freely available at https://github.com/HaiGenBuShang/LUKB.
动机英国生物库数据为人类健康研究提供了巨大潜力。然而,复杂的数据准备和解释过程往往成为研究人员的障碍,使他们偏离核心研究问题:我们开发了基于 R Shiny 的网络工具 LUKB,通过自动完成这些预处理任务来简化英国生物库数据的准备工作。LUKB 减少了预处理时间,并集成了用于初始数据探索的功能,使研究人员能够将更多时间投入到科学研究中。详细的部署和使用方法见补充数据:LUKB 可在 https://github.com/HaiGenBuShang/LUKB 免费获取。
{"title":"LUKB: preparing local UK Biobank data for analysis.","authors":"Xiangnan Li, Yaqi Huang, Shuming Wang, Meng Hao, Yi Li, Hui Zhang, Zixin Hu","doi":"10.1093/bioadv/vbae176","DOIUrl":"10.1093/bioadv/vbae176","url":null,"abstract":"<p><strong>Motivation: </strong>The UK Biobank data holds immense potential for human health research. However, the complex data preparation and interpretation processes often act as barriers for researchers, diverting them from their core research questions.</p><p><strong>Results: </strong>We developed LUKB, an R Shiny-based web tool that simplifies UK Biobank data preparation by automating these preprocessing tasks. LUKB reduces preprocessing time and integrates functions for initial data exploration, allowing researchers to dedicate more time to their scientific endeavors. Detailed deployment and usage can be found in the Supplementary Data.</p><p><strong>Availability and implementation: </strong>LUKB is freely available at https://github.com/HaiGenBuShang/LUKB.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae176"},"PeriodicalIF":2.4,"publicationDate":"2024-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11580680/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142689856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}