Pub Date : 2024-07-24eCollection Date: 2024-09-01DOI: 10.1515/jib-2024-0022
Stefan Paul Feyer, Bruno Pinaud, Karsten Klein, Etienne Lein, Falk Schreiber
Animal behaviour is often modelled as networks, where, for example, the nodes are individuals of a group and the edges represent behaviour within this group. Different types of behaviours or behavioural categories are then modelled as different yet connected networks which form a multilayer network. Recent developments show the potential and benefit of multilayer networks for animal behaviour research as well as the potential benefit of stereoscopic 3D immersive environments for the interactive visualisation, exploration and analysis of animal behaviour multilayer networks. However, so far animal behaviour research is mainly supported by libraries or software on 2D desktops. Here, we explore the domain-specific requirements for (stereoscopic) 3D environments. Based on those requirements, we provide a proof of concept to visualise, explore and analyse animal behaviour multilayer networks in immersive environments.
{"title":"Exploring animal behaviour multilayer networks in immersive environments - a conceptual framework.","authors":"Stefan Paul Feyer, Bruno Pinaud, Karsten Klein, Etienne Lein, Falk Schreiber","doi":"10.1515/jib-2024-0022","DOIUrl":"10.1515/jib-2024-0022","url":null,"abstract":"<p><p>Animal behaviour is often modelled as networks, where, for example, the nodes are individuals of a group and the edges represent behaviour within this group. Different types of behaviours or behavioural categories are then modelled as different yet connected networks which form a multilayer network. Recent developments show the potential and benefit of multilayer networks for animal behaviour research as well as the potential benefit of stereoscopic 3D immersive environments for the interactive visualisation, exploration and analysis of animal behaviour multilayer networks. However, so far animal behaviour research is mainly supported by libraries or software on 2D desktops. Here, we explore the domain-specific requirements for (stereoscopic) 3D environments. Based on those requirements, we provide a proof of concept to visualise, explore and analyse animal behaviour multilayer networks in immersive environments.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11602229/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141762604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-24eCollection Date: 2024-06-01DOI: 10.1515/jib-2023-0051
Daniel Glez-Peña, Hugo López-Fernández, Pedro Duque, Cristina P Vieira, Jorge Vieira
When inferring the evolution of a gene/gene family, it is advisable to use all available coding sequences (CDS) from as many species genomes as possible in order to infer and date all gene duplications and losses. Nowadays, this means using hundreds or even thousands of CDSs, which makes the inferred phylogenetic trees difficult to visualize and interpret. Therefore, it is useful to have an automated way of collapsing large phylogenetic trees according to a taxonomic term decided by the user (family, class, or order, for instance), in order to highlight the minimal set of sequences that should be used to recapitulate the full history of the gene/gene family being studied at that taxonomic level, that can be refined using additional software. Here we present the Phylogenetic Tree Collapser (PTC) program (https://github.com/pegi3s/phylogenetic-tree-collapser), a flexible tool for automated tree collapsing using taxonomic information, that can be easily used by researchers without a background in informatics, since it only requires the installation of Docker, Podman or Singularity. The utility of PTC is demonstrated by addressing the evolution of the ascorbic acid synthesis pathway in insects. A Docker image is available at Docker Hub (https://hub.docker.com/r/pegi3s/phylogenetic-tree-collapser) with PTC installed and ready-to-run.
{"title":"Inferences on the evolution of the ascorbic acid synthesis pathway in insects using Phylogenetic Tree Collapser (PTC), a tool for the automated collapsing of phylogenetic trees using taxonomic information.","authors":"Daniel Glez-Peña, Hugo López-Fernández, Pedro Duque, Cristina P Vieira, Jorge Vieira","doi":"10.1515/jib-2023-0051","DOIUrl":"10.1515/jib-2023-0051","url":null,"abstract":"<p><p>When inferring the evolution of a gene/gene family, it is advisable to use all available coding sequences (CDS) from as many species genomes as possible in order to infer and date all gene duplications and losses. Nowadays, this means using hundreds or even thousands of CDSs, which makes the inferred phylogenetic trees difficult to visualize and interpret. Therefore, it is useful to have an automated way of collapsing large phylogenetic trees according to a taxonomic term decided by the user (family, class, or order, for instance), in order to highlight the minimal set of sequences that should be used to recapitulate the full history of the gene/gene family being studied at that taxonomic level, that can be refined using additional software. Here we present the Phylogenetic Tree Collapser (PTC) program (https://github.com/pegi3s/phylogenetic-tree-collapser), a flexible tool for automated tree collapsing using taxonomic information, that can be easily used by researchers without a background in informatics, since it only requires the installation of Docker, Podman or Singularity. The utility of PTC is demonstrated by addressing the evolution of the ascorbic acid synthesis pathway in insects. A Docker image is available at Docker Hub (https://hub.docker.com/r/pegi3s/phylogenetic-tree-collapser) with PTC installed and ready-to-run.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11377030/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141762605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-22eCollection Date: 2024-03-01DOI: 10.1515/jib-2024-0015
Martin Golebiewski, Gary Bader, Padraig Gleeson, Thomas E Gorochowski, Sarah M Keating, Matthias König, Chris J Myers, David P Nickerson, Björn Sommer, Dagmar Waltemath, Falk Schreiber
{"title":"Specifications of standards in systems and synthetic biology: status, developments, and tools in 2024.","authors":"Martin Golebiewski, Gary Bader, Padraig Gleeson, Thomas E Gorochowski, Sarah M Keating, Matthias König, Chris J Myers, David P Nickerson, Björn Sommer, Dagmar Waltemath, Falk Schreiber","doi":"10.1515/jib-2024-0015","DOIUrl":"10.1515/jib-2024-0015","url":null,"abstract":"","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11293897/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141725016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-15eCollection Date: 2024-09-01DOI: 10.1515/jib-2024-0020
Valentin Wesp, Lukas Scholz, Janine M Ziermann-Canabarro, Stefan Schuster, Heiko Stark
Collagens are structural proteins that are predominantly found in the extracellular matrix of multicellular animals, where they are mainly responsible for the stability and structural integrity of various tissues. All collagens contain polypeptide strands (α-chains). There are several types of collagens, some of which differ significantly in form, function, and tissue specificity. Because of their importance in clinical research, they are grouped into subdivisions, the so-called collagen families, and their sequences are often analysed. However, problems arise with highly homologous sequence segments. To increase the accuracy of collagen classification and prediction of their functions, the structure of these collagens and their expression in different tissues could result in a better focus on sequence segments of interest. Here, we analyse collagen families with different levels of conservation. As a result, clusters with high interconnectivity can be found, such as the fibrillar collagens, the COL4 network-forming collagens, and the COL9 FACITs. Furthermore, a large cluster between network-forming, FACIT, and COL28a1 α-chains is formed with COL6a3 as a major hub node. The formation of clusters also signifies, why it is important to always analyse the α-chains and why structural changes can have a wide range of effects on the body.
{"title":"Constructing networks for comparison of collagen types.","authors":"Valentin Wesp, Lukas Scholz, Janine M Ziermann-Canabarro, Stefan Schuster, Heiko Stark","doi":"10.1515/jib-2024-0020","DOIUrl":"10.1515/jib-2024-0020","url":null,"abstract":"<p><p>Collagens are structural proteins that are predominantly found in the extracellular matrix of multicellular animals, where they are mainly responsible for the stability and structural integrity of various tissues. All collagens contain polypeptide strands (α-chains). There are several types of collagens, some of which differ significantly in form, function, and tissue specificity. Because of their importance in clinical research, they are grouped into subdivisions, the so-called collagen families, and their sequences are often analysed. However, problems arise with highly homologous sequence segments. To increase the accuracy of collagen classification and prediction of their functions, the structure of these collagens and their expression in different tissues could result in a better focus on sequence segments of interest. Here, we analyse collagen families with different levels of conservation. As a result, clusters with high interconnectivity can be found, such as the fibrillar collagens, the COL4 network-forming collagens, and the COL9 FACITs. Furthermore, a large cluster between network-forming, FACIT, and COL28a1 α-chains is formed with COL6a3 as a major hub node. The formation of clusters also signifies, why it is important to always analyse the α-chains and why structural changes can have a wide range of effects on the body.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11602231/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141602166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-15eCollection Date: 2024-06-01DOI: 10.1515/jib-2023-0042
Daniel Martins, Maryam Abbasi, Conceição Egas, Joel P Arrais
This study delves into the intricate genetic and clinical aspects of Schizophrenia, a complex mental disorder with uncertain etiology. Deep Learning (DL) holds promise for analyzing large genomic datasets to uncover new risk factors. However, based on reports of non-negligible misdiagnosis rates for SCZ, case-control cohorts may contain outlying genetic profiles, hindering compelling performances of classification models. The research employed a case-control dataset sourced from the Swedish populace. A gene-annotation-based DL architecture was developed and employed in two stages. First, the model was trained on the entire dataset to highlight differences between cases and controls. Then, samples likely to be misclassified were excluded, and the model was retrained on the refined dataset for performance evaluation. The results indicate that SCZ prevalence and misdiagnosis rates can affect case-control cohorts, potentially compromising future studies reliant on such datasets. However, by detecting and filtering outliers, the study demonstrates the feasibility of adapting DL methodologies to large-scale biological problems, producing results more aligned with existing heritability estimates for SCZ. This approach not only advances the comprehension of the genetic background of SCZ but also opens doors for adapting DL techniques in complex research for precision medicine in mental health.
{"title":"Detecting outliers in case-control cohorts for improving deep learning networks on Schizophrenia prediction.","authors":"Daniel Martins, Maryam Abbasi, Conceição Egas, Joel P Arrais","doi":"10.1515/jib-2023-0042","DOIUrl":"10.1515/jib-2023-0042","url":null,"abstract":"<p><p>This study delves into the intricate genetic and clinical aspects of Schizophrenia, a complex mental disorder with uncertain etiology. Deep Learning (DL) holds promise for analyzing large genomic datasets to uncover new risk factors. However, based on reports of non-negligible misdiagnosis rates for SCZ, case-control cohorts may contain outlying genetic profiles, hindering compelling performances of classification models. The research employed a case-control dataset sourced from the Swedish populace. A gene-annotation-based DL architecture was developed and employed in two stages. First, the model was trained on the entire dataset to highlight differences between cases and controls. Then, samples likely to be misclassified were excluded, and the model was retrained on the refined dataset for performance evaluation. The results indicate that SCZ prevalence and misdiagnosis rates can affect case-control cohorts, potentially compromising future studies reliant on such datasets. However, by detecting and filtering outliers, the study demonstrates the feasibility of adapting DL methodologies to large-scale biological problems, producing results more aligned with existing heritability estimates for SCZ. This approach not only advances the comprehension of the genetic background of SCZ but also opens doors for adapting DL techniques in complex research for precision medicine in mental health.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11377398/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141617597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-12eCollection Date: 2024-09-01DOI: 10.1515/jib-2024-0023
Niklas Gröne, Benjamin Grüneisen, Karsten Klein, Bernard de Bono, Tobias Czauderna, Falk Schreiber
We present a method for the layout of anatomical structures and blood vessels based on information from the Foundational Model of Anatomy (FMA). Our approach integrates a novel vascular layout into the hierarchical treemap representation of anatomy as used in ApiNATOMY. Our method aims to improve the comprehension of complex anatomical and vascular data by providing readable visual representations. The effectiveness of our method is demonstrated through a prototype developed in VANTED, showing potential for application in research, education, and clinical settings.
{"title":"Layout of anatomical structures and blood vessels based on the foundational model of anatomy.","authors":"Niklas Gröne, Benjamin Grüneisen, Karsten Klein, Bernard de Bono, Tobias Czauderna, Falk Schreiber","doi":"10.1515/jib-2024-0023","DOIUrl":"10.1515/jib-2024-0023","url":null,"abstract":"<p><p>We present a method for the layout of anatomical structures and blood vessels based on information from the Foundational Model of Anatomy (FMA). Our approach integrates a novel vascular layout into the hierarchical treemap representation of anatomy as used in ApiNATOMY. Our method aims to improve the comprehension of complex anatomical and vascular data by providing readable visual representations. The effectiveness of our method is demonstrated through a prototype developed in VANTED, showing potential for application in research, education, and clinical settings.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11602228/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141602167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-04eCollection Date: 2024-06-01DOI: 10.1515/jib-2023-0043
Vicente Machaca, Valeria Goyzueta, María Graciel Cruz, Erika Sejje, Luz Marina Pilco, Julio López, Yván Túpac
Cancer immunology offers a new alternative to traditional cancer treatments, such as radiotherapy and chemotherapy. One notable alternative is the development of personalized vaccines based on cancer neoantigens. Moreover, Transformers are considered a revolutionary development in artificial intelligence with a significant impact on natural language processing (NLP) tasks and have been utilized in proteomics studies in recent years. In this context, we conducted a systematic literature review to investigate how Transformers are applied in each stage of the neoantigen detection process. Additionally, we mapped current pipelines and examined the results of clinical trials involving cancer vaccines.
{"title":"Transformers meets neoantigen detection: a systematic literature review.","authors":"Vicente Machaca, Valeria Goyzueta, María Graciel Cruz, Erika Sejje, Luz Marina Pilco, Julio López, Yván Túpac","doi":"10.1515/jib-2023-0043","DOIUrl":"10.1515/jib-2023-0043","url":null,"abstract":"<p><p>Cancer immunology offers a new alternative to traditional cancer treatments, such as radiotherapy and chemotherapy. One notable alternative is the development of personalized vaccines based on cancer neoantigens. Moreover, Transformers are considered a revolutionary development in artificial intelligence with a significant impact on natural language processing (NLP) tasks and have been utilized in proteomics studies in recent years. In this context, we conducted a systematic literature review to investigate how Transformers are applied in each stage of the neoantigen detection process. Additionally, we mapped current pipelines and examined the results of clinical trials involving cancer vaccines.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11377031/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141499640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-11eCollection Date: 2024-03-01DOI: 10.1515/jib-2024-0002
Bartholomew E Jardine, Lucian P Smith, Herbert M Sauro
We describe a web-based tool, MakeSBML (https://sys-bio.github.io/makesbml/), that provides an installation-free application for creating, editing, and searching the Biomodels repository for SBML-based models. MakeSBML is a client-based web application that translates models expressed in human-readable Antimony to the System Biology Markup Language (SBML) and vice-versa. Since MakeSBML is a web-based application it requires no installation on the user's part. Currently, MakeSBML is hosted on a GitHub page where the client-based design makes it trivial to move to other hosts. This model for software deployment also reduces maintenance costs since an active server is not required. The SBML modeling language is often used in systems biology research to describe complex biochemical networks and makes reproducing models much easier. However, SBML is designed to be computer-readable, not human-readable. We therefore employ the human-readable Antimony language to make it easy to create and edit SBML models.
{"title":"MakeSBML: a tool for converting between Antimony and SBML.","authors":"Bartholomew E Jardine, Lucian P Smith, Herbert M Sauro","doi":"10.1515/jib-2024-0002","DOIUrl":"10.1515/jib-2024-0002","url":null,"abstract":"<p><p>We describe a web-based tool, MakeSBML (https://sys-bio.github.io/makesbml/), that provides an installation-free application for creating, editing, and searching the Biomodels repository for SBML-based models. MakeSBML is a client-based web application that translates models expressed in human-readable Antimony to the System Biology Markup Language (SBML) and vice-versa. Since MakeSBML is a web-based application it requires no installation on the user's part. Currently, MakeSBML is hosted on a GitHub page where the client-based design makes it trivial to move to other hosts. This model for software deployment also reduces maintenance costs since an active server is not required. The SBML modeling language is often used in systems biology research to describe complex biochemical networks and makes reproducing models much easier. However, SBML is designed to be computer-readable, not human-readable. We therefore employ the human-readable Antimony language to make it easy to create and edit SBML models.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11294058/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141302067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-28eCollection Date: 2024-03-01DOI: 10.1515/jib-2024-0003
Paul F Lang, Anand Jain, Christopher Rackauckas
Julia is a general purpose programming language that was designed for simplifying and accelerating numerical analysis and computational science. In particular the Scientific Machine Learning (SciML) ecosystem of Julia packages includes frameworks for high-performance symbolic-numeric computations. It allows users to automatically enhance high-level descriptions of their models with symbolic preprocessing and automatic sparsification and parallelization of computations. This enables performant solution of differential equations, efficient parameter estimation and methodologies for automated model discovery with neural differential equations and sparse identification of nonlinear dynamics. To give the systems biology community easy access to SciML, we developed SBMLToolkit.jl. SBMLToolkit.jl imports dynamic SBML models into the SciML ecosystem to accelerate model simulation and fitting of kinetic parameters. By providing computational systems biologists with easy access to the open-source Julia ecosystevnm, we hope to catalyze the development of further Julia tools in this domain and the growth of the Julia bioscience community. SBMLToolkit.jl is freely available under the MIT license. The source code is available at https://github.com/SciML/SBMLToolkit.jl.
Julia 是一种通用编程语言,旨在简化和加速数值分析和计算科学。特别是 Julia 软件包的科学机器学习(SciML)生态系统包括高性能符号数值计算框架。它允许用户通过符号预处理、自动稀疏化和并行化计算,自动增强模型的高级描述。这样就能实现微分方程的高效求解、高效参数估计以及利用神经微分方程和非线性动力学稀疏识别自动发现模型的方法。为了让系统生物学界能方便地使用 SciML,我们开发了 SBMLToolkit.jl。SBMLToolkit.jl 将动态 SBML 模型导入 SciML 生态系统,以加速模型模拟和动力学参数拟合。我们希望通过为计算系统生物学家提供对开源 Julia 生态系统的便捷访问,促进该领域更多 Julia 工具的开发和 Julia 生物科学社区的发展。SBMLToolkit.jl 在 MIT 许可下免费提供。源代码可从 https://github.com/SciML/SBMLToolkit.jl 获取。
{"title":"SBMLToolkit.jl: a Julia package for importing SBML into the SciML ecosystem.","authors":"Paul F Lang, Anand Jain, Christopher Rackauckas","doi":"10.1515/jib-2024-0003","DOIUrl":"10.1515/jib-2024-0003","url":null,"abstract":"<p><p>Julia is a general purpose programming language that was designed for simplifying and accelerating numerical analysis and computational science. In particular the Scientific Machine Learning (SciML) ecosystem of Julia packages includes frameworks for high-performance symbolic-numeric computations. It allows users to automatically enhance high-level descriptions of their models with symbolic preprocessing and automatic sparsification and parallelization of computations. This enables performant solution of differential equations, efficient parameter estimation and methodologies for automated model discovery with neural differential equations and sparse identification of nonlinear dynamics. To give the systems biology community easy access to SciML, we developed SBMLToolkit.jl. SBMLToolkit.jl imports dynamic SBML models into the SciML ecosystem to accelerate model simulation and fitting of kinetic parameters. By providing computational systems biologists with easy access to the open-source Julia ecosystevnm, we hope to catalyze the development of further Julia tools in this domain and the growth of the Julia bioscience community. SBMLToolkit.jl is freely available under the MIT license. The source code is available at https://github.com/SciML/SBMLToolkit.jl.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11294517/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141158826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-27eCollection Date: 2024-06-01DOI: 10.1515/jib-2023-0041
Paloma Tejera-Nevado, Emilio Serrano, Ana González-Herrero, Rodrigo Bermejo, Alejandro Rodríguez-González
Protein structure determination has made progress with the aid of deep learning models, enabling the prediction of protein folding from protein sequences. However, obtaining accurate predictions becomes essential in certain cases where the protein structure remains undescribed. This is particularly challenging when dealing with rare, diverse structures and complex sample preparation. Different metrics assess prediction reliability and offer insights into result strength, providing a comprehensive understanding of protein structure by combining different models. In a previous study, two proteins named ARM58 and ARM56 were investigated. These proteins contain four domains of unknown function and are present in Leishmania spp. ARM refers to an antimony resistance marker. The study's main objective is to assess the accuracy of the model's predictions, thereby providing insights into the complexities and supporting metrics underlying these findings. The analysis also extends to the comparison of predictions obtained from other species and organisms. Notably, one of these proteins shares an ortholog with Trypanosoma cruzi and Trypanosoma brucei, leading further significance to our analysis. This attempt underscored the importance of evaluating the diverse outputs from deep learning models, facilitating comparisons across different organisms and proteins. This becomes particularly pertinent in cases where no previous structural information is available.
在深度学习模型的帮助下,蛋白质结构测定取得了进展,能够根据蛋白质序列预测蛋白质折叠。然而,在某些蛋白质结构仍未被描述的情况下,获得准确的预测变得至关重要。在处理罕见、多样的结构和复杂的样品制备时,这尤其具有挑战性。不同的指标可以评估预测的可靠性并深入了解预测结果的强度,通过结合不同的模型提供对蛋白质结构的全面了解。在之前的一项研究中,对名为 ARM58 和 ARM56 的两种蛋白质进行了研究。这两个蛋白含有四个功能未知的结构域,存在于利什曼原虫中。 ARM 指的是抗锑标记。研究的主要目的是评估模型预测的准确性,从而深入了解这些发现背后的复杂性和支持性指标。分析还扩展到了与其他物种和生物的预测结果进行比较。值得注意的是,其中一个蛋白质与克鲁斯锥虫和布氏锥虫有一个同源物,这为我们的分析带来了进一步的意义。这一尝试强调了评估深度学习模型不同输出结果的重要性,有助于在不同生物体和蛋白质之间进行比较。在没有先前结构信息的情况下,这一点尤为重要。
{"title":"Unlocking the power of AI models: exploring protein folding prediction through comparative analysis.","authors":"Paloma Tejera-Nevado, Emilio Serrano, Ana González-Herrero, Rodrigo Bermejo, Alejandro Rodríguez-González","doi":"10.1515/jib-2023-0041","DOIUrl":"10.1515/jib-2023-0041","url":null,"abstract":"<p><p>Protein structure determination has made progress with the aid of deep learning models, enabling the prediction of protein folding from protein sequences. However, obtaining accurate predictions becomes essential in certain cases where the protein structure remains undescribed. This is particularly challenging when dealing with rare, diverse structures and complex sample preparation. Different metrics assess prediction reliability and offer insights into result strength, providing a comprehensive understanding of protein structure by combining different models. In a previous study, two proteins named ARM58 and ARM56 were investigated. These proteins contain four domains of unknown function and are present in <i>Leishmania</i> spp. ARM refers to an antimony resistance marker. The study's main objective is to assess the accuracy of the model's predictions, thereby providing insights into the complexities and supporting metrics underlying these findings. The analysis also extends to the comparison of predictions obtained from other species and organisms. Notably, one of these proteins shares an ortholog with <i>Trypanosoma cruzi</i> and <i>Trypanosoma brucei</i>, leading further significance to our analysis. This attempt underscored the importance of evaluating the diverse outputs from deep learning models, facilitating comparisons across different organisms and proteins. This becomes particularly pertinent in cases where no previous structural information is available.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11377126/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141155977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}