Biclustering is a non-supervised data mining technique used to analyze gene expression data, it consists to classify subgroups of genes that have similar behavior under subgroups of conditions. The classified genes can have independent behavior under other subgroups of conditions. Discovering such co-expressed genes, called biclusters, can be helpful to find specific biological features such as gene interactions under different circumstances. Compared to clustering, biclustering has two main characteristics: bi-dimensionality which means grouping both genes and conditions simultaneously and overlapping which means allowing genes to be in more than one bicluster at the same time. Biclustering algorithms, which continue to be developed at a constant pace, give as output a large number of overlapping biclusters. Visualizing groups of biclusters is still a non-trivial task due to their overlapping. In this paper, we present the most interesting techniques to visualize groups of biclusters and evaluate them.
{"title":"An evaluation study of biclusters visualization techniques of gene expression data.","authors":"Haithem Aouabed, Mourad Elloumi, Rodrigo Santamaría","doi":"10.1515/jib-2021-0019","DOIUrl":"https://doi.org/10.1515/jib-2021-0019","url":null,"abstract":"<p><p><i>Biclustering</i> is a non-supervised data mining technique used to analyze gene expression data, it consists to classify subgroups of genes that have similar behavior under subgroups of conditions. The classified genes can have independent behavior under other subgroups of conditions. Discovering such co-expressed genes, called <i>biclusters</i>, can be helpful to find specific biological features such as gene interactions under different circumstances. Compared to clustering, biclustering has two main characteristics: <i>bi-dimensionality</i> which means grouping both genes and conditions simultaneously and <i>overlapping</i> which means allowing genes to be in more than one bicluster at the same time. Biclustering algorithms, which continue to be developed at a constant pace, give as output a large number of overlapping biclusters. Visualizing groups of biclusters is still a non-trivial task due to their overlapping. In this paper, we present the most interesting techniques to visualize groups of biclusters and evaluate them.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8709740/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39558960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Falk Schreiber, Padraig Gleeson, Martin Golebiewski, Thomas E Gorochowski, Michael Hucka, Sarah M Keating, Matthias König, Chris J Myers, David P Nickerson, Björn Sommer, Dagmar Waltemath
This special issue of the Journal of Integrative Bioinformatics contains updated specifications of COMBINE standards in systems and synthetic biology. The 2021 special issue presents four updates of standards: Synthetic Biology Open Language Visual Version 2.3, Synthetic Biology Open Language Visual Version 3.0, Simulation Experiment Description Markup Language Level 1 Version 4, and OMEX Metadata specification Version 1.2. This document can also be consulted to identify the latest specifications of all COMBINE standards.
{"title":"Specifications of standards in systems and synthetic biology: status and developments in 2021.","authors":"Falk Schreiber, Padraig Gleeson, Martin Golebiewski, Thomas E Gorochowski, Michael Hucka, Sarah M Keating, Matthias König, Chris J Myers, David P Nickerson, Björn Sommer, Dagmar Waltemath","doi":"10.1515/jib-2021-0026","DOIUrl":"10.1515/jib-2021-0026","url":null,"abstract":"<p><p>This special issue of the <i>Journal of Integrative Bioinformatics</i> contains updated specifications of COMBINE standards in systems and synthetic biology. The 2021 special issue presents four updates of standards: Synthetic Biology Open Language Visual Version 2.3, Synthetic Biology Open Language Visual Version 3.0, Simulation Experiment Description Markup Language Level 1 Version 4, and OMEX Metadata specification Version 1.2. This document can also be consulted to identify the latest specifications of all COMBINE standards.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2021-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573232/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39540482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hasan Baig, Pedro Fontanarossa, James McLaughlin, James Scott-Brown, Prashant Vaidyanathan, Thomas Gorochowski, Goksel Misirli, Jacob Beal, Chris Myers
People who engineer biological organisms often find it useful to draw diagrams in order to communicate both the structure of the nucleic acid sequences that they are engineering and the functional relationships between sequence features and other molecular species. Some typical practices and conventions have begun to emerge for such diagrams. SBOL Visual aims to organize and systematize such conventions in order to produce a coherent language for expressing the structure and function of genetic designs. This document details version 3.0 of SBOL Visual, a new major revision of the standard. The major difference between SBOL Visual 3 and SBOL Visual 2 is that diagrams and glyphs are defined with respect to the SBOL 3 data model rather than the SBOL 2 data model. A byproduct of this change is that the use of dashed undirected lines for subsystem mappings has been removed, pending future determination on how to represent general SBOL 3 constraints; in the interim, this annotation can still be used as an annotation. Finally, deprecated material has been removed from collection of glyphs: the deprecated "insulator" glyph and "macromolecule" alternative glyphs have been removed, as have the deprecated BioPAX alternatives to SBO terms.
{"title":"Synthetic biology open language visual (SBOL visual) version 3.0.","authors":"Hasan Baig, Pedro Fontanarossa, James McLaughlin, James Scott-Brown, Prashant Vaidyanathan, Thomas Gorochowski, Goksel Misirli, Jacob Beal, Chris Myers","doi":"10.1515/jib-2021-0013","DOIUrl":"https://doi.org/10.1515/jib-2021-0013","url":null,"abstract":"<p><p>People who engineer biological organisms often find it useful to draw diagrams in order to communicate both the structure of the nucleic acid sequences that they are engineering and the functional relationships between sequence features and other molecular species. Some typical practices and conventions have begun to emerge for such diagrams. SBOL Visual aims to organize and systematize such conventions in order to produce a coherent language for expressing the structure and function of genetic designs. This document details version 3.0 of SBOL Visual, a new major revision of the standard. The major difference between SBOL Visual 3 and SBOL Visual 2 is that diagrams and glyphs are defined with respect to the SBOL 3 data model rather than the SBOL 2 data model. A byproduct of this change is that the use of dashed undirected lines for subsystem mappings has been removed, pending future determination on how to represent general SBOL 3 constraints; in the interim, this annotation can still be used as an annotation. Finally, deprecated material has been removed from collection of glyphs: the deprecated \"insulator\" glyph and \"macromolecule\" alternative glyphs have been removed, as have the deprecated BioPAX alternatives to SBO terms.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8560346/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39532713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
John H Gennari, Matthias König, Goksel Misirli, Maxwell L Neal, David P Nickerson, Dagmar Waltemath
A standardized approach to annotating computational biomedical models and their associated files can facilitate model reuse and reproducibility among research groups, enhance search and retrieval of models and data, and enable semantic comparisons between models. Motivated by these potential benefits and guided by consensus across the COmputational Modeling in BIology NEtwork (COMBINE) community, we have developed a specification for encoding annotations in Open Modeling and EXchange (OMEX)-formatted archives. This document details version 1.2 of the specification, which builds on version 1.0 published last year in this journal. In particular, this version includes a set of initial model-level annotations (whereas v 1.0 described exclusively annotations at a smaller scale). Additionally, this version uses best practices for namespaces, and introduces omex-library.org as a common root for all annotations. Distributing modeling projects within an OMEX archive is a best practice established by COMBINE, and the OMEX metadata specification presented here provides a harmonized, community-driven approach for annotating a variety of standardized model representations. This specification acts as a technical guideline for developing software tools that can support this standard, and thereby encourages broad advances in model reuse, discovery, and semantic analyses.
{"title":"OMEX metadata specification (version 1.2).","authors":"John H Gennari, Matthias König, Goksel Misirli, Maxwell L Neal, David P Nickerson, Dagmar Waltemath","doi":"10.1515/jib-2021-0020","DOIUrl":"https://doi.org/10.1515/jib-2021-0020","url":null,"abstract":"<p><p>A standardized approach to annotating computational biomedical models and their associated files can facilitate model reuse and reproducibility among research groups, enhance search and retrieval of models and data, and enable semantic comparisons between models. Motivated by these potential benefits and guided by consensus across the COmputational Modeling in BIology NEtwork (COMBINE) community, we have developed a specification for encoding annotations in Open Modeling and EXchange (OMEX)-formatted archives. This document details version 1.2 of the specification, which builds on version 1.0 published last year in this journal. In particular, this version includes a set of initial model-level annotations (whereas v 1.0 described exclusively annotations at a smaller scale). Additionally, this version uses best practices for namespaces, and introduces omex-library.org as a common root for all annotations. Distributing modeling projects within an OMEX archive is a best practice established by COMBINE, and the OMEX metadata specification presented here provides a harmonized, community-driven approach for annotating a variety of standardized model representations. This specification acts as a technical guideline for developing software tools that can support this standard, and thereby encourages broad advances in model reuse, discovery, and semantic analyses.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8560343/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39534730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lucian P Smith, Frank T Bergmann, Alan Garny, Tomáš Helikar, Jonathan Karr, David Nickerson, Herbert Sauro, Dagmar Waltemath, Matthias König
Computational simulation experiments increasingly inform modern biological research, and bring with them the need to provide ways to annotate, archive, share and reproduce the experiments performed. These simulations increasingly require extensive collaboration among modelers, experimentalists, and engineers. The Minimum Information About a Simulation Experiment (MIASE) guidelines outline the information needed to share simulation experiments. SED-ML is a computer-readable format for the information outlined by MIASE, created as a community project and supported by many investigators and software tools. The first versions of SED-ML focused on deterministic and stochastic simulations of models. Level 1 Version 4 of SED-ML substantially expands these capabilities to cover additional types of models, model languages, parameter estimations, simulations and analyses of models, and analyses and visualizations of simulation results. To facilitate consistent practices across the community, Level 1 Version 4 also more clearly describes the use of SED-ML constructs, and includes numerous concrete validation rules. SED-ML is supported by a growing ecosystem of investigators, model languages, and software tools, including eight languages for constraint-based, kinetic, qualitative, rule-based, and spatial models, over 20 simulation tools, visual editors, model repositories, and validators. Additional information about SED-ML is available at https://sed-ml.org/.
计算模拟实验越来越多地为现代生物学研究提供信息,并带来了提供注释、存档、共享和复制实验的方法的需求。这些模拟越来越需要建模者、实验家和工程师之间的广泛合作。关于模拟实验的最小信息(MIASE)指南概述了共享模拟实验所需的信息。SED-ML是MIASE概述的信息的计算机可读格式,作为一个社区项目创建,并得到许多研究人员和软件工具的支持。第一个版本的SED-ML侧重于模型的确定性和随机模拟。Level 1版本4的SED-ML实质上扩展了这些功能,以涵盖其他类型的模型、模型语言、参数估计、模型的仿真和分析,以及仿真结果的分析和可视化。为了促进整个社区的一致实践,Level 1 Version 4还更清楚地描述了SED-ML结构的使用,并包含了许多具体的验证规则。SED-ML由不断增长的研究人员、模型语言和软件工具的生态系统支持,包括八种基于约束的、动态的、定性的、基于规则的和空间模型的语言,超过20种仿真工具、可视化编辑器、模型存储库和验证器。有关SED-ML的更多信息,请访问https://sed-ml.org/。
{"title":"The simulation experiment description markup language (SED-ML): language specification for level 1 version 4.","authors":"Lucian P Smith, Frank T Bergmann, Alan Garny, Tomáš Helikar, Jonathan Karr, David Nickerson, Herbert Sauro, Dagmar Waltemath, Matthias König","doi":"10.1515/jib-2021-0021","DOIUrl":"https://doi.org/10.1515/jib-2021-0021","url":null,"abstract":"<p><p>Computational simulation experiments increasingly inform modern biological research, and bring with them the need to provide ways to annotate, archive, share and reproduce the experiments performed. These simulations increasingly require extensive collaboration among modelers, experimentalists, and engineers. The Minimum Information About a Simulation Experiment (MIASE) guidelines outline the information needed to share simulation experiments. SED-ML is a computer-readable format for the information outlined by MIASE, created as a community project and supported by many investigators and software tools. The first versions of SED-ML focused on deterministic and stochastic simulations of models. Level 1 Version 4 of SED-ML substantially expands these capabilities to cover additional types of models, model languages, parameter estimations, simulations and analyses of models, and analyses and visualizations of simulation results. To facilitate consistent practices across the community, Level 1 Version 4 also more clearly describes the use of SED-ML constructs, and includes numerous concrete validation rules. SED-ML is supported by a growing ecosystem of investigators, model languages, and software tools, including eight languages for constraint-based, kinetic, qualitative, rule-based, and spatial models, over 20 simulation tools, visual editors, model repositories, and validators. Additional information about SED-ML is available at https://sed-ml.org/.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8560344/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40324428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Reyes-González Juan Pablo, Díaz-Peregrino Roberto, Soto-Ulloa Victor, Galvan-Remigio Isabel, Castillo Paul, Ogando-Rivas Elizabeth
In the last decades big data has facilitating and improving our daily duties in the medical research and clinical fields; the strategy to get to this point is understanding how to organize and analyze the data in order to accomplish the final goal that is improving healthcare system, in terms of cost and benefits, quality of life and outcome patient. The main objective of this review is to illustrate the state-of-art of big data in healthcare, its features and architecture. We also would like to demonstrate the different application and principal mechanisms of big data in the latest technologies known as blockchain and artificial intelligence, recognizing their benefits and limitations. Perhaps, medical education and digital anatomy are unexplored fields that might be profitable to investigate as we are proposing. The healthcare system can be revolutionized using these different technologies. Thus, we are explaining the basis of these systems focused to the medical arena in order to encourage medical doctors, nurses, biotechnologies and other healthcare professions to be involved and create a more efficient and efficacy system.
{"title":"Big data in the healthcare system: a synergy with artificial intelligence and blockchain technology.","authors":"Reyes-González Juan Pablo, Díaz-Peregrino Roberto, Soto-Ulloa Victor, Galvan-Remigio Isabel, Castillo Paul, Ogando-Rivas Elizabeth","doi":"10.1515/jib-2020-0035","DOIUrl":"https://doi.org/10.1515/jib-2020-0035","url":null,"abstract":"<p><p>In the last decades big data has facilitating and improving our daily duties in the medical research and clinical fields; the strategy to get to this point is understanding how to organize and analyze the data in order to accomplish the final goal that is improving healthcare system, in terms of cost and benefits, quality of life and outcome patient. The main objective of this review is to illustrate the state-of-art of big data in healthcare, its features and architecture. We also would like to demonstrate the different application and principal mechanisms of big data in the latest technologies known as blockchain and artificial intelligence, recognizing their benefits and limitations. Perhaps, medical education and digital anatomy are unexplored fields that might be profitable to investigate as we are proposing. The healthcare system can be revolutionized using these different technologies. Thus, we are explaining the basis of these systems focused to the medical arena in order to encourage medical doctors, nurses, biotechnologies and other healthcare professions to be involved and create a more efficient and efficacy system.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9135137/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39326687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mei Yen Man, Mohd Saberi Mohamad, Yee Wen Choon, Mohd Arfian Ismail
Microorganisms commonly produce many high-demand industrial products like fuels, food, vitamins, and other chemicals. Microbial strains are the strains of microorganisms, which can be optimized to improve their technological properties through metabolic engineering. Metabolic engineering is the process of overcoming cellular regulation in order to achieve a desired product or to generate a new product that the host cells do not usually need to produce. The prediction of genetic manipulations such as gene knockout is part of metabolic engineering. Gene knockout can be used to optimize the microbial strains, such as to maximize the production rate of chemicals of interest. Metabolic and genetic engineering is important in producing the chemicals of interest as, without them, the product yields of many microorganisms are normally low. As a result, the aim of this paper is to propose a combination of the Bat algorithm and the minimization of metabolic adjustment (BATMOMA) to predict which genes to knock out in order to increase the succinate and lactate production rates in Escherichia coli (E. coli).
{"title":"<i>In silico</i> gene knockout prediction using a hybrid of Bat algorithm and minimization of metabolic adjustment.","authors":"Mei Yen Man, Mohd Saberi Mohamad, Yee Wen Choon, Mohd Arfian Ismail","doi":"10.1515/jib-2020-0037","DOIUrl":"https://doi.org/10.1515/jib-2020-0037","url":null,"abstract":"<p><p>Microorganisms commonly produce many high-demand industrial products like fuels, food, vitamins, and other chemicals. Microbial strains are the strains of microorganisms, which can be optimized to improve their technological properties through metabolic engineering. Metabolic engineering is the process of overcoming cellular regulation in order to achieve a desired product or to generate a new product that the host cells do not usually need to produce. The prediction of genetic manipulations such as gene knockout is part of metabolic engineering. Gene knockout can be used to optimize the microbial strains, such as to maximize the production rate of chemicals of interest. Metabolic and genetic engineering is important in producing the chemicals of interest as, without them, the product yields of many microorganisms are normally low. As a result, the aim of this paper is to propose a combination of the Bat algorithm and the minimization of metabolic adjustment (BATMOMA) to predict which genes to knock out in order to increase the succinate and lactate production rates in <i>Escherichia coli</i> (<i>E. coli</i>).</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573224/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39277083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The Protein Data Bank (PDB) today contains more than 174,000 entries with the 3-dimensional structures of biological macromolecules. Using the rich resources of this repository, it is possible identifying subsets with specific, interesting properties for different applications. Our research group prepared an automatically updated list of amyloid- and probably amyloidogenic molecules, the PDB_Amyloid collection, which is freely available at the address http://pitgroup.org/amyloid. This resource applies exclusively the geometric properties of the steric structures for identifying amyloids. In the present contribution, we analyze the starting (i.e., prefix) subsequences of the characteristic, parallel beta-sheets of the structures in the PDB_Amyloid collection, and identify further appearances of these length-5 prefix subsequences in the whole PDB data set. We have identified this way numerous proteins, whose normal or irregular functions involve amyloid formation, structural misfolding, or anti-coagulant properties, simply by containing these prefixes: including the T-cell receptor (TCR), bound with the major histocompatibility complexes MHC-1 and MHC-2; the p53 tumor suppressor protein; a mycobacterial RNA polymerase transcription initialization complex; the human bridging integrator protein BIN-1; and the tick anti-coagulant peptide TAP.
{"title":"On the border of the amyloidogenic sequences: prefix analysis of the parallel beta sheets in the PDB_Amyloid collection.","authors":"Kristóf Takács, Vince Grolmusz","doi":"10.1515/jib-2020-0043","DOIUrl":"https://doi.org/10.1515/jib-2020-0043","url":null,"abstract":"<p><p>The Protein Data Bank (PDB) today contains more than 174,000 entries with the 3-dimensional structures of biological macromolecules. Using the rich resources of this repository, it is possible identifying subsets with specific, interesting properties for different applications. Our research group prepared an automatically updated list of amyloid- and probably amyloidogenic molecules, the PDB_Amyloid collection, which is freely available at the address http://pitgroup.org/amyloid. This resource applies exclusively the geometric properties of the steric structures for identifying amyloids. In the present contribution, we analyze the starting (i.e., prefix) subsequences of the characteristic, parallel beta-sheets of the structures in the PDB_Amyloid collection, and identify further appearances of these length-5 prefix subsequences in the whole PDB data set. We have identified this way numerous proteins, whose normal or irregular functions involve amyloid formation, structural misfolding, or anti-coagulant properties, simply by containing these prefixes: including the T-cell receptor (TCR), bound with the major histocompatibility complexes MHC-1 and MHC-2; the p53 tumor suppressor protein; a mycobacterial RNA polymerase transcription initialization complex; the human bridging integrator protein BIN-1; and the tick anti-coagulant peptide TAP.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/jib-2020-0043","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39217075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hasan Baig, Pedro Fontanarossa, Vishwesh Kulkarni, James McLaughlin, Prashant Vaidyanathan, Bryan Bartley, Shyam Bhakta, Swapnil Bhatia, Mike Bissell, Kevin Clancy, Robert Sidney Cox, Angel Goñi Moreno, Thomas Gorochowski, Raik Grunberg, Jihwan Lee, Augustin Luna, Curtis Madsen, Goksel Misirli, Tramy Nguyen, Nicolas Le Novere, Zachary Palchick, Matthew Pocock, Nicholas Roehner, Herbert Sauro, James Scott-Brown, John T Sexton, Guy-Bart Stan, Jeffrey J Tabor, Logan Terry, Marta Vazquez Vilar, Christopher A Voigt, Anil Wipat, David Zong, Zach Zundel, Jacob Beal, Chris Myers
People who are engineering biological organisms often find it useful to communicate in diagrams, both about the structure of the nucleic acid sequences that they are engineering and about the functional relationships between sequence features and other molecular species. Some typical practices and conventions have begun to emerge for such diagrams. The Synthetic Biology Open Language Visual (SBOL Visual) has been developed as a standard for organizing and systematizing such conventions in order to produce a coherent language for expressing the structure and function of genetic designs. This document details version 2.3 of SBOL Visual, which builds on the prior SBOL Visual 2.2 in several ways. First, the specification now includes higher-level "interactions with interactions," such as an inducer molecule stimulating a repression interaction. Second, binding with a nucleic acid backbone can be shown by overlapping glyphs, as with other molecular complexes. Finally, a new "unspecified interaction" glyph is added for visualizing interactions whose nature is unknown, the "insulator" glyph is deprecated in favor of a new "inert DNA spacer" glyph, and the polypeptide region glyph is recommended for showing 2A sequences.
{"title":"Synthetic biology open language visual (SBOL Visual) version 2.3.","authors":"Hasan Baig, Pedro Fontanarossa, Vishwesh Kulkarni, James McLaughlin, Prashant Vaidyanathan, Bryan Bartley, Shyam Bhakta, Swapnil Bhatia, Mike Bissell, Kevin Clancy, Robert Sidney Cox, Angel Goñi Moreno, Thomas Gorochowski, Raik Grunberg, Jihwan Lee, Augustin Luna, Curtis Madsen, Goksel Misirli, Tramy Nguyen, Nicolas Le Novere, Zachary Palchick, Matthew Pocock, Nicholas Roehner, Herbert Sauro, James Scott-Brown, John T Sexton, Guy-Bart Stan, Jeffrey J Tabor, Logan Terry, Marta Vazquez Vilar, Christopher A Voigt, Anil Wipat, David Zong, Zach Zundel, Jacob Beal, Chris Myers","doi":"10.1515/jib-2020-0045","DOIUrl":"https://doi.org/10.1515/jib-2020-0045","url":null,"abstract":"<p><p>People who are engineering biological organisms often find it useful to communicate in diagrams, both about the structure of the nucleic acid sequences that they are engineering and about the functional relationships between sequence features and other molecular species. Some typical practices and conventions have begun to emerge for such diagrams. The Synthetic Biology Open Language Visual (SBOL Visual) has been developed as a standard for organizing and systematizing such conventions in order to produce a coherent language for expressing the structure and function of genetic designs. This document details version 2.3 of SBOL Visual, which builds on the prior SBOL Visual 2.2 in several ways. First, the specification now includes higher-level \"interactions with interactions,\" such as an inducer molecule stimulating a repression interaction. Second, binding with a nucleic acid backbone can be shown by overlapping glyphs, as with other molecular complexes. Finally, a new \"unspecified interaction\" glyph is added for visualizing interactions whose nature is unknown, the \"insulator\" glyph is deprecated in favor of a new \"inert DNA spacer\" glyph, and the polypeptide region glyph is recommended for showing 2A sequences.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/jib-2020-0045","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39071280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Some species of cover crops produce phenolic compounds with allelopathic potential. The use of math, statistical and computational tools to analyze data obtained with spectrophotometry can assist in the chemical profile discrimination to choose which species and cultivation are the best for weed management purposes. The aim of this study was to perform exploratory and discriminant analysis using R package specmine on the phenolic profile of Secale cereale L., Avena strigosa L. and Raphanus sativus L. shoots obtained by UV-vis scanning spectrophotometry. Plants were collected at 60, 80 and 100 days after sowing and at 15 and 30 days after rolling in experiment in Brazil. Exploratory and discriminant analysis, namely principal component analysis, hierarchical clustering analysis, t-test, fold-change, analysis of variance and supervised machine learning analysis were performed. Results showed a stronger tendency to cluster phenolic profiles according to plant species rather than crop management system, period of sampling or plant phenologic stage. PCA analysis showed a strong distinction of S. cereale L. and A. strigosa L. 30 days after rolling. Due to the fast analysis and friendly use, the R package specmine can be recommended as a supporting tool to exploratory and discriminatory analysis of multivariate data.
某些种类的覆盖作物会产生具有等位潜力的酚类化合物。使用数学、统计和计算工具分析分光光度法获得的数据,有助于对化学特征进行判别,从而选择最适合杂草管理的品种和种植方式。本研究的目的是使用 R 软件包 specmine,对紫外可见扫描分光光度法获得的山苍子(Secale cereale L.)、燕麦(Avena strigosa L.)和油菜(Raphanus sativus L.)嫩枝的酚类特征进行探索性分析和判别分析。在巴西的实验中,分别在播种后 60 天、80 天和 100 天,以及滚动后 15 天和 30 天采集植物。进行了探索性分析和判别分析,即主成分分析、层次聚类分析、t 检验、折变分析、方差分析和监督机器学习分析。结果表明,根据植物种类而不是作物管理系统、采样时期或植物物候期对酚类特征进行聚类的趋势更强。PCA 分析表明,轧制 30 天后的 S. cereale L. 和 A. strigosa L. 有很强的区别。由于 R 软件包 specmine 分析速度快、使用方便,建议将其作为多变量数据探索性和鉴别性分析的辅助工具。
{"title":"Exploratory and discriminant analysis of plant phenolic profiles obtained by UV-vis scanning spectroscopy.","authors":"Monique Souza, Jucinei José Comin, Rodolfo Moresco, Marcelo Maraschin, Claudinei Kurtz, Paulo Emílio Lovato, Cledimar Rogério Lourenzi, Fernanda Kokowicz Pilatti, Arcângelo Loss, Shirley Kuhnen","doi":"10.1515/jib-2019-0056","DOIUrl":"10.1515/jib-2019-0056","url":null,"abstract":"<p><p>Some species of cover crops produce phenolic compounds with allelopathic potential. The use of math, statistical and computational tools to analyze data obtained with spectrophotometry can assist in the chemical profile discrimination to choose which species and cultivation are the best for weed management purposes. The aim of this study was to perform exploratory and discriminant analysis using R package specmine on the phenolic profile of <i>Secale cereale</i> L., <i>Avena strigosa</i> L. and <i>Raphanus sativus</i> L. shoots obtained by UV-vis scanning spectrophotometry. Plants were collected at 60, 80 and 100 days after sowing and at 15 and 30 days after rolling in experiment in Brazil. Exploratory and discriminant analysis, namely principal component analysis, hierarchical clustering analysis, <i>t</i>-test, fold-change, analysis of variance and supervised machine learning analysis were performed. Results showed a stronger tendency to cluster phenolic profiles according to plant species rather than crop management system, period of sampling or plant phenologic stage. PCA analysis showed a strong distinction of <i>S. cereale</i> L. and <i>A. strigosa</i> L. 30 days after rolling. Due to the fast analysis and friendly use, the R package specmine can be recommended as a supporting tool to exploratory and discriminatory analysis of multivariate data.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573236/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38992260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}