首页 > 最新文献

Journal of Integrative Bioinformatics最新文献

英文 中文
Unlocking the power of AI models: exploring protein folding prediction through comparative analysis. 释放人工智能模型的力量:通过比较分析探索蛋白质折叠预测。
IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-05-27 eCollection Date: 2024-06-01 DOI: 10.1515/jib-2023-0041
Paloma Tejera-Nevado, Emilio Serrano, Ana González-Herrero, Rodrigo Bermejo, Alejandro Rodríguez-González

Protein structure determination has made progress with the aid of deep learning models, enabling the prediction of protein folding from protein sequences. However, obtaining accurate predictions becomes essential in certain cases where the protein structure remains undescribed. This is particularly challenging when dealing with rare, diverse structures and complex sample preparation. Different metrics assess prediction reliability and offer insights into result strength, providing a comprehensive understanding of protein structure by combining different models. In a previous study, two proteins named ARM58 and ARM56 were investigated. These proteins contain four domains of unknown function and are present in Leishmania spp. ARM refers to an antimony resistance marker. The study's main objective is to assess the accuracy of the model's predictions, thereby providing insights into the complexities and supporting metrics underlying these findings. The analysis also extends to the comparison of predictions obtained from other species and organisms. Notably, one of these proteins shares an ortholog with Trypanosoma cruzi and Trypanosoma brucei, leading further significance to our analysis. This attempt underscored the importance of evaluating the diverse outputs from deep learning models, facilitating comparisons across different organisms and proteins. This becomes particularly pertinent in cases where no previous structural information is available.

在深度学习模型的帮助下,蛋白质结构测定取得了进展,能够根据蛋白质序列预测蛋白质折叠。然而,在某些蛋白质结构仍未被描述的情况下,获得准确的预测变得至关重要。在处理罕见、多样的结构和复杂的样品制备时,这尤其具有挑战性。不同的指标可以评估预测的可靠性并深入了解预测结果的强度,通过结合不同的模型提供对蛋白质结构的全面了解。在之前的一项研究中,对名为 ARM58 和 ARM56 的两种蛋白质进行了研究。这两个蛋白含有四个功能未知的结构域,存在于利什曼原虫中。 ARM 指的是抗锑标记。研究的主要目的是评估模型预测的准确性,从而深入了解这些发现背后的复杂性和支持性指标。分析还扩展到了与其他物种和生物的预测结果进行比较。值得注意的是,其中一个蛋白质与克鲁斯锥虫和布氏锥虫有一个同源物,这为我们的分析带来了进一步的意义。这一尝试强调了评估深度学习模型不同输出结果的重要性,有助于在不同生物体和蛋白质之间进行比较。在没有先前结构信息的情况下,这一点尤为重要。
{"title":"Unlocking the power of AI models: exploring protein folding prediction through comparative analysis.","authors":"Paloma Tejera-Nevado, Emilio Serrano, Ana González-Herrero, Rodrigo Bermejo, Alejandro Rodríguez-González","doi":"10.1515/jib-2023-0041","DOIUrl":"10.1515/jib-2023-0041","url":null,"abstract":"<p><p>Protein structure determination has made progress with the aid of deep learning models, enabling the prediction of protein folding from protein sequences. However, obtaining accurate predictions becomes essential in certain cases where the protein structure remains undescribed. This is particularly challenging when dealing with rare, diverse structures and complex sample preparation. Different metrics assess prediction reliability and offer insights into result strength, providing a comprehensive understanding of protein structure by combining different models. In a previous study, two proteins named ARM58 and ARM56 were investigated. These proteins contain four domains of unknown function and are present in <i>Leishmania</i> spp. ARM refers to an antimony resistance marker. The study's main objective is to assess the accuracy of the model's predictions, thereby providing insights into the complexities and supporting metrics underlying these findings. The analysis also extends to the comparison of predictions obtained from other species and organisms. Notably, one of these proteins shares an ortholog with <i>Trypanosoma cruzi</i> and <i>Trypanosoma brucei</i>, leading further significance to our analysis. This attempt underscored the importance of evaluating the diverse outputs from deep learning models, facilitating comparisons across different organisms and proteins. This becomes particularly pertinent in cases where no previous structural information is available.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141155977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The simulation experiment description markup language (SED-ML): language specification for level 1 version 5. 模拟实验描述标记语言(SED-ML):第 1 级第 5 版语言规范。
IF 1.9 Q1 Medicine Pub Date : 2024-04-15 DOI: 10.1515/jib-2024-0008
Lucian P. Smith, Frank T. Bergmann, Alan Garny, Tomáš Helikar, Jonathan Karr, D. Nickerson, Herbert M. Sauro, Dagmar Waltemath, Matthias König
Modern biological research is increasingly informed by computational simulation experiments, which necessitate the development of methods for annotating, archiving, sharing, and reproducing the conducted experiments. These simulations increasingly require extensive collaboration among modelers, experimentalists, and engineers. The Minimum Information About a Simulation Experiment (MIASE) guidelines outline the information needed to share simulation experiments. SED-ML is a computer-readable format for the information outlined by MIASE, created as a community project and supported by many investigators and software tools. Level 1 Version 5 of SED-ML expands the ability of modelers to define simulations in SED-ML using the Kinetic Simulation Algorithm Onotoloy (KiSAO). While it was possible in Version 4 to define a simulation entirely using KiSAO, Version 5 now allows users to define tasks, model changes, ranges, and outputs using the ontology as well. SED-ML is supported by a growing ecosystem of investigators, model languages, and software tools, including various languages for constraint-based, kinetic, qualitative, rule-based, and spatial models, and many simulation tools, visual editors, model repositories, and validators. Additional information about SED-ML is available at https://sed-ml.org/.
现代生物学研究越来越多地借助于计算模拟实验,这就需要开发注释、归档、共享和复制所进行实验的方法。这些模拟实验越来越需要建模人员、实验人员和工程师之间的广泛合作。模拟实验最低限度信息(MIASE)指南概述了共享模拟实验所需的信息。SED-ML 是 MIASE 概述的信息的计算机可读格式,作为一个社区项目创建,并得到许多研究人员和软件工具的支持。SED-ML 第 5 版扩展了建模者在 SED-ML 中使用动力学模拟算法 Onotoloy (KiSAO) 定义模拟的能力。在第 4 版中,用户可以完全使用 KiSAO 来定义模拟,而第 5 版现在也允许用户使用本体来定义任务、模型变化、范围和输出。SED-ML 得到了研究人员、模型语言和软件工具等日益壮大的生态系统的支持,包括各种基于约束、动力学、定性、基于规则和空间模型的语言,以及许多仿真工具、可视化编辑器、模型库和验证器。有关 SED-ML 的更多信息,请访问 https://sed-ml.org/。
{"title":"The simulation experiment description markup language (SED-ML): language specification for level 1 version 5.","authors":"Lucian P. Smith, Frank T. Bergmann, Alan Garny, Tomáš Helikar, Jonathan Karr, D. Nickerson, Herbert M. Sauro, Dagmar Waltemath, Matthias König","doi":"10.1515/jib-2024-0008","DOIUrl":"https://doi.org/10.1515/jib-2024-0008","url":null,"abstract":"Modern biological research is increasingly informed by computational simulation experiments, which necessitate the development of methods for annotating, archiving, sharing, and reproducing the conducted experiments. These simulations increasingly require extensive collaboration among modelers, experimentalists, and engineers. The Minimum Information About a Simulation Experiment (MIASE) guidelines outline the information needed to share simulation experiments. SED-ML is a computer-readable format for the information outlined by MIASE, created as a community project and supported by many investigators and software tools. Level 1 Version 5 of SED-ML expands the ability of modelers to define simulations in SED-ML using the Kinetic Simulation Algorithm Onotoloy (KiSAO). While it was possible in Version 4 to define a simulation entirely using KiSAO, Version 5 now allows users to define tasks, model changes, ranges, and outputs using the ontology as well. SED-ML is supported by a growing ecosystem of investigators, model languages, and software tools, including various languages for constraint-based, kinetic, qualitative, rule-based, and spatial models, and many simulation tools, visual editors, model repositories, and validators. Additional information about SED-ML is available at https://sed-ml.org/.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140703567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Auto-phylo v2 and auto-phylo-pipeliner: building advanced, flexible, and reusable pipelines for phylogenetic inferences, estimation of variability levels and identification of positively selected amino acid sites. Auto-phylo v2 和 aut-phylo-pipeliner:构建先进、灵活和可重复使用的管道,用于系统发育推断、变异性水平估计和正选氨基酸位点识别。
IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-03-27 eCollection Date: 2024-06-01 DOI: 10.1515/jib-2023-0046
Hugo López-Fernández, Miguel Pinto, Cristina P Vieira, Pedro Duque, Miguel Reboiro-Jato, Jorge Vieira

The vast amount of genome sequence data that is available, and that is predicted to drastically increase in the near future, can only be efficiently dealt with by building automated pipelines. Indeed, the Earth Biogenome Project will produce high-quality reference genome sequences for all 1.8 million named living eukaryote species, providing unprecedented insight into the evolution of genes and gene families, and thus on biological issues. Here, new modules for gene annotation, further BLAST search algorithms, further multiple sequence alignment methods, the adding of reference sequences, further tree rooting methods, the estimation of rates of synonymous and nonsynonymous substitutions, and the identification of positively selected amino acid sites, have been added to auto-phylo (version 2), a recently developed software to address biological problems using phylogenetic inferences. Additionally, we present auto-phylo-pipeliner, a graphical user interface application that further facilitates the creation and running of auto-phylo pipelines. Inferences on S-RNase specificity, are critical for both cross-based breeding and for the establishment of pollination requirements. Therefore, as a test case, we develop an auto-phylo pipeline to identify amino acid sites under positive selection, that are, in principle, those determining S-RNase specificity, starting from both non-annotated Prunus genomes and sequences available in public databases.

现有的基因组序列数据量巨大,而且预计在不久的将来还会急剧增加,只有建立自动化管道才能有效处理这些数据。事实上,地球生物基因组计划(Earth Biogenome Project)将为所有 180 万个已命名的真核生物物种提供高质量的参考基因组序列,为基因和基因家族的进化,进而为生物问题提供前所未有的洞察力。auto-phylo(第 2 版)是最近开发的一款利用系统发育推论解决生物学问题的软件,在这里,我们为它添加了新的模块,包括基因注释、进一步的 BLAST 搜索算法、进一步的多序列比对方法、参考序列的添加、进一步的树根方法、同义和非同义替换率的估计以及正选氨基酸位点的鉴定。此外,我们还介绍了auto-phylo-pipeliner,这是一个图形用户界面应用程序,可进一步方便auto-phylo管道的创建和运行。S-RNase特异性推断对于杂交育种和确定授粉要求都至关重要。因此,作为一个测试案例,我们从未注明的梅花基因组和公共数据库中的序列入手,开发了一个自动植物基因组分析管道,以确定正选择的氨基酸位点,这些位点原则上就是决定 S-RNase 特异性的氨基酸位点。
{"title":"Auto-phylo v2 and auto-phylo-pipeliner: building advanced, flexible, and reusable pipelines for phylogenetic inferences, estimation of variability levels and identification of positively selected amino acid sites.","authors":"Hugo López-Fernández, Miguel Pinto, Cristina P Vieira, Pedro Duque, Miguel Reboiro-Jato, Jorge Vieira","doi":"10.1515/jib-2023-0046","DOIUrl":"10.1515/jib-2023-0046","url":null,"abstract":"<p><p>The vast amount of genome sequence data that is available, and that is predicted to drastically increase in the near future, can only be efficiently dealt with by building automated pipelines. Indeed, the Earth Biogenome Project will produce high-quality reference genome sequences for all 1.8 million named living eukaryote species, providing unprecedented insight into the evolution of genes and gene families, and thus on biological issues. Here, new modules for gene annotation, further BLAST search algorithms, further multiple sequence alignment methods, the adding of reference sequences, further tree rooting methods, the estimation of rates of synonymous and nonsynonymous substitutions, and the identification of positively selected amino acid sites, have been added to auto-phylo (version 2), a recently developed software to address biological problems using phylogenetic inferences. Additionally, we present auto-phylo-pipeliner, a graphical user interface application that further facilitates the creation and running of auto-phylo pipelines. Inferences on <i>S-RNase</i> specificity, are critical for both cross-based breeding and for the establishment of pollination requirements. Therefore, as a test case, we develop an auto-phylo pipeline to identify amino acid sites under positive selection, that are, in principle, those determining <i>S-RNase</i> specificity, starting from both non-annotated <i>Prunus</i> genomes and sequences available in public databases.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140289644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MetaLo: metabolic analysis of Logical models extracted from molecular interaction maps. MetaLo:从分子相互作用图中提取的逻辑模型代谢分析。
IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2024-02-06 eCollection Date: 2024-03-01 DOI: 10.1515/jib-2023-0048
Sahar Aghakhani, Anna Niarakis, Sylvain Soliman

Molecular interaction maps (MIMs) are static graphical representations depicting complex biochemical networks that can be formalized using one of the Systems Biology Graphical Notation languages. Regardless of their extensive coverage of various biological processes, they are limited in terms of dynamic insights. However, MIMs can serve as templates for developing dynamic computational models. We present MetaLo, an open-source Python package that enables the coupling of Boolean models inferred from process description MIMs with generic core metabolic networks. MetaLo provides a framework to study the impact of signaling cascades, gene regulation processes, and metabolic flux distribution of central energy production pathways. MetaLo computes the Boolean model's asynchronous asymptotic behavior, through the identification of trap-spaces, and extracts metabolic constraints to contextualize the generic metabolic network. MetaLo is able to handle large-scale Boolean models and genome-scale metabolic models without requiring kinetic information or manual tuning. The framework behind MetaLo enables in depth analysis of the regulatory model, and may allow tackling a lack of omics data in poorly addressed biological fields to contextualize generic metabolic networks along with improper automatic reconstructions of cell- and/or disease-specific metabolic networks. MetaLo is available at https://pypi.org/project/metalo/ under the terms of the GNU General Public License v3.

分子相互作用图(MIM)是描述复杂生化网络的静态图形表示法,可使用系统生物学图形符号语言之一进行形式化。尽管它们广泛覆盖了各种生物过程,但在动态洞察方面却很有限。然而,MIM 可以作为开发动态计算模型的模板。我们介绍的 MetaLo 是一个开源 Python 软件包,它能将从过程描述 MIMs 中推断出的布尔模型与通用核心代谢网络相耦合。MetaLo 提供了一个框架,用于研究信号级联、基因调控过程和中心能量生产途径的代谢通量分布的影响。MetaLo 通过识别陷阱空间来计算布尔模型的异步渐进行为,并提取代谢约束条件,从而将通用代谢网络背景化。MetaLo 能够处理大规模布尔模型和基因组规模的代谢模型,而无需动力学信息或人工调整。MetaLo 背后的框架可对调控模型进行深入分析,并可解决生物领域中缺乏 omics 数据的问题,从而将通用代谢网络与细胞和/或疾病特定代谢网络的不当自动重建结合起来。MetaLo 根据 GNU 通用公共许可证 v3 条款发布于 https://pypi.org/project/metalo/。
{"title":"MetaLo: metabolic analysis of Logical models extracted from molecular interaction maps.","authors":"Sahar Aghakhani, Anna Niarakis, Sylvain Soliman","doi":"10.1515/jib-2023-0048","DOIUrl":"10.1515/jib-2023-0048","url":null,"abstract":"<p><p>Molecular interaction maps (MIMs) are static graphical representations depicting complex biochemical networks that can be formalized using one of the Systems Biology Graphical Notation languages. Regardless of their extensive coverage of various biological processes, they are limited in terms of dynamic insights. However, MIMs can serve as templates for developing dynamic computational models. We present MetaLo, an open-source Python package that enables the coupling of Boolean models inferred from process description MIMs with generic core metabolic networks. MetaLo provides a framework to study the impact of signaling cascades, gene regulation processes, and metabolic flux distribution of central energy production pathways. MetaLo computes the Boolean model's asynchronous asymptotic behavior, through the identification of trap-spaces, and extracts metabolic constraints to contextualize the generic metabolic network. MetaLo is able to handle large-scale Boolean models and genome-scale metabolic models without requiring kinetic information or manual tuning. The framework behind MetaLo enables in depth analysis of the regulatory model, and may allow tackling a lack of omics data in poorly addressed biological fields to contextualize generic metabolic networks along with improper automatic reconstructions of cell- and/or disease-specific metabolic networks. MetaLo is available at https://pypi.org/project/metalo/ under the terms of the GNU General Public License v3.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139693479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PharmoCo: a graph-based visualization of pharmacogenomic plausibility check reports for clinical decision support systems. PharmoCo:用于临床决策支持系统的基于图形的药物基因组可信性检查报告可视化。
IF 1.9 Q1 Medicine Pub Date : 2023-12-28 eCollection Date: 2023-12-01 DOI: 10.1515/jib-2023-0026
Lena Raupach, Cassandra Königs

The first approaches in recent years for the integration of pharmacogenomic plausibility checks into clinical practice show both a promising improvement in the drug therapy safety, but also difficulties in application. One of the difficulties is the meaningful interpretation of the text-based results by the medical practitioner. We propose here as an appropriate and sensible solution to avoid misunderstandings and to include evidence-based, pharmacogenomic recommendations in prescriptions, which should be the graph-based visualization of the reports. This allows for a plausible interpretation and relate complex, even contradictory guidelines. The improved overview over the pharmacogenomics (PGx) guidelines using the graphical visualization makes the medical practitioner's choice of dose and medication more patient-specific, improves the treatment outcome and thus, increases the drug therapy safety.

近年来,将药物基因组学合理性检查纳入临床实践的首批方法表明,药物治疗安全性有望得到改善,但在应用方面也存在困难。困难之一是如何让医生对基于文本的结果进行有意义的解释。为了避免误解,并在处方中加入以证据为基础的药物基因组学建议,我们在此提出了一个适当而合理的解决方案,那就是将报告图表化。这样就能做出合理的解释,并将复杂甚至相互矛盾的指南联系起来。利用图形可视化技术改进对药物基因组学(PGx)指南的概述,可使医生在选择剂量和药物时更加针对患者的具体情况,改善治疗效果,从而提高药物治疗的安全性。
{"title":"PharmoCo: a graph-based visualization of pharmacogenomic plausibility check reports for clinical decision support systems.","authors":"Lena Raupach, Cassandra Königs","doi":"10.1515/jib-2023-0026","DOIUrl":"10.1515/jib-2023-0026","url":null,"abstract":"<p><p>The first approaches in recent years for the integration of pharmacogenomic plausibility checks into clinical practice show both a promising improvement in the drug therapy safety, but also difficulties in application. One of the difficulties is the meaningful interpretation of the text-based results by the medical practitioner. We propose here as an appropriate and sensible solution to avoid misunderstandings and to include evidence-based, pharmacogenomic recommendations in prescriptions, which should be the graph-based visualization of the reports. This allows for a plausible interpretation and relate complex, even contradictory guidelines. The improved overview over the pharmacogenomics (PGx) guidelines using the graphical visualization makes the medical practitioner's choice of dose and medication more patient-specific, improves the treatment outcome and thus, increases the drug therapy safety.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10777363/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139049773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gene-network based analysis of human placental trophoblast subtypes identifies critical genes as potential targets of therapeutic drugs. 基于基因网络的人类胎盘滋养细胞亚型分析确定了作为治疗药物潜在靶点的关键基因。
IF 1.9 Q1 Medicine Pub Date : 2023-12-22 eCollection Date: 2023-12-01 DOI: 10.1515/jib-2023-0011
Andreas Ian Lackner, Jürgen Pollheimer, Paulina Latos, Martin Knöfler, Sandra Haider

During early pregnancy, extravillous trophoblasts (EVTs) play a crucial role in modifying the maternal uterine environment. Failures in EVT lineage formation and differentiation can lead to pregnancy complications such as preeclampsia, fetal growth restriction, and pregnancy loss. Despite recent advances, our knowledge on molecular and external factors that control and affect EVT development remains incomplete. Using trophoblast organoid in vitro models, we recently discovered that coordinated manipulation of the transforming growth factor beta (TGFβ) signaling is essential for EVT development. To further investigate gene networks involved in EVT function and development, we performed weighted gene co-expression network analysis (WGCNA) on our RNA-Seq data. We identified 10 modules with a median module membership of over 0.8 and sizes ranging from 1005 (M1) to 72 (M27) network genes associated with TGFβ activation status or in vitro culturing, the latter being indicative for yet undiscovered factors that shape the EVT phenotypes. Lastly, we hypothesized that certain therapeutic drugs might unintentionally interfere with placentation by affecting EVT-specific gene expression. We used the STRING database to map correlations and the Drug-Gene Interaction database to identify drug targets. Our comprehensive dataset of drug-gene interactions provides insights into potential risks associated with certain drugs in early gestation.

妊娠早期,胚胎滋养层外细胞(EVT)在改变母体子宫环境方面发挥着至关重要的作用。EVT品系形成和分化的失败可导致妊娠并发症,如子痫前期、胎儿生长受限和妊娠失败。尽管最近取得了一些进展,但我们对控制和影响EVT发育的分子和外部因素的了解仍不全面。最近,我们利用滋养细胞类器官体外模型发现,协调操纵转化生长因子β(TGFβ)信号传导对EVT的发育至关重要。为了进一步研究参与EVT功能和发育的基因网络,我们对RNA-Seq数据进行了加权基因共表达网络分析(WGCNA)。我们发现了10个模块,模块成员中位数超过0.8,规模从1005个(M1)到72个(M27)不等,这些网络基因与TGFβ激活状态或体外培养相关,后者表明EVT表型的形成因素尚未被发现。最后,我们假设某些治疗药物可能会通过影响EVT特异性基因的表达而无意中干扰胎盘的形成。我们利用 STRING 数据库绘制相关性图谱,并利用药物基因相互作用数据库确定药物靶点。我们的药物基因相互作用综合数据集让我们深入了解了某些药物在妊娠早期的潜在风险。
{"title":"Gene-network based analysis of human placental trophoblast subtypes identifies critical genes as potential targets of therapeutic drugs.","authors":"Andreas Ian Lackner, Jürgen Pollheimer, Paulina Latos, Martin Knöfler, Sandra Haider","doi":"10.1515/jib-2023-0011","DOIUrl":"10.1515/jib-2023-0011","url":null,"abstract":"<p><p>During early pregnancy, extravillous trophoblasts (EVTs) play a crucial role in modifying the maternal uterine environment. Failures in EVT lineage formation and differentiation can lead to pregnancy complications such as preeclampsia, fetal growth restriction, and pregnancy loss. Despite recent advances, our knowledge on molecular and external factors that control and affect EVT development remains incomplete. Using trophoblast organoid <i>in vitro</i> models, we recently discovered that coordinated manipulation of the transforming growth factor beta (TGFβ) signaling is essential for EVT development. To further investigate gene networks involved in EVT function and development, we performed weighted gene co-expression network analysis (WGCNA) on our RNA-Seq data. We identified 10 modules with a median module membership of over 0.8 and sizes ranging from 1005 (M1) to 72 (M27) network genes associated with TGFβ activation status or <i>in vitro</i> culturing, the latter being indicative for yet undiscovered factors that shape the EVT phenotypes. Lastly, we hypothesized that certain therapeutic drugs might unintentionally interfere with placentation by affecting EVT-specific gene expression. We used the STRING database to map correlations and the Drug-Gene Interaction database to identify drug targets. Our comprehensive dataset of drug-gene interactions provides insights into potential risks associated with certain drugs in early gestation.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10777358/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138832989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An overview of machine learning and deep learning techniques for predicting epileptic seizures. 预测癫痫发作的机器学习和深度学习技术概述。
IF 1.9 Q1 Medicine Pub Date : 2023-12-15 eCollection Date: 2023-12-01 DOI: 10.1515/jib-2023-0002
Marco Zurdo-Tabernero, Ángel Canal-Alonso, Fernando de la Prieta, Sara Rodríguez, Javier Prieto, Juan Manuel Corchado

Epilepsy is a neurological disorder (the third most common, following stroke and migraines). A key aspect of its diagnosis is the presence of seizures that occur without a known cause and the potential for new seizures to occur. Machine learning has shown potential as a cost-effective alternative for rapid diagnosis. In this study, we review the current state of machine learning in the detection and prediction of epileptic seizures. The objective of this study is to portray the existing machine learning methods for seizure prediction. Internet bibliographical searches were conducted to identify relevant literature on the topic. Through cross-referencing from key articles, additional references were obtained to provide a comprehensive overview of the techniques. As the aim of this paper aims is not a pure bibliographical review of the subject, the publications here cited have been selected among many others based on their number of citations. To implement accurate diagnostic and treatment tools, it is necessary to achieve a balance between prediction time, sensitivity, and specificity. This balance can be achieved using deep learning algorithms. The best performance and results are often achieved by combining multiple techniques and features, but this approach can also increase computational requirements.

癫痫是一种神经系统疾病(继中风和偏头痛之后的第三大常见疾病)。其诊断的一个关键方面是在没有已知病因的情况下出现癫痫发作,以及可能出现新的癫痫发作。机器学习已显示出作为一种经济有效的快速诊断替代方法的潜力。在本研究中,我们回顾了机器学习在检测和预测癫痫发作方面的现状。本研究的目的是描绘用于癫痫发作预测的现有机器学习方法。我们在互联网上进行了文献检索,以确定该主题的相关文献。通过对主要文章的交叉引用,获得了更多参考文献,以提供有关技术的全面概述。由于本文的目的不是对该主题进行纯粹的文献综述,因此本文引用的出版物是根据引用次数从众多出版物中挑选出来的。要实施准确的诊断和治疗工具,必须在预测时间、灵敏度和特异性之间取得平衡。这种平衡可以通过深度学习算法来实现。结合多种技术和特征往往能达到最佳性能和效果,但这种方法也会增加计算要求。
{"title":"An overview of machine learning and deep learning techniques for predicting epileptic seizures.","authors":"Marco Zurdo-Tabernero, Ángel Canal-Alonso, Fernando de la Prieta, Sara Rodríguez, Javier Prieto, Juan Manuel Corchado","doi":"10.1515/jib-2023-0002","DOIUrl":"10.1515/jib-2023-0002","url":null,"abstract":"<p><p>Epilepsy is a neurological disorder (the third most common, following stroke and migraines). A key aspect of its diagnosis is the presence of seizures that occur without a known cause and the potential for new seizures to occur. Machine learning has shown potential as a cost-effective alternative for rapid diagnosis. In this study, we review the current state of machine learning in the detection and prediction of epileptic seizures. The objective of this study is to portray the existing machine learning methods for seizure prediction. Internet bibliographical searches were conducted to identify relevant literature on the topic. Through cross-referencing from key articles, additional references were obtained to provide a comprehensive overview of the techniques. As the aim of this paper aims is not a pure bibliographical review of the subject, the publications here cited have been selected among many others based on their number of citations. To implement accurate diagnostic and treatment tools, it is necessary to achieve a balance between prediction time, sensitivity, and specificity. This balance can be achieved using deep learning algorithms. The best performance and results are often achieved by combining multiple techniques and features, but this approach can also increase computational requirements.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10777364/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138805520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SAMNA: accurate alignment of multiple biological networks based on simulated annealing. SAMNA:基于模拟退火的多生物网络精确配准。
IF 1.9 Q1 Medicine Pub Date : 2023-12-14 eCollection Date: 2023-12-01 DOI: 10.1515/jib-2023-0006
Jing Chen, Zixiang Wang, Jia Huang

Proteins are important parts of the biological structures and encode a lot of biological information. Protein-protein interaction network alignment is a model for analyzing proteins that helps discover conserved functions between organisms and predict unknown functions. In particular, multi-network alignment aims at finding the mapping relationship among multiple network nodes, so as to transfer the knowledge across species. However, with the increasing complexity of PPI networks, how to perform network alignment more accurately and efficiently is a new challenge. This paper proposes a new global network alignment algorithm called Simulated Annealing Multiple Network Alignment (SAMNA), using both network topology and sequence homology information. To generate the alignment, SAMNA first generates cross-network candidate clusters by a clustering algorithm on a k-partite similarity graph constructed with sequence similarity information, and then selects candidate cluster nodes as alignment results and optimizes them using an improved simulated annealing algorithm. Finally, the SAMNA algorithm was experimented on synthetic and real-world network datasets, and the results showed that SAMNA outperformed the state-of-the-art algorithm in biological performance.

蛋白质是生物结构的重要组成部分,并编码大量生物信息。蛋白质-蛋白质相互作用网络配准是一种分析蛋白质的模型,有助于发现生物体之间的保守功能和预测未知功能。其中,多网络配准旨在找到多个网络节点之间的映射关系,从而实现跨物种知识传递。然而,随着 PPI 网络的日益复杂,如何更准确、更高效地进行网络配准是一个新的挑战。本文提出了一种新的全局网络配准算法--模拟退火多重网络配准(SAMNA),同时使用网络拓扑和序列同源性信息。为了生成对齐结果,SAMNA 首先在利用序列相似性信息构建的 k-partite 相似性图上通过聚类算法生成跨网络候选簇,然后选择候选簇节点作为对齐结果,并利用改进的模拟退火算法对其进行优化。最后,SAMNA 算法在合成和实际网络数据集上进行了实验,结果表明 SAMNA 在生物学性能上优于最先进的算法。
{"title":"SAMNA: accurate alignment of multiple biological networks based on simulated annealing.","authors":"Jing Chen, Zixiang Wang, Jia Huang","doi":"10.1515/jib-2023-0006","DOIUrl":"10.1515/jib-2023-0006","url":null,"abstract":"<p><p>Proteins are important parts of the biological structures and encode a lot of biological information. Protein-protein interaction network alignment is a model for analyzing proteins that helps discover conserved functions between organisms and predict unknown functions. In particular, multi-network alignment aims at finding the mapping relationship among multiple network nodes, so as to transfer the knowledge across species. However, with the increasing complexity of PPI networks, how to perform network alignment more accurately and efficiently is a new challenge. This paper proposes a new global network alignment algorithm called Simulated Annealing Multiple Network Alignment (SAMNA), using both network topology and sequence homology information. To generate the alignment, SAMNA first generates cross-network candidate clusters by a clustering algorithm on a <i>k</i>-partite similarity graph constructed with sequence similarity information, and then selects candidate cluster nodes as alignment results and optimizes them using an improved simulated annealing algorithm. Finally, the SAMNA algorithm was experimented on synthetic and real-world network datasets, and the results showed that SAMNA outperformed the state-of-the-art algorithm in biological performance.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10777366/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138805553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of Artificial Intelligence or machine learning in risk sharing agreements for pharmacotherapy risk management. 在药物治疗风险管理的风险分担协议中应用人工智能或机器学习。
IF 1.9 Q1 Medicine Pub Date : 2023-12-12 eCollection Date: 2023-09-01 DOI: 10.1515/jib-2023-0014
Grigory A Oborotov, Konstantin A Koshechkin, Yuriy L Orlov

Applications of Artificial Intelligence in medical informatics solutions risk sharing have social value. At a time of ever-increasing cost for the provision of medicines to citizens, there is a need to restrain the growth of health care costs. The search for computer technologies to stop or slow down the growth of costs acquires a new very important and significant meaning. We discussed the two information technologies in pharmacotherapy and the possibility of combining and sharing them, namely the combination of risk-sharing agreements and Machine Learning, which was made possible by the development of Artificial Intelligence (AI). Neural networks could be used to predict the outcome to reduce the risk factors for treatment. AI-based data processing automation technologies could be also used for risk-sharing agreements automation.

人工智能在医疗信息解决方案中的应用具有风险分担的社会价值。在为公民提供药品的成本不断增加的今天,有必要抑制医疗成本的增长。寻求计算机技术来阻止或减缓成本的增长就有了新的重要意义。我们讨论了药物疗法中的两种信息技术以及将其结合和共享的可能性,即风险分担协议与机器学习的结合,人工智能(AI)的发展使机器学习成为可能。神经网络可用于预测结果,以减少治疗的风险因素。基于人工智能的数据处理自动化技术也可用于风险分担协议自动化。
{"title":"Application of Artificial Intelligence or machine learning in risk sharing agreements for pharmacotherapy risk management.","authors":"Grigory A Oborotov, Konstantin A Koshechkin, Yuriy L Orlov","doi":"10.1515/jib-2023-0014","DOIUrl":"10.1515/jib-2023-0014","url":null,"abstract":"<p><p>Applications of Artificial Intelligence in medical informatics solutions risk sharing have social value. At a time of ever-increasing cost for the provision of medicines to citizens, there is a need to restrain the growth of health care costs. The search for computer technologies to stop or slow down the growth of costs acquires a new very important and significant meaning. We discussed the two information technologies in pharmacotherapy and the possibility of combining and sharing them, namely the combination of risk-sharing agreements and Machine Learning, which was made possible by the development of Artificial Intelligence (AI). Neural networks could be used to predict the outcome to reduce the risk factors for treatment. AI-based data processing automation technologies could be also used for risk-sharing agreements automation.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2023-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10757074/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138805521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TidyGEO: preparing analysis-ready datasets from Gene Expression Omnibus. TidyGEO:从基因表达Omnibus准备分析就绪的数据集。
IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2023-12-05 eCollection Date: 2024-03-01 DOI: 10.1515/jib-2023-0021
Avery Mecham, Ashlie Stephenson, Badi I Quinteros, Grace S Brown, Stephen R Piccolo

TidyGEO is a Web-based tool for downloading, tidying, and reformatting data series from Gene Expression Omnibus (GEO). As a freely accessible repository with data from over 6 million biological samples across more than 4000 organisms, GEO provides diverse opportunities for secondary research. Although scientists may find assay data relevant to a given research question, most analyses require sample-level annotations. In GEO, such annotations are stored alongside assay data in delimited, text-based files. However, the structure and semantics of the annotations vary widely from one series to another, and many annotations are not useful for analysis purposes. Thus, every GEO series must be tidied before it is analyzed. Manual approaches may be used, but these are error prone and take time away from other research tasks. Custom computer scripts can be written, but many scientists lack the computational expertise to create such scripts. To address these challenges, we created TidyGEO, which supports essential data-cleaning tasks for sample-level annotations, such as selecting informative columns, renaming columns, splitting or merging columns, standardizing data values, and filtering samples. Additionally, users can integrate annotations with assay data, restructure assay data, and generate code that enables others to reproduce these steps.

TidyGEO是一个基于web的工具,用于下载、整理和重新格式化基因表达Omnibus (GEO)的数据系列。GEO是一个可免费访问的数据库,拥有4000多种生物的600多万个生物样本的数据,为二次研究提供了多种机会。虽然科学家可能会发现与给定研究问题相关的分析数据,但大多数分析需要样本级别的注释。在GEO中,这些注释与分析数据一起存储在分隔的基于文本的文件中。然而,注解的结构和语义在不同的系列之间差别很大,许多注解对于分析目的是没有用处的。因此,每一个GEO序列在分析之前都必须进行整理。可以使用手工方法,但这些方法容易出错,并且会占用其他研究任务的时间。可以编写自定义计算机脚本,但许多科学家缺乏创建此类脚本的计算专业知识。为了应对这些挑战,我们创建了TidyGEO,它支持样本级注释的基本数据清理任务,例如选择信息列、重命名列、拆分或合并列、标准化数据值和过滤样本。此外,用户可以将注释与分析数据集成,重构分析数据,并生成代码,使其他人能够重现这些步骤。
{"title":"TidyGEO: preparing analysis-ready datasets from Gene Expression Omnibus.","authors":"Avery Mecham, Ashlie Stephenson, Badi I Quinteros, Grace S Brown, Stephen R Piccolo","doi":"10.1515/jib-2023-0021","DOIUrl":"10.1515/jib-2023-0021","url":null,"abstract":"<p><p>TidyGEO is a Web-based tool for downloading, tidying, and reformatting data series from Gene Expression Omnibus (GEO). As a freely accessible repository with data from over 6 million biological samples across more than 4000 organisms, GEO provides diverse opportunities for secondary research. Although scientists may find assay data relevant to a given research question, most analyses require sample-level annotations. In GEO, such annotations are stored alongside assay data in delimited, text-based files. However, the structure and semantics of the annotations vary widely from one series to another, and many annotations are not useful for analysis purposes. Thus, every GEO series must be tidied before it is analyzed. Manual approaches may be used, but these are error prone and take time away from other research tasks. Custom computer scripts can be written, but many scientists lack the computational expertise to create such scripts. To address these challenges, we created TidyGEO, which supports essential data-cleaning tasks for sample-level annotations, such as selecting informative columns, renaming columns, splitting or merging columns, standardizing data values, and filtering samples. Additionally, users can integrate annotations with assay data, restructure assay data, and generate code that enables others to reproduce these steps.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2023-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138479290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Integrative Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1