首页 > 最新文献

Journal of Integrative Bioinformatics最新文献

英文 中文
Systems biology graphical notation: process description language level 1 version 2.1. 系统生物学图形符号:过程描述语言1级2.1版。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-23 DOI: 10.1515/jib-2025-0018
Hasan Balci, Adrien Rougny, Rupert Overall, Irina Balaur, Michael L Blinov, Hanna Borlinghaus, Emek Demir, Andreas Dräger, Robin Haw, Alexander Mazein, Huaiyu Mi, Stuart Moodie, Falk Schreiber, Anatoly Sorokin, Vasundra Touré, Alice Villéger, Tobias Czauderna, Ugur Dogrusoz, Augustin Luna

The Systems Biology Graphical Notation (SBGN) is an international community effort to standardize the visualization of pathways and networks, making them accessible to scientists from diverse fields while facilitating efficient and ac-curate knowledge exchange among research communities, industry, and other stakeholders in systems biology. SBGN consists of three complementary languages - Entity Relationship (ER), Activity Flow (AF), and Process Description (PD) - each addressing biological and biochemical systems at varying levels of detail. PD, closely aligned with the metabolic and regulatory pathways found in biological literature, books, and academic courses, provides well-defined semantics for precisely representing biological information. The PD language uses a graph structure to represent mechanistic and temporal relationships of biological interactions and transformations. It incorporates distinct node types, including entity pools (e.g., metabolites, proteins, genes, and complexes) and processes (e.g., reactions and associations), with edges representing the connections between nodes (e.g., consumption, production, stimulation, and inhibition). This document details Level 1 Version 2.1 of the PD specification, including several improvements over the previous version (Level 1 Version 2.0): 1) refinements to document structure and terminology, 2) clarifications and updates to specification content, and 3) updated figures and rules.

系统生物学图形符号(SBGN)是国际社会的一项努力,旨在标准化路径和网络的可视化,使来自不同领域的科学家能够访问它们,同时促进研究社区,行业和系统生物学其他利益相关者之间有效和准确的知识交流。SBGN由三种互补的语言组成——实体关系(ER)、活动流(AF)和过程描述(PD)——每一种语言都在不同的细节水平上处理生物和生化系统。PD与生物学文献、书籍和学术课程中发现的代谢和调节途径密切相关,为精确表示生物信息提供了定义良好的语义。PD语言使用图形结构来表示生物相互作用和转换的机制和时间关系。它包含不同的节点类型,包括实体池(例如,代谢物,蛋白质,基因和复合物)和过程(例如,反应和关联),边缘表示节点之间的连接(例如,消耗,生产,刺激和抑制)。本文档详细介绍了PD规范的第1级版本2.1,包括对前一版本(第1级版本2.0)的一些改进:1)对文档结构和术语的改进,2)对规范内容的澄清和更新,以及3)更新的图形和规则。
{"title":"Systems biology graphical notation: process description language level 1 version 2.1.","authors":"Hasan Balci, Adrien Rougny, Rupert Overall, Irina Balaur, Michael L Blinov, Hanna Borlinghaus, Emek Demir, Andreas Dräger, Robin Haw, Alexander Mazein, Huaiyu Mi, Stuart Moodie, Falk Schreiber, Anatoly Sorokin, Vasundra Touré, Alice Villéger, Tobias Czauderna, Ugur Dogrusoz, Augustin Luna","doi":"10.1515/jib-2025-0018","DOIUrl":"https://doi.org/10.1515/jib-2025-0018","url":null,"abstract":"<p><p>The Systems Biology Graphical Notation (SBGN) is an international community effort to standardize the visualization of pathways and networks, making them accessible to scientists from diverse fields while facilitating efficient and ac-curate knowledge exchange among research communities, industry, and other stakeholders in systems biology. SBGN consists of three complementary languages - Entity Relationship (ER), Activity Flow (AF), and Process Description (PD) - each addressing biological and biochemical systems at varying levels of detail. PD, closely aligned with the metabolic and regulatory pathways found in biological literature, books, and academic courses, provides well-defined semantics for precisely representing biological information. The PD language uses a graph structure to represent mechanistic and temporal relationships of biological interactions and transformations. It incorporates distinct node types, including entity pools (e.g., metabolites, proteins, genes, and complexes) and processes (e.g., reactions and associations), with edges representing the connections between nodes (e.g., consumption, production, stimulation, and inhibition). This document details Level 1 Version 2.1 of the PD specification, including several improvements over the previous version (Level 1 Version 2.0): 1) refinements to document structure and terminology, 2) clarifications and updates to specification content, and 3) updated figures and rules.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146031556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Functional and evolutionary characteristics of human genes encoding cell surface receptors involved in the regulation of appetite. 参与食欲调节的细胞表面受体编码基因的功能和进化特征。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2026-01-08 DOI: 10.1515/jib-2025-0023
Elena Ignatieva, Sergey Lashin, Roman Ivanov, Valentin Suslov, Angelina Mikhailova, Nikolay Kolchanov

Appetite is an instinct that has been formed through evolution. Appetite promotes normal growth and development in humans. However, under conditions of food abundance, appetite can become excessive, posing significant health risks. In this study we have identified 80 human genes whose orthologs regulated food intake in model animal species. More than 80 % of these genes encode G-protein-coupled receptors and 29 % were found to be involved in developmental processes. Using phylostratigraphic age index (PAI), which specifies the evolutionary age of a gene, we found that this set of 80 genes contains an increased proportion of genes with the same phylostratigraphic age (PAI = 6, the stage of Vertebrata divergence) indicating the coordinated evolution of this group of genes. Using divergence index (DI), which indicates the type of selection to which the gene is subjected, we observed significant enrichment for genes with DI ≤ 0.25, i.e., those that are subject to strong stabilizing selection. The subgroup of genes having DI ≤ 0.25 included 45 genes and was enriched with genes that are associated with developmental processes. This finding supports the hypothesis that developmental disturbances generally impose strong constraints on viability due to purifying selection.

食欲是一种通过进化形成的本能。食欲促进人类的正常生长和发育。然而,在食物充足的情况下,胃口可能会过大,对健康构成重大威胁。在这项研究中,我们已经确定了80个人类基因,其同源物调节模式动物物种的食物摄入。超过80% %的这些基因编码g蛋白偶联受体,29 %被发现参与发育过程。利用系统地层年龄指数(PAI)来确定基因的进化年龄,我们发现这80个基因中具有相同系统地层年龄(PAI = 6,脊椎动物分化阶段)的基因比例增加,表明这组基因的进化是协调的。利用表明基因所受选择类型的差异指数(DI),我们观察到DI≤0.25的基因显著富集,即那些受到强稳定选择的基因。DI≤0.25的基因亚群包括45个基因,并且富含与发育过程相关的基因。这一发现支持了一种假设,即由于净化选择,发育障碍通常对生存能力施加了强烈的限制。
{"title":"Functional and evolutionary characteristics of human genes encoding cell surface receptors involved in the regulation of appetite.","authors":"Elena Ignatieva, Sergey Lashin, Roman Ivanov, Valentin Suslov, Angelina Mikhailova, Nikolay Kolchanov","doi":"10.1515/jib-2025-0023","DOIUrl":"10.1515/jib-2025-0023","url":null,"abstract":"<p><p>Appetite is an instinct that has been formed through evolution. Appetite promotes normal growth and development in humans. However, under conditions of food abundance, appetite can become excessive, posing significant health risks. In this study we have identified 80 human genes whose orthologs regulated food intake in model animal species. More than 80 % of these genes encode G-protein-coupled receptors and 29 % were found to be involved in developmental processes. Using phylostratigraphic age index (PAI), which specifies the evolutionary age of a gene, we found that this set of 80 genes contains an increased proportion of genes with the same phylostratigraphic age (PAI = 6, the stage of Vertebrata divergence) indicating the coordinated evolution of this group of genes. Using divergence index (DI), which indicates the type of selection to which the gene is subjected, we observed significant enrichment for genes with DI ≤ 0.25, i.e., those that are subject to strong stabilizing selection. The subgroup of genes having DI ≤ 0.25 included 45 genes and was enriched with genes that are associated with developmental processes. This finding supports the hypothesis that developmental disturbances generally impose strong constraints on viability due to purifying selection.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145913868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Streamlining feature elaboration and statistics analysis in metabolomics: the GetFeatistics R-package. 简化代谢组学中的特征阐述和统计分析:getfeatatistics r包。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-12-24 DOI: 10.1515/jib-2025-0047
Gianfranco Frigerio

Metabolomics studies require complex data processing pipelines to ensure data quality and extract meaningful biological insights. GetFeatistics is an R-package developed to streamline the elaboration and statistical analysis of metabolomics data. For targeted analyses, the package enables calibration curve-based quantification with different data weighting options. For untargeted studies, it includes dedicated functions to import feature tables from tools like patRoon and MS-DIAL, assign annotation confidence levels, and filter features based on pooled quality control (QC) criteria, including options for group-specific pooled QCs. The package also provides functions for univariate and multivariate statistical analyses, notably streamlined regression modelling with fixed effects, mixed-effects models for longitudinal data, and Tobit regression for censoring values exceeding the limits of detection. Output tables are concise and informative, facilitating interpretation and reporting, while output visualisations are fully customisable via the ggplot grammar. Additional functionalities include automated retrieval of chemical properties from PubChem, ontology classification via ClassyFire, and pathway enrichment analysis using the FELLA package. GetFeatistics is publicly available on GitHub, with comprehensive documentation and a step-by-step vignette. By integrating key steps of the metabolomics workflow, the package aims to facilitate both exploratory studies and large-scale epidemiological applications in metabolomics research.

代谢组学研究需要复杂的数据处理管道,以确保数据质量并提取有意义的生物学见解。GetFeatistics是一个r软件包,用于简化代谢组学数据的阐述和统计分析。对于有针对性的分析,该软件包可以使用不同的数据加权选项进行基于校准曲线的量化。对于非目标研究,它包括从诸如patron和MS-DIAL之类的工具中导入特征表的专用功能,分配注释置信水平,并基于池质量控制(QC)标准过滤特征,包括特定于组的池QC选项。该软件包还提供了单变量和多变量统计分析的功能,特别是具有固定效应的流线型回归模型,纵向数据的混合效应模型,以及用于审查超出检测范围的值的Tobit回归。输出表简洁且信息丰富,便于解释和报告,而输出可视化可通过ggplot语法完全自定义。其他功能包括从PubChem自动检索化学性质,通过ClassyFire进行本体分类,以及使用FELLA软件包进行途径富集分析。GetFeatistics在GitHub上是公开的,有全面的文档和一步一步的小插图。通过整合代谢组学工作流程的关键步骤,该软件包旨在促进代谢组学研究中的探索性研究和大规模流行病学应用。
{"title":"Streamlining feature elaboration and statistics analysis in metabolomics: the GetFeatistics R-package.","authors":"Gianfranco Frigerio","doi":"10.1515/jib-2025-0047","DOIUrl":"https://doi.org/10.1515/jib-2025-0047","url":null,"abstract":"<p><p>Metabolomics studies require complex data processing pipelines to ensure data quality and extract meaningful biological insights. GetFeatistics is an R-package developed to streamline the elaboration and statistical analysis of metabolomics data. For targeted analyses, the package enables calibration curve-based quantification with different data weighting options. For untargeted studies, it includes dedicated functions to import feature tables from tools like patRoon and MS-DIAL, assign annotation confidence levels, and filter features based on pooled quality control (QC) criteria, including options for group-specific pooled QCs. The package also provides functions for univariate and multivariate statistical analyses, notably streamlined regression modelling with fixed effects, mixed-effects models for longitudinal data, and Tobit regression for censoring values exceeding the limits of detection. Output tables are concise and informative, facilitating interpretation and reporting, while output visualisations are fully customisable via the ggplot grammar. Additional functionalities include automated retrieval of chemical properties from PubChem, ontology classification via ClassyFire, and pathway enrichment analysis using the FELLA package. GetFeatistics is publicly available on GitHub, with comprehensive documentation and a step-by-step vignette. By integrating key steps of the metabolomics workflow, the package aims to facilitate both exploratory studies and large-scale epidemiological applications in metabolomics research.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145812266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Colon cancer survival prediction from gland shapes within histology slides using deep learning. 利用深度学习从组织学切片中的腺体形状预测结肠癌存活。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-07-14 eCollection Date: 2025-06-01 DOI: 10.1515/jib-2024-0052
Rawan Gedeon, Atulya Nagar

This study investigates the application of deep learning techniques for segmenting glands in histopathological images of colorectal cancer. We trained two convolutional neural network models, U-Net and DCAN, on a combination of the GlaS and CRAG datasets to enhance generalization across diverse histological appearances, selecting DCAN for its superior accuracy in delineating gland boundaries. The goal was to achieve robust gland segmentation applicable to whole slide images (WSIs) from The Cancer Genome Atlas (TCGA). Using the segmented glands, we extracted patient-level morphological features and used them to predict survival outcomes. A Cox proportional hazards model was trained on these features and achieved a high concordance index, indicating strong predictive performance. Patients were then stratified into high- and low-risk groups, with significant differences in survival distributions (log-rank p-value: 0.01317). In addition, we benchmarked our models against state-of-the-art gland segmentation methods on GlaS and CRAG, highlighting the trade-off between domain-specific accuracy and cross-dataset robustness.

本研究探讨了深度学习技术在结直肠癌组织病理图像中分割腺体的应用。我们在GlaS和CRAG数据集的组合上训练了两个卷积神经网络模型U-Net和DCAN,以增强对不同组织学外观的泛化,选择DCAN是因为它在描绘腺体边界方面具有卓越的准确性。目标是实现适用于来自癌症基因组图谱(TCGA)的整个幻灯片图像(WSIs)的稳健腺体分割。通过分割腺体,我们提取了患者水平的形态学特征,并用它们来预测生存结果。根据这些特征训练了Cox比例风险模型,并获得了较高的一致性指数,表明具有较强的预测性能。然后将患者分为高危组和低危组,生存分布有显著差异(log-rank p值:0.01317)。此外,我们将我们的模型与GlaS和CRAG上最先进的腺体分割方法进行了基准测试,强调了特定领域准确性和跨数据集鲁棒性之间的权衡。
{"title":"Colon cancer survival prediction from gland shapes within histology slides using deep learning.","authors":"Rawan Gedeon, Atulya Nagar","doi":"10.1515/jib-2024-0052","DOIUrl":"10.1515/jib-2024-0052","url":null,"abstract":"<p><p>This study investigates the application of deep learning techniques for segmenting glands in histopathological images of colorectal cancer. We trained two convolutional neural network models, U-Net and DCAN, on a combination of the GlaS and CRAG datasets to enhance generalization across diverse histological appearances, selecting DCAN for its superior accuracy in delineating gland boundaries. The goal was to achieve robust gland segmentation applicable to whole slide images (WSIs) from The Cancer Genome Atlas (TCGA). Using the segmented glands, we extracted patient-level morphological features and used them to predict survival outcomes. A Cox proportional hazards model was trained on these features and achieved a high concordance index, indicating strong predictive performance. Patients were then stratified into high- and low-risk groups, with significant differences in survival distributions (log-rank <i>p</i>-value: 0.01317). In addition, we benchmarked our models against state-of-the-art gland segmentation methods on GlaS and CRAG, highlighting the trade-off between domain-specific accuracy and cross-dataset robustness.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12569585/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144621178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial - 20 years Journal of Integrative Bioinformatics. 编辑- 20 年整合生物信息学杂志。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-07-09 eCollection Date: 2025-03-01 DOI: 10.1515/jib-2025-0034
Ralf Hofestädt
{"title":"Editorial - 20 years Journal of Integrative Bioinformatics.","authors":"Ralf Hofestädt","doi":"10.1515/jib-2025-0034","DOIUrl":"10.1515/jib-2025-0034","url":null,"abstract":"","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12327197/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144585616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sustainable software development in science - insights from 20 years of Vanted. 科学中的可持续软件开发-来自20 年Vanted的见解。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-07-01 eCollection Date: 2025-03-01 DOI: 10.1515/jib-2025-0007
Falk Schreiber, Tobias Czauderna, Dimitar Garkov, Niklas Gröne, Karsten Klein, Matthias Lange, Uwe Scholz, Björn Sommer

Sustainable software development requires the software to remain accessible and maintainable over long time. This is particularly challenging in a scientific context. For example, fewer than one third of tools and platforms for biological network representation, analysis, and visualisation have been available and supported over a period of 15 years. One of those tools is Vanted, which has been developed and actively supported over the past 20 years. In this work, we discuss sustainable software development in science and investigate which software tools for biological network representation, analysis, and visualisation are maintained over a period of at least 15 years. With Vanted as a case study, we highlight five key insights that we consider crucial for sustainable, long-term software development and software maintenance in science.

可持续的软件开发要求软件在很长一段时间内保持可访问性和可维护性。这在科学背景下尤其具有挑战性。例如,在过去的15年里,只有不到三分之一的生物网络表示、分析和可视化工具和平台是可用的,并且得到了支持。其中一个工具是Vanted,它在过去20年里得到了开发和积极支持。在这项工作中,我们讨论了科学中的可持续软件开发,并调查了哪些用于生物网络表示、分析和可视化的软件工具在至少15年的时间内得到了维护。以Vanted为例,我们强调了我们认为对科学中可持续的、长期的软件开发和软件维护至关重要的五个关键见解。
{"title":"Sustainable software development in science - insights from 20 years of Vanted.","authors":"Falk Schreiber, Tobias Czauderna, Dimitar Garkov, Niklas Gröne, Karsten Klein, Matthias Lange, Uwe Scholz, Björn Sommer","doi":"10.1515/jib-2025-0007","DOIUrl":"10.1515/jib-2025-0007","url":null,"abstract":"<p><p>Sustainable software development requires the software to remain accessible and maintainable over long time. This is particularly challenging in a scientific context. For example, fewer than one third of tools and platforms for biological network representation, analysis, and visualisation have been available and supported over a period of 15 years. One of those tools is Vanted, which has been developed and actively supported over the past 20 years. In this work, we discuss sustainable software development in science and investigate which software tools for biological network representation, analysis, and visualisation are maintained over a period of at least 15 years. With Vanted as a case study, we highlight five key insights that we consider crucial for sustainable, long-term software development and software maintenance in science.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12327203/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144531041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Metagenome and metabolome study on inhaled corticosteroids in asthma patients with side effects. 哮喘患者吸入皮质类固醇副作用的宏基因组和代谢组研究。
IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-24 DOI: 10.1515/jib-2024-0062
Igor Goryanin, Anatoly Sorokin, Meder Seitov, Berik Emilov, Muktarbek Iskakov, Irina Goryanin, Batyr Osmonov

This study investigates the gut microbiome and metabolome of asthma patients treated with inhaled corticosteroids (ICS), some of whom experience adverse side effects. We analyzed stool samples from 24 participants, divided into three cohorts: asthma patients with side effects, those without, and healthy controls. Using next-generation sequencing and LC-MS/MS metabolomics, we identified significant differences in bacterial species and metabolites. Multi-Omics Factor Analysis (MOFA) and Global Sensitivity Analysis-Partial Rank Correlation Coefficient (GSA-PRCC) provided insights into key contributors to side effects, such as tryptophan depletion and altered linolenate and glucose-1-phosphate levels. The study proposes dietary or probiotic interventions to mitigate side effects. Despite the limited sample size, these findings provide a basis for personalized asthma management approaches. Further studies are required to confirm initial fundings.

本研究调查了吸入皮质类固醇(ICS)治疗的哮喘患者的肠道微生物组和代谢组,其中一些患者出现了不良副作用。我们分析了24名参与者的粪便样本,将其分为三组:有副作用的哮喘患者、没有副作用的哮喘患者和健康对照组。通过下一代测序和LC-MS/MS代谢组学,我们发现了细菌种类和代谢物的显著差异。多组学因素分析(MOFA)和全局敏感性分析-部分秩相关系数(GSA-PRCC)提供了对副作用的关键影响因素的见解,例如色氨酸消耗和亚麻酸和葡萄糖-1-磷酸水平的改变。该研究建议通过饮食或益生菌干预来减轻副作用。尽管样本量有限,但这些发现为个性化哮喘管理方法提供了基础。需要进一步的研究来确认初始资金。
{"title":"Metagenome and metabolome study on inhaled corticosteroids in asthma patients with side effects.","authors":"Igor Goryanin, Anatoly Sorokin, Meder Seitov, Berik Emilov, Muktarbek Iskakov, Irina Goryanin, Batyr Osmonov","doi":"10.1515/jib-2024-0062","DOIUrl":"https://doi.org/10.1515/jib-2024-0062","url":null,"abstract":"<p><p>This study investigates the gut microbiome and metabolome of asthma patients treated with inhaled corticosteroids (ICS), some of whom experience adverse side effects. We analyzed stool samples from 24 participants, divided into three cohorts: asthma patients with side effects, those without, and healthy controls. Using next-generation sequencing and LC-MS/MS metabolomics, we identified significant differences in bacterial species and metabolites. Multi-Omics Factor Analysis (MOFA) and Global Sensitivity Analysis-Partial Rank Correlation Coefficient (GSA-PRCC) provided insights into key contributors to side effects, such as tryptophan depletion and altered linolenate and glucose-1-phosphate levels. The study proposes dietary or probiotic interventions to mitigate side effects. Despite the limited sample size, these findings provide a basis for personalized asthma management approaches. Further studies are required to confirm initial fundings.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144477807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging transformers for semi-supervised pathogenicity prediction with soft labels. 利用变压器进行软标签的半监督致病性预测。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-23 eCollection Date: 2025-06-01 DOI: 10.1515/jib-2024-0047
Pablo Enrique Guillem, Marco Zurdo-Tabernero, Noelia Egido Iglesias, Ángel Canal-Alonso, Liliana Durón Figueroa, Guillermo Hernández, Angélica González-Arrieta, Fernando de la Prieta

The rapid advancement of Next-Generation Sequencing (NGS) technologies has revolutionized the field of genomics, producing large volumes of data that necessitate sophisticated analytical techniques. This paper introduces a Deep Learning model designed to predict the pathogenicity of genetic variants, a vital component in advancing personalized medicine. The model is trained on a dataset derived from the analysis of NGS outputs, containing a combination of well-defined and ambiguous genetic variants. By employing a semi-supervised learning approach, the model efficiently utilizes both confidently labeled and less certain data. At the core of the methodology is the Feature Tokenizer Transformer architecture, which processes both numerical and categorical genomic information. The preprocessing pipeline includes key steps such as data imputation, scaling, and encoding to ensure high data quality. The results highlight the model's impressive accuracy, particularly in detecting confidently labeled variants, while also addressing the impact of its predictions on less certain (soft-labeled) data.

新一代测序(NGS)技术的快速发展彻底改变了基因组学领域,产生了大量数据,需要复杂的分析技术。本文介绍了一个深度学习模型,旨在预测遗传变异的致病性,这是推进个性化医疗的重要组成部分。该模型在NGS输出分析得出的数据集上进行训练,该数据集包含定义良好和模糊的遗传变异的组合。通过采用半监督学习方法,该模型有效地利用了自信标记和不太确定的数据。该方法的核心是Feature Tokenizer Transformer架构,它处理数值和分类基因组信息。预处理流程包括数据输入、缩放和编码等关键步骤,以确保高数据质量。结果突出了该模型令人印象深刻的准确性,特别是在检测自信标记的变体时,同时也解决了其预测对不太确定(软标记)数据的影响。
{"title":"Leveraging transformers for semi-supervised pathogenicity prediction with soft labels.","authors":"Pablo Enrique Guillem, Marco Zurdo-Tabernero, Noelia Egido Iglesias, Ángel Canal-Alonso, Liliana Durón Figueroa, Guillermo Hernández, Angélica González-Arrieta, Fernando de la Prieta","doi":"10.1515/jib-2024-0047","DOIUrl":"10.1515/jib-2024-0047","url":null,"abstract":"<p><p>The rapid advancement of Next-Generation Sequencing (NGS) technologies has revolutionized the field of genomics, producing large volumes of data that necessitate sophisticated analytical techniques. This paper introduces a Deep Learning model designed to predict the pathogenicity of genetic variants, a vital component in advancing personalized medicine. The model is trained on a dataset derived from the analysis of NGS outputs, containing a combination of well-defined and ambiguous genetic variants. By employing a semi-supervised learning approach, the model efficiently utilizes both confidently labeled and less certain data. At the core of the methodology is the Feature Tokenizer Transformer architecture, which processes both numerical and categorical genomic information. The preprocessing pipeline includes key steps such as data imputation, scaling, and encoding to ensure high data quality. The results highlight the model's impressive accuracy, particularly in detecting confidently labeled variants, while also addressing the impact of its predictions on less certain (soft-labeled) data.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12569578/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144334418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Petri net modeling and simulation of post-transcriptional regulatory networks of human embryonic stem cell (hESC) differentiation to cardiomyocytes. 人胚胎干细胞(hESC)向心肌细胞分化的转录后调控网络的Petri网建模和模拟。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-23 eCollection Date: 2025-03-01 DOI: 10.1515/jib-2024-0037
Aruana F F Hansel-Fröse, Christoph Brinkrolf, Marcel Friedrichs, Bruno Dallagiovanna, Lucia Spangenberg

Stem cells are capable of self-renewal and differentiation into various cell types, showing significant potential for cellular therapies and regenerative medicine, particularly in cardiovascular diseases. The differentiation to cardiomyocytes replicates the embryonic heart development, potentially supporting cardiac regeneration. Cardiomyogenesis is controlled by complex post-transcriptional regulation that affects the construction of gene regulatory networks (GRNs), such as: alternative polyadenylation (APA), length changes in untranslated regulatory regions (3'UTRs), and microRNA (miRNA) regulation. To deepen our understanding of the cardiomyogenesis process, we have modeled a GRN for each day of cardiomyocyte differentiation. Then, each GRN was automatically transformed by four transformation rules to a Petri net and simulated using the software VANESA. The Petri nets highlighted the relationship between genes and alternative isoforms, emphasizing the inhibition of miRNA on APA isoforms with varying 3'UTR lengths. Moreover, in silico simulation of miRNA knockout enabled the visualization of the consequential effects on isoform expression. Our Petri net models provide a resourceful tool and holistic perspective to investigate the functional orchestra of transcript regulation that differentiate hESCs to cardiomyocytes. Additionally, the models can be adapted to investigate post-transcriptional GRN in other biological contexts.

干细胞能够自我更新并分化成各种细胞类型,在细胞治疗和再生医学方面显示出巨大的潜力,特别是在心血管疾病方面。向心肌细胞的分化复制了胚胎心脏的发育,可能支持心脏再生。心肌发生受复杂的转录后调控控制,影响基因调控网络(grn)的构建,如:选择性聚腺苷化(APA)、非翻译调控区域(3'UTRs)的长度变化和microRNA (miRNA)调控。为了加深我们对心肌形成过程的理解,我们为心肌细胞分化的每一天建立了一个GRN模型。然后,通过4条变换规则将每个GRN自动变换为Petri网,并利用VANESA软件进行仿真。Petri网强调了基因与备选亚型之间的关系,强调了miRNA对不同3'UTR长度的APA亚型的抑制作用。此外,miRNA敲除的计算机模拟能够可视化对异构体表达的相应影响。我们的Petri网模型提供了一个丰富的工具和整体的视角来研究将hESCs分化为心肌细胞的转录调控的功能组合。此外,该模型可用于研究其他生物学背景下的转录后GRN。
{"title":"Petri net modeling and simulation of post-transcriptional regulatory networks of human embryonic stem cell (hESC) differentiation to cardiomyocytes.","authors":"Aruana F F Hansel-Fröse, Christoph Brinkrolf, Marcel Friedrichs, Bruno Dallagiovanna, Lucia Spangenberg","doi":"10.1515/jib-2024-0037","DOIUrl":"10.1515/jib-2024-0037","url":null,"abstract":"<p><p>Stem cells are capable of self-renewal and differentiation into various cell types, showing significant potential for cellular therapies and regenerative medicine, particularly in cardiovascular diseases. The differentiation to cardiomyocytes replicates the embryonic heart development, potentially supporting cardiac regeneration. Cardiomyogenesis is controlled by complex post-transcriptional regulation that affects the construction of gene regulatory networks (GRNs), such as: alternative polyadenylation (APA), length changes in untranslated regulatory regions (3'UTRs), and microRNA (miRNA) regulation. To deepen our understanding of the cardiomyogenesis process, we have modeled a GRN for each day of cardiomyocyte differentiation. Then, each GRN was automatically transformed by four transformation rules to a Petri net and simulated using the software VANESA. The Petri nets highlighted the relationship between genes and alternative isoforms, emphasizing the inhibition of miRNA on APA isoforms with varying 3'UTR lengths. Moreover, <i>in silico</i> simulation of miRNA knockout enabled the visualization of the consequential effects on isoform expression. Our Petri net models provide a resourceful tool and holistic perspective to investigate the functional orchestra of transcript regulation that differentiate hESCs to cardiomyocytes. Additionally, the models can be adapted to investigate post-transcriptional GRN in other biological contexts.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12327202/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144334419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating AI and genomics: predictive CNN models for schizophrenia phenotypes. 整合人工智能和基因组学:预测精神分裂症表型的CNN模型。
IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-06-18 eCollection Date: 2025-06-01 DOI: 10.1515/jib-2024-0057
Guilherme Henriques, Maryam Abbasi, Daniel Martins, Joel P Arrais

This study explores the use of deep learning to analyze genetic data and predict phenotypic traits associated with schizophrenia, a complex psychiatric disorder with a strong hereditary component yet incomplete genetic characterization. We applied Convolutional Neural Networks models to a large-scale case-control exome sequencing dataset from the Swedish population to identify genetic patterns linked to schizophrenia. To enhance model performance and reduce overfitting, we employed advanced optimization techniques, including dropout layers, learning rate scheduling, batch normalization, and early stopping. Following systematic refinements in data preprocessing, model architecture, and hyperparameter tuning, the final model achieved an accuracy of 80 %. These results demonstrate the potential of deep learning approaches to uncover intricate genotype-phenotype relationships and support their future integration into precision medicine and genetic diagnostics for psychiatric disorders such as schizophrenia.

本研究探索了使用深度学习来分析遗传数据并预测与精神分裂症相关的表型特征,精神分裂症是一种复杂的精神疾病,具有强烈的遗传成分,但遗传特征不完整。我们将卷积神经网络模型应用于来自瑞典人群的大规模病例对照外显子组测序数据集,以确定与精神分裂症相关的遗传模式。为了提高模型性能并减少过拟合,我们采用了先进的优化技术,包括辍学层、学习率调度、批处理归一化和早期停止。经过数据预处理、模型架构和超参数调优的系统改进,最终模型的精度达到了80% %。这些结果证明了深度学习方法在揭示复杂的基因型-表型关系方面的潜力,并支持它们未来整合到精神分裂症等精神疾病的精准医学和基因诊断中。
{"title":"Integrating AI and genomics: predictive CNN models for schizophrenia phenotypes.","authors":"Guilherme Henriques, Maryam Abbasi, Daniel Martins, Joel P Arrais","doi":"10.1515/jib-2024-0057","DOIUrl":"10.1515/jib-2024-0057","url":null,"abstract":"<p><p>This study explores the use of deep learning to analyze genetic data and predict phenotypic traits associated with schizophrenia, a complex psychiatric disorder with a strong hereditary component yet incomplete genetic characterization. We applied Convolutional Neural Networks models to a large-scale case-control exome sequencing dataset from the Swedish population to identify genetic patterns linked to schizophrenia. To enhance model performance and reduce overfitting, we employed advanced optimization techniques, including dropout layers, learning rate scheduling, batch normalization, and early stopping. Following systematic refinements in data preprocessing, model architecture, and hyperparameter tuning, the final model achieved an accuracy of 80 %. These results demonstrate the potential of deep learning approaches to uncover intricate genotype-phenotype relationships and support their future integration into precision medicine and genetic diagnostics for psychiatric disorders such as schizophrenia.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12569582/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144310814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Integrative Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1