BMC Bioinformatics最新文献_第3页

QuAPPro: an R shiny app for quantification and alignment of polysome profiles. QuAPPro：一个R闪亮的应用程序，用于量化和对准多聚体配置文件。

IF 3.3 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics

Pub Date : 2026-01-22 DOI: 10.1186/s12859-026-06379-2

Chiara Schiller, Matthias Lemmer, Sonja Reitter, Janina A Lehmann, Kai Fenzl, Johanna Schott

Background: Polysome profiling is a widespread technique to study mRNA translation. After separation of cellular particles by ultracentrifugation on a sucrose-density gradient, a UV absorbance profile is recorded during elution, which mostly reflects RNA content and shows distinct peaks for ribosomal subunits, monosomes and polysomes with increasing number of ribosomes. This profile can be used to assess global translational activity, or to reveal changes in ribosome biogenesis and translation elongation. In addition, it is also possible to measure the association of fluorescently tagged proteins with ribosomal subunits or polysomes. Alignment and quantification of polysome profiles usually relies on spreadsheet programs, custom R/Python scripts or commercial software.

Results: With QuAPPro, we present the first interactive web app that allows quantification and alignment of polysome profiles, independently of the device or software that was used to generate the profiles. QuAPPro was written in R, with a graphical user interface implemented in R shiny. It supports interactive visualization and analysis of polysome profiles, including profile smoothing, baseline selection, alignment along a defined point on the x-axis, quantification of profile subsections and deconvolution for resolving individual peaks. Fluorescence profiles can be aligned and quantified in parallel. Finally, quantification results can be summarized and visualized as bar plots. Every interactive plot can be exported directly in a publication-ready format.

Conclusions: This user-friendly tool does not only speed up the analysis of polysome profiles but also facilitates reproducibility and documentation of the process, without the need for programming abilities or commercial software.

背景：多体分析是一种广泛应用于mRNA翻译研究的技术。在蔗糖-密度梯度上进行超离心分离细胞颗粒后，在洗脱过程中记录紫外吸收谱，紫外吸收谱主要反映RNA含量，随着核糖体数量的增加，核糖体亚基、单体和多体的峰值明显。该剖面可用于评估整体翻译活性，或揭示核糖体生物发生和翻译伸长的变化。此外，还可以测量荧光标记蛋白与核糖体亚基或多体的关联。聚合体配置文件的对齐和量化通常依赖于电子表格程序、自定义R/Python脚本或商业软件。结果：通过QuAPPro，我们提出了第一个交互式web应用程序，可以独立于用于生成配置文件的设备或软件，对多聚体配置文件进行量化和校准。QuAPPro是用R语言编写的，图形用户界面用R语言实现。它支持多体剖面的交互式可视化和分析，包括剖面平滑、基线选择、沿着x轴上定义的点对齐、剖面子剖面的量化以及用于解析单个峰的反褶积。荧光谱可以平行排列和定量。最后，量化结果可以总结和可视化为条形图。每个互动情节都可以直接导出为出版物格式。结论：这个用户友好的工具不仅加快了对多聚体谱的分析，而且促进了过程的再现性和文档化，而不需要编程能力或商业软件。

{"title":"QuAPPro: an R shiny app for quantification and alignment of polysome profiles.","authors":"Chiara Schiller, Matthias Lemmer, Sonja Reitter, Janina A Lehmann, Kai Fenzl, Johanna Schott","doi":"10.1186/s12859-026-06379-2","DOIUrl":"10.1186/s12859-026-06379-2","url":null,"abstract":"Background: Polysome profiling is a widespread technique to study mRNA translation. After separation of cellular particles by ultracentrifugation on a sucrose-density gradient, a UV absorbance profile is recorded during elution, which mostly reflects RNA content and shows distinct peaks for ribosomal subunits, monosomes and polysomes with increasing number of ribosomes. This profile can be used to assess global translational activity, or to reveal changes in ribosome biogenesis and translation elongation. In addition, it is also possible to measure the association of fluorescently tagged proteins with ribosomal subunits or polysomes. Alignment and quantification of polysome profiles usually relies on spreadsheet programs, custom R/Python scripts or commercial software.Results: With QuAPPro, we present the first interactive web app that allows quantification and alignment of polysome profiles, independently of the device or software that was used to generate the profiles. QuAPPro was written in R, with a graphical user interface implemented in R shiny. It supports interactive visualization and analysis of polysome profiles, including profile smoothing, baseline selection, alignment along a defined point on the x-axis, quantification of profile subsections and deconvolution for resolving individual peaks. Fluorescence profiles can be aligned and quantified in parallel. Finally, quantification results can be summarized and visualized as bar plots. Every interactive plot can be exported directly in a publication-ready format.Conclusions: This user-friendly tool does not only speed up the analysis of polysome profiles but also facilitates reproducibility and documentation of the process, without the need for programming abilities or commercial software.","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":"22"},"PeriodicalIF":3.3,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12849056/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146028149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

CRESCENT, a comprehensive RNA-Seq expression, splicing, and coding/non-coding element network tool. CRESCENT是一个全面的RNA-Seq表达、剪接和编码/非编码元件网络工具。

IF 3.3 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics

Pub Date : 2026-01-21 DOI: 10.1186/s12859-026-06368-5

Gilles Sireta, Gwendal Cueff, Vincent Darbot, Marie Lefebvre, Simon Amiard, Aline V Probst, Christophe Tatout

Background: Traditional short-read RNA-Seq analysis pipelines predominantly focus on protein-coding genes, often overlooking other genomic sequences such as transposable elements (TEs) and non-coding RNA dynamics and do not usually investigate splicing events or transcript usage. To fully capture the complexity of the transcriptome, and in particular transcriptomic regulation, it is crucial to adopt a comprehensive approach that integrates these diverse aspects, providing a more complete and nuanced understanding of expression dynamics in the studied organism.

Results: To address these limitations, we present CRESCENT (Comprehensive RNA-seq Expression, Splicing, and Coding/non-coding Element Network Tool), a Snakemake workflow capable of performing a fully automated and comprehensive RNA-Seq analysis. CRESCENT integrates multiple tools at each step of the workflow and enables analysis of differential expression, differential alternative splicing, differential transcript usage, and gene ontology-based functional enrichment for all three. The workflow takes advantage of multiple Snakemake wrappers to minimize required installations for the user, integrating the latest versions of popular bioinformatic tools. It can be run for a complete analysis or for only a specific part in accordance with the configuration file provided by the user. The CRESCENT workflow was validated, demonstrating the pipeline's reliability, as differentially expressed protein-coding genes, TEs and differential alternative splicing events were consistent with previously published datasets. Finally, benchmarking CRESCENT performance indicated that it can be run on a personal computer or a remote server, including a high-performance computing cluster, allowing a user to process small single-end sequencing on species possessing a small genome like Arabidopsis thaliana to very large paired-end sequencing on polyploid species like wheat.

Conclusion and availability: CRESCENT is a scalable solution for comprehensive transcriptomic profiling. It is freely available at https://github.com/gilless429/crescent.

背景：传统的短读RNA- seq分析管道主要关注蛋白质编码基因，经常忽略其他基因组序列，如转座元件（te）和非编码RNA动力学，通常不研究剪接事件或转录物的使用。为了充分了解转录组的复杂性，特别是转录组调控，采用综合的方法整合这些不同的方面是至关重要的，从而提供对所研究生物体中表达动态的更完整和细致的理解。结果：为了解决这些限制，我们提出了CRESCENT（综合RNA-seq表达，剪接和编码/非编码元件网络工具），这是一个能够执行全自动和全面RNA-seq分析的Snakemake工作流程。CRESCENT在工作流程的每一步都集成了多个工具，能够分析三者的差异表达、差异可选剪接、差异转录物使用和基于基因本体的功能富集。该工作流程利用了多个Snakemake包装器，最大限度地减少了用户所需的安装，集成了最新版本的流行生物信息学工具。它可以根据用户提供的配置文件运行完整的分析或仅针对特定的部分。CRESCENT工作流程得到了验证，证明了管道的可靠性，因为差异表达的蛋白质编码基因、te和差异可选剪接事件与先前发布的数据集一致。最后，对CRESCENT性能的基准测试表明，它可以在个人计算机或远程服务器上运行，包括高性能计算集群，允许用户处理具有小基因组的物种（如拟南芥）的小单端测序，以及对多倍体物种（如小麦）的大配对端测序。结论和可用性：CRESCENT是一种可扩展的全面转录组分析解决方案。它可以在https://github.com/gilless429/crescent上免费获得。

{"title":"CRESCENT, a comprehensive RNA-Seq expression, splicing, and coding/non-coding element network tool.","authors":"Gilles Sireta, Gwendal Cueff, Vincent Darbot, Marie Lefebvre, Simon Amiard, Aline V Probst, Christophe Tatout","doi":"10.1186/s12859-026-06368-5","DOIUrl":"https://doi.org/10.1186/s12859-026-06368-5","url":null,"abstract":"Background: Traditional short-read RNA-Seq analysis pipelines predominantly focus on protein-coding genes, often overlooking other genomic sequences such as transposable elements (TEs) and non-coding RNA dynamics and do not usually investigate splicing events or transcript usage. To fully capture the complexity of the transcriptome, and in particular transcriptomic regulation, it is crucial to adopt a comprehensive approach that integrates these diverse aspects, providing a more complete and nuanced understanding of expression dynamics in the studied organism.Results: To address these limitations, we present CRESCENT (Comprehensive RNA-seq Expression, Splicing, and Coding/non-coding Element Network Tool), a Snakemake workflow capable of performing a fully automated and comprehensive RNA-Seq analysis. CRESCENT integrates multiple tools at each step of the workflow and enables analysis of differential expression, differential alternative splicing, differential transcript usage, and gene ontology-based functional enrichment for all three. The workflow takes advantage of multiple Snakemake wrappers to minimize required installations for the user, integrating the latest versions of popular bioinformatic tools. It can be run for a complete analysis or for only a specific part in accordance with the configuration file provided by the user. The CRESCENT workflow was validated, demonstrating the pipeline's reliability, as differentially expressed protein-coding genes, TEs and differential alternative splicing events were consistent with previously published datasets. Finally, benchmarking CRESCENT performance indicated that it can be run on a personal computer or a remote server, including a high-performance computing cluster, allowing a user to process small single-end sequencing on species possessing a small genome like Arabidopsis thaliana to very large paired-end sequencing on polyploid species like wheat.Conclusion and availability: CRESCENT is a scalable solution for comprehensive transcriptomic profiling. It is freely available at https://github.com/gilless429/crescent.","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146017181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Beyond the clipboard: data collection with GridScore NEXT. 超越剪贴板：数据收集与GridScore NEXT。

IF 3.3 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics

Pub Date : 2026-01-21 DOI: 10.1186/s12859-025-06352-5

Sebastian Raubach, Miriam Schreiber, Ruth Hamilton, Gaynor McKenzie, Susan McCallum, Benjamin Kilian, Alan Humphries, Loi Huu Nguyen, Tin Huynh Quang, Akanksha Singh, Shivali Sharma, Sarah Trinder, Manuel Feser, Paul D Shaw

Background: Accurate acquisition of phenotypic data is critical for cataloguing and utilising genetic variation in cultivated crops, landraces, and their wild relatives. The collection of phenotypic data using handwritten notes often introduces errors which can and should be avoided. Electronic data collection is crucial for ensuring error prevention and data standardisation and thus ensuring high-quality, reliable data.

Implementation: This paper describes the development of GridScore NEXT, a new plant phenotyping application that significantly advances the state of the art for collecting field trial data in plant genetics, pre-breeding and crop improvement research. Building on its predecessor, GridScore, the development of GridScore NEXT was driven by real life, in the field interactions with expert user groups across a number of crops. This iterative design methodology allowed the development and testing of new features. Collaborators from the 'Biodiversity for Opportunities, Livelihoods and Development' (BOLD) project, focusing on crops including rice, grasspea, and alfalfa, along with barley, potato, vegetable and blueberry teams, provided invaluable insights through training sessions and interviews and in the field use of the application.

Results: Key improvements to GridScore NEXT include enhanced data collection tools, supporting individual plant phenotyping within plots and enabling new data types such as GPS coordinates and image traits. GridScore NEXT provides customisable user defined validation rules to help prevent errors and incorporates barcode scanning for accurate, efficient data capture. The application offers an increased toolbox of data visualizations over its predecessor including heatmaps and statistical box plots, which aid in identifying potential data issues and understanding trial performance in the field. GridScore NEXT is cross-platform and can operate without an internet connection, making it ideal for field use in remote areas. Its adoption has led to standardisation of methods, significant error reduction, and the timely sharing of data, enabling quicker decision-making in pre-breeding and characterisation experiments. GridScore NEXT is available under an open-source (Apache 2.0) licence and freely available to all with no restrictions. It offers self-hosting options for enhanced data security and privacy. GridScore NEXT shows broad applicability across a diverse range of not only plant phenotyping experiments, but any experiment that requires the collection of accurate data.

背景：准确获取表型数据对于编目和利用栽培作物、地方品种及其野生近缘种的遗传变异至关重要。使用手写笔记收集表型数据经常会引入错误，这些错误可以并且应该避免。电子数据收集对于确保防止错误和数据标准化，从而确保高质量、可靠的数据至关重要。实现：本文描述了GridScore NEXT的开发，这是一款新的植物表型分析应用程序，它显著提高了收集植物遗传学、预育种和作物改良研究领域试验数据的技术水平。在其前身GridScore的基础上，GridScore NEXT的开发是由现实生活驱动的，在与许多作物的专家用户组的现场交互中。这种迭代设计方法允许开发和测试新功能。来自“生物多样性促进机遇、生计和发展”（BOLD）项目的合作者，重点关注包括水稻、草草和苜蓿在内的作物，以及大麦、土豆、蔬菜和蓝莓团队，通过培训课程、访谈以及应用程序的现场使用，提供了宝贵的见解。结果：GridScore NEXT的主要改进包括增强的数据收集工具，支持地块内单株植物表型，并启用新的数据类型，如GPS坐标和图像特征。GridScore NEXT提供可定制的用户定义验证规则，以帮助防止错误，并结合条形码扫描，以实现准确，高效的数据捕获。该应用程序比其前身提供了更多的数据可视化工具箱，包括热图和统计箱形图，这有助于识别潜在的数据问题，并了解现场的试验性能。GridScore NEXT是跨平台的，可以在没有互联网连接的情况下操作，使其成为偏远地区现场使用的理想选择。它的采用导致了方法的标准化，显著减少了错误，并及时共享数据，使育种前和表征实验的决策更快。GridScore NEXT在开源（Apache 2.0）许可下可用，并且免费提供给所有人，没有任何限制。它提供自托管选项，以增强数据安全性和隐私性。GridScore NEXT显示了广泛的适用性，不仅适用于植物表型实验，而且适用于任何需要收集准确数据的实验。

{"title":"Beyond the clipboard: data collection with GridScore NEXT.","authors":"Sebastian Raubach, Miriam Schreiber, Ruth Hamilton, Gaynor McKenzie, Susan McCallum, Benjamin Kilian, Alan Humphries, Loi Huu Nguyen, Tin Huynh Quang, Akanksha Singh, Shivali Sharma, Sarah Trinder, Manuel Feser, Paul D Shaw","doi":"10.1186/s12859-025-06352-5","DOIUrl":"https://doi.org/10.1186/s12859-025-06352-5","url":null,"abstract":"Background: Accurate acquisition of phenotypic data is critical for cataloguing and utilising genetic variation in cultivated crops, landraces, and their wild relatives. The collection of phenotypic data using handwritten notes often introduces errors which can and should be avoided. Electronic data collection is crucial for ensuring error prevention and data standardisation and thus ensuring high-quality, reliable data.Implementation: This paper describes the development of GridScore NEXT, a new plant phenotyping application that significantly advances the state of the art for collecting field trial data in plant genetics, pre-breeding and crop improvement research. Building on its predecessor, GridScore, the development of GridScore NEXT was driven by real life, in the field interactions with expert user groups across a number of crops. This iterative design methodology allowed the development and testing of new features. Collaborators from the 'Biodiversity for Opportunities, Livelihoods and Development' (BOLD) project, focusing on crops including rice, grasspea, and alfalfa, along with barley, potato, vegetable and blueberry teams, provided invaluable insights through training sessions and interviews and in the field use of the application.Results: Key improvements to GridScore NEXT include enhanced data collection tools, supporting individual plant phenotyping within plots and enabling new data types such as GPS coordinates and image traits. GridScore NEXT provides customisable user defined validation rules to help prevent errors and incorporates barcode scanning for accurate, efficient data capture. The application offers an increased toolbox of data visualizations over its predecessor including heatmaps and statistical box plots, which aid in identifying potential data issues and understanding trial performance in the field. GridScore NEXT is cross-platform and can operate without an internet connection, making it ideal for field use in remote areas. Its adoption has led to standardisation of methods, significant error reduction, and the timely sharing of data, enabling quicker decision-making in pre-breeding and characterisation experiments. GridScore NEXT is available under an open-source (Apache 2.0) licence and freely available to all with no restrictions. It offers self-hosting options for enhanced data security and privacy. GridScore NEXT shows broad applicability across a diverse range of not only plant phenotyping experiments, but any experiment that requires the collection of accurate data.","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146017147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

BioMark: biomarker analysis tool. BioMark：生物标志物分析工具。

IF 3.3 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics

Pub Date : 2026-01-20 DOI: 10.1186/s12859-025-06346-3

Mehmet Ali Balikci, Cyrille Mesue Njume, Ali Cakmak

Biomarkers play a pivotal role in disease diagnosis and prognosis by offering molecular insights into biological states. The rapid growth of high-throughput omics technologies has enabled the generation of large-scale biomarker datasets, yet analyzing these complex, high-dimensional data remains a major challenge-particularly for researchers lacking advanced computational expertise. While numerous tools exist for omics data analysis, many fall short in providing an integrated, user-friendly environment tailored specifically for biomarker discovery and interpretation. To address this gap, we present BioMark, a web-based platform designed to streamline biomarker analysis across diverse omics types. BioMark integrates robust statistical methods with widely used machine learning algorithms to support key workflows including statistical analysis, dimensionality reduction, classification, and subsequent model explanation. The platform emphasizes accessibility, offering intuitive visualizations and automated reporting to facilitate interpretation and dissemination of results. Notably, BioMark also offers a feature-ranking strategy that consolidates outputs from multiple analytical methods, enhancing the robustness of biomarker identification. By lowering the barrier to advanced biomarker analytics, BioMark empowers a broader range of researchers to uncover clinically relevant molecular signatures and accelerate translational research. Biomark is available online at https://bioinf.itu.edu.tr/biomark.

生物标志物通过提供对生物状态的分子洞察，在疾病诊断和预后中发挥着关键作用。高通量组学技术的快速发展使大规模生物标志物数据集的产生成为可能，然而，分析这些复杂的高维数据仍然是一个主要挑战，特别是对于缺乏高级计算专业知识的研究人员。虽然存在许多用于组学数据分析的工具，但许多工具在为生物标志物的发现和解释提供专门定制的集成的、用户友好的环境方面存在不足。为了解决这一差距，我们提出了一个基于网络的平台BioMark，旨在简化不同组学类型的生物标志物分析。BioMark将强大的统计方法与广泛使用的机器学习算法集成在一起，以支持关键的工作流程，包括统计分析、降维、分类和随后的模型解释。该平台强调可访问性，提供直观的可视化和自动报告，以促进结果的解释和传播。值得注意的是，BioMark还提供了一种特征排序策略，该策略整合了多种分析方法的输出，增强了生物标志物鉴定的稳健性。通过降低高级生物标志物分析的门槛，BioMark使更广泛的研究人员能够发现临床相关的分子特征，并加速转化研究。Biomark的网站是https://bioinf.itu.edu.tr/biomark。

{"title":"BioMark: biomarker analysis tool.","authors":"Mehmet Ali Balikci, Cyrille Mesue Njume, Ali Cakmak","doi":"10.1186/s12859-025-06346-3","DOIUrl":"https://doi.org/10.1186/s12859-025-06346-3","url":null,"abstract":"Biomarkers play a pivotal role in disease diagnosis and prognosis by offering molecular insights into biological states. The rapid growth of high-throughput omics technologies has enabled the generation of large-scale biomarker datasets, yet analyzing these complex, high-dimensional data remains a major challenge-particularly for researchers lacking advanced computational expertise. While numerous tools exist for omics data analysis, many fall short in providing an integrated, user-friendly environment tailored specifically for biomarker discovery and interpretation. To address this gap, we present BioMark, a web-based platform designed to streamline biomarker analysis across diverse omics types. BioMark integrates robust statistical methods with widely used machine learning algorithms to support key workflows including statistical analysis, dimensionality reduction, classification, and subsequent model explanation. The platform emphasizes accessibility, offering intuitive visualizations and automated reporting to facilitate interpretation and dissemination of results. Notably, BioMark also offers a feature-ranking strategy that consolidates outputs from multiple analytical methods, enhancing the robustness of biomarker identification. By lowering the barrier to advanced biomarker analytics, BioMark empowers a broader range of researchers to uncover clinically relevant molecular signatures and accelerate translational research. Biomark is available online at https://bioinf.itu.edu.tr/biomark.","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":""},"PeriodicalIF":3.3,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146008491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Benchmarking of methods to analyse data derived from GBS-MeDIP. 对GBS-MeDIP数据分析方法进行基准测试。

IF 3.3 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics

Pub Date : 2026-01-19 DOI: 10.1186/s12859-025-06330-x

Violeta de Anca Prado, Fábio Pértille, Pedro Sá, Marta Gòdia, Joëlle Rüegg, Josep C Jimenez-Chillaron, Carlos Guerrero-Bosagna

引用次数: 0

Limblab: pipeline for 3D analysis and visualisation of limb bud gene expression. Limblab：用于肢体芽基因表达的三维分析和可视化的管道。

IF 3.3 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics

Pub Date : 2026-01-12 DOI: 10.1186/s12859-025-06264-4

Laura Aviñó-Esteban, Heura Cardona-Blaya, Marco Musy, Antoni Matyjaszkiewicz, James Sharpe, Giovanni Dalmasso

Background: Although some aspects of limb development can be treated as a 2D problem, a true understanding of the morphogenesis and patterning requires 3D analysis. Since the data on gene expression patterns are largely static 3D image stacks, a major challenge is an efficient pipeline for staging each data-set, and then aligning and warping the data into a standard atlas for convenient visualisation.

Results: We present a novel bioinformatic pipeline tailored for 3D visualization and analysis of developing limb buds. The pipeline integrates key steps such as data acquisition, volume cleaning, surface extraction, staging, alignment, and advanced visualization techniques. Its modular design allows researchers to customize workflows while maintaining compatibility with tools such as Fiji and Vedo. The pipeline can be accessed at https://github.com/LauAvinyo/limblab .

Conclusions: The pipeline advances 3D gene expression analysis in limb development by integrating flexible tools for staging, alignment, and visualization. It is user-friendly, scalable to other samples, and optimized for research needs. Future updates will enhance customization and expand applicability to other species and developmental biology fields.

背景：虽然肢体发育的某些方面可以被视为二维问题，但要真正理解形态发生和模式需要三维分析。由于基因表达模式的数据主要是静态的3D图像堆栈，一个主要的挑战是如何有效地将每个数据集分级，然后将数据对齐和扭曲成一个标准的图谱，以方便可视化。结果：我们提出了一种新的生物信息学管道，专门用于发育中的肢体芽的三维可视化和分析。该管道集成了关键步骤，如数据采集、体积清洗、表面提取、分段、对齐和先进的可视化技术。它的模块化设计允许研究人员定制工作流程，同时保持与斐济和Vedo等工具的兼容性。该管道可以在https://github.com/LauAvinyo/limblab上访问。结论：该管道通过整合灵活的分期、排列和可视化工具，推进了肢体发育的3D基因表达分析。它对用户友好，可扩展到其他样本，并针对研究需求进行了优化。未来的更新将增强定制和扩展到其他物种和发育生物学领域的适用性。

{"title":"Limblab: pipeline for 3D analysis and visualisation of limb bud gene expression.","authors":"Laura Aviñó-Esteban, Heura Cardona-Blaya, Marco Musy, Antoni Matyjaszkiewicz, James Sharpe, Giovanni Dalmasso","doi":"10.1186/s12859-025-06264-4","DOIUrl":"10.1186/s12859-025-06264-4","url":null,"abstract":"Background: Although some aspects of limb development can be treated as a 2D problem, a true understanding of the morphogenesis and patterning requires 3D analysis. Since the data on gene expression patterns are largely static 3D image stacks, a major challenge is an efficient pipeline for staging each data-set, and then aligning and warping the data into a standard atlas for convenient visualisation.Results: We present a novel bioinformatic pipeline tailored for 3D visualization and analysis of developing limb buds. The pipeline integrates key steps such as data acquisition, volume cleaning, surface extraction, staging, alignment, and advanced visualization techniques. Its modular design allows researchers to customize workflows while maintaining compatibility with tools such as Fiji and Vedo. The pipeline can be accessed at https://github.com/LauAvinyo/limblab .Conclusions: The pipeline advances 3D gene expression analysis in limb development by integrating flexible tools for staging, alignment, and visualization. It is user-friendly, scalable to other samples, and optimized for research needs. Future updates will enhance customization and expand applicability to other species and developmental biology fields.","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"27 1","pages":"6"},"PeriodicalIF":3.3,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12794269/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145958550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Comic: explainable drug repurposing via contrastive masking for interpretable connections. 漫画：可解释的药物重新利用通过对比掩蔽可解释的连接。

IF 3.3 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics

Pub Date : 2026-01-09 DOI: 10.1186/s12859-025-06337-4

Naafey Aamer, Muhammad Nabeel Asim, Andreas Dengel

引用次数: 0

From 2D to 4D: a containerized workflow and browser to explore dynamic chromatin architecture. 从2D到4D：一个容器化的工作流程和浏览器，探索动态染色质架构。

IF 3.3 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics

Pub Date : 2026-01-08 DOI: 10.1186/s12859-025-06361-4

David H Rogers, Cullen Roth, Cameron Tauxe, Jeannie T Lee, Christina R Steadman, Karissa Y Sanbonmatsu, Anna Lappala, Shawn R Starkenburg

引用次数: 0

Predicting the pathway involvement of metabolites annotated in the MetaCyc knowledgebase. 预测MetaCyc知识库中标注的代谢物的通路参与。

IF 3.3 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics

Pub Date : 2026-01-07 DOI: 10.1186/s12859-025-06358-z

Erik D Huckvale, Hunter N B Moseley

Background: The associations of metabolites with biochemical pathways are highly useful information for interpreting molecular datasets generated in biological and biomedical research. However, such pathway annotations are sparse in most molecular datasets, limiting their utility for pathway level interpretation. To address these shortcomings, several past publications have presented machine learning models for predicting the pathway association of small biomolecule (metabolite and xenobiotic) using data from the Kyoto Encyclopedia of Genes and Genomes (KEGG). But other similar knowledgebases exist, for example MetaCyc, which has more compound entries and pathway definitions than KEGG.

Results: As a logical next step, we trained and evaluated multilayer perceptron models on compound entries and pathway annotations obtained from MetaCyc. From the models trained on this dataset, we observed a mean Matthews correlation coefficient (MCC) of 0.845 with 0.0101 standard deviation, compared to a mean MCC of 0.847 with 0.0098 standard deviation for the KEGG dataset. However, KEGG's 184 metabolic-only pathway predictions (out of 502 total pathways) have a mean MCC of 0.800 with 0.021 standard deviation. Since MetaCyc pathways are metabolic focused, the MetaCyc results represent over a 5.6% improvement in metabolic pathway prediction performance.

Conclusions: These performance results are pragmatically the same, demonstrating that in aggregate, the 4055 MetaCyc pathways can be effectively predicted at the current state-of-the-art performance level.

背景：代谢物与生化途径的关联对于解释生物学和生物医学研究中产生的分子数据集是非常有用的信息。然而，这种途径注释在大多数分子数据集中是稀疏的，限制了它们在途径级解释中的效用。为了解决这些缺点，过去的一些出版物已经提出了机器学习模型，用于使用京都基因和基因组百科全书（KEGG）的数据预测小生物分子（代谢物和异种生物）的途径关联。但是也存在其他类似的知识库，例如MetaCyc，它比KEGG拥有更多的复合条目和路径定义。结果：作为逻辑上的下一步，我们训练并评估了多层感知器模型，该模型基于从MetaCyc获得的复合条目和路径注释。从该数据集上训练的模型中，我们观察到马修斯相关系数（MCC）的平均值为0.845，标准差为0.0101，而KEGG数据集的平均MCC为0.847，标准差为0.0098。然而，KEGG的184个仅代谢途径预测（502个总途径）的平均MCC为0.800，标准差为0.021。由于MetaCyc途径以代谢为重点，因此MetaCyc结果在代谢途径预测性能方面提高了5.6%以上。结论：这些性能结果实际上是相同的，表明总的来说，4055个MetaCyc路径可以在当前最先进的性能水平上有效地预测。

{"title":"Predicting the pathway involvement of metabolites annotated in the MetaCyc knowledgebase.","authors":"Erik D Huckvale, Hunter N B Moseley","doi":"10.1186/s12859-025-06358-z","DOIUrl":"10.1186/s12859-025-06358-z","url":null,"abstract":"Background: The associations of metabolites with biochemical pathways are highly useful information for interpreting molecular datasets generated in biological and biomedical research. However, such pathway annotations are sparse in most molecular datasets, limiting their utility for pathway level interpretation. To address these shortcomings, several past publications have presented machine learning models for predicting the pathway association of small biomolecule (metabolite and xenobiotic) using data from the Kyoto Encyclopedia of Genes and Genomes (KEGG). But other similar knowledgebases exist, for example MetaCyc, which has more compound entries and pathway definitions than KEGG.Results: As a logical next step, we trained and evaluated multilayer perceptron models on compound entries and pathway annotations obtained from MetaCyc. From the models trained on this dataset, we observed a mean Matthews correlation coefficient (MCC) of 0.845 with 0.0101 standard deviation, compared to a mean MCC of 0.847 with 0.0098 standard deviation for the KEGG dataset. However, KEGG's 184 metabolic-only pathway predictions (out of 502 total pathways) have a mean MCC of 0.800 with 0.021 standard deviation. Since MetaCyc pathways are metabolic focused, the MetaCyc results represent over a 5.6% improvement in metabolic pathway prediction performance.Conclusions: These performance results are pragmatically the same, demonstrating that in aggregate, the 4055 MetaCyc pathways can be effectively predicted at the current state-of-the-art performance level.","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":"36"},"PeriodicalIF":3.3,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12870939/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145910342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Hgtsynergy: a transfer learning method for predicting anticancer synergistic drug combinations based on a drug-drug interaction heterogeneous graph. Hgtsynergy：一种基于药物-药物相互作用异构图预测抗癌协同药物组合的迁移学习方法。

IF 3.3 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics

Pub Date : 2026-01-06 DOI: 10.1186/s12859-025-06360-5

Xiaowen Wang, Yanming Huang, Hongming Zhu, Dongsheng Mao, Xiaoli Zhu, Qin Liu

Background: Drug combination therapy often outperforms monotherapy in cancer treatment, but the vast number of available drugs makes manual screening for synergistic combinations costly. Computational methods, especially deep learning, can reduce the search space by predicting likely synergistic drug combinations. Recent studies have improved drug synergy prediction by modeling associations among different biological entities, but drug-drug interactions have not been fully leveraged in this scenario, which motivated the work presented in this paper.

Methods: This paper proposes a deep learning method named HGTSynergy to predict synergistic drug combinations, which employs a heterogeneous graph attention network and a tailored task to capture complex latent patterns in the drug network as prior knowledge. The learned knowledge is then transferred through a transfer learning framework to the downstream task of predicting drug synergy scores, effectively enhancing predictive performance.

Results: A five-fold nested cross-validation is employed to train HGTSynergy. In the synergy regression task, HGTSynergy outperforms seven deep learning methods, achieving a mean squared error of 222.83, root mean squared error of 14.91, and Pearson correlation coefficient of 0.75. For the synergy classification task, it also surpasses other methods with an area under the receiver operating characteristic curve of 0.90, area under the precision-recall curve of 0.63, accuracy of 0.94, precision of 0.72, and Cohen's Kappa of 0.52. The ablation study verifies that the heterogeneous graph attention network and the transfer learning framework both have a positive effect on prediction performance. Moreover, a series of analyses demonstrates that the proposed method exhibits strong generalization performance and interpretability. The case study further validates its consistency with prior research.

Conclusions: This study suggests that drug synergy prediction can be improved by comprehensively modeling diverse drug-drug interaction types and leveraging transfer learning to extract prior knowledge from them. The ability of HGTSynergy to discover new anticancer synergistic drug combinations outperforms other state-of-the-art methods. HGTSynergy promises to be a powerful tool to pre-screen anticancer synergistic drug combinations.

背景：在癌症治疗中，药物联合治疗通常优于单一治疗，但是大量可用的药物使得人工筛选协同联合治疗的成本很高。计算方法，特别是深度学习，可以通过预测可能的协同药物组合来减少搜索空间。最近的研究通过对不同生物实体之间的关联进行建模，改进了药物协同作用的预测，但在这种情况下，药物-药物相互作用尚未得到充分利用，这促使了本文的工作。方法：本文提出了一种名为HGTSynergy的深度学习方法来预测协同药物组合，该方法采用异构图注意网络和定制任务来捕获药物网络中的复杂潜在模式作为先验知识。然后通过迁移学习框架将学习到的知识转移到预测药物协同得分的下游任务中，有效地提高了预测性能。结果：采用五重嵌套交叉验证来训练HGTSynergy。在协同回归任务中，HGTSynergy优于7种深度学习方法，均方误差为222.83，均方根误差为14.91，Pearson相关系数为0.75。对于协同分类任务，它也优于其他方法，其接收者工作特征曲线下面积为0.90，精密度-召回率曲线下面积为0.63，准确度为0.94，精密度为0.72，科恩Kappa为0.52。实验验证了异构图注意网络和迁移学习框架对预测性能都有积极的影响。此外，一系列分析表明，该方法具有较强的泛化性能和可解释性。案例研究进一步验证了其与前人研究的一致性。结论：通过对多种药物-药物相互作用类型进行综合建模，并利用迁移学习从中提取先验知识，可以提高药物协同作用预测的准确性。HGTSynergy发现新的抗癌协同药物组合的能力优于其他最先进的方法。HGTSynergy有望成为预筛选抗癌协同药物组合的有力工具。

{"title":"Hgtsynergy: a transfer learning method for predicting anticancer synergistic drug combinations based on a drug-drug interaction heterogeneous graph.","authors":"Xiaowen Wang, Yanming Huang, Hongming Zhu, Dongsheng Mao, Xiaoli Zhu, Qin Liu","doi":"10.1186/s12859-025-06360-5","DOIUrl":"10.1186/s12859-025-06360-5","url":null,"abstract":"Background: Drug combination therapy often outperforms monotherapy in cancer treatment, but the vast number of available drugs makes manual screening for synergistic combinations costly. Computational methods, especially deep learning, can reduce the search space by predicting likely synergistic drug combinations. Recent studies have improved drug synergy prediction by modeling associations among different biological entities, but drug-drug interactions have not been fully leveraged in this scenario, which motivated the work presented in this paper.Methods: This paper proposes a deep learning method named HGTSynergy to predict synergistic drug combinations, which employs a heterogeneous graph attention network and a tailored task to capture complex latent patterns in the drug network as prior knowledge. The learned knowledge is then transferred through a transfer learning framework to the downstream task of predicting drug synergy scores, effectively enhancing predictive performance.Results: A five-fold nested cross-validation is employed to train HGTSynergy. In the synergy regression task, HGTSynergy outperforms seven deep learning methods, achieving a mean squared error of 222.83, root mean squared error of 14.91, and Pearson correlation coefficient of 0.75. For the synergy classification task, it also surpasses other methods with an area under the receiver operating characteristic curve of 0.90, area under the precision-recall curve of 0.63, accuracy of 0.94, precision of 0.72, and Cohen's Kappa of 0.52. The ablation study verifies that the heterogeneous graph attention network and the transfer learning framework both have a positive effect on prediction performance. Moreover, a series of analyses demonstrates that the proposed method exhibits strong generalization performance and interpretability. The case study further validates its consistency with prior research.Conclusions: This study suggests that drug synergy prediction can be improved by comprehensively modeling diverse drug-drug interaction types and leveraging transfer learning to extract prior knowledge from them. The ability of HGTSynergy to discover new anticancer synergistic drug combinations outperforms other state-of-the-art methods. HGTSynergy promises to be a powerful tool to pre-screen anticancer synergistic drug combinations.","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":" ","pages":"35"},"PeriodicalIF":3.3,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12870253/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145910284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0