首页 > 最新文献

Frontiers in bioinformatics最新文献

英文 中文
Editorial: Computational protein function prediction based on sequence and/or structural data. 编辑:基于序列和/或结构数据的计算蛋白质功能预测。
IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-10-31 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1705252
Yaan J Jang
{"title":"Editorial: Computational protein function prediction based on sequence and/or structural data.","authors":"Yaan J Jang","doi":"10.3389/fbinf.2025.1705252","DOIUrl":"10.3389/fbinf.2025.1705252","url":null,"abstract":"","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1705252"},"PeriodicalIF":3.9,"publicationDate":"2025-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12615499/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145544048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Segmentation and modeling of large-scale microvascular networks: a survey. 大规模微血管网络的分割和建模:综述。
IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-10-31 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1645520
Helya Goharbavang, Artem T Ashitkov, Athira Pillai, Joshua D Wythe, Guoning Chen, David Mayerich

Recent advances in three-dimensional microscopy enable imaging of whole-organ microvascular networks in small animals. Since microvasculature plays a crucial role in tissue development and function, its structure may provide diagnostic biomarkers and insight into disease progression. However, the microscopy community currently lacks benchmarks for scalable algorithms to measure these potential biomarkers. While many algorithms exist for segmenting vessel-like structures and extracting their surface features and connectivity, they have not been thoroughly evaluated on modern gigavoxel-scale images. In this paper, we propose a comprehensive yet compact survey of available algorithms. We focus on essential features for microvascular analysis, including extracting vessel surfaces and the network's associated connectivity. We select a series of algorithms based on popularity and availability and provide a thorough quantitative analysis of their performance on datasets acquired using light sheet fluorescence microscopy (LSFM), knife-edge scanning microscopy (KESM), and X-ray microtomography (µ-CT).

三维显微镜的最新进展使小动物的全器官微血管网络成像成为可能。由于微血管在组织发育和功能中起着至关重要的作用,其结构可能提供诊断生物标志物和疾病进展的见解。然而,显微镜学界目前缺乏可扩展算法的基准来测量这些潜在的生物标志物。虽然存在许多用于分割类血管结构并提取其表面特征和连通性的算法,但它们尚未在现代千兆像素尺度图像上进行彻底评估。在本文中,我们提出了一个全面而紧凑的可用算法调查。我们专注于微血管分析的基本特征,包括提取血管表面和网络相关的连通性。我们根据受欢迎程度和可用性选择了一系列算法,并对它们在使用光片荧光显微镜(LSFM)、刀口扫描显微镜(KESM)和x射线微断层扫描(µ-CT)获得的数据集上的性能进行了全面的定量分析。
{"title":"Segmentation and modeling of large-scale microvascular networks: a survey.","authors":"Helya Goharbavang, Artem T Ashitkov, Athira Pillai, Joshua D Wythe, Guoning Chen, David Mayerich","doi":"10.3389/fbinf.2025.1645520","DOIUrl":"10.3389/fbinf.2025.1645520","url":null,"abstract":"<p><p>Recent advances in three-dimensional microscopy enable imaging of whole-organ microvascular networks in small animals. Since microvasculature plays a crucial role in tissue development and function, its structure may provide diagnostic biomarkers and insight into disease progression. However, the microscopy community currently lacks benchmarks for scalable algorithms to measure these potential biomarkers. While many algorithms exist for segmenting vessel-like structures and extracting their surface features and connectivity, they have not been thoroughly evaluated on modern gigavoxel-scale images. In this paper, we propose a comprehensive yet compact survey of available algorithms. We focus on essential features for microvascular analysis, including extracting vessel surfaces and the network's associated connectivity. We select a series of algorithms based on popularity and availability and provide a thorough quantitative analysis of their performance on datasets acquired using light sheet fluorescence microscopy (LSFM), knife-edge scanning microscopy (KESM), and X-ray microtomography (µ-CT).</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1645520"},"PeriodicalIF":3.9,"publicationDate":"2025-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12616183/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145544065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The importance of democratized resources in early-career training for bioimage analysts and bioimaging scientists. 民主化资源在早期职业培训对生物图像分析师和生物成像科学家的重要性。
IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-10-30 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1693343
Genevieve Laprade, Quinn Lee, Kristin L Gallik, Michael Nelson, Natalie Woo, Celina Terán Ramírez, Alexis Ricardo Becerril Cuevas, Kevin W Eliceiri, Corinne Esquibel

The fields of bioimaging and image analysis are rapidly expanding as new technologies transform biological questions into novel insights. While professionals of varying expertise are essential to achieving these advancements, early-career scientists-a prominent user group within the imaging community-are often assumed to have the prerequisite knowledge and ability to use these tools. This demographic, consisting of students, post-docs, and bioimage analysis trainees, is critical for the field to continue to evolve and flourish. However, obstacles such as geographic location, language barriers, insufficient funding or training, and instrument availability hinder access to resources and introduce significant knowledge gaps, especially for scientists in early-career stages. Democratized resources for bioimaging and analysis such as forums, community organizations, and publicly available datasets have been helpful in overcoming barriers to access for early-career scientists. Here, we discuss the current tools and resources available for early-career researchers, highlight their limitations from the learners' perspective, and propose strategies to better support this group. As bioimage analysis extends broadly into many scientific disciplines, we implore all members of this community, regardless of experience level, to empower next-generation scientists.

随着新技术将生物学问题转化为新的见解,生物成像和图像分析领域正在迅速扩大。虽然不同专业知识的专业人员对于实现这些进步至关重要,但早期职业科学家-成像社区中的重要用户群体-通常被认为具有使用这些工具的先决知识和能力。这一人口统计,包括学生,博士后和生物图像分析学员,是该领域继续发展和繁荣的关键。然而,地理位置、语言障碍、资金或培训不足以及仪器可用性等障碍阻碍了获取资源,并导致了重大的知识鸿沟,特别是对处于职业生涯早期阶段的科学家而言。生物成像和分析的民主化资源,如论坛、社区组织和公开可用的数据集,有助于克服早期职业科学家访问的障碍。在这里,我们讨论了目前可供早期职业研究人员使用的工具和资源,从学习者的角度强调了它们的局限性,并提出了更好地支持这一群体的策略。随着生物图像分析广泛地扩展到许多科学学科,我们恳请这个社区的所有成员,无论经验水平如何,都能赋予下一代科学家权力。
{"title":"The importance of democratized resources in early-career training for bioimage analysts and bioimaging scientists.","authors":"Genevieve Laprade, Quinn Lee, Kristin L Gallik, Michael Nelson, Natalie Woo, Celina Terán Ramírez, Alexis Ricardo Becerril Cuevas, Kevin W Eliceiri, Corinne Esquibel","doi":"10.3389/fbinf.2025.1693343","DOIUrl":"10.3389/fbinf.2025.1693343","url":null,"abstract":"<p><p>The fields of bioimaging and image analysis are rapidly expanding as new technologies transform biological questions into novel insights. While professionals of varying expertise are essential to achieving these advancements, early-career scientists-a prominent user group within the imaging community-are often assumed to have the prerequisite knowledge and ability to use these tools. This demographic, consisting of students, post-docs, and bioimage analysis trainees, is critical for the field to continue to evolve and flourish. However, obstacles such as geographic location, language barriers, insufficient funding or training, and instrument availability hinder access to resources and introduce significant knowledge gaps, especially for scientists in early-career stages. Democratized resources for bioimaging and analysis such as forums, community organizations, and publicly available datasets have been helpful in overcoming barriers to access for early-career scientists. Here, we discuss the current tools and resources available for early-career researchers, highlight their limitations from the learners' perspective, and propose strategies to better support this group. As bioimage analysis extends broadly into many scientific disciplines, we implore all members of this community, regardless of experience level, to empower next-generation scientists.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1693343"},"PeriodicalIF":3.9,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12611831/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145544038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gene expression profile in colon cancer therapeutic resistance and its relationship with the tumor microenvironment. 结肠癌耐药基因表达谱及其与肿瘤微环境的关系
IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-10-29 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1674179
Priscila Galvão Doria, Gisele Vieira Rocha, Vanessa Dybal Bertoni, Roberto de Souza Batista Dos Santos, Mariana Araújo-Pereira, Clarissa Gurgel

Introduction: Colon cancer is a common disease, treated with few chemotherapeutic agents with similar treatment sequencing despite its heterogeneity. A significant proportion of patients are diagnosed with metastasis, and resistance to antineoplastic drugs is associated with disease progression and therapeutic failure. It is known that the tumor microenvironment plays an essential role in cancer progression, contributing to processes that may be associated with therapeutic resistance mechanisms in colon cancer. In this study, we aim to identify a gene expression signature and its relationship with immune cell infiltration in colon cancer, contributing to the identification of potential resistance biomarkers.

Methods: An in silico study was conducted using RNA-seq data from The Cancer Genome Atlas Program (TCGA) samples, subdivided into two groups (treatment-resistant and non-resistant), taking into account the molecular subgroups (CMS1, CMS2, CMS3, and CMS4). The following algorithms were used: i. Limma was applied to identify differentially expressed genes; ii. WGCNA was applied to construct co-expression networks; iii. CIBERSORT was applied to estimate the proportion of infiltrating immune cells; and iv. TIMER was applied to explore the relationship between core genes and immune cell content.

Results: Twenty differentially expressed genes (DEGs) were found, with 18 related to the group considered resistant to oncologic treatment and presenting poorer overall survival. T CD4 memory resting cells and M0 and M2 macrophages were found in more significant proportions in the analyzed samples and more infiltrated in the tumor microenvironment, the higher the expression of some of these resistance DEGs. Additionally, these genes correlate with biological aspects of neuronal differentiation, axogenesis, and synaptic transmission.

Conclusion: The gene expression signature suggests the presence of differentially expressed synaptic membrane genes, which may be involved in neuronal pathways that influence the tumor microenvironment, potentially serving as future biomarkers. Furthermore, the presence of M0 and M2 macrophages and T CD4 memory resting cells suggests a potential interaction that may play a role in therapeutic resistance.

导读:结肠癌是一种常见的疾病,尽管存在异质性,但治疗顺序相似的化疗药物很少。相当比例的患者被诊断为转移,对抗肿瘤药物的耐药性与疾病进展和治疗失败有关。众所周知,肿瘤微环境在癌症进展中起着至关重要的作用,有助于可能与结肠癌治疗耐药机制相关的过程。在本研究中,我们旨在鉴定结肠癌中一个基因表达特征及其与免疫细胞浸润的关系,有助于鉴定潜在的耐药生物标志物。方法:利用来自癌症基因组图谱计划(TCGA)样本的RNA-seq数据进行计算机研究,考虑到分子亚群(CMS1, CMS2, CMS3和CMS4),将样本细分为两组(治疗耐药和非耐药)。采用以下算法:i.利用Limma法鉴定差异表达基因;2。应用WGCNA构建共表达网络;3。采用CIBERSORT估计浸润免疫细胞比例;iv.应用TIMER方法探讨核心基因与免疫细胞含量的关系。结果:发现了20个差异表达基因(DEGs),其中18个与肿瘤治疗耐药和总生存率较差的组相关。T CD4记忆性静息细胞和M0、M2巨噬细胞在分析样本中所占比例越显著,在肿瘤微环境中浸润程度越高,部分耐药deg的表达越高。此外,这些基因与神经元分化、轴生和突触传递的生物学方面相关。结论:基因表达特征提示存在差异表达的突触膜基因,这些基因可能参与影响肿瘤微环境的神经通路,可能作为未来的生物标志物。此外,M0和M2巨噬细胞与T CD4记忆静息细胞的存在表明可能在治疗抵抗中发挥作用的潜在相互作用。
{"title":"Gene expression profile in colon cancer therapeutic resistance and its relationship with the tumor microenvironment.","authors":"Priscila Galvão Doria, Gisele Vieira Rocha, Vanessa Dybal Bertoni, Roberto de Souza Batista Dos Santos, Mariana Araújo-Pereira, Clarissa Gurgel","doi":"10.3389/fbinf.2025.1674179","DOIUrl":"10.3389/fbinf.2025.1674179","url":null,"abstract":"<p><strong>Introduction: </strong>Colon cancer is a common disease, treated with few chemotherapeutic agents with similar treatment sequencing despite its heterogeneity. A significant proportion of patients are diagnosed with metastasis, and resistance to antineoplastic drugs is associated with disease progression and therapeutic failure. It is known that the tumor microenvironment plays an essential role in cancer progression, contributing to processes that may be associated with therapeutic resistance mechanisms in colon cancer. In this study, we aim to identify a gene expression signature and its relationship with immune cell infiltration in colon cancer, contributing to the identification of potential resistance biomarkers.</p><p><strong>Methods: </strong>An <i>in silico</i> study was conducted using RNA-seq data from The Cancer Genome Atlas Program (TCGA) samples, subdivided into two groups (treatment-resistant and non-resistant), taking into account the molecular subgroups (CMS1, CMS2, CMS3, and CMS4). The following algorithms were used: i. <i>Limma</i> was applied to identify differentially expressed genes; ii. WGCNA was applied to construct co-expression networks; iii. CIBERSORT was applied to estimate the proportion of infiltrating immune cells; and iv. TIMER was applied to explore the relationship between core genes and immune cell content.</p><p><strong>Results: </strong>Twenty differentially expressed genes (DEGs) were found, with 18 related to the group considered resistant to oncologic treatment and presenting poorer overall survival. T CD4 memory resting cells and M0 and M2 macrophages were found in more significant proportions in the analyzed samples and more infiltrated in the tumor microenvironment, the higher the expression of some of these resistance DEGs. Additionally, these genes correlate with biological aspects of neuronal differentiation, axogenesis, and synaptic transmission.</p><p><strong>Conclusion: </strong>The gene expression signature suggests the presence of differentially expressed synaptic membrane genes, which may be involved in neuronal pathways that influence the tumor microenvironment, potentially serving as future biomarkers. Furthermore, the presence of M0 and M2 macrophages and T CD4 memory resting cells suggests a potential interaction that may play a role in therapeutic resistance.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1674179"},"PeriodicalIF":3.9,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12604976/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145515086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Why science needs art. 为什么科学需要艺术。
IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-10-24 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1708311
Giulia Ghisleni, Christian Stolte, Megan Gozzard, Lea Von Soosten, Antonia Bruno

This perspective paper examines the profound cognitive and methodological parallels between scientific and artistic research, challenging the traditional distinction between the two domains. While science and art use different languages, both emerge from the human drive for creativity and understanding. We argue that scientific inquiry, often presented as strictly objective and methodical, inherently shares with art the need for imagination, flexibility, and interpretative thinking. Drawing on neuroscience, education, design theory, and the visual arts, we highlight how artistic practices, particularly in the visual arts, can enhance scientific learning, innovation, and public engagement. We advocate integrating art into scientific training and research to foster a more creative and inclusive epistemology. Through examples in microbiology, education, and data visualization, we show how the arts can support deeper understanding, cross-disciplinary collaboration, and more effective science communication. Ultimately, we call for a shift toward a more integrated approach that embraces the complementary strengths of both art and science in advancing knowledge and societal impact.

这篇透视论文探讨了科学和艺术研究之间深刻的认知和方法上的相似之处,挑战了这两个领域之间的传统区别。虽然科学和艺术使用不同的语言,但它们都源于人类的创造力和理解力。我们认为,科学探究通常被认为是严格客观和有条理的,它本质上与艺术一样需要想象力、灵活性和解释性思维。利用神经科学、教育、设计理论和视觉艺术,我们强调艺术实践,特别是视觉艺术,如何促进科学学习、创新和公众参与。我们提倡将艺术融入科学训练和研究中,以培养更具创造性和包容性的认识论。通过微生物学、教育和数据可视化的例子,我们展示了艺术如何支持更深层次的理解、跨学科合作和更有效的科学交流。最终,我们呼吁转向一种更加综合的方法,在推进知识和社会影响方面拥抱艺术和科学的互补优势。
{"title":"Why science needs art.","authors":"Giulia Ghisleni, Christian Stolte, Megan Gozzard, Lea Von Soosten, Antonia Bruno","doi":"10.3389/fbinf.2025.1708311","DOIUrl":"10.3389/fbinf.2025.1708311","url":null,"abstract":"<p><p>This perspective paper examines the profound cognitive and methodological parallels between scientific and artistic research, challenging the traditional distinction between the two domains. While science and art use different languages, both emerge from the human drive for creativity and understanding. We argue that scientific inquiry, often presented as strictly objective and methodical, inherently shares with art the need for imagination, flexibility, and interpretative thinking. Drawing on neuroscience, education, design theory, and the visual arts, we highlight how artistic practices, particularly in the visual arts, can enhance scientific learning, innovation, and public engagement. We advocate integrating art into scientific training and research to foster a more creative and inclusive epistemology. Through examples in microbiology, education, and data visualization, we show how the arts can support deeper understanding, cross-disciplinary collaboration, and more effective science communication. Ultimately, we call for a shift toward a more integrated approach that embraces the complementary strengths of both art and science in advancing knowledge and societal impact.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1708311"},"PeriodicalIF":3.9,"publicationDate":"2025-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12592062/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145483967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unraveling the molecular basis of snake venom nerve growth factor: human TrkA recognition through molecular dynamics simulation and comparison with human nerve growth factor. 揭示蛇毒神经生长因子的分子基础:通过分子动力学模拟人类TrkA识别,并与人类神经生长因子进行比较。
IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-10-24 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1674791
Shrudhi Devi, Gurunathan Jayaraman

Introduction: Neurodegenerative diseases pose significant challenges owing to the limited number of effective therapies. Nerve growth factor (NGF) plays a crucial role in neuronal survival and differentiation through tropomyosin receptor kinase A (TrkA). Although snake venom NGF (sNGF) has been studied for its ability to activate TrkA, the binding modes and associated dynamics remain unclear compared to those of human NGF (hNGF). Herein, we explored the possibilities of NGFs from Daboia russelii and Naja naja as potential therapeutic alternatives to hNGF by comparing the structural similarities and conserved binding residues.

Methods: The active sites were identified through a literature review, molecular docking was performed using HADDOCK, and molecular dynamics simulation was performed to analyse the stabilities of the complexes; then, PRODIGY and molecular mechanics Poisson-Boltzmann surface area were used to determine the binding affinities.

Results: The different sNGFs exhibited stronger binding affinities and stabilities than hNGF, while principal component analysis and the free energy landscape indicated constrained conformational flexibilities suggestive of an adaptive mechanism in sNGF for effective receptor engagement. A network coevolutionary analysis was performed, which showed the pattern in which the amino acids were coevolved and conserved throughout the simulations.

Discussion: These findings indicate that NGFs from D. russelii and N. naja are promising therapeutic candidates for treating neurodegenerative disorders and warrant further in vivo validation.

导言:神经退行性疾病由于有效的治疗方法数量有限而构成重大挑战。神经生长因子(NGF)通过原肌球蛋白受体激酶a (TrkA)在神经元存活和分化中起着至关重要的作用。虽然已经研究了蛇毒NGF (sNGF)激活TrkA的能力,但与人NGF (hNGF)相比,其结合模式和相关动力学尚不清楚。在此,我们通过比较结构相似性和保守的结合残基,探讨了来自达伯亚russelii和Naja Naja的ngf作为hNGF潜在治疗替代品的可能性。方法:通过文献查阅确定活性位点,利用HADDOCK进行分子对接,并进行分子动力学模拟分析配合物的稳定性;然后利用PRODIGY和分子力学泊松-玻尔兹曼表面积来确定结合亲和力。结果:不同的sNGF表现出比hNGF更强的结合亲和力和稳定性,而主成分分析和自由能图表明sNGF具有约束的构象灵活性,这表明sNGF具有有效结合受体的自适应机制。进行了网络共同进化分析,显示了氨基酸在整个模拟过程中共同进化和保守的模式。讨论:这些发现表明,来自russelii和nnaja的ngf是治疗神经退行性疾病的有希望的治疗候选者,值得进一步的体内验证。
{"title":"Unraveling the molecular basis of snake venom nerve growth factor: human TrkA recognition through molecular dynamics simulation and comparison with human nerve growth factor.","authors":"Shrudhi Devi, Gurunathan Jayaraman","doi":"10.3389/fbinf.2025.1674791","DOIUrl":"10.3389/fbinf.2025.1674791","url":null,"abstract":"<p><strong>Introduction: </strong>Neurodegenerative diseases pose significant challenges owing to the limited number of effective therapies. Nerve growth factor (NGF) plays a crucial role in neuronal survival and differentiation through tropomyosin receptor kinase A (TrkA). Although snake venom NGF (sNGF) has been studied for its ability to activate TrkA, the binding modes and associated dynamics remain unclear compared to those of human NGF (hNGF). Herein, we explored the possibilities of NGFs from <i>Daboia russelii</i> and <i>Naja naja</i> as potential therapeutic alternatives to hNGF by comparing the structural similarities and conserved binding residues.</p><p><strong>Methods: </strong>The active sites were identified through a literature review, molecular docking was performed using HADDOCK, and molecular dynamics simulation was performed to analyse the stabilities of the complexes; then, PRODIGY and molecular mechanics Poisson-Boltzmann surface area were used to determine the binding affinities.</p><p><strong>Results: </strong>The different sNGFs exhibited stronger binding affinities and stabilities than hNGF, while principal component analysis and the free energy landscape indicated constrained conformational flexibilities suggestive of an adaptive mechanism in sNGF for effective receptor engagement. A network coevolutionary analysis was performed, which showed the pattern in which the amino acids were coevolved and conserved throughout the simulations.</p><p><strong>Discussion: </strong>These findings indicate that NGFs from <i>D. russelii</i> and <i>N. naja</i> are promising therapeutic candidates for treating neurodegenerative disorders and warrant further <i>in vivo</i> validation.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1674791"},"PeriodicalIF":3.9,"publicationDate":"2025-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12592128/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145483953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Drug repositioning pipeline integrating community analysis in drug-drug similarity networks and automated ATC community labeling to foster molecular docking analysis. 药物再定位管道整合了药物相似网络中的社区分析和自动ATC社区标记,促进分子对接分析。
IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-10-23 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1666716
Daiana Colibăşanu, Vlad Groza, Maria Antonietta Occhiuzzi, Fedora Grande, Mihai Udrescu, Lucreția Udrescu

Introduction: Drug repositioning-finding new therapeutic uses for existing drugs-can dramatically reduce development time and cost, but requires efficient computational frameworks to generate and validate repositioning hypotheses. Network-based methods can uncover drug communities with shared pharmacological properties, while molecular docking offers mechanistic insights by predicting drug-target binding.

Methods: We introduce an end-to-end, fully automated pipeline that (1) constructs a tripartite drug-gene-disease network from DrugBank and DisGeNET, (2) projects it into a drug-drug similarity network for community detection, (3) labels communities via Anatomical Therapeutic Chemical (ATC) codes to generate repositioning hints and identify relevant targets, (4) validates hints through automated literature searches, and (5) prioritizes candidates via targeted molecular docking.

Results: After filtering for connectivity and size, 12 robust communities emerged from the initial 34 clusters. The pipeline correctly matched 53.4% of drugs to their ATC level 1 community label via database entries; literature validation confirmed an additional 20.2%, yielding 73.6% overall accuracy. The remaining 26.4% of drugs were flagged as repositioning candidates. To illustrate the advantages of our pipeline, molecular docking studies of chloramphenicol demonstrated stable binding and interaction profiles similar to those of known inhibitors, reinforcing its potential as an anticancer agent.

Conclusion: Our integrated pipeline effectively integrates network-based community analysis and automated ATC labeling with literature and docking analysis, narrowing the search space for in silico and experimental follow-up. The chloramphenicol example illustrates its utility for uncovering non-obvious repositioning opportunities. Future work will extend similarity definitions (e.g., to higher-order network motifs) and incorporate wet-lab validation of top candidates.

药物重新定位-为现有药物寻找新的治疗用途-可以显着减少开发时间和成本,但需要有效的计算框架来生成和验证重新定位假设。基于网络的方法可以发现具有共同药理特性的药物群落,而分子对接通过预测药物靶标结合提供了机制见解。方法:我们引入了一个端到端的全自动管道,该管道(1)从DrugBank和DisGeNET构建一个药物-基因-疾病的三要素网络,(2)将其投影到药物-药物相似网络中用于社区检测,(3)通过解剖治疗化学(ATC)代码标记社区以生成重新定位提示并识别相关靶点,(4)通过自动文献检索验证提示,(5)通过靶向分子对接确定候选对象的优先级。结果:在对连通性和规模进行筛选后,从最初的34个集群中产生了12个强大的社区。该管道通过数据库条目将53.4%的药物与ATC 1级社区标签正确匹配;文献验证证实了额外的20.2%,总体准确率为73.6%。其余26.4%的药物被标记为重新定位候选药物。为了说明我们的产品线的优势,氯霉素的分子对接研究显示出与已知抑制剂相似的稳定结合和相互作用特征,加强了其作为抗癌剂的潜力。结论:我们的集成管道有效地将基于网络的社区分析和自动ATC标记与文献和对接分析相结合,缩小了计算机和实验随访的搜索空间。氯霉素的例子说明了它在发现非明显的重新定位机会方面的效用。未来的工作将扩展相似性定义(例如,到高阶网络基序),并纳入顶级候选的湿实验室验证。
{"title":"Drug repositioning pipeline integrating community analysis in drug-drug similarity networks and automated ATC community labeling to foster molecular docking analysis.","authors":"Daiana Colibăşanu, Vlad Groza, Maria Antonietta Occhiuzzi, Fedora Grande, Mihai Udrescu, Lucreția Udrescu","doi":"10.3389/fbinf.2025.1666716","DOIUrl":"10.3389/fbinf.2025.1666716","url":null,"abstract":"<p><strong>Introduction: </strong>Drug repositioning-finding new therapeutic uses for existing drugs-can dramatically reduce development time and cost, but requires efficient computational frameworks to generate and validate repositioning hypotheses. Network-based methods can uncover drug communities with shared pharmacological properties, while molecular docking offers mechanistic insights by predicting drug-target binding.</p><p><strong>Methods: </strong>We introduce an end-to-end, fully automated pipeline that (1) constructs a tripartite drug-gene-disease network from DrugBank and DisGeNET, (2) projects it into a drug-drug similarity network for community detection, (3) labels communities <i>via</i> Anatomical Therapeutic Chemical (ATC) codes to generate repositioning hints and identify relevant targets, (4) validates hints through automated literature searches, and (5) prioritizes candidates <i>via</i> targeted molecular docking.</p><p><strong>Results: </strong>After filtering for connectivity and size, 12 robust communities emerged from the initial 34 clusters. The pipeline correctly matched 53.4% of drugs to their ATC level 1 community label <i>via</i> database entries; literature validation confirmed an additional 20.2%, yielding 73.6% overall accuracy. The remaining 26.4% of drugs were flagged as repositioning candidates. To illustrate the advantages of our pipeline, molecular docking studies of chloramphenicol demonstrated stable binding and interaction profiles similar to those of known inhibitors, reinforcing its potential as an anticancer agent.</p><p><strong>Conclusion: </strong>Our integrated pipeline effectively integrates network-based community analysis and automated ATC labeling with literature and docking analysis, narrowing the search space for <i>in silico</i> and experimental follow-up. The chloramphenicol example illustrates its utility for uncovering non-obvious repositioning opportunities. Future work will extend similarity definitions (e.g., to higher-order network motifs) and incorporate wet-lab validation of top candidates.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1666716"},"PeriodicalIF":3.9,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12589059/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145483891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inferred global dense residue transition graphs from primary structure sequences enable protein interaction prediction via directed graph convolutional neural networks. 从一级结构序列中推断出的全局密集残基转移图可以通过有向图卷积神经网络进行蛋白质相互作用预测。
IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-10-22 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1651623
Islam Akef Ebeid, Haoteng Tang, Pengfei Gu

Introduction: Accurate prediction of protein-protein interactions (PPIs) is crucial for understanding cellular functions and advancing the development of drugs. While existing in-silico methods leverage direct sequence embeddings from Protein Language Models (PLMs) or apply Graph Neural Networks (GNNs) to 3D protein structures, the main focus of this study is to investigate less computationally intensive alternatives. This work introduces a novel framework for the downstream task of PPI prediction via link prediction.

Methods: We introduce a two-stage graph representation learning framework, ProtGram-DirectGCN. First, we developed ProtGram, a novel approach that models a protein's primary structure as a hierarchy of globally inferred n-gram graphs. In these graphs, residue transition probabilities, aggregated from a large sequence corpus, define the edge weights of a directed graph of paired residues. Second, we propose a custom directed graph convolutional neural network, DirectGCN, which features a unique convolutional layer that processes information through separate path-specific (incoming, outgoing, undirected) and shared transformations, combined via a learnable gating mechanism. DirectGCN is applied to the ProtGram graphs to learn residue-level embeddings, which are then pooled via an attention mechanism to generate protein-level embeddings for the prediction task.

Results: The efficacy of the DirectGCN model was first established on standard node classification benchmarks, where its performance is comparable to that of established methods on general datasets, while demonstrating specialization for complex, directed, and dense heterophilic graph structures. When applied to PPI prediction, the full ProtGram-DirectGCN framework achieves robust predictive power despite being trained on limited data.

Discussion: Our results suggest that a globally inferred, directed graph-based representation of sequence transitions offers a potent and computationally distinct alternative to resource-intensive PLMs for the task of PPI prediction. Future work will involve testing ProtGram-DirectGCN on a wider range of bioinformatics tasks.

蛋白-蛋白相互作用(PPIs)的准确预测对于理解细胞功能和推进药物开发至关重要。虽然现有的计算机方法利用蛋白质语言模型(PLMs)的直接序列嵌入或将图神经网络(gnn)应用于3D蛋白质结构,但本研究的主要重点是研究计算强度较低的替代方法。这项工作引入了一种新的框架,用于通过链路预测进行PPI预测的下游任务。方法:我们引入了一个两阶段图表示学习框架program - directgcn。首先,我们开发了ProtGram,这是一种新颖的方法,将蛋白质的初级结构建模为全局推断的n图层次结构。在这些图中,从一个大的序列语料库中聚集的残数转移概率定义了成对残数有向图的边权。其次,我们提出了一个自定义的有向图卷积神经网络DirectGCN,它具有独特的卷积层,通过单独的路径特定(传入,传出,无向)和共享转换处理信息,并通过可学习的门控机制组合在一起。将DirectGCN应用于program图来学习残差级嵌入,然后通过注意机制将残差级嵌入集合起来,生成用于预测任务的蛋白级嵌入。结果:DirectGCN模型的有效性首先是在标准节点分类基准上建立的,其性能与在一般数据集上建立的方法相当,同时显示出对复杂、有向和密集的异亲图结构的专门化。当应用于PPI预测时,尽管在有限的数据上进行了训练,但完整的program - directgcn框架仍具有强大的预测能力。讨论:我们的研究结果表明,序列转换的全局推断、有向图表示为资源密集型PLMs的PPI预测任务提供了一个强大的、计算上独特的替代方案。未来的工作将包括在更广泛的生物信息学任务中测试program - directgcn。
{"title":"Inferred global dense residue transition graphs from primary structure sequences enable protein interaction prediction via directed graph convolutional neural networks.","authors":"Islam Akef Ebeid, Haoteng Tang, Pengfei Gu","doi":"10.3389/fbinf.2025.1651623","DOIUrl":"10.3389/fbinf.2025.1651623","url":null,"abstract":"<p><strong>Introduction: </strong>Accurate prediction of protein-protein interactions (PPIs) is crucial for understanding cellular functions and advancing the development of drugs. While existing <i>in-silico</i> methods leverage direct sequence embeddings from Protein Language Models (PLMs) or apply Graph Neural Networks (GNNs) to 3D protein structures, the main focus of this study is to investigate less computationally intensive alternatives. This work introduces a novel framework for the downstream task of PPI prediction via link prediction.</p><p><strong>Methods: </strong>We introduce a two-stage graph representation learning framework, <i>ProtGram-DirectGCN</i>. First, we developed <i>ProtGram</i>, a novel approach that models a protein's primary structure as a hierarchy of globally inferred n-gram graphs. In these graphs, residue transition probabilities, aggregated from a large sequence corpus, define the edge weights of a directed graph of paired residues. Second, we propose a custom directed graph convolutional neural network, <i>DirectGCN</i>, which features a unique convolutional layer that processes information through separate path-specific (incoming, outgoing, undirected) and shared transformations, combined via a learnable gating mechanism. <i>DirectGCN</i> is applied to the <i>ProtGram</i> graphs to learn residue-level embeddings, which are then pooled via an attention mechanism to generate protein-level embeddings for the prediction task.</p><p><strong>Results: </strong>The efficacy of the <i>DirectGCN</i> model was first established on standard node classification benchmarks, where its performance is comparable to that of established methods on general datasets, while demonstrating specialization for complex, directed, and dense heterophilic graph structures. When applied to PPI prediction, the full <i>ProtGram-DirectGCN</i> framework achieves robust predictive power despite being trained on limited data.</p><p><strong>Discussion: </strong>Our results suggest that a globally inferred, directed graph-based representation of sequence transitions offers a potent and computationally distinct alternative to resource-intensive PLMs for the task of PPI prediction. Future work will involve testing <i>ProtGram-DirectGCN</i> on a wider range of bioinformatics tasks.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1651623"},"PeriodicalIF":3.9,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12585958/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145460749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing drug-target interaction prediction with graph representation learning and knowledge-based regularization. 用图表示学习和基于知识的正则化增强药物-靶标相互作用预测。
IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-10-21 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1649337
Qihuan Yao, Zhen Chen, Ye Cao, Huijing Hu

Introduction: Accurately predicting drug-target interactions (DTIs) is crucial for accelerating drug discovery and repurposing. Despite recent advances in deep learning-based methods, challenges remain in effectively capturing the complex relationships between drugs and targets while incorporating prior biological knowledge.

Methods: We introduce a novel framework that combines graph neural networks with knowledge integration for DTI prediction. Our approach learns representations from molecular structures and protein sequences through a customized graph-based message passing scheme. We integrate domain knowledge from biomedical ontologies and databases using a knowledge-based regularization strategy to infuse biological context into the learned representations.

Results: We evaluated our model on multiple benchmark datasets, achieving an average AUC of 0.98 and an average AUPR of 0.89, surpassing existing state-of-the-art methods by a considerable margin. Visualization of learned attention weights identified salient molecular substructures and protein motifs driving the predicted interactions, demonstrating model interpretability.

Discussion: We validated the practical utility by predicting novel DTIs for FDA-approved drugs and experimentally confirming a high proportion of predictions. Our framework offers a powerful and interpretable solution for DTI prediction with the potential to substantially accelerate the identification of new drug candidates and therapeutic targets.

准确预测药物-靶标相互作用(DTIs)是加速药物发现和再利用的关键。尽管基于深度学习的方法最近取得了进展,但在有效捕获药物和靶标之间的复杂关系同时结合先前的生物学知识方面仍然存在挑战。方法:提出了一种将图神经网络与知识集成相结合的DTI预测框架。我们的方法通过定制的基于图的消息传递方案从分子结构和蛋白质序列中学习表征。我们使用基于知识的正则化策略将生物医学本体和数据库中的领域知识集成到学习表征中。结果:我们在多个基准数据集上评估了我们的模型,平均AUC为0.98,平均AUPR为0.89,大大超过了现有的最先进的方法。学习到的注意权重的可视化识别了驱动预测相互作用的显著分子亚结构和蛋白质基序,证明了模型的可解释性。讨论:我们通过预测fda批准的药物的新型dti来验证实际效用,并通过实验证实了高比例的预测。我们的框架为DTI预测提供了一个强大且可解释的解决方案,有可能大大加速新的候选药物和治疗靶点的确定。
{"title":"Enhancing drug-target interaction prediction with graph representation learning and knowledge-based regularization.","authors":"Qihuan Yao, Zhen Chen, Ye Cao, Huijing Hu","doi":"10.3389/fbinf.2025.1649337","DOIUrl":"10.3389/fbinf.2025.1649337","url":null,"abstract":"<p><strong>Introduction: </strong>Accurately predicting drug-target interactions (DTIs) is crucial for accelerating drug discovery and repurposing. Despite recent advances in deep learning-based methods, challenges remain in effectively capturing the complex relationships between drugs and targets while incorporating prior biological knowledge.</p><p><strong>Methods: </strong>We introduce a novel framework that combines graph neural networks with knowledge integration for DTI prediction. Our approach learns representations from molecular structures and protein sequences through a customized graph-based message passing scheme. We integrate domain knowledge from biomedical ontologies and databases using a knowledge-based regularization strategy to infuse biological context into the learned representations.</p><p><strong>Results: </strong>We evaluated our model on multiple benchmark datasets, achieving an average AUC of 0.98 and an average AUPR of 0.89, surpassing existing state-of-the-art methods by a considerable margin. Visualization of learned attention weights identified salient molecular substructures and protein motifs driving the predicted interactions, demonstrating model interpretability.</p><p><strong>Discussion: </strong>We validated the practical utility by predicting novel DTIs for FDA-approved drugs and experimentally confirming a high proportion of predictions. Our framework offers a powerful and interpretable solution for DTI prediction with the potential to substantially accelerate the identification of new drug candidates and therapeutic targets.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1649337"},"PeriodicalIF":3.9,"publicationDate":"2025-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12583218/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145454181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fish isoallergens and variants: database compilation, in silico allergenicity prediction challenges, and epitope-based threshold optimization. 鱼类等过敏原和变异体:数据库编译,在硅过敏原预测挑战,和基于表位的阈值优化。
IF 3.9 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2025-10-20 eCollection Date: 2025-01-01 DOI: 10.3389/fbinf.2025.1669237
Vachiranee Limviphuvadh, Thimo Ruethers, Minh N Nguyen, Dean R Jerry, Benjamin P C Smith, Yulan Wang, Yansong Miao, Anand Kumar Andiappan, Andreas L Lopata, Sebastian Maurer-Stroh

Introduction: Fish is a major food allergy trigger with a complex variety of allergenic protein isoforms and vast species diversity exhibiting variable allergenicity. This is the first study to systematically compile fish isoallergen and variant entries associated with ingestion-related allergic reactions.

Methods: Entries were compiled from four major allergen databases: World Health Organization and International Union of Immunological Societies (WHO/IUIS), AllergenOnline, Comprehensive Protein Allergen Resource (COMPARE), and Allergome, including evidence from in vitro IgE-binding assays and complete amino acid sequences. Challenges in predicting the allergenicity of fish isoallergens and variants were evaluated, and the sensitivity of five widely used in silico tools (AllerCatPro 2.0, AlgPred 2.0, pLM4Alg, AllergenFP v.1.0, and AllerTop v.2.0) was assessed. Epitope mapping and phylogenetic analyses were performed for the major fish allergen parvalbumin, incorporating experimentally validated B-cell epitope data from the Immune Epitope Database (IEDB) and evolutionary relationships.

Results: A comprehensive dataset of 79 unique fish isoallergen and variant entries from 34 fish species was identified, with 25 entries common across all four databases. AllerCatPro 2.0 achieved the highest sensitivity (97.5%). A phylogenetic tree was constructed, integrating epitope data to optimize protein family-specific thresholds for differentiating allergenic from less/non-allergenic parvalbumins. A threshold of ≥4 IEDB-mapped epitopes allowing up to two mismatches captured 52 out of 54 parvalbumin sequences (96%) in the dataset, effectively distinguishing between parvalbumin classes.

Discussion: This study enhances understanding of fish allergy by systematically compiling fish isoallergens and variants and integrating B-cell epitope data. The optimized thresholds improve the performance of allergenicity prediction tools and can be applied to other protein families in future studies.

鱼类是一种主要的食物过敏触发器,具有多种复杂的致敏蛋白异构体和巨大的物种多样性,表现出不同的致敏性。这是第一个系统地汇编与摄入相关过敏反应相关的鱼类等过敏原和变异条目的研究。方法:从世界卫生组织和国际免疫学会联合会(WHO/IUIS)、AllergenOnline、综合蛋白过敏原资源(COMPARE)和Allergome四个主要过敏原数据库中收集条目,包括体外ige结合试验和完整氨基酸序列的证据。评估了预测鱼类等过敏原和变异体致敏性的挑战,并评估了五种广泛使用的硅工具(AllerCatPro 2.0、AlgPred 2.0、pLM4Alg、AllergenFP v.1.0和AllerTop v.2.0)的敏感性。结合免疫表位数据库(IEDB)中实验验证的b细胞表位数据和进化关系,对主要的鱼类过敏原小白蛋白进行了表位定位和系统发育分析。结果:鉴定了来自34种鱼类的79个独特的鱼类等过敏原和变体条目的综合数据集,其中25个条目在所有四个数据库中都是通用的。AllerCatPro 2.0的灵敏度最高(97.5%)。构建了一个系统发育树,整合表位数据来优化蛋白家族特异性阈值,以区分过敏性和非过敏性小白蛋白。≥4个iedb映射表位的阈值允许最多两个错配,捕获了数据集中54个细小蛋白序列中的52个(96%),有效区分了细小蛋白类别。讨论:本研究通过系统地汇编鱼类等过敏原和变异体以及整合b细胞表位数据,增强了对鱼类过敏的理解。优化后的阈值提高了过敏原预测工具的性能,可以在未来的研究中应用于其他蛋白质家族。
{"title":"Fish isoallergens and variants: database compilation, <i>in silico</i> allergenicity prediction challenges, and epitope-based threshold optimization.","authors":"Vachiranee Limviphuvadh, Thimo Ruethers, Minh N Nguyen, Dean R Jerry, Benjamin P C Smith, Yulan Wang, Yansong Miao, Anand Kumar Andiappan, Andreas L Lopata, Sebastian Maurer-Stroh","doi":"10.3389/fbinf.2025.1669237","DOIUrl":"10.3389/fbinf.2025.1669237","url":null,"abstract":"<p><strong>Introduction: </strong>Fish is a major food allergy trigger with a complex variety of allergenic protein isoforms and vast species diversity exhibiting variable allergenicity. This is the first study to systematically compile fish isoallergen and variant entries associated with ingestion-related allergic reactions.</p><p><strong>Methods: </strong>Entries were compiled from four major allergen databases: World Health Organization and International Union of Immunological Societies (WHO/IUIS), AllergenOnline, Comprehensive Protein Allergen Resource (COMPARE), and Allergome, including evidence from <i>in vitro</i> IgE-binding assays and complete amino acid sequences. Challenges in predicting the allergenicity of fish isoallergens and variants were evaluated, and the sensitivity of five widely used <i>in silico</i> tools (AllerCatPro 2.0, AlgPred 2.0, pLM4Alg, AllergenFP v.1.0, and AllerTop v.2.0) was assessed. Epitope mapping and phylogenetic analyses were performed for the major fish allergen parvalbumin, incorporating experimentally validated B-cell epitope data from the Immune Epitope Database (IEDB) and evolutionary relationships.</p><p><strong>Results: </strong>A comprehensive dataset of 79 unique fish isoallergen and variant entries from 34 fish species was identified, with 25 entries common across all four databases. AllerCatPro 2.0 achieved the highest sensitivity (97.5%). A phylogenetic tree was constructed, integrating epitope data to optimize protein family-specific thresholds for differentiating allergenic from less/non-allergenic parvalbumins. A threshold of ≥4 IEDB-mapped epitopes allowing up to two mismatches captured 52 out of 54 parvalbumin sequences (96%) in the dataset, effectively distinguishing between parvalbumin classes.</p><p><strong>Discussion: </strong>This study enhances understanding of fish allergy by systematically compiling fish isoallergens and variants and integrating B-cell epitope data. The optimized thresholds improve the performance of allergenicity prediction tools and can be applied to other protein families in future studies.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"5 ","pages":"1669237"},"PeriodicalIF":3.9,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12580176/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145446599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Frontiers in bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1