Journal of Integrative Bioinformatics最新文献_第3页

Immunoinformatics-guided design of a multiepitope peptide vaccine targeting the receptor-binding domain of SARS-CoV-2 spike glycoprotein: insights from Indonesian samples. 免疫信息学指导下针对SARS-CoV-2刺突糖蛋白受体结合域的多表位肽疫苗设计：来自印度尼西亚样本的见解

IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Journal of Integrative Bioinformatics

Pub Date : 2025-01-14 DOI: 10.1515/jib-2024-0025

Irvan Faizal, Darrian Chandra, Tarwadi, Sabar Pambudi, Astutiati Nurhasanah, Rizky Priambodo, Muhammad Yusuf

The emergence of new variants of SARS-CoV-2, including Alpha, Beta, Gamma, Delta, Omicron variants, and XBB sub-variants, contributes to the number of coronavirus cases worldwide. SARS-CoV-2 is a positive RNA virus with a genome of 29.9 kb that encodes four structural proteins: spike glycoprotein (S), envelope glycoprotein (E), membrane glycoprotein (M), and nucleocapsid glycoprotein (N). These proteins are vital for viral activity, with the S protein facilitating attachment and membrane fusion through the receptor-binding domain (RBD) located in the S1 subunit. The RBD recognizes and binds to the human angiotensin-converting enzyme 2 (ACE-2) protein. An immunoinformatic-aided design of a peptide-based multiepitope vaccine candidate targeting the RBD glycoprotein is constructed from the SARS-CoV-2 sequence data base from various regions of Indonesia (Jakarta, West Java, and Bali). The results show that the RBD region of with accession ID EPI_ISL_15982641 from West Java had the highest antigenicity of 0.5904. This isolate is non-toxic and non-allergenic and shows a total of 18 LBL epitopes, 72 CTL epitopes, and 98 HTL epitopes. The epitope that has the best overall binding affinity was GCHNKCAY for MHC-I and GGCVFSYVGCHNKCAYWV for MHC-II which show a binding affinity of -13.6 and -15.5 (kcal/mol), respectively. Therefore, this study aims to design an epitope vaccine candidate based on samples from Indonesia that has good characteristics and may have the potential to stimulate an immune response against SARS-CoV-2.

SARS-CoV-2新变体的出现，包括Alpha、Beta、Gamma、Delta、Omicron变体和XBB亚变体，导致全球冠状病毒病例数量增加。SARS-CoV-2是一种RNA阳性病毒，基因组为29.9 kb，编码4种结构蛋白：刺突糖蛋白(S)、包膜糖蛋白(E)、膜糖蛋白(M)和核衣壳糖蛋白(N)。这些蛋白对病毒活性至关重要，其中S蛋白通过位于S1亚基的受体结合结构域（RBD）促进附着和膜融合。RBD识别并结合人血管紧张素转换酶2 （ACE-2）蛋白。利用来自印度尼西亚不同地区（雅加达、西爪哇和巴厘岛）的SARS-CoV-2序列数据库，构建了一种靶向RBD糖蛋白的肽基多表位候选疫苗的免疫信息学辅助设计。结果表明，来自西爪哇的RBD区编码为EPI_ISL_15982641，抗原性最高，为0.5904；该分离物无毒且无致敏性，共显示18个LBL表位，72个CTL表位和98个HTL表位。对MHC-I和MHC-II具有最佳结合亲和力的表位分别为GCHNKCAY和GGCVFSYVGCHNKCAYWV，两者的结合亲和力分别为-13.6和-15.5 kcal/mol。因此，本研究旨在基于来自印度尼西亚的样品设计一种具有良好特征的候选表位疫苗，该疫苗可能具有激发针对SARS-CoV-2的免疫应答的潜力。

{"title":"Immunoinformatics-guided design of a multiepitope peptide vaccine targeting the receptor-binding domain of SARS-CoV-2 spike glycoprotein: insights from Indonesian samples.","authors":"Irvan Faizal, Darrian Chandra, Tarwadi, Sabar Pambudi, Astutiati Nurhasanah, Rizky Priambodo, Muhammad Yusuf","doi":"10.1515/jib-2024-0025","DOIUrl":"https://doi.org/10.1515/jib-2024-0025","url":null,"abstract":"The emergence of new variants of SARS-CoV-2, including Alpha, Beta, Gamma, Delta, Omicron variants, and XBB sub-variants, contributes to the number of coronavirus cases worldwide. SARS-CoV-2 is a positive RNA virus with a genome of 29.9 kb that encodes four structural proteins: spike glycoprotein (S), envelope glycoprotein (E), membrane glycoprotein (M), and nucleocapsid glycoprotein (N). These proteins are vital for viral activity, with the S protein facilitating attachment and membrane fusion through the receptor-binding domain (RBD) located in the S1 subunit. The RBD recognizes and binds to the human angiotensin-converting enzyme 2 (ACE-2) protein. An immunoinformatic-aided design of a peptide-based multiepitope vaccine candidate targeting the RBD glycoprotein is constructed from the SARS-CoV-2 sequence data base from various regions of Indonesia (Jakarta, West Java, and Bali). The results show that the RBD region of with accession ID EPI_ISL_15982641 from West Java had the highest antigenicity of 0.5904. This isolate is non-toxic and non-allergenic and shows a total of 18 LBL epitopes, 72 CTL epitopes, and 98 HTL epitopes. The epitope that has the best overall binding affinity was GCHNKCAY for MHC-I and GGCVFSYVGCHNKCAYWV for MHC-II which show a binding affinity of -13.6 and -15.5 (kcal/mol), respectively. Therefore, this study aims to design an epitope vaccine candidate based on samples from Indonesia that has good characteristics and may have the potential to stimulate an immune response against SARS-CoV-2.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142973272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MiRNA target enrichment analysis of co-expression network modules reveals important miRNAs and their roles in breast cancer progression. 共表达网络模块的MiRNA靶富集分析揭示了重要的MiRNA及其在乳腺癌进展中的作用。

IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Journal of Integrative Bioinformatics

Pub Date : 2024-12-25 eCollection Date: 2024-12-01 DOI: 10.1515/jib-2022-0036

Mohammad Javad Bazyari, Seyed Hamid Aghaee-Bakhtiari

Breast cancer has the highest incidence and is the fifth cause of death in cancers. Progression is one of the important features of breast cancer which makes it a life-threatening cancer. MicroRNAs are small RNA molecules that have pivotal roles in the regulation of gene expression and they control different properties in breast cancer such as progression. Recently, systems biology offers novel approaches to study complicated biological systems like miRNAs to find their regulatory roles. One of these approaches is analysis of weighted co-expression network in which genes with similar expression patterns are considered as a single module. Because the genes in one module have similar expression, it is rational to think the same regulatory elements such as miRNAs control their expression. Herein, we use WGCNA to find important modules related to breast cancer progression and use hypergeometric test to perform miRNA target enrichment analysis and find important miRNAs. Also, we use negative correlation between miRNA expression and modules as the second filter to ensure choosing the right candidate miRNAs regarding to important modules. We found hsa-mir-23b, hsa-let-7b and hsa-mir-30a are important miRNAs in breast cancer and also investigated their roles in breast cancer progression.

乳腺癌发病率最高，是癌症的第五大死因。进展是乳腺癌的重要特征之一，使其成为一种危及生命的癌症。microrna是一种小的RNA分子，在调节基因表达方面起着关键作用，它们控制着乳腺癌的不同特性，比如进展。近年来，系统生物学提供了新的方法来研究复杂的生物系统，如mirna，以发现它们的调控作用。其中一种方法是分析加权共表达网络，其中具有相似表达模式的基因被视为单个模块。由于一个模块中的基因具有相似的表达，因此认为相同的调控元件（如miRNAs）控制它们的表达是合理的。本文利用WGCNA寻找与乳腺癌进展相关的重要模块，并利用hypergeometric test进行miRNA靶富集分析，发现重要的miRNA。此外，我们使用miRNA表达与模块之间的负相关作为第二过滤器，以确保选择正确的重要模块候选miRNA。我们发现hsa-mir-23b、hsa-let-7b和hsa-mir-30a是乳腺癌中重要的mirna，并研究了它们在乳腺癌进展中的作用。

{"title":"MiRNA target enrichment analysis of co-expression network modules reveals important miRNAs and their roles in breast cancer progression.","authors":"Mohammad Javad Bazyari, Seyed Hamid Aghaee-Bakhtiari","doi":"10.1515/jib-2022-0036","DOIUrl":"10.1515/jib-2022-0036","url":null,"abstract":"Breast cancer has the highest incidence and is the fifth cause of death in cancers. Progression is one of the important features of breast cancer which makes it a life-threatening cancer. MicroRNAs are small RNA molecules that have pivotal roles in the regulation of gene expression and they control different properties in breast cancer such as progression. Recently, systems biology offers novel approaches to study complicated biological systems like miRNAs to find their regulatory roles. One of these approaches is analysis of weighted co-expression network in which genes with similar expression patterns are considered as a single module. Because the genes in one module have similar expression, it is rational to think the same regulatory elements such as miRNAs control their expression. Herein, we use WGCNA to find important modules related to breast cancer progression and use hypergeometric test to perform miRNA target enrichment analysis and find important miRNAs. Also, we use negative correlation between miRNA expression and modules as the second filter to ensure choosing the right candidate miRNAs regarding to important modules. We found hsa-mir-23b, hsa-let-7b and hsa-mir-30a are important miRNAs in breast cancer and also investigated their roles in breast cancer progression.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11698623/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142883636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring the therapeutic potential of Asparagus africanus in polycystic ovarian syndrome: a computational analysis. 探索非洲芦笋治疗多囊卵巢综合征的潜力：计算分析。

IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Journal of Integrative Bioinformatics

Pub Date : 2024-12-12 eCollection Date: 2024-12-01 DOI: 10.1515/jib-2024-0019

Sania Riaz, Fatima Haider, Rizwan- Ur-Rehman, Aqsa Zafar

PCOS is a multifaceted condition characterized by ovarian abnormalities, metabolic disorders, anovulation, and hormonal imbalances. In response to the growing demand for treatments with fewer side effects, the exploration of herbal-origin drugs has gained prominence. Asparagus africanus, a traditional medicinal plant that exhibits anti-inflammatory, antioxidant, and anti-androgenic properties may have a cure for PCOS. The plant has rich biochemical profile prompted its exploration as a potential source for drug development. The aim of this study is to investigate the potential therapeutic efficacy of A. africanus in the management of PCOS through molecular docking studies with Luteinizing Hormone Receptor and Follicle-Stimulating Hormone Receptor proteins. The identified compounds underwent molecular docking against key proteins associated with PCOS, namely Luteinizing Hormone Receptor and Follicle-Stimulating Hormone Receptor. The results underscored the lead compound's superiority, demonstrating favorable pharmacokinetics, ADME characteristics, and strong molecular binding without any observed toxicity in comparison to standard drug. This study, by leveraging natural compounds sourced from A. africanus, provides valuable insights and advances towards developing more effective and safer treatments for PCOS. The findings contribute to the evolving landscape of PCOS therapeutics, emphasizing the potential of herbal-origin drugs in mitigating the complexities of this syndrome.

多囊卵巢综合征是一种多方面的疾病，以卵巢异常、代谢紊乱、无排卵和激素失衡为特征。由于对副作用更少的治疗方法的需求日益增长，对草药药物的探索已经得到了重视。非洲芦笋是一种传统的药用植物，具有抗炎、抗氧化和抗雄激素的特性，可能对多囊卵巢综合征有治疗作用。该植物具有丰富的生物化学特征，促使其作为药物开发的潜在来源进行探索。本研究旨在通过与促黄体生成素受体和促卵泡激素受体蛋白的分子对接研究，探讨非洲麻对PCOS的潜在治疗作用。鉴定的化合物与PCOS相关的关键蛋白，即促黄体生成素受体和促卵泡激素受体进行了分子对接。结果强调了先导化合物的优越性，与标准药物相比，显示出良好的药代动力学、ADME特性和强分子结合，没有观察到任何毒性。本研究利用来自非洲古树的天然化合物，为开发更有效、更安全的PCOS治疗方法提供了有价值的见解和进展。这些发现有助于多囊卵巢综合征治疗方法的发展，强调草药药物在减轻该综合征复杂性方面的潜力。

{"title":"Exploring the therapeutic potential of Asparagus africanus in polycystic ovarian syndrome: a computational analysis.","authors":"Sania Riaz, Fatima Haider, Rizwan- Ur-Rehman, Aqsa Zafar","doi":"10.1515/jib-2024-0019","DOIUrl":"10.1515/jib-2024-0019","url":null,"abstract":"PCOS is a multifaceted condition characterized by ovarian abnormalities, metabolic disorders, anovulation, and hormonal imbalances. In response to the growing demand for treatments with fewer side effects, the exploration of herbal-origin drugs has gained prominence. Asparagus africanus, a traditional medicinal plant that exhibits anti-inflammatory, antioxidant, and anti-androgenic properties may have a cure for PCOS. The plant has rich biochemical profile prompted its exploration as a potential source for drug development. The aim of this study is to investigate the potential therapeutic efficacy of A. africanus in the management of PCOS through molecular docking studies with Luteinizing Hormone Receptor and Follicle-Stimulating Hormone Receptor proteins. The identified compounds underwent molecular docking against key proteins associated with PCOS, namely Luteinizing Hormone Receptor and Follicle-Stimulating Hormone Receptor. The results underscored the lead compound's superiority, demonstrating favorable pharmacokinetics, ADME characteristics, and strong molecular binding without any observed toxicity in comparison to standard drug. This study, by leveraging natural compounds sourced from A. africanus, provides valuable insights and advances towards developing more effective and safer treatments for PCOS. The findings contribute to the evolving landscape of PCOS therapeutics, emphasizing the potential of herbal-origin drugs in mitigating the complexities of this syndrome.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11698622/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142808595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

TREMSUCS-TCGA - an integrated workflow for the identification of biomarkers for treatment success. TREMSUCS-TCGA -用于鉴定治疗成功的生物标志物的集成工作流程。

IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Journal of Integrative Bioinformatics

Pub Date : 2024-12-11 eCollection Date: 2024-12-01 DOI: 10.1515/jib-2024-0031

Gabor Balogh, Natasha Jorge, Célia Dupain, Maud Kamal, Nicolas Servant, Christophe Le Tourneau, Peter F Stadler, Stephan H Bernhart

Many publicly available databases provide disease related data, that makes it possible to link genomic data to medical and meta-data. The cancer genome atlas (TCGA), for example, compiles tens of thousand of datasets covering a wide array of cancer types. Here we introduce an interactive and highly automatized TCGA-based workflow that links and analyses epigenomic and transcriptomic data with treatment and survival data in order to identify possible biomarkers that indicate treatment success. TREMSUCS-TCGA is flexible with respect to type of cancer and treatment and provides standard methods for differential expression analysis or DMR detection. Furthermore, it makes it possible to examine several cancer types together in a pan-cancer type approach. Parallelisation and reproducibility of all steps is ensured with the workflowmanagement system Snakemake. TREMSUCS-TCGA produces a comprehensive single report file which holds all relevant results in descriptive and tabular form that can be explored in an interactive manner. As a showcase application we describe a comprehensive analysis of the available data for the combination of patients with squamous cell carcinomas of head and neck, cervix and lung treated with cisplatin, carboplatin and the combination of carboplatin and paclitaxel. The best ranked biomarker candidates are discussed in the light of the existing literature, indicating plausible causal relationships to the relevant cancer entities.

许多可公开获得的数据库提供与疾病有关的数据，这使得将基因组数据与医疗和元数据联系起来成为可能。例如，癌症基因组图谱（TCGA）汇编了数以万计的数据集，涵盖了广泛的癌症类型。在这里，我们介绍了一个交互式和高度自动化的基于tcga的工作流程，将表观基因组和转录组数据与治疗和生存数据联系起来并进行分析，以确定可能指示治疗成功的生物标志物。TREMSUCS-TCGA在癌症类型和治疗方面具有灵活性，并为差异表达分析或DMR检测提供了标准方法。此外，它使得在泛癌症类型方法中一起检查几种癌症类型成为可能。所有步骤的并行性和可重复性都通过工作流管理系统Snakemake得到保证。TREMSUCS-TCGA生成一个全面的单一报告文件，其中以描述性和表格形式保存所有相关结果，可以以互动方式进行探索。作为一个示范应用，我们描述了对头颈部、宫颈和肺部鳞状细胞癌患者联合使用顺铂、卡铂以及卡铂和紫杉醇联合治疗的现有数据的综合分析。根据现有文献讨论了排名最高的生物标志物候选物，表明与相关癌症实体的合理因果关系。

{"title":"TREMSUCS-TCGA - an integrated workflow for the identification of biomarkers for treatment success.","authors":"Gabor Balogh, Natasha Jorge, Célia Dupain, Maud Kamal, Nicolas Servant, Christophe Le Tourneau, Peter F Stadler, Stephan H Bernhart","doi":"10.1515/jib-2024-0031","DOIUrl":"10.1515/jib-2024-0031","url":null,"abstract":"Many publicly available databases provide disease related data, that makes it possible to link genomic data to medical and meta-data. The cancer genome atlas (TCGA), for example, compiles tens of thousand of datasets covering a wide array of cancer types. Here we introduce an interactive and highly automatized TCGA-based workflow that links and analyses epigenomic and transcriptomic data with treatment and survival data in order to identify possible biomarkers that indicate treatment success. TREMSUCS-TCGA is flexible with respect to type of cancer and treatment and provides standard methods for differential expression analysis or DMR detection. Furthermore, it makes it possible to examine several cancer types together in a pan-cancer type approach. Parallelisation and reproducibility of all steps is ensured with the workflowmanagement system Snakemake. TREMSUCS-TCGA produces a comprehensive single report file which holds all relevant results in descriptive and tabular form that can be explored in an interactive manner. As a showcase application we describe a comprehensive analysis of the available data for the combination of patients with squamous cell carcinomas of head and neck, cervix and lung treated with cisplatin, carboplatin and the combination of carboplatin and paclitaxel. The best ranked biomarker candidates are discussed in the light of the existing literature, indicating plausible causal relationships to the relevant cancer entities.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11698617/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Ion channel classification through machine learning and protein language model embeddings. 通过机器学习和蛋白质语言模型嵌入进行离子通道分类。

IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Journal of Integrative Bioinformatics

Pub Date : 2024-11-25 eCollection Date: 2024-12-01 DOI: 10.1515/jib-2023-0047

Hamed Ghazikhani, Gregory Butler

Ion channels are critical membrane proteins that regulate ion flux across cellular membranes, influencing numerous biological functions. The resource-intensive nature of traditional wet lab experiments for ion channel identification has led to an increasing emphasis on computational techniques. This study extends our previous work on protein language models for ion channel prediction, significantly advancing the methodology and performance. We employ a comprehensive array of machine learning algorithms, including k-Nearest Neighbors, Random Forest, Support Vector Machines, and Feed-Forward Neural Networks, alongside a novel Convolutional Neural Network (CNN) approach. These methods leverage fine-tuned embeddings from ProtBERT, ProtBERT-BFD, and MembraneBERT to differentiate ion channels from non-ion channels. Our empirical findings demonstrate that TooT-BERT-CNN-C, which combines features from ProtBERT-BFD and a CNN, substantially surpasses existing benchmarks. On our original dataset, it achieves a Matthews Correlation Coefficient (MCC) of 0.8584 and an accuracy of 98.35 %. More impressively, on a newly curated, larger dataset (DS-Cv2), it attains an MCC of 0.9492 and an ROC AUC of 0.9968 on the independent test set. These results not only highlight the power of integrating protein language models with deep learning for ion channel classification but also underscore the importance of using up-to-date, comprehensive datasets in bioinformatics tasks. Our approach represents a significant advancement in computational methods for ion channel identification, with potential implications for accelerating research in ion channel biology and aiding drug discovery efforts.

离子通道是调节跨细胞膜离子通量的关键膜蛋白，影响着多种生物功能。传统的湿实验室离子通道鉴定实验耗费大量资源，因此人们越来越重视计算技术。本研究扩展了我们之前在用于离子通道预测的蛋白质语言模型方面的工作，大大提高了方法和性能。我们采用了一系列全面的机器学习算法，包括 k-近邻、随机森林、支持向量机和前馈神经网络，以及一种新颖的卷积神经网络（CNN）方法。这些方法利用来自 ProtBERT、ProtBERT-BFD 和 MembraneBERT 的微调嵌入来区分离子通道和非离子通道。我们的实证研究结果表明，结合了 ProtBERT-BFD 和 CNN 特征的 TooT-BERT-CNN-C 大大超越了现有基准。在我们的原始数据集上，它的马修斯相关系数（MCC）达到了 0.8584，准确率为 98.35%。更令人印象深刻的是，在新开发的更大数据集（DS-Cv2）上，它的马修斯相关系数（MCC）达到了 0.9492，独立测试集的 ROC AUC 达到了 0.9968。这些结果不仅凸显了蛋白质语言模型与深度学习在离子通道分类中的整合能力，还强调了在生物信息学任务中使用最新、全面数据集的重要性。我们的方法代表了离子通道识别计算方法的重大进步，对加速离子通道生物学研究和帮助药物发现工作具有潜在意义。

{"title":"Ion channel classification through machine learning and protein language model embeddings.","authors":"Hamed Ghazikhani, Gregory Butler","doi":"10.1515/jib-2023-0047","DOIUrl":"10.1515/jib-2023-0047","url":null,"abstract":"Ion channels are critical membrane proteins that regulate ion flux across cellular membranes, influencing numerous biological functions. The resource-intensive nature of traditional wet lab experiments for ion channel identification has led to an increasing emphasis on computational techniques. This study extends our previous work on protein language models for ion channel prediction, significantly advancing the methodology and performance. We employ a comprehensive array of machine learning algorithms, including k-Nearest Neighbors, Random Forest, Support Vector Machines, and Feed-Forward Neural Networks, alongside a novel Convolutional Neural Network (CNN) approach. These methods leverage fine-tuned embeddings from ProtBERT, ProtBERT-BFD, and MembraneBERT to differentiate ion channels from non-ion channels. Our empirical findings demonstrate that TooT-BERT-CNN-C, which combines features from ProtBERT-BFD and a CNN, substantially surpasses existing benchmarks. On our original dataset, it achieves a Matthews Correlation Coefficient (MCC) of 0.8584 and an accuracy of 98.35 %. More impressively, on a newly curated, larger dataset (DS-Cv2), it attains an MCC of 0.9492 and an ROC AUC of 0.9968 on the independent test set. These results not only highlight the power of integrating protein language models with deep learning for ion channel classification but also underscore the importance of using up-to-date, comprehensive datasets in bioinformatics tasks. Our approach represents a significant advancement in computational methods for ion channel identification, with potential implications for accelerating research in ion channel biology and aiding drug discovery efforts.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11698620/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142689725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A roadmap for a middleware as a federation service for integrative data retrieval of agricultural data. 作为联盟服务的中间件路线图，用于农业数据的综合数据检索。

IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Journal of Integrative Bioinformatics

Pub Date : 2024-11-07 eCollection Date: 2024-09-01 DOI: 10.1515/jib-2024-0027

Jorge García Brizuela, Carsten Scharfenberg, Carmen Scheuner, Florian Hoedt, Patrick König, Angela Kranz, Antonia Leidel, Daniel Martini, Gabriel Schneider, Julian Schneider, Lea Sophie Singson, Harald von Waldow, Nils Wehrmeyer, Björn Usadel, Stephan Lesch, Xenia Specka, Matthias Lange, Daniel Arend

Agriculture is confronted with several challenges such as climate change, the loss of biodiversity and stagnating productivity. The massive increasing amount of data and new digital technologies promise to overcome them, but they necessitate careful data integration and data management to make them usable. The FAIRagro consortium is part of the National Research Data Infrastructure (NFDI) in Germany and will develop FAIR compliant infrastructure services for the agrosystems science community, which will be integrated in the existing research data infrastructure service landscape. Here we present the initial steps of designing and implementing the FAIRagro middleware infrastructure to connect existing data infrastructures. The middleware will feature services for the seamless data integration across diverse infrastructures. Data and metadata are streamlined for research in agrosystems science by downstream processing in the central FAIRagro Search and Inventory Portal and the data integration and analysis workflow system "SciWIn".

农业面临着气候变化、生物多样性丧失和生产力停滞不前等诸多挑战。不断增加的海量数据和新的数字技术有望克服这些挑战，但必须进行细致的数据整合和数据管理，才能使这些数据发挥作用。FAIRagro联盟是德国国家研究数据基础设施（NFDI）的一部分，将为农业系统科学界开发符合FAIR标准的基础设施服务，并将整合到现有的研究数据基础设施服务中。在此，我们将介绍设计和实施 FAIRagro 中间件基础设施以连接现有数据基础设施的初步步骤。该中间件将提供服务，以实现不同基础设施之间的无缝数据集成。通过在 FAIRagro 搜索和目录中央门户网站以及数据集成和分析工作流系统 "SciWIn "中进行下游处理，数据和元数据将被简化，以用于农业系统科学研究。

{"title":"A roadmap for a middleware as a federation service for integrative data retrieval of agricultural data.","authors":"Jorge García Brizuela, Carsten Scharfenberg, Carmen Scheuner, Florian Hoedt, Patrick König, Angela Kranz, Antonia Leidel, Daniel Martini, Gabriel Schneider, Julian Schneider, Lea Sophie Singson, Harald von Waldow, Nils Wehrmeyer, Björn Usadel, Stephan Lesch, Xenia Specka, Matthias Lange, Daniel Arend","doi":"10.1515/jib-2024-0027","DOIUrl":"10.1515/jib-2024-0027","url":null,"abstract":"Agriculture is confronted with several challenges such as climate change, the loss of biodiversity and stagnating productivity. The massive increasing amount of data and new digital technologies promise to overcome them, but they necessitate careful data integration and data management to make them usable. The FAIRagro consortium is part of the National Research Data Infrastructure (NFDI) in Germany and will develop FAIR compliant infrastructure services for the agrosystems science community, which will be integrated in the existing research data infrastructure service landscape. Here we present the initial steps of designing and implementing the FAIRagro middleware infrastructure to connect existing data infrastructures. The middleware will feature services for the seamless data integration across diverse infrastructures. Data and metadata are streamlined for research in agrosystems science by downstream processing in the central FAIRagro Search and Inventory Portal and the data integration and analysis workflow system \"SciWIn\".","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11602230/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142585141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

International symposium on integrative bioinformatics 2024 - editorial. 2024 年综合生物信息学国际研讨会--社论。

IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Journal of Integrative Bioinformatics

Pub Date : 2024-11-07 eCollection Date: 2024-09-01 DOI: 10.1515/jib-2024-0051

Can Türker, Christian Panse, Bjorn Sommer, Marcel Friedrichs, Ralf Hofestädt

Integrative Bioinformatics faces the challenge of integrating, aligning, modelling, and simulating data in a coherent fashion to gain deeper insights into complex biological systems. This special issue of the Journal of Integrative Bioinformatics consists of six articles accepted for the presentation at the "18th International Symposium on Integrative Bioinformatics" held in Zürich on September 12-13, 2024. In addition, the symposium featured five keynote talks which will be discussed here as well.

整合生物信息学面临的挑战是如何以协调一致的方式整合、排列、建模和模拟数据，从而深入了解复杂的生物系统。本期《整合生物信息学杂志》特刊收录了六篇被 2024 年 9 月 12-13 日在苏黎世举行的 "第 18 届整合生物信息学国际研讨会 "录用的文章。此外，研讨会还举行了五场主题演讲，本文也将对这些演讲进行讨论。

引用次数: 0

The potential of Mitragyna speciosa leaves as a natural source of antioxidants for disease prevention. Mitragyna speciosa 叶子作为天然抗氧化剂预防疾病的潜力。

IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Journal of Integrative Bioinformatics

Pub Date : 2024-09-17 eCollection Date: 2024-12-01 DOI: 10.1515/jib-2023-0030

Ihsanul Arief, Gagus Ketut Sunnardianto, Syahrul Khairi, Wahyu Dita Saputri

Mitragyna speciosa is famous for its addictive effect. On the other hand, this plant has good potential as an antioxidant agent, and so far, it was not explicitly explained what the most contributing compound in the leaves to that activity is. This study has been conducted using several computational methods to determine which compounds are the most active in interacting with cytochrome P450, myeloperoxidase, and NADPH oxidase proteins. First, virtual screening was carried out based on molecular docking, followed by profiling the properties of adsorption, distribution, metabolism, excretion, and toxicity (ADMET); the second one is the molecular dynamics (MD) simulations for 100 ns. The virtual screening results showed that three compounds acted as inhibitors for each protein: (-)-epicatechin, sitogluside, and corynoxeine. The ADMET profiles of the three compounds exhibit good drug ability and toxicity. The trajectories study from MD simulations predicts that the complexes of these three compounds with their respective target proteins are stable. Furthermore, these compounds identified in this computational study can be a potential guide for future experiments aimed at assessing the antioxidant properties through in vitro testing.

Mitragyna speciosa 以其成瘾效果而闻名。另一方面，这种植物具有很好的抗氧化潜力，但到目前为止，还没有明确解释叶片中对这种活性贡献最大的化合物是什么。本研究采用了多种计算方法来确定哪些化合物与细胞色素 P450、髓过氧化物酶和 NADPH 氧化酶蛋白的相互作用最为活跃。首先，在分子对接的基础上进行了虚拟筛选，然后分析了吸附、分布、代谢、排泄和毒性（ADMET）的特性；其次是进行了 100 ns 的分子动力学（MD）模拟。虚拟筛选结果表明，(-)-表儿茶素、西妥苷和堇菜素这三种化合物对每种蛋白质都有抑制作用。这三种化合物的 ADMET 曲线显示出良好的药物能力和毒性。根据 MD 模拟的轨迹研究预测，这三种化合物与各自靶蛋白的复合物是稳定的。此外，这项计算研究中发现的这些化合物可以为今后通过体外测试评估抗氧化特性的实验提供潜在指导。

{"title":"The potential of Mitragyna speciosa leaves as a natural source of antioxidants for disease prevention.","authors":"Ihsanul Arief, Gagus Ketut Sunnardianto, Syahrul Khairi, Wahyu Dita Saputri","doi":"10.1515/jib-2023-0030","DOIUrl":"10.1515/jib-2023-0030","url":null,"abstract":"Mitragyna speciosa is famous for its addictive effect. On the other hand, this plant has good potential as an antioxidant agent, and so far, it was not explicitly explained what the most contributing compound in the leaves to that activity is. This study has been conducted using several computational methods to determine which compounds are the most active in interacting with cytochrome P450, myeloperoxidase, and NADPH oxidase proteins. First, virtual screening was carried out based on molecular docking, followed by profiling the properties of adsorption, distribution, metabolism, excretion, and toxicity (ADMET); the second one is the molecular dynamics (MD) simulations for 100 ns. The virtual screening results showed that three compounds acted as inhibitors for each protein: (-)-epicatechin, sitogluside, and corynoxeine. The ADMET profiles of the three compounds exhibit good drug ability and toxicity. The trajectories study from MD simulations predicts that the complexes of these three compounds with their respective target proteins are stable. Furthermore, these compounds identified in this computational study can be a potential guide for future experiments aimed at assessing the antioxidant properties through in vitro testing.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11698621/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142300772","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MCMVDRP: a multi-channel multi-view deep learning framework for cancer drug response prediction. MCMVDRP：用于癌症药物反应预测的多通道多视角深度学习框架。

IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Journal of Integrative Bioinformatics

Pub Date : 2024-09-02 eCollection Date: 2024-09-01 DOI: 10.1515/jib-2024-0026

Xiangyu Li, Xiumin Shi, Yuxuan Li, Lu Wang

Drug therapy remains the primary approach to treating tumours. Variability among cancer patients, including variations in genomic profiles, often results in divergent therapeutic responses to analogous anti-cancer drug treatments within the same cohort of cancer patients. Hence, predicting the drug response by analysing the genomic profile characteristics of individual patients holds significant research importance. With the notable progress in machine learning and deep learning, many effective methods have emerged for predicting drug responses utilizing features from both drugs and cell lines. However, these methods are inadequate in capturing a sufficient number of features inherent to drugs. Consequently, we propose a representational approach for drugs that incorporates three distinct types of features: the molecular graph, the SMILE strings, and the molecular fingerprints. In this study, a novel deep learning model, named MCMVDRP, is introduced for the prediction of cancer drug responses. In our proposed model, an amalgamation of these extracted features is performed, followed by the utilization of fully connected layers to predict the drug response based on the IC50 values. Experimental results demonstrate that the presented model outperforms current state-of-the-art models in performance.

药物治疗仍然是治疗肿瘤的主要方法。癌症患者之间的差异，包括基因组特征的差异，往往导致同一批癌症患者对类似的抗癌药物治疗产生不同的治疗反应。因此，通过分析个体患者的基因组特征来预测药物反应具有重要的研究意义。随着机器学习和深度学习的显著进步，出现了许多利用药物和细胞系特征预测药物反应的有效方法。然而，这些方法不足以捕捉到足够数量的药物固有特征。因此，我们提出了一种药物表征方法，其中包含三种不同类型的特征：分子图、SMILE 字符串和分子指纹。在这项研究中，我们引入了一种名为 MCMVDRP 的新型深度学习模型，用于预测癌症药物反应。在我们提出的模型中，首先对这些提取的特征进行合并，然后利用全连接层根据 IC50 值预测药物反应。实验结果表明，所提出的模型在性能上优于目前最先进的模型。

{"title":"MCMVDRP: a multi-channel multi-view deep learning framework for cancer drug response prediction.","authors":"Xiangyu Li, Xiumin Shi, Yuxuan Li, Lu Wang","doi":"10.1515/jib-2024-0026","DOIUrl":"10.1515/jib-2024-0026","url":null,"abstract":"Drug therapy remains the primary approach to treating tumours. Variability among cancer patients, including variations in genomic profiles, often results in divergent therapeutic responses to analogous anti-cancer drug treatments within the same cohort of cancer patients. Hence, predicting the drug response by analysing the genomic profile characteristics of individual patients holds significant research importance. With the notable progress in machine learning and deep learning, many effective methods have emerged for predicting drug responses utilizing features from both drugs and cell lines. However, these methods are inadequate in capturing a sufficient number of features inherent to drugs. Consequently, we propose a representational approach for drugs that incorporates three distinct types of features: the molecular graph, the SMILE strings, and the molecular fingerprints. In this study, a novel deep learning model, named MCMVDRP, is introduced for the prediction of cancer drug responses. In our proposed model, an amalgamation of these extracted features is performed, followed by the utilization of fully connected layers to predict the drug response based on the IC50 values. Experimental results demonstrate that the presented model outperforms current state-of-the-art models in performance.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11602226/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142141758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Leonhard Med, a trusted research environment for processing sensitive research data. Leonhard Med，一个用于处理敏感研究数据的可信研究环境。

IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Journal of Integrative Bioinformatics

Pub Date : 2024-08-02 eCollection Date: 2024-09-01 DOI: 10.1515/jib-2024-0021

Michal J Okoniewski, Anna Wiegand, Diana Coman Schmid, Christian Bolliger, Cristian Bovino, Mattia Belluco, Thomas Wüst, Olivier Byrde, Sergio Maffioletti, Bernd Rinn

This paper provides an overview of the development and operation of the Leonhard Med Trusted Research Environment (TRE) at ETH Zurich. Leonhard Med gives scientific researchers the ability to securely work on sensitive research data. We give an overview of the user perspective, the legal framework for processing sensitive data, design history, current status, and operations. Leonhard Med is an efficient, highly secure Trusted Research Environment for data processing, hosted at ETH Zurich and operated by the Scientific IT Services (SIS) of ETH. It provides a full stack of security controls that allow researchers to store, access, manage, and process sensitive data according to Swiss legislation and ETH Zurich Data Protection policies. In addition, Leonhard Med fulfills the BioMedIT Information Security Policies and is compatible with international data protection laws and therefore can be utilized within the scope of national and international collaboration research projects. Initially designed as a "bare-metal" High-Performance Computing (HPC) platform to achieve maximum performance, Leonhard Med was later re-designed as a virtualized, private cloud platform to offer more flexibility to its customers. Sensitive data can be analyzed in secure, segregated spaces called tenants. Technical and Organizational Measures (TOMs) are in place to assure the confidentiality, integrity, and availability of sensitive data. At the same time, Leonhard Med ensures broad access to cutting-edge research software, especially for the analysis of human -omics data and other personalized health applications.

本文概述了苏黎世联邦理工学院 Leonhard Med 可信研究环境（TRE）的开发和运行情况。Leonhard Med 为科研人员提供了安全处理敏感研究数据的能力。我们概述了用户视角、处理敏感数据的法律框架、设计历史、现状和运行情况。Leonhard Med 是一个用于数据处理的高效、高度安全的可信研究环境，由苏黎世联邦理工学院托管，并由苏黎世联邦理工学院的科学信息技术服务部（SIS）负责运营。它提供一整套安全控制措施，允许研究人员根据瑞士法律和苏黎世联邦理工学院数据保护政策存储、访问、管理和处理敏感数据。此外，Leonhard Med 还符合 BioMedIT 信息安全政策，并与国际数据保护法兼容，因此可在国家和国际合作研究项目范围内使用。Leonhard Med 最初设计为 "裸机 "高性能计算（HPC）平台，以实现最高性能，后来重新设计为虚拟化私有云平台，为客户提供更大的灵活性。敏感数据可在被称为租户的安全隔离空间内进行分析。技术和组织措施（TOM）确保敏感数据的保密性、完整性和可用性。与此同时，Leonhard Med 还确保广泛使用最先进的研究软件，尤其是用于分析人类组学数据和其他个性化健康应用的软件。

{"title":"Leonhard Med, a trusted research environment for processing sensitive research data.","authors":"Michal J Okoniewski, Anna Wiegand, Diana Coman Schmid, Christian Bolliger, Cristian Bovino, Mattia Belluco, Thomas Wüst, Olivier Byrde, Sergio Maffioletti, Bernd Rinn","doi":"10.1515/jib-2024-0021","DOIUrl":"10.1515/jib-2024-0021","url":null,"abstract":"This paper provides an overview of the development and operation of the Leonhard Med Trusted Research Environment (TRE) at ETH Zurich. Leonhard Med gives scientific researchers the ability to securely work on sensitive research data. We give an overview of the user perspective, the legal framework for processing sensitive data, design history, current status, and operations. Leonhard Med is an efficient, highly secure Trusted Research Environment for data processing, hosted at ETH Zurich and operated by the Scientific IT Services (SIS) of ETH. It provides a full stack of security controls that allow researchers to store, access, manage, and process sensitive data according to Swiss legislation and ETH Zurich Data Protection policies. In addition, Leonhard Med fulfills the BioMedIT Information Security Policies and is compatible with international data protection laws and therefore can be utilized within the scope of national and international collaboration research projects. Initially designed as a \"bare-metal\" High-Performance Computing (HPC) platform to achieve maximum performance, Leonhard Med was later re-designed as a virtualized, private cloud platform to offer more flexibility to its customers. Sensitive data can be analyzed in secure, segregated spaces called tenants. Technical and Organizational Measures (TOMs) are in place to assure the confidentiality, integrity, and availability of sensitive data. At the same time, Leonhard Med ensures broad access to cutting-edge research software, especially for the analysis of human -omics data and other personalized health applications.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11602227/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141876714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0