首页 > 最新文献

Frontiers in Bioinformatics最新文献

英文 中文
RIPS (rapid intuitive pathogen surveillance): a tool for surveillance of genome sequence data from foodborne bacterial pathogens RIPS(快速直观病原体监测):监测食源性细菌病原体基因组序列数据的工具
Pub Date : 2024-08-09 DOI: 10.3389/fbinf.2024.1415078
Tim Muruvanda, Hugh Rand, James Pettengill, A. Pightling
Monitoring data submitted to the National Center for Biotechnology Information’s Pathogen Detection whole-genome sequence database, which includes the foodborne bacterial pathogens Listeria monocytogenes, Salmonella enterica, and Escherichia coli, has proven effective for detecting emerging outbreaks. As part of the submission process, new sequence data are typed using a whole-genome multi-locus sequence typing scheme and clustered with sequences already in the database. Publicly available text files contain the results of these analyses. However, contextualizing and interpreting this information is complex. We present the Rapid Intuitive Pathogen Surveillance (RIPS) tool, which shows the results of the NCBI Rapid Reports, along with appropriate metadata, in a graphical, interactive dashboard. RIPS makes the information in the Rapid Reports useful for real-time surveillance of genome sequence databases.
事实证明,向美国国家生物技术信息中心病原体检测全基因组序列数据库提交的监测数据(包括食源性细菌病原体单核细胞增生李斯特菌、肠炎沙门氏菌和大肠埃希氏菌)可有效检测新出现的疫情。作为提交程序的一部分,新的序列数据将使用全基因组多焦点序列分型方案进行分型,并与数据库中已有的序列进行聚类。公开的文本文件包含这些分析的结果。然而,对这些信息进行背景分析和解释是非常复杂的。我们推出了快速直观病原体监测(RIPS)工具,它以图形、交互式仪表板的形式显示了 NCBI 快速报告的结果以及相应的元数据。RIPS 使快速报告中的信息有助于对基因组序列数据库进行实时监控。
{"title":"RIPS (rapid intuitive pathogen surveillance): a tool for surveillance of genome sequence data from foodborne bacterial pathogens","authors":"Tim Muruvanda, Hugh Rand, James Pettengill, A. Pightling","doi":"10.3389/fbinf.2024.1415078","DOIUrl":"https://doi.org/10.3389/fbinf.2024.1415078","url":null,"abstract":"Monitoring data submitted to the National Center for Biotechnology Information’s Pathogen Detection whole-genome sequence database, which includes the foodborne bacterial pathogens Listeria monocytogenes, Salmonella enterica, and Escherichia coli, has proven effective for detecting emerging outbreaks. As part of the submission process, new sequence data are typed using a whole-genome multi-locus sequence typing scheme and clustered with sequences already in the database. Publicly available text files contain the results of these analyses. However, contextualizing and interpreting this information is complex. We present the Rapid Intuitive Pathogen Surveillance (RIPS) tool, which shows the results of the NCBI Rapid Reports, along with appropriate metadata, in a graphical, interactive dashboard. RIPS makes the information in the Rapid Reports useful for real-time surveillance of genome sequence databases.","PeriodicalId":507586,"journal":{"name":"Frontiers in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141921242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial: Big data and artificial intelligence for genomics and therapeutics – Proceedings of the 19th Annual Meeting of the MidSouth Computational Biology and Bioinformatics Society (MCBIOS) 社论:用于基因组学和治疗学的大数据和人工智能--中南计算生物学和生物信息学学会(MCBIOS)第 19 届年会论文集
Pub Date : 2024-08-09 DOI: 10.3389/fbinf.2024.1470107
Huixiao Hong, Inimary Toby-Ogundeji, Robert J. Doerksen, Zhaohui Steve Qin
{"title":"Editorial: Big data and artificial intelligence for genomics and therapeutics – Proceedings of the 19th Annual Meeting of the MidSouth Computational Biology and Bioinformatics Society (MCBIOS)","authors":"Huixiao Hong, Inimary Toby-Ogundeji, Robert J. Doerksen, Zhaohui Steve Qin","doi":"10.3389/fbinf.2024.1470107","DOIUrl":"https://doi.org/10.3389/fbinf.2024.1470107","url":null,"abstract":"","PeriodicalId":507586,"journal":{"name":"Frontiers in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141923553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
In silico studies of benzothiazole derivatives as potential inhibitors of Anopheles funestus and Anopheles gambiae trehalase 苯并噻唑衍生物作为疟原虫和冈比亚按蚊三卤酶潜在抑制剂的硅学研究
Pub Date : 2024-08-09 DOI: 10.3389/fbinf.2024.1428539
T. A. Ogunnupebi, G. Oduselu, O. F. Elebiju, O. Ajani, E. Adebiyi
In malaria management, insecticides play a crucial role in targeting disease vectors. Benzothiazole derivatives have also been reported to possess insecticidal properties, among several other properties they exhibit. The female Anopheles mosquito is responsible for transmitting the malaria parasite when infected. Anopheles gambiae (Ag) and Anopheles funestus (Af) are two of the most notable Anopheles species known to spread malaria in Nigeria. Trehalase is an enzyme that breaks down trehalose. Recent research has proposed it as a viable target for inhibition since it aids in flight and stress adaptation.This study aimed to investigate benzothiazole derivatives as potential inhibitors of trehalase of Anopheles funestus (AfTre) and Anopheles gambiae (AgTre) using toxicity profiling, molecular docking, and dynamic simulation for future insecticidal intervention. A total of 4,214 benzothiazole-based compounds were obtained from the PubChem database and subjected to screening against the 3D modelled structure of AfTre and AgTre. Compounds with some toxicity levels were optimised, and the obtained lead compounds were further investigated through molecular docking studies. Furthermore, the best hit was subjected to parameters such as RMSD, RMSF, SASA, Rg, and hydrogen bond to confirm its stability when in a complex with AfTre, and these parameters were compared to that of validamycin A (control ligand).The post-screening analysis showed binding affinities of −8.7 and −8.2 kcal/mol (compound 1), −8.2 and −7.4 kcal/mol (compound 2), compared to −6.3 and −5.1 kcal/mol (Validamycin A, a known inhibitor) against AfTre and AgTre, respectively. The molecular dynamics simulation showed that compound 1 (the best hit) had good stability when in complex with AfTre. These findings suggest that these best hits can serve as potential inhibitors for the development of novel insecticides in the control of malaria vectors.
在疟疾防治中,杀虫剂在针对病媒方面发挥着至关重要的作用。据报道,苯并噻唑衍生物除具有其他一些特性外,还具有杀虫特性。雌性按蚊在感染后负责传播疟疾寄生虫。冈比亚按蚊(Ag)和疟蚊(Af)是已知在尼日利亚传播疟疾的两个最显著的按蚊物种。三卤糖酶是一种分解三卤糖的酶。本研究旨在通过毒性分析、分子对接和动态模拟,研究苯并噻唑衍生物作为拟南芥(AfTre)和冈比亚按蚊(AgTre)三卤糖酶的潜在抑制剂,以用于未来的杀虫干预。研究人员从 PubChem 数据库中获取了 4,214 种苯并噻唑类化合物,并根据 AfTre 和 AgTre 的三维模型结构进行了筛选。对具有一定毒性水平的化合物进行了优化,并通过分子对接研究对获得的先导化合物进行了进一步研究。此外,对最佳化合物进行了 RMSD、RMSF、SASA、Rg 和氢键等参数测试,以确认其与 AfTre 复合物的稳定性,并将这些参数与有效霉素 A(对照配体)进行了比较。筛选后分析表明,化合物 1 与 AfTre 和 AgTre 的结合亲和力分别为 -8.7 和 -8.2 kcal/mol(化合物 1)、-8.2 和 -7.4 kcal/mol(化合物 2),而 Validamycin A(已知抑制剂)则分别为 -6.3 和 -5.1 kcal/mol。分子动力学模拟显示,化合物 1(最佳化合物)与 AfTre 复合物具有良好的稳定性。这些研究结果表明,这些最佳化合物可以作为潜在的抑制剂,用于开发新型杀虫剂,控制疟疾病媒。
{"title":"In silico studies of benzothiazole derivatives as potential inhibitors of Anopheles funestus and Anopheles gambiae trehalase","authors":"T. A. Ogunnupebi, G. Oduselu, O. F. Elebiju, O. Ajani, E. Adebiyi","doi":"10.3389/fbinf.2024.1428539","DOIUrl":"https://doi.org/10.3389/fbinf.2024.1428539","url":null,"abstract":"In malaria management, insecticides play a crucial role in targeting disease vectors. Benzothiazole derivatives have also been reported to possess insecticidal properties, among several other properties they exhibit. The female Anopheles mosquito is responsible for transmitting the malaria parasite when infected. Anopheles gambiae (Ag) and Anopheles funestus (Af) are two of the most notable Anopheles species known to spread malaria in Nigeria. Trehalase is an enzyme that breaks down trehalose. Recent research has proposed it as a viable target for inhibition since it aids in flight and stress adaptation.This study aimed to investigate benzothiazole derivatives as potential inhibitors of trehalase of Anopheles funestus (AfTre) and Anopheles gambiae (AgTre) using toxicity profiling, molecular docking, and dynamic simulation for future insecticidal intervention. A total of 4,214 benzothiazole-based compounds were obtained from the PubChem database and subjected to screening against the 3D modelled structure of AfTre and AgTre. Compounds with some toxicity levels were optimised, and the obtained lead compounds were further investigated through molecular docking studies. Furthermore, the best hit was subjected to parameters such as RMSD, RMSF, SASA, Rg, and hydrogen bond to confirm its stability when in a complex with AfTre, and these parameters were compared to that of validamycin A (control ligand).The post-screening analysis showed binding affinities of −8.7 and −8.2 kcal/mol (compound 1), −8.2 and −7.4 kcal/mol (compound 2), compared to −6.3 and −5.1 kcal/mol (Validamycin A, a known inhibitor) against AfTre and AgTre, respectively. The molecular dynamics simulation showed that compound 1 (the best hit) had good stability when in complex with AfTre. These findings suggest that these best hits can serve as potential inhibitors for the development of novel insecticides in the control of malaria vectors.","PeriodicalId":507586,"journal":{"name":"Frontiers in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141924896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial: Machine learning approaches to antimicrobials: discovery and resistance 社论:抗菌药的机器学习方法:发现与抗药性
Pub Date : 2024-08-09 DOI: 10.3389/fbinf.2024.1458237
S. Broschat, Shirley W. I. Siu, César de la Fuente-Nunez
{"title":"Editorial: Machine learning approaches to antimicrobials: discovery and resistance","authors":"S. Broschat, Shirley W. I. Siu, César de la Fuente-Nunez","doi":"10.3389/fbinf.2024.1458237","DOIUrl":"https://doi.org/10.3389/fbinf.2024.1458237","url":null,"abstract":"","PeriodicalId":507586,"journal":{"name":"Frontiers in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141921173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predictive identification and design of potent inhibitors targeting resistance-inducing candidate genes from E. coli whole-genome sequences 从大肠杆菌全基因组序列中预测性识别和设计针对抗性诱导候选基因的强效抑制剂
Pub Date : 2024-07-26 DOI: 10.3389/fbinf.2024.1411935
A. Aborode, Neeraj Kumar, Christopher Busayo Olowosoke, Tope Abraham Ibisanmi, Islamiyyah Ayoade, H. I. Umar, A. Jamiu, Basit Bolarinwa, Zainab Olapade, A. R. Idowu, Ibrahim O. Adelakun, I. A. Onifade, Benjamin Akangbe, Modesta Abacheng, O. O. Ikhimiukor, A. A. Awaji, R. Adesola
Introduction: This work utilizes predictive modeling in drug discovery to unravel potential candidate genes from Escherichia coli that are implicated in antimicrobial resistance; we subsequently target the gidB, MacB, and KatG genes with some compounds from plants with reported antibacterial potentials.Method: The resistance genes and plasmids were identified from 10 whole-genome sequence datasets of E. coli; forty two plant compounds were selected, and their 3D structures were retrieved and optimized for docking. The 3D crystal structures of KatG, MacB, and gidB were retrieved and prepared for molecular docking, molecular dynamics simulations, and ADMET profiling.Result: Hesperidin showed the least binding energy (kcal/mol) against KatG (−9.3), MacB (−10.7), and gidB (−6.7); additionally, good pharmacokinetic profiles and structure–dynamics integrity with their respective protein complexes were observed.Conclusion: Although these findings suggest hesperidin as a potential inhibitor against MacB, gidB, and KatG in E. coli, further validations through in vitro and in vivo experiments are needed. This research is expected to provide an alternative avenue for addressing existing antimicrobial resistances associated with E. coli’s MacB, gidB, and KatG.
简介:这项研究利用药物发现中的预测建模来揭示大肠杆菌中与抗菌药耐药性有关的潜在候选基因;随后,我们用一些据报道具有抗菌潜力的植物化合物来靶向 gidB、MacB 和 KatG 基因:方法:我们从 10 个大肠杆菌全基因组序列数据集中鉴定了耐药基因和质粒;选择了 42 种植物化合物,并检索和优化了它们的三维结构,以便进行对接。检索并制备了 KatG、MacB 和 gidB 的三维晶体结构,用于分子对接、分子动力学模拟和 ADMET 分析:结果:橙皮甙与 KatG(-9.3)、MacB(-10.7)和 gidB(-6.7)的结合能(kcal/mol)最小;此外,还观察到橙皮甙与各自蛋白质复合物的良好药代动力学特征和结构动力学完整性:尽管这些研究结果表明橙皮甙是一种潜在的大肠杆菌 MacB、gidB 和 KatG 抑制剂,但还需要通过体外和体内实验进一步验证。这项研究有望为解决与大肠杆菌的 MacB、gidB 和 KatG 相关的现有抗菌药耐药性问题提供另一种途径。
{"title":"Predictive identification and design of potent inhibitors targeting resistance-inducing candidate genes from E. coli whole-genome sequences","authors":"A. Aborode, Neeraj Kumar, Christopher Busayo Olowosoke, Tope Abraham Ibisanmi, Islamiyyah Ayoade, H. I. Umar, A. Jamiu, Basit Bolarinwa, Zainab Olapade, A. R. Idowu, Ibrahim O. Adelakun, I. A. Onifade, Benjamin Akangbe, Modesta Abacheng, O. O. Ikhimiukor, A. A. Awaji, R. Adesola","doi":"10.3389/fbinf.2024.1411935","DOIUrl":"https://doi.org/10.3389/fbinf.2024.1411935","url":null,"abstract":"Introduction: This work utilizes predictive modeling in drug discovery to unravel potential candidate genes from Escherichia coli that are implicated in antimicrobial resistance; we subsequently target the gidB, MacB, and KatG genes with some compounds from plants with reported antibacterial potentials.Method: The resistance genes and plasmids were identified from 10 whole-genome sequence datasets of E. coli; forty two plant compounds were selected, and their 3D structures were retrieved and optimized for docking. The 3D crystal structures of KatG, MacB, and gidB were retrieved and prepared for molecular docking, molecular dynamics simulations, and ADMET profiling.Result: Hesperidin showed the least binding energy (kcal/mol) against KatG (−9.3), MacB (−10.7), and gidB (−6.7); additionally, good pharmacokinetic profiles and structure–dynamics integrity with their respective protein complexes were observed.Conclusion: Although these findings suggest hesperidin as a potential inhibitor against MacB, gidB, and KatG in E. coli, further validations through in vitro and in vivo experiments are needed. This research is expected to provide an alternative avenue for addressing existing antimicrobial resistances associated with E. coli’s MacB, gidB, and KatG.","PeriodicalId":507586,"journal":{"name":"Frontiers in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141801132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CAEM-GBDT: a cancer subtype identifying method using multi-omics data and convolutional autoencoder network CAEM-GBDT:利用多组学数据和卷积自动编码器网络识别癌症亚型的方法
Pub Date : 2024-07-15 DOI: 10.3389/fbinf.2024.1403826
Jiquan Shen, Xuanhui Guo, Hanwen Bai, Junwei Luo
The identification of cancer subtypes plays a very important role in the field of medicine. Accurate identification of cancer subtypes is helpful for both cancer treatment and prognosis Currently, most methods for cancer subtype identification are based on single-omics data, such as gene expression data. However, multi-omics data can show various characteristics about cancer, which also can improve the accuracy of cancer subtype identification. Therefore, how to extract features from multi-omics data for cancer subtype identification is the main challenge currently faced by researchers. In this paper, we propose a cancer subtype identification method named CAEM-GBDT, which takes gene expression data, miRNA expression data, and DNA methylation data as input, and adopts convolutional autoencoder network to identify cancer subtypes. Through a convolutional encoder layer, the method performs feature extraction on the input data. Within the convolutional encoder layer, a convolutional self-attention module is embedded to recognize higher-level representations of the multi-omics data. The extracted high-level representations from the convolutional encoder are then concatenated with the input to the decoder. The GBDT (Gradient Boosting Decision Tree) is utilized for cancer subtype identification. In the experiments, we compare CAEM-GBDT with existing cancer subtype identifying methods. Experimental results demonstrate that the proposed CAEM-GBDT outperforms other methods. The source code is available from GitHub at https://github.com/gxh-1/CAEM-GBDT.git.
癌症亚型的识别在医学领域发挥着非常重要的作用。目前,大多数癌症亚型识别方法都是基于单组学数据,如基因表达数据。然而,多组学数据能显示癌症的各种特征,也能提高癌症亚型识别的准确性。因此,如何从多组学数据中提取癌症亚型识别的特征是目前研究人员面临的主要挑战。本文提出了一种名为 CAEM-GBDT 的癌症亚型识别方法,它以基因表达数据、miRNA 表达数据和 DNA 甲基化数据为输入,采用卷积自动编码器网络来识别癌症亚型。该方法通过卷积编码器层对输入数据进行特征提取。在卷积编码器层中,嵌入了一个卷积自注意模块,用于识别多组学数据的高层表征。然后,从卷积编码器中提取的高层表征与解码器的输入进行连接。梯度提升决策树(GBDT)用于癌症亚型识别。在实验中,我们将 CAEM-GBDT 与现有的癌症亚型识别方法进行了比较。实验结果表明,所提出的 CAEM-GBDT 优于其他方法。源代码可从 GitHub 上获取:https://github.com/gxh-1/CAEM-GBDT.git。
{"title":"CAEM-GBDT: a cancer subtype identifying method using multi-omics data and convolutional autoencoder network","authors":"Jiquan Shen, Xuanhui Guo, Hanwen Bai, Junwei Luo","doi":"10.3389/fbinf.2024.1403826","DOIUrl":"https://doi.org/10.3389/fbinf.2024.1403826","url":null,"abstract":"The identification of cancer subtypes plays a very important role in the field of medicine. Accurate identification of cancer subtypes is helpful for both cancer treatment and prognosis Currently, most methods for cancer subtype identification are based on single-omics data, such as gene expression data. However, multi-omics data can show various characteristics about cancer, which also can improve the accuracy of cancer subtype identification. Therefore, how to extract features from multi-omics data for cancer subtype identification is the main challenge currently faced by researchers. In this paper, we propose a cancer subtype identification method named CAEM-GBDT, which takes gene expression data, miRNA expression data, and DNA methylation data as input, and adopts convolutional autoencoder network to identify cancer subtypes. Through a convolutional encoder layer, the method performs feature extraction on the input data. Within the convolutional encoder layer, a convolutional self-attention module is embedded to recognize higher-level representations of the multi-omics data. The extracted high-level representations from the convolutional encoder are then concatenated with the input to the decoder. The GBDT (Gradient Boosting Decision Tree) is utilized for cancer subtype identification. In the experiments, we compare CAEM-GBDT with existing cancer subtype identifying methods. Experimental results demonstrate that the proposed CAEM-GBDT outperforms other methods. The source code is available from GitHub at https://github.com/gxh-1/CAEM-GBDT.git.","PeriodicalId":507586,"journal":{"name":"Frontiers in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141649028","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network analysis of driver genes in human cancers 人类癌症驱动基因网络分析
Pub Date : 2024-07-08 DOI: 10.3389/fbinf.2024.1365200
S. S. Patil, Steven A. Roberts, A. Gebremedhin
Cancer is a heterogeneous disease that results from genetic alteration of cell cycle and proliferation controls. Identifying mutations that drive cancer, understanding cancer type specificities, and delineating how driver mutations interact with each other to establish disease is vital for identifying therapeutic vulnerabilities. Such cancer specific patterns and gene co-occurrences can be identified by studying tumor genome sequences, and networks have proven effective in uncovering relationships between sequences. We present two network-based approaches to identify driver gene patterns among tumor samples. The first approach relies on analysis using the Directed Weighted All Nearest Neighbors (DiWANN) model, which is a variant of sequence similarity network, and the second approach uses bipartite network analysis. A data reduction framework was implemented to extract the minimal relevant information for the sequence similarity network analysis, where a transformed reference sequence is generated for constructing the driver gene network. This data reduction process combined with the efficiency of the DiWANN network model, greatly lowered the computational cost (in terms of execution time and memory usage) of generating the networks enabling us to work at a much larger scale than previously possible. The DiWANN network helped us identify cancer types in which samples were more closely connected to each other suggesting they are less heterogeneous and potentially susceptible to a common drug. The bipartite network analysis provided insight into gene associations and co-occurrences. We identified genes that were broadly mutated in multiple cancer types and mutations exclusive to only a few. Additionally, weighted one-mode gene projections of the bipartite networks revealed a pattern of occurrence of driver genes in different cancers. Our study demonstrates that network-based approaches can be an effective tool in cancer genomics. The analysis identifies co-occurring and exclusive driver genes and mutations for specific cancer types, providing a better understanding of the driver genes that lead to tumor initiation and evolution.
癌症是一种异质性疾病,由细胞周期和增殖控制的基因改变引起。识别驱动癌症的基因突变、了解癌症类型的特异性以及界定驱动基因突变如何相互影响以导致疾病,对于识别治疗漏洞至关重要。这种癌症特异性模式和基因共现可以通过研究肿瘤基因组序列来确定,而网络已被证明能有效揭示序列之间的关系。我们提出了两种基于网络的方法来识别肿瘤样本中的驱动基因模式。第一种方法依赖于使用定向加权全近邻(DiWANN)模型进行分析,这是序列相似性网络的一种变体;第二种方法使用双方位网络分析。为了提取序列相似性网络分析所需的最小相关信息,实施了一个数据缩减框架,在此框架下生成了一个用于构建驱动基因网络的转换参考序列。这一数据缩减过程与 DiWANN 网络模型的效率相结合,大大降低了生成网络的计算成本(在执行时间和内存使用方面),使我们能够以比以前更大的规模开展工作。DiWANN 网络帮助我们确定了癌症类型,在这些癌症类型中,样本之间的联系更为紧密,这表明它们的异质性较低,有可能对共同的药物敏感。双向网络分析让我们深入了解了基因关联和共现。我们确定了在多种癌症类型中发生广泛突变的基因,以及仅在少数癌症类型中发生突变的基因。此外,双方格网络的加权单模式基因投影显示了驱动基因在不同癌症中的出现模式。我们的研究表明,基于网络的方法可以成为癌症基因组学的有效工具。该分析确定了特定癌症类型的共存和排他性驱动基因和突变,从而让人们更好地了解导致肿瘤发生和进化的驱动基因。
{"title":"Network analysis of driver genes in human cancers","authors":"S. S. Patil, Steven A. Roberts, A. Gebremedhin","doi":"10.3389/fbinf.2024.1365200","DOIUrl":"https://doi.org/10.3389/fbinf.2024.1365200","url":null,"abstract":"Cancer is a heterogeneous disease that results from genetic alteration of cell cycle and proliferation controls. Identifying mutations that drive cancer, understanding cancer type specificities, and delineating how driver mutations interact with each other to establish disease is vital for identifying therapeutic vulnerabilities. Such cancer specific patterns and gene co-occurrences can be identified by studying tumor genome sequences, and networks have proven effective in uncovering relationships between sequences. We present two network-based approaches to identify driver gene patterns among tumor samples. The first approach relies on analysis using the Directed Weighted All Nearest Neighbors (DiWANN) model, which is a variant of sequence similarity network, and the second approach uses bipartite network analysis. A data reduction framework was implemented to extract the minimal relevant information for the sequence similarity network analysis, where a transformed reference sequence is generated for constructing the driver gene network. This data reduction process combined with the efficiency of the DiWANN network model, greatly lowered the computational cost (in terms of execution time and memory usage) of generating the networks enabling us to work at a much larger scale than previously possible. The DiWANN network helped us identify cancer types in which samples were more closely connected to each other suggesting they are less heterogeneous and potentially susceptible to a common drug. The bipartite network analysis provided insight into gene associations and co-occurrences. We identified genes that were broadly mutated in multiple cancer types and mutations exclusive to only a few. Additionally, weighted one-mode gene projections of the bipartite networks revealed a pattern of occurrence of driver genes in different cancers. Our study demonstrates that network-based approaches can be an effective tool in cancer genomics. The analysis identifies co-occurring and exclusive driver genes and mutations for specific cancer types, providing a better understanding of the driver genes that lead to tumor initiation and evolution.","PeriodicalId":507586,"journal":{"name":"Frontiers in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141669653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PAGER-scFGA: unveiling cell functions and molecular mechanisms in cell trajectories through single-cell functional genomics analysis PAGER-scFGA:通过单细胞功能基因组学分析揭示细胞轨迹中的细胞功能和分子机制
Pub Date : 2024-04-16 DOI: 10.3389/fbinf.2024.1336135
Fengyuan Huang, Robert S. Welner, Jake Y. Chen, Zongliang Yue
Background: Understanding how cells and tissues respond to stress factors and perturbations during disease processes is crucial for developing effective prevention, diagnosis, and treatment strategies. Single-cell RNA sequencing (scRNA-seq) enables high-resolution identification of cells and exploration of cell heterogeneity, shedding light on cell differentiation/maturation and functional differences. Recent advancements in multimodal sequencing technologies have focused on improving access to cell-specific subgroups for functional genomics analysis. To facilitate the functional annotation of cell groups and characterization of molecular mechanisms underlying cell trajectories, we introduce the Pathways, Annotated Gene Lists, and Gene Signatures Electronic Repository for Single-Cell Functional Genomics Analysis (PAGER-scFGA).Results: We have developed PAGER-scFGA, which integrates cell functional annotations and gene-set enrichment analysis into popular single-cell analysis pipelines such as Scanpy. Using differentially expressed genes (DEGs) from pairwise cell clusters, PAGER-scFGA infers cell functions through the enrichment of potential cell-marker genesets. Moreover, PAGER-scFGA provides pathways, annotated gene lists, and gene signatures (PAGs) enriched in specific cell subsets with tissue compositions and continuous transitions along cell trajectories. Additionally, PAGER-scFGA enables the construction of a gene subcellular map based on DEGs and allows examination of the gene functional compartments (GFCs) underlying cell maturation/differentiation. In a real-world case study of mouse natural killer (mNK) cells, PAGER-scFGA revealed two major stages of natural killer (NK) cells and three trajectories from the precursor stage to NK T-like mature stage within blood, spleen, and bone marrow tissues. As the trajectories progress to later stages, the DEGs exhibit greater divergence and variability. However, the DEGs in different trajectories still interact within a network during NK cell maturation. Notably, PAGER-scFGA unveiled cell cytotoxicity, exocytosis, and the response to interleukin (IL) signaling pathways and associated network models during the progression from precursor NK cells to mature NK cells.Conclusion: PAGER-scFGA enables in-depth exploration of functional insights and presents a comprehensive knowledge map of gene networks and GFCs, which can be utilized for future studies and hypothesis generation. It is expected to become an indispensable tool for inferring cell functions and detecting molecular mechanisms within cell trajectories in single-cell studies. The web app (accessible at https://au-singlecell.streamlit.app/) is publicly available.
背景:了解细胞和组织如何应对疾病过程中的应激因素和干扰,对于制定有效的预防、诊断和治疗策略至关重要。单细胞 RNA 测序(scRNA-seq)能够高分辨率地识别细胞并探索细胞异质性,从而揭示细胞分化/成熟和功能差异。多模态测序技术的最新进展主要集中在改善细胞特异性亚群的获取,以进行功能基因组学分析。为了促进细胞群的功能注释和细胞轨迹的分子机制表征,我们引入了用于单细胞功能基因组学分析的通路、注释基因列表和基因特征电子资源库(PAGER-scFGA):我们开发了 PAGER-scFGA,它将细胞功能注释和基因组富集分析集成到了 Scanpy 等流行的单细胞分析管道中。PAGER-scFGA利用成对细胞簇的差异表达基因(DEGs),通过富集潜在的细胞标记基因组来推断细胞功能。此外,PAGER-scFGA 还提供了在特定细胞亚群中富集的通路、注释基因列表和基因特征 (PAG),以及组织组成和细胞轨迹的连续转换。此外,PAGER-scFGA 还能根据 DEGs 构建基因亚细胞图谱,并检查细胞成熟/分化所依赖的基因功能区(GFCs)。在对小鼠自然杀伤(mNK)细胞的实际案例研究中,PAGER-scFGA 揭示了自然杀伤(NK)细胞的两个主要阶段,以及在血液、脾脏和骨髓组织中从前体阶段到类似 NK T 的成熟阶段的三个轨迹。随着轨迹向后期发展,DEGs 表现出更大的差异和变异性。然而,在 NK 细胞成熟过程中,不同轨迹的 DEGs 仍会在一个网络中相互作用。值得注意的是,PAGER-scFGA揭示了从前体NK细胞到成熟NK细胞过程中的细胞毒性、外吞作用和对白细胞介素(IL)信号通路的反应以及相关网络模型:结论:PAGER-scFGA 能够深入探讨功能性见解,并呈现基因网络和 GFC 的全面知识图谱,可用于未来的研究和假设生成。它有望成为单细胞研究中推断细胞功能和检测细胞轨迹中分子机制的不可或缺的工具。该网络应用程序(可通过 https://au-singlecell.streamlit.app/ 访问)已公开发布。
{"title":"PAGER-scFGA: unveiling cell functions and molecular mechanisms in cell trajectories through single-cell functional genomics analysis","authors":"Fengyuan Huang, Robert S. Welner, Jake Y. Chen, Zongliang Yue","doi":"10.3389/fbinf.2024.1336135","DOIUrl":"https://doi.org/10.3389/fbinf.2024.1336135","url":null,"abstract":"Background: Understanding how cells and tissues respond to stress factors and perturbations during disease processes is crucial for developing effective prevention, diagnosis, and treatment strategies. Single-cell RNA sequencing (scRNA-seq) enables high-resolution identification of cells and exploration of cell heterogeneity, shedding light on cell differentiation/maturation and functional differences. Recent advancements in multimodal sequencing technologies have focused on improving access to cell-specific subgroups for functional genomics analysis. To facilitate the functional annotation of cell groups and characterization of molecular mechanisms underlying cell trajectories, we introduce the Pathways, Annotated Gene Lists, and Gene Signatures Electronic Repository for Single-Cell Functional Genomics Analysis (PAGER-scFGA).Results: We have developed PAGER-scFGA, which integrates cell functional annotations and gene-set enrichment analysis into popular single-cell analysis pipelines such as Scanpy. Using differentially expressed genes (DEGs) from pairwise cell clusters, PAGER-scFGA infers cell functions through the enrichment of potential cell-marker genesets. Moreover, PAGER-scFGA provides pathways, annotated gene lists, and gene signatures (PAGs) enriched in specific cell subsets with tissue compositions and continuous transitions along cell trajectories. Additionally, PAGER-scFGA enables the construction of a gene subcellular map based on DEGs and allows examination of the gene functional compartments (GFCs) underlying cell maturation/differentiation. In a real-world case study of mouse natural killer (mNK) cells, PAGER-scFGA revealed two major stages of natural killer (NK) cells and three trajectories from the precursor stage to NK T-like mature stage within blood, spleen, and bone marrow tissues. As the trajectories progress to later stages, the DEGs exhibit greater divergence and variability. However, the DEGs in different trajectories still interact within a network during NK cell maturation. Notably, PAGER-scFGA unveiled cell cytotoxicity, exocytosis, and the response to interleukin (IL) signaling pathways and associated network models during the progression from precursor NK cells to mature NK cells.Conclusion: PAGER-scFGA enables in-depth exploration of functional insights and presents a comprehensive knowledge map of gene networks and GFCs, which can be utilized for future studies and hypothesis generation. It is expected to become an indispensable tool for inferring cell functions and detecting molecular mechanisms within cell trajectories in single-cell studies. The web app (accessible at https://au-singlecell.streamlit.app/) is publicly available.","PeriodicalId":507586,"journal":{"name":"Frontiers in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140694896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From complex data to clear insights: visualizing molecular dynamics trajectories 从复杂数据到清晰见解:分子动力学轨迹可视化
Pub Date : 2024-04-11 DOI: 10.3389/fbinf.2024.1356659
Hayet Belghit, Mariano Spivak, Manuel Dauchez, Marc Baaden, J. Jonquet-Prevoteau
Advances in simulations, combined with technological developments in high-performance computing, have made it possible to produce a physically accurate dynamic representation of complex biological systems involving millions to billions of atoms over increasingly long simulation times. The analysis of these computed simulations is crucial, involving the interpretation of structural and dynamic data to gain insights into the underlying biological processes. However, this analysis becomes increasingly challenging due to the complexity of the generated systems with a large number of individual runs, ranging from hundreds to thousands of trajectories. This massive increase in raw simulation data creates additional processing and visualization challenges. Effective visualization techniques play a vital role in facilitating the analysis and interpretation of molecular dynamics simulations. In this paper, we focus mainly on the techniques and tools that can be used for visualization of molecular dynamics simulations, among which we highlight the few approaches used specifically for this purpose, discussing their advantages and limitations, and addressing the future challenges of molecular dynamics visualization.
模拟技术的进步,加上高性能计算技术的发展,使得在越来越长的模拟时间内,对涉及数百万到数十亿原子的复杂生物系统进行物理上精确的动态模拟成为可能。对这些计算模拟的分析至关重要,其中包括对结构和动态数据的解读,以深入了解潜在的生物过程。然而,由于生成系统的复杂性,以及大量的单个运行(从数百到数千个轨迹不等),这种分析变得越来越具有挑战性。原始模拟数据的大量增加带来了额外的处理和可视化挑战。有效的可视化技术在促进分子动力学模拟的分析和解释方面发挥着至关重要的作用。在本文中,我们主要关注可用于分子动力学模拟可视化的技术和工具,其中重点介绍了专门用于此目的的几种方法,讨论了它们的优势和局限性,并探讨了分子动力学可视化的未来挑战。
{"title":"From complex data to clear insights: visualizing molecular dynamics trajectories","authors":"Hayet Belghit, Mariano Spivak, Manuel Dauchez, Marc Baaden, J. Jonquet-Prevoteau","doi":"10.3389/fbinf.2024.1356659","DOIUrl":"https://doi.org/10.3389/fbinf.2024.1356659","url":null,"abstract":"Advances in simulations, combined with technological developments in high-performance computing, have made it possible to produce a physically accurate dynamic representation of complex biological systems involving millions to billions of atoms over increasingly long simulation times. The analysis of these computed simulations is crucial, involving the interpretation of structural and dynamic data to gain insights into the underlying biological processes. However, this analysis becomes increasingly challenging due to the complexity of the generated systems with a large number of individual runs, ranging from hundreds to thousands of trajectories. This massive increase in raw simulation data creates additional processing and visualization challenges. Effective visualization techniques play a vital role in facilitating the analysis and interpretation of molecular dynamics simulations. In this paper, we focus mainly on the techniques and tools that can be used for visualization of molecular dynamics simulations, among which we highlight the few approaches used specifically for this purpose, discussing their advantages and limitations, and addressing the future challenges of molecular dynamics visualization.","PeriodicalId":507586,"journal":{"name":"Frontiers in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140715799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of polyspecificity from antibody sequence data by machine learning 通过机器学习从抗体序列数据中预测多特异性
Pub Date : 2024-04-08 DOI: 10.3389/fbinf.2023.1286883
Szabolcs Éliás, Clemens Wrzodek, Charlotte M. Deane, Alain C. Tissot, Stefan Klostermann, Francesca Ros
Antibodies are generated with great diversity in nature resulting in a set of molecules, each optimized to bind a specific target. Taking advantage of their diversity and specificity, antibodies make up for a large part of recently developed biologic drugs. For therapeutic use antibodies need to fulfill several criteria to be safe and efficient. Polyspecific antibodies can bind structurally unrelated molecules in addition to their main target, which can lead to side effects and decreased efficacy in a therapeutic setting, for example via reduction of effective drug levels. Therefore, we created a neural-network-based model to predict polyspecificity of antibodies using the heavy chain variable region sequence as input. We devised a strategy for enriching antibodies from an immunization campaign either for antigen-specific or polyspecific binding properties, followed by generation of a large sequencing data set for training and cross-validation of the model. We identified important physico-chemical features influencing polyspecificity by investigating the behaviour of this model. This work is a machine-learning-based approach to polyspecificity prediction and, besides increasing our understanding of polyspecificity, it might contribute to therapeutic antibody development.
抗体在自然界中的产生具有极大的多样性,从而产生了一系列分子,每种分子都经过优化,可与特定靶点结合。利用抗体的多样性和特异性,抗体在最近开发的生物药物中占了很大一部分。在治疗用途上,抗体需要满足几个标准才能安全有效。多特异性抗体除了能与主要靶点结合外,还能与结构上不相关的分子结合,这可能会导致副作用和治疗效果下降,例如降低有效药物水平。因此,我们创建了一个基于神经网络的模型,利用重链可变区序列作为输入来预测抗体的多特异性。我们设计了一种策略,从免疫活动中富集具有抗原特异性或多特异性结合特性的抗体,然后生成一个大型测序数据集,用于模型的训练和交叉验证。通过研究该模型的行为,我们确定了影响多特异性的重要物理化学特征。这项工作是一种基于机器学习的多特异性预测方法,除了增加我们对多特异性的了解,还可能有助于治疗性抗体的开发。
{"title":"Prediction of polyspecificity from antibody sequence data by machine learning","authors":"Szabolcs Éliás, Clemens Wrzodek, Charlotte M. Deane, Alain C. Tissot, Stefan Klostermann, Francesca Ros","doi":"10.3389/fbinf.2023.1286883","DOIUrl":"https://doi.org/10.3389/fbinf.2023.1286883","url":null,"abstract":"Antibodies are generated with great diversity in nature resulting in a set of molecules, each optimized to bind a specific target. Taking advantage of their diversity and specificity, antibodies make up for a large part of recently developed biologic drugs. For therapeutic use antibodies need to fulfill several criteria to be safe and efficient. Polyspecific antibodies can bind structurally unrelated molecules in addition to their main target, which can lead to side effects and decreased efficacy in a therapeutic setting, for example via reduction of effective drug levels. Therefore, we created a neural-network-based model to predict polyspecificity of antibodies using the heavy chain variable region sequence as input. We devised a strategy for enriching antibodies from an immunization campaign either for antigen-specific or polyspecific binding properties, followed by generation of a large sequencing data set for training and cross-validation of the model. We identified important physico-chemical features influencing polyspecificity by investigating the behaviour of this model. This work is a machine-learning-based approach to polyspecificity prediction and, besides increasing our understanding of polyspecificity, it might contribute to therapeutic antibody development.","PeriodicalId":507586,"journal":{"name":"Frontiers in Bioinformatics","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140731336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Frontiers in Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1