首页 > 最新文献

Journal of bioinformatics and systems biology : Open access最新文献

英文 中文
CDKs Functional Analysis in Low Proliferating Early-Stage Pancreatic Ductal Adenocarcinoma. CDKs在低增殖早期胰腺导管腺癌中的功能分析。
Pub Date : 2023-01-01 Epub Date: 2023-08-16 DOI: 10.26502/jbsb.5107060
Shikai Zhu, Huining Yang, Lingling Liu, Zhilin Jiang, Juanjuan Ji, Xiao Wang, Lin Zhong, Fulin Liu, Xueliang Gao, Haizhen Wang, Yu Zhou

Pancreatic ductal adenocarcinoma (PDAC) is a highly devastating disease with a poor prognosis and growing incidence. In this study, we explored the potential roles of CDK1, CDK2, CDK4, and CDK6 in the progression of early-stage PDAC. Clinicopathologic and mRNA expression data and treatment information of 140 patients identified with stage I/II PDAC who underwent pancreaticoduodenectomy were obtained from the Cancer Genome Atlas data set. Our bioinformatic analysis showed that higher CDK1, CDK2, CDK4, or CDK6 expression was associated with a shorter median survival of the early-stage PDAC patients. Of note, in the low-proliferating pancreatic cancer group, CDKs expressions were significantly associated with proteins functioning in apoptosis, metastasis, immunity, or stemness. Among the low-proliferating PDAC, higher expression of CDK1 was associated with the shorter survival of patients, suggesting that CDK1 may regulate PDAC progression through cell cycle-independent mechanisms. Our experimental data showed that CDK1 knockdown/inhibition significantly suppressed the expression levels of AHR and POU5F1, two critical proteins functioning in cancer cell metastasis and stemness, in low-proliferating, but not in high-proliferating pancreatic cancer cells. In all, our study suggests that CDKs regulate PDAC progression not only through cell proliferation but also through apoptosis, metastasis, immunity, and stemness.

胰腺导管腺癌(PDAC)是一种极具破坏性的疾病,预后不良,发病率不断上升。在本研究中,我们探讨了CDK1、CDK2、CDK4和CDK6在早期PDAC进展中的潜在作用。从癌症基因组图谱数据集获得140名I/II期PDAC患者的临床病理和mRNA表达数据以及接受胰十二指肠切除术的治疗信息。我们的生物信息学分析显示,CDK1、CDK2、CDK4或CDK6的较高表达与早期PDAC患者的中位生存期较短有关。值得注意的是,在低增殖性胰腺癌症组中,CDKs的表达与细胞凋亡、转移、免疫或干燥的蛋白质功能显著相关。在低增殖PDAC中,CDK1的高表达与患者的较短生存期有关,这表明CDK1可能通过细胞周期无关的机制调节PDAC的进展。我们的实验数据显示,CDK1敲除/抑制显著抑制了AHR和POU5F1的表达水平,这两种关键蛋白在癌症细胞转移和干燥、低增殖但不在高增殖的癌症胰腺细胞中起作用。总之,我们的研究表明,CDKs不仅通过细胞增殖,还通过细胞凋亡、转移、免疫和干性来调节PDAC的进展。
{"title":"CDKs Functional Analysis in Low Proliferating Early-Stage Pancreatic Ductal Adenocarcinoma.","authors":"Shikai Zhu, Huining Yang, Lingling Liu, Zhilin Jiang, Juanjuan Ji, Xiao Wang, Lin Zhong, Fulin Liu, Xueliang Gao, Haizhen Wang, Yu Zhou","doi":"10.26502/jbsb.5107060","DOIUrl":"10.26502/jbsb.5107060","url":null,"abstract":"<p><p>Pancreatic ductal adenocarcinoma (PDAC) is a highly devastating disease with a poor prognosis and growing incidence. In this study, we explored the potential roles of CDK1, CDK2, CDK4, and CDK6 in the progression of early-stage PDAC. Clinicopathologic and mRNA expression data and treatment information of 140 patients identified with stage I/II PDAC who underwent pancreaticoduodenectomy were obtained from the Cancer Genome Atlas data set. Our bioinformatic analysis showed that higher CDK1, CDK2, CDK4, or CDK6 expression was associated with a shorter median survival of the early-stage PDAC patients. Of note, in the low-proliferating pancreatic cancer group, CDKs expressions were significantly associated with proteins functioning in apoptosis, metastasis, immunity, or stemness. Among the low-proliferating PDAC, higher expression of CDK1 was associated with the shorter survival of patients, suggesting that CDK1 may regulate PDAC progression through cell cycle-independent mechanisms. Our experimental data showed that CDK1 knockdown/inhibition significantly suppressed the expression levels of AHR and POU5F1, two critical proteins functioning in cancer cell metastasis and stemness, in low-proliferating, but not in high-proliferating pancreatic cancer cells. In all, our study suggests that CDKs regulate PDAC progression not only through cell proliferation but also through apoptosis, metastasis, immunity, and stemness.</p>","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"6 3","pages":"187-200"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516534/pdf/nihms-1926474.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41156658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SMS: A Novel Approach for Bacterial Strain Analysis in Multiple Samples SMS:多样本细菌菌株分析的新方法
Pub Date : 2023-01-01 DOI: 10.26502/jbsb.5107065
Saidi Wang, Minerva Fatimae Ventolero, Haiyan Hu, Xiaoman Li
The analysis of the bacterial strains is important for understanding drug resistance. Despite the existence of dozens of computational tools for bacterial strain studies, most of them are for known bacterial strains. Almost all remaining tools are designed to analyze individual samples or local strain regions. With multiple shotgun metagenomic samples routinely generated in a project, it is necessary to create methods to infer novel bacterial strain genomes in multiple samples. To fill this gap, we developed a novel computational approach called SMS to de novo reconstruct bacterial Strain genomes in Multiple Samples. Tested on 702 simulated and 195 experimental datasets, SMS reliably identified the strain number, abundance, and polymorphisms. Compared with two existing approaches, SMS showed superior performance.
菌株的分析对了解耐药性具有重要意义。尽管存在数十种用于细菌菌株研究的计算工具,但其中大多数是针对已知菌株的。几乎所有剩下的工具都是用来分析单个样品或局部应变区域的。由于在一个项目中通常会生成多个shotgun宏基因组样本,因此有必要创建在多个样本中推断新菌株基因组的方法。为了填补这一空白,我们开发了一种称为SMS的新型计算方法来重新构建多个样本中的细菌菌株基因组。在702个模拟数据集和195个实验数据集上进行了测试,SMS可靠地识别了菌株数量、丰度和多态性。对比已有的两种方法,SMS的性能更优。
{"title":"SMS: A Novel Approach for Bacterial Strain Analysis in Multiple Samples","authors":"Saidi Wang, Minerva Fatimae Ventolero, Haiyan Hu, Xiaoman Li","doi":"10.26502/jbsb.5107065","DOIUrl":"https://doi.org/10.26502/jbsb.5107065","url":null,"abstract":"The analysis of the bacterial strains is important for understanding drug resistance. Despite the existence of dozens of computational tools for bacterial strain studies, most of them are for known bacterial strains. Almost all remaining tools are designed to analyze individual samples or local strain regions. With multiple shotgun metagenomic samples routinely generated in a project, it is necessary to create methods to infer novel bacterial strain genomes in multiple samples. To fill this gap, we developed a novel computational approach called SMS to de novo reconstruct bacterial Strain genomes in Multiple Samples. Tested on 702 simulated and 195 experimental datasets, SMS reliably identified the strain number, abundance, and polymorphisms. Compared with two existing approaches, SMS showed superior performance.","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136260003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Computational Pipeline to Control the Quality and Reduce Contamination in Single Retinal Ganglion Cells 控制单个视网膜神经节细胞质量和减少污染的计算管道
Pub Date : 2023-01-01 DOI: 10.26502/jbsb.5107061
Yeganeh Madadi, Hao Chen, Lu Lu, Monica M Jablonski, Robert W Williams, Siamak Yousefi
Single-cell transcriptome profiling has transformed our understanding of cellular heterogeneity. However, single-cell data with poor quality can impede proper identification of distinct cell populations and subsequent biological interpretations. In this study, we present a customized computational approach to control the quality and reduce contaminations in single-cell transcriptome profiling of retinal ganglion cells (RGCs). We leverage domain knowledge and statistical methods to effectively eliminate various sources of contaminants for identification of RGC types and subtypes. We show that our end-to-end computational pipeline improves the accuracy and reliability of single-cell transcriptome profiling of RGCs and enhances the biological interpretations. To show the effectiveness of our pipeline, we use 5,994 RGCs captured from retinas of mouse using Fluidigm technology as a benchmark dataset and compare with widely used quality control tools. Further, we introduce seven candidate F-RGC subtype markers that we identified after applying our introduced pipeline on the benchmark dataset. Our customized quality control pipeline could enable retinal single RGC probing with more granularity, leading to new insights into RGC-related visual diseases and development of therapeutic approaches.
单细胞转录组分析改变了我们对细胞异质性的理解。然而,单细胞数据质量差可能会阻碍不同细胞群的正确识别和随后的生物学解释。在这项研究中,我们提出了一种定制的计算方法来控制视网膜神经节细胞(RGCs)单细胞转录组分析的质量并减少污染。我们利用领域知识和统计方法有效地消除各种污染物来源,以确定研资局的类型和亚型。我们表明,我们的端到端计算管道提高了RGCs单细胞转录组分析的准确性和可靠性,并增强了生物学解释。为了证明我们的流水线的有效性,我们使用Fluidigm技术从小鼠视网膜捕获5994个rgc作为基准数据集,并与广泛使用的质量控制工具进行比较。此外,我们引入了七个候选的F-RGC亚型标记,这些标记是我们在基准数据集上应用我们引入的管道后确定的。我们定制的质量控制管道可以使视网膜单个RGC探测更细粒度,从而为RGC相关的视觉疾病和治疗方法的发展提供新的见解。
{"title":"A Computational Pipeline to Control the Quality and Reduce Contamination in Single Retinal Ganglion Cells","authors":"Yeganeh Madadi, Hao Chen, Lu Lu, Monica M Jablonski, Robert W Williams, Siamak Yousefi","doi":"10.26502/jbsb.5107061","DOIUrl":"https://doi.org/10.26502/jbsb.5107061","url":null,"abstract":"Single-cell transcriptome profiling has transformed our understanding of cellular heterogeneity. However, single-cell data with poor quality can impede proper identification of distinct cell populations and subsequent biological interpretations. In this study, we present a customized computational approach to control the quality and reduce contaminations in single-cell transcriptome profiling of retinal ganglion cells (RGCs). We leverage domain knowledge and statistical methods to effectively eliminate various sources of contaminants for identification of RGC types and subtypes. We show that our end-to-end computational pipeline improves the accuracy and reliability of single-cell transcriptome profiling of RGCs and enhances the biological interpretations. To show the effectiveness of our pipeline, we use 5,994 RGCs captured from retinas of mouse using Fluidigm technology as a benchmark dataset and compare with widely used quality control tools. Further, we introduce seven candidate F-RGC subtype markers that we identified after applying our introduced pipeline on the benchmark dataset. Our customized quality control pipeline could enable retinal single RGC probing with more granularity, leading to new insights into RGC-related visual diseases and development of therapeutic approaches.","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69367526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conservation Genomics and Species Distribution Models Motivate Proactive and Collaborative Conservation in an Era of Rapid Change 保护基因组学和物种分布模型激励快速变化时代的主动和协作保护
Pub Date : 2023-01-01 DOI: 10.26502/jbsb.5107063
Maria Sagatelova, Rosa A Rodriguez- Pena, Thomas J Rodhouse, Jeffrey Lonneker, Kirk R Sherrill, Andrea D Wolfe
Small, fragmented plant populations with low genetic diversity are susceptible to deterministic and stochastic events that can affect long-term persistence of species. Penstemon lemhiensis Keck (Plantaginaceae) is a rare endemic with small, scattered populations across Idaho and Montana threatened by cumulative impacts of biological invasion, drought, and altered fire regimes. When contextualized by an understanding of rangewide distributions under different environmental change scenarios, conservation genetics can be leveraged to motivate proactive conservation action among collaborating stakeholder groups. We applied a genotypingby- sequencing (GBS) approach across eight populations and 93 individuals of P. lemhiensis. Genetic differentiation among populations followed an isolation-by-distance pattern and ranged from low to moderate (FST = 0.095-0.280). Values of inbreeding were low, and often negative (FIS = -0.039-0.032), indicating outbreeding within populations. Population structure analyses identified six ancestral populations and admixture across all individuals. We contextualized these findings by fitting bioclimatic niche models to past, present, and future climate regime scenarios. Habitat connectivity peaked mid-Holocene and nearly disappeared in the future scenario. Genetic analyses and species distribution models indicated that the species may experience drastic range contraction and accelerated isolation and inbreeding in future. We identified a core area in the Upper Big Hole Valley, Montana most likely to persist as suitable habitat. The National Park Service, Bureau of Land Management, and US Forest Service were identified as key stakeholders in that valley. We outline a proactive collaborative conservation strategy that aim to maintain wild P. lemhiensis populations.
遗传多样性低的小而分散的植物种群容易受到确定性和随机事件的影响,这些事件可能影响物种的长期持久性。车前草(Penstemon lemhiensis Keck)是一种罕见的地方性植物,在爱达荷州和蒙大拿州分布着小而分散的种群,受到生物入侵、干旱和改变的火灾制度的累积影响。在了解不同环境变化情景下的大范围分布的背景下,保护遗传学可以用来激励合作的利益相关者群体采取积极的保护行动。我们对8个种群和93个lemhiensis个体采用了基因分型测序(GBS)方法。群体间的遗传分化遵循距离隔离模式,FST = 0.095 ~ 0.280,范围从低到中等。近交值较低,且常为负值(FIS = -0.039 ~ 0.032),表明种群内有近交。种群结构分析确定了6个祖先种群和所有个体的混合种群。通过将生物气候生态位模型拟合到过去、现在和未来的气候情景中,我们将这些发现置于背景中。栖息地连通性在全新世中期达到峰值,在未来情景中几乎消失。遗传分析和物种分布模型表明,该物种未来可能经历剧烈的范围缩小和加速的隔离和近交。我们在蒙大拿州的上大洞谷确定了一个核心区,最有可能成为合适的栖息地。国家公园管理局、土地管理局和美国林务局被确定为该山谷的主要利益相关者。我们概述了一个积极主动的合作保护策略,旨在维持野生lemhiensis种群。
{"title":"Conservation Genomics and Species Distribution Models Motivate Proactive and Collaborative Conservation in an Era of Rapid Change","authors":"Maria Sagatelova, Rosa A Rodriguez- Pena, Thomas J Rodhouse, Jeffrey Lonneker, Kirk R Sherrill, Andrea D Wolfe","doi":"10.26502/jbsb.5107063","DOIUrl":"https://doi.org/10.26502/jbsb.5107063","url":null,"abstract":"Small, fragmented plant populations with low genetic diversity are susceptible to deterministic and stochastic events that can affect long-term persistence of species. Penstemon lemhiensis Keck (Plantaginaceae) is a rare endemic with small, scattered populations across Idaho and Montana threatened by cumulative impacts of biological invasion, drought, and altered fire regimes. When contextualized by an understanding of rangewide distributions under different environmental change scenarios, conservation genetics can be leveraged to motivate proactive conservation action among collaborating stakeholder groups. We applied a genotypingby- sequencing (GBS) approach across eight populations and 93 individuals of P. lemhiensis. Genetic differentiation among populations followed an isolation-by-distance pattern and ranged from low to moderate (FST = 0.095-0.280). Values of inbreeding were low, and often negative (FIS = -0.039-0.032), indicating outbreeding within populations. Population structure analyses identified six ancestral populations and admixture across all individuals. We contextualized these findings by fitting bioclimatic niche models to past, present, and future climate regime scenarios. Habitat connectivity peaked mid-Holocene and nearly disappeared in the future scenario. Genetic analyses and species distribution models indicated that the species may experience drastic range contraction and accelerated isolation and inbreeding in future. We identified a core area in the Upper Big Hole Valley, Montana most likely to persist as suitable habitat. The National Park Service, Bureau of Land Management, and US Forest Service were identified as key stakeholders in that valley. We outline a proactive collaborative conservation strategy that aim to maintain wild P. lemhiensis populations.","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135952823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acyclovir Improves the Efficacy of Chemoradiation in Nasopharyngeal Cancer Containing the Epstein Barr Virus Genome 无环鸟苷提高含eb病毒基因组鼻咽癌放化疗的疗效
Pub Date : 2023-01-01 DOI: 10.26502/jbsb.5107064
Aditya Thandoni, Andrew Zloza, Devora Schiff, Malay Rao, Kwok-wai Lo, Bruce G Haffty, Sung Kim, Sachin R Jhawar
Nasopharyngeal carcinoma (NPC) is a malignancy endemic to East Asia and is caused by Epstein-Barr Virus (EBV)-mediated cancerous transformation of epithelial cells. The standard of care treatment for NPC involves radiation and chemotherapy. While treatment outcomes continue to improve, up to 50% of patients can be expected to recur by five years, and additional innovative treatment options are needed. We posit that a potential way to do this is by targeting the underlying cause of malignant transformation, namely EBV. One method by which EBV escapes immune surveillance is by undergoing latent phase replication, during which EBV expression of immunogenic proteins is reduced. However, chemoradiation is known to drive conversion of EBV from a latent to a lytic phase. This creates an opportunity for the targeting of EBV-infected cells utilizing antiviral drugs. Indeed, we found that combining acyclovir with cisplatin and radiation significantly decreases the viability of the EBV-infected C666- 1 cell line. Western blot quantification revealed a resultant increase of thymidine kinase (TK) and apoptosis-inducing mediators, cleaved PARP (cPARP) and phosphorylated ERK (pERK). These studies suggest that the addition of anti-viral drugs to frontline chemoradiation may improve outcomes in patients treated for EBV-related NPC and future in vivo and clinical studies are needed.
鼻咽癌(NPC)是东亚地区特有的恶性肿瘤,由eb病毒介导的上皮细胞癌变引起。鼻咽癌的标准护理治疗包括放疗和化疗。虽然治疗结果不断改善,但预计高达50%的患者在五年内可能会复发,因此需要更多的创新治疗方案。我们假设一种潜在的方法是通过靶向恶性转化的潜在原因,即EBV。EBV逃避免疫监视的一种方法是进行潜伏期复制,在此期间EBV免疫原性蛋白的表达减少。然而,已知放化疗可驱动EBV从潜伏期转化为裂解期。这为利用抗病毒药物靶向ebv感染细胞创造了机会。事实上,我们发现阿昔洛韦联合顺铂和放疗显著降低ebv感染的C666- 1细胞系的活力。Western blot结果显示,胸苷激酶(TK)和凋亡诱导介质,裂解PARP (cPARP)和磷酸化ERK (pERK)随之增加。这些研究表明,在一线放化疗中加入抗病毒药物可能会改善ebv相关鼻咽癌患者的治疗结果,未来需要进行体内和临床研究。
{"title":"Acyclovir Improves the Efficacy of Chemoradiation in Nasopharyngeal Cancer Containing the Epstein Barr Virus Genome","authors":"Aditya Thandoni, Andrew Zloza, Devora Schiff, Malay Rao, Kwok-wai Lo, Bruce G Haffty, Sung Kim, Sachin R Jhawar","doi":"10.26502/jbsb.5107064","DOIUrl":"https://doi.org/10.26502/jbsb.5107064","url":null,"abstract":"Nasopharyngeal carcinoma (NPC) is a malignancy endemic to East Asia and is caused by Epstein-Barr Virus (EBV)-mediated cancerous transformation of epithelial cells. The standard of care treatment for NPC involves radiation and chemotherapy. While treatment outcomes continue to improve, up to 50% of patients can be expected to recur by five years, and additional innovative treatment options are needed. We posit that a potential way to do this is by targeting the underlying cause of malignant transformation, namely EBV. One method by which EBV escapes immune surveillance is by undergoing latent phase replication, during which EBV expression of immunogenic proteins is reduced. However, chemoradiation is known to drive conversion of EBV from a latent to a lytic phase. This creates an opportunity for the targeting of EBV-infected cells utilizing antiviral drugs. Indeed, we found that combining acyclovir with cisplatin and radiation significantly decreases the viability of the EBV-infected C666- 1 cell line. Western blot quantification revealed a resultant increase of thymidine kinase (TK) and apoptosis-inducing mediators, cleaved PARP (cPARP) and phosphorylated ERK (pERK). These studies suggest that the addition of anti-viral drugs to frontline chemoradiation may improve outcomes in patients treated for EBV-related NPC and future in vivo and clinical studies are needed.","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"109 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135954929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NMF Clustering: Accessible NMF-based Clustering Utilizing GPU Acceleration. NMF 聚类:利用 GPU 加速的基于 NMF 的无障碍聚类。
Pub Date : 2023-01-01 Epub Date: 2023-12-21 DOI: 10.26502/jbsb.5107072
Ted Liefeld, Edwin Huang, Alexander T Wenzel, Kenneth Yoshimoto, Ashwyn K Sharma, Jason K Sicklick, Jill P Mesirov, Michael Reich

Non-negative Matrix Factorization (NMF) is an algorithm that can reduce high dimensional datasets of tens of thousands of genes to a handful of metagenes which are biologically easier to interpret. Application of NMF on gene expression data has been limited by its computationally intensive nature, which hinders its use on large datasets such as single-cell RNA sequencing (scRNA-seq) count matrices. We have implemented NMF based clustering to run on high performance GPU compute nodes using CuPy, a GPU backed python library, and the Message Passing Interface (MPI). This reduces the computation time by up to three orders of magnitude and makes the NMF Clustering analysis of large RNA-Seq and scRNA-seq datasets practical. We have made the method freely available through the GenePattern gateway, which provides free public access to hundreds of tools for the analysis and visualization of multiple 'omic data types. Its web-based interface gives easy access to these tools and allows the creation of multi-step analysis pipelines on high performance computing (HPC) clusters that enable reproducible in silico research for non-programmers.

非负矩阵因式分解(NMF)是一种算法,可将数以万计基因的高维数据集缩减为少量元基因,从而更容易从生物学角度进行解释。NMF 在基因表达数据上的应用一直受限于其计算密集的特性,这阻碍了它在单细胞 RNA 测序(scRNA-seq)计数矩阵等大型数据集上的应用。我们利用支持 GPU 的 Python 库 CuPy 和消息传递接口 (MPI),在高性能 GPU 计算节点上实现了基于 NMF 的聚类。这将计算时间缩短了三个数量级,使大型 RNA-Seq 和 scRNA-seq 数据集的 NMF 聚类分析成为现实。我们通过 GenePattern 网关免费提供该方法,该网关可免费向公众提供数百种工具,用于分析和可视化多种 "omic "数据类型。它基于网络的界面可以方便地访问这些工具,并允许在高性能计算(HPC)集群上创建多步骤分析管道,使非程序员也能进行可重复的硅学研究。
{"title":"NMF Clustering: Accessible NMF-based Clustering Utilizing GPU Acceleration.","authors":"Ted Liefeld, Edwin Huang, Alexander T Wenzel, Kenneth Yoshimoto, Ashwyn K Sharma, Jason K Sicklick, Jill P Mesirov, Michael Reich","doi":"10.26502/jbsb.5107072","DOIUrl":"10.26502/jbsb.5107072","url":null,"abstract":"<p><p>Non-negative Matrix Factorization (NMF) is an algorithm that can reduce high dimensional datasets of tens of thousands of genes to a handful of metagenes which are biologically easier to interpret. Application of NMF on gene expression data has been limited by its computationally intensive nature, which hinders its use on large datasets such as single-cell RNA sequencing (scRNA-seq) count matrices. We have implemented NMF based clustering to run on high performance GPU compute nodes using CuPy, a GPU backed python library, and the Message Passing Interface (MPI). This reduces the computation time by up to three orders of magnitude and makes the NMF Clustering analysis of large RNA-Seq and scRNA-seq datasets practical. We have made the method freely available through the GenePattern gateway, which provides free public access to hundreds of tools for the analysis and visualization of multiple 'omic data types. Its web-based interface gives easy access to these tools and allows the creation of multi-step analysis pipelines on high performance computing (HPC) clusters that enable reproducible <i>in silico</i> research for non-programmers.</p>","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"6 4","pages":"379-383"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10883375/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139934481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Machine Learning to identify microRNA biomarkers for predisposition to Huntington’s Disease 使用机器学习识别亨廷顿病易感性的microRNA生物标志物
Pub Date : 2023-01-01 DOI: 10.26502/jbsb.5107046
P. K, Sheridan C, Chandrasegaran S, Shanley Dp
{"title":"Using Machine Learning to identify microRNA biomarkers for predisposition to Huntington’s Disease","authors":"P. K, Sheridan C, Chandrasegaran S, Shanley Dp","doi":"10.26502/jbsb.5107046","DOIUrl":"https://doi.org/10.26502/jbsb.5107046","url":null,"abstract":"","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69367931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LDBPR: Latest Database of Protein Research LDBPR:最新蛋白质研究数据库
Pub Date : 2022-01-01 DOI: 10.26502/jbsb.5107032
Shahid Ullah, Tianshun Gao, W. Rahman, F. Ullah, R. Jahan, Anees Ullah, Gulzar Ahmad, Muhammad Ijaz, Yihang Pan
With the vast and rapid growth of protein research data, a large number of databases are produced to annotate proteins. How to use these databases is becoming a crucial part of modern biology. Database research is usually the first step in the analysis of a new protein. The combined utilization of multiple databases could help researchers to understand the evolution, structure, and function of proteins. Therefore, a well comprehensive and large-scale resource integrated with most of databases is urgently desirable for systematic and precise studies of proteins. Here we designed a platform LDBPR with a collection of 564 latest scientific protein databases. It fully covered physical, chemical, and biological information of Protein sequence, structure, and model, domain, function, and protein‐ protein interactions. Furthermore, The LDBPR can be explored by three ways: (i) single database can be browsed by typing the name in the given search bar; (ii) all protein categories can be browsed by clicking on the name of the category; (iii) the image icon, could give all categorized protein databases on single click. Moreover, the programming languages including PHP, HTML, CSS, and MySQL were used to construct LDBPR for the protein scientific community that can be freely searched by clicking http://www.habdsk.org/ldbpr.php and will be updated timely.
随着蛋白质研究数据的大量快速增长,产生了大量对蛋白质进行注释的数据库。如何使用这些数据库正成为现代生物学的一个关键部分。数据库研究通常是分析新蛋白质的第一步。多种数据库的综合利用可以帮助研究人员了解蛋白质的进化、结构和功能。因此,迫切需要一个与大多数数据库相结合的全面、大规模的资源来系统、精确地研究蛋白质。在这里,我们设计了一个LDBPR平台,收集了564个最新的科学蛋白质数据库。它完全涵盖了蛋白质序列、结构和模型、结构域、功能和蛋白质-蛋白质相互作用的物理、化学和生物信息。此外,LDBPR可以通过三种方式进行探索:(i)可以通过在给定的搜索栏中输入名称来浏览单个数据库;(ii)所有蛋白质类别均可通过点击类别名称浏览;(iii)图像图标,一键显示所有分类的蛋白质数据库。并利用PHP、HTML、CSS、MySQL等编程语言构建蛋白质科学界LDBPR,点击http://www.habdsk.org/ldbpr.php可自由检索,并及时更新。
{"title":"LDBPR: Latest Database of Protein Research","authors":"Shahid Ullah, Tianshun Gao, W. Rahman, F. Ullah, R. Jahan, Anees Ullah, Gulzar Ahmad, Muhammad Ijaz, Yihang Pan","doi":"10.26502/jbsb.5107032","DOIUrl":"https://doi.org/10.26502/jbsb.5107032","url":null,"abstract":"With the vast and rapid growth of protein research data, a large number of databases are produced to annotate proteins. How to use these databases is becoming a crucial part of modern biology. Database research is usually the first step in the analysis of a new protein. The combined utilization of multiple databases could help researchers to understand the evolution, structure, and function of proteins. Therefore, a well comprehensive and large-scale resource integrated with most of databases is urgently desirable for systematic and precise studies of proteins. Here we designed a platform LDBPR with a collection of 564 latest scientific protein databases. It fully covered physical, chemical, and biological information of Protein sequence, structure, and model, domain, function, and protein‐ protein interactions. Furthermore, The LDBPR can be explored by three ways: (i) single database can be browsed by typing the name in the given search bar; (ii) all protein categories can be browsed by clicking on the name of the category; (iii) the image icon, could give all categorized protein databases on single click. Moreover, the programming languages including PHP, HTML, CSS, and MySQL were used to construct LDBPR for the protein scientific community that can be freely searched by clicking http://www.habdsk.org/ldbpr.php and will be updated timely.","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69367509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Transcriptome Dedifferentiation Observed in Animal Primary Cultures is Essential to Plant Reprogramming. 在动物原代培养中观察到的转录组去分化对植物重编程至关重要。
Pub Date : 2022-01-01 Epub Date: 2022-10-07 DOI: 10.26502/jbsb.5107039
Norichika Ogata

Tissue culture environment liberate cells from ordinary laws of multi-cellular organisms. This liberation enables cells several behaviors, such as growth, dedifferentiation, acquisition of pluripotency, immortalization and reprogramming. Each phenomenon is relating to each other and hardly to determine. Recently, dedifferentiation of animal cell was quantified as increasing liberality which is information entropy of transcriptome. The increasing liberality induced by tissue culture may reappear in plant cells too. Here we corroborated it. Measuring liberality during reprogramming of plant cells suggested that reprogramming is a combined phenomenon of dedifferentiation and re-differentiation.

组织培养环境将细胞从多细胞生物的一般规律中解放出来。这种释放使细胞能够进行多种行为,如生长、去分化、获得多能性、永生化和重编程。每种现象都是相互联系的,很难确定。近年来,动物细胞的去分化被量化为自由度的增加,即转录组的信息熵。组织培养诱导的自由度增加也可能在植物细胞中重现。这里我们证实了这一点。测量植物细胞重编程过程中的自由度表明,重编程是一个去分化和再分化的综合现象。
{"title":"Transcriptome Dedifferentiation Observed in Animal Primary Cultures is Essential to Plant Reprogramming.","authors":"Norichika Ogata","doi":"10.26502/jbsb.5107039","DOIUrl":"https://doi.org/10.26502/jbsb.5107039","url":null,"abstract":"<p><p>Tissue culture environment liberate cells from ordinary laws of multi-cellular organisms. This liberation enables cells several behaviors, such as growth, dedifferentiation, acquisition of pluripotency, immortalization and reprogramming. Each phenomenon is relating to each other and hardly to determine. Recently, dedifferentiation of animal cell was quantified as increasing liberality which is information entropy of transcriptome. The increasing liberality induced by tissue culture may reappear in plant cells too. Here we corroborated it. Measuring liberality during reprogramming of plant cells suggested that reprogramming is a combined phenomenon of dedifferentiation and re-differentiation.</p>","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":" ","pages":"116-118"},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9668052/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40483821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computer-Aided Molecular Design of CCR2 - CCR5 Dual Antagonists for the Treatment of NASH CCR2 - CCR5双拮抗剂治疗NASH的计算机辅助分子设计
Pub Date : 2022-01-01 DOI: 10.26502/jbsb.5107035
S. Kumari, Elizabeth Sobhia M
Non-alcoholic fatty liver disease (NAFLD), one of the most common liver diseases, is caused by the disruption of hepatic lipid homeostasis by various metabolic disorders. The progression of NAFLD into Non-alcoholic steatohepatitis (NASH) is mediated by inflammatory chemokines, cytokines, mitochondrial dysfunction, and oxidative stress resulting in hepatocyte inflammation, ballooning, apoptosis, and activation of hepatic stellate cells (HSC). NASH can further lead to cirrhosis, hepatic carcinoma, and also it is predicted to be a major cause of liver transplantation over the next 10 years. Chemokine receptors are majorly involved in recruiting the monocytes in the liver where they are converted into pro-inflammatory macrophages, which further activate the hepatic stellate cell (HSCs) to promote their survival while activating collagen production and fibrogenesis. Thus, chemokines and their receptor play a vital role in the pathogenesis of NASH and can be a potential target for the treatment of NASH. Herein, in this study, we have carried out a structure-based design of CCR2 and CCR5 dual antagonists. We performed pharmacophore mapping studies followed by virtual screening of commercial database to obtain novel molecules which can potentially act as CCR2 and CCR5 dual antagonists. We also performed molecular docking studies of newly obtained hits molecules to see their interactions with both CCR2 and CCR5 receptors. Non-alcoholic By evaluating the chemical structures of the top five molecules, it was observed that all five molecules possess C2 symmetry. The docking results of the top five molecules showed that Thr284, Trp86, Tyr89, and Glu283 (in CCR5) and Asp283, Val37, Asn286, His202, and Gln288 (in CCR2) residues were involved in hydrogen bond interactions. The molecules also showed π-π stacking interaction with key residues Phe112, Tyr108, Phe109, Trp86, and Tyr89 (in CCR5) and HIE121, Trp98, Tyr120 (in CCR2). Additionally, in some molecule’s halogen bond was also observed. The residues which formed the halogen bond include Phe182, Thr195 (in CCR5), and Lys38 (in CCR2). The screened molecules showed the interactions with some key residues i.e., Phe112, Tyr108, Phe109 (in CCR5) and Trp98, Tyr120 (in CCR2) as same that of CVC interactions with CCR5 and CCR2 receptor.
非酒精性脂肪性肝病(Non-alcoholic fatty liver disease, NAFLD)是一种最常见的肝脏疾病,是由各种代谢紊乱破坏肝脏脂质稳态引起的。NAFLD向非酒精性脂肪性肝炎(NASH)的进展是由炎症趋化因子、细胞因子、线粒体功能障碍和氧化应激介导的,导致肝细胞炎症、气球化、凋亡和肝星状细胞(HSC)的激活。NASH可进一步导致肝硬化、肝癌,并且预计在未来10年将成为肝移植的主要原因。趋化因子受体主要参与募集肝脏单核细胞,并将其转化为促炎巨噬细胞,进而激活肝星状细胞(hsc),促进其存活,同时激活胶原蛋白的产生和纤维化。因此,趋化因子及其受体在NASH的发病机制中起着至关重要的作用,可能是NASH治疗的潜在靶点。因此,在本研究中,我们进行了基于结构的CCR2和CCR5双拮抗剂设计。我们进行了药效团定位研究,然后对商业数据库进行虚拟筛选,以获得可能作为CCR2和CCR5双重拮抗剂的新分子。我们还对新获得的hit分子进行了分子对接研究,以观察它们与CCR2和CCR5受体的相互作用。通过评价前5个分子的化学结构,发现5个分子均具有C2对称性。前5个分子的对接结果表明,Thr284、Trp86、Tyr89、Glu283 (CCR5中)和Asp283、Val37、Asn286、His202、Gln288 (CCR2中)残基参与了氢键相互作用。这些分子还与关键残基Phe112、Tyr108、Phe109、Trp86、Tyr89 (CCR5中)和HIE121、Trp98、Tyr120 (CCR2中)表现出π-π堆叠相互作用。此外,在一些分子中还观察到卤素键。形成卤素键的残基包括Phe182、Thr195 (CCR5中)和Lys38 (CCR2中)。筛选的分子与CVC与CCR5和CCR2受体的相互作用相同,与关键残基Phe112、Tyr108、Phe109 (CCR5中)和Trp98、Tyr120 (CCR2中)相互作用。
{"title":"Computer-Aided Molecular Design of CCR2 - CCR5 Dual Antagonists for the Treatment of NASH","authors":"S. Kumari, Elizabeth Sobhia M","doi":"10.26502/jbsb.5107035","DOIUrl":"https://doi.org/10.26502/jbsb.5107035","url":null,"abstract":"Non-alcoholic fatty liver disease (NAFLD), one of the most common liver diseases, is caused by the disruption of hepatic lipid homeostasis by various metabolic disorders. The progression of NAFLD into Non-alcoholic steatohepatitis (NASH) is mediated by inflammatory chemokines, cytokines, mitochondrial dysfunction, and oxidative stress resulting in hepatocyte inflammation, ballooning, apoptosis, and activation of hepatic stellate cells (HSC). NASH can further lead to cirrhosis, hepatic carcinoma, and also it is predicted to be a major cause of liver transplantation over the next 10 years. Chemokine receptors are majorly involved in recruiting the monocytes in the liver where they are converted into pro-inflammatory macrophages, which further activate the hepatic stellate cell (HSCs) to promote their survival while activating collagen production and fibrogenesis. Thus, chemokines and their receptor play a vital role in the pathogenesis of NASH and can be a potential target for the treatment of NASH. Herein, in this study, we have carried out a structure-based design of CCR2 and CCR5 dual antagonists. We performed pharmacophore mapping studies followed by virtual screening of commercial database to obtain novel molecules which can potentially act as CCR2 and CCR5 dual antagonists. We also performed molecular docking studies of newly obtained hits molecules to see their interactions with both CCR2 and CCR5 receptors. Non-alcoholic By evaluating the chemical structures of the top five molecules, it was observed that all five molecules possess C2 symmetry. The docking results of the top five molecules showed that Thr284, Trp86, Tyr89, and Glu283 (in CCR5) and Asp283, Val37, Asn286, His202, and Gln288 (in CCR2) residues were involved in hydrogen bond interactions. The molecules also showed π-π stacking interaction with key residues Phe112, Tyr108, Phe109, Trp86, and Tyr89 (in CCR5) and HIE121, Trp98, Tyr120 (in CCR2). Additionally, in some molecule’s halogen bond was also observed. The residues which formed the halogen bond include Phe182, Thr195 (in CCR5), and Lys38 (in CCR2). The screened molecules showed the interactions with some key residues i.e., Phe112, Tyr108, Phe109 (in CCR5) and Trp98, Tyr120 (in CCR2) as same that of CVC interactions with CCR5 and CCR2 receptor.","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69367855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of bioinformatics and systems biology : Open access
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1