Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing最新文献

英文中文

intCC: An efficient weighted integrative consensus clustering of multimodal data. intCC：多模态数据的高效加权综合共识聚类。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Can Huang, Pei Fen Kuan

High throughput profiling of multiomics data provides a valuable resource to better understand the complex human disease such as cancer and to potentially uncover new subtypes. Integrative clustering has emerged as a powerful unsupervised learning framework for subtype discovery. In this paper, we propose an efficient weighted integrative clustering called intCC by combining ensemble method, consensus clustering and kernel learning integrative clustering. We illustrate that intCC can accurately uncover the latent cluster structures via extensive simulation studies and a case study on the TCGA pan cancer datasets. An R package intCC implementing our proposed method is available at https://github.com/candsj/intCC.

多组学数据的高通量剖析为更好地了解癌症等复杂的人类疾病提供了宝贵的资源，并有可能发现新的亚型。整合聚类已成为发现亚型的一个强大的无监督学习框架。在本文中，我们结合了集合方法、共识聚类和核学习整合聚类，提出了一种高效的加权整合聚类，称为 intCC。我们通过大量的模拟研究和对 TCGA 泛癌症数据集的案例研究，说明 intCC 可以准确地发现潜在的聚类结构。实现我们提出的方法的 R 软件包 intCC 可在 https://github.com/candsj/intCC 上获取。

引用次数: 0

LA-GEM: imputation of gene expression with incorporation of Local Ancestry. LA-GEM：结合当地血统的基因表达估算。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Mrinal Mishra, Layan Nahlawi, Yizhen Zhong, Tanima De, Guang Yang, Cristina Alarcon, Minoli A Perera

Gene imputation and TWAS have become a staple in the genomics medicine discovery space; helping to identify genes whose regulation effects may contribute to disease susceptibility. However, the cohorts on which these methods are built are overwhelmingly of European Ancestry. This means that the unique regulatory variation that exist in non-European populations, specifically African Ancestry populations, may not be included in the current models. Moreover, African Americans are an admixed population, with a mix of European and African segments within their genome. No gene imputation model thus far has incorporated the effect of local ancestry (LA) on gene expression imputation. As such, we created LA-GEM which was trained and tested on a cohort of 60 African American hepatocyte primary cultures. Uniquely, LA-GEM include local ancestry inference in its prediction of gene expression. We compared the performance of LA-GEM to PrediXcan trained the same dataset (with no inclusion of local ancestry) We were able to reliably predict the expression of 2559 genes (1326 in LA-GEM and 1236 in PrediXcan). Of these, 546 genes were unique to LA-GEM, including the CYP3A5 gene which is critical to drug metabolism. We conducted TWAS analysis on two African American clinical cohorts with pharmacogenomics phenotypic information to identity novel gene associations. In our IWPC warfarin cohort, we identified 17 transcriptome-wide significant hits. No gene reached are prespecified significance level in the clopidogrel cohort. We did see suggestive association with RAS3A to P2RY12 Reactivity Units (PRU), a clinical measure of response to anti-platelet therapy. This method demonstrated the need for the incorporation of LA into study in admixed populations.

基因归因和 TWAS 已成为基因组学医学发现领域的主要方法，有助于确定其调节作用可能导致疾病易感性的基因。然而，这些方法所依据的队列绝大多数是欧洲血统。这就意味着，非欧洲血统人群，特别是非洲血统人群中存在的独特调控变异可能不会被纳入当前的模型中。此外，非裔美国人是一个混血群体，他们的基因组中既有欧洲人的片段，也有非洲人的片段。迄今为止，还没有一个基因归因模型包含本地祖先（LA）对基因表达归因的影响。因此，我们创建了 LA-GEM，并在 60 个非裔美国人肝细胞原代培养物队列中进行了训练和测试。与众不同的是，LA-GEM 在预测基因表达时包含了本地祖先推断。我们将 LA-GEM 的性能与 PrediXcan 的性能进行了比较，后者训练了相同的数据集（不包含本地祖先）。我们能够可靠地预测 2559 个基因的表达（LA-GEM 预测了 1326 个，PrediXcan 预测了 1236 个）。其中，546 个基因是 LA-GEM 独有的，包括对药物代谢至关重要的 CYP3A5 基因。我们对两个具有药物基因组学表型信息的非裔美国人临床队列进行了 TWAS 分析，以确定新的基因关联。在我们的 IWPC 华法林队列中，我们发现了 17 个转录组范围内的重要基因。在氯吡格雷队列中，没有基因达到预设的显著性水平。我们确实发现了 RAS3A 与 P2RY12 反应单位 (PRU) 的提示性关联，P2RY12 反应单位是抗血小板治疗反应的临床指标。这种方法表明，有必要将 LA 纳入混血人群的研究中。

{"title":"LA-GEM: imputation of gene expression with incorporation of Local Ancestry.","authors":"Mrinal Mishra, Layan Nahlawi, Yizhen Zhong, Tanima De, Guang Yang, Cristina Alarcon, Minoli A Perera","doi":"","DOIUrl":"","url":null,"abstract":"Gene imputation and TWAS have become a staple in the genomics medicine discovery space; helping to identify genes whose regulation effects may contribute to disease susceptibility. However, the cohorts on which these methods are built are overwhelmingly of European Ancestry. This means that the unique regulatory variation that exist in non-European populations, specifically African Ancestry populations, may not be included in the current models. Moreover, African Americans are an admixed population, with a mix of European and African segments within their genome. No gene imputation model thus far has incorporated the effect of local ancestry (LA) on gene expression imputation. As such, we created LA-GEM which was trained and tested on a cohort of 60 African American hepatocyte primary cultures. Uniquely, LA-GEM include local ancestry inference in its prediction of gene expression. We compared the performance of LA-GEM to PrediXcan trained the same dataset (with no inclusion of local ancestry) We were able to reliably predict the expression of 2559 genes (1326 in LA-GEM and 1236 in PrediXcan). Of these, 546 genes were unique to LA-GEM, including the CYP3A5 gene which is critical to drug metabolism. We conducted TWAS analysis on two African American clinical cohorts with pharmacogenomics phenotypic information to identity novel gene associations. In our IWPC warfarin cohort, we identified 17 transcriptome-wide significant hits. No gene reached are prespecified significance level in the clopidogrel cohort. We did see suggestive association with RAS3A to P2RY12 Reactivity Units (PRU), a clinical measure of response to anti-platelet therapy. This method demonstrated the need for the incorporation of LA into study in admixed populations.","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":"29 ","pages":"341-358"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10764069/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139075175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Practical Approaches to Enhancing Fairness, Social Responsibility and the Inclusion of Diverse Viewpoints in Biomedicine. 增强生物医学的公平性、社会责任感和多元观点包容性的实用方法。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Daphne O Martschenko, Nicole Martinez-Martin, Meghan Halley

The following sections are included:Workshop DescriptionLearning ObjectivesPresenter InformationAbout the Workshop OrganizersPresentationsSpeaker Presentations.

包括以下部分：研讨会简介学习目标主讲人信息关于研讨会组织者演讲人演讲。

引用次数: 0

Risk prediction: Methods, Challenges, and Opportunities. 风险预测：方法、挑战和机遇。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Ruowang Li, Rui Duan, Lifang He, Jason H Moore

The following sections are included:Introduction to the workshopWorkshop Presenters.

包括以下部分：讲习班简介讲习班主讲人。

引用次数: 0

Session Introduction: Drug-repurposing and discovery in the era of "big" real-world data: how the incorporation of observational data, genetics, and other -omic technologies can move us forward. 会议简介：大 "真实世界数据时代的药物再利用和发现：观察数据、遗传学和其他原子技术如何推动我们前进。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Megan M Shuey, Jacklyn N Hellwege, Nikhil Khankari, Marijana Vujkovic, Todd L Edwards

This PSB 2024 session discusses the many broad biological, computational, and statistical approaches currently being used for therapeutic drug target identification and repurposing of existing treatments. Drug repurposing efforts have the potential to dramatically improve the treatment landscape by more rapidly identifying drug targets and alternative strategies for untreated or poorly managed diseases. The overarching theme for this session is the use and integration of real-world data to identify drug-disease pairs with potential therapeutic use. These drug-disease pairs may be identified through genomic, proteomic, biomarkers, protein interaction analyses, electronic health records, and chemical profiling. Taken together, this session combines novel applications of methods and innovative modeling strategies with diverse real-world data to suggest new pharmaceutical treatments for human diseases.

本次 PSB 2024 会议将讨论目前用于治疗药物靶点识别和现有疗法再利用的许多广泛的生物、计算和统计方法。通过更快速地识别药物靶点和针对未治疗或治疗效果不佳疾病的替代策略，药物再利用工作有可能极大地改善治疗状况。本次会议的首要主题是使用和整合真实世界的数据，以确定具有潜在治疗用途的药物-疾病配对。这些药物-疾病配对可通过基因组、蛋白质组、生物标记物、蛋白质相互作用分析、电子健康记录和化学特征分析来确定。总之，本环节将新方法的应用和创新建模策略与各种真实世界数据相结合，为人类疾病提出新的药物治疗建议。

引用次数: 0

KombOver: Efficient k-core and K-truss based characterization of perturbations within the human gut microbiome. KombOver：基于 K 核心和 K 桁架的人类肠道微生物群扰动高效表征。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Nicolae Sapoval, Marko Tanevski, Todd J Treangen

The microbes present in the human gastrointestinal tract are regularly linked to human health and disease outcomes. Thanks to technological and methodological advances in recent years, metagenomic sequencing data, and computational methods designed to analyze metagenomic data, have contributed to improved understanding of the link between the human gut microbiome and disease. However, while numerous methods have been recently developed to extract quantitative and qualitative results from host-associated microbiome data, improved computational tools are still needed to track microbiome dynamics with short-read sequencing data. Previously we have proposed KOMB as a de novo tool for identifying copy number variations in metagenomes for characterizing microbial genome dynamics in response to perturbations. In this work, we present KombOver (KO), which includes four key contributions with respect to our previous work: (i) it scales to large microbiome study cohorts, (ii) it includes both k-core and K-truss based analysis, (iii) we provide the foundation of a theoretical understanding of the relation between various graph-based metagenome representations, and (iv) we provide an improved user experience with easier-to-run code and more descriptive outputs/results. To highlight the aforementioned benefits, we applied KO to nearly 1000 human microbiome samples, requiring less than 10 minutes and 10 GB RAM per sample to process these data. Furthermore, we highlight how graph-based approaches such as k-core and K-truss can be informative for pinpointing microbial community dynamics within a myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) cohort. KO is open source and available for download/use at: https://github.com/treangenlab/komb.

人类胃肠道中的微生物经常与人类健康和疾病结果联系在一起。近年来，由于技术和方法上的进步，元基因组测序数据和用于分析元基因组数据的计算方法有助于人们更好地了解人类肠道微生物组与疾病之间的联系。然而，尽管最近已开发出许多方法来从宿主相关微生物组数据中提取定量和定性结果，但仍需要改进计算工具来利用短线程测序数据跟踪微生物组动态。在此之前，我们已经提出了 KOMB 作为一种全新的工具，用于识别元基因组中的拷贝数变异，以描述微生物基因组对扰动的动态响应。在这项工作中，我们提出了 KombOver (KO)，它与我们之前的工作相比有四个主要贡献：(i) 它可扩展到大型微生物组研究队列；(ii) 它包括基于 K 核和 K 桁架的分析；(iii) 我们为理解各种基于图的元基因组表示之间的关系提供了理论基础；(iv) 我们提供了更好的用户体验，代码更易于运行，输出/结果更具描述性。为了突出上述优势，我们将 KO 应用于近 1000 个人类微生物组样本，每个样本只需不到 10 分钟和 10 GB 内存就能处理这些数据。此外，我们还强调了基于图的方法（如 k-core 和 K-truss）如何为确定肌痛性脑脊髓炎/慢性疲劳综合征（ME/CFS）队列中的微生物群落动态提供信息。KO 是开放源代码，可在以下网址下载/使用：https://github.com/treangenlab/komb。

{"title":"KombOver: Efficient k-core and K-truss based characterization of perturbations within the human gut microbiome.","authors":"Nicolae Sapoval, Marko Tanevski, Todd J Treangen","doi":"","DOIUrl":"","url":null,"abstract":"The microbes present in the human gastrointestinal tract are regularly linked to human health and disease outcomes. Thanks to technological and methodological advances in recent years, metagenomic sequencing data, and computational methods designed to analyze metagenomic data, have contributed to improved understanding of the link between the human gut microbiome and disease. However, while numerous methods have been recently developed to extract quantitative and qualitative results from host-associated microbiome data, improved computational tools are still needed to track microbiome dynamics with short-read sequencing data. Previously we have proposed KOMB as a de novo tool for identifying copy number variations in metagenomes for characterizing microbial genome dynamics in response to perturbations. In this work, we present KombOver (KO), which includes four key contributions with respect to our previous work: (i) it scales to large microbiome study cohorts, (ii) it includes both k-core and K-truss based analysis, (iii) we provide the foundation of a theoretical understanding of the relation between various graph-based metagenome representations, and (iv) we provide an improved user experience with easier-to-run code and more descriptive outputs/results. To highlight the aforementioned benefits, we applied KO to nearly 1000 human microbiome samples, requiring less than 10 minutes and 10 GB RAM per sample to process these data. Furthermore, we highlight how graph-based approaches such as k-core and K-truss can be informative for pinpointing microbial community dynamics within a myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) cohort. KO is open source and available for download/use at: https://github.com/treangenlab/komb.","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":"29 ","pages":"506-520"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10764071/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139075174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Low- and high-level information analyses of transcriptome connecting endometrial-decidua-placental origin of preeclampsia subtypes: A preliminary study. 子痫前期亚型子宫内膜-蜕膜-胎盘来源转录组的低级和高级信息分析：初步研究。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Herdiantri Sufriyana, Yu-Wei Wu, Emily Chia-Yu Su

Background: Existing proposed pathogenesis for preeclampsia (PE) was only applied for early onset subtype and did not consider pre-pregnancy and competing risks. We aimed to decipher PE subtypes by identifying related transcriptome that represents endometrial maturation and histologic chorioamnionitis.

Methods: We utilized eight arrays of mRNA expression for discovery (n=289), and other eight arrays for validation (n=352). Differentially expressed genes (DEGs) were overlapped between those of: (1) healthy samples from endometrium, decidua, and placenta, and placenta samples under histologic chorioamnionitis; and (2) placenta samples for each of the subtypes. They were all possible combinations based on four axes: (1) pregnancy-induced hypertension; (2) placental dysfunction-related diseases (e.g., fetal growth restriction [FGR]); (3) onset; and (4) severity.

Results: The DEGs of endometrium at late-secretory phase, but none of decidua, significantly overlapped with those of any subtypes with: (1) early onset (p-values ≤0.008); (2) severe hypertension and proteinuria (p-values ≤0.042); or (3) chronic hypertension and/or severe PE with FGR (p-values ≤0.042). Although sharing the same subtypes whose DEGs with which significantly overlap, the gene regulation was mostly counter-expressed in placenta under chorioamnionitis (n=13/18, 72.22%; odds ratio [OR] upper bounds ≤0.21) but co-expressed in late-secretory endometrium (n=3/9, 66.67%; OR lower bounds ≥1.17). Neither the placental DEGs at first-nor second-trimester under normotensive pregnancy significantly overlapped with those under late-onset, severe PE without FGR.

Conclusions: We identified the transcriptome of endometrial maturation in placental dysfunction that distinguished early- and late-onset PE, and indicated chorioamnionitis as a PE competing risk. This study implied a feasibility to develop and validate the pathogenesis models that include pre-pregnancy and competing risks to decide if it is needed to collect prospective data for PE starting from pre-pregnancy including chorioamnionitis information.

背景：现有的子痫前期（PE）发病机制仅适用于早发亚型，并未考虑孕前和竞争性风险。我们的目的是通过识别代表子宫内膜成熟和组织学绒毛膜炎的相关转录组来解读子痫前期亚型：我们利用八种 mRNA 表达阵列进行发现（样本数=289），并利用其他八种阵列进行验证（样本数=352）。差异表达基因（DEGs）在以下两类样本中重叠：(1) 子宫内膜、蜕膜和胎盘的健康样本和组织学绒毛膜羊膜炎的胎盘样本；(2) 每种亚型的胎盘样本。它们都是基于四个轴的可能组合：（1）妊娠诱发高血压；（2）胎盘功能障碍相关疾病（如胎儿生长受限[FGR]）；（3）发病；（4）严重程度：结果：分泌晚期子宫内膜的 DEGs 与任何亚型的 DEGs 都有明显重叠，但蜕膜没有：(1)早期发病（p 值≤0.008）；(2)严重高血压和蛋白尿（p 值≤0.042）；或(3)慢性高血压和/或严重 PE 合并 FGR（p 值≤0.042）。虽然DEGs与之有明显重叠的亚型相同，但在绒毛膜羊膜炎的胎盘中，基因调控大多是反表达（n=13/18，72.22%；比值比[OR]上限≤0.21），但在晚分泌期子宫内膜中却是共表达（n=3/9，66.67%；比值比下限≥1.17）。正常血压妊娠的胎盘 DEGs 在一胎和二胎均未与晚期重度 PE 无 FGR 的胎盘 DEGs 显著重叠：我们确定了胎盘功能障碍中子宫内膜成熟的转录组，该转录组可区分早发和晚发PE，并指出绒毛膜羊膜炎是PE的竞争风险之一。这项研究意味着开发和验证包括孕前和竞争风险在内的发病机理模型的可行性，以决定是否需要从孕前开始收集包括绒毛膜羊膜炎信息在内的前瞻性 PE 数据。

{"title":"Low- and high-level information analyses of transcriptome connecting endometrial-decidua-placental origin of preeclampsia subtypes: A preliminary study.","authors":"Herdiantri Sufriyana, Yu-Wei Wu, Emily Chia-Yu Su","doi":"","DOIUrl":"","url":null,"abstract":"Background: Existing proposed pathogenesis for preeclampsia (PE) was only applied for early onset subtype and did not consider pre-pregnancy and competing risks. We aimed to decipher PE subtypes by identifying related transcriptome that represents endometrial maturation and histologic chorioamnionitis.Methods: We utilized eight arrays of mRNA expression for discovery (n=289), and other eight arrays for validation (n=352). Differentially expressed genes (DEGs) were overlapped between those of: (1) healthy samples from endometrium, decidua, and placenta, and placenta samples under histologic chorioamnionitis; and (2) placenta samples for each of the subtypes. They were all possible combinations based on four axes: (1) pregnancy-induced hypertension; (2) placental dysfunction-related diseases (e.g., fetal growth restriction [FGR]); (3) onset; and (4) severity.Results: The DEGs of endometrium at late-secretory phase, but none of decidua, significantly overlapped with those of any subtypes with: (1) early onset (p-values ≤0.008); (2) severe hypertension and proteinuria (p-values ≤0.042); or (3) chronic hypertension and/or severe PE with FGR (p-values ≤0.042). Although sharing the same subtypes whose DEGs with which significantly overlap, the gene regulation was mostly counter-expressed in placenta under chorioamnionitis (n=13/18, 72.22%; odds ratio [OR] upper bounds ≤0.21) but co-expressed in late-secretory endometrium (n=3/9, 66.67%; OR lower bounds ≥1.17). Neither the placental DEGs at first-nor second-trimester under normotensive pregnancy significantly overlapped with those under late-onset, severe PE without FGR.Conclusions: We identified the transcriptome of endometrial maturation in placental dysfunction that distinguished early- and late-onset PE, and indicated chorioamnionitis as a PE competing risk. This study implied a feasibility to develop and validate the pathogenesis models that include pre-pregnancy and competing risks to decide if it is needed to collect prospective data for PE starting from pre-pregnancy including chorioamnionitis information.","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":"29 ","pages":"549-563"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139075178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

nSEA: n-Node Subnetwork Enumeration Algorithm Identifies Lower Grade Glioma Subtypes with Altered Subnetworks and Distinct Prognostics. nSEA：n节点子网络枚举算法可识别具有改变的子网络和不同预后的低级别胶质瘤亚型。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Zhihan Zhang, Christiana Wang, Ziyin Zhao, Ziyue Yi, Arda Durmaz, Jennifer S Yu, Gurkan Bebek

Advances in molecular characterization have reshaped our understanding of low-grade glioma (LGG) subtypes, emphasizing the need for comprehensive classification beyond histology. Lever-aging this, we present a novel approach, network-based Subnetwork Enumeration, and Analysis (nSEA), to identify distinct LGG patient groups based on dysregulated molecular pathways. Using gene expression profiles from 516 patients and a protein-protein interaction network we generated 25 million sub-networks. Through our unsupervised bottom-up approach, we selected 92 subnetworks that categorized LGG patients into five groups. Notably, a new LGG patient group with a lack of mutations in EGFR, NF1, and PTEN emerged as a previously unidentified patient subgroup with unique clinical features and subnetwork states. Validation of the patient groups on an independent dataset demonstrated the robustness of our approach and revealed consistent survival traits across different patient populations. This study offers a comprehensive molecular classification of LGG, providing insights beyond traditional genetic markers. By integrating network analysis with patient clustering, we unveil a previously overlooked patient subgroup with potential implications for prognosis and treatment strategies. Our approach sheds light on the synergistic nature of driver genes and highlights the biological relevance of the identified subnetworks. With broad implications for glioma research, our findings pave the way for further investigations into the mechanistic underpinnings of LGG subtypes and their clinical relevance.Availability: Source code and supplementary data are available at https://github.com/bebeklab/nSEA.

分子特征描述的进展重塑了我们对低级别胶质瘤（LGG）亚型的认识，强调了超越组织学进行综合分类的必要性。利用这一点，我们提出了一种新方法--基于网络的子网络枚举和分析（nSEA）--来根据失调的分子通路识别不同的 LGG 患者群体。利用来自 516 名患者的基因表达谱和蛋白-蛋白相互作用网络，我们生成了 2,500 万个子网络。通过自下而上的无监督方法，我们筛选出 92 个子网络，将 LGG 患者分为五组。值得注意的是，一个缺乏表皮生长因子受体（EGFR）、NF1和PTEN突变的新LGG患者组出现了，这是一个以前未被发现的患者亚组，具有独特的临床特征和亚网络状态。在一个独立数据集上对患者分组进行的验证证明了我们的方法的稳健性，并揭示了不同患者群体的一致生存特征。这项研究提供了一种全面的 LGG 分子分类方法，提供了超越传统遗传标记的见解。通过将网络分析与患者聚类相结合，我们揭示了一个以前被忽视的患者亚群，并对预后和治疗策略产生了潜在影响。我们的方法揭示了驱动基因的协同作用，并强调了已识别子网络的生物学相关性。我们的发现对胶质瘤研究具有广泛的意义，为进一步研究 LGG 亚型的机理基础及其临床意义铺平了道路：源代码和补充数据见 https://github.com/bebeklab/nSEA。

{"title":"nSEA: n-Node Subnetwork Enumeration Algorithm Identifies Lower Grade Glioma Subtypes with Altered Subnetworks and Distinct Prognostics.","authors":"Zhihan Zhang, Christiana Wang, Ziyin Zhao, Ziyue Yi, Arda Durmaz, Jennifer S Yu, Gurkan Bebek","doi":"","DOIUrl":"","url":null,"abstract":"Advances in molecular characterization have reshaped our understanding of low-grade glioma (LGG) subtypes, emphasizing the need for comprehensive classification beyond histology. Lever-aging this, we present a novel approach, network-based Subnetwork Enumeration, and Analysis (nSEA), to identify distinct LGG patient groups based on dysregulated molecular pathways. Using gene expression profiles from 516 patients and a protein-protein interaction network we generated 25 million sub-networks. Through our unsupervised bottom-up approach, we selected 92 subnetworks that categorized LGG patients into five groups. Notably, a new LGG patient group with a lack of mutations in EGFR, NF1, and PTEN emerged as a previously unidentified patient subgroup with unique clinical features and subnetwork states. Validation of the patient groups on an independent dataset demonstrated the robustness of our approach and revealed consistent survival traits across different patient populations. This study offers a comprehensive molecular classification of LGG, providing insights beyond traditional genetic markers. By integrating network analysis with patient clustering, we unveil a previously overlooked patient subgroup with potential implications for prognosis and treatment strategies. Our approach sheds light on the synergistic nature of driver genes and highlights the biological relevance of the identified subnetworks. With broad implications for glioma research, our findings pave the way for further investigations into the mechanistic underpinnings of LGG subtypes and their clinical relevance.Availability: Source code and supplementary data are available at https://github.com/bebeklab/nSEA.","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":"29 ","pages":"521-533"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139075192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PopGenAdapt: Semi-Supervised Domain Adaptation for Genotype-to-Phenotype Prediction in Underrepresented Populations. PopGenAdapt：在代表性不足的人群中进行基因型到表型预测的半监督领域适应。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Marçal Comajoan Cara, Daniel Mas Montserrat, Alexander G Ioannidis

The lack of diversity in genomic datasets, currently skewed towards individuals of European ancestry, presents a challenge in developing inclusive biomedical models. The scarcity of such data is particularly evident in labeled datasets that include genomic data linked to electronic health records. To address this gap, this paper presents PopGenAdapt, a genotype-to-phenotype prediction model which adopts semi-supervised domain adaptation (SSDA) techniques originally proposed for computer vision. PopGenAdapt is designed to leverage the substantial labeled data available from individuals of European ancestry, as well as the limited labeled and the larger amount of unlabeled data from currently underrepresented populations. The method is evaluated in underrepresented populations from Nigeria, Sri Lanka, and Hawaii for the prediction of several disease outcomes. The results suggest a significant improvement in the performance of genotype-to-phenotype models for these populations over state-of-the-art supervised learning methods, setting SSDA as a promising strategy for creating more inclusive machine learning models in biomedical research.Our code is available at https://github.com/AI-sandbox/PopGenAdapt.

基因组数据集目前偏重于欧洲血统的个体，缺乏多样性，这给开发包容性生物医学模型带来了挑战。此类数据的稀缺性在包含与电子健康记录相关联的基因组数据的标记数据集中尤为明显。为了弥补这一不足，本文介绍了一种基因型到表型预测模型 PopGenAdapt，它采用了最初为计算机视觉提出的半监督领域适应（SSDA）技术。PopGenAdapt 的设计目的是利用欧洲血统个体的大量标注数据，以及目前代表性不足人群的有限标注数据和大量未标注数据。该方法在来自尼日利亚、斯里兰卡和夏威夷的代表性不足人群中进行了评估，以预测几种疾病的结果。结果表明，与最先进的监督学习方法相比，针对这些人群的基因型到表型模型的性能有了显著提高，这使得 SSDA 成为在生物医学研究中创建更具包容性的机器学习模型的一种有前途的策略。我们的代码可在 https://github.com/AI-sandbox/PopGenAdapt 上获取。

{"title":"PopGenAdapt: Semi-Supervised Domain Adaptation for Genotype-to-Phenotype Prediction in Underrepresented Populations.","authors":"Marçal Comajoan Cara, Daniel Mas Montserrat, Alexander G Ioannidis","doi":"","DOIUrl":"","url":null,"abstract":"The lack of diversity in genomic datasets, currently skewed towards individuals of European ancestry, presents a challenge in developing inclusive biomedical models. The scarcity of such data is particularly evident in labeled datasets that include genomic data linked to electronic health records. To address this gap, this paper presents PopGenAdapt, a genotype-to-phenotype prediction model which adopts semi-supervised domain adaptation (SSDA) techniques originally proposed for computer vision. PopGenAdapt is designed to leverage the substantial labeled data available from individuals of European ancestry, as well as the limited labeled and the larger amount of unlabeled data from currently underrepresented populations. The method is evaluated in underrepresented populations from Nigeria, Sri Lanka, and Hawaii for the prediction of several disease outcomes. The results suggest a significant improvement in the performance of genotype-to-phenotype models for these populations over state-of-the-art supervised learning methods, setting SSDA as a promising strategy for creating more inclusive machine learning models in biomedical research.Our code is available at https://github.com/AI-sandbox/PopGenAdapt.","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":"29 ","pages":"327-340"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10906137/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139075196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Expanding the access of wearable silicone wristbands in community-engaged research through best practices in data analysis and integration. 通过数据分析和整合方面的最佳实践，扩大可穿戴硅胶腕带在社区参与式研究中的使用范围。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Lisa M Bramer, Holly M Dixon, David J Degnan, Diana Rohlman, Julie B Herbstman, Kim A Anderson, Katrina M Waters

Wearable silicone wristbands are a rapidly growing exposure assessment technology that offer researchers the ability to study previously inaccessible cohorts and have the potential to provide a more comprehensive picture of chemical exposure within diverse communities. However, there are no established best practices for analyzing the data within a study or across multiple studies, thereby limiting impact and access of these data for larger meta-analyses. We utilize data from three studies, from over 600 wristbands worn by participants in New York City and Eugene, Oregon, to present a first-of-its-kind manuscript detailing wristband data properties. We further discuss and provide concrete examples of key areas and considerations in common statistical modeling methods where best practices must be established to enable meta-analyses and integration of data from multiple studies. Finally, we detail important and challenging aspects of machine learning, meta-analysis, and data integration that researchers will face in order to extend beyond the limited scope of individual studies focused on specific populations.

可穿戴硅胶腕带是一种快速发展的暴露评估技术，它为研究人员提供了研究以前无法接触到的群体的能力，并有可能更全面地反映不同社区的化学品暴露情况。然而，目前还没有既定的最佳实践来分析一项研究或多项研究中的数据，从而限制了这些数据对大型荟萃分析的影响和使用。我们利用纽约市和俄勒冈州尤金市参与者佩戴的 600 多条腕带上的三项研究数据，首次提交了一份详细说明腕带数据特性的手稿。我们进一步讨论了常用统计建模方法中的关键领域和注意事项，并提供了具体实例，这些领域和注意事项必须建立最佳实践，才能进行荟萃分析和整合来自多项研究的数据。最后，我们详细介绍了研究人员在机器学习、荟萃分析和数据整合方面将面临的重要挑战，以便超越以特定人群为重点的单项研究的有限范围。

{"title":"Expanding the access of wearable silicone wristbands in community-engaged research through best practices in data analysis and integration.","authors":"Lisa M Bramer, Holly M Dixon, David J Degnan, Diana Rohlman, Julie B Herbstman, Kim A Anderson, Katrina M Waters","doi":"","DOIUrl":"","url":null,"abstract":"Wearable silicone wristbands are a rapidly growing exposure assessment technology that offer researchers the ability to study previously inaccessible cohorts and have the potential to provide a more comprehensive picture of chemical exposure within diverse communities. However, there are no established best practices for analyzing the data within a study or across multiple studies, thereby limiting impact and access of these data for larger meta-analyses. We utilize data from three studies, from over 600 wristbands worn by participants in New York City and Eugene, Oregon, to present a first-of-its-kind manuscript detailing wristband data properties. We further discuss and provide concrete examples of key areas and considerations in common statistical modeling methods where best practices must be established to enable meta-analyses and integration of data from multiple studies. Finally, we detail important and challenging aspects of machine learning, meta-analysis, and data integration that researchers will face in order to extend beyond the limited scope of individual studies focused on specific populations.","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":"29 ","pages":"170-186"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10766083/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139075247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀