Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing最新文献

英文中文

Polygenic risk scores for cardiometabolic traits demonstrate importance of ancestry for predictive precision medicine. 心脏代谢特征的多基因风险评分显示了祖先对于预测性精准医疗的重要性。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Rachel L Kember, Shefali S Verma, Anurag Verma, Brenda Xiao, Anastasia Lucas, Colleen M Kripke, Renae Judy, Jinbo Chen, Scott M Damrauer, Daniel J Rader, Marylyn D Ritchie

Polygenic risk scores (PRS) have predominantly been derived from genome-wide association studies (GWAS) conducted in European ancestry (EUR) individuals. In this study, we present an in-depth evaluation of PRS based on multi-ancestry GWAS for five cardiometabolic phenotypes in the Penn Medicine BioBank (PMBB) followed by a phenome-wide association study (PheWAS). We examine the PRS performance across all individuals and separately in African ancestry (AFR) and EUR ancestry groups. For AFR individuals, PRS derived using the multi-ancestry LD panel showed a higher effect size for four out of five PRSs (DBP, SBP, T2D, and BMI) than those derived from the AFR LD panel. In contrast, for EUR individuals, the multi-ancestry LD panel PRS demonstrated a higher effect size for two out of five PRSs (SBP and T2D) compared to the EUR LD panel. These findings underscore the potential benefits of utilizing a multi-ancestry LD panel for PRS derivation in diverse genetic backgrounds and demonstrate overall robustness in all individuals. Our results also revealed significant associations between PRS and various phenotypic categories. For instance, CAD PRS was linked with 18 phenotypes in AFR and 82 in EUR, while T2D PRS correlated with 84 phenotypes in AFR and 78 in EUR. Notably, associations like hyperlipidemia, renal failure, atrial fibrillation, coronary atherosclerosis, obesity, and hypertension were observed across different PRSs in both AFR and EUR groups, with varying effect sizes and significance levels. However, in AFR individuals, the strength and number of PRS associations with other phenotypes were generally reduced compared to EUR individuals. Our study underscores the need for future research to prioritize 1) conducting GWAS in diverse ancestry groups and 2) creating a cosmopolitan PRS methodology that is universally applicable across all genetic backgrounds. Such advances will foster a more equitable and personalized approach to precision medicine.

多基因风险评分（PRS）主要来自对欧洲血统（EUR）个体进行的全基因组关联研究（GWAS）。在本研究中，我们对基于宾夕法尼亚医学生物库（PMBB）中五种心脏代谢表型的多血统 GWAS 的多基因风险评分进行了深入评估，随后又进行了全表型关联研究（PheWAS）。我们研究了所有个体的 PRS 性能，并分别研究了非洲血统 (AFR) 和欧洲血统群体的 PRS 性能。对于非洲裔个体，使用多血统 LD 面板得出的 PRS 在五个 PRS 中的四个（DBP、SBP、T2D 和 BMI）显示出比从非洲裔 LD 面板得出的 PRS 更高的效应大小。相比之下，对于欧洲人，多家系 LD 面板 PRS 在五个 PRS 中的两个（SBP 和 T2D）显示出比欧洲 LD 面板更高的效应大小。这些发现凸显了在不同遗传背景下利用多家系家系 LD 面板推导 PRS 的潜在益处，并证明了在所有个体中的整体稳健性。我们的研究结果还揭示了 PRS 与各种表型类别之间的重要关联。例如，CAD PRS 与 18 种表型（AFR）和 82 种表型（EUR）相关，而 T2D PRS 与 84 种表型（AFR）和 78 种表型（EUR）相关。值得注意的是，高脂血症、肾功能衰竭、心房颤动、冠状动脉粥样硬化、肥胖和高血压等症状在非洲裔美国人和欧洲裔美国人群体中的不同PRS中都存在关联，其效应大小和显著性水平各不相同。然而，与欧洲人相比，非洲裔美国人的 PRS 与其他表型相关的强度和数量普遍降低。我们的研究强调，未来的研究需要优先考虑：1）在不同的祖先群体中开展 GWAS；2）创建一种普遍适用于所有遗传背景的世界性 PRS 方法。这些进展将促进更公平、更个性化的精准医疗方法。

{"title":"Polygenic risk scores for cardiometabolic traits demonstrate importance of ancestry for predictive precision medicine.","authors":"Rachel L Kember, Shefali S Verma, Anurag Verma, Brenda Xiao, Anastasia Lucas, Colleen M Kripke, Renae Judy, Jinbo Chen, Scott M Damrauer, Daniel J Rader, Marylyn D Ritchie","doi":"","DOIUrl":"","url":null,"abstract":"Polygenic risk scores (PRS) have predominantly been derived from genome-wide association studies (GWAS) conducted in European ancestry (EUR) individuals. In this study, we present an in-depth evaluation of PRS based on multi-ancestry GWAS for five cardiometabolic phenotypes in the Penn Medicine BioBank (PMBB) followed by a phenome-wide association study (PheWAS). We examine the PRS performance across all individuals and separately in African ancestry (AFR) and EUR ancestry groups. For AFR individuals, PRS derived using the multi-ancestry LD panel showed a higher effect size for four out of five PRSs (DBP, SBP, T2D, and BMI) than those derived from the AFR LD panel. In contrast, for EUR individuals, the multi-ancestry LD panel PRS demonstrated a higher effect size for two out of five PRSs (SBP and T2D) compared to the EUR LD panel. These findings underscore the potential benefits of utilizing a multi-ancestry LD panel for PRS derivation in diverse genetic backgrounds and demonstrate overall robustness in all individuals. Our results also revealed significant associations between PRS and various phenotypic categories. For instance, CAD PRS was linked with 18 phenotypes in AFR and 82 in EUR, while T2D PRS correlated with 84 phenotypes in AFR and 78 in EUR. Notably, associations like hyperlipidemia, renal failure, atrial fibrillation, coronary atherosclerosis, obesity, and hypertension were observed across different PRSs in both AFR and EUR groups, with varying effect sizes and significance levels. However, in AFR individuals, the strength and number of PRS associations with other phenotypes were generally reduced compared to EUR individuals. Our study underscores the need for future research to prioritize 1) conducting GWAS in diverse ancestry groups and 2) creating a cosmopolitan PRS methodology that is universally applicable across all genetic backgrounds. Such advances will foster a more equitable and personalized approach to precision medicine.","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":"29 ","pages":"611-626"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10947742/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139075195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Statistical analysis of single-cell protein data. 单细胞蛋白质数据统计分析。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Brooke L Fridley, Simon Vandekar, Inna Chervoneva, Julia Wrobel, Siyuan Ma

Immune modulation is considered a hallmark of cancer initiation and progression, with immune cell density being consistently associated with clinical outcomes of individuals with cancer. Multiplex immunofluorescence (mIF) microscopy combined with automated image analysis is a novel and increasingly used technique that allows for the assessment and visualization of the tumor microenvironment (TME). Recently, application of this new technology to tissue microarrays (TMAs) or whole tissue sections from large cancer studies has been used to characterize different cell populations in the TME with enhanced reproducibility and accuracy. Generally, mIF data has been used to examine the presence and abundance of immune cells in the tumor and stroma compartments; however, this aggregate measure assumes uniform patterns of immune cells throughout the TME and overlooks spatial heterogeneity. Recently, the spatial contexture of the TME has been explored with a variety of statistical methods. In this PSB workshop, speakers will present some of the state-of-the-art statistical methods for assessing the TIME from mIF data.

免疫调节被认为是癌症发生和发展的一个标志，免疫细胞密度一直与癌症患者的临床预后相关。多重免疫荧光（mIF）显微镜与自动图像分析相结合，是一种新颖的、应用日益广泛的技术，可对肿瘤微环境（TME）进行评估和可视化。最近，将这项新技术应用于组织微阵列（TMA）或大型癌症研究的整个组织切片已被用来描述肿瘤微环境中不同细胞群的特征，并提高了可重复性和准确性。一般来说，mIF 数据被用来检测肿瘤和基质区免疫细胞的存在和丰度；然而，这种综合测量方法假定整个 TME 中免疫细胞的模式是一致的，而忽略了空间异质性。最近，人们利用各种统计方法对肿瘤组织间质的空间背景进行了探索。在本次 PSB 研讨会上，演讲者将介绍从 mIF 数据中评估 TIME 的一些最新统计方法。

引用次数: 0

Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific Literature. Clinfo.ai：利用科学文献回答医学问题的开源检索增强型大语言模型系统。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Alejandro Lozano, Scott L Fleming, Chia-Chun Chiang, Nigam Shah

The quickly-expanding nature of published medical literature makes it challenging for clinicians and researchers to keep up with and summarize recent, relevant findings in a timely manner. While several closed-source summarization tools based on large language models (LLMs) now exist, rigorous and systematic evaluations of their outputs are lacking. Furthermore, there is a paucity of high-quality datasets and appropriate benchmark tasks with which to evaluate these tools. We address these issues with four contributions: we release Clinfo.ai, an open-source WebApp that answers clinical questions based on dynamically retrieved scientific literature; we specify an information retrieval and abstractive summarization task to evaluate the performance of such retrieval-augmented LLM systems; we release a dataset of 200 questions and corresponding answers derived from published systematic reviews, which we name PubMed Retrieval and Synthesis (PubMedRS-200); and report benchmark results for Clinfo.ai and other publicly available OpenQA systems on PubMedRS-200.

已发表的医学文献数量迅速增加，这使得临床医生和研究人员及时了解和总结最新的相关研究成果变得十分困难。虽然目前已有几种基于大型语言模型（LLM）的闭源摘要工具，但对其输出结果缺乏严格而系统的评估。此外，用于评估这些工具的高质量数据集和适当的基准任务也非常缺乏。我们通过四项贡献来解决这些问题：我们发布了一个开源 WebApp Clinfo.ai，它可以根据动态检索到的科学文献回答临床问题；我们指定了一个信息检索和抽象摘要任务来评估此类检索增强型 LLM 系统的性能；我们发布了一个包含 200 个问题和相应答案的数据集，这些问题和答案来自已发表的系统性综述，我们将其命名为 PubMed Retrieval and Synthesis (PubMedRS-200)；我们还报告了 Clinfo.ai 和其他公开可用的 OpenQA 系统在 PubMedRS-200 上的基准结果。

引用次数: 0

Session Introduction: Digital health technology data in biocomputing: Research efforts and considerations for expanding access (PSB2024). 会议简介：生物计算中的数字健康技术数据：研究工作和扩大访问的考虑因素（PSB2024）。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Michelle Holko, Chris Lunt, Jessilyn Dunn

Data from digital health technologies (DHT), including wearable sensors like Apple Watch, Whoop, Oura Ring, and Fitbit, are increasingly being used in biomedical research. Research and development of DHT-related devices, platforms, and applications is happening rapidly and with significant private-sector involvement with new biotech companies and large tech companies (e.g. Google, Apple, Amazon, Uber) investing heavily in technologies to improve human health. Many academic institutions are building capabilities related to DHT research, often in cross-sector collaboration with technology companies and other organizations with the goal of generating clinically meaningful evidence to improve patient care, to identify users at an earlier stage of disease presentation, and to support health preservation and disease prevention. Large research consortia, cross-sector partnerships, and individual research labs are all represented in the current corpus of published studies. Some of the large research studies, like NIH's All of Us Research Program, make data sets from wearable sensors available to the research community, while the vast majority of data from wearable sensors and other DHTs are held by private sector organizations and are not readily available to the research community. As data are unlocked from the private sector and made available to the academic research community, there is an opportunity to develop innovative analytics and methods through expanded access. This is the second year for this Session which solicited research results leveraging digital health technologies, including wearable sensor data, describing novel analytical methods, and issues related to diversity, equity, inclusion (DEI) of the research, data, and the community of researchers working in this area. We particularly encouraged submissions describing opportunities for expanding and democratizing academic research using data from wearable sensors and related digital health technologies.

来自数字健康技术（DHT）的数据，包括 Apple Watch、Whoop、Oura Ring 和 Fitbit 等可穿戴传感器的数据，正越来越多地被用于生物医学研究。与数字健康技术相关的设备、平台和应用的研究与开发正在快速进行，新兴生物技术公司和大型科技公司（如谷歌、苹果、亚马逊、优步等）大量投资于改善人类健康的技术，私营部门也积极参与其中。许多学术机构正在建设与 DHT 研究相关的能力，通常是与技术公司和其他组织开展跨部门合作，目标是提供有临床意义的证据，以改善患者护理，在疾病的早期阶段识别用户，并支持健康保护和疾病预防。在目前已发表的研究成果中，大型研究联盟、跨部门合作以及单个研究实验室均有体现。一些大型研究，如美国国立卫生研究院的 "我们所有人研究计划"，向研究界提供了来自可穿戴传感器的数据集，而来自可穿戴传感器和其他 DHT 的绝大多数数据都由私营部门组织掌握，不能随时向研究界提供。随着数据从私营部门解锁并提供给学术研究界，有机会通过扩大访问范围来开发创新的分析方法和手段。今年是该会议举办的第二年，会议征集了利用数字健康技术（包括可穿戴传感器数据）的研究成果，介绍了新颖的分析方法，以及与该领域的研究、数据和研究人员群体的多样性、公平性和包容性（DEI）相关的问题。我们特别鼓励在提交的论文中描述利用可穿戴传感器和相关数字健康技术的数据扩大学术研究并使之民主化的机会。

{"title":"Session Introduction: Digital health technology data in biocomputing: Research efforts and considerations for expanding access (PSB2024).","authors":"Michelle Holko, Chris Lunt, Jessilyn Dunn","doi":"","DOIUrl":"","url":null,"abstract":"Data from digital health technologies (DHT), including wearable sensors like Apple Watch, Whoop, Oura Ring, and Fitbit, are increasingly being used in biomedical research. Research and development of DHT-related devices, platforms, and applications is happening rapidly and with significant private-sector involvement with new biotech companies and large tech companies (e.g. Google, Apple, Amazon, Uber) investing heavily in technologies to improve human health. Many academic institutions are building capabilities related to DHT research, often in cross-sector collaboration with technology companies and other organizations with the goal of generating clinically meaningful evidence to improve patient care, to identify users at an earlier stage of disease presentation, and to support health preservation and disease prevention. Large research consortia, cross-sector partnerships, and individual research labs are all represented in the current corpus of published studies. Some of the large research studies, like NIH's All of Us Research Program, make data sets from wearable sensors available to the research community, while the vast majority of data from wearable sensors and other DHTs are held by private sector organizations and are not readily available to the research community. As data are unlocked from the private sector and made available to the academic research community, there is an opportunity to develop innovative analytics and methods through expanded access. This is the second year for this Session which solicited research results leveraging digital health technologies, including wearable sensor data, describing novel analytical methods, and issues related to diversity, equity, inclusion (DEI) of the research, data, and the community of researchers working in this area. We particularly encouraged submissions describing opportunities for expanding and democratizing academic research using data from wearable sensors and related digital health technologies.","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":"29 ","pages":"163-169"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139075203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Risk prediction: Methods, Challenges, and Opportunities. 风险预测：方法、挑战和机遇。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Ruowang Li, Rui Duan, Lifang He, Jason H Moore

The following sections are included:Introduction to the workshopWorkshop Presenters.

包括以下部分：讲习班简介讲习班主讲人。

引用次数: 0

intCC: An efficient weighted integrative consensus clustering of multimodal data. intCC：多模态数据的高效加权综合共识聚类。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Can Huang, Pei Fen Kuan

High throughput profiling of multiomics data provides a valuable resource to better understand the complex human disease such as cancer and to potentially uncover new subtypes. Integrative clustering has emerged as a powerful unsupervised learning framework for subtype discovery. In this paper, we propose an efficient weighted integrative clustering called intCC by combining ensemble method, consensus clustering and kernel learning integrative clustering. We illustrate that intCC can accurately uncover the latent cluster structures via extensive simulation studies and a case study on the TCGA pan cancer datasets. An R package intCC implementing our proposed method is available at https://github.com/candsj/intCC.

多组学数据的高通量剖析为更好地了解癌症等复杂的人类疾病提供了宝贵的资源，并有可能发现新的亚型。整合聚类已成为发现亚型的一个强大的无监督学习框架。在本文中，我们结合了集合方法、共识聚类和核学习整合聚类，提出了一种高效的加权整合聚类，称为 intCC。我们通过大量的模拟研究和对 TCGA 泛癌症数据集的案例研究，说明 intCC 可以准确地发现潜在的聚类结构。实现我们提出的方法的 R 软件包 intCC 可在 https://github.com/candsj/intCC 上获取。

引用次数: 0

Practical Approaches to Enhancing Fairness, Social Responsibility and the Inclusion of Diverse Viewpoints in Biomedicine. 增强生物医学的公平性、社会责任感和多元观点包容性的实用方法。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Daphne O Martschenko, Nicole Martinez-Martin, Meghan Halley

The following sections are included:Workshop DescriptionLearning ObjectivesPresenter InformationAbout the Workshop OrganizersPresentationsSpeaker Presentations.

包括以下部分：研讨会简介学习目标主讲人信息关于研讨会组织者演讲人演讲。

引用次数: 0

Impact of Measurement Noise on Genetic Association Studies of Cardiac Function. 测量噪音对心功能遗传关联研究的影响。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Milos Vukadinovic, Gauri Renjith, Victoria Yuan, Alan Kwan, Susan C Cheng, Debiao Li, Shoa L Clarke, David Ouyang

Recent research has effectively used quantitative traits from imaging to boost the capabilities of genome-wide association studies (GWAS), providing further understanding of disease biology and various traits. However, it's important to note that phenotyping inherently carries measurement error and noise that could influence subsequent genetic analyses. The study focused on left ventricular ejection fraction (LVEF), a vital yet potentially inaccurate quantitative measurement, to investigate how imprecision in phenotype measurement affects genetic studies. Several methods of acquiring LVEF, along with simulating measurement noise, were assessed for their effects on ensuing genetic analyses. The results showed that by introducing just 7.9% of measurement noise, all genetic associations in an LVEF GWAS with almost forty thousand individuals could be eliminated. Moreover, a 1% increase in mean absolute error (MAE) in LVEF had an effect equivalent to a 10% reduction in the sample size of the cohort on the power of GWAS. Therefore, enhancing the accuracy of phenotyping is crucial to maximize the effectiveness of genome-wide association studies.

最近的研究有效地利用了成像的定量性状来提高全基因组关联研究（GWAS）的能力，从而进一步了解疾病生物学和各种性状。然而，值得注意的是，表型分析本身存在测量误差和噪声，可能会影响后续的遗传分析。这项研究以左心室射血分数（LVEF）为重点，研究表型测量的不精确性如何影响遗传研究。研究人员评估了几种获取 LVEF 的方法以及模拟测量噪音对后续遗传分析的影响。结果显示，只需引入 7.9% 的测量噪声，就能消除近四万人的 LVEF GWAS 中的所有遗传关联。此外，LVEF 平均绝对误差（MAE）每增加 1%，对 GWAS 功率的影响相当于队列样本量减少 10%。因此，提高表型分析的准确性对于最大限度地提高全基因组关联研究的效果至关重要。

{"title":"Impact of Measurement Noise on Genetic Association Studies of Cardiac Function.","authors":"Milos Vukadinovic, Gauri Renjith, Victoria Yuan, Alan Kwan, Susan C Cheng, Debiao Li, Shoa L Clarke, David Ouyang","doi":"","DOIUrl":"","url":null,"abstract":"Recent research has effectively used quantitative traits from imaging to boost the capabilities of genome-wide association studies (GWAS), providing further understanding of disease biology and various traits. However, it's important to note that phenotyping inherently carries measurement error and noise that could influence subsequent genetic analyses. The study focused on left ventricular ejection fraction (LVEF), a vital yet potentially inaccurate quantitative measurement, to investigate how imprecision in phenotype measurement affects genetic studies. Several methods of acquiring LVEF, along with simulating measurement noise, were assessed for their effects on ensuing genetic analyses. The results showed that by introducing just 7.9% of measurement noise, all genetic associations in an LVEF GWAS with almost forty thousand individuals could be eliminated. Moreover, a 1% increase in mean absolute error (MAE) in LVEF had an effect equivalent to a 10% reduction in the sample size of the cohort on the power of GWAS. Therefore, enhancing the accuracy of phenotyping is crucial to maximize the effectiveness of genome-wide association studies.","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":"29 ","pages":"134-147"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139075171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

LA-GEM: imputation of gene expression with incorporation of Local Ancestry. LA-GEM：结合当地血统的基因表达估算。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Mrinal Mishra, Layan Nahlawi, Yizhen Zhong, Tanima De, Guang Yang, Cristina Alarcon, Minoli A Perera

Gene imputation and TWAS have become a staple in the genomics medicine discovery space; helping to identify genes whose regulation effects may contribute to disease susceptibility. However, the cohorts on which these methods are built are overwhelmingly of European Ancestry. This means that the unique regulatory variation that exist in non-European populations, specifically African Ancestry populations, may not be included in the current models. Moreover, African Americans are an admixed population, with a mix of European and African segments within their genome. No gene imputation model thus far has incorporated the effect of local ancestry (LA) on gene expression imputation. As such, we created LA-GEM which was trained and tested on a cohort of 60 African American hepatocyte primary cultures. Uniquely, LA-GEM include local ancestry inference in its prediction of gene expression. We compared the performance of LA-GEM to PrediXcan trained the same dataset (with no inclusion of local ancestry) We were able to reliably predict the expression of 2559 genes (1326 in LA-GEM and 1236 in PrediXcan). Of these, 546 genes were unique to LA-GEM, including the CYP3A5 gene which is critical to drug metabolism. We conducted TWAS analysis on two African American clinical cohorts with pharmacogenomics phenotypic information to identity novel gene associations. In our IWPC warfarin cohort, we identified 17 transcriptome-wide significant hits. No gene reached are prespecified significance level in the clopidogrel cohort. We did see suggestive association with RAS3A to P2RY12 Reactivity Units (PRU), a clinical measure of response to anti-platelet therapy. This method demonstrated the need for the incorporation of LA into study in admixed populations.

基因归因和 TWAS 已成为基因组学医学发现领域的主要方法，有助于确定其调节作用可能导致疾病易感性的基因。然而，这些方法所依据的队列绝大多数是欧洲血统。这就意味着，非欧洲血统人群，特别是非洲血统人群中存在的独特调控变异可能不会被纳入当前的模型中。此外，非裔美国人是一个混血群体，他们的基因组中既有欧洲人的片段，也有非洲人的片段。迄今为止，还没有一个基因归因模型包含本地祖先（LA）对基因表达归因的影响。因此，我们创建了 LA-GEM，并在 60 个非裔美国人肝细胞原代培养物队列中进行了训练和测试。与众不同的是，LA-GEM 在预测基因表达时包含了本地祖先推断。我们将 LA-GEM 的性能与 PrediXcan 的性能进行了比较，后者训练了相同的数据集（不包含本地祖先）。我们能够可靠地预测 2559 个基因的表达（LA-GEM 预测了 1326 个，PrediXcan 预测了 1236 个）。其中，546 个基因是 LA-GEM 独有的，包括对药物代谢至关重要的 CYP3A5 基因。我们对两个具有药物基因组学表型信息的非裔美国人临床队列进行了 TWAS 分析，以确定新的基因关联。在我们的 IWPC 华法林队列中，我们发现了 17 个转录组范围内的重要基因。在氯吡格雷队列中，没有基因达到预设的显著性水平。我们确实发现了 RAS3A 与 P2RY12 反应单位 (PRU) 的提示性关联，P2RY12 反应单位是抗血小板治疗反应的临床指标。这种方法表明，有必要将 LA 纳入混血人群的研究中。

{"title":"LA-GEM: imputation of gene expression with incorporation of Local Ancestry.","authors":"Mrinal Mishra, Layan Nahlawi, Yizhen Zhong, Tanima De, Guang Yang, Cristina Alarcon, Minoli A Perera","doi":"","DOIUrl":"","url":null,"abstract":"Gene imputation and TWAS have become a staple in the genomics medicine discovery space; helping to identify genes whose regulation effects may contribute to disease susceptibility. However, the cohorts on which these methods are built are overwhelmingly of European Ancestry. This means that the unique regulatory variation that exist in non-European populations, specifically African Ancestry populations, may not be included in the current models. Moreover, African Americans are an admixed population, with a mix of European and African segments within their genome. No gene imputation model thus far has incorporated the effect of local ancestry (LA) on gene expression imputation. As such, we created LA-GEM which was trained and tested on a cohort of 60 African American hepatocyte primary cultures. Uniquely, LA-GEM include local ancestry inference in its prediction of gene expression. We compared the performance of LA-GEM to PrediXcan trained the same dataset (with no inclusion of local ancestry) We were able to reliably predict the expression of 2559 genes (1326 in LA-GEM and 1236 in PrediXcan). Of these, 546 genes were unique to LA-GEM, including the CYP3A5 gene which is critical to drug metabolism. We conducted TWAS analysis on two African American clinical cohorts with pharmacogenomics phenotypic information to identity novel gene associations. In our IWPC warfarin cohort, we identified 17 transcriptome-wide significant hits. No gene reached are prespecified significance level in the clopidogrel cohort. We did see suggestive association with RAS3A to P2RY12 Reactivity Units (PRU), a clinical measure of response to anti-platelet therapy. This method demonstrated the need for the incorporation of LA into study in admixed populations.","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":"29 ","pages":"341-358"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10764069/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139075175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Session Introduction: Drug-repurposing and discovery in the era of "big" real-world data: how the incorporation of observational data, genetics, and other -omic technologies can move us forward. 会议简介：大 "真实世界数据时代的药物再利用和发现：观察数据、遗传学和其他原子技术如何推动我们前进。

Q2 Computer Science

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

Pub Date : 2024-01-01

Megan M Shuey, Jacklyn N Hellwege, Nikhil Khankari, Marijana Vujkovic, Todd L Edwards

This PSB 2024 session discusses the many broad biological, computational, and statistical approaches currently being used for therapeutic drug target identification and repurposing of existing treatments. Drug repurposing efforts have the potential to dramatically improve the treatment landscape by more rapidly identifying drug targets and alternative strategies for untreated or poorly managed diseases. The overarching theme for this session is the use and integration of real-world data to identify drug-disease pairs with potential therapeutic use. These drug-disease pairs may be identified through genomic, proteomic, biomarkers, protein interaction analyses, electronic health records, and chemical profiling. Taken together, this session combines novel applications of methods and innovative modeling strategies with diverse real-world data to suggest new pharmaceutical treatments for human diseases.

本次 PSB 2024 会议将讨论目前用于治疗药物靶点识别和现有疗法再利用的许多广泛的生物、计算和统计方法。通过更快速地识别药物靶点和针对未治疗或治疗效果不佳疾病的替代策略，药物再利用工作有可能极大地改善治疗状况。本次会议的首要主题是使用和整合真实世界的数据，以确定具有潜在治疗用途的药物-疾病配对。这些药物-疾病配对可通过基因组、蛋白质组、生物标记物、蛋白质相互作用分析、电子健康记录和化学特征分析来确定。总之，本环节将新方法的应用和创新建模策略与各种真实世界数据相结合，为人类疾病提出新的药物治疗建议。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀