首页 > 最新文献

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing最新文献

英文 中文
FairPRS: adjusting for admixed populations in polygenic risk scores using invariant risk minimization. FairPRS:使用不变量风险最小化方法调整多基因风险评分中的混杂人群。
Diego Machado Reyes, Aritra Bose, Ehud Karavani, Laxmi Parida

Polygenic risk scores (PRS) are increasingly used to estimate the personal risk of a trait based on genetics. However, most genomic cohorts are of European populations, with a strong under-representation of non-European groups. Given that PRS poorly transport across racial groups, this has the potential to exacerbate health disparities if used in clinical care. Hence there is a need to generate PRS that perform comparably across ethnic groups. Borrowing from recent advancements in the domain adaption field of machine learning, we propose FairPRS - an Invariant Risk Minimization (IRM) approach for estimating fair PRS or debiasing a pre-computed PRS. We test our method on both a diverse set of synthetic data and real data from the UK Biobank. We show our method can create ancestry-invariant PRS distributions that are both racially unbiased and largely improve phenotype prediction. We hope that FairPRS will contribute to a fairer characterization of patients by genetics rather than by race.

多基因风险评分(PRS)越来越多地用于根据遗传学估算某种性状的个人风险。然而,大多数基因组队列都是欧洲人群,非欧洲人群的代表性严重不足。鉴于 PRS 在不同种族群体之间的迁移性较差,如果用于临床护理,有可能会加剧健康差异。因此,有必要生成跨种族群体具有可比性的 PRS。借鉴机器学习领域适应性方面的最新进展,我们提出了公平PRS--一种用于估计公平PRS或去除预先计算的PRS的不变风险最小化(IRM)方法。我们在一组不同的合成数据和英国生物库的真实数据上测试了我们的方法。结果表明,我们的方法可以创建种族无偏的祖先不变 PRS 分布,并在很大程度上改善表型预测。我们希望 FairPRS 将有助于根据遗传学而非种族对患者进行更公平的特征描述。
{"title":"FairPRS: adjusting for admixed populations in polygenic risk scores using invariant risk minimization.","authors":"Diego Machado Reyes, Aritra Bose, Ehud Karavani, Laxmi Parida","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Polygenic risk scores (PRS) are increasingly used to estimate the personal risk of a trait based on genetics. However, most genomic cohorts are of European populations, with a strong under-representation of non-European groups. Given that PRS poorly transport across racial groups, this has the potential to exacerbate health disparities if used in clinical care. Hence there is a need to generate PRS that perform comparably across ethnic groups. Borrowing from recent advancements in the domain adaption field of machine learning, we propose FairPRS - an Invariant Risk Minimization (IRM) approach for estimating fair PRS or debiasing a pre-computed PRS. We test our method on both a diverse set of synthetic data and real data from the UK Biobank. We show our method can create ancestry-invariant PRS distributions that are both racially unbiased and largely improve phenotype prediction. We hope that FairPRS will contribute to a fairer characterization of patients by genetics rather than by race.</p>","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10804441/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9207311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Session Introduction: Graph Representations and Algorithms in Biomedicine. 会议介绍:生物医学中的图形表示和算法。
Brianna Chrisman, Maya Varma, Sepideh Maleki, Maria Brbic, Cliff Joslyn, Marinka Zitnik

The following sections are included: Introduction, Understanding and Predicting Molecular Networks, Understanding and Predicting Molecular Networks, Making Use of Family Structure, Applying Traditional Graph Algorithms to Novel Tasks, Representing Uncertainty in Networks, Conclusion, References.

以下部分包括:介绍、理解和预测分子网络、理解和预测分子网络、利用族结构、将传统图算法应用于新任务、表示网络中的不确定性、结论、参考文献。
{"title":"Session Introduction: Graph Representations and Algorithms in Biomedicine.","authors":"Brianna Chrisman,&nbsp;Maya Varma,&nbsp;Sepideh Maleki,&nbsp;Maria Brbic,&nbsp;Cliff Joslyn,&nbsp;Marinka Zitnik","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The following sections are included: Introduction, Understanding and Predicting Molecular Networks, Understanding and Predicting Molecular Networks, Making Use of Family Structure, Applying Traditional Graph Algorithms to Novel Tasks, Representing Uncertainty in Networks, Conclusion, References.</p>","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10441694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development and application of a computable genotype model in the GA4GH Variation Representation Specification. 在 GA4GH 变异表示规范中开发和应用可计算基因型模型。
Wesley Goar, Lawrence Babb, Srikar Chamala, Melissa Cline, Robert R Freimuth, Reece K Hart, Kori Kuzma, Jennifer Lee, Tristan Nelson, Andreas Prlić, Kevin Riehle, Anastasia Smith, Kathryn Stahl, Andrew D Yates, Heidi L Rehm, Alex H Wagner

As the diversity of genomic variation data increases with our growing understanding of the role of variation in health and disease, it is critical to develop standards for precise inter-system exchange of these data for research and clinical applications. The Global Alliance for Genomics and Health (GA4GH) Variation Representation Specification (VRS) meets this need through a technical terminology and information model for disambiguating and concisely representing variation concepts. Here we discuss the recent Genotype model in VRS, which may be used to represent the allelic composition of a genetic locus. We demonstrate the use of the Genotype model and the constituent Haplotype model for the precise and interoperable representation of pharmacogenomic diplotypes, HGVS variants, and VCF records using VRS and discuss how this can be leveraged to enable interoperable exchange and search operations between assayed variation and genomic knowledgebases.

随着我们对变异在健康和疾病中的作用的认识不断加深,基因组变异数据的多样性也随之增加,因此,为研究和临床应用制定系统间精确交换这些数据的标准至关重要。全球基因组学与健康联盟(GA4GH)的变异表示规范(VRS)通过技术术语和信息模型满足了这一需求,该模型可用于消除歧义并简明扼要地表示变异概念。在此,我们将讨论 VRS 中最新的基因型模型,该模型可用于表示遗传位点的等位基因组成。我们演示了如何利用基因型模型及其组成的单倍型模型,使用 VRS 精确、互操作地表示药物基因组的双倍型、HGVS 变异和 VCF 记录,并讨论了如何利用该模型在化验变异和基因组知识库之间实现互操作的交换和搜索操作。
{"title":"Development and application of a computable genotype model in the GA4GH Variation Representation Specification.","authors":"Wesley Goar, Lawrence Babb, Srikar Chamala, Melissa Cline, Robert R Freimuth, Reece K Hart, Kori Kuzma, Jennifer Lee, Tristan Nelson, Andreas Prlić, Kevin Riehle, Anastasia Smith, Kathryn Stahl, Andrew D Yates, Heidi L Rehm, Alex H Wagner","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>As the diversity of genomic variation data increases with our growing understanding of the role of variation in health and disease, it is critical to develop standards for precise inter-system exchange of these data for research and clinical applications. The Global Alliance for Genomics and Health (GA4GH) Variation Representation Specification (VRS) meets this need through a technical terminology and information model for disambiguating and concisely representing variation concepts. Here we discuss the recent Genotype model in VRS, which may be used to represent the allelic composition of a genetic locus. We demonstrate the use of the Genotype model and the constituent Haplotype model for the precise and interoperable representation of pharmacogenomic diplotypes, HGVS variants, and VCF records using VRS and discuss how this can be leveraged to enable interoperable exchange and search operations between assayed variation and genomic knowledgebases.</p>","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/d8/6e/nihms-1853002.PMC9782714.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9698793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of Kinase-Substrate Associations Using The Functional Landscape of Kinases and Phosphorylation Sites. 利用激酶和磷酸化位点的功能图谱预测激酶与底物的联系
Marzieh Ayati, Serhan Yilmaz, Filipa Blasco Tavares Pereira Lopes, Mark Chance, Mehmet Koyuturk

Protein phosphorylation is a key post-translational modification that plays a central role in many cellular processes. With recent advances in biotechnology, thousands of phosphorylated sites can be identified and quantified in a given sample, enabling proteome-wide screening of cellular signaling. However, for most (> 90%) of the phosphorylation sites that are identified in these experiments, the kinase(s) that target these sites are unknown. To broadly utilize available structural, functional, evolutionary, and contextual information in predicting kinase-substrate associations (KSAs), we develop a network-based machine learning framework. Our framework integrates a multitude of data sources to characterize the landscape of functional relationships and associations among phosphosites and kinases. To construct a phosphosite-phosphosite association network, we use sequence similarity, shared biological pathways, co-evolution, co-occurrence, and co-phosphorylation of phosphosites across different biological states. To construct a kinase-kinase association network, we integrate protein-protein interactions, shared biological pathways, and membership in common kinase families. We use node embeddings computed from these heterogeneous networks to train machine learning models for predicting kinase-substrate associations. Our systematic computational experiments using the PhosphositePLUS database shows that the resulting algorithm, NetKSA, outperforms two state-of-the-art algorithms, including KinomeXplorer and LinkPhinder, in overall KSA prediction. By stratifying the ranking of kinases, NetKSA also enables annotation of phosphosites that are targeted by relatively less-studied kinases.Availability: The code and data are available at compbio.case.edu/NetKSA/.

蛋白质磷酸化是一种关键的翻译后修饰,在许多细胞过程中发挥着核心作用。随着生物技术的不断进步,数千个磷酸化位点可以在给定样本中被鉴定和量化,从而实现了对细胞信号的全蛋白质组筛选。然而,在这些实验中确定的大多数(> 90%)磷酸化位点,以这些位点为靶点的激酶都是未知的。为了广泛利用现有的结构、功能、进化和上下文信息来预测激酶-底物关联(KSA),我们开发了一个基于网络的机器学习框架。我们的框架整合了多种数据源,以描述磷酸化位点和激酶之间的功能关系和关联。为了构建磷酸化位点-磷酸化位点关联网络,我们使用了序列相似性、共享生物途径、共同进化、共同发生以及不同生物状态下磷酸化位点的共同磷酸化。为了构建激酶-激酶关联网络,我们整合了蛋白质-蛋白质相互作用、共享生物途径和共同激酶家族成员资格。我们利用从这些异构网络中计算出的节点嵌入来训练机器学习模型,以预测激酶与底物的关联。我们使用 PhosphositePLUS 数据库进行的系统计算实验表明,NetKSA 算法在整体 KSA 预测方面优于 KinomeXplorer 和 LinkPhinder 这两种最先进的算法。通过对激酶进行分层排序,NetKSA还能对研究相对较少的激酶靶向的磷酸位点进行注释:代码和数据可在compbio.case.edu/NetKSA/获取。
{"title":"Prediction of Kinase-Substrate Associations Using The Functional Landscape of Kinases and Phosphorylation Sites.","authors":"Marzieh Ayati, Serhan Yilmaz, Filipa Blasco Tavares Pereira Lopes, Mark Chance, Mehmet Koyuturk","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Protein phosphorylation is a key post-translational modification that plays a central role in many cellular processes. With recent advances in biotechnology, thousands of phosphorylated sites can be identified and quantified in a given sample, enabling proteome-wide screening of cellular signaling. However, for most (> 90%) of the phosphorylation sites that are identified in these experiments, the kinase(s) that target these sites are unknown. To broadly utilize available structural, functional, evolutionary, and contextual information in predicting kinase-substrate associations (KSAs), we develop a network-based machine learning framework. Our framework integrates a multitude of data sources to characterize the landscape of functional relationships and associations among phosphosites and kinases. To construct a phosphosite-phosphosite association network, we use sequence similarity, shared biological pathways, co-evolution, co-occurrence, and co-phosphorylation of phosphosites across different biological states. To construct a kinase-kinase association network, we integrate protein-protein interactions, shared biological pathways, and membership in common kinase families. We use node embeddings computed from these heterogeneous networks to train machine learning models for predicting kinase-substrate associations. Our systematic computational experiments using the PhosphositePLUS database shows that the resulting algorithm, NetKSA, outperforms two state-of-the-art algorithms, including KinomeXplorer and LinkPhinder, in overall KSA prediction. By stratifying the ranking of kinases, NetKSA also enables annotation of phosphosites that are targeted by relatively less-studied kinases.Availability: The code and data are available at compbio.case.edu/NetKSA/.</p>","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/b4/25/nihms-1852984.PMC9782723.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10276758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accessing clinical-grade genomic classification data through the ClinGen Data Platform. 通过ClinGen数据平台访问临床级基因组分类数据。
Karen P Dalton, Heidi L Rehm, Matt W Wright, Mark E Mandell, Kilannin Krysiak, Lawrence Babb, Kevin Riehle, Tristan Nelson, Alex H Wagner

The Clinical Genome Resource (ClinGen) serves as an authoritative resource on the clinical relevance of genes and variants. In order to support our curation activities and to disseminate our findings to the community, we have developed a Data Platform of informatics resources backed by standardized data models. In this workshop we demonstrate our publicly available resources including curation interfaces, (Variant Curation Interface, CIViC), supporting infrastructure (Allele Registry, Genegraph), and data models (SEPIO, GA4GH VRS, VA).

临床基因组资源(ClinGen)是基因和变异临床相关性的权威资源。为了支持我们的管理活动并向社区传播我们的发现,我们开发了一个由标准化数据模型支持的信息学资源数据平台。在本次研讨会中,我们展示了我们公开可用的资源,包括管理接口(变种管理接口,CIViC),支持基础设施(等位基因注册,Genegraph)和数据模型(SEPIO, GA4GH VRS, VA)。
{"title":"Accessing clinical-grade genomic classification data through the ClinGen Data Platform.","authors":"Karen P Dalton,&nbsp;Heidi L Rehm,&nbsp;Matt W Wright,&nbsp;Mark E Mandell,&nbsp;Kilannin Krysiak,&nbsp;Lawrence Babb,&nbsp;Kevin Riehle,&nbsp;Tristan Nelson,&nbsp;Alex H Wagner","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The Clinical Genome Resource (ClinGen) serves as an authoritative resource on the clinical relevance of genes and variants. In order to support our curation activities and to disseminate our findings to the community, we have developed a Data Platform of informatics resources backed by standardized data models. In this workshop we demonstrate our publicly available resources including curation interfaces, (Variant Curation Interface, CIViC), supporting infrastructure (Allele Registry, Genegraph), and data models (SEPIO, GA4GH VRS, VA).</p>","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/b0/9c/nihms-1891029.PMC10114895.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9684109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predictive modeling using shape statistics for interpretable and robust quality assurance of automated contours in radiation treatment planning. 利用形状统计进行预测建模,用于放射治疗计划中自动轮廓的可解释性和鲁棒性质量保证。
Zachary T Wooten, Cenji Yu, Laurence E Court, Christine B Peterson

Deep learning methods for image segmentation and contouring are gaining prominence as an automated approach for delineating anatomical structures in medical images during radiation treatment planning. These contours are used to guide radiotherapy treatment planning, so it is important that contouring errors are flagged before they are used for planning. This creates a need for effective quality assurance methods to enable the clinical use of automated contours in radiotherapy. We propose a novel method for contour quality assurance that requires only shape features, making it independent of the platform used to obtain the images. Our method uses a random forest classifier to identify low-quality contours. On a dataset of 312 kidney contours, our method achieved a cross-validated area under the curve of 0.937 in identifying unacceptable contours. We applied our method to an unlabeled validation dataset of 36 kidney contours. We flagged 6 contours which were then reviewed by a cervix contour specialist, who found that 4 of the 6 contours contained errors. We used Shapley values to characterize the specific shape features that contributed to each contour being flagged, providing a starting point for characterizing the source of the contouring error. These promising results suggest our method is feasible for quality assurance of automated radiotherapy contours.

图像分割和轮廓的深度学习方法作为在放射治疗计划期间描绘医学图像中的解剖结构的自动化方法正在获得突出地位。这些轮廓用于指导放射治疗计划,因此在用于计划之前标记轮廓错误是很重要的。这就需要有效的质量保证方法,以便在放射治疗中临床使用自动轮廓。我们提出了一种新的轮廓质量保证方法,它只需要形状特征,使其独立于用于获取图像的平台。我们的方法使用随机森林分类器来识别低质量的轮廓。在312个肾脏轮廓的数据集上,我们的方法在识别不可接受轮廓时获得了0.937的曲线下交叉验证面积。我们将我们的方法应用于36个肾脏轮廓的未标记验证数据集。我们标记了6个轮廓,然后由宫颈轮廓专家检查,他发现6个轮廓中有4个包含错误。我们使用Shapley值来描述导致每个轮廓被标记的特定形状特征,为描述轮廓误差的来源提供了一个起点。这些有希望的结果表明,我们的方法是可行的质量保证自动放疗轮廓线。
{"title":"Predictive modeling using shape statistics for interpretable and robust quality assurance of automated contours in radiation treatment planning.","authors":"Zachary T Wooten,&nbsp;Cenji Yu,&nbsp;Laurence E Court,&nbsp;Christine B Peterson","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Deep learning methods for image segmentation and contouring are gaining prominence as an automated approach for delineating anatomical structures in medical images during radiation treatment planning. These contours are used to guide radiotherapy treatment planning, so it is important that contouring errors are flagged before they are used for planning. This creates a need for effective quality assurance methods to enable the clinical use of automated contours in radiotherapy. We propose a novel method for contour quality assurance that requires only shape features, making it independent of the platform used to obtain the images. Our method uses a random forest classifier to identify low-quality contours. On a dataset of 312 kidney contours, our method achieved a cross-validated area under the curve of 0.937 in identifying unacceptable contours. We applied our method to an unlabeled validation dataset of 36 kidney contours. We flagged 6 contours which were then reviewed by a cervix contour specialist, who found that 4 of the 6 contours contained errors. We used Shapley values to characterize the specific shape features that contributed to each contour being flagged, providing a starting point for characterizing the source of the contouring error. These promising results suggest our method is feasible for quality assurance of automated radiotherapy contours.</p>","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/97/ce/nihms-1888738.PMC10091357.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9287536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TRANS-OMIC KNOWLEDGE TRANSFER MODELING INFERS GUT MICROBIOME BIOMARKERS OF ANTI-TNF RESISTANCE IN ULCERATIVE COLITIS. 反组知识转移模型推断溃疡性结肠炎中抗肿瘤坏死因子耐药的肠道微生物组生物标志物。
Alan Trinh, Ran Ran, Douglas K Brubaker

A critical challenge in analyzing multi-omics data from clinical cohorts is the re-use of these valuable datasets to answer biological questions beyond the scope of the original study. Transfer Learning and Knowledge Transfer approaches are machine learning methods that leverage knowledge gained in one domain to solve a problem in another. Here, we address the challenge of developing Knowledge Transfer approaches to map trans-omic information from a multi-omic clinical cohort to another cohort in which a novel phenotype is measured. Our test case is that of predicting gut microbiome and gut metabolite biomarkers of resistance to anti-TNF therapy in Ulcerative Colitis patients. Three approaches are proposed for Trans-omic Knowledge Transfer, and the resulting performance and downstream inferred biomarkers are compared to identify efficacious methods. We find that multiple approaches reveal similar metabolite and microbial biomarkers of anti-TNF resistance and that these commonly implicated biomarkers can be validated in literature analysis. Overall, we demonstrate a promising approach to maximize the value of the investment in large clinical multi-omics studies by re-using these data to answer biological and clinical questions not posed in the original study.

分析来自临床队列的多组学数据的一个关键挑战是重新使用这些有价值的数据集来回答超出原始研究范围的生物学问题。迁移学习和知识迁移方法是利用在一个领域获得的知识来解决另一个领域的问题的机器学习方法。在这里,我们解决了开发知识转移方法的挑战,将跨基因组信息从多组临床队列映射到另一个测量新表型的队列。我们的测试案例是预测溃疡性结肠炎患者对抗肿瘤坏死因子治疗耐药的肠道微生物组和肠道代谢物生物标志物。本文提出了三种跨组知识转移的方法,并比较了结果的性能和下游推断的生物标志物,以确定有效的方法。我们发现多种方法揭示了抗tnf耐药的相似代谢物和微生物生物标志物,这些通常涉及的生物标志物可以在文献分析中得到验证。总的来说,我们展示了一种有希望的方法,通过重新使用这些数据来回答原始研究中未提出的生物学和临床问题,从而最大化大型临床多组学研究的投资价值。
{"title":"TRANS-OMIC KNOWLEDGE TRANSFER MODELING INFERS GUT MICROBIOME BIOMARKERS OF ANTI-TNF RESISTANCE IN ULCERATIVE COLITIS.","authors":"Alan Trinh,&nbsp;Ran Ran,&nbsp;Douglas K Brubaker","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>A critical challenge in analyzing multi-omics data from clinical cohorts is the re-use of these valuable datasets to answer biological questions beyond the scope of the original study. Transfer Learning and Knowledge Transfer approaches are machine learning methods that leverage knowledge gained in one domain to solve a problem in another. Here, we address the challenge of developing Knowledge Transfer approaches to map trans-omic information from a multi-omic clinical cohort to another cohort in which a novel phenotype is measured. Our test case is that of predicting gut microbiome and gut metabolite biomarkers of resistance to anti-TNF therapy in Ulcerative Colitis patients. Three approaches are proposed for Trans-omic Knowledge Transfer, and the resulting performance and downstream inferred biomarkers are compared to identify efficacious methods. We find that multiple approaches reveal similar metabolite and microbial biomarkers of anti-TNF resistance and that these commonly implicated biomarkers can be validated in literature analysis. Overall, we demonstrate a promising approach to maximize the value of the investment in large clinical multi-omics studies by re-using these data to answer biological and clinical questions not posed in the original study.</p>","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10443515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-treatment Effect Estimation from Biomedical Data. 基于生物医学数据的多处理效果估计。
Raquel Aoki, Yizhou Chen, Martin Ester

Several biomedical applications contain multiple treatments from which we want to estimate the causal effect on a given outcome. Most existing Causal Inference methods, however, focus on single treatments. In this work, we propose a neural network that adopts a multi-task learning approach to estimate the effect of multiple treatments. We validated M3E2 in three synthetic benchmark datasets that mimic biomedical datasets. Our analysis showed that our method makes more accurate estimations than existing baselines.

几种生物医学应用包含多种治疗方法,我们希望从中估计对给定结果的因果效应。然而,大多数现有的因果推理方法都集中在单一的处理上。在这项工作中,我们提出了一个采用多任务学习方法的神经网络来估计多种治疗的效果。我们在模拟生物医学数据集的三个合成基准数据集中验证了M3E2。我们的分析表明,我们的方法比现有的基线做出更准确的估计。
{"title":"Multi-treatment Effect Estimation from Biomedical Data.","authors":"Raquel Aoki,&nbsp;Yizhou Chen,&nbsp;Martin Ester","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Several biomedical applications contain multiple treatments from which we want to estimate the causal effect on a given outcome. Most existing Causal Inference methods, however, focus on single treatments. In this work, we propose a neural network that adopts a multi-task learning approach to estimate the effect of multiple treatments. We validated M3E2 in three synthetic benchmark datasets that mimic biomedical datasets. Our analysis showed that our method makes more accurate estimations than existing baselines.</p>","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10443516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Session Introduction: Precision Medicine: Using Artificial Intelligence to Improve Diagnostics and Healthcare. 会议介绍:精准医疗:利用人工智能改善诊断和医疗保健。
Michelle Whirl-Carrillo, Steven E Brenner, Jonathan H Chen, Dana C Crawford, Łukasz Kidziński, David Ouyang, Roxana Daneshjou

Precision medicine requires a deep understanding of complex biomedical and healthcare data, which is being generated at exponential rates and increasingly made available through public biobanks, electronic medical record systems and biomedical databases and knowledgebases. The complexity and sheer amount of data prohibit manual manipulation. Instead, the field depends on artificial intelligence approaches to parse, annotate, evaluate and interpret the data to enable applications to patient healthcare At the 2023 Pacific Symposium on Biocomputing (PSB) session entitled "Precision Medicine: Using Artificial Intelligence (AI) to improve diagnostics and healthcare", we spotlight research that develops and applies computational methodologies to solve biomedical problems.

精准医疗需要对复杂的生物医学和医疗保健数据有深刻的理解,这些数据正以指数级的速度产生,并越来越多地通过公共生物银行、电子病历系统、生物医学数据库和知识库提供。数据的复杂性和庞大的数量禁止人工操作。相反,该领域依赖于人工智能方法来解析、注释、评估和解释数据,以使应用于患者医疗保健。在2023年太平洋生物计算研讨会(PSB)题为“精准医学:使用人工智能(AI)改善诊断和医疗保健”的会议上,我们重点关注开发和应用计算方法来解决生物医学问题的研究。
{"title":"Session Introduction: Precision Medicine: Using Artificial Intelligence to Improve Diagnostics and Healthcare.","authors":"Michelle Whirl-Carrillo,&nbsp;Steven E Brenner,&nbsp;Jonathan H Chen,&nbsp;Dana C Crawford,&nbsp;Łukasz Kidziński,&nbsp;David Ouyang,&nbsp;Roxana Daneshjou","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Precision medicine requires a deep understanding of complex biomedical and healthcare data, which is being generated at exponential rates and increasingly made available through public biobanks, electronic medical record systems and biomedical databases and knowledgebases. The complexity and sheer amount of data prohibit manual manipulation. Instead, the field depends on artificial intelligence approaches to parse, annotate, evaluate and interpret the data to enable applications to patient healthcare At the 2023 Pacific Symposium on Biocomputing (PSB) session entitled \"Precision Medicine: Using Artificial Intelligence (AI) to improve diagnostics and healthcare\", we spotlight research that develops and applies computational methodologies to solve biomedical problems.</p>","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10443517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HIGH-PERFORMANCE COMPUTING MEETS HIGH-PERFORMANCE MEDICINE. 高性能计算遇上高性能医学。
Anurag Verma, Jennifer Huffman, Ali Torkamani, Ravi Madduri

The following sections are included: Introduction, Background, and Motivation, Workshop Presenters, References.

包括以下部分:介绍,背景和动机,研讨会主持人,参考资料。
{"title":"HIGH-PERFORMANCE COMPUTING MEETS HIGH-PERFORMANCE MEDICINE.","authors":"Anurag Verma,&nbsp;Jennifer Huffman,&nbsp;Ali Torkamani,&nbsp;Ravi Madduri","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The following sections are included: Introduction, Background, and Motivation, Workshop Presenters, References.</p>","PeriodicalId":34954,"journal":{"name":"Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10443522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1