Optimizing design of genomics studies for clonal evolution analysis.

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY Bioinformatics advances Pub Date : 2024-12-02 eCollection Date: 2024-01-01 DOI:10.1093/bioadv/vbae193
Arjun Srivatsa, Russell Schwartz
{"title":"Optimizing design of genomics studies for clonal evolution analysis.","authors":"Arjun Srivatsa, Russell Schwartz","doi":"10.1093/bioadv/vbae193","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Genomic biotechnology has rapidly advanced, allowing for the inference and modification of genetic and epigenetic information at the single-cell level. While these tools hold enormous potential for basic and clinical research, they also raise difficult issues of how to design studies to deploy them most effectively. In designing a genomic study, a modern researcher might combine many sequencing modalities and sampling protocols, each with different utility, costs, and other tradeoffs. This is especially relevant for studies of somatic variation, which may involve highly heterogeneous cell populations whose differences can be probed <i>via</i> an extensive set of biotechnological tools. Efficiently deploying genomic technologies in this space will require principled ways to create study designs that recover desired genomic information while minimizing various measures of cost.</p><p><strong>Results: </strong>The central problem this paper attempts to address is how one might create an optimal study design for a genomic analysis, with particular focus on studies involving somatic variation that occur most often with application to cancer genomics. We pose the study design problem as a stochastic constrained nonlinear optimization problem. We introduce a Bayesian optimization framework that iteratively optimizes for an objective function using surrogate modeling combined with pattern and gradient search. We demonstrate our procedure on several test cases to derive resource and study design allocations optimized for various goals and criteria, demonstrating its ability to optimize study designs efficiently across diverse scenarios.</p><p><strong>Availability and implementation: </strong>https://github.com/CMUSchwartzLab/StudyDesignOptimization.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae193"},"PeriodicalIF":2.4000,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11645549/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbae193","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Genomic biotechnology has rapidly advanced, allowing for the inference and modification of genetic and epigenetic information at the single-cell level. While these tools hold enormous potential for basic and clinical research, they also raise difficult issues of how to design studies to deploy them most effectively. In designing a genomic study, a modern researcher might combine many sequencing modalities and sampling protocols, each with different utility, costs, and other tradeoffs. This is especially relevant for studies of somatic variation, which may involve highly heterogeneous cell populations whose differences can be probed via an extensive set of biotechnological tools. Efficiently deploying genomic technologies in this space will require principled ways to create study designs that recover desired genomic information while minimizing various measures of cost.

Results: The central problem this paper attempts to address is how one might create an optimal study design for a genomic analysis, with particular focus on studies involving somatic variation that occur most often with application to cancer genomics. We pose the study design problem as a stochastic constrained nonlinear optimization problem. We introduce a Bayesian optimization framework that iteratively optimizes for an objective function using surrogate modeling combined with pattern and gradient search. We demonstrate our procedure on several test cases to derive resource and study design allocations optimized for various goals and criteria, demonstrating its ability to optimize study designs efficiently across diverse scenarios.

Availability and implementation: https://github.com/CMUSchwartzLab/StudyDesignOptimization.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
优化基因组学研究设计,促进克隆进化分析。
动因:基因组生物技术发展迅速,可以在单细胞水平上推断和修改基因和表观遗传信息。这些工具为基础和临床研究带来了巨大的潜力,但同时也提出了如何设计研究以最有效地利用这些工具的难题。在设计基因组研究时,现代研究人员可能会结合多种测序模式和取样方案,每种模式和方案都有不同的效用、成本和其他权衡因素。这一点与体细胞变异研究尤为相关,因为体细胞变异研究可能涉及高度异质性的细胞群,而这些细胞群的差异可以通过一系列广泛的生物技术工具进行探测。要在这一领域有效地部署基因组技术,就需要有原则性的研究设计方法,既能恢复所需的基因组信息,又能最大限度地降低各种成本:本文试图解决的核心问题是如何为基因组分析创建最佳研究设计,尤其关注涉及体细胞变异的研究,这种变异在癌症基因组学应用中最为常见。我们将研究设计问题视为一个随机约束非线性优化问题。我们介绍了一种贝叶斯优化框架,该框架利用代用模型结合模式和梯度搜索对目标函数进行迭代优化。我们在几个测试案例中演示了我们的程序,得出了针对各种目标和标准进行优化的资源和研究设计分配,证明了它在各种情况下高效优化研究设计的能力。可用性和实现:https://github.com/CMUSchwartzLab/StudyDesignOptimization。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
1.60
自引率
0.00%
发文量
0
期刊最新文献
Imputation for Lipidomics and Metabolomics (ImpLiMet): a web-based application for optimization and method selection for missing data imputation. estiMAge: development of a DNA methylation clock to estimate the methylation age of single cells. Integrative co-registration of elemental imaging and histopathology for enhanced spatial multimodal analysis of tissue sections through TRACE. Predicting CRISPR-Cas9 off-target effects in human primary cells using bidirectional LSTM with BERT embedding. Genal: a Python toolkit for genetic risk scoring and Mendelian randomization.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1