Andrei Kramer, Federica Milinanni, Pierre Nyquist, Alexandra Jauhiainen, Olivia Eriksson
{"title":"UQSA -- An R-Package for Uncertainty Quantification and Sensitivity Analysis for Biochemical Reaction Network Models","authors":"Andrei Kramer, Federica Milinanni, Pierre Nyquist, Alexandra Jauhiainen, Olivia Eriksson","doi":"arxiv-2308.05527","DOIUrl":null,"url":null,"abstract":"We present an R-package developed for modeling of biochemical reaction\nnetworks, uncertainty quantification (UQ) and sensitivity analysis (SA).\nEstimating parameters and quantifying their uncertainty (and resulting\nprediction uncertainty), is required for data-driven systems biology modeling.\nSampling methods need to be efficient when confronted with high-dimensional,\ncorrelated parameter distributions. We have developed the UQSA package to be\nfast for this problem class and work well with other tools for modelling. We\naim for simplicity, and part of that is our use of the SBtab format for the\nunified storage of model and data. Our tool-set is modular enough, that parts\ncan be replaced. We use intermediate formats that are not hidden from the user\nto make this feasible. UQ is performed through Markov chain Monte Carlo (MCMC)\nsampling in an Approximate Bayesian Computation (ABC) setting. This can be\nfollowed by a variance-decomposition based global sensitivity analysis. If\nneeded, complex parameter distributions can be described, evaluated, and\nsampled from, with the help of Vine-copulas that are available in R. This\napproach is especially useful when new experimental data become available, and\na previously calibrated model needs to be updated. Implementation: R is a high-level language and allows the use of\nsophisticated statistical methods. The ode solver we used is written in C\n(gsl_odeiv2, interface to R is ours). We use the SBtab tabular format for the\nmodel description, as well as the data and an event system to be able to model\ninputs frequently encountered in systems biology and neuroscience. The code has\nbeen tested on one node with 256 cores of a computing cluster, but smaller\nexamples are included in the repository that can be run on a laptop. Source code: https://github.com/icpm-kth/uqsa","PeriodicalId":501170,"journal":{"name":"arXiv - QuanBio - Subcellular Processes","volume":"58 39","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Subcellular Processes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2308.05527","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We present an R-package developed for modeling of biochemical reaction
networks, uncertainty quantification (UQ) and sensitivity analysis (SA).
Estimating parameters and quantifying their uncertainty (and resulting
prediction uncertainty), is required for data-driven systems biology modeling.
Sampling methods need to be efficient when confronted with high-dimensional,
correlated parameter distributions. We have developed the UQSA package to be
fast for this problem class and work well with other tools for modelling. We
aim for simplicity, and part of that is our use of the SBtab format for the
unified storage of model and data. Our tool-set is modular enough, that parts
can be replaced. We use intermediate formats that are not hidden from the user
to make this feasible. UQ is performed through Markov chain Monte Carlo (MCMC)
sampling in an Approximate Bayesian Computation (ABC) setting. This can be
followed by a variance-decomposition based global sensitivity analysis. If
needed, complex parameter distributions can be described, evaluated, and
sampled from, with the help of Vine-copulas that are available in R. This
approach is especially useful when new experimental data become available, and
a previously calibrated model needs to be updated. Implementation: R is a high-level language and allows the use of
sophisticated statistical methods. The ode solver we used is written in C
(gsl_odeiv2, interface to R is ours). We use the SBtab tabular format for the
model description, as well as the data and an event system to be able to model
inputs frequently encountered in systems biology and neuroscience. The code has
been tested on one node with 256 cores of a computing cluster, but smaller
examples are included in the repository that can be run on a laptop. Source code: https://github.com/icpm-kth/uqsa