Hong-Li Zeng, Cheng-Long Yang, Bo Jing, John Barton, Erik Aurell
{"title":"Two fitness inference schemes compared using allele frequencies from 1,068,391 sequences sampled in the UK during the COVID-19 pandemic.","authors":"Hong-Li Zeng, Cheng-Long Yang, Bo Jing, John Barton, Erik Aurell","doi":"10.1088/1478-3975/ad9213","DOIUrl":null,"url":null,"abstract":"<p><p>Throughout the course of the SARS-CoV-2 pandemic, genetic variation has contributed to the spread and persistence of the virus. For example, various mutations have allowed SARS-CoV-2 to escape antibody neutralization or to bind more strongly to the receptors that it uses to enter human cells. Here, we compared two methods that estimate the fitness effects of viral mutations using the abundant sequence data gathered over the course of the pandemic. Both approaches are grounded in population genetics theory but with different assumptions. One approach, tQLE, features an epistatic fitness landscape and assumes that alleles are nearly in linkage equilibrium. Another approach, MPL, assumes a simple, additive fitness landscape, but allows for any level of correlation between alleles. We characterized differences in the distributions of fitness values inferred by each approach and in the ranks of fitness values that they assign to sequences across time. We find that in a large fraction of weeks the two methods are in good agreement as to their top-ranked sequences, \\textit{i.e.} as to which sequences observed that week are most fit. We also find that agreement between the ranking of sequences varies with genetic unimodality in the population in a given week.
.</p>","PeriodicalId":20207,"journal":{"name":"Physical biology","volume":null,"pages":null},"PeriodicalIF":2.0000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1088/1478-3975/ad9213","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Throughout the course of the SARS-CoV-2 pandemic, genetic variation has contributed to the spread and persistence of the virus. For example, various mutations have allowed SARS-CoV-2 to escape antibody neutralization or to bind more strongly to the receptors that it uses to enter human cells. Here, we compared two methods that estimate the fitness effects of viral mutations using the abundant sequence data gathered over the course of the pandemic. Both approaches are grounded in population genetics theory but with different assumptions. One approach, tQLE, features an epistatic fitness landscape and assumes that alleles are nearly in linkage equilibrium. Another approach, MPL, assumes a simple, additive fitness landscape, but allows for any level of correlation between alleles. We characterized differences in the distributions of fitness values inferred by each approach and in the ranks of fitness values that they assign to sequences across time. We find that in a large fraction of weeks the two methods are in good agreement as to their top-ranked sequences, \textit{i.e.} as to which sequences observed that week are most fit. We also find that agreement between the ranking of sequences varies with genetic unimodality in the population in a given week.
.
期刊介绍:
Physical Biology publishes articles in the broad interdisciplinary field bridging biology with the physical sciences and engineering. This journal focuses on research in which quantitative approaches – experimental, theoretical and modeling – lead to new insights into biological systems at all scales of space and time, and all levels of organizational complexity.
Physical Biology accepts contributions from a wide range of biological sub-fields, including topics such as:
molecular biophysics, including single molecule studies, protein-protein and protein-DNA interactions
subcellular structures, organelle dynamics, membranes, protein assemblies, chromosome structure
intracellular processes, e.g. cytoskeleton dynamics, cellular transport, cell division
systems biology, e.g. signaling, gene regulation and metabolic networks
cells and their microenvironment, e.g. cell mechanics and motility, chemotaxis, extracellular matrix, biofilms
cell-material interactions, e.g. biointerfaces, electrical stimulation and sensing, endocytosis
cell-cell interactions, cell aggregates, organoids, tissues and organs
developmental dynamics, including pattern formation and morphogenesis
physical and evolutionary aspects of disease, e.g. cancer progression, amyloid formation
neuronal systems, including information processing by networks, memory and learning
population dynamics, ecology, and evolution
collective action and emergence of collective phenomena.