A simple phylogenetic approach to analyze hypermutated HIV proviruses reveals insights into their dynamics and persistence during antiretroviral therapy.

Aniqa Shahid, Bradley R Jones, Maggie C Duncan, Signe MacLennan, Michael J Dapp, Mark H Kuniholm, Bradley Aouizerat, Nancie M Archin, Stephen Gange, Igho Ofotokun, Margaret A Fischl, Seble Kassaye, Harris Goldstein, Kathryn Anastos, Jeffrey B Joy, Zabrina L Brumme
{"title":"A simple phylogenetic approach to analyze hypermutated HIV proviruses reveals insights into their dynamics and persistence during antiretroviral therapy.","authors":"Aniqa Shahid, Bradley R Jones, Maggie C Duncan, Signe MacLennan, Michael J Dapp, Mark H Kuniholm, Bradley Aouizerat, Nancie M Archin, Stephen Gange, Igho Ofotokun, Margaret A Fischl, Seble Kassaye, Harris Goldstein, Kathryn Anastos, Jeffrey B Joy, Zabrina L Brumme","doi":"10.21203/rs.3.rs-4549934/v1","DOIUrl":null,"url":null,"abstract":"<p><p>Hypermutated proviruses, which arise in a single HIV replication cycle when host antiviral APOBEC3 proteins introduce extensive G-to-A mutations throughout the viral genome, persist in all people living with HIV receiving antiretroviral therapy (ART). But, the within-host evolutionary origins of hypermutated sequences are incompletely understood because phylogenetic inference algorithms, which assume that mutations gradually accumulate over generations, incorrectly reconstruct their ancestor-descendant relationships. Using > 1400 longitudinal single-genome-amplified HIV <i>env-gp120</i> sequences isolated from six women over a median 18 years of follow-up - including plasma HIV RNA sequences collected over a median 9 years between seroconversion and ART initiation, and > 500 proviruses isolated over a median 9 years on ART - we evaluated three approaches for removing hypermutation from nucleotide alignments. Our goals were to 1) reconstruct accurate phylogenies that can be used for molecular dating and 2) phylogenetically infer the integration dates of hypermutated proviruses persisting during ART. Two of the tested approaches (stripping all positions containing putative APOBEC3 mutations from the alignment, or replacing individual putative APOBEC3 mutations in hypermutated sequences with the ambiguous base R) consistently normalized tree topologies, eliminated erroneous clustering of hypermutated proviruses, and brought <i>env</i>-intact and hypermutated proviruses into comparable ranges with respect to multiple tree-based metrics. Importantly, these corrected trees produced integration date estimates for <i>env</i>-intact proviruses that were highly concordant with those from benchmark trees that excluded hypermutated sequences, indicating that the corrected trees can be used for molecular dating. Use of these trees to infer the integration dates of hypermutated proviruses persisting during ART revealed that these spanned a wide age range, with the oldest ones dating to shortly after infection. This indicates that hypermutated proviruses, like other provirus types, begin to be seeded into the proviral pool immediately following infection, and can persist for decades. In two of the six participants, hypermutated proviruses differed from <i>env-</i>intact ones in terms of their age distributions, suggesting that different provirus types decay at heterogeneous rates in some hosts. These simple approaches to reconstruct hypermutated provirus' evolutionary histories, allow insights into their <i>in vivo</i> origins and longevity, towards a more comprehensive understanding of HIV persistence during ART.</p>","PeriodicalId":94282,"journal":{"name":"Research square","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11213167/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research square","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21203/rs.3.rs-4549934/v1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Hypermutated proviruses, which arise in a single HIV replication cycle when host antiviral APOBEC3 proteins introduce extensive G-to-A mutations throughout the viral genome, persist in all people living with HIV receiving antiretroviral therapy (ART). But, the within-host evolutionary origins of hypermutated sequences are incompletely understood because phylogenetic inference algorithms, which assume that mutations gradually accumulate over generations, incorrectly reconstruct their ancestor-descendant relationships. Using > 1400 longitudinal single-genome-amplified HIV env-gp120 sequences isolated from six women over a median 18 years of follow-up - including plasma HIV RNA sequences collected over a median 9 years between seroconversion and ART initiation, and > 500 proviruses isolated over a median 9 years on ART - we evaluated three approaches for removing hypermutation from nucleotide alignments. Our goals were to 1) reconstruct accurate phylogenies that can be used for molecular dating and 2) phylogenetically infer the integration dates of hypermutated proviruses persisting during ART. Two of the tested approaches (stripping all positions containing putative APOBEC3 mutations from the alignment, or replacing individual putative APOBEC3 mutations in hypermutated sequences with the ambiguous base R) consistently normalized tree topologies, eliminated erroneous clustering of hypermutated proviruses, and brought env-intact and hypermutated proviruses into comparable ranges with respect to multiple tree-based metrics. Importantly, these corrected trees produced integration date estimates for env-intact proviruses that were highly concordant with those from benchmark trees that excluded hypermutated sequences, indicating that the corrected trees can be used for molecular dating. Use of these trees to infer the integration dates of hypermutated proviruses persisting during ART revealed that these spanned a wide age range, with the oldest ones dating to shortly after infection. This indicates that hypermutated proviruses, like other provirus types, begin to be seeded into the proviral pool immediately following infection, and can persist for decades. In two of the six participants, hypermutated proviruses differed from env-intact ones in terms of their age distributions, suggesting that different provirus types decay at heterogeneous rates in some hosts. These simple approaches to reconstruct hypermutated provirus' evolutionary histories, allow insights into their in vivo origins and longevity, towards a more comprehensive understanding of HIV persistence during ART.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用简单的系统发生学方法分析高突变的艾滋病毒前病毒,可以深入了解它们在抗逆转录病毒治疗过程中的动态和持久性。
当宿主抗病毒 APOBEC3 蛋白在整个病毒基因组中引入大量 G 到 A 的突变时,就会在单个 HIV 复制周期中产生高突变前病毒,这种病毒在所有接受抗逆转录病毒疗法(ART)的 HIV 感染者中持续存在。但是,人们对高突变序列的宿主内进化起源还不完全了解,因为系统发育推断算法假定突变会随着世代逐渐积累,从而错误地重建了它们的祖先-后代关系。我们使用了从六位女性中位 18 年的随访中分离出的超过 1400 个纵向单基因组扩增 HIV env-gp120 序列(包括从血清转换到开始抗逆转录病毒疗法之间中位 9 年收集的血浆 HIV RNA 序列,以及在抗逆转录病毒疗法中位 9 年分离出的超过 500 个前病毒),评估了从核苷酸排列中去除高突变的三种方法。我们的目标是:1)重建可用于分子年代测定的精确系统发生;2)从系统发生学角度推断抗逆转录病毒疗法期间持续存在的高突变前病毒的整合日期。其中两种测试方法(从比对中剔除所有含有推定 APOBEC3 突变的位置,或用模糊碱基 R 替换高突变序列中的单个推定 APOBEC3 突变)一致地规范了树的拓扑结构,消除了高突变前病毒的错误聚类,并使 env -intact 和高突变前病毒在多个基于树的指标方面处于可比范围内。重要的是,这些经过校正的树所得出的 env -intact 病毒的整合日期估计值与排除了高突变序列的基准树得出的整合日期估计值高度一致,这表明校正后的树可用于分子年代测定。利用这些树来推断抗逆转录病毒疗法期间持续存在的高突变前病毒的整合日期,发现这些前病毒的年龄跨度很大,最老的可追溯到感染后不久。这表明,高突变前病毒与其他类型的前病毒一样,在感染后立即开始向前病毒库播种,并可持续数十年之久。在六名参与者中,有两名参与者的高突变前病毒在年龄分布上不同于与 env 无关的前病毒,这表明不同类型的前病毒在某些宿主体内以不同的速度衰减。通过这些简单的方法重建高突变前病毒的进化史,可以深入了解它们在体内的起源和寿命,从而更全面地了解抗逆转录病毒疗法期间艾滋病毒的持续存在。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Long non-coding RNA Malat1 fine-tunes bone homeostasis and repair by orchestrating cellular crosstalk and the β-catenin-OPG/Jagged1 pathway. Dietary lipid is largely deposited in skin and rapidly affects insulating properties. Novel Machine Learning of DNA Methylation Patterns to Diagnose Complex Disease: Identification of Cerebral Palsy with Concurrent Epilepsy. The context-dependent epigenetic and organogenesis programs determine 3D vs. 2D cellular fitness of MYC-driven murine liver cancer cells. GZMK+CD8+ T cells Target A Specific Acinar Cell Type in Sjögren's Disease.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1