[hpv16型E6和E7蛋白的氨基酸取代模式:系统地理学和进化]。

E E Zelenova, A A Karlsen, D V Avdoshina, K K Kyuregyan, M G Belikova, I D Trotsenko
{"title":"[hpv16型E6和E7蛋白的氨基酸取代模式:系统地理学和进化]。","authors":"E E Zelenova, A A Karlsen, D V Avdoshina, K K Kyuregyan, M G Belikova, I D Trotsenko","doi":"10.31857/S0026898424040036, EDN: INFCZJ","DOIUrl":null,"url":null,"abstract":"<p><p>The E6 and E7 proteins of the high risk human papillomaviruses (HR HPVs) play a key role in the oncogenesis associated with papillomavirus infection. Data on the variability of these proteins are limited, and the factors affecting their variability are still poorly understood. We analyzed the variability of the currently known sequences of the HPV type 16 (HPV16) E6 and E7 proteins, taking into account their geographic origin and year of sample collection, as well as the direction of their evolution in the major geographic regions of the world. All sequences belonging to HPV16 genome fragments encoding the E6 and E7 oncoproteins were downloaded from the NCBI GenBank database on October 6, 2022. Samples were filtered according to the following parameters: the sequence has to include at least one of the two whole open reading frames, and given date of collection, and the country of origin. A total of 3651 full-genome nucleotide sequences encoding the E6 protein and 4578 full-genome nucleotide sequences encoding the E7 protein were sampled. The nucleotide sequences obtained after sampling and alignment were converted to amino acid sequences and analyzed using the MEGA11, R, RStudio, Jmodeltest 2.1.20, BEAST v1.10.4, Fastcov, and Biostrings software. The highest variability in the E6 protein was recorded for amino acid (AA) residues in the positions 17, 21, 32, 85, and 90. The most variable in E7 were aa positions 28, 29, 51, and 77. The samples were divided geographically into five heterogeneous groups as derived from Africa, Europe, America, South-West and South Asia, and South-East Asia. Unique amino acid substitutions (AA-substitutions) in the E6/E7 proteins of HPV16, presumably characteristic to certain ethnic groups, were identified for a number of countries. They weare mainly localized in the sites of known B- and T-cell epitopes and relatively rarely the domains critical for in structure and protein function. The revealed differences in AA-substitutions in different ethnic groups and their colocalization with the clusters of B- and T-cell epitopes suggested their possible relation to the geographical distribution of alleles and haplotypes of the major histocompatibility complex (HLA). This may lead to the recognition of a different set of B- and T-cell epitopes of the virus in different geographic areas, resulting in the regional differences in the direction of epitopic drift. Phylogenetic analysis of the nucleotide sequences encoding the E6 protein of HPV16 revealed a common ancestor, confirmed regional clustering of the E6 protein sequences sharing common AA-substitutions, and identified cases of reversion of individual AA-substitutions when the change of geographical localization. For the E7 protein, such analysis was not possible due to the high sequence homology. Covariance analysis of the pooled of E6 and E7 sequences revealed that there was no associations between amino acid residues in any aa position within E6 or E7 as well as aa positions of E6 and E7 proteins. The data presented here are important for the development of universal therapeutic vaccines against HPV of high carcinogenic risk.</p>","PeriodicalId":39818,"journal":{"name":"Molekulyarnaya Biologiya","volume":"58 4","pages":"549-574"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"[Amino Acid Substitution Patterns in the E6 and E7 Proteins of HPV Type 16: Phylogeography and Evolution].\",\"authors\":\"E E Zelenova, A A Karlsen, D V Avdoshina, K K Kyuregyan, M G Belikova, I D Trotsenko\",\"doi\":\"10.31857/S0026898424040036, EDN: INFCZJ\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The E6 and E7 proteins of the high risk human papillomaviruses (HR HPVs) play a key role in the oncogenesis associated with papillomavirus infection. Data on the variability of these proteins are limited, and the factors affecting their variability are still poorly understood. We analyzed the variability of the currently known sequences of the HPV type 16 (HPV16) E6 and E7 proteins, taking into account their geographic origin and year of sample collection, as well as the direction of their evolution in the major geographic regions of the world. All sequences belonging to HPV16 genome fragments encoding the E6 and E7 oncoproteins were downloaded from the NCBI GenBank database on October 6, 2022. Samples were filtered according to the following parameters: the sequence has to include at least one of the two whole open reading frames, and given date of collection, and the country of origin. A total of 3651 full-genome nucleotide sequences encoding the E6 protein and 4578 full-genome nucleotide sequences encoding the E7 protein were sampled. The nucleotide sequences obtained after sampling and alignment were converted to amino acid sequences and analyzed using the MEGA11, R, RStudio, Jmodeltest 2.1.20, BEAST v1.10.4, Fastcov, and Biostrings software. The highest variability in the E6 protein was recorded for amino acid (AA) residues in the positions 17, 21, 32, 85, and 90. The most variable in E7 were aa positions 28, 29, 51, and 77. The samples were divided geographically into five heterogeneous groups as derived from Africa, Europe, America, South-West and South Asia, and South-East Asia. Unique amino acid substitutions (AA-substitutions) in the E6/E7 proteins of HPV16, presumably characteristic to certain ethnic groups, were identified for a number of countries. They weare mainly localized in the sites of known B- and T-cell epitopes and relatively rarely the domains critical for in structure and protein function. The revealed differences in AA-substitutions in different ethnic groups and their colocalization with the clusters of B- and T-cell epitopes suggested their possible relation to the geographical distribution of alleles and haplotypes of the major histocompatibility complex (HLA). This may lead to the recognition of a different set of B- and T-cell epitopes of the virus in different geographic areas, resulting in the regional differences in the direction of epitopic drift. Phylogenetic analysis of the nucleotide sequences encoding the E6 protein of HPV16 revealed a common ancestor, confirmed regional clustering of the E6 protein sequences sharing common AA-substitutions, and identified cases of reversion of individual AA-substitutions when the change of geographical localization. For the E7 protein, such analysis was not possible due to the high sequence homology. Covariance analysis of the pooled of E6 and E7 sequences revealed that there was no associations between amino acid residues in any aa position within E6 or E7 as well as aa positions of E6 and E7 proteins. The data presented here are important for the development of universal therapeutic vaccines against HPV of high carcinogenic risk.</p>\",\"PeriodicalId\":39818,\"journal\":{\"name\":\"Molekulyarnaya Biologiya\",\"volume\":\"58 4\",\"pages\":\"549-574\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molekulyarnaya Biologiya\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31857/S0026898424040036, EDN: INFCZJ\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molekulyarnaya Biologiya","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31857/S0026898424040036, EDN: INFCZJ","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

摘要

高危人乳头瘤病毒(HR hpv)的E6和E7蛋白在与乳头瘤病毒感染相关的肿瘤发生中起关键作用。关于这些蛋白质可变性的数据是有限的,影响其可变性的因素仍然知之甚少。我们分析了目前已知的HPV16型(HPV16) E6和E7蛋白序列的可变性,考虑到它们的地理起源和样本收集年份,以及它们在世界主要地理区域的进化方向。2022年10月6日,从NCBI GenBank数据库中下载了编码E6和E7癌蛋白的HPV16基因组片段的所有序列。根据以下参数对样本进行过滤:序列必须包括两个完整开放阅读框中的至少一个,并给出采集日期和原产国。共获得编码E6蛋白的全基因组核苷酸序列3651条,编码E7蛋白的全基因组核苷酸序列4578条。将取样比对后得到的核苷酸序列转化为氨基酸序列,使用MEGA11、R、RStudio、Jmodeltest 2.1.20、BEAST v1.10.4、Fastcov和Biostrings软件进行分析。E6蛋白在17、21、32、85和90个位置的氨基酸(AA)残基上变异最大。E7中变化最大的是第28、29、51和77位。这些样本在地理上被分为来自非洲、欧洲、美洲、西南和南亚以及东南亚的五个异质群体。在许多国家发现了HPV16的E6/E7蛋白中独特的氨基酸取代(aa -取代),可能是某些种族群体的特征。它们主要定位于已知的B细胞和t细胞表位,相对较少定位于结构和蛋白质功能的关键区域。不同族群aa -取代的差异及其与B细胞和t细胞表位簇的共定位提示其可能与主要组织相容性复合体(HLA)等位基因和单倍型的地理分布有关。这可能导致在不同的地理区域识别一组不同的病毒B细胞和t细胞表位,从而导致表位漂移方向的区域差异。对编码HPV16 E6蛋白的核苷酸序列进行系统发育分析,发现E6蛋白序列具有共同的祖先,具有共同的aa -取代序列,并发现了地理定位变化时个体aa -取代的逆转情况。对于E7蛋白,由于序列同源性高,无法进行这种分析。对E6和E7序列进行协方差分析,发现E6和E7中aa位氨基酸残基与E6和E7蛋白的aa位氨基酸残基之间不存在关联。本文提供的数据对于开发针对高致癌风险HPV的通用治疗性疫苗具有重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
[Amino Acid Substitution Patterns in the E6 and E7 Proteins of HPV Type 16: Phylogeography and Evolution].

The E6 and E7 proteins of the high risk human papillomaviruses (HR HPVs) play a key role in the oncogenesis associated with papillomavirus infection. Data on the variability of these proteins are limited, and the factors affecting their variability are still poorly understood. We analyzed the variability of the currently known sequences of the HPV type 16 (HPV16) E6 and E7 proteins, taking into account their geographic origin and year of sample collection, as well as the direction of their evolution in the major geographic regions of the world. All sequences belonging to HPV16 genome fragments encoding the E6 and E7 oncoproteins were downloaded from the NCBI GenBank database on October 6, 2022. Samples were filtered according to the following parameters: the sequence has to include at least one of the two whole open reading frames, and given date of collection, and the country of origin. A total of 3651 full-genome nucleotide sequences encoding the E6 protein and 4578 full-genome nucleotide sequences encoding the E7 protein were sampled. The nucleotide sequences obtained after sampling and alignment were converted to amino acid sequences and analyzed using the MEGA11, R, RStudio, Jmodeltest 2.1.20, BEAST v1.10.4, Fastcov, and Biostrings software. The highest variability in the E6 protein was recorded for amino acid (AA) residues in the positions 17, 21, 32, 85, and 90. The most variable in E7 were aa positions 28, 29, 51, and 77. The samples were divided geographically into five heterogeneous groups as derived from Africa, Europe, America, South-West and South Asia, and South-East Asia. Unique amino acid substitutions (AA-substitutions) in the E6/E7 proteins of HPV16, presumably characteristic to certain ethnic groups, were identified for a number of countries. They weare mainly localized in the sites of known B- and T-cell epitopes and relatively rarely the domains critical for in structure and protein function. The revealed differences in AA-substitutions in different ethnic groups and their colocalization with the clusters of B- and T-cell epitopes suggested their possible relation to the geographical distribution of alleles and haplotypes of the major histocompatibility complex (HLA). This may lead to the recognition of a different set of B- and T-cell epitopes of the virus in different geographic areas, resulting in the regional differences in the direction of epitopic drift. Phylogenetic analysis of the nucleotide sequences encoding the E6 protein of HPV16 revealed a common ancestor, confirmed regional clustering of the E6 protein sequences sharing common AA-substitutions, and identified cases of reversion of individual AA-substitutions when the change of geographical localization. For the E7 protein, such analysis was not possible due to the high sequence homology. Covariance analysis of the pooled of E6 and E7 sequences revealed that there was no associations between amino acid residues in any aa position within E6 or E7 as well as aa positions of E6 and E7 proteins. The data presented here are important for the development of universal therapeutic vaccines against HPV of high carcinogenic risk.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Molekulyarnaya Biologiya
Molekulyarnaya Biologiya Medicine-Medicine (all)
CiteScore
0.70
自引率
0.00%
发文量
131
期刊最新文献
[Donor DNA Modification with Cas9 Targeting Sites Improves the Efficiency of MTC34 Knock-in into the CXCR4 Locus]. [How to Shift the Equilibrium of DNA Break Repair in Favor of Homologous Recombination]. [Human eRF1 Translation Regulation]. [Metabolic Profile of Gut Microbiota and Levels of Trefoil Factors in Adults with Different Metabolic Phenotypes of Obesity]. [Methods to Increase the Efficiency of Knock-in of a Construct Encoding the HIV-1 Fusion Inhibitor, MT-C34 Peptide, into the CXCR4 Locus in the CEM/R5 T Cell Line].
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1