Pub Date : 2023-07-14DOI: 10.1186/s13100-023-00296-4
Jingxuan Chen, Preston J Basting, Shunhua Han, David J Garfinkel, Casey M Bergman
Background: Many computational methods have been developed to detect non-reference transposable element (TE) insertions using short-read whole genome sequencing data. The diversity and complexity of such methods often present challenges to new users seeking to reproducibly install, execute, or evaluate multiple TE insertion detectors.
Results: We previously developed the McClintock meta-pipeline to facilitate the installation, execution, and evaluation of six first-generation short-read TE detectors. Here, we report a completely re-implemented version of McClintock written in Python using Snakemake and Conda that improves its installation, error handling, speed, stability, and extensibility. McClintock 2 now includes 12 short-read TE detectors, auxiliary pre-processing and analysis modules, interactive HTML reports, and a simulation framework to reproducibly evaluate the accuracy of component TE detectors. When applied to the model microbial eukaryote Saccharomyces cerevisiae, we find substantial variation in the ability of McClintock 2 components to identify the precise locations of non-reference TE insertions, with RelocaTE2 showing the highest recall and precision in simulated data. We find that RelocaTE2, TEMP, TEMP2 and TEBreak provide consistent estimates of [Formula: see text]50 non-reference TE insertions per strain and that Ty2 has the highest number of non-reference TE insertions in a species-wide panel of [Formula: see text]1000 yeast genomes. Finally, we show that best-in-class predictors for yeast applied to resequencing data have sufficient resolution to reveal a dyad pattern of integration in nucleosome-bound regions upstream of yeast tRNA genes for Ty1, Ty2, and Ty4, allowing us to extend knowledge about fine-scale target preferences revealed previously for experimentally-induced Ty1 insertions to spontaneous insertions for other copia-superfamily retrotransposons in yeast.
Conclusion: McClintock ( https://github.com/bergmanlab/mcclintock/ ) provides a user-friendly pipeline for the identification of TEs in short-read WGS data using multiple TE detectors, which should benefit researchers studying TE insertion variation in a wide range of different organisms. Application of the improved McClintock system to simulated and empirical yeast genome data reveals best-in-class methods and novel biological insights for one of the most widely-studied model eukaryotes and provides a paradigm for evaluating and selecting non-reference TE detectors in other species.
{"title":"Reproducible evaluation of transposable element detectors with McClintock 2 guides accurate inference of Ty insertion patterns in yeast.","authors":"Jingxuan Chen, Preston J Basting, Shunhua Han, David J Garfinkel, Casey M Bergman","doi":"10.1186/s13100-023-00296-4","DOIUrl":"https://doi.org/10.1186/s13100-023-00296-4","url":null,"abstract":"<p><strong>Background: </strong>Many computational methods have been developed to detect non-reference transposable element (TE) insertions using short-read whole genome sequencing data. The diversity and complexity of such methods often present challenges to new users seeking to reproducibly install, execute, or evaluate multiple TE insertion detectors.</p><p><strong>Results: </strong>We previously developed the McClintock meta-pipeline to facilitate the installation, execution, and evaluation of six first-generation short-read TE detectors. Here, we report a completely re-implemented version of McClintock written in Python using Snakemake and Conda that improves its installation, error handling, speed, stability, and extensibility. McClintock 2 now includes 12 short-read TE detectors, auxiliary pre-processing and analysis modules, interactive HTML reports, and a simulation framework to reproducibly evaluate the accuracy of component TE detectors. When applied to the model microbial eukaryote Saccharomyces cerevisiae, we find substantial variation in the ability of McClintock 2 components to identify the precise locations of non-reference TE insertions, with RelocaTE2 showing the highest recall and precision in simulated data. We find that RelocaTE2, TEMP, TEMP2 and TEBreak provide consistent estimates of [Formula: see text]50 non-reference TE insertions per strain and that Ty2 has the highest number of non-reference TE insertions in a species-wide panel of [Formula: see text]1000 yeast genomes. Finally, we show that best-in-class predictors for yeast applied to resequencing data have sufficient resolution to reveal a dyad pattern of integration in nucleosome-bound regions upstream of yeast tRNA genes for Ty1, Ty2, and Ty4, allowing us to extend knowledge about fine-scale target preferences revealed previously for experimentally-induced Ty1 insertions to spontaneous insertions for other copia-superfamily retrotransposons in yeast.</p><p><strong>Conclusion: </strong>McClintock ( https://github.com/bergmanlab/mcclintock/ ) provides a user-friendly pipeline for the identification of TEs in short-read WGS data using multiple TE detectors, which should benefit researchers studying TE insertion variation in a wide range of different organisms. Application of the improved McClintock system to simulated and empirical yeast genome data reveals best-in-class methods and novel biological insights for one of the most widely-studied model eukaryotes and provides a paradigm for evaluating and selecting non-reference TE detectors in other species.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":null,"pages":null},"PeriodicalIF":4.9,"publicationDate":"2023-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10347736/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9863184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-05-26DOI: 10.1186/s13100-023-00295-5
Desmila Idola, Hiroshi Mori, Yuji Nagata, Lisa Nonaka, Hirokazu Yano
Background: The strand-biased circularizing integrative elements (SEs) are putatively non-mobilizable integrative elements for transmitting antimicrobial resistance genes. The transposition mode and the prevalence of SEs in prokaryotes remain vague.
Results: To corroborate the transposition mode and the prevalence of SEs, hypothetical transposition intermediates of an SE were searched for in genomic DNA fractions of an SE host. Then, the SE core genes were defined based on gene knockout experiments, and the synteny blocks of their distant homologs were searched for in the RefSeq complete genome sequence database using PSI-BLAST. A genomic DNA fractionation experiment revealed that SE copies are present in a double-stranded nicked circular form in vivo. Operonic structure of three conserved coding sequences (intA, tfp, intB) and srap located at the left end of SEs were identified as essential for attL × attR recombination. The synteny blocks of tfp and srap homologs were detected in 3.6% of the replicons of Gammaproteobacteria but not in other taxa, implying that SE movement is host-dependent. SEs have been discovered most frequently in the orders Vibrionales (19% of replicons), Pseudomonadales (18%), Alteromonadales (17%), and Aeromonadales (12%). Genomic comparisons revealed 35 new SE members with identifiable termini. SEs are present at 1 to 2 copies per replicon and have a median length of 15.7 kb. Three newly identified SE members carry antimicrobial resistance genes, like tmexCD-toprJ, mcr-9, and blaGMA-1. Further experiments validated that three new SE members possess the strand-biased attL × attR recombination activity.
Conclusions: This study suggested that transposition intermediates of SEs are double-stranded circular DNA. The main hosts of SEs are a subset of free-living Gammaproteobacteria; this represents a rather narrow host range compared to those of mobile DNA element groups discovered to date. As the host range, genetic organization, and movements are unique among the mobile DNA elements, SEs provide a new model system for host-mobile DNA element coevolution studies.
{"title":"Host range of strand-biased circularizing integrative elements: a new class of mobile DNA elements nesting in Gammaproteobacteria.","authors":"Desmila Idola, Hiroshi Mori, Yuji Nagata, Lisa Nonaka, Hirokazu Yano","doi":"10.1186/s13100-023-00295-5","DOIUrl":"https://doi.org/10.1186/s13100-023-00295-5","url":null,"abstract":"<p><strong>Background: </strong>The strand-biased circularizing integrative elements (SEs) are putatively non-mobilizable integrative elements for transmitting antimicrobial resistance genes. The transposition mode and the prevalence of SEs in prokaryotes remain vague.</p><p><strong>Results: </strong>To corroborate the transposition mode and the prevalence of SEs, hypothetical transposition intermediates of an SE were searched for in genomic DNA fractions of an SE host. Then, the SE core genes were defined based on gene knockout experiments, and the synteny blocks of their distant homologs were searched for in the RefSeq complete genome sequence database using PSI-BLAST. A genomic DNA fractionation experiment revealed that SE copies are present in a double-stranded nicked circular form in vivo. Operonic structure of three conserved coding sequences (intA, tfp, intB) and srap located at the left end of SEs were identified as essential for attL × attR recombination. The synteny blocks of tfp and srap homologs were detected in 3.6% of the replicons of Gammaproteobacteria but not in other taxa, implying that SE movement is host-dependent. SEs have been discovered most frequently in the orders Vibrionales (19% of replicons), Pseudomonadales (18%), Alteromonadales (17%), and Aeromonadales (12%). Genomic comparisons revealed 35 new SE members with identifiable termini. SEs are present at 1 to 2 copies per replicon and have a median length of 15.7 kb. Three newly identified SE members carry antimicrobial resistance genes, like tmexCD-toprJ, mcr-9, and bla<sub>GMA-1</sub>. Further experiments validated that three new SE members possess the strand-biased attL × attR recombination activity.</p><p><strong>Conclusions: </strong>This study suggested that transposition intermediates of SEs are double-stranded circular DNA. The main hosts of SEs are a subset of free-living Gammaproteobacteria; this represents a rather narrow host range compared to those of mobile DNA element groups discovered to date. As the host range, genetic organization, and movements are unique among the mobile DNA elements, SEs provide a new model system for host-mobile DNA element coevolution studies.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":null,"pages":null},"PeriodicalIF":4.9,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10214605/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9532139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-05-22DOI: 10.1186/s13100-023-00294-6
Zheng Zuo
THE1-family retrovirus invaded the primate genome more than 40 million years ago. Dunn-Fletcher et al. reported one THE1B element upstream of CRH gene alters gestation length by upregulating corticotropin-releasing hormone expression in transgenic mice and concluded it has the same role in human as well. However, no promoter or enhancer mark has been detected around this CRH-proximal element in any human tissue or cell, so probably some anti-viral factor exists in primates to prevents it from wreaking havoc. Here I report two paralogous zinc finger genes, ZNF430 and ZNF100, that emerged during the simian lineage to specifically silence THE1B and THE1A, respectively. Contact residue changes in one finger confers each ZNF the unique ability to preferentially repress one THE1 sub-family over the other. The reported THE1B element contains an intact ZNF430 binding site, thus under the repression of ZNF430 in most tissues including placenta, it is questionable whether or not this retrovirus has any role in human pregnancy. Overall, this analysis highlights the need to study human retroviruses' functions in suitable model system.
{"title":"THE1B may have no role in human pregnancy due to ZNF430-mediated silencing.","authors":"Zheng Zuo","doi":"10.1186/s13100-023-00294-6","DOIUrl":"https://doi.org/10.1186/s13100-023-00294-6","url":null,"abstract":"<p><p>THE1-family retrovirus invaded the primate genome more than 40 million years ago. Dunn-Fletcher et al. reported one THE1B element upstream of CRH gene alters gestation length by upregulating corticotropin-releasing hormone expression in transgenic mice and concluded it has the same role in human as well. However, no promoter or enhancer mark has been detected around this CRH-proximal element in any human tissue or cell, so probably some anti-viral factor exists in primates to prevents it from wreaking havoc. Here I report two paralogous zinc finger genes, ZNF430 and ZNF100, that emerged during the simian lineage to specifically silence THE1B and THE1A, respectively. Contact residue changes in one finger confers each ZNF the unique ability to preferentially repress one THE1 sub-family over the other. The reported THE1B element contains an intact ZNF430 binding site, thus under the repression of ZNF430 in most tissues including placenta, it is questionable whether or not this retrovirus has any role in human pregnancy. Overall, this analysis highlights the need to study human retroviruses' functions in suitable model system.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":null,"pages":null},"PeriodicalIF":4.9,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10204343/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9508945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-05-10DOI: 10.1186/s13100-023-00293-7
Kennedy C Ukadike, Rayan Najjar, Kathryn Ni, Amanda Laine, Xiaoxing Wang, Alison Bays, Martin S Taylor, John LaCava, Tomas Mustelin
Background: Patients with systemic lupus erythematosus (SLE) have autoantibodies against the L1-encoded open-reading frame 1 protein (ORF1p). Here, we report (i) which immune cells ORF1p emanates from, (ii) which L1 loci are transcriptionally active, (iii) whether the cells express L1-dependent interferon and interferon-stimulated genes, and (iv) the effect of inhibition of L1 ORF2p by reverse transcriptase inhibitors.
Results: L1 ORF1p was detected by flow cytometry primarily in SLE CD66b+CD15+ regular and low-density granulocytes, but much less in other immune cell lineages. The amount of ORF1p was higher in neutrophils from patients with SLE disease activity index (SLEDAI) > 6 (p = 0.011) compared to patients with inactive disease, SLEDAI < 4. Patient neutrophils transcribed seven to twelve human-specific L1 loci (L1Hs), but only 3 that are full-length and with an intact ORF1. Besides serving as a source of detectable ORF1p, the most abundant transcript encoded a truncated ORF2p reverse transcriptase predicted to remain cytosolic, while the two other encoded an intact full-length ORF2p. A number of genes encoding proteins that influence L1 transcription positively or negatively were altered in patients, particularly those with active disease, compared to healthy controls. Components of nucleic acid sensing and interferon induction were also altered. SLE neutrophils also expressed type I interferon-inducible genes and interferon β, which were substantially reduced after treatment of the cells with drugs known to inhibit ORF2p reverse transcriptase activity.
Conclusions: We identified L1Hs loci that are transcriptionally active in SLE neutrophils, and a reduction in the epigenetic silencing mechanisms that normally counteract L1 transcription. SLE neutrophils contained L1-encoded ORF1p protein, as well as activation of the type I interferon system, which was inhibited by treatment with reverse transcriptase inhibitors. Our findings will enable a deeper analysis of L1 dysregulation and its potential role in SLE pathogenesis.
{"title":"Expression of L1 retrotransposons in granulocytes from patients with active systemic lupus erythematosus.","authors":"Kennedy C Ukadike, Rayan Najjar, Kathryn Ni, Amanda Laine, Xiaoxing Wang, Alison Bays, Martin S Taylor, John LaCava, Tomas Mustelin","doi":"10.1186/s13100-023-00293-7","DOIUrl":"10.1186/s13100-023-00293-7","url":null,"abstract":"<p><strong>Background: </strong>Patients with systemic lupus erythematosus (SLE) have autoantibodies against the L1-encoded open-reading frame 1 protein (ORF1p). Here, we report (i) which immune cells ORF1p emanates from, (ii) which L1 loci are transcriptionally active, (iii) whether the cells express L1-dependent interferon and interferon-stimulated genes, and (iv) the effect of inhibition of L1 ORF2p by reverse transcriptase inhibitors.</p><p><strong>Results: </strong>L1 ORF1p was detected by flow cytometry primarily in SLE CD66b<sup>+</sup>CD15<sup>+</sup> regular and low-density granulocytes, but much less in other immune cell lineages. The amount of ORF1p was higher in neutrophils from patients with SLE disease activity index (SLEDAI) > 6 (p = 0.011) compared to patients with inactive disease, SLEDAI < 4. Patient neutrophils transcribed seven to twelve human-specific L1 loci (L1Hs), but only 3 that are full-length and with an intact ORF1. Besides serving as a source of detectable ORF1p, the most abundant transcript encoded a truncated ORF2p reverse transcriptase predicted to remain cytosolic, while the two other encoded an intact full-length ORF2p. A number of genes encoding proteins that influence L1 transcription positively or negatively were altered in patients, particularly those with active disease, compared to healthy controls. Components of nucleic acid sensing and interferon induction were also altered. SLE neutrophils also expressed type I interferon-inducible genes and interferon β, which were substantially reduced after treatment of the cells with drugs known to inhibit ORF2p reverse transcriptase activity.</p><p><strong>Conclusions: </strong>We identified L1Hs loci that are transcriptionally active in SLE neutrophils, and a reduction in the epigenetic silencing mechanisms that normally counteract L1 transcription. SLE neutrophils contained L1-encoded ORF1p protein, as well as activation of the type I interferon system, which was inhibited by treatment with reverse transcriptase inhibitors. Our findings will enable a deeper analysis of L1 dysregulation and its potential role in SLE pathogenesis.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":null,"pages":null},"PeriodicalIF":4.9,"publicationDate":"2023-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10170740/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10439218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-04-24DOI: 10.1186/s13100-023-00292-8
Thanh Quang Tran, Minyoung Park, Jong Eun Lee, Soo Hyun Kim, Jae-Ho Jeong, Hyon E Choy
Antimicrobial resistant pathogens are a global health threat driven by the indiscriminate use of antimicrobials. Antimicrobial resistance can be acquired by resistance genes encoded by mobile genetic elements. In this study, we identified a strain of Salmonella enterica serovar Gallinarum (SG4021) from an infected chicken in Korea and characterized the presence of resistance genes in its plasmid by whole genome sequencing. The sequence was then compared with that of a plasmid (P2) from strain SG_07Q015, the only other strain of S. Gallinarum isolated in Korea for which a genome sequence is available. The results revealed that both strains harbored nearly identical DNA carrying antibiotic resistance gene cassettes inserted into integron In2 of the transposable element Tn21, namely an aadA1 resistance gene conferring resistance to aminoglycosides and a sul1 resistance gene conferring resistance to sulfonamide. Interestingly, despite the presence of sul1 in SG4021, an antibiotic sensitivity test revealed that it was sensitive to sulfonamides. Further analysis revealed that this disparity was due to the insertion of a ~ 5 kb ISCR16 sequence downstream of the promoter driving sul1 expression in SG4021. Using various mutants, we showed that the insertion of ISCR16 blocked the expression of the sul1 gene from the upstream promoter. Therefore, the functionality of antimicrobial resistance genes determines phenotypic antimicrobial resistance.
{"title":"Analysis of antibiotic resistance gene cassettes in a newly identified Salmonella enterica serovar Gallinarum strain in Korea.","authors":"Thanh Quang Tran, Minyoung Park, Jong Eun Lee, Soo Hyun Kim, Jae-Ho Jeong, Hyon E Choy","doi":"10.1186/s13100-023-00292-8","DOIUrl":"https://doi.org/10.1186/s13100-023-00292-8","url":null,"abstract":"<p><p>Antimicrobial resistant pathogens are a global health threat driven by the indiscriminate use of antimicrobials. Antimicrobial resistance can be acquired by resistance genes encoded by mobile genetic elements. In this study, we identified a strain of Salmonella enterica serovar Gallinarum (SG4021) from an infected chicken in Korea and characterized the presence of resistance genes in its plasmid by whole genome sequencing. The sequence was then compared with that of a plasmid (P2) from strain SG_07Q015, the only other strain of S. Gallinarum isolated in Korea for which a genome sequence is available. The results revealed that both strains harbored nearly identical DNA carrying antibiotic resistance gene cassettes inserted into integron In2 of the transposable element Tn21, namely an aadA1 resistance gene conferring resistance to aminoglycosides and a sul1 resistance gene conferring resistance to sulfonamide. Interestingly, despite the presence of sul1 in SG4021, an antibiotic sensitivity test revealed that it was sensitive to sulfonamides. Further analysis revealed that this disparity was due to the insertion of a ~ 5 kb ISCR16 sequence downstream of the promoter driving sul1 expression in SG4021. Using various mutants, we showed that the insertion of ISCR16 blocked the expression of the sul1 gene from the upstream promoter. Therefore, the functionality of antimicrobial resistance genes determines phenotypic antimicrobial resistance.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":null,"pages":null},"PeriodicalIF":4.9,"publicationDate":"2023-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10124037/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9391068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-04-10DOI: 10.1186/s13100-023-00291-9
Pavel Jedlička, Viktor Tokan, Iva Kejnovská, Roman Hobza, Eduard Kejnovský
Background: Canonical telomeres (telomerase-synthetised) are readily forming G-quadruplexes (G4) on the G-rich strand. However, there are examples of non-canonical telomeres among eukaryotes where telomeric tandem repeats are invaded by specific retrotransposons. Drosophila melanogaster represents an extreme example with telomeres composed solely by three retrotransposons-Het-A, TAHRE and TART (HTT). Even though non-canonical telomeres often show strand biased G-distribution, the evidence for the G4-forming potential is limited.
Results: Using circular dichroism spectroscopy and UV absorption melting assay we have verified in vitro G4-formation in the HTT elements of D. melanogaster. Namely 3 in Het-A, 8 in TART and 2 in TAHRE. All the G4s are asymmetrically distributed as in canonical telomeres. Bioinformatic analysis showed that asymmetric distribution of potential quadruplex sequences (PQS) is common in telomeric retrotransposons in other Drosophila species. Most of the PQS are located in the gag gene where PQS density correlates with higher DNA sequence conservation and codon selection favoring G4-forming potential. The importance of G4s in non-canonical telomeres is further supported by analysis of telomere-associated retrotransposons from various eukaryotic species including green algae, Diplomonadida, fungi, insects and vertebrates. Virtually all analyzed telomere-associated retrotransposons contained PQS, frequently with asymmetric strand distribution. Comparison with non-telomeric elements showed independent selection of PQS-rich elements from four distinct LINE clades.
Conclusion: Our findings of strand-biased G4-forming motifs in telomere-associated retrotransposons from various eukaryotic species support the G4-formation as one of the prerequisites for the recruitment of specific retrotransposons to chromosome ends and call for further experimental studies.
背景:规范端粒(端粒酶合成)很容易在富含g的链上形成g -四联体(G4)。然而,在真核生物中有非规范端粒的例子,其中端粒串联重复序列被特定的反转录转座子入侵。黑腹果蝇代表了一个极端的例子,其端粒仅由三个反转录转座子- het - a, TAHRE和TART (HTT)组成。尽管非规范端粒经常表现出链偏g分布,但关于g4形成潜力的证据有限。结果:采用圆二色光谱法和紫外吸收熔融法,证实了黑胃草中HTT元素的g4生成。即Het-A有3个,TART有8个,tare有2个。所有的G4s都不对称地分布在规范端粒中。生物信息学分析表明,潜在四重体序列(PQS)的不对称分布在其他果蝇物种的端粒反转录转座子中也很常见。大多数PQS位于gag基因,PQS密度与较高的DNA序列保守性和有利于g4形成潜力的密码子选择相关。G4s在非规范端粒中的重要性进一步得到了来自各种真核生物物种(包括绿藻、外交家、真菌、昆虫和脊椎动物)的端粒相关反转录转座子的分析的支持。几乎所有分析的端粒相关的反转录转座子都含有PQS,通常具有不对称链分布。与非端粒元素的比较表明,4个不同的LINE分支中pqs丰富的元素是独立选择的。结论:我们在不同真核物种的端粒相关的反转录转座子中发现了链偏置的g4形成基序,支持g4的形成是特定的反转录转座子募集到染色体末端的先决条件之一,需要进一步的实验研究。
{"title":"Telomeric retrotransposons show propensity to form G-quadruplexes in various eukaryotic species.","authors":"Pavel Jedlička, Viktor Tokan, Iva Kejnovská, Roman Hobza, Eduard Kejnovský","doi":"10.1186/s13100-023-00291-9","DOIUrl":"https://doi.org/10.1186/s13100-023-00291-9","url":null,"abstract":"<p><strong>Background: </strong>Canonical telomeres (telomerase-synthetised) are readily forming G-quadruplexes (G4) on the G-rich strand. However, there are examples of non-canonical telomeres among eukaryotes where telomeric tandem repeats are invaded by specific retrotransposons. Drosophila melanogaster represents an extreme example with telomeres composed solely by three retrotransposons-Het-A, TAHRE and TART (HTT). Even though non-canonical telomeres often show strand biased G-distribution, the evidence for the G4-forming potential is limited.</p><p><strong>Results: </strong>Using circular dichroism spectroscopy and UV absorption melting assay we have verified in vitro G4-formation in the HTT elements of D. melanogaster. Namely 3 in Het-A, 8 in TART and 2 in TAHRE. All the G4s are asymmetrically distributed as in canonical telomeres. Bioinformatic analysis showed that asymmetric distribution of potential quadruplex sequences (PQS) is common in telomeric retrotransposons in other Drosophila species. Most of the PQS are located in the gag gene where PQS density correlates with higher DNA sequence conservation and codon selection favoring G4-forming potential. The importance of G4s in non-canonical telomeres is further supported by analysis of telomere-associated retrotransposons from various eukaryotic species including green algae, Diplomonadida, fungi, insects and vertebrates. Virtually all analyzed telomere-associated retrotransposons contained PQS, frequently with asymmetric strand distribution. Comparison with non-telomeric elements showed independent selection of PQS-rich elements from four distinct LINE clades.</p><p><strong>Conclusion: </strong>Our findings of strand-biased G4-forming motifs in telomere-associated retrotransposons from various eukaryotic species support the G4-formation as one of the prerequisites for the recruitment of specific retrotransposons to chromosome ends and call for further experimental studies.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":null,"pages":null},"PeriodicalIF":4.9,"publicationDate":"2023-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10088271/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9658689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-03-14DOI: 10.1186/s13100-023-00290-w
Marco Russo, Sara Morelli, Giovanni Capranico
Small-cell lung cancer (SCLC) is an aggressive cancer characterized by immunosuppressive features leading to poor responses to current immunotherapies. Activation of transposable elements (TE) can trigger an innate immune response, which can synergize with immunotherapeutic protocols in patients. However, TE activity in relation to immune gene response is not fully known in human SCLC. Here, we compared TE expression in 104 human SCLC and 24 normal tissues and established their involvement in innate immune responses. We observed that different intergenic TEs, mainly endogenous retroviral (ERV) families, are deregulated in SCLC. Similarly to other cancers, we detected a subset of LTRs that correlate with innate immune gene signatures and cytosolic RNA sensors, such as RIG-I. These LTRs are downregulated in SCLC tumors vs. normal tissues, and are mainly located at transcriptional repressed regions, marked with H3K4me2 in different cell lines. Analyses of different genomic datasets show that chromatin repression is likely due to de-methylase LSD1 activity. Moreover, high expression levels of ERV LTRs predict a better survival upon chemotherapy of SCLC patients. The findings reveal a specific pattern of TE-mediated activation of innate immune genes in SCLC, which can be exploited to establish more effective immunotherapeutic combinations.
{"title":"Expression of down-regulated ERV LTR elements associates with immune activation in human small-cell lung cancers.","authors":"Marco Russo, Sara Morelli, Giovanni Capranico","doi":"10.1186/s13100-023-00290-w","DOIUrl":"https://doi.org/10.1186/s13100-023-00290-w","url":null,"abstract":"<p><p>Small-cell lung cancer (SCLC) is an aggressive cancer characterized by immunosuppressive features leading to poor responses to current immunotherapies. Activation of transposable elements (TE) can trigger an innate immune response, which can synergize with immunotherapeutic protocols in patients. However, TE activity in relation to immune gene response is not fully known in human SCLC. Here, we compared TE expression in 104 human SCLC and 24 normal tissues and established their involvement in innate immune responses. We observed that different intergenic TEs, mainly endogenous retroviral (ERV) families, are deregulated in SCLC. Similarly to other cancers, we detected a subset of LTRs that correlate with innate immune gene signatures and cytosolic RNA sensors, such as RIG-I. These LTRs are downregulated in SCLC tumors vs. normal tissues, and are mainly located at transcriptional repressed regions, marked with H3K4me2 in different cell lines. Analyses of different genomic datasets show that chromatin repression is likely due to de-methylase LSD1 activity. Moreover, high expression levels of ERV LTRs predict a better survival upon chemotherapy of SCLC patients. The findings reveal a specific pattern of TE-mediated activation of innate immune genes in SCLC, which can be exploited to establish more effective immunotherapeutic combinations.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":null,"pages":null},"PeriodicalIF":4.9,"publicationDate":"2023-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10012523/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9129154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-02-11DOI: 10.1186/s13100-023-00289-3
Priscilla S Redd, Stephanie Diaz, David Weidner, Jazmine Benjamin, C Nathan Hancock
Background: DNA transposable elements are mobilized by a "cut and paste" mechanism catalyzed by the binding of one or more transposase proteins to terminal inverted repeats (TIRs) to form a transpositional complex. Study of the rice genome indicates that the mPing element has experienced a recent burst in transposition compared to the closely related Ping and Pong elements. A previously developed yeast transposition assay allowed us to probe the role of both internal and terminal sequences in the mobilization of these elements.
Results: We observed that mPing and a synthetic mPong element have significantly higher transposition efficiency than the related autonomous Ping and Pong elements. Systematic mutation of the internal sequences of both mPing and mPong identified multiple regions that promote or inhibit transposition. Simultaneous alteration of single bases on both mPing TIRs resulted in a significant reduction in transposition frequency, indicating that each base plays a role in efficient transposase binding. Testing chimeric mPing and mPong elements verified the important role of both the TIRs and internal regulatory regions. Previous experiments showed that the G at position 16, adjacent to the 5' TIR, allows mPing to have higher mobility. Alteration of the 16th and 17th base from mPing's 3' end or replacement of the 3' end with Pong 3' sequences significantly increased transposition frequency.
Conclusions: As the transposase proteins were consistent throughout this study, we conclude that the observed transposition differences are due to the element sequences. The presence of sub-optimal internal regions and TIR bases supports a model in which transposable elements self-limit their activity to prevent host damage and detection by host regulatory mechanisms. Knowing the role of the TIRs, adjacent sub-TIRs, and internal regulatory sequences allows for the creation of hyperactive elements.
{"title":"Mobility of mPing and its associated elements is regulated by both internal and terminal sequences.","authors":"Priscilla S Redd, Stephanie Diaz, David Weidner, Jazmine Benjamin, C Nathan Hancock","doi":"10.1186/s13100-023-00289-3","DOIUrl":"https://doi.org/10.1186/s13100-023-00289-3","url":null,"abstract":"<p><strong>Background: </strong>DNA transposable elements are mobilized by a \"cut and paste\" mechanism catalyzed by the binding of one or more transposase proteins to terminal inverted repeats (TIRs) to form a transpositional complex. Study of the rice genome indicates that the mPing element has experienced a recent burst in transposition compared to the closely related Ping and Pong elements. A previously developed yeast transposition assay allowed us to probe the role of both internal and terminal sequences in the mobilization of these elements.</p><p><strong>Results: </strong>We observed that mPing and a synthetic mPong element have significantly higher transposition efficiency than the related autonomous Ping and Pong elements. Systematic mutation of the internal sequences of both mPing and mPong identified multiple regions that promote or inhibit transposition. Simultaneous alteration of single bases on both mPing TIRs resulted in a significant reduction in transposition frequency, indicating that each base plays a role in efficient transposase binding. Testing chimeric mPing and mPong elements verified the important role of both the TIRs and internal regulatory regions. Previous experiments showed that the G at position 16, adjacent to the 5' TIR, allows mPing to have higher mobility. Alteration of the 16th and 17th base from mPing's 3' end or replacement of the 3' end with Pong 3' sequences significantly increased transposition frequency.</p><p><strong>Conclusions: </strong>As the transposase proteins were consistent throughout this study, we conclude that the observed transposition differences are due to the element sequences. The presence of sub-optimal internal regions and TIR bases supports a model in which transposable elements self-limit their activity to prevent host damage and detection by host regulatory mechanisms. Knowing the role of the TIRs, adjacent sub-TIRs, and internal regulatory sequences allows for the creation of hyperactive elements.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":null,"pages":null},"PeriodicalIF":4.9,"publicationDate":"2023-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9921582/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10712094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
David Doty, Hunter Fleming, Daniel Hader, Matthew J. Patitz, Lukas A. Vaughan
We present an abstract model of self-assembly of systems composed of “crisscross slats”, which have been experimentally implemented as a single-stranded piece of DNA [21] or as a complete DNA origami structure [28]. We then introduce a more physically realistic “kinetic” model and show how important constants in the model were derived and tuned, and compare simulation-based results to experimental results [21,28]. Using these models, we show how we can apply optimizations to designs of slat systems in order to lower the numbers of unique slat types required to build target structures. In general, we apply two types of techniques to achieve greatly reduced numbers of slat types. Similar to the experimental work implementing DNA origami-based slats, in our designs the slats oriented in horizontal and vertical directions are each restricted to their own plane and sets of them overlap each other in square regions which we refer to as macrotiles . Our first technique extends their previous work of reusing slat types within macrotiles and requires analyses of binding domain patterns to determine the potential for errors consisting of incorrect slat types attaching at undesired translations and reflections. The second technique leverages the power of algorithmic self-assembly to efficiently reuse entire macrotiles which self-assemble in patterns following designed algorithms that dictate the dimensions and patterns of growth. Using these designs, we demonstrate that in kinetic simulations the systems with reduced numbers of slat types self-assemble more quickly than those with greater numbers. This provides evidence that such optimizations will also result in greater assembly speeds in experimental systems. Furthermore, the reduced numbers of slat types required have the potential to vastly reduce the cost and number of lab steps for crisscross assembly experiments.
{"title":"Accelerating Self-Assembly of Crisscross Slat Systems","authors":"David Doty, Hunter Fleming, Daniel Hader, Matthew J. Patitz, Lukas A. Vaughan","doi":"10.4230/LIPIcs.DNA.29.7","DOIUrl":"https://doi.org/10.4230/LIPIcs.DNA.29.7","url":null,"abstract":"We present an abstract model of self-assembly of systems composed of “crisscross slats”, which have been experimentally implemented as a single-stranded piece of DNA [21] or as a complete DNA origami structure [28]. We then introduce a more physically realistic “kinetic” model and show how important constants in the model were derived and tuned, and compare simulation-based results to experimental results [21,28]. Using these models, we show how we can apply optimizations to designs of slat systems in order to lower the numbers of unique slat types required to build target structures. In general, we apply two types of techniques to achieve greatly reduced numbers of slat types. Similar to the experimental work implementing DNA origami-based slats, in our designs the slats oriented in horizontal and vertical directions are each restricted to their own plane and sets of them overlap each other in square regions which we refer to as macrotiles . Our first technique extends their previous work of reusing slat types within macrotiles and requires analyses of binding domain patterns to determine the potential for errors consisting of incorrect slat types attaching at undesired translations and reflections. The second technique leverages the power of algorithmic self-assembly to efficiently reuse entire macrotiles which self-assemble in patterns following designed algorithms that dictate the dimensions and patterns of growth. Using these designs, we demonstrate that in kinetic simulations the systems with reduced numbers of slat types self-assemble more quickly than those with greater numbers. This provides evidence that such optimizations will also result in greater assembly speeds in experimental systems. Furthermore, the reduced numbers of slat types required have the potential to vastly reduce the cost and number of lab steps for crisscross assembly experiments.","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":null,"pages":null},"PeriodicalIF":4.9,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81477749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Molecular computation involving promiscuous, or non-orthogonal, binding interactions between system components is found commonly in natural biological systems, as well as some proposed human-made molecular computers. Such systems are characterized by the fact that each computational unit, such as a domain within a DNA strand, may bind to several different partners with distinct, prescribed binding strengths. Unfortunately, implementing systems of molecular computation that incorporate non-orthogonal binding is difficult, because researchers lack a robust, general-purpose method for designing molecules with this type of behavior. In this work, we describe and demonstrate a process for the rational design of DNA sequences with prescribed non-orthogonal binding behavior. This process makes use of a model that represents large sets of non-orthogonal DNA sequences using fixed-length binary strings, and estimates the differential binding affinity between pairs of sequences through the Hamming distance between their corresponding binary strings. The real-world applicability of this model is supported by simulations and some experimental data. We then select two previously described systems of molecular computation involving non-orthogonal interactions, and apply our sequence design process to implement them using DNA strand displacement. Our simulated results on these two systems demonstrate both digital and analog computation. We hope that this work motivates the development and implementation of new computational paradigms based on non-orthogonal binding.
{"title":"Rational Design of DNA Sequences with Non-Orthogonal Binding Interactions","authors":"Joseph Don Berleant","doi":"10.4230/LIPIcs.DNA.29.4","DOIUrl":"https://doi.org/10.4230/LIPIcs.DNA.29.4","url":null,"abstract":"Molecular computation involving promiscuous, or non-orthogonal, binding interactions between system components is found commonly in natural biological systems, as well as some proposed human-made molecular computers. Such systems are characterized by the fact that each computational unit, such as a domain within a DNA strand, may bind to several different partners with distinct, prescribed binding strengths. Unfortunately, implementing systems of molecular computation that incorporate non-orthogonal binding is difficult, because researchers lack a robust, general-purpose method for designing molecules with this type of behavior. In this work, we describe and demonstrate a process for the rational design of DNA sequences with prescribed non-orthogonal binding behavior. This process makes use of a model that represents large sets of non-orthogonal DNA sequences using fixed-length binary strings, and estimates the differential binding affinity between pairs of sequences through the Hamming distance between their corresponding binary strings. The real-world applicability of this model is supported by simulations and some experimental data. We then select two previously described systems of molecular computation involving non-orthogonal interactions, and apply our sequence design process to implement them using DNA strand displacement. Our simulated results on these two systems demonstrate both digital and analog computation. We hope that this work motivates the development and implementation of new computational paradigms based on non-orthogonal binding.","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":null,"pages":null},"PeriodicalIF":4.9,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74261429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}