Pub Date : 2025-02-06DOI: 10.1186/s13100-024-00339-4
Mathias I Nielsen, Justina C Wolters, Omar G Rosas Bringas, Hua Jiang, Luciano H Di Stefano, Mehrnoosh Oghbaie, Samira Hozeifi, Mats J Nitert, Alienke van Pijkeren, Marieke Smit, Lars Ter Morsche, Apostolos Mourtzinos, Vikram Deshpande, Martin S Taylor, Brian T Chait, John LaCava
Background: Both the expression and activities of LINE-1 (L1) retrotransposons are known to occur in numerous cell-types and are implicated in pathobiological contexts such as aging-related inflammation, autoimmunity, and in cancers. L1s encode two proteins that are translated from bicistronic transcripts. The translation product of ORF1 (ORF1p) has been robustly detected by immunoassays and shotgun mass spectrometry (MS). Yet, more sensitive detection methods would enhance the use of ORF1p as a clinical biomarker. In contrast, until now, no direct evidence of endogenous L1 ORF2 translation to protein (ORF2p) has been shown. Instead, assays for ORF2p have been limited to ectopic L1 ORF over-expression contexts and to indirect detection of endogenous ORF2p enzymatic activity, such as by the sequencing of de novo genomic insertions. Immunoassays for endogenous ORF2p have been problematic, producing apparent false positives due to cross-reactivities, and shotgun MS has not yielded reliable evidence of ORF2p peptides in biological samples.
Results: Here we present targeted mass spectrometry assays, selected and parallel reaction monitoring (SRM and PRM, respectively) to detect and quantify L1 ORF1p and ORF2p at their endogenous abundances. We were able to quantify ORF1p and ORF2p present in our samples down to a range in the low attomoles. Confident in our ability to affinity enrich ORF2p, we describe an interactome associated with endogenous ORF2-containing macromolecular assemblies.
Conclusions: This is the first assay to demonstrate sensitive and robust quantitation of endogenous ORF2p. The ability to assay ORF2p directly and quantitatively will improve our understanding of the developmental and diseased cell states where L1 expression and its activity naturally occur. The ability to simultaneously assay endogenous L1 ORF1p and ORF2p is an important step forward for L1 analytical biochemistry. Endogenous ORF2p interactomes can now be presented with confidence that ORF2p is among the enriched proteins.
{"title":"Targeted detection of endogenous LINE-1 proteins and ORF2p interactions.","authors":"Mathias I Nielsen, Justina C Wolters, Omar G Rosas Bringas, Hua Jiang, Luciano H Di Stefano, Mehrnoosh Oghbaie, Samira Hozeifi, Mats J Nitert, Alienke van Pijkeren, Marieke Smit, Lars Ter Morsche, Apostolos Mourtzinos, Vikram Deshpande, Martin S Taylor, Brian T Chait, John LaCava","doi":"10.1186/s13100-024-00339-4","DOIUrl":"10.1186/s13100-024-00339-4","url":null,"abstract":"<p><strong>Background: </strong>Both the expression and activities of LINE-1 (L1) retrotransposons are known to occur in numerous cell-types and are implicated in pathobiological contexts such as aging-related inflammation, autoimmunity, and in cancers. L1s encode two proteins that are translated from bicistronic transcripts. The translation product of ORF1 (ORF1p) has been robustly detected by immunoassays and shotgun mass spectrometry (MS). Yet, more sensitive detection methods would enhance the use of ORF1p as a clinical biomarker. In contrast, until now, no direct evidence of endogenous L1 ORF2 translation to protein (ORF2p) has been shown. Instead, assays for ORF2p have been limited to ectopic L1 ORF over-expression contexts and to indirect detection of endogenous ORF2p enzymatic activity, such as by the sequencing of de novo genomic insertions. Immunoassays for endogenous ORF2p have been problematic, producing apparent false positives due to cross-reactivities, and shotgun MS has not yielded reliable evidence of ORF2p peptides in biological samples.</p><p><strong>Results: </strong>Here we present targeted mass spectrometry assays, selected and parallel reaction monitoring (SRM and PRM, respectively) to detect and quantify L1 ORF1p and ORF2p at their endogenous abundances. We were able to quantify ORF1p and ORF2p present in our samples down to a range in the low attomoles. Confident in our ability to affinity enrich ORF2p, we describe an interactome associated with endogenous ORF2-containing macromolecular assemblies.</p><p><strong>Conclusions: </strong>This is the first assay to demonstrate sensitive and robust quantitation of endogenous ORF2p. The ability to assay ORF2p directly and quantitatively will improve our understanding of the developmental and diseased cell states where L1 expression and its activity naturally occur. The ability to simultaneously assay endogenous L1 ORF1p and ORF2p is an important step forward for L1 analytical biochemistry. Endogenous ORF2p interactomes can now be presented with confidence that ORF2p is among the enriched proteins.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"3"},"PeriodicalIF":4.7,"publicationDate":"2025-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11800616/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143365220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-28DOI: 10.1186/s13100-025-00342-3
Alice M Godden, Benjamin Rix, Simone Immler
Background: Piwi-interacting RNAs (piRNA)s are non-coding small RNAs that post-transcriptionally affect gene expression and regulation. Through complementary seed region binding with transposable elements (TEs), piRNAs protect the genome from transposition. A tool to link piRNAs with complementary TE targets will improve our understanding of the role of piRNAs in genome maintenance and gene regulation. Existing tools such as TEsmall can process sRNA-seq datasets to produce differentially expressed piRNAs, and piRScan developed for nematodes can link piRNAs and TEs but it requires knowledge about the target region of interest and works backwards.
Results: We developed FishPi to predict the pairings between piRNA and TEs for available genomes from zebrafish, medaka and tilapia, with full user customisation of parameters including orientation of piRNA, mismatches in the piRNA seed binding to TE and scored output lists of piRNA-TE matches. FishPi works with individual piRNAs or a list of piRNA sequences in fasta format. The software focuses on the piRNA-TE seed region and analyses reference TEs for piRNA complementarity. TE type is examined, counted and stored to a dictionary, with genomic loci recorded. Any updates to piRNA-TE binding rules can easily be incorporated by changing the seed-region options in the graphic user-interface. FishPi provides a graphic interface using tkinter for the user to input piRNA sequences to generate comprehensive reports on piRNA-TE interactions. FishPi can easily be adapted to genomes from other species and taxa opening the interpretation of piRNA functionality to a wide community.
Conclusions: Users will gain insight into genome mobility and FishPi will help further our understanding of the biological role of piRNAs and their interaction with TEs in a similar way that public databases have improved the access to and the understanding of the role of small RNAs.
{"title":"FishPi: a bioinformatic prediction tool to link piRNA and transposable elements.","authors":"Alice M Godden, Benjamin Rix, Simone Immler","doi":"10.1186/s13100-025-00342-3","DOIUrl":"10.1186/s13100-025-00342-3","url":null,"abstract":"<p><strong>Background: </strong>Piwi-interacting RNAs (piRNA)s are non-coding small RNAs that post-transcriptionally affect gene expression and regulation. Through complementary seed region binding with transposable elements (TEs), piRNAs protect the genome from transposition. A tool to link piRNAs with complementary TE targets will improve our understanding of the role of piRNAs in genome maintenance and gene regulation. Existing tools such as TEsmall can process sRNA-seq datasets to produce differentially expressed piRNAs, and piRScan developed for nematodes can link piRNAs and TEs but it requires knowledge about the target region of interest and works backwards.</p><p><strong>Results: </strong>We developed FishPi to predict the pairings between piRNA and TEs for available genomes from zebrafish, medaka and tilapia, with full user customisation of parameters including orientation of piRNA, mismatches in the piRNA seed binding to TE and scored output lists of piRNA-TE matches. FishPi works with individual piRNAs or a list of piRNA sequences in fasta format. The software focuses on the piRNA-TE seed region and analyses reference TEs for piRNA complementarity. TE type is examined, counted and stored to a dictionary, with genomic loci recorded. Any updates to piRNA-TE binding rules can easily be incorporated by changing the seed-region options in the graphic user-interface. FishPi provides a graphic interface using tkinter for the user to input piRNA sequences to generate comprehensive reports on piRNA-TE interactions. FishPi can easily be adapted to genomes from other species and taxa opening the interpretation of piRNA functionality to a wide community.</p><p><strong>Conclusions: </strong>Users will gain insight into genome mobility and FishPi will help further our understanding of the biological role of piRNAs and their interaction with TEs in a similar way that public databases have improved the access to and the understanding of the role of small RNAs.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"2"},"PeriodicalIF":4.7,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11773700/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143053085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-04DOI: 10.1186/s13100-024-00338-5
Jessica D Choi, Lelani A Del Pinto, Nathan B Sutter
Background: Messenger RNA 3' untranslated regions (3'UTRs) control many aspects of gene expression and determine where the transcript will terminate. The polyadenylation signal (PAS) AAUAAA (AATAAA in DNA) is a key regulator of transcript termination and this hexamer, or a similar sequence, is very frequently found within 30 bp of 3'UTR ends. Short interspersed element (SINE) retrotransposons are found throughout genomes in high copy numbers. When inserted into genes they can disrupt expression, alter splicing, or cause nuclear retention of mRNAs. The genomes of the domestic dog and other carnivores carry hundreds of thousands of Can-SINEs, a tRNA-related SINE with transcription termination potential. Because of this we asked whether Can-SINEs may terminate transcript in some dog genes.
Results: Each of the dog's nine Can-SINE consensus sequences carry an average of three AATAAA PASs on their sense strands but zero on their antisense strands. Consistent with the idea that Can-SINEs can terminate transcripts, we find that sense-oriented Can-SINEs are approximately ten times more frequent at 3' ends of 3'UTRs compared to further upstream within 3'UTRs. Furthermore, the count of AATAAA PASs on head-to-tail SINE sequences differs significantly between sense and antisense-oriented retrotransposons in transcripts. Can-SINEs near 3'UTR ends are likely to carry an AATAAA motif on the mRNA sense strand while those further upstream are not. We identified loci where Can-SINE insertion has truncated or altered a 3'UTR of the dog genome (dog 3'UTR) compared to the human ortholog. Dog 3'UTRs have peaks of AATAAA PAS frequency at 28, 32, and 36 bp from the end. The periodicity is partly explained by TAAA(n) repeats within Can-SINE AT-rich tails. We annotated all repeat-masked Can-SINE copies in the Boxer reference genome and found that the young SINEC_Cf type has a mode of 15 bp length for target site duplications (TSDs). All dog Can-SINE types favor integration at TSDs beginning with A(4).
Conclusion: Dog Can-SINE retrotransposition has imported AATAAA PASs into gene transcripts and led to alteration of 3'UTRs. AATAAA sequences are selectively removed from Can-SINEs in introns and upstream 3'UTR regions but are retained at the far downstream end of 3'UTRs, which we infer reflects their role as termination sequences for these transcripts.
{"title":"SINE retrotransposons import polyadenylation signals to 3'UTRs in dog (Canis familiaris).","authors":"Jessica D Choi, Lelani A Del Pinto, Nathan B Sutter","doi":"10.1186/s13100-024-00338-5","DOIUrl":"https://doi.org/10.1186/s13100-024-00338-5","url":null,"abstract":"<p><strong>Background: </strong>Messenger RNA 3' untranslated regions (3'UTRs) control many aspects of gene expression and determine where the transcript will terminate. The polyadenylation signal (PAS) AAUAAA (AATAAA in DNA) is a key regulator of transcript termination and this hexamer, or a similar sequence, is very frequently found within 30 bp of 3'UTR ends. Short interspersed element (SINE) retrotransposons are found throughout genomes in high copy numbers. When inserted into genes they can disrupt expression, alter splicing, or cause nuclear retention of mRNAs. The genomes of the domestic dog and other carnivores carry hundreds of thousands of Can-SINEs, a tRNA-related SINE with transcription termination potential. Because of this we asked whether Can-SINEs may terminate transcript in some dog genes.</p><p><strong>Results: </strong>Each of the dog's nine Can-SINE consensus sequences carry an average of three AATAAA PASs on their sense strands but zero on their antisense strands. Consistent with the idea that Can-SINEs can terminate transcripts, we find that sense-oriented Can-SINEs are approximately ten times more frequent at 3' ends of 3'UTRs compared to further upstream within 3'UTRs. Furthermore, the count of AATAAA PASs on head-to-tail SINE sequences differs significantly between sense and antisense-oriented retrotransposons in transcripts. Can-SINEs near 3'UTR ends are likely to carry an AATAAA motif on the mRNA sense strand while those further upstream are not. We identified loci where Can-SINE insertion has truncated or altered a 3'UTR of the dog genome (dog 3'UTR) compared to the human ortholog. Dog 3'UTRs have peaks of AATAAA PAS frequency at 28, 32, and 36 bp from the end. The periodicity is partly explained by TAAA(n) repeats within Can-SINE AT-rich tails. We annotated all repeat-masked Can-SINE copies in the Boxer reference genome and found that the young SINEC_Cf type has a mode of 15 bp length for target site duplications (TSDs). All dog Can-SINE types favor integration at TSDs beginning with A(4).</p><p><strong>Conclusion: </strong>Dog Can-SINE retrotransposition has imported AATAAA PASs into gene transcripts and led to alteration of 3'UTRs. AATAAA sequences are selectively removed from Can-SINEs in introns and upstream 3'UTR regions but are retained at the far downstream end of 3'UTRs, which we infer reflects their role as termination sequences for these transcripts.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"1"},"PeriodicalIF":4.7,"publicationDate":"2025-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142927290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-31DOI: 10.1186/s13100-024-00340-x
Irina R Arkhipova, Kathleen H Burns, Pascale Lesage
{"title":"Controlling and controlled elements: highlights of the year in mobile DNA research.","authors":"Irina R Arkhipova, Kathleen H Burns, Pascale Lesage","doi":"10.1186/s13100-024-00340-x","DOIUrl":"10.1186/s13100-024-00340-x","url":null,"abstract":"","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"15 1","pages":"27"},"PeriodicalIF":4.7,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11689530/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142909997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-15DOI: 10.1186/s13100-024-00336-7
Alexander Belyayev, Begoña Quirós de la Peña, Simon Villanueva Corrales, Shook Ling Low, Barbora Frejová, Zuzana Sejfová, Jiřina Josefiová, Eliška Záveská, Yann J K Bertrand, Jindřich Chrtek, Patrik Mráz
Background: The centromere is one of the key regions of the eukaryotic chromosome. While maintaining its function, centromeric DNA may differ among closely related species. Here, we explored the composition and structure of the pericentromeres (a chromosomal region including a functional centromere) of Hieracium alpinum (Asteraceae), a member of one of the most diverse genera in the plant kingdom. Previously, we identified a pericentromere-specific tandem repeat that made it possible to distinguish reads within the Oxford Nanopore library attributed to the pericentromeres, separating them into a discrete subset and allowing comparison of the repeatome composition of this subset with the remaining genome.
Results: We found that the main satellite DNA (satDNA) monomer forms long arrays of linear and block types in the pericentromeric heterochromatin of H. alpinum, and very often, single reads contain forward and reverse arrays and mirror each other. Beside the major, two new minor satDNA families were discovered. In addition to satDNAs, high amounts of LTR retrotransposons (TEs) with dominant of Tekay lineage, were detected in the pericentromeres. We were able to reconstruct four main TEs of the Ty3-gypsy and Ty1-copia superfamilies and compare their relative positions with satDNAs. The latter showed that the conserved domains (CDs) of the TE proteins are located between the newly discovered satDNAs, which appear to be parts of ancient Tekay LTRs that we were able to reconstruct. The dominant satDNA monomer shows a certain similarity to the GAG CD of the Angela retrotransposon.
Conclusions: The species-specific pericentromeric arrays of the H. alpinum genome are heterogeneous, exhibiting both linear and block type structures. High amounts of forward and reverse arrays of the main satDNA monomer point to multiple microinversions that could be the main mechanism for rapid structural evolution stochastically creating the uniqueness of an individual pericentromeric structure. The traces of TEs insertion waves remain in pericentromeres for a long time, thus "keeping memories" of past genomic events. We counted at least four waves of TEs insertions. In pericentromeres, TEs particles can be transformed into satDNA, which constitutes a background pool of minor families that, under certain conditions, can replace the dominant one(s).
{"title":"Analysis of pericentromere composition and structure elucidated the history of the Hieracium alpinum L. genome, revealing waves of transposable elements insertions.","authors":"Alexander Belyayev, Begoña Quirós de la Peña, Simon Villanueva Corrales, Shook Ling Low, Barbora Frejová, Zuzana Sejfová, Jiřina Josefiová, Eliška Záveská, Yann J K Bertrand, Jindřich Chrtek, Patrik Mráz","doi":"10.1186/s13100-024-00336-7","DOIUrl":"10.1186/s13100-024-00336-7","url":null,"abstract":"<p><strong>Background: </strong>The centromere is one of the key regions of the eukaryotic chromosome. While maintaining its function, centromeric DNA may differ among closely related species. Here, we explored the composition and structure of the pericentromeres (a chromosomal region including a functional centromere) of Hieracium alpinum (Asteraceae), a member of one of the most diverse genera in the plant kingdom. Previously, we identified a pericentromere-specific tandem repeat that made it possible to distinguish reads within the Oxford Nanopore library attributed to the pericentromeres, separating them into a discrete subset and allowing comparison of the repeatome composition of this subset with the remaining genome.</p><p><strong>Results: </strong>We found that the main satellite DNA (satDNA) monomer forms long arrays of linear and block types in the pericentromeric heterochromatin of H. alpinum, and very often, single reads contain forward and reverse arrays and mirror each other. Beside the major, two new minor satDNA families were discovered. In addition to satDNAs, high amounts of LTR retrotransposons (TEs) with dominant of Tekay lineage, were detected in the pericentromeres. We were able to reconstruct four main TEs of the Ty3-gypsy and Ty1-copia superfamilies and compare their relative positions with satDNAs. The latter showed that the conserved domains (CDs) of the TE proteins are located between the newly discovered satDNAs, which appear to be parts of ancient Tekay LTRs that we were able to reconstruct. The dominant satDNA monomer shows a certain similarity to the GAG CD of the Angela retrotransposon.</p><p><strong>Conclusions: </strong>The species-specific pericentromeric arrays of the H. alpinum genome are heterogeneous, exhibiting both linear and block type structures. High amounts of forward and reverse arrays of the main satDNA monomer point to multiple microinversions that could be the main mechanism for rapid structural evolution stochastically creating the uniqueness of an individual pericentromeric structure. The traces of TEs insertion waves remain in pericentromeres for a long time, thus \"keeping memories\" of past genomic events. We counted at least four waves of TEs insertions. In pericentromeres, TEs particles can be transformed into satDNA, which constitutes a background pool of minor families that, under certain conditions, can replace the dominant one(s).</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"15 1","pages":"26"},"PeriodicalIF":4.7,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11566620/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142644426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-26DOI: 10.1186/s13100-024-00334-9
Pascale Lesage, Emilie Brasset, Gael Cristofari, Clément Gilbert, Didier Mazel, Rita Rebollo, Clémentine Vitte
From April 20 to 23, 2024, three hundred ten researchers from around the world gathered in Saint-Malo, France, at the fourth International Congress on Transposable Elements (ICTE 2024), to present their most recent discoveries on transposable elements (TEs) and exchange ideas and methodologies. ICTE has been held every four years since 2008 (except in 2020, when it was exceptionally transformed into a seminar series due to the Covid-19 pandemic) and is organized by the French network on Mobile Genetic Elements (CNRS GDR 3546). This fourth edition offered two keynote presentations and four sessions presenting the latest findings and encouraging discussions on the following topics: (1) TEs, genome evolution and adaptation; (2) TEs in health and diseases; (3) TE control and epigenetics; (4) Transposition mechanisms and applications. The 2024 edition also included a half-day satellite workshop on new challenges in TE annotation, organized in collaboration with the TE Hub. The meeting gathered long-term TE enthusiasts, as well as newcomers to the field, with 77% of the participants attending ICTE for the first time.
2024年4月20日至23日,来自世界各地的110名研究人员齐聚法国圣马洛(Saint-Malo),参加第四届可转座元件国际大会(ICTE 2024),展示他们在可转座元件(TEs)方面的最新发现,并交流思想和方法。国际可转座元件大会自2008年起每四年举办一次(2020年除外,该年因Covid-19大流行而破例改为系列研讨会),由法国移动遗传元件网络(CNRS GDR 3546)主办。第四届会议提供了两个主旨报告和四个分会场,介绍最新研究成果,并鼓励就以下主题展开讨论:(1) TE、基因组进化和适应;(2) TE 在健康和疾病中的作用;(3) TE 控制和表观遗传学;(4) 转座机制和应用。2024 年会议还包括与 TE Hub 合作举办的为期半天的关于 TE 注释新挑战的卫星研讨会。会议聚集了 TE 领域的长期爱好者和新手,77% 的与会者是首次参加 ICTE。
{"title":"International congress on transposable elements (ICTE 2024) in Saint Malo: breaking down transposon waves and their impact.","authors":"Pascale Lesage, Emilie Brasset, Gael Cristofari, Clément Gilbert, Didier Mazel, Rita Rebollo, Clémentine Vitte","doi":"10.1186/s13100-024-00334-9","DOIUrl":"10.1186/s13100-024-00334-9","url":null,"abstract":"<p><p>From April 20 to 23, 2024, three hundred ten researchers from around the world gathered in Saint-Malo, France, at the fourth International Congress on Transposable Elements (ICTE 2024), to present their most recent discoveries on transposable elements (TEs) and exchange ideas and methodologies. ICTE has been held every four years since 2008 (except in 2020, when it was exceptionally transformed into a seminar series due to the Covid-19 pandemic) and is organized by the French network on Mobile Genetic Elements (CNRS GDR 3546). This fourth edition offered two keynote presentations and four sessions presenting the latest findings and encouraging discussions on the following topics: (1) TEs, genome evolution and adaptation; (2) TEs in health and diseases; (3) TE control and epigenetics; (4) Transposition mechanisms and applications. The 2024 edition also included a half-day satellite workshop on new challenges in TE annotation, organized in collaboration with the TE Hub. The meeting gathered long-term TE enthusiasts, as well as newcomers to the field, with 77% of the participants attending ICTE for the first time.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"15 1","pages":"25"},"PeriodicalIF":4.7,"publicationDate":"2024-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11512509/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142504417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-19DOI: 10.1186/s13100-024-00335-8
Frank Qingyun Wang, Xiao Dang, Huidong Su, Yao Lei, Chun Hing She, Caicai Zhang, Xinxin Chen, Xingtian Yang, Jing Yang, Hong Feng, Wanling Yang
Background: Systemic Lupus Erythematosus (SLE) is a complex autoimmune disorder, and transposable elements (TEs) have been hypothesized to play a significant role in its development. However, limited research has explored this connection. Our study aimed to examine the relationship between TE expression and SLE pathogenesis.
Methods: We analyzed whole blood RNA-seq datasets from 198 SLE patients and 84 healthy controls. The REdiscoverTE pipeline was employed to quantify TE and other gene expressions, identifying differentially expressed TEs. A TE score was calculated to measure overall TE expression for each sample. Gene ontology and gene set enrichment analyses were conducted to explore the functional implications of TE upregulation. Independent datasets were utilized to replicate the results and investigate cell type-specific TE expression.
Results: Our analysis identified two distinct patient groups: one with high TE expression and another with TE expression comparable to controls. Patients with high TE expression exhibited upregulation of pathways involving nucleic acid sensors, and TE expression was strongly correlated with interferon (IFN) signatures. Furthermore, these patients displayed deregulated cell composition, including increased neutrophils and decreased regulatory T cells. Neutrophils were suggested as the primary source of TE expression, contributing to IFN production.
Conclusions: Our findings suggest that TE expression may serve as a crucial mediator in maintaining the activation of interferon pathways, acting as an endogenous source of nucleic acid stimulators in SLE patients.
背景:系统性红斑狼疮(SLE)是一种复杂的自身免疫性疾病:系统性红斑狼疮(SLE)是一种复杂的自身免疫性疾病,转座元件(TE)被认为在其发病过程中起着重要作用。然而,对这一关系的研究还很有限。我们的研究旨在探讨TE表达与系统性红斑狼疮发病机制之间的关系:我们分析了 198 名系统性红斑狼疮患者和 84 名健康对照者的全血 RNA-seq 数据集。采用 REdiscoverTE 管道量化 TE 和其他基因的表达,识别差异表达的 TE。通过计算 TE 分数来衡量每个样本的整体 TE 表达。进行了基因本体和基因组富集分析,以探索 TE 上调的功能意义。利用独立数据集来复制结果并研究细胞类型特异性 TE 表达:我们的分析确定了两个不同的患者组:一个是高 TE 表达组,另一个是 TE 表达与对照组相当的患者组。高TE表达的患者表现出涉及核酸传感器的通路上调,TE表达与干扰素(IFN)特征密切相关。此外,这些患者还表现出细胞组成失调,包括中性粒细胞增加和调节性T细胞减少。中性粒细胞被认为是 TE 表达的主要来源,有助于 IFN 的产生:我们的研究结果表明,TE表达可能是维持干扰素通路激活的关键介质,是系统性红斑狼疮患者核酸刺激物的内源性来源。
{"title":"Association of hyperactivated transposon expression with exacerbated immune activation in systemic lupus erythematosus.","authors":"Frank Qingyun Wang, Xiao Dang, Huidong Su, Yao Lei, Chun Hing She, Caicai Zhang, Xinxin Chen, Xingtian Yang, Jing Yang, Hong Feng, Wanling Yang","doi":"10.1186/s13100-024-00335-8","DOIUrl":"https://doi.org/10.1186/s13100-024-00335-8","url":null,"abstract":"<p><strong>Background: </strong>Systemic Lupus Erythematosus (SLE) is a complex autoimmune disorder, and transposable elements (TEs) have been hypothesized to play a significant role in its development. However, limited research has explored this connection. Our study aimed to examine the relationship between TE expression and SLE pathogenesis.</p><p><strong>Methods: </strong>We analyzed whole blood RNA-seq datasets from 198 SLE patients and 84 healthy controls. The REdiscoverTE pipeline was employed to quantify TE and other gene expressions, identifying differentially expressed TEs. A TE score was calculated to measure overall TE expression for each sample. Gene ontology and gene set enrichment analyses were conducted to explore the functional implications of TE upregulation. Independent datasets were utilized to replicate the results and investigate cell type-specific TE expression.</p><p><strong>Results: </strong>Our analysis identified two distinct patient groups: one with high TE expression and another with TE expression comparable to controls. Patients with high TE expression exhibited upregulation of pathways involving nucleic acid sensors, and TE expression was strongly correlated with interferon (IFN) signatures. Furthermore, these patients displayed deregulated cell composition, including increased neutrophils and decreased regulatory T cells. Neutrophils were suggested as the primary source of TE expression, contributing to IFN production.</p><p><strong>Conclusions: </strong>Our findings suggest that TE expression may serve as a crucial mediator in maintaining the activation of interferon pathways, acting as an endogenous source of nucleic acid stimulators in SLE patients.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"15 1","pages":"23"},"PeriodicalIF":4.7,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11490001/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142470306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-19DOI: 10.1186/s13100-024-00331-y
Herui Liao, Yanni Sun, Shujun Ou
Genome annotation is an important but challenging task. Accurate identification of short interspersed nuclear elements (SINEs) is particularly difficult due to their lack of highly conserved sequences. AnnoSINE is state-of-the-art software for annotating SINEs in plant genomes, but it is computationally inefficient for large genomes. Moreover, its applicability to animals is limited due to the absence of animal pHMMs in its HMM library. Therefore, we propose AnnoSINE_v2, which extends accurate SINE annotation for animal genomes with greatly optimized computational efficiency. Our results show that AnnoSINE_v2's annotation of SINEs has over 20% higher F1-score compared to the existing tools on animal genomes and enables the processing of complicated genomes, like human and zebrafish, which were beyond the capabilities of AnnoSINE_v1. AnnoSINE_v2 is freely available on Conda and GitHub: https://github.com/liaoherui/AnnoSINE_v2 .
基因组注释是一项重要但极具挑战性的任务。由于缺乏高度保守的序列,准确识别短穿插核元素(SINEs)尤其困难。AnnoSINE 是注释植物基因组中 SINEs 的最先进软件,但它对大型基因组的计算效率较低。此外,由于其 HMM 库中没有动物 pHMMs,它对动物的适用性也很有限。因此,我们提出了 AnnoSINE_v2,它将精确的 SINE 注释扩展到动物基因组,并大大优化了计算效率。我们的研究结果表明,与现有的动物基因组工具相比,AnnoSINE_v2 的 SINE 注释 F1 分数提高了 20% 以上,而且还能处理人类和斑马鱼等复杂基因组,而这些都是 AnnoSINE_v1 所无法胜任的。AnnoSINE_v2 可在 Conda 和 GitHub 上免费获取:https://github.com/liaoherui/AnnoSINE_v2 。
{"title":"Accelerating de novo SINE annotation in plant and animal genomes.","authors":"Herui Liao, Yanni Sun, Shujun Ou","doi":"10.1186/s13100-024-00331-y","DOIUrl":"https://doi.org/10.1186/s13100-024-00331-y","url":null,"abstract":"<p><p>Genome annotation is an important but challenging task. Accurate identification of short interspersed nuclear elements (SINEs) is particularly difficult due to their lack of highly conserved sequences. AnnoSINE is state-of-the-art software for annotating SINEs in plant genomes, but it is computationally inefficient for large genomes. Moreover, its applicability to animals is limited due to the absence of animal pHMMs in its HMM library. Therefore, we propose AnnoSINE_v2, which extends accurate SINE annotation for animal genomes with greatly optimized computational efficiency. Our results show that AnnoSINE_v2's annotation of SINEs has over 20% higher F1-score compared to the existing tools on animal genomes and enables the processing of complicated genomes, like human and zebrafish, which were beyond the capabilities of AnnoSINE_v1. AnnoSINE_v2 is freely available on Conda and GitHub: https://github.com/liaoherui/AnnoSINE_v2 .</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"15 1","pages":"24"},"PeriodicalIF":4.7,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11490119/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142470305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-16DOI: 10.1186/s13100-024-00332-x
Jacopo Martelossi, Mariangela Iannello, Fabrizio Ghiselli, Andrea Luchetti
Background: Short interspersed nuclear elements (SINEs) are non-autonomous non-LTR retrotransposons widespread across eukaryotes. They exist both as lineage-specific, fast-evolving elements and as ubiquitous superfamilies characterized by highly conserved domains (HCD). Several of these superfamilies have been described in bivalves, however their overall distribution and impact on host genome evolution are still unknown due to the extreme scarcity of transposon libraries for the clade. In this study, we examined more than 40 bivalve genomes to uncover the distribution of HCD-tRNA-related SINEs, discover novel SINE-LINE partnerships, and understand their possible role in shaping bivalve genome evolution.
Results: We found that bivalve HCD SINEs have an ancient origin, and they can rely on at least four different LINE clades. According to a "mosaic" evolutionary scenario, multiple LINE partner can promote the amplification of the same HCD SINE superfamilies while homologues LINE-derived tails are present between different superfamilies. Multiple SINEs were found to be highly similar between phylogenetically related species but separated by extremely long evolutionary timescales, up to ~ 400 million years. Studying their genomic distribution in a subset of five species, we observed different patterns of SINE enrichment in various genomic compartments as well as differences in the tendency of SINEs to form tandem-like and palindromic structures also within intronic sequences. Despite these differences, we observed that SINEs, especially older ones, tend to accumulate preferentially within genes, or in their close proximity, consistently with a model of survival bias for less harmful, short non-coding transposons in euchromatic genomic regions.
Conclusion: Here we conducted a wide characterization of tRNA-related SINEs in bivalves revealing their taxonomic distribution and LINE partnerships across the clade. Moreover, through the study of their genomic distribution in five species, we highlighted commonalities and differences with other previously studied eukaryotes, thus extending our understanding of SINE evolution across the tree of life.
背景:短间隔核元素(SINEs)是广泛存在于真核生物中的非自主性非LTR逆转录转座子。它们既可以作为种系特异的快速进化元件而存在,也可以作为以高度保守结构域(HCD)为特征的无处不在的超家族而存在。这些超家族中有几个已经在双壳类动物中得到了描述,但由于双壳类动物的转座子文库极其稀少,它们的总体分布及其对宿主基因组进化的影响仍然未知。在这项研究中,我们研究了 40 多个双壳类基因组,以揭示 HCD-tRNA 相关 SINE 的分布,发现新的 SINE-LINE 伙伴关系,并了解它们在塑造双壳类基因组进化过程中可能扮演的角色:结果:我们发现双壳类动物的HCD SINEs起源古老,它们至少依赖于四个不同的LINE支系。根据 "马赛克 "进化设想,多个 LINE 伙伴可促进相同 HCD SINE 超家族的扩增,而同源 LINE 衍生的尾部则存在于不同超家族之间。研究发现,多个 SINEs 在系统发育相关的物种之间高度相似,但它们之间的进化时间尺度却极长,最长可达 4 亿年。通过研究它们在五个物种亚群中的基因组分布,我们观察到 SINE 在不同基因组区块中的富集模式不同,而且 SINE 在内含子序列中形成串联结构和回文结构的趋势也不同。尽管存在这些差异,但我们观察到,SINEs,尤其是较老的 SINEs,倾向于优先在基因内或基因附近积累,这与有害性较低的短非编码转座子在染色体外基因组区域的生存偏倚模型是一致的:在这里,我们对双壳类动物中与 tRNA 相关的 SINEs 进行了广泛的特征描述,揭示了它们在分类学上的分布以及整个类群中的 LINE 伙伴关系。此外,通过研究它们在五个物种中的基因组分布,我们强调了它们与之前研究过的其他真核生物的共同点和不同点,从而扩展了我们对整个生命树中 SINE 演化的了解。
{"title":"Widespread HCD-tRNA derived SINEs in bivalves rely on multiple LINE partners and accumulate in genic regions.","authors":"Jacopo Martelossi, Mariangela Iannello, Fabrizio Ghiselli, Andrea Luchetti","doi":"10.1186/s13100-024-00332-x","DOIUrl":"https://doi.org/10.1186/s13100-024-00332-x","url":null,"abstract":"<p><strong>Background: </strong>Short interspersed nuclear elements (SINEs) are non-autonomous non-LTR retrotransposons widespread across eukaryotes. They exist both as lineage-specific, fast-evolving elements and as ubiquitous superfamilies characterized by highly conserved domains (HCD). Several of these superfamilies have been described in bivalves, however their overall distribution and impact on host genome evolution are still unknown due to the extreme scarcity of transposon libraries for the clade. In this study, we examined more than 40 bivalve genomes to uncover the distribution of HCD-tRNA-related SINEs, discover novel SINE-LINE partnerships, and understand their possible role in shaping bivalve genome evolution.</p><p><strong>Results: </strong>We found that bivalve HCD SINEs have an ancient origin, and they can rely on at least four different LINE clades. According to a \"mosaic\" evolutionary scenario, multiple LINE partner can promote the amplification of the same HCD SINE superfamilies while homologues LINE-derived tails are present between different superfamilies. Multiple SINEs were found to be highly similar between phylogenetically related species but separated by extremely long evolutionary timescales, up to ~ 400 million years. Studying their genomic distribution in a subset of five species, we observed different patterns of SINE enrichment in various genomic compartments as well as differences in the tendency of SINEs to form tandem-like and palindromic structures also within intronic sequences. Despite these differences, we observed that SINEs, especially older ones, tend to accumulate preferentially within genes, or in their close proximity, consistently with a model of survival bias for less harmful, short non-coding transposons in euchromatic genomic regions.</p><p><strong>Conclusion: </strong>Here we conducted a wide characterization of tRNA-related SINEs in bivalves revealing their taxonomic distribution and LINE partnerships across the clade. Moreover, through the study of their genomic distribution in five species, we highlighted commonalities and differences with other previously studied eukaryotes, thus extending our understanding of SINE evolution across the tree of life.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"15 1","pages":"22"},"PeriodicalIF":4.7,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11481361/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142470308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-10-12DOI: 10.1186/s13100-024-00329-6
Weronika Mikina, Paweł Hałakuc, Rafał Milanowski
{"title":"Correction: Transposon-derived introns as an element shaping the structure of eukaryotic genomes.","authors":"Weronika Mikina, Paweł Hałakuc, Rafał Milanowski","doi":"10.1186/s13100-024-00329-6","DOIUrl":"https://doi.org/10.1186/s13100-024-00329-6","url":null,"abstract":"","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"15 1","pages":"21"},"PeriodicalIF":4.7,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11470543/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142470307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}