首页 > 最新文献

Mobile DNA最新文献

英文 中文
Tandem LTR-retrotransposon structures are common and highly polymorphic in plant genomes. 串联ltr -反转录转座子结构是植物基因组中常见的高度多态性结构。
IF 4.7 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2025-03-12 DOI: 10.1186/s13100-025-00347-y
Noemia Morales-Díaz, Svitlana Sushko, Lucía Campos-Dominguez, Venkataramana Kopalli, Agnieszka A Golicz, Raúl Castanera, Josep M Casacuberta

Background: LTR-retrotransposons (LTR-RT) are a major component of plant genomes and important drivers of genome evolution. Most LTR-RT copies in plant genomes are defective elements found as truncated copies, nested insertions or as part of more complex structures. The recent availability of highly contiguous plant genome assemblies based on long-read sequences now allows to perform detailed characterization of these complex structures and to evaluate their importance for plant genome evolution.

Results: The detailed analysis of two rice loci containing complex LTR-RT structures showed that they consist of tandem arrays of LTR copies sharing internal LTRs. Our analyses suggests that these LTR-RT tandems are the result of a single insertion and not of the recombination of two independent LTR-RT elements. Our results also suggest that gypsy elements may be more prone to form these structures. We show that these structures are highly polymorphic in rice and therefore have the potential to generate genetic variability. We have developed a computational pipeline (IDENTAM) that scans genome sequences and identifies tandem LTR-RT candidates. Using this tool, we have detected 266 tandems in a pangenome built from the genomes of 76 accessions of cultivated and wild rice, showing that tandem LTR-RT structures are frequent and highly polymorphic in rice. Running IDENTAM in the Arabidopsis, almond and cotton genomes showed that LTR-RT tandems are frequent in plant genomes of different size, complexity and ploidy level. The complexity of differentiating intra-element variations at the nucleotide level among haplotypes is very high, and we found that graph-based pangenomic methodologies are appropriate to resolve these structures.

Conclusions: Our results show that LTR-RT elements can form tandem arrays. These structures are relatively abundant and highly polymorphic in rice and are widespread in the plant kingdom. Future studies will contribute to understanding how these structures originate and whether the variability that they generate has a functional impact.

背景:ltr -反转录转座子(LTR-RT)是植物基因组的重要组成部分,也是基因组进化的重要驱动因素。植物基因组中的大多数LTR-RT拷贝都是有缺陷的元素,如截断拷贝、嵌套插入或作为更复杂结构的一部分。最近基于长读序列的高度连续植物基因组组装的可用性现在允许对这些复杂结构进行详细表征并评估它们对植物基因组进化的重要性。结果:对两个含有复杂LTR- rt结构的水稻基因座的详细分析表明,它们是由共享内部LTR的LTR拷贝串联阵列组成的。我们的分析表明,这些LTR-RT串联是单次插入的结果,而不是两个独立LTR-RT元件的重组。我们的结果也表明吉普赛人元素可能更容易形成这些结构。我们发现这些结构在水稻中是高度多态的,因此有可能产生遗传变异。我们已经开发了一个计算管道(IDENTAM),扫描基因组序列并识别串联LTR-RT候选物。利用该工具,我们在76份栽培稻和野生稻的基因组构建的泛基因组中检测到266个串联,表明串联LTR-RT结构在水稻中是频繁且高度多态性的。对拟南芥、杏仁和棉花基因组进行IDENTAM分析表明,LTR-RT序列在不同大小、复杂程度和倍性水平的植物基因组中都很常见。在单倍型中区分核苷酸水平上的元件内变异的复杂性非常高,我们发现基于图的全基因组学方法适用于解决这些结构。结论:我们的研究结果表明,LTR-RT元件可以形成串联阵列。这些结构在水稻中相对丰富且高度多态,在植物界广泛存在。未来的研究将有助于了解这些结构是如何产生的,以及它们产生的变异性是否对功能有影响。
{"title":"Tandem LTR-retrotransposon structures are common and highly polymorphic in plant genomes.","authors":"Noemia Morales-Díaz, Svitlana Sushko, Lucía Campos-Dominguez, Venkataramana Kopalli, Agnieszka A Golicz, Raúl Castanera, Josep M Casacuberta","doi":"10.1186/s13100-025-00347-y","DOIUrl":"10.1186/s13100-025-00347-y","url":null,"abstract":"<p><strong>Background: </strong>LTR-retrotransposons (LTR-RT) are a major component of plant genomes and important drivers of genome evolution. Most LTR-RT copies in plant genomes are defective elements found as truncated copies, nested insertions or as part of more complex structures. The recent availability of highly contiguous plant genome assemblies based on long-read sequences now allows to perform detailed characterization of these complex structures and to evaluate their importance for plant genome evolution.</p><p><strong>Results: </strong>The detailed analysis of two rice loci containing complex LTR-RT structures showed that they consist of tandem arrays of LTR copies sharing internal LTRs. Our analyses suggests that these LTR-RT tandems are the result of a single insertion and not of the recombination of two independent LTR-RT elements. Our results also suggest that gypsy elements may be more prone to form these structures. We show that these structures are highly polymorphic in rice and therefore have the potential to generate genetic variability. We have developed a computational pipeline (IDENTAM) that scans genome sequences and identifies tandem LTR-RT candidates. Using this tool, we have detected 266 tandems in a pangenome built from the genomes of 76 accessions of cultivated and wild rice, showing that tandem LTR-RT structures are frequent and highly polymorphic in rice. Running IDENTAM in the Arabidopsis, almond and cotton genomes showed that LTR-RT tandems are frequent in plant genomes of different size, complexity and ploidy level. The complexity of differentiating intra-element variations at the nucleotide level among haplotypes is very high, and we found that graph-based pangenomic methodologies are appropriate to resolve these structures.</p><p><strong>Conclusions: </strong>Our results show that LTR-RT elements can form tandem arrays. These structures are relatively abundant and highly polymorphic in rice and are widespread in the plant kingdom. Future studies will contribute to understanding how these structures originate and whether the variability that they generate has a functional impact.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"10"},"PeriodicalIF":4.7,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11899658/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143616257","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Marine vs. terrestrial: links between the environment and the diversity of Copia retrotransposon in metazoans. 海洋与陆地:环境与后生动物中Copia反转录转座子多样性之间的联系。
IF 4.7 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2025-03-08 DOI: 10.1186/s13100-025-00346-z
Khouloud Klai, Sarah Farhat, Laure Lamothe, Dominique Higuet, Éric Bonnivard

Background: LTR-retrotransposons are widely distributed among the eukaryote tree of life and have extensive impacts on genome evolution. Among the three canonical superfamilies, the Copia superfamily demonstrates the lowest abundances and repartitions among metazoans. To better understand their dynamics, we have conducted the first large-scale study of LTR-retrotransposon diversity in metazoans and we report on the diversity and distribution of the Copia elements.

Results: We have identified over than 2,300 Copia elements from 263 metazoan genomes. The sequences were annotated at the clade level based on the classification of their RT/RNaseH domain. Our results confirmed that Copia are scarce in metazoans. However, we observed a great variation in Copia abundance between taxa. Surprisingly, some genomes, had a record number of copies, especially in Squamata. In contrast, terrestrial Deuterostomia display a clear loss of Copia diversity leading to their disappearance in some taxa. Additionally, we identified 18 new clades, tripling the number of previously defined clades. By studying more than 50 widespread taxa, we believe that most metazoan Copia clades have now been identified. The most striking result is that environment appears to be related to Copia distribution. We defined two sets of clades characterizing marine or terrestrial taxa. This two-sided pattern could be partially explained by horizontal transfers within both environments.

Conclusions: This research enhances our understanding of transposable element evolution and emphasizes the influence of sharing the same ecological contexts on genomic diversity, and highlights the importance of annotating them at the clade level to characterize their evolutionary dynamics.

背景:ltr -反转录转座子广泛分布于真核生物生命树中,对基因组进化有着广泛的影响。在三个典型的超家族中,Copia超家族在后生动物中表现出最低的丰度和重划分。为了更好地了解它们的动态,我们进行了后生动物中ltr -反转录转座子多样性的首次大规模研究,并报道了Copia元件的多样性和分布。结果:我们从263个后生动物基因组中鉴定出2300多个Copia元件。根据其RT/RNaseH结构域的分类,在进化水平上对这些序列进行注释。我们的结果证实,Copia在后生动物中是稀缺的。但不同分类群间Copia丰度差异较大。令人惊讶的是,一些基因组具有创纪录的拷贝数,尤其是在Squamata中。相比之下,陆生后口动物的Copia多样性明显丧失,导致其在某些分类群中消失。此外,我们确定了18个新的分支,是以前定义的分支数量的三倍。通过对50多个分布广泛的分类群的研究,我们认为大多数后生的Copia分支现在已经被确定。最引人注目的结果是环境似乎与Copia分布有关。我们定义了两组演化枝来表征海洋或陆地分类群。这种双面模式可以用两种环境中的水平转移来部分解释。结论:本研究提高了我们对转座因子进化的认识,强调了共享相同生态环境对基因组多样性的影响,并强调了在支系水平上对其进行注释以表征其进化动态的重要性。
{"title":"Marine vs. terrestrial: links between the environment and the diversity of Copia retrotransposon in metazoans.","authors":"Khouloud Klai, Sarah Farhat, Laure Lamothe, Dominique Higuet, Éric Bonnivard","doi":"10.1186/s13100-025-00346-z","DOIUrl":"10.1186/s13100-025-00346-z","url":null,"abstract":"<p><strong>Background: </strong>LTR-retrotransposons are widely distributed among the eukaryote tree of life and have extensive impacts on genome evolution. Among the three canonical superfamilies, the Copia superfamily demonstrates the lowest abundances and repartitions among metazoans. To better understand their dynamics, we have conducted the first large-scale study of LTR-retrotransposon diversity in metazoans and we report on the diversity and distribution of the Copia elements.</p><p><strong>Results: </strong>We have identified over than 2,300 Copia elements from 263 metazoan genomes. The sequences were annotated at the clade level based on the classification of their RT/RNaseH domain. Our results confirmed that Copia are scarce in metazoans. However, we observed a great variation in Copia abundance between taxa. Surprisingly, some genomes, had a record number of copies, especially in Squamata. In contrast, terrestrial Deuterostomia display a clear loss of Copia diversity leading to their disappearance in some taxa. Additionally, we identified 18 new clades, tripling the number of previously defined clades. By studying more than 50 widespread taxa, we believe that most metazoan Copia clades have now been identified. The most striking result is that environment appears to be related to Copia distribution. We defined two sets of clades characterizing marine or terrestrial taxa. This two-sided pattern could be partially explained by horizontal transfers within both environments.</p><p><strong>Conclusions: </strong>This research enhances our understanding of transposable element evolution and emphasizes the influence of sharing the same ecological contexts on genomic diversity, and highlights the importance of annotating them at the clade level to characterize their evolutionary dynamics.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"9"},"PeriodicalIF":4.7,"publicationDate":"2025-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11889832/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143586217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transposable elements in genomic architecture of Monilinia fungal phytopathogens and TE-driven DMI-resistance adaptation. 念珠菌属植物病原体基因组结构中的转座因子及te驱动的dmi抗性适应。
IF 4.7 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2025-03-07 DOI: 10.1186/s13100-025-00343-2
Muhammed Raşit Durak, Hilal Özkılınç

Background: Fungicide resistance poses a significant challenge to plant disease management and influences the evolutionary dynamics of fungal pathogens. Besides being important phytopathogens, Monilinia species have become a model for discovering many fundamental questions related to fungal pathosystems. In this study, DMI-propiconazole sensitivity was investigated in view of transposable element (TE) dynamics in M. fructicola and M. laxa.

Results: Propiconazole-sensitivity of 109 M. fructicola and 20 M. laxa isolates from different regions of Türkiye was assessed. Comprehensive TE identification within the species revealed that Class I elements were predominant, and TEs constituted approximately 9% of the genome for both M. fructicola and M. laxa, with a total of 15,327 and 10,710 TEs, respectively. An experimental evolution plan was developed for Monilinia that allows observing phenotypic and genotypic changes over successive generations under controlled selection pressures. Dynamic changes in TE content were discovered throughout the experimental evolution of M. fructicola under propiconazole pressure. With a net change of 187 TEs, the evolved strain showed an expansion of TE sequences, whereas different TE classes displayed diverse patterns of increase/decrease. Additionally, the presence of a nested TE upstream of the CYP51 gene was observed in less-sensitive M. fructicola isolates but absent in highly-sensitive ones. Gene expressions of CYP51 differed significantly between TE-containing and TE-lacking isolates, strongly supporting the contribution of this TE to fungicide resistance.

Conclusion: This study establishes a critical link between TEs and DMI fungicide resistance by associating a nested TE with reduced sensitivity to propiconazole. We introduce an innovative experimental evolution framework for studying genomic changes under selective pressure and provide a comprehensive characterization of Monilinia TEs. These findings significantly advance our understanding of molecular resistance mechanisms in fungal pathogens, offering insights for more effective disease management.

背景:杀菌剂抗性对植物病害管理提出了重大挑战,并影响真菌病原体的进化动态。除了是重要的植物病原体外,念珠菌已成为发现与真菌病理系统有关的许多基本问题的模型。本研究从转座因子(TE)动力学的角度研究了聚果糖密索菌和拉克斯密索菌对dmi -丙环唑的敏感性。结果:对 rkiye不同地区分离的109株果分枝杆菌和20株laxa分枝杆菌进行丙环唑敏感性分析。综合TE鉴定结果显示,ⅰ类元素占主导地位,TE约占9%,分别为15,327个和10,710个。在可控的选择压力下,为念珠菌制定了一个实验进化计划,以观察其连续几代的表型和基因型变化。在丙环康唑的作用下,果糖乳杆菌的TE含量在整个实验进化过程中都发生了动态变化。进化菌株的TE序列净变化为187个,但不同TE类别的增加/减少模式不同。此外,在低敏感性的果实分枝杆菌分离株中发现了CYP51基因上游的巢状TE,而在高敏感性的分离株中则没有。CYP51基因的表达在TE含量和缺乏TE的菌株之间存在显著差异,这有力地支持了TE对杀菌剂抗性的贡献。结论:本研究通过将巢状TE与丙环唑敏感性降低联系起来,建立了TE与DMI杀菌剂耐药性之间的关键联系。我们引入了一个创新的实验进化框架来研究选择压力下的基因组变化,并提供了Monilinia TEs的全面表征。这些发现大大促进了我们对真菌病原体分子耐药机制的理解,为更有效的疾病管理提供了见解。
{"title":"Transposable elements in genomic architecture of Monilinia fungal phytopathogens and TE-driven DMI-resistance adaptation.","authors":"Muhammed Raşit Durak, Hilal Özkılınç","doi":"10.1186/s13100-025-00343-2","DOIUrl":"10.1186/s13100-025-00343-2","url":null,"abstract":"<p><strong>Background: </strong>Fungicide resistance poses a significant challenge to plant disease management and influences the evolutionary dynamics of fungal pathogens. Besides being important phytopathogens, Monilinia species have become a model for discovering many fundamental questions related to fungal pathosystems. In this study, DMI-propiconazole sensitivity was investigated in view of transposable element (TE) dynamics in M. fructicola and M. laxa.</p><p><strong>Results: </strong>Propiconazole-sensitivity of 109 M. fructicola and 20 M. laxa isolates from different regions of Türkiye was assessed. Comprehensive TE identification within the species revealed that Class I elements were predominant, and TEs constituted approximately 9% of the genome for both M. fructicola and M. laxa, with a total of 15,327 and 10,710 TEs, respectively. An experimental evolution plan was developed for Monilinia that allows observing phenotypic and genotypic changes over successive generations under controlled selection pressures. Dynamic changes in TE content were discovered throughout the experimental evolution of M. fructicola under propiconazole pressure. With a net change of 187 TEs, the evolved strain showed an expansion of TE sequences, whereas different TE classes displayed diverse patterns of increase/decrease. Additionally, the presence of a nested TE upstream of the CYP51 gene was observed in less-sensitive M. fructicola isolates but absent in highly-sensitive ones. Gene expressions of CYP51 differed significantly between TE-containing and TE-lacking isolates, strongly supporting the contribution of this TE to fungicide resistance.</p><p><strong>Conclusion: </strong>This study establishes a critical link between TEs and DMI fungicide resistance by associating a nested TE with reduced sensitivity to propiconazole. We introduce an innovative experimental evolution framework for studying genomic changes under selective pressure and provide a comprehensive characterization of Monilinia TEs. These findings significantly advance our understanding of molecular resistance mechanisms in fungal pathogens, offering insights for more effective disease management.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"8"},"PeriodicalIF":4.7,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11887251/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143586303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the origin of the P-element invasion in Drosophila simulans. 论拟果蝇p元素入侵的起源。
IF 4.7 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2025-02-26 DOI: 10.1186/s13100-025-00345-0
Filip Wierzbicki, Riccardo Pianezza, Divya Selvaraju, Madeleine Maria Eller, Robert Kofler

The horizontal transfer (HT) of the P-element is one of the best documented cases of the HT of a transposable element. The P-element invaded natural D. melanogaster populations between 1950 and 1980 following its HT from Drosophila willistoni, a species endemic to South and Central America. Subsequently, it spread in D. simulans populations between 2006 and 2014, following a HT from D. melanogaster. The geographic region where the spread into D. simulans occurred is unclear, as both involved species are cosmopolitan. The P-element differs between these two species by a single base substitution at site 2040, where D. melanogaster carries a 'G' and D. simulans carries an 'A'. It has been hypothesized that this base substitution was a necessary adaptation that enabled the spread of the P-element in D. simulans, potentially explaining the 30-50-year lag between the invasions of D. melanogaster and D. simulans. To test this hypothesis, we monitored the invasion dynamics of P-elements with both alleles in experimental populations of D. melanogaster and D. simulans. Our results indicate that the allele at site 2040 has a minimal impact on the invasion dynamics of the P-element and, therefore, was not necessary for the invasion of D. simulans. However, we found that the host species significantly influenced the invasion dynamics, with higher P-element copy numbers accumulating in D. melanogaster than in D. simulans. Finally, based on SNPs segregating in natural D. melanogaster populations, we suggest that the horizontal transfer of the P-element from D. melanogaster to D. simulans likely occurred around Tasmania.

p -元的水平转移(HT)是文献记载最好的转座元的水平转移之一。1950 - 1980年间,p元素从中南美洲特有物种威利斯托尼果蝇(Drosophila willistoni)侵染而来,侵入黑腹田鼠自然种群。随后,在2006年至2014年期间,在黑腹d.m anogaster的HT之后,它在d.m anans种群中传播。由于涉及的两个物种都是世界性的,因此尚不清楚在哪个地理区域发生了向拟象d的传播。这两个物种的p元素的不同之处在于在2040位点有一个碱基替换,其中D. melanogaster携带一个“G”,D. simulans携带一个“a”。据推测,这种碱基替换是一种必要的适应,使得p元素在D. simulans中传播,这可能解释了D. melanogaster和D. simulans入侵之间30-50年的滞后。为了验证这一假设,我们在实验群体中监测了p元素与两个等位基因的入侵动态。我们的研究结果表明,位点2040的等位基因对p元素的入侵动力学影响最小,因此不是D. simulans入侵的必要条件。然而,我们发现寄主物种对入侵动态有显著的影响,黑腹田鼠体内积累的p元素拷贝数高于模拟田鼠。最后,基于自然种群的snp分离,我们认为p元素的水平转移可能发生在塔斯马尼亚附近。
{"title":"On the origin of the P-element invasion in Drosophila simulans.","authors":"Filip Wierzbicki, Riccardo Pianezza, Divya Selvaraju, Madeleine Maria Eller, Robert Kofler","doi":"10.1186/s13100-025-00345-0","DOIUrl":"10.1186/s13100-025-00345-0","url":null,"abstract":"<p><p>The horizontal transfer (HT) of the P-element is one of the best documented cases of the HT of a transposable element. The P-element invaded natural D. melanogaster populations between 1950 and 1980 following its HT from Drosophila willistoni, a species endemic to South and Central America. Subsequently, it spread in D. simulans populations between 2006 and 2014, following a HT from D. melanogaster. The geographic region where the spread into D. simulans occurred is unclear, as both involved species are cosmopolitan. The P-element differs between these two species by a single base substitution at site 2040, where D. melanogaster carries a 'G' and D. simulans carries an 'A'. It has been hypothesized that this base substitution was a necessary adaptation that enabled the spread of the P-element in D. simulans, potentially explaining the 30-50-year lag between the invasions of D. melanogaster and D. simulans. To test this hypothesis, we monitored the invasion dynamics of P-elements with both alleles in experimental populations of D. melanogaster and D. simulans. Our results indicate that the allele at site 2040 has a minimal impact on the invasion dynamics of the P-element and, therefore, was not necessary for the invasion of D. simulans. However, we found that the host species significantly influenced the invasion dynamics, with higher P-element copy numbers accumulating in D. melanogaster than in D. simulans. Finally, based on SNPs segregating in natural D. melanogaster populations, we suggest that the horizontal transfer of the P-element from D. melanogaster to D. simulans likely occurred around Tasmania.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"7"},"PeriodicalIF":4.7,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11863927/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143516163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Alternative splicing of transposable elements in human breast cancer. 人乳腺癌中转座因子的选择性剪接。
IF 3.1 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2025-02-22 DOI: 10.1186/s13100-025-00341-4
Alex Nesta, Diogo F T Veiga, Jacques Banchereau, Olga Anczukow, Christine R Beck

Transposable elements (TEs) drive genome evolution and can affect gene expression through diverse mechanisms. In breast cancer, disrupted regulation of TE sequences may facilitate tumor-specific transcriptomic alterations. We examine 142,514 full-length isoforms derived from long-read RNA sequencing (LR-seq) of 30 breast samples to investigate the effects of TEs on the breast cancer transcriptome. Approximately half of these isoforms contain TE sequences, and these contribute to half of the novel annotated splice junctions. We quantify splicing of these LR-seq derived isoforms in 1,135 breast tumors from The Cancer Genome Atlas (TCGA) and 1,329 healthy tissue samples from the Genotype-Tissue Expression (GTEx), and find 300 TE-overlapping tumor-specific splicing events. Some splicing events are enriched in specific breast cancer subtypes - for example, a TE-driven transcription start site upstream of ERBB2 in HER2 + tumors, and several TE-mediated splicing events are associated with patient survival and poor prognosis. The full-length sequences we capture with LR-seq reveal thousands of isoforms with signatures of RNA editing, including a novel isoform belonging to RHOA; a gene previously implicated in tumor progression. We utilize our full-length isoforms to discover polymorphic TE insertions that alter splicing and validate one of these events in breast cancer cell lines. Together, our results demonstrate the widespread effects of dysregulated TEs on breast cancer transcriptomes and highlight the advantages of long-read isoform sequencing for understanding TE biology. TE-derived isoforms may alter the expression of genes important in cancer and can potentially be used as novel, disease-specific therapeutic targets or biomarkers.One sentence summary: Transposable elements generate alternative isoforms and alter post-transcriptional regulation in human breast cancer.

转座因子(te)驱动基因组进化并通过多种机制影响基因表达。在乳腺癌中,TE序列的紊乱可能促进肿瘤特异性转录组改变。为了研究TEs对乳腺癌转录组的影响,我们对30个乳腺样本的长读RNA测序(LR-seq)得出的142,514个全长同种异构体进行了研究。这些同工异构体中大约有一半含有TE序列,并且这些序列贡献了一半的新注释剪接连接。我们量化了来自癌症基因组图谱(TCGA)的1135个乳腺肿瘤和来自基因型组织表达(GTEx)的1329个健康组织样本中这些LR-seq衍生同种异构体的剪接,发现了300个te重叠的肿瘤特异性剪接事件。一些剪接事件在特定的乳腺癌亚型中丰富-例如,在HER2 +肿瘤中,te驱动的转录起始位点位于ERBB2上游,并且一些te介导的剪接事件与患者的生存和不良预后相关。我们用LR-seq捕获的全长序列揭示了数千种具有RNA编辑特征的同种异构体,包括属于RHOA的新型同种异构体;一个先前与肿瘤进展有关的基因。我们利用我们的全长同种异构体来发现改变剪接的多态TE插入,并在乳腺癌细胞系中验证这些事件之一。总之,我们的研究结果证明了TE失调对乳腺癌转录组的广泛影响,并强调了长读异构体测序在理解TE生物学方面的优势。te衍生的同种异构体可能会改变癌症中重要基因的表达,并可能被用作新的、疾病特异性的治疗靶点或生物标志物。一句话总结:转座因子在人类乳腺癌中产生可选择的异构体并改变转录后调控。
{"title":"Alternative splicing of transposable elements in human breast cancer.","authors":"Alex Nesta, Diogo F T Veiga, Jacques Banchereau, Olga Anczukow, Christine R Beck","doi":"10.1186/s13100-025-00341-4","DOIUrl":"10.1186/s13100-025-00341-4","url":null,"abstract":"<p><p>Transposable elements (TEs) drive genome evolution and can affect gene expression through diverse mechanisms. In breast cancer, disrupted regulation of TE sequences may facilitate tumor-specific transcriptomic alterations. We examine 142,514 full-length isoforms derived from long-read RNA sequencing (LR-seq) of 30 breast samples to investigate the effects of TEs on the breast cancer transcriptome. Approximately half of these isoforms contain TE sequences, and these contribute to half of the novel annotated splice junctions. We quantify splicing of these LR-seq derived isoforms in 1,135 breast tumors from The Cancer Genome Atlas (TCGA) and 1,329 healthy tissue samples from the Genotype-Tissue Expression (GTEx), and find 300 TE-overlapping tumor-specific splicing events. Some splicing events are enriched in specific breast cancer subtypes - for example, a TE-driven transcription start site upstream of ERBB2 in HER2 + tumors, and several TE-mediated splicing events are associated with patient survival and poor prognosis. The full-length sequences we capture with LR-seq reveal thousands of isoforms with signatures of RNA editing, including a novel isoform belonging to RHOA; a gene previously implicated in tumor progression. We utilize our full-length isoforms to discover polymorphic TE insertions that alter splicing and validate one of these events in breast cancer cell lines. Together, our results demonstrate the widespread effects of dysregulated TEs on breast cancer transcriptomes and highlight the advantages of long-read isoform sequencing for understanding TE biology. TE-derived isoforms may alter the expression of genes important in cancer and can potentially be used as novel, disease-specific therapeutic targets or biomarkers.One sentence summary: Transposable elements generate alternative isoforms and alter post-transcriptional regulation in human breast cancer.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"6"},"PeriodicalIF":3.1,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11846448/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143476766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A family of Tn7-like transposons evolved to target CRISPR repeats. 一个tn7样转座子家族进化为靶向CRISPR重复序列。
IF 4.7 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2025-02-18 DOI: 10.1186/s13100-025-00344-1
Laura Chacon Machado, Joseph E Peters

Tn7 family transposons are mobile genetic elements known for precise target site selection, with some co-opting CRISPR-Cas systems for RNA-guided transposition. We identified a novel group of Tn7-like transposons in Cyanobacteria that preferentially target CRISPR arrays, suggesting a new functional interaction between these elements and CRISPR-Cas systems. Using bioinformatics tools, we characterized their phylogeny, target specificity, and sub-specialization. The array-targeting elements are phylogenetically close to tRNA-targeting elements. The distinct target preference coincides with loss of a C-terminal region in the TnsD protein which is responsible for recognizing target sites when compared to closely related elements. Notably, elements are found integrated into a fixed position within CRISPR spacer regions, a behavior that might minimize negative impacts on the host defense system. These transposons were identified in both plasmid and genomic CRISPR arrays, indicating that their preferred target provides a means for both safe insertion in the host chromosome and a mechanism for dissemination. Attempts to reconstitute these elements in E. coli were unsuccessful, indicating possible dependence on native host factors. Our findings expand the diversity of interactions between Tn7-like transposons and CRISPR systems.

Tn7家族转座子是一种可移动的遗传元件,以精确的目标位点选择而闻名,一些转座子采用CRISPR-Cas系统进行rna引导转座子。我们在蓝藻细菌中发现了一组新的tn7样转座子,它们优先靶向CRISPR阵列,这表明这些元件与CRISPR- cas系统之间存在新的功能相互作用。利用生物信息学工具,我们对它们的系统发育、靶点特异性和亚特化进行了表征。阵列靶向元件在系统发育上接近trna靶向元件。这种明显的靶标偏好与TnsD蛋白中c端区域的缺失相吻合,该区域与密切相关的元件相比负责识别靶标位点。值得注意的是,发现元件被整合到CRISPR间隔区域的固定位置,这种行为可能会最大限度地减少对宿主防御系统的负面影响。这些转座子在质粒和基因组CRISPR阵列中都被鉴定出来,表明它们的首选靶标提供了一种安全插入宿主染色体的手段和传播机制。试图在大肠杆菌中重建这些元素是不成功的,这表明可能依赖于天然宿主因子。我们的发现扩展了tn7样转座子和CRISPR系统之间相互作用的多样性。
{"title":"A family of Tn7-like transposons evolved to target CRISPR repeats.","authors":"Laura Chacon Machado, Joseph E Peters","doi":"10.1186/s13100-025-00344-1","DOIUrl":"10.1186/s13100-025-00344-1","url":null,"abstract":"<p><p>Tn7 family transposons are mobile genetic elements known for precise target site selection, with some co-opting CRISPR-Cas systems for RNA-guided transposition. We identified a novel group of Tn7-like transposons in Cyanobacteria that preferentially target CRISPR arrays, suggesting a new functional interaction between these elements and CRISPR-Cas systems. Using bioinformatics tools, we characterized their phylogeny, target specificity, and sub-specialization. The array-targeting elements are phylogenetically close to tRNA-targeting elements. The distinct target preference coincides with loss of a C-terminal region in the TnsD protein which is responsible for recognizing target sites when compared to closely related elements. Notably, elements are found integrated into a fixed position within CRISPR spacer regions, a behavior that might minimize negative impacts on the host defense system. These transposons were identified in both plasmid and genomic CRISPR arrays, indicating that their preferred target provides a means for both safe insertion in the host chromosome and a mechanism for dissemination. Attempts to reconstitute these elements in E. coli were unsuccessful, indicating possible dependence on native host factors. Our findings expand the diversity of interactions between Tn7-like transposons and CRISPR systems.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"5"},"PeriodicalIF":4.7,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11837452/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143449492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A genome-wide study of ruminants uncovers two endogenous retrovirus families recently active in goats. 一项反刍动物全基因组研究揭示了最近在山羊中活跃的两个内源性逆转录病毒家族。
IF 4.7 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2025-02-17 DOI: 10.1186/s13100-024-00337-6
Marie Verneret, Caroline Leroux, Thomas Faraut, Vincent Navratil, Emmanuelle Lerat, Jocelyn Turpin

Background: Endogenous retroviruses (ERV) are traces of ancestral retroviral germline infections that constitute a significant portion of mammalian genomes and are classified as LTR-retrotransposons. The exploration of their dynamics and evolutionary history in ruminants remains limited, highlighting the need for a comprehensive and thorough investigation of the ERV landscape in the genomes of cattle, sheep and goat.

Results: Through a de novo bioinformatic analysis, we characterized 24 Class I and II ERV families across four reference assemblies of domestic and wild sheep and goats, and one assembly of cattle. Among these families, 13 are represented by consensus sequences identified in the five analyzed species, while eight are exclusive to small ruminants and three to cattle. The similarity-based approach used to search for the presence of these families in other ruminant species revealed multiple endogenization events over the last 40 million years and distinct evolutionary dynamics among species. The ERV annotation resulted in a high-resolution dataset of 100,534 ERV insertions across the five genomes, representing between 0.5 and 1% of their genomes. Solo-LTRs account for 83.2% of the annotated insertions demonstrating that most of the ERVs are relics of past events. Two Class II families showed higher abundance and copy conservation in small ruminants. One of them is closely related to circulating exogenous retroviruses and is represented by 22 copies sharing identical LTRs and 12 with complete coding capacities in the domestic goat.

Conclusions: Our results suggest the presence of two ERV families with recent transpositional activity in ruminant genomes, particularly in the domestic goat, illustrating distinct evolutionary dynamics among the analyzed species. This work highlights the ongoing influence of ERVs on genomic landscapes and call for further investigation of their evolutionary trajectories in these genomes.

背景:内源性逆转录病毒(ERV)是祖先逆转录病毒生殖系感染的痕迹,构成哺乳动物基因组的重要部分,被归类为ltr -逆转录转座子。在反刍动物中对其动态和进化历史的探索仍然有限,这突出表明需要对牛、绵羊和山羊基因组中的ERV景观进行全面和彻底的调查。结果:通过从头开始的生物信息学分析,我们在家养和野生绵羊和山羊的4个参考组合以及牛的1个组合中鉴定了24个I类和II类ERV家族。在这些科中,有13个是在5个分析物种中鉴定的一致序列,8个是小型反刍动物独有的,3个是牛特有的。基于相似性的方法用于在其他反刍动物物种中寻找这些科的存在,揭示了过去4000万年的多重内源性事件和物种之间独特的进化动态。ERV注释产生了一个高分辨率的数据集,其中包括5个基因组中的100,534个ERV插入,占其基因组的0.5%至1%。solo - ltr占注释插入的83.2%,表明大多数erv是过去事件的遗迹。2个II类科在小型反刍动物中表现出较高的丰度和拷贝保守性。其中一种与循环外源性逆转录病毒密切相关,在家山羊中有22个拷贝具有相同的ltr, 12个具有完整的编码能力。结论:我们的研究结果表明,在反刍动物基因组中,特别是在家养山羊中,存在两个具有最近转位活性的ERV家族,说明了所分析物种之间不同的进化动力学。这项工作强调了erv对基因组景观的持续影响,并呼吁进一步研究它们在这些基因组中的进化轨迹。
{"title":"A genome-wide study of ruminants uncovers two endogenous retrovirus families recently active in goats.","authors":"Marie Verneret, Caroline Leroux, Thomas Faraut, Vincent Navratil, Emmanuelle Lerat, Jocelyn Turpin","doi":"10.1186/s13100-024-00337-6","DOIUrl":"10.1186/s13100-024-00337-6","url":null,"abstract":"<p><strong>Background: </strong>Endogenous retroviruses (ERV) are traces of ancestral retroviral germline infections that constitute a significant portion of mammalian genomes and are classified as LTR-retrotransposons. The exploration of their dynamics and evolutionary history in ruminants remains limited, highlighting the need for a comprehensive and thorough investigation of the ERV landscape in the genomes of cattle, sheep and goat.</p><p><strong>Results: </strong>Through a de novo bioinformatic analysis, we characterized 24 Class I and II ERV families across four reference assemblies of domestic and wild sheep and goats, and one assembly of cattle. Among these families, 13 are represented by consensus sequences identified in the five analyzed species, while eight are exclusive to small ruminants and three to cattle. The similarity-based approach used to search for the presence of these families in other ruminant species revealed multiple endogenization events over the last 40 million years and distinct evolutionary dynamics among species. The ERV annotation resulted in a high-resolution dataset of 100,534 ERV insertions across the five genomes, representing between 0.5 and 1% of their genomes. Solo-LTRs account for 83.2% of the annotated insertions demonstrating that most of the ERVs are relics of past events. Two Class II families showed higher abundance and copy conservation in small ruminants. One of them is closely related to circulating exogenous retroviruses and is represented by 22 copies sharing identical LTRs and 12 with complete coding capacities in the domestic goat.</p><p><strong>Conclusions: </strong>Our results suggest the presence of two ERV families with recent transpositional activity in ruminant genomes, particularly in the domestic goat, illustrating distinct evolutionary dynamics among the analyzed species. This work highlights the ongoing influence of ERVs on genomic landscapes and call for further investigation of their evolutionary trajectories in these genomes.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"4"},"PeriodicalIF":4.7,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11831830/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143440945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Targeted detection of endogenous LINE-1 proteins and ORF2p interactions. 内源性LINE-1蛋白和ORF2p相互作用的靶向检测。
IF 4.7 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2025-02-06 DOI: 10.1186/s13100-024-00339-4
Mathias I Nielsen, Justina C Wolters, Omar G Rosas Bringas, Hua Jiang, Luciano H Di Stefano, Mehrnoosh Oghbaie, Samira Hozeifi, Mats J Nitert, Alienke van Pijkeren, Marieke Smit, Lars Ter Morsche, Apostolos Mourtzinos, Vikram Deshpande, Martin S Taylor, Brian T Chait, John LaCava

Background: Both the expression and activities of LINE-1 (L1) retrotransposons are known to occur in numerous cell-types and are implicated in pathobiological contexts such as aging-related inflammation, autoimmunity, and in cancers. L1s encode two proteins that are translated from bicistronic transcripts. The translation product of ORF1 (ORF1p) has been robustly detected by immunoassays and shotgun mass spectrometry (MS). Yet, more sensitive detection methods would enhance the use of ORF1p as a clinical biomarker. In contrast, until now, no direct evidence of endogenous L1 ORF2 translation to protein (ORF2p) has been shown. Instead, assays for ORF2p have been limited to ectopic L1 ORF over-expression contexts and to indirect detection of endogenous ORF2p enzymatic activity, such as by the sequencing of de novo genomic insertions. Immunoassays for endogenous ORF2p have been problematic, producing apparent false positives due to cross-reactivities, and shotgun MS has not yielded reliable evidence of ORF2p peptides in biological samples.

Results: Here we present targeted mass spectrometry assays, selected and parallel reaction monitoring (SRM and PRM, respectively) to detect and quantify L1 ORF1p and ORF2p at their endogenous abundances. We were able to quantify ORF1p and ORF2p present in our samples down to a range in the low attomoles. Confident in our ability to affinity enrich ORF2p, we describe an interactome associated with endogenous ORF2-containing macromolecular assemblies.

Conclusions: This is the first assay to demonstrate sensitive and robust quantitation of endogenous ORF2p. The ability to assay ORF2p directly and quantitatively will improve our understanding of the developmental and diseased cell states where L1 expression and its activity naturally occur. The ability to simultaneously assay endogenous L1 ORF1p and ORF2p is an important step forward for L1 analytical biochemistry. Endogenous ORF2p interactomes can now be presented with confidence that ORF2p is among the enriched proteins.

背景:已知LINE-1 (L1)逆转录转座子的表达和活性发生在许多细胞类型中,并与病理生物学背景有关,如衰老相关炎症、自身免疫和癌症。l1编码两种由双链转录本翻译而来的蛋白质。ORF1的翻译产物(ORF1p)已被免疫分析和霰弹枪质谱(MS)检测到。然而,更灵敏的检测方法将增强ORF1p作为临床生物标志物的使用。相比之下,到目前为止,还没有直接证据表明内源性L1 ORF2翻译成蛋白质(ORF2p)。相反,ORF2p的检测仅限于异位L1 ORF过表达背景和间接检测内源性ORF2p酶活性,例如通过从头基因组插入测序。内源性ORF2p的免疫分析一直存在问题,由于交叉反应产生明显的假阳性,霰弹枪质谱法也没有在生物样品中产生ORF2p肽的可靠证据。结果:本研究提出了靶向质谱分析,选择和平行反应监测(分别为SRM和PRM)来检测和量化L1 ORF1p和ORF2p的内源性丰度。我们能够将样本中的ORF1p和ORF2p量化到低原子的范围内。我们对丰富ORF2p亲和力的能力充满信心,我们描述了一个与内源性orf2大分子组装相关的相互作用组。结论:这是首次对内源性ORF2p进行灵敏、可靠的定量分析。直接定量分析ORF2p的能力将提高我们对L1表达及其活性自然发生的发育和病变细胞状态的理解。同时检测内源性L1 ORF1p和ORF2p的能力是L1分析生物化学向前迈出的重要一步。内源性ORF2p相互作用组现在可以确信ORF2p位于富集蛋白中。
{"title":"Targeted detection of endogenous LINE-1 proteins and ORF2p interactions.","authors":"Mathias I Nielsen, Justina C Wolters, Omar G Rosas Bringas, Hua Jiang, Luciano H Di Stefano, Mehrnoosh Oghbaie, Samira Hozeifi, Mats J Nitert, Alienke van Pijkeren, Marieke Smit, Lars Ter Morsche, Apostolos Mourtzinos, Vikram Deshpande, Martin S Taylor, Brian T Chait, John LaCava","doi":"10.1186/s13100-024-00339-4","DOIUrl":"10.1186/s13100-024-00339-4","url":null,"abstract":"<p><strong>Background: </strong>Both the expression and activities of LINE-1 (L1) retrotransposons are known to occur in numerous cell-types and are implicated in pathobiological contexts such as aging-related inflammation, autoimmunity, and in cancers. L1s encode two proteins that are translated from bicistronic transcripts. The translation product of ORF1 (ORF1p) has been robustly detected by immunoassays and shotgun mass spectrometry (MS). Yet, more sensitive detection methods would enhance the use of ORF1p as a clinical biomarker. In contrast, until now, no direct evidence of endogenous L1 ORF2 translation to protein (ORF2p) has been shown. Instead, assays for ORF2p have been limited to ectopic L1 ORF over-expression contexts and to indirect detection of endogenous ORF2p enzymatic activity, such as by the sequencing of de novo genomic insertions. Immunoassays for endogenous ORF2p have been problematic, producing apparent false positives due to cross-reactivities, and shotgun MS has not yielded reliable evidence of ORF2p peptides in biological samples.</p><p><strong>Results: </strong>Here we present targeted mass spectrometry assays, selected and parallel reaction monitoring (SRM and PRM, respectively) to detect and quantify L1 ORF1p and ORF2p at their endogenous abundances. We were able to quantify ORF1p and ORF2p present in our samples down to a range in the low attomoles. Confident in our ability to affinity enrich ORF2p, we describe an interactome associated with endogenous ORF2-containing macromolecular assemblies.</p><p><strong>Conclusions: </strong>This is the first assay to demonstrate sensitive and robust quantitation of endogenous ORF2p. The ability to assay ORF2p directly and quantitatively will improve our understanding of the developmental and diseased cell states where L1 expression and its activity naturally occur. The ability to simultaneously assay endogenous L1 ORF1p and ORF2p is an important step forward for L1 analytical biochemistry. Endogenous ORF2p interactomes can now be presented with confidence that ORF2p is among the enriched proteins.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"3"},"PeriodicalIF":4.7,"publicationDate":"2025-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11800616/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143365220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FishPi: a bioinformatic prediction tool to link piRNA and transposable elements. FishPi:连接piRNA和转座因子的生物信息学预测工具。
IF 4.7 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2025-01-28 DOI: 10.1186/s13100-025-00342-3
Alice M Godden, Benjamin Rix, Simone Immler

Background: Piwi-interacting RNAs (piRNA)s are non-coding small RNAs that post-transcriptionally affect gene expression and regulation. Through complementary seed region binding with transposable elements (TEs), piRNAs protect the genome from transposition. A tool to link piRNAs with complementary TE targets will improve our understanding of the role of piRNAs in genome maintenance and gene regulation. Existing tools such as TEsmall can process sRNA-seq datasets to produce differentially expressed piRNAs, and piRScan developed for nematodes can link piRNAs and TEs but it requires knowledge about the target region of interest and works backwards.

Results: We developed FishPi to predict the pairings between piRNA and TEs for available genomes from zebrafish, medaka and tilapia, with full user customisation of parameters including orientation of piRNA, mismatches in the piRNA seed binding to TE and scored output lists of piRNA-TE matches. FishPi works with individual piRNAs or a list of piRNA sequences in fasta format. The software focuses on the piRNA-TE seed region and analyses reference TEs for piRNA complementarity. TE type is examined, counted and stored to a dictionary, with genomic loci recorded. Any updates to piRNA-TE binding rules can easily be incorporated by changing the seed-region options in the graphic user-interface. FishPi provides a graphic interface using tkinter for the user to input piRNA sequences to generate comprehensive reports on piRNA-TE interactions. FishPi can easily be adapted to genomes from other species and taxa opening the interpretation of piRNA functionality to a wide community.

Conclusions: Users will gain insight into genome mobility and FishPi will help further our understanding of the biological role of piRNAs and their interaction with TEs in a similar way that public databases have improved the access to and the understanding of the role of small RNAs.

背景:piwi相互作用rna (piRNA)是一种非编码小rna,其转录后影响基因表达和调控。通过与转座因子(te)的互补种子区结合,pirna保护基因组免受转座。将pirna与互补TE靶点联系起来的工具将提高我们对pirna在基因组维持和基因调控中的作用的理解。现有的工具如TEsmall可以处理sRNA-seq数据集以产生差异表达的pirna,而为线虫开发的piRScan可以将pirna和te连接起来,但它需要了解感兴趣的目标区域并向后工作。结果:我们开发了FishPi来预测来自斑马鱼、medaka和罗非鱼的可用基因组的piRNA和TE之间的配对,用户可以完全定制参数,包括piRNA的方向、piRNA种子与TE结合的不匹配以及piRNA-TE匹配的评分输出列表。FishPi与单个piRNA或快速格式的piRNA序列列表一起工作。该软件侧重于piRNA- te种子区,并分析了piRNA互补性的参考te。TE型被检查、计数并存储到字典中,并记录基因组位点。piRNA-TE绑定规则的任何更新都可以通过更改图形用户界面中的种子区域选项轻松合并。FishPi提供了一个图形界面,用户可以使用tkinter输入piRNA序列,生成piRNA- te相互作用的综合报告。FishPi可以很容易地适应其他物种和分类群的基因组,为更广泛的社区打开了piRNA功能的解释。结论:用户将深入了解基因组迁移,FishPi将帮助我们进一步了解pirna的生物学作用及其与te的相互作用,就像公共数据库提高了对小rna作用的获取和理解一样。
{"title":"FishPi: a bioinformatic prediction tool to link piRNA and transposable elements.","authors":"Alice M Godden, Benjamin Rix, Simone Immler","doi":"10.1186/s13100-025-00342-3","DOIUrl":"10.1186/s13100-025-00342-3","url":null,"abstract":"<p><strong>Background: </strong>Piwi-interacting RNAs (piRNA)s are non-coding small RNAs that post-transcriptionally affect gene expression and regulation. Through complementary seed region binding with transposable elements (TEs), piRNAs protect the genome from transposition. A tool to link piRNAs with complementary TE targets will improve our understanding of the role of piRNAs in genome maintenance and gene regulation. Existing tools such as TEsmall can process sRNA-seq datasets to produce differentially expressed piRNAs, and piRScan developed for nematodes can link piRNAs and TEs but it requires knowledge about the target region of interest and works backwards.</p><p><strong>Results: </strong>We developed FishPi to predict the pairings between piRNA and TEs for available genomes from zebrafish, medaka and tilapia, with full user customisation of parameters including orientation of piRNA, mismatches in the piRNA seed binding to TE and scored output lists of piRNA-TE matches. FishPi works with individual piRNAs or a list of piRNA sequences in fasta format. The software focuses on the piRNA-TE seed region and analyses reference TEs for piRNA complementarity. TE type is examined, counted and stored to a dictionary, with genomic loci recorded. Any updates to piRNA-TE binding rules can easily be incorporated by changing the seed-region options in the graphic user-interface. FishPi provides a graphic interface using tkinter for the user to input piRNA sequences to generate comprehensive reports on piRNA-TE interactions. FishPi can easily be adapted to genomes from other species and taxa opening the interpretation of piRNA functionality to a wide community.</p><p><strong>Conclusions: </strong>Users will gain insight into genome mobility and FishPi will help further our understanding of the biological role of piRNAs and their interaction with TEs in a similar way that public databases have improved the access to and the understanding of the role of small RNAs.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"2"},"PeriodicalIF":4.7,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11773700/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143053085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SINE retrotransposons import polyadenylation signals to 3'UTRs in dog (Canis familiaris). 犬(Canis familiaris)的sin反转录转座子向3' utr输入聚腺苷化信号。
IF 4.7 2区 生物学 Q1 GENETICS & HEREDITY Pub Date : 2025-01-04 DOI: 10.1186/s13100-024-00338-5
Jessica D Choi, Lelani A Del Pinto, Nathan B Sutter

Background: Messenger RNA 3' untranslated regions (3'UTRs) control many aspects of gene expression and determine where the transcript will terminate. The polyadenylation signal (PAS) AAUAAA (AATAAA in DNA) is a key regulator of transcript termination and this hexamer, or a similar sequence, is very frequently found within 30 bp of 3'UTR ends. Short interspersed element (SINE) retrotransposons are found throughout genomes in high copy numbers. When inserted into genes they can disrupt expression, alter splicing, or cause nuclear retention of mRNAs. The genomes of the domestic dog and other carnivores carry hundreds of thousands of Can-SINEs, a tRNA-related SINE with transcription termination potential. Because of this we asked whether Can-SINEs may terminate transcript in some dog genes.

Results: Each of the dog's nine Can-SINE consensus sequences carry an average of three AATAAA PASs on their sense strands but zero on their antisense strands. Consistent with the idea that Can-SINEs can terminate transcripts, we find that sense-oriented Can-SINEs are approximately ten times more frequent at 3' ends of 3'UTRs compared to further upstream within 3'UTRs. Furthermore, the count of AATAAA PASs on head-to-tail SINE sequences differs significantly between sense and antisense-oriented retrotransposons in transcripts. Can-SINEs near 3'UTR ends are likely to carry an AATAAA motif on the mRNA sense strand while those further upstream are not. We identified loci where Can-SINE insertion has truncated or altered a 3'UTR of the dog genome (dog 3'UTR) compared to the human ortholog. Dog 3'UTRs have peaks of AATAAA PAS frequency at 28, 32, and 36 bp from the end. The periodicity is partly explained by TAAA(n) repeats within Can-SINE AT-rich tails. We annotated all repeat-masked Can-SINE copies in the Boxer reference genome and found that the young SINEC_Cf type has a mode of 15 bp length for target site duplications (TSDs). All dog Can-SINE types favor integration at TSDs beginning with A(4).

Conclusion: Dog Can-SINE retrotransposition has imported AATAAA PASs into gene transcripts and led to alteration of 3'UTRs. AATAAA sequences are selectively removed from Can-SINEs in introns and upstream 3'UTR regions but are retained at the far downstream end of 3'UTRs, which we infer reflects their role as termination sequences for these transcripts.

背景:信使RNA 3‘非翻译区(3’ utr)控制着基因表达的许多方面,并决定转录物终止的位置。多聚腺苷化信号(PAS) AAUAAA (DNA中的AATAAA)是转录终止的关键调节因子,该六聚体或类似序列经常在3'UTR末端30 bp内发现。短穿插元件(sin)逆转录转座子以高拷贝数存在于整个基因组中。当插入基因时,它们可以破坏表达,改变剪接,或导致mrna的核保留。家犬和其他食肉动物的基因组携带数十万个can - sin,这是一种与trna相关的具有转录终止潜力的sin。因此,我们询问can - sin是否可以终止某些狗基因的转录。结果:每只狗的9个Can-SINE一致序列在它们的义链上平均携带3个AATAAA PASs,但在它们的反义链上平均携带0个。与can - sin可以终止转录本的想法一致,我们发现在3‘ utr的3’末端,与在3' utr的上游相比,面向感官的can - sin的频率大约是10倍。此外,在转录本中正反义转座子和反义转座子之间,从头到尾的SINE序列上的AATAAA PASs数存在显著差异。靠近3'UTR末端的can - sin可能在mRNA意义链上携带AATAAA基序,而更上游的则不携带。我们确定了与人类同源基因相比,Can-SINE插入截断或改变狗基因组3'UTR的位点(狗3'UTR)。狗的3' utr在距离末端28,32和36bp处有AATAAA PAS频率峰值。这种周期性的部分原因是在富含can - sin at的尾部出现TAAA(n)重复。我们在Boxer参考基因组中注释了所有重复屏蔽的Can-SINE拷贝,发现年轻的SINEC_Cf型具有15 bp长度的靶位点复制(TSDs)模式。所有犬犬Can-SINE类型都倾向于在以A开头的tsd进行整合(4)。结论:犬Can-SINE逆转录将AATAAA PASs导入基因转录本,导致3' utr的改变。AATAAA序列被选择性地从内含子和上游3'UTR区域的Can-SINEs中移除,但保留在3'UTR的远下游端,我们推断这反映了它们作为这些转录本的终止序列的作用。
{"title":"SINE retrotransposons import polyadenylation signals to 3'UTRs in dog (Canis familiaris).","authors":"Jessica D Choi, Lelani A Del Pinto, Nathan B Sutter","doi":"10.1186/s13100-024-00338-5","DOIUrl":"https://doi.org/10.1186/s13100-024-00338-5","url":null,"abstract":"<p><strong>Background: </strong>Messenger RNA 3' untranslated regions (3'UTRs) control many aspects of gene expression and determine where the transcript will terminate. The polyadenylation signal (PAS) AAUAAA (AATAAA in DNA) is a key regulator of transcript termination and this hexamer, or a similar sequence, is very frequently found within 30 bp of 3'UTR ends. Short interspersed element (SINE) retrotransposons are found throughout genomes in high copy numbers. When inserted into genes they can disrupt expression, alter splicing, or cause nuclear retention of mRNAs. The genomes of the domestic dog and other carnivores carry hundreds of thousands of Can-SINEs, a tRNA-related SINE with transcription termination potential. Because of this we asked whether Can-SINEs may terminate transcript in some dog genes.</p><p><strong>Results: </strong>Each of the dog's nine Can-SINE consensus sequences carry an average of three AATAAA PASs on their sense strands but zero on their antisense strands. Consistent with the idea that Can-SINEs can terminate transcripts, we find that sense-oriented Can-SINEs are approximately ten times more frequent at 3' ends of 3'UTRs compared to further upstream within 3'UTRs. Furthermore, the count of AATAAA PASs on head-to-tail SINE sequences differs significantly between sense and antisense-oriented retrotransposons in transcripts. Can-SINEs near 3'UTR ends are likely to carry an AATAAA motif on the mRNA sense strand while those further upstream are not. We identified loci where Can-SINE insertion has truncated or altered a 3'UTR of the dog genome (dog 3'UTR) compared to the human ortholog. Dog 3'UTRs have peaks of AATAAA PAS frequency at 28, 32, and 36 bp from the end. The periodicity is partly explained by TAAA(n) repeats within Can-SINE AT-rich tails. We annotated all repeat-masked Can-SINE copies in the Boxer reference genome and found that the young SINEC_Cf type has a mode of 15 bp length for target site duplications (TSDs). All dog Can-SINE types favor integration at TSDs beginning with A(4).</p><p><strong>Conclusion: </strong>Dog Can-SINE retrotransposition has imported AATAAA PASs into gene transcripts and led to alteration of 3'UTRs. AATAAA sequences are selectively removed from Can-SINEs in introns and upstream 3'UTR regions but are retained at the far downstream end of 3'UTRs, which we infer reflects their role as termination sequences for these transcripts.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":"16 1","pages":"1"},"PeriodicalIF":4.7,"publicationDate":"2025-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142927290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Mobile DNA
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1