首页 > 最新文献

RNA最新文献

英文 中文
Complementary DNA oligonucleotide direct in-gel quantification (cDINGQ) for precise tRNA fragment analysis. 互补DNA寡核苷酸直接凝胶定量(cDINGQ)用于精确的tRNA片段分析。
IF 5 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
RNA
Pub Date : 2026-01-29 DOI: 10.1261/rna.080749.125
Marko Joerg, Marco Kristen, Lukas Walz, Christine Lietz, Max Mueller, Sebastian Nathal, Virginie Marchand, Yuri Motorin, Marie-Luise Winz, Mark Helm, Kristina Friedland

tRNA-derived fragments have emerged as critical regulators in various biological processes, but reliable methods for their quantification remain a challenge due to their small size and extensive RNA modifications. In this study, we present the newly developed Complementary DNA Oligonucleotide Direct In-Gel Quantification (cDINGQ) method for tRF analysis and compare it with traditional radioactive [32P] Northern blotting, non-radioactive approaches, and high-throughput Illumina sequencing under different experimental conditions. The cDINGQ method, utilizing Cy5-labeled hybridization probes, offers high specificity and sensitivity for detecting tRFs with significantly reduced processing time and costs. By applying these techniques to an Alzheimer's disease (AD) cell model, we demonstrate the reliability of these methods in detecting subtle variations in tRF abundance. Our findings highlight the sensitivity, specificity, and applicability of each method, addressing limitations such as RNA input requirements and probe hybridization conditions. The study further explores the utility of these methods for detecting tRFs in various biological contexts, emphasizing their potential for future research and biomarker discovery in disease-related studies.

trna衍生的片段已成为各种生物过程中的关键调节因子,但由于其小尺寸和广泛的RNA修饰,可靠的定量方法仍然是一个挑战。在本研究中,我们提出了新开发的用于tRF分析的互补DNA寡核苷酸直接凝胶定量(cDINGQ)方法,并在不同的实验条件下将其与传统的放射性[32P] Northern blotting、非放射性方法和高通量Illumina测序进行了比较。cDINGQ方法利用cy5标记的杂交探针,在检测tRFs方面具有很高的特异性和敏感性,显著降低了处理时间和成本。通过将这些技术应用于阿尔茨海默病(AD)细胞模型,我们证明了这些方法在检测tRF丰度的细微变化方面的可靠性。我们的研究结果突出了每种方法的敏感性、特异性和适用性,解决了RNA输入要求和探针杂交条件等局限性。该研究进一步探讨了这些方法在各种生物学背景下检测tRFs的效用,强调了它们在未来研究和疾病相关研究中发现生物标志物的潜力。
{"title":"Complementary DNA oligonucleotide direct in-gel quantification (cDINGQ) for precise tRNA fragment analysis.","authors":"Marko Joerg, Marco Kristen, Lukas Walz, Christine Lietz, Max Mueller, Sebastian Nathal, Virginie Marchand, Yuri Motorin, Marie-Luise Winz, Mark Helm, Kristina Friedland","doi":"10.1261/rna.080749.125","DOIUrl":"https://doi.org/10.1261/rna.080749.125","url":null,"abstract":"<p><p>tRNA-derived fragments have emerged as critical regulators in various biological processes, but reliable methods for their quantification remain a challenge due to their small size and extensive RNA modifications. In this study, we present the newly developed Complementary DNA Oligonucleotide Direct In-Gel Quantification (cDINGQ) method for tRF analysis and compare it with traditional radioactive [<sup>32</sup>P] Northern blotting, non-radioactive approaches, and high-throughput Illumina sequencing under different experimental conditions. The cDINGQ method, utilizing Cy5-labeled hybridization probes, offers high specificity and sensitivity for detecting tRFs with significantly reduced processing time and costs. By applying these techniques to an Alzheimer's disease (AD) cell model, we demonstrate the reliability of these methods in detecting subtle variations in tRF abundance. Our findings highlight the sensitivity, specificity, and applicability of each method, addressing limitations such as RNA input requirements and probe hybridization conditions. The study further explores the utility of these methods for detecting tRFs in various biological contexts, emphasizing their potential for future research and biomarker discovery in disease-related studies.</p>","PeriodicalId":21401,"journal":{"name":"RNA","volume":" ","pages":""},"PeriodicalIF":5.0,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146087019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Structure-informed mutagenesis identifies combinatorial contributions to mouse insulin receptor IRES function. 结构信息突变确定了小鼠胰岛素受体IRES功能的组合贡献。
IF 5 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
RNA
Pub Date : 2026-01-28 DOI: 10.1261/rna.080775.125
William B Dahl, Tammy C T Lan, Silvi Rouskin, Michael T Marr

Cells under stress shift their proteome by repressing cap-dependent translation initiation. RNA elements called internal ribosome entry sites (IRES) can allow key cellular transcripts to remain efficiently translated to support an effective stress response. We previously determined that the 5' untranslated region (5'UTR) of the insulin receptor mRNA possesses a capacity for IRES activity that is conserved from insects to mammals. Well-characterized IRESes depend on RNA structures that reduce the protein requirements for translation initiation, thus circumventing translation inhibition. While there are several examples of viral IRES structures solved in vitro, the RNA secondary structures of cellular IRESes remain elusive and little information exists about the secondary structures of these RNAs in vivo. Here we probe the secondary structure of the Insr 5'UTR IRES along with two well-studied viral IRESes from hepatitis C virus and encephalomyocarditis virus using dimethyl sulfate mutational profiling by sequencing (DMS-MaPseq) in vitro and in cells. We find that the structures of viral IRESes in a cellular environment are largely consistent with their known in vitro structures. Using DMS-MaPseq probing as a constraint, we generated a model of the RNA secondary structure of the mouse insulin receptor 5'UTR. With this model as a guide, we employed a mutation strategy which allowed us to identify a conserved segment of RNA, distal from the translation start codon, that is critical for Insr IRES function. This knowledge informed the design of a minimal IRES element with equivalent activity to the full-length Insr 5'UTR across translation contexts.

应激下的细胞通过抑制帽依赖的翻译起始来改变其蛋白质组。称为内部核糖体进入位点(IRES)的RNA元件可以使关键的细胞转录物保持有效的翻译,以支持有效的应激反应。我们先前确定胰岛素受体mRNA的5‘非翻译区(5’ utr)具有从昆虫到哺乳动物的IRES活性能力。表征良好的IRESes依赖于降低翻译起始所需蛋白质的RNA结构,从而绕过翻译抑制。虽然在体外已经解决了几个病毒IRES结构的例子,但细胞IRES的RNA二级结构仍然难以捉摸,并且关于这些RNA在体内的二级结构的信息很少。在这里,我们利用硫酸二甲酯突变谱测序技术(DMS-MaPseq)在体外和细胞中探测了Insr 5'UTR IRES以及丙型肝炎病毒和脑心肌炎病毒的两种已被广泛研究的病毒IRES的二级结构。我们发现病毒IRESes在细胞环境中的结构与体外已知的结构在很大程度上是一致的。使用DMS-MaPseq探针作为约束,我们生成了小鼠胰岛素受体5'UTR的RNA二级结构模型。以该模型为指导,我们采用了一种突变策略,该策略使我们能够识别出距离翻译起始密码子远端的RNA保守片段,这对Insr IRES功能至关重要。这些知识为最小IRES元件的设计提供了依据,该元件在翻译环境中具有与全长Insr 5'UTR等效的活性。
{"title":"Structure-informed mutagenesis identifies combinatorial contributions to mouse insulin receptor IRES function.","authors":"William B Dahl, Tammy C T Lan, Silvi Rouskin, Michael T Marr","doi":"10.1261/rna.080775.125","DOIUrl":"10.1261/rna.080775.125","url":null,"abstract":"<p><p>Cells under stress shift their proteome by repressing cap-dependent translation initiation. RNA elements called internal ribosome entry sites (IRES) can allow key cellular transcripts to remain efficiently translated to support an effective stress response. We previously determined that the 5' untranslated region (5'UTR) of the insulin receptor mRNA possesses a capacity for IRES activity that is conserved from insects to mammals. Well-characterized IRESes depend on RNA structures that reduce the protein requirements for translation initiation, thus circumventing translation inhibition. While there are several examples of viral IRES structures solved in vitro, the RNA secondary structures of cellular IRESes remain elusive and little information exists about the secondary structures of these RNAs in vivo. Here we probe the secondary structure of the Insr 5'UTR IRES along with two well-studied viral IRESes from hepatitis C virus and encephalomyocarditis virus using dimethyl sulfate mutational profiling by sequencing (DMS-MaPseq) in vitro and in cells. We find that the structures of viral IRESes in a cellular environment are largely consistent with their known in vitro structures. Using DMS-MaPseq probing as a constraint, we generated a model of the RNA secondary structure of the mouse insulin receptor 5'UTR. With this model as a guide, we employed a mutation strategy which allowed us to identify a conserved segment of RNA, distal from the translation start codon, that is critical for Insr IRES function. This knowledge informed the design of a minimal IRES element with equivalent activity to the full-length Insr 5'UTR across translation contexts.</p>","PeriodicalId":21401,"journal":{"name":"RNA","volume":" ","pages":""},"PeriodicalIF":5.0,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146053775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MiSiPi.Rna: an integrated tool for characterizing small regulatory RNA processing. MiSiPi。Rna:用于表征小调控Rna加工的集成工具。
IF 5 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
RNA
Pub Date : 2026-01-26 DOI: 10.1261/rna.080864.125
Taiya Jarva, Chris Baugh, Jialin Zhang, Eric Lai, Alex Flynt

Argonaute proteins mediate gene silencing via small regulatory RNAs that are generated by distinctive biogenesis pathways. In animals, three main classes are recognized: ~21-24 nucleotide (nt) microRNAs (miRNAs), ~21-24 nt small-interfering RNAs (siRNAs) and ~24-32 nt Piwi-interacting RNAs (piRNAs). Mechanistic understanding of these pathways was gained from genetic, biochemical and genomic studies in a handful of model systems, where key ribonucleolytic events were identified that specify stereotyped positioning of small RNAs relative to their precursor transcripts. With burgeoning availability of assembled genomes and small RNA data, there are abundant opportunities to characterize the diversity of small RNAs across non-model organisms. While several tools are well-suited to analyze specific small RNA pathways, an integrated package that can help classify and interpret all three major classes of small RNAs is wanting. To address this need, we developed a simple and efficient R package (MiSiPi.Rna) that can generate a variety of plots and statistics for pre-selected loci, which enable the characterization of diverse biogenesis features of miRNAs, siRNAs and piRNAs. MiSiPi.Rna requires minimal computational expertise to run, and will facilitate efforts to annotate and analyze the major classes of Argonaute-based small regulatory RNAs in arbitrary species of choice.

Argonaute蛋白通过不同生物发生途径产生的小调控rna介导基因沉默。在动物中,可识别出三大类:~21-24核苷酸(nt) microRNAs (miRNAs)、~21-24 nt小干扰rna (sirna)和~24-32 nt piwi相互作用rna (piRNAs)。对这些途径的机制理解是通过对少数模型系统的遗传、生化和基因组研究获得的,其中确定了关键的核糖核溶解事件,这些事件指定了小rna相对于其前体转录物的定型定位。随着组装基因组和小RNA数据的迅速发展,有大量的机会来表征非模式生物中小RNA的多样性。虽然有几种工具非常适合分析特定的小RNA途径,但需要一种能够帮助分类和解释所有三大类小RNA的集成包。为了满足这一需求,我们开发了一个简单高效的R包(MiSiPi.Rna),可以为预选位点生成各种图和统计数据,从而能够表征mirna, sirna和pirna的各种生物发生特征。MiSiPi。Rna需要最少的计算专业知识来运行,并且将有助于在任意选择的物种中注释和分析基于argonaute的小调控Rna的主要类别。
{"title":"MiSiPi.Rna: an integrated tool for characterizing small regulatory RNA processing.","authors":"Taiya Jarva, Chris Baugh, Jialin Zhang, Eric Lai, Alex Flynt","doi":"10.1261/rna.080864.125","DOIUrl":"10.1261/rna.080864.125","url":null,"abstract":"<p><p>Argonaute proteins mediate gene silencing via small regulatory RNAs that are generated by distinctive biogenesis pathways. In animals, three main classes are recognized: ~21-24 nucleotide (nt) microRNAs (miRNAs), ~21-24 nt small-interfering RNAs (siRNAs) and ~24-32 nt Piwi-interacting RNAs (piRNAs). Mechanistic understanding of these pathways was gained from genetic, biochemical and genomic studies in a handful of model systems, where key ribonucleolytic events were identified that specify stereotyped positioning of small RNAs relative to their precursor transcripts. With burgeoning availability of assembled genomes and small RNA data, there are abundant opportunities to characterize the diversity of small RNAs across non-model organisms. While several tools are well-suited to analyze specific small RNA pathways, an integrated package that can help classify and interpret all three major classes of small RNAs is wanting. To address this need, we developed a simple and efficient R package (MiSiPi.Rna) that can generate a variety of plots and statistics for pre-selected loci, which enable the characterization of diverse biogenesis features of miRNAs, siRNAs and piRNAs. MiSiPi.Rna requires minimal computational expertise to run, and will facilitate efforts to annotate and analyze the major classes of Argonaute-based small regulatory RNAs in arbitrary species of choice.</p>","PeriodicalId":21401,"journal":{"name":"RNA","volume":" ","pages":""},"PeriodicalIF":5.0,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146053734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of 3D motifs in Rfam with JAR3D. 用JAR3D识别Rfam中的三维图案。
IF 5 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
RNA
Pub Date : 2026-01-26 DOI: 10.1261/rna.080764.125
James E Roll, Craig L Zirbel

Many non-protein-coding RNAs have been discovered and more are being discovered each year. At first we know them only by their sequences in a few organisms, but to understand their function and interactions, we need to understand what 3D structures they may form, in whole or in part. Many hairpin and internal loops are known to form structured 3D motifs, many of which are recurrent across different non-coding RNAs, for example, kink turn and sarcin-ricin internal loops, and GNRA, UNCG, and T-loop hairpin loops. As such, a new non-protein-coding RNA may well have one or more known structured 3D loop motifs. The goal of this paper is to introduce a tool which can identify loops in Rfam seed alignments that match well to known 3D loop motifs, and which makes those identifications easily accessible. JAR3D was developed to map sequences of hairpin and internal loops to known 3D motifs, and was extended for this work to 3-way and 4-way junction motifs. We applied JAR3D to 4,166 Rfam seed alignments from Rfam 15.0 and made the results accessible on the JAR3D web page, which makes it easy to inspect and evaluate the possible matches for each loop in each Rfam family. We provide several examples which validate JAR3D's ability to identify the correct loop motif, using 3D structures of RNAs outside of the training set. We created a new page to search for instances of a particular loop motif across all Rfam families, to facilitate studies of how widespread the occurrence of each motif is. We provide statistics on how many Rfam loops appear to match well to a known 3D motif. Match rates are much higher for internal loops than for hairpins or multi-helix junctions.

许多非蛋白编码rna已经被发现,而且每年都有更多的rna被发现。起初,我们只通过它们在少数生物体中的序列来了解它们,但为了了解它们的功能和相互作用,我们需要了解它们可能形成的整体或部分3D结构。已知许多发夹环和内环形成结构化的3D基序,其中许多在不同的非编码rna中反复出现,例如,扭结转和sarcin-ricin内环,以及GNRA, UNCG和t -环发夹环。因此,新的非蛋白编码RNA很可能具有一个或多个已知的结构三维环基序。本文的目标是介绍一种工具,该工具可以识别Rfam种子比对中与已知的3D环路motif匹配良好的环路,并使这些识别易于访问。开发了JAR3D来将发夹和内部环序列映射到已知的3D motif,并为此工作扩展到3-way和4-way结motif。我们将JAR3D应用于Rfam 15.0中的4,166个Rfam种子比对,并使结果可在JAR3D网页上访问,这使得检查和评估每个Rfam家族中每个循环的可能匹配变得容易。我们提供了几个例子来验证JAR3D识别正确循环基序的能力,使用训练集之外的rna的3D结构。我们创建了一个新的页面来搜索所有Rfam家族中特定循环基序的实例,以促进对每个基序发生的广泛程度的研究。我们提供了多少Rfam循环似乎很好地匹配已知的3D图案的统计数据。内环的匹配率比发夹或多螺旋结高得多。
{"title":"Identification of 3D motifs in Rfam with JAR3D.","authors":"James E Roll, Craig L Zirbel","doi":"10.1261/rna.080764.125","DOIUrl":"https://doi.org/10.1261/rna.080764.125","url":null,"abstract":"<p><p>Many non-protein-coding RNAs have been discovered and more are being discovered each year. At first we know them only by their sequences in a few organisms, but to understand their function and interactions, we need to understand what 3D structures they may form, in whole or in part. Many hairpin and internal loops are known to form structured 3D motifs, many of which are recurrent across different non-coding RNAs, for example, kink turn and sarcin-ricin internal loops, and GNRA, UNCG, and T-loop hairpin loops. As such, a new non-protein-coding RNA may well have one or more known structured 3D loop motifs. The goal of this paper is to introduce a tool which can identify loops in Rfam seed alignments that match well to known 3D loop motifs, and which makes those identifications easily accessible. JAR3D was developed to map sequences of hairpin and internal loops to known 3D motifs, and was extended for this work to 3-way and 4-way junction motifs. We applied JAR3D to 4,166 Rfam seed alignments from Rfam 15.0 and made the results accessible on the JAR3D web page, which makes it easy to inspect and evaluate the possible matches for each loop in each Rfam family. We provide several examples which validate JAR3D's ability to identify the correct loop motif, using 3D structures of RNAs outside of the training set. We created a new page to search for instances of a particular loop motif across all Rfam families, to facilitate studies of how widespread the occurrence of each motif is. We provide statistics on how many Rfam loops appear to match well to a known 3D motif. Match rates are much higher for internal loops than for hairpins or multi-helix junctions.</p>","PeriodicalId":21401,"journal":{"name":"RNA","volume":" ","pages":""},"PeriodicalIF":5.0,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146053679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal stability of a highly conserved RNA G4 in PDCoV nsp8 supports viral proliferation. PDCoV nsp8中高度保守的RNA G4的最佳稳定性支持病毒增殖。
IF 5 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
RNA
Pub Date : 2026-01-26 DOI: 10.1261/rna.080834.125
Puxian Fang, Congbao Xie, Ting Cheng, Jingjing Sui, Yan Cheng, Tong Ding, Jiahui Guo, Yuhan Zhang, Liurong Fang, Dengguo Wei, Shaobo Xiao

Porcine deltacoronavirus (PDCoV), an emerging enteropathogenic coronavirus, primarily causes diarrhea in piglets and has the potential for cross-species transmission to humans. The recent detection of PDCoV in Haitian children underscores the urgent need for developing antiviral therapeutic strategies. G-quadruplexes (G4s) are implicated in the modulation of viral infection; however, their identification and roles in the PDCoV life cycle remain unclear. Here, we identified a highly conserved G4 structure, designated PDCoV-G4, located within the coding region of PDCoV non-structural protein 8 (nsp8). PDS and TMPyP4, two well-known G4-binding ligands, were found to target PDCoV-G4 and exhibit anti-PDCoV activity. Interestingly, PDS stabilizes the structure of PDCoV-G4, while TMPyP4 disrupts it. The recombinant PDCoV with G4-disruptive mutations (rPDCoV-nsp8mut) displays resistance to both PDS and TMPyP4. Utilizing an embryonated chicken eggs (ECEs) infection model, we observed that TMPyP4 provides superior protective effects for rPDCoV-wt-infected ECEs compared to PDS. However, both PDS and TMPyP4 exhibited diminished protective effects on chicken embryos infected with rPDCoV-nsp8mut, relative to rPDCoV-wt, further confirming their in vivo antiviral activity through targeting PDCoV-G4. These findings demonstrate that the PDCoV-G4 plays a crucial regulatory role in the PDCoV life cycle and pathogenicity, representing a potential target for antiviral therapy.

猪三角冠状病毒(PDCoV)是一种新兴的肠致病性冠状病毒,主要引起仔猪腹泻,并有可能跨物种传播给人类。最近在海地儿童中检测到PDCoV强调了制定抗病毒治疗策略的迫切需要。g -四联体(G4s)与病毒感染的调节有关;然而,它们的鉴定和在PDCoV生命周期中的作用仍不清楚。在此,我们鉴定了位于PDCoV非结构蛋白8 (nsp8)编码区高度保守的G4结构,命名为PDCoV-G4。PDS和TMPyP4是两种众所周知的g4结合配体,它们靶向PDCoV-G4并表现出抗pdcov活性。有趣的是,PDS稳定了PDCoV-G4的结构,而TMPyP4破坏了它。具有g4破坏性突变的重组PDCoV (rPDCoV-nsp8mut)对PDS和TMPyP4均具有抗性。利用鸡胚蛋(ECEs)感染模型,我们观察到TMPyP4对rpdcov -wt感染的鸡胚蛋具有优于PDS的保护作用。然而,与rPDCoV-wt相比,PDS和TMPyP4对感染了rPDCoV-nsp8mut的鸡胚的保护作用减弱,进一步证实了它们通过靶向PDCoV-G4而具有体内抗病毒活性。这些发现表明,PDCoV- g4在PDCoV生命周期和致病性中起着至关重要的调节作用,代表了抗病毒治疗的潜在靶点。
{"title":"Optimal stability of a highly conserved RNA G4 in PDCoV nsp8 supports viral proliferation.","authors":"Puxian Fang, Congbao Xie, Ting Cheng, Jingjing Sui, Yan Cheng, Tong Ding, Jiahui Guo, Yuhan Zhang, Liurong Fang, Dengguo Wei, Shaobo Xiao","doi":"10.1261/rna.080834.125","DOIUrl":"https://doi.org/10.1261/rna.080834.125","url":null,"abstract":"<p><p>Porcine deltacoronavirus (PDCoV), an emerging enteropathogenic coronavirus, primarily causes diarrhea in piglets and has the potential for cross-species transmission to humans. The recent detection of PDCoV in Haitian children underscores the urgent need for developing antiviral therapeutic strategies. G-quadruplexes (G4s) are implicated in the modulation of viral infection; however, their identification and roles in the PDCoV life cycle remain unclear. Here, we identified a highly conserved G4 structure, designated PDCoV-G4, located within the coding region of PDCoV non-structural protein 8 (nsp8). PDS and TMPyP4, two well-known G4-binding ligands, were found to target PDCoV-G4 and exhibit anti-PDCoV activity. Interestingly, PDS stabilizes the structure of PDCoV-G4, while TMPyP4 disrupts it. The recombinant PDCoV with G4-disruptive mutations (rPDCoV-nsp8mut) displays resistance to both PDS and TMPyP4. Utilizing an embryonated chicken eggs (ECEs) infection model, we observed that TMPyP4 provides superior protective effects for rPDCoV-wt-infected ECEs compared to PDS. However, both PDS and TMPyP4 exhibited diminished protective effects on chicken embryos infected with rPDCoV-nsp8mut, relative to rPDCoV-wt, further confirming their in vivo antiviral activity through targeting PDCoV-G4. These findings demonstrate that the PDCoV-G4 plays a crucial regulatory role in the PDCoV life cycle and pathogenicity, representing a potential target for antiviral therapy.</p>","PeriodicalId":21401,"journal":{"name":"RNA","volume":" ","pages":""},"PeriodicalIF":5.0,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146053811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning for RNA Secondary Structure Prediction: a review of current methods and challenges. RNA二级结构预测的机器学习:当前方法和挑战的综述。
IF 5 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
RNA
Pub Date : 2026-01-23 DOI: 10.1261/rna.080840.125
Giuseppe Sacco, Giovanni Bussi, Guido Sanguinetti

Predicting the secondary structure of RNA is a core challenge in computational biology, essential for understanding molecular function and designing novel therapeutics. The field has evolved from foundational but accuracy-limited thermodynamic approaches to a new data-driven paradigm dominated by machine learning and deep learning. These models learn folding patterns directly from data, leading to significant performance gains. This review surveys the modern landscape of these methods, covering single-sequence, evolutionary-based, and hybrid models that blend machine learning with biophysics. A central theme is the field's "generalization crisis," where powerful models were found to fail on new RNA families, prompting a community-wide shift to stricter, homology-aware benchmarking. In response to the underlying challenge of data scarcity, RNA foundation models have emerged, learning from massive, unlabeled sequence corpora to improve generalization. Finally, we look ahead to the next set of major hurdles-including the accurate prediction of complex motifs like pseudoknots, scaling to kilobase-length transcripts, incorporating the chemical diversity of modified nucleotides, and shifting the prediction target from static structures to the dynamic ensembles that better capture biological function. We also highlight the need for a standardized, prospective benchmarking system to ensure unbiased validation and accelerate progress.

预测RNA的二级结构是计算生物学的核心挑战,对于理解分子功能和设计新的治疗方法至关重要。该领域已经从基础但精度有限的热力学方法发展到以机器学习和深度学习为主导的新的数据驱动范式。这些模型直接从数据中学习折叠模式,从而显著提高了性能。本文综述了这些方法的现代概况,包括单序列、基于进化的模型和混合模型,这些模型将机器学习与生物物理学相结合。一个中心主题是该领域的“泛化危机”,即强大的模型被发现在新的RNA家族中失败,促使整个社区转向更严格的,同源性意识的基准。为了应对数据稀缺的潜在挑战,RNA基础模型已经出现,从大量未标记的序列语料库中学习以提高泛化。最后,我们展望下一组主要的障碍-包括准确预测复杂的基序,如假结,扩展到千碱基长度的转录本,结合修饰核苷酸的化学多样性,以及将预测目标从静态结构转移到更好地捕获生物功能的动态集合。我们还强调需要一个标准化的,前瞻性的基准系统,以确保公正的验证和加速进展。
{"title":"Machine Learning for RNA Secondary Structure Prediction: a review of current methods and challenges.","authors":"Giuseppe Sacco, Giovanni Bussi, Guido Sanguinetti","doi":"10.1261/rna.080840.125","DOIUrl":"https://doi.org/10.1261/rna.080840.125","url":null,"abstract":"<p><p>Predicting the secondary structure of RNA is a core challenge in computational biology, essential for understanding molecular function and designing novel therapeutics. The field has evolved from foundational but accuracy-limited thermodynamic approaches to a new data-driven paradigm dominated by machine learning and deep learning. These models learn folding patterns directly from data, leading to significant performance gains. This review surveys the modern landscape of these methods, covering single-sequence, evolutionary-based, and hybrid models that blend machine learning with biophysics. A central theme is the field's \"generalization crisis,\" where powerful models were found to fail on new RNA families, prompting a community-wide shift to stricter, homology-aware benchmarking. In response to the underlying challenge of data scarcity, RNA foundation models have emerged, learning from massive, unlabeled sequence corpora to improve generalization. Finally, we look ahead to the next set of major hurdles-including the accurate prediction of complex motifs like pseudoknots, scaling to kilobase-length transcripts, incorporating the chemical diversity of modified nucleotides, and shifting the prediction target from static structures to the dynamic ensembles that better capture biological function. We also highlight the need for a standardized, prospective benchmarking system to ensure unbiased validation and accelerate progress.</p>","PeriodicalId":21401,"journal":{"name":"RNA","volume":" ","pages":""},"PeriodicalIF":5.0,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146041549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Blocky proline/glutamine patterns in the SFPQ intrinsically disordered region dictate paraspeckle formation as a distinct membraneless organelle. 块状脯氨酸/谷氨酰胺模式在SFPQ内在无序区域指示副斑形成作为一个独特的无膜细胞器。
IF 5 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
RNA
Pub Date : 2026-01-22 DOI: 10.1261/rna.080769.125
Hiro Takakuwa, Takao Yoda, Tomohiro Yamazaki, Tetsuro Hirose

Membraneless organelles (MLOs) formed through phase separation play crucial roles in various cellular processes. Many MLOs remain spatially compartmentalized, avoiding fusion or engulfment. MLOs are formed by dynamic multivalent interactions, often mediated by proteins with intrinsically disordered regions (IDRs). However, the molecular principles behind how IDRs maintain MLO independence remain poorly understood. Here, we investigated the proline/glutamine (P/Q)-rich IDR of SFPQ, a protein identified as a key factor in segregating paraspeckles from nuclear speckles. Paraspeckle segregation analyses, using SFPQ mutants tethered to NEAT1_2 long noncoding RNA, revealed that P/Q residues within the SFPQ IDR, conserved from humans to zebrafish, are crucial for its segregation activity. Beyond amino acid composition, the blocky patterns of P/Q residues, in which proline- and glutamine-rich blocks are repetitively arranged, are required for the segregation from nuclear speckles. Among human IDRs exhibiting PQ-block patterns, BRD4 IDR shows strong sequence similarity to the SFPQ IDR, and exhibits comparable segregation activity. Molecular dynamics simulation suggests that the PQ-blocky patterns required for the paraspeckle segregation do not correlate with the IDR characteristics necessary for self-assembly. Thus, these data suggest that the PQ-blocky patterns in IDRs represent a previously uncharacterized property that contributes to MLO independence, possibly through a mechanism distinct from the conventional phase separation-promoting function of IDRs.

通过相分离形成的无膜细胞器在各种细胞过程中起着至关重要的作用。许多MLOs在空间上保持分隔,避免融合或吞没。MLOs是由动态多价相互作用形成的,通常由具有内在无序区(IDRs)的蛋白质介导。然而,idr维持MLO独立性背后的分子原理仍然知之甚少。在这里,我们研究了富含脯氨酸/谷氨酰胺(P/Q)的SFPQ蛋白的IDR, SFPQ蛋白被认为是分离副斑和核斑的关键因素。利用连接到neat12长链非编码RNA上的SFPQ突变体进行的旁斑分离分析表明,从人类到斑马鱼的SFPQ IDR中的P/Q残基对其分离活性至关重要。除了氨基酸组成外,P/Q残基的块状模式,其中富含脯氨酸和谷氨酰胺的块被重复排列,是与核斑点分离所必需的。在具有PQ-block模式的人类IDR中,BRD4 IDR与SFPQ IDR具有很强的序列相似性,并且具有相似的分离活性。分子动力学模拟表明,副散斑分离所需的pq块状模式与自组装所需的IDR特征无关。因此,这些数据表明,idr中的pq块模式代表了一种以前未被表征的特性,有助于MLO独立性,可能是通过一种不同于idr传统的相分离促进功能的机制。
{"title":"Blocky proline/glutamine patterns in the SFPQ intrinsically disordered region dictate paraspeckle formation as a distinct membraneless organelle.","authors":"Hiro Takakuwa, Takao Yoda, Tomohiro Yamazaki, Tetsuro Hirose","doi":"10.1261/rna.080769.125","DOIUrl":"https://doi.org/10.1261/rna.080769.125","url":null,"abstract":"<p><p>Membraneless organelles (MLOs) formed through phase separation play crucial roles in various cellular processes. Many MLOs remain spatially compartmentalized, avoiding fusion or engulfment. MLOs are formed by dynamic multivalent interactions, often mediated by proteins with intrinsically disordered regions (IDRs). However, the molecular principles behind how IDRs maintain MLO independence remain poorly understood. Here, we investigated the proline/glutamine (P/Q)-rich IDR of SFPQ, a protein identified as a key factor in segregating paraspeckles from nuclear speckles. Paraspeckle segregation analyses, using SFPQ mutants tethered to NEAT1_2 long noncoding RNA, revealed that P/Q residues within the SFPQ IDR, conserved from humans to zebrafish, are crucial for its segregation activity. Beyond amino acid composition, the blocky patterns of P/Q residues, in which proline- and glutamine-rich blocks are repetitively arranged, are required for the segregation from nuclear speckles. Among human IDRs exhibiting PQ-block patterns, BRD4 IDR shows strong sequence similarity to the SFPQ IDR, and exhibits comparable segregation activity. Molecular dynamics simulation suggests that the PQ-blocky patterns required for the paraspeckle segregation do not correlate with the IDR characteristics necessary for self-assembly. Thus, these data suggest that the PQ-blocky patterns in IDRs represent a previously uncharacterized property that contributes to MLO independence, possibly through a mechanism distinct from the conventional phase separation-promoting function of IDRs.</p>","PeriodicalId":21401,"journal":{"name":"RNA","volume":" ","pages":""},"PeriodicalIF":5.0,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146030579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uridine analogs prevent stress granule formation, not by blocking PKR recognition, but by inhibiting the synthesis of T7 RNA Polymerase byproducts. 尿苷类似物阻止应激颗粒的形成,不是通过阻断PKR识别,而是通过抑制T7 RNA聚合酶副产物的合成。
IF 5 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
RNA
Pub Date : 2026-01-22 DOI: 10.1261/rna.080481.125
Sean J Ihn, Emily Jiang, Nevraj S KeJiou, Yifan E Wang, Laura Farlam-Williams, Alexander F Palazzo, Hyun O Lee

mRNA-based therapeutics are commonly produced through T7 RNA Polymerase-mediated in vitro transcription. Introducing these exogenous RNAs into human cells activates an RNA sensor Protein Kinase R (PKR), which suppresses translation initiation and reduces their therapeutic effectiveness. Incorporating uridine analogs into these transcripts prevents PKR activation and translation shutdown, but the underlying mechanism remains unclear. Here, we demonstrate that treating T7 RNA Polymerase-produced transcripts with RNase III, which selectively degrades double-stranded RNA (dsRNA), blocks PKR activation and downstream translation-inhibition events, including eIF2α phosphorylation and stress granule formation in human cells. Interestingly, dsRNAs generated with uridine analogs robustly induce eIF2α phosphorylation and stress granules to the same extent as dsRNA containing uridine. These findings indicate that uridine analogs do not prevent PKR from detecting dsRNA. Instead, we show that uridine analogs decrease the production of T7 RNA Polymerase byproducts, including antisense RNA and dsRNA, which activate PKR and downstream stress responses. Finally, we demonstrate that higher amounts of exogenous RNA, lacking T7 RNA Polymerase byproducts, can induce stress granules independently of PKR and phospho-eIF2α, but dependent on stress granule scaffold proteins G3BP1 and G3BP2. Together, our findings show that uridine analogs mitigate PKR signaling not by blocking mRNA-PKR interactions, but by minimizing dsRNA byproducts from T7 Polymerase transcription. Furthermore, stress granule formation in response to high levels of exogenous RNA can occur through a mechanism that does not depend on PKR but relies on G3BP1 and G3BP2. These insights clarify the role of uridine analogs in PKR activation and may inform future therapeutic RNA design.

基于mrna的疗法通常通过T7 RNA聚合酶介导的体外转录产生。将这些外源性RNA引入人类细胞,激活RNA传感器蛋白激酶R (PKR),从而抑制翻译起始并降低其治疗效果。在这些转录本中加入尿苷类似物可防止PKR激活和翻译关闭,但潜在的机制尚不清楚。在这里,我们证明了用RNase III处理T7 RNA聚合酶产生的转录本,它可以选择性地降解双链RNA (dsRNA),阻断PKR激活和下游的翻译抑制事件,包括人类细胞中eIF2α磷酸化和应激颗粒的形成。有趣的是,由尿苷类似物产生的dsRNA与含有尿苷的dsRNA在相同程度上强烈诱导eIF2α磷酸化和应激颗粒。这些发现表明尿苷类似物不会阻止PKR检测dsRNA。相反,我们发现尿苷类似物减少了T7 RNA聚合酶副产物的产生,包括反义RNA和dsRNA,它们激活PKR和下游应激反应。最后,我们证明了缺乏T7 RNA聚合酶副产物的大量外源RNA可以独立于PKR和磷酸化- eif2 α诱导应激颗粒,但依赖于应激颗粒支架蛋白G3BP1和G3BP2。总之,我们的研究结果表明尿苷类似物不是通过阻断mRNA-PKR相互作用,而是通过减少T7聚合酶转录的dsRNA副产物来减轻PKR信号传导。此外,对高水平外源RNA的应激颗粒形成可以通过不依赖于PKR而依赖于G3BP1和G3BP2的机制发生。这些发现阐明了尿苷类似物在PKR激活中的作用,并可能为未来的治疗性RNA设计提供信息。
{"title":"Uridine analogs prevent stress granule formation, not by blocking PKR recognition, but by inhibiting the synthesis of T7 RNA Polymerase byproducts.","authors":"Sean J Ihn, Emily Jiang, Nevraj S KeJiou, Yifan E Wang, Laura Farlam-Williams, Alexander F Palazzo, Hyun O Lee","doi":"10.1261/rna.080481.125","DOIUrl":"https://doi.org/10.1261/rna.080481.125","url":null,"abstract":"<p><p>mRNA-based therapeutics are commonly produced through T7 RNA Polymerase-mediated in vitro transcription. Introducing these exogenous RNAs into human cells activates an RNA sensor Protein Kinase R (PKR), which suppresses translation initiation and reduces their therapeutic effectiveness. Incorporating uridine analogs into these transcripts prevents PKR activation and translation shutdown, but the underlying mechanism remains unclear. Here, we demonstrate that treating T7 RNA Polymerase-produced transcripts with RNase III, which selectively degrades double-stranded RNA (dsRNA), blocks PKR activation and downstream translation-inhibition events, including eIF2α phosphorylation and stress granule formation in human cells. Interestingly, dsRNAs generated with uridine analogs robustly induce eIF2α phosphorylation and stress granules to the same extent as dsRNA containing uridine. These findings indicate that uridine analogs do not prevent PKR from detecting dsRNA. Instead, we show that uridine analogs decrease the production of T7 RNA Polymerase byproducts, including antisense RNA and dsRNA, which activate PKR and downstream stress responses. Finally, we demonstrate that higher amounts of exogenous RNA, lacking T7 RNA Polymerase byproducts, can induce stress granules independently of PKR and phospho-eIF2α, but dependent on stress granule scaffold proteins G3BP1 and G3BP2. Together, our findings show that uridine analogs mitigate PKR signaling not by blocking mRNA-PKR interactions, but by minimizing dsRNA byproducts from T7 Polymerase transcription. Furthermore, stress granule formation in response to high levels of exogenous RNA can occur through a mechanism that does not depend on PKR but relies on G3BP1 and G3BP2. These insights clarify the role of uridine analogs in PKR activation and may inform future therapeutic RNA design.</p>","PeriodicalId":21401,"journal":{"name":"RNA","volume":" ","pages":""},"PeriodicalIF":5.0,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146030524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Miniature NAD+-II riboswitches control bacterial genes for nicotinamide salvage and de novo NAD+ biosynthesis. 微型NAD+-II核开关控制细菌基因的烟酰胺回收和新生的NAD+生物合成。
IF 5 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
RNA
Pub Date : 2026-01-16 DOI: 10.1261/rna.080744.125
Christopher G King, Kenny P Cheng, Ronald R Breaker

Nicotinamide adenine dinucleotide (NAD) is a ubiquitous enzyme cofactor that serves as a carrier of hydride ions for metabolic oxidation-reduction reactions. NAD is also sometimes used as a source of activated adenosine monophosphate (AMP) for adenylation reactions or as a precursor of ADP-ribose upon removal of nicotinamide. Many bacterial riboswitch classes are known to sense nucleotide-derived enzyme cofactors, but NAD is one of several ancient cofactors that have few or no known riboswitch representatives. Two rare riboswitch classes, named NAD+-I and NAD+-II, have been reported that regulate genes relevant to NAD biosynthesis and transport. However, these RNAs exhibit unusual functional and structural properties. Here we report that miniature NAD+-II riboswitches, named mini-NAD+-II, are more abundant and widespread than the longer RNAs that were used to define the original consensus model for this class. The newfound examples are commonly found within lactic acid bacteria, which are notable for varied metabolic fermentation strategies used to maintain sufficient NAD+ Furthermore, the simple H-type pseudoknot core of mini-NAD+-II aptamers is similar to that of class I preQ1 riboswitch (preQ1-I) aptamers. Thus, H-type pseudoknots might serve as a versatile architecture for the natural or synthetic construction of ligand-binding aptamers.

烟酰胺腺嘌呤二核苷酸(NAD)是一种普遍存在的酶辅因子,在代谢氧化还原反应中作为氢化物离子的载体。NAD有时也被用作腺苷化反应中活化的一磷酸腺苷(AMP)的来源,或在去除烟酰胺时作为adp核糖的前体。已知许多细菌的核开关类可以感应核苷酸衍生的酶辅助因子,但NAD是几种很少或没有已知核开关代表的古老辅助因子之一。NAD+-I和NAD+-II是两种罕见的核蛋白开关类,它们调节与NAD生物合成和运输相关的基因。然而,这些rna表现出不同寻常的功能和结构特性。在这里,我们报告了微型NAD+-II核糖开关,称为mini-NAD+-II,比用于定义该类原始共识模型的较长rna更丰富和广泛。新发现的例子通常在乳酸菌中发现,这是值得注意的,用于维持足够的NAD+的各种代谢发酵策略。此外,mini-NAD+-II适配体的简单h型假结核与I类preQ1核糖开关适配体(preQ1-I)相似。因此,h型假结可以作为天然或合成的配体结合适体结构的通用结构。
{"title":"Miniature NAD<sup>+</sup>-II riboswitches control bacterial genes for nicotinamide salvage and de novo NAD<sup>+</sup> biosynthesis.","authors":"Christopher G King, Kenny P Cheng, Ronald R Breaker","doi":"10.1261/rna.080744.125","DOIUrl":"10.1261/rna.080744.125","url":null,"abstract":"<p><p>Nicotinamide adenine dinucleotide (NAD) is a ubiquitous enzyme cofactor that serves as a carrier of hydride ions for metabolic oxidation-reduction reactions. NAD is also sometimes used as a source of activated adenosine monophosphate (AMP) for adenylation reactions or as a precursor of ADP-ribose upon removal of nicotinamide. Many bacterial riboswitch classes are known to sense nucleotide-derived enzyme cofactors, but NAD is one of several ancient cofactors that have few or no known riboswitch representatives. Two rare riboswitch classes, named NAD<sup>+</sup>-I and NAD<sup>+</sup>-II, have been reported that regulate genes relevant to NAD biosynthesis and transport. However, these RNAs exhibit unusual functional and structural properties. Here we report that miniature NAD<sup>+</sup>-II riboswitches, named mini-NAD<sup>+</sup>-II, are more abundant and widespread than the longer RNAs that were used to define the original consensus model for this class. The newfound examples are commonly found within lactic acid bacteria, which are notable for varied metabolic fermentation strategies used to maintain sufficient NAD<sup>+</sup> Furthermore, the simple H-type pseudoknot core of mini-NAD<sup>+</sup>-II aptamers is similar to that of class I preQ<sub>1</sub> riboswitch (preQ<sub>1</sub>-I) aptamers. Thus, H-type pseudoknots might serve as a versatile architecture for the natural or synthetic construction of ligand-binding aptamers.</p>","PeriodicalId":21401,"journal":{"name":"RNA","volume":" ","pages":"162-170"},"PeriodicalIF":5.0,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12810184/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145542228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PANTHER Score: Protein-Affinity for Nucleic Target-binding, Hybridization, and Energy Regression. PANTHER -蛋白亲和力的核酸靶标结合,杂交,和能量回归。
IF 5 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
RNA
Pub Date : 2026-01-16 DOI: 10.1261/rna.080646.125
Parisa Aletayeb, Akash Deep Biswas, Stefano Rocca, Carmine Talarico, Giulio Vistoli, Alessandro Pedretti

Although protein-RNA interactions are crucial for many biological processes, predicting their binding free energies (ΔG) is a challenging task due to limited available experimental data and the complexity of these interactions. To address this issue, we developed a machine learning-based model designed to predict energy-based scores for protein-RNA complexes, called PANTHER Score. By applying a local-to-global approach, we proposed a methodology further subdivided into five steps: (1) We derived 87,117 pairwise local interaction energies from 331,744 MD-derived interactions across 46 curated protein-RNA complexes; (2) we trained ML models on pairwise interaction features to predict local interaction energies without performing MD simulations; (3) we integrated predicted local interaction energies using a local-to-global methodology, to compute model-specific PANTHER Score; (4) we evaluate model-specific PANTHER Score on an independent test set of seven complexes; and (5) we validated and selected the optimal model using an external stress set of 110 complexes with experimental ΔG values for implementation in the PANTHER Scoring pipeline. Among the regression models developed, Random Forest Regression exhibited the highest predictive performance as a model-specific PANTHER Score, achieveing a Pearson correlation (r) of 0.80 and MAE of 1.79 kcal/mol on the test set. It maintained strong predictive capabilities on the stress set (r = 0.64, MAE = 1.63 kcal/mol). Benchmarking against existing tools on the stress test set, the PANTHER Score demonstrated superior accuracy and reliability. This study highlights the effectiveness of MD and machine learning in addressing data limitations through innovative strategies, positioning the PANTHER Score as a robust tool for predicting protein-RNA binding affinities in biomolecular research, drug discovery and mainly in RNA-therapeutics.

尽管蛋白质- rna相互作用对许多生物过程至关重要,但由于可用的实验数据有限和这些相互作用的复杂性,预测它们的结合自由能(ΔG)是一项具有挑战性的任务。为了解决这个问题,我们开发了一个基于机器学习的模型,旨在预测蛋白质- rna复合物的能量分数,称为PANTHER分数。通过应用局部到全局的方法,本文提出的方法可以细分为四个步骤:(1)我们从分子动力学模拟中获得了由46个策划的蛋白质- rna复合物组成的训练集的331,744个相互作用能中获得了87,117个成对的局部相互作用能;(2)在不进行MD运行的情况下,训练基于两两相互作用特征的ML模型来预测局部相互作用能量;(3)将预测的局部相互作用能与本文提出的局部到全局方法相结合,计算模型特有的PANTHER分数;(4)我们在7个复合物的测试集上测试了模型特定的PANTHER分数(5)我们进一步将所有模型暴露于包括110个复合物的外部应力集,实验ΔG允许最终选择在PANTHER评分管道中实施的最佳模型。在所有的多元回归模型中,随机森林回归作为模型特异性的PANTHER评分表现出最高的预测性能,Pearson相关系数(r)为0.80,平均绝对误差(MAE)为1.79 kcal/mol。随机森林回归模型(r)为0.64,MAE为1.63 kcal/mol,对应力集具有较强的预测能力。与现有压力测试工具相比,PANTHER评分显示出更高的准确性和可靠性。本研究强调了机器学习在通过创新策略解决数据限制方面的有效性,将本文提出的PANTHER评分定位为生物分子研究和药物发现中预测蛋白质- rna结合亲和力的有价值工具。
{"title":"PANTHER Score: Protein-Affinity for Nucleic Target-binding, Hybridization, and Energy Regression.","authors":"Parisa Aletayeb, Akash Deep Biswas, Stefano Rocca, Carmine Talarico, Giulio Vistoli, Alessandro Pedretti","doi":"10.1261/rna.080646.125","DOIUrl":"10.1261/rna.080646.125","url":null,"abstract":"<p><p>Although protein-RNA interactions are crucial for many biological processes, predicting their binding free energies (Δ<i>G</i>) is a challenging task due to limited available experimental data and the complexity of these interactions. To address this issue, we developed a machine learning-based model designed to predict energy-based scores for protein-RNA complexes, called PANTHER Score. By applying a local-to-global approach, we proposed a methodology further subdivided into five steps: (1) We derived 87,117 pairwise local interaction energies from 331,744 MD-derived interactions across 46 curated protein-RNA complexes; (2) we trained ML models on pairwise interaction features to predict local interaction energies without performing MD simulations; (3) we integrated predicted local interaction energies using a local-to-global methodology, to compute model-specific PANTHER Score; (4) we evaluate model-specific PANTHER Score on an independent test set of seven complexes; and (5) we validated and selected the optimal model using an external stress set of 110 complexes with experimental Δ<i>G</i> values for implementation in the PANTHER Scoring pipeline. Among the regression models developed, Random Forest Regression exhibited the highest predictive performance as a model-specific PANTHER Score, achieveing a Pearson correlation (<i>r</i>) of 0.80 and MAE of 1.79 kcal/mol on the test set. It maintained strong predictive capabilities on the stress set (<i>r</i> = 0.64, MAE = 1.63 kcal/mol). Benchmarking against existing tools on the stress test set, the PANTHER Score demonstrated superior accuracy and reliability. This study highlights the effectiveness of MD and machine learning in addressing data limitations through innovative strategies, positioning the PANTHER Score as a robust tool for predicting protein-RNA binding affinities in biomolecular research, drug discovery and mainly in RNA-therapeutics.</p>","PeriodicalId":21401,"journal":{"name":"RNA","volume":" ","pages":"131-149"},"PeriodicalIF":5.0,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12810179/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145669402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
RNA
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1