{"title":"Using the RNAstructure Software Package to Predict Conserved RNA Structures","authors":"Abhinav Mittal, Sara E. Ali, David H. Mathews","doi":"10.1002/cpz1.70054","DOIUrl":null,"url":null,"abstract":"<p>The structures of many non-coding RNAs (ncRNA) are conserved by evolution to a greater extent than their sequences. By predicting the conserved structure of two or more homologous sequences, the accuracy of secondary structure prediction can be improved as compared to structure prediction for a single sequence. Here, we provide protocols for the use of four programs in the RNAstructure suite to predict conserved structures: Multilign, TurboFold, Dynalign, and PARTS. TurboFold iteratively aligns multiple homologous sequences and estimates the pairing probabilities for the conserved structure. Dynalign, PARTS, and Multilign are dynamic programming algorithms that simultaneously align sequences and identify the common secondary structure. Dynalign uses a pair of homologs and finds the lowest free energy common structure. PARTS uses a pair of homologs and estimates pairing probabilities from the base pairing probabilities estimated for each sequence. Multilign uses two or more homologs and finds the lowest free energy common structure using multiple pairwise calculations with Dynalign. It scales linearly with the number of sequences. We outline the strengths of each program. These programs can be run through web servers, on the command line, or with graphical user interfaces. © 2024 Wiley Periodicals LLC.</p><p><b>Basic Protocol 1</b>: Predicting a structure conserved in three or more sequences with the RNAstructure web server</p><p><b>Basic Protocol 2</b>: Predicting a structure conserved in two sequences with the RNAstructure web server</p><p><b>Alternative Protocol 1</b>: Predicting a structure conserved in multiple sequences in the RNAstructure graphical user interface</p><p><b>Alternative Protocol 2</b>: Predicting a structure conserved in two sequences with Dynalign in the RNAstructure graphical user interface</p><p><b>Alternative Protocol 3</b>: Running TurboFold on the command line</p>","PeriodicalId":93970,"journal":{"name":"Current protocols","volume":"4 11","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current protocols","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpz1.70054","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The structures of many non-coding RNAs (ncRNA) are conserved by evolution to a greater extent than their sequences. By predicting the conserved structure of two or more homologous sequences, the accuracy of secondary structure prediction can be improved as compared to structure prediction for a single sequence. Here, we provide protocols for the use of four programs in the RNAstructure suite to predict conserved structures: Multilign, TurboFold, Dynalign, and PARTS. TurboFold iteratively aligns multiple homologous sequences and estimates the pairing probabilities for the conserved structure. Dynalign, PARTS, and Multilign are dynamic programming algorithms that simultaneously align sequences and identify the common secondary structure. Dynalign uses a pair of homologs and finds the lowest free energy common structure. PARTS uses a pair of homologs and estimates pairing probabilities from the base pairing probabilities estimated for each sequence. Multilign uses two or more homologs and finds the lowest free energy common structure using multiple pairwise calculations with Dynalign. It scales linearly with the number of sequences. We outline the strengths of each program. These programs can be run through web servers, on the command line, or with graphical user interfaces. © 2024 Wiley Periodicals LLC.
Basic Protocol 1: Predicting a structure conserved in three or more sequences with the RNAstructure web server
Basic Protocol 2: Predicting a structure conserved in two sequences with the RNAstructure web server
Alternative Protocol 1: Predicting a structure conserved in multiple sequences in the RNAstructure graphical user interface
Alternative Protocol 2: Predicting a structure conserved in two sequences with Dynalign in the RNAstructure graphical user interface
Alternative Protocol 3: Running TurboFold on the command line
使用 RNAstructure 软件包预测保守的 RNA 结构。
许多非编码 RNA(ncRNA)的结构在进化过程中的保守程度要高于其序列。通过预测两个或更多同源序列的保守结构,二级结构预测的准确性可以比单一序列的结构预测有所提高。在此,我们提供了使用 RNAstructure 套件中的四个程序预测保守结构的协议:Multilign、TurboFold、Dynalign 和 PARTS。TurboFold 对多个同源序列进行迭代对齐,并估计保守结构的配对概率。Dynalign、PARTS 和 Multilign 是动态编程算法,可同时对齐序列并识别共同的二级结构。Dynalign 使用一对同源物,找出自由能最低的共同结构。PARTS 使用一对同源物,并根据每个序列的碱基配对概率估算配对概率。Multilign 使用两个或更多同源物,并通过与 Dynalign 的多次配对计算找到自由能最低的共同结构。它与序列的数量成线性比例。我们概述了每个程序的优势。这些程序可通过网络服务器、命令行或图形用户界面运行。© 2024 Wiley Periodicals LLC.基本协议 1:使用 RNAstructure 网络服务器预测三个或更多序列中的保守结构 基本协议 2:使用 RNAstructure 网络服务器预测两个序列中的保守结构 替代协议 1:在 RNAstructure 图形用户界面中预测多个序列中的保守结构 替代协议 2:在 RNAstructure 图形用户界面中使用 Dynalign 预测两个序列中的保守结构 替代协议 3:在命令行上运行 TurboFold。
本文章由计算机程序翻译,如有差异,请以英文原文为准。