{"title":"经验香农熵与抽象香农熵是否在值上收敛?RNA分子结构的一个案例","authors":"Amirhossein Manzourolajdad","doi":"10.1109/IETC47856.2020.9249220","DOIUrl":null,"url":null,"abstract":"The RNA molecule is capable of folding into different shapes, with some being more stable than others. The structural space of the RNA can be described by Stochastic Context-free Grammars (SCFG), offering a probabilisitic distribution of structural scenarios. In a more accurate folding model, the probability associated with a structural scenario is more informative of its stability. Here, we offer two different ways of calculating the Shannon Entropy of the SCFG-modeled RNA structural space; Grammar Space (GS) Entropy and SCFG Entropy. The former is the Shannon Entropy of the infinite number of structures produced by the model and the latter is the Shannon Entropy of a limited subset of structures all of which belong to the same RNA sequence. After a brief introduction on the two measures, we explore the relationship between these measures on a given set of RNA folding models and biologically functional RNA sequences. We show that these two measures of entropy are indeed correlated. While more theoretical work is needed in understanding the convergence behavior between the two, this observation suggests that GS Entropy is a promising characteristic in future model training approaches.","PeriodicalId":186446,"journal":{"name":"2020 Intermountain Engineering, Technology and Computing (IETC)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Do Empirical and Abstract Shannon Entropies Converge in Value? A Case in RNA Molecular Structure\",\"authors\":\"Amirhossein Manzourolajdad\",\"doi\":\"10.1109/IETC47856.2020.9249220\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The RNA molecule is capable of folding into different shapes, with some being more stable than others. The structural space of the RNA can be described by Stochastic Context-free Grammars (SCFG), offering a probabilisitic distribution of structural scenarios. In a more accurate folding model, the probability associated with a structural scenario is more informative of its stability. Here, we offer two different ways of calculating the Shannon Entropy of the SCFG-modeled RNA structural space; Grammar Space (GS) Entropy and SCFG Entropy. The former is the Shannon Entropy of the infinite number of structures produced by the model and the latter is the Shannon Entropy of a limited subset of structures all of which belong to the same RNA sequence. After a brief introduction on the two measures, we explore the relationship between these measures on a given set of RNA folding models and biologically functional RNA sequences. We show that these two measures of entropy are indeed correlated. While more theoretical work is needed in understanding the convergence behavior between the two, this observation suggests that GS Entropy is a promising characteristic in future model training approaches.\",\"PeriodicalId\":186446,\"journal\":{\"name\":\"2020 Intermountain Engineering, Technology and Computing (IETC)\",\"volume\":\"81 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 Intermountain Engineering, Technology and Computing (IETC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IETC47856.2020.9249220\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Intermountain Engineering, Technology and Computing (IETC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IETC47856.2020.9249220","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Do Empirical and Abstract Shannon Entropies Converge in Value? A Case in RNA Molecular Structure
The RNA molecule is capable of folding into different shapes, with some being more stable than others. The structural space of the RNA can be described by Stochastic Context-free Grammars (SCFG), offering a probabilisitic distribution of structural scenarios. In a more accurate folding model, the probability associated with a structural scenario is more informative of its stability. Here, we offer two different ways of calculating the Shannon Entropy of the SCFG-modeled RNA structural space; Grammar Space (GS) Entropy and SCFG Entropy. The former is the Shannon Entropy of the infinite number of structures produced by the model and the latter is the Shannon Entropy of a limited subset of structures all of which belong to the same RNA sequence. After a brief introduction on the two measures, we explore the relationship between these measures on a given set of RNA folding models and biologically functional RNA sequences. We show that these two measures of entropy are indeed correlated. While more theoretical work is needed in understanding the convergence behavior between the two, this observation suggests that GS Entropy is a promising characteristic in future model training approaches.