{"title":"Classification of lncRNA and mRNA of Eukaryotic model organism using physicochemical properties and composition of dineuclotides and trineuclotides","authors":"R. Prasad, A. Krishnamachari","doi":"10.1109/PCEMS58491.2023.10136048","DOIUrl":null,"url":null,"abstract":"Unveiling lncRNA and mRNA gene differences at the sequence level is one of the important challenges in molecular and disease biology. In the context of DNA sequence, this difference in a physicochemical signature parameter is very important. In this study, we have proposed a machine learning-based computational approach for the classification of these genomic features. we have considered three important physicochemical properties,solvation energy, hydrogen bonding ensrgy and stacking energy of dinucleotide and trinucleotide of lncRNA and mRNA sequence as well as dinucleotide and trinucleotide composition in their sequences.We have considered lncRNA and mRNA sequences from seven model organisms namely Arabidopsis thliana, C.elegans, Chicken, Chimpanzee, Cow, Platypus, and Zebrafish.","PeriodicalId":330870,"journal":{"name":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","volume":"195 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PCEMS58491.2023.10136048","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Unveiling lncRNA and mRNA gene differences at the sequence level is one of the important challenges in molecular and disease biology. In the context of DNA sequence, this difference in a physicochemical signature parameter is very important. In this study, we have proposed a machine learning-based computational approach for the classification of these genomic features. we have considered three important physicochemical properties,solvation energy, hydrogen bonding ensrgy and stacking energy of dinucleotide and trinucleotide of lncRNA and mRNA sequence as well as dinucleotide and trinucleotide composition in their sequences.We have considered lncRNA and mRNA sequences from seven model organisms namely Arabidopsis thliana, C.elegans, Chicken, Chimpanzee, Cow, Platypus, and Zebrafish.