{"title":"Prediction of Selenoproteins Based on Motif Recognition","authors":"Lan Tao, Geng Liu, Xiaoli Wang, Lei Zhang","doi":"10.1109/BMEI.2009.5302720","DOIUrl":null,"url":null,"abstract":"At present available computational methods can not predict selenoproteins correctly because of the special features of selenoproteins. It is known that there are some conservative sections around U in selenoproteins from previous research. So we bring forward a new method to predict selenoproteins based on motif recognition, we use Multiple Em for Motif Elicitation (MEME) to discover motif around U in selenoproteins and then predict selenoproteins based on the motif. The new method found all the selenoproteins in 9 seleno families expect one false positive in family of GPX1 and one in SelS. From the experiment, it is showed that this method can effectively predict almost all the selenoproteins in the known seleno families, and better than the methods of locating the position of U and blasting the Sec/Cys pairing based on handcraft.","PeriodicalId":6389,"journal":{"name":"2009 2nd International Conference on Biomedical Engineering and Informatics","volume":"12 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 2nd International Conference on Biomedical Engineering and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BMEI.2009.5302720","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
At present available computational methods can not predict selenoproteins correctly because of the special features of selenoproteins. It is known that there are some conservative sections around U in selenoproteins from previous research. So we bring forward a new method to predict selenoproteins based on motif recognition, we use Multiple Em for Motif Elicitation (MEME) to discover motif around U in selenoproteins and then predict selenoproteins based on the motif. The new method found all the selenoproteins in 9 seleno families expect one false positive in family of GPX1 and one in SelS. From the experiment, it is showed that this method can effectively predict almost all the selenoproteins in the known seleno families, and better than the methods of locating the position of U and blasting the Sec/Cys pairing based on handcraft.
由于硒蛋白的特殊性质,现有的计算方法不能正确预测硒蛋白。根据以往的研究,硒蛋白中U附近存在一些保守区。为此,我们提出了一种基于基序识别的预测硒蛋白的新方法,即利用多模态模序激发(Multiple Em for motif Elicitation, MEME)技术发现硒蛋白中U周围的基序,并基于该基序预测硒蛋白。新方法在9个硒蛋白家族中发现了除GPX1家族和SelS家族中的一个假阳性外的所有硒蛋白。实验结果表明,该方法能有效预测已知硒蛋白家族中几乎所有的硒蛋白,且优于手工定位U和爆破Sec/Cys配对的方法。