{"title":"基于蛋白质序列特征和结构域-结构域相互作用数据的疾病相关非单核苷酸多态性鉴定","authors":"Rui Jiang, Mingxin Gan, Jiaxin Wu","doi":"10.1504/IJCBDD.2012.049204","DOIUrl":null,"url":null,"abstract":"<p><p>Recent studies have suggested the common disease-rare variant (CD-RV) hypothesis in the mapping of disease-related genetic variants and have proposed a number of statistical methods to detect associations between rare variants and human inherited diseases. However, most of these methods take the selection of functional variants as a preliminary step in order to maximise the power of statistical tests. To meet this end, we put forward a filtration approach to identify genetic variants that are potentially associated with a query disease of interest from the perspective of one-class novelty learning. We propose to prioritise candidate non-synonymous single nucleotide polymorphisms (nsSNPs) relying on the integrated use of two sequence conservation properties of amino acids calculated from multiple sequence alignment of protein sequences and one functional similarity measure derived from domain-domain interaction data. We show the power of this approach in the detection of disease-related nsSNP via large-scale leave-one-out cross-validation experiments.</p>","PeriodicalId":39227,"journal":{"name":"International Journal of Computational Biology and Drug Design","volume":"5 3-4","pages":"206-21"},"PeriodicalIF":0.0000,"publicationDate":"2012-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJCBDD.2012.049204","citationCount":"0","resultStr":"{\"title\":\"Identification of disease-related nsSNPs via the integration of protein sequence features and domain-domain interaction data.\",\"authors\":\"Rui Jiang, Mingxin Gan, Jiaxin Wu\",\"doi\":\"10.1504/IJCBDD.2012.049204\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Recent studies have suggested the common disease-rare variant (CD-RV) hypothesis in the mapping of disease-related genetic variants and have proposed a number of statistical methods to detect associations between rare variants and human inherited diseases. However, most of these methods take the selection of functional variants as a preliminary step in order to maximise the power of statistical tests. To meet this end, we put forward a filtration approach to identify genetic variants that are potentially associated with a query disease of interest from the perspective of one-class novelty learning. We propose to prioritise candidate non-synonymous single nucleotide polymorphisms (nsSNPs) relying on the integrated use of two sequence conservation properties of amino acids calculated from multiple sequence alignment of protein sequences and one functional similarity measure derived from domain-domain interaction data. We show the power of this approach in the detection of disease-related nsSNP via large-scale leave-one-out cross-validation experiments.</p>\",\"PeriodicalId\":39227,\"journal\":{\"name\":\"International Journal of Computational Biology and Drug Design\",\"volume\":\"5 3-4\",\"pages\":\"206-21\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1504/IJCBDD.2012.049204\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Computational Biology and Drug Design\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJCBDD.2012.049204\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2012/9/24 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q4\",\"JCRName\":\"Pharmacology, Toxicology and Pharmaceutics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computational Biology and Drug Design","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJCBDD.2012.049204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2012/9/24 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"Pharmacology, Toxicology and Pharmaceutics","Score":null,"Total":0}
Identification of disease-related nsSNPs via the integration of protein sequence features and domain-domain interaction data.
Recent studies have suggested the common disease-rare variant (CD-RV) hypothesis in the mapping of disease-related genetic variants and have proposed a number of statistical methods to detect associations between rare variants and human inherited diseases. However, most of these methods take the selection of functional variants as a preliminary step in order to maximise the power of statistical tests. To meet this end, we put forward a filtration approach to identify genetic variants that are potentially associated with a query disease of interest from the perspective of one-class novelty learning. We propose to prioritise candidate non-synonymous single nucleotide polymorphisms (nsSNPs) relying on the integrated use of two sequence conservation properties of amino acids calculated from multiple sequence alignment of protein sequences and one functional similarity measure derived from domain-domain interaction data. We show the power of this approach in the detection of disease-related nsSNP via large-scale leave-one-out cross-validation experiments.