{"title":"微阵列预处理广义PDNN模型的实现与应用","authors":"Wei Wei, Lin Wan, M. Qian, Minghua Deng","doi":"10.1109/BMEI.2009.5302030","DOIUrl":null,"url":null,"abstract":"The preprocessing of the Microarray data is a hot topic in the bioinformatics research. The key point of a successful preprocessing method is to remove the noise of nonspecific binding and to keep the information of specific binding as much as possible. One way to solve these problems is to understand the principle of the binding between probes and target sequences, and to distinguish specific binding from nonspecific binding correctly. In this paper, we introduce MM probe intensities into position dependent nearest neighbor (PDNN) model, which contain much information of nonspecific binding.We use two-step model to estimate the parameters,which can simplify the computation. Based on the Wilcoxon rank test, we can determine whether a gene is present, with which we can obtain the training data set for the specific binding and non specific binding parameters. We also apply our model to gene expression data (HGU133plus2.0 and HGU133A) . We find that all these improvements increase the precision and stability, and show better result compared to the other four methods( Mas5.0, dChip, RMA and PDNN ).","PeriodicalId":6389,"journal":{"name":"2009 2nd International Conference on Biomedical Engineering and Informatics","volume":"28 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"The Implementation and Application of the Microarray Preprocessing Generalized PDNN Model\",\"authors\":\"Wei Wei, Lin Wan, M. Qian, Minghua Deng\",\"doi\":\"10.1109/BMEI.2009.5302030\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The preprocessing of the Microarray data is a hot topic in the bioinformatics research. The key point of a successful preprocessing method is to remove the noise of nonspecific binding and to keep the information of specific binding as much as possible. One way to solve these problems is to understand the principle of the binding between probes and target sequences, and to distinguish specific binding from nonspecific binding correctly. In this paper, we introduce MM probe intensities into position dependent nearest neighbor (PDNN) model, which contain much information of nonspecific binding.We use two-step model to estimate the parameters,which can simplify the computation. Based on the Wilcoxon rank test, we can determine whether a gene is present, with which we can obtain the training data set for the specific binding and non specific binding parameters. We also apply our model to gene expression data (HGU133plus2.0 and HGU133A) . We find that all these improvements increase the precision and stability, and show better result compared to the other four methods( Mas5.0, dChip, RMA and PDNN ).\",\"PeriodicalId\":6389,\"journal\":{\"name\":\"2009 2nd International Conference on Biomedical Engineering and Informatics\",\"volume\":\"28 1\",\"pages\":\"1-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-10-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 2nd International Conference on Biomedical Engineering and Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BMEI.2009.5302030\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 2nd International Conference on Biomedical Engineering and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BMEI.2009.5302030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The Implementation and Application of the Microarray Preprocessing Generalized PDNN Model
The preprocessing of the Microarray data is a hot topic in the bioinformatics research. The key point of a successful preprocessing method is to remove the noise of nonspecific binding and to keep the information of specific binding as much as possible. One way to solve these problems is to understand the principle of the binding between probes and target sequences, and to distinguish specific binding from nonspecific binding correctly. In this paper, we introduce MM probe intensities into position dependent nearest neighbor (PDNN) model, which contain much information of nonspecific binding.We use two-step model to estimate the parameters,which can simplify the computation. Based on the Wilcoxon rank test, we can determine whether a gene is present, with which we can obtain the training data set for the specific binding and non specific binding parameters. We also apply our model to gene expression data (HGU133plus2.0 and HGU133A) . We find that all these improvements increase the precision and stability, and show better result compared to the other four methods( Mas5.0, dChip, RMA and PDNN ).