{"title":"Predicting Penetration Across the Blood-Brain Barrier A Rough Set Approach","authors":"Jianwen Fang, J. Grzymala-Busse","doi":"10.1109/GrC.2007.110","DOIUrl":null,"url":null,"abstract":"This paper reports on the results of experiments regarding a biomedical data set describing blood-brain barrier penetration ability of molecules. In this data set 415 cases represent organic compounds with known steady-state concentrations of a drug in the brain and blood. In our experiments we used two different discretization algorithms, based on agglomerative and divisive approaches of cluster analysis, respectively, and two different approaches to missing attribute values: deletion of cases with missing attribute values and deletion of attributes with missing values. Using ten-fold cross validation we concluded that the best strategy is based on a divisive approach of cluster analysis and deleting cases affected by missing attribute values. Moreover, prediction accuracy of this strategy is comparable with the other successful approaches reported in this area.","PeriodicalId":259430,"journal":{"name":"2007 IEEE International Conference on Granular Computing (GRC 2007)","volume":"201 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE International Conference on Granular Computing (GRC 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GrC.2007.110","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
This paper reports on the results of experiments regarding a biomedical data set describing blood-brain barrier penetration ability of molecules. In this data set 415 cases represent organic compounds with known steady-state concentrations of a drug in the brain and blood. In our experiments we used two different discretization algorithms, based on agglomerative and divisive approaches of cluster analysis, respectively, and two different approaches to missing attribute values: deletion of cases with missing attribute values and deletion of attributes with missing values. Using ten-fold cross validation we concluded that the best strategy is based on a divisive approach of cluster analysis and deleting cases affected by missing attribute values. Moreover, prediction accuracy of this strategy is comparable with the other successful approaches reported in this area.