K. Neophytou, C. Nicolaou, C. Pattichis, C. Schizas
{"title":"Deriving Quantitative Structure-Activity Relationship Models Using Genetic Programming for Drug Discovery","authors":"K. Neophytou, C. Nicolaou, C. Pattichis, C. Schizas","doi":"10.1109/ITAB.2007.4407401","DOIUrl":null,"url":null,"abstract":"Genetic Programming is a heuristic search algorithm inspired by evolutionary techniques that has been shown to produce satisfactory solutions to problems related to several scientific domains [1]. Presented here is a methodology for the creation of Quantitative Structure-Activity Relationship (QSAR) models for the prediction of chemical activity, using Genetic Programming. QSAR analysis is crucial for drug discovery since good QSAR models enable human experts to select compounds with increased chances of being active for further investigations. Our technique has been tested using the Selwood dataset, a benchmark dataset for the QSAR field [2]. The results indicate that the QSAR models created are accurate, reliable and simple and can thus be used to identify molecular descriptors correlated with measured activity and for the prediction of the activity of untested molecules. The QSAR models we generated predict the activity of untested molecules with an error ranging between 0.46 -0.8 on the scale [-1,1]. These results compare favourably with results sited in the literature for the same dataset [3], [4], Our models are constructed using any combination of the arithmetic operators {+, -, /, *}, the descriptors available and constant values.","PeriodicalId":129874,"journal":{"name":"2007 6th International Special Topic Conference on Information Technology Applications in Biomedicine","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 6th International Special Topic Conference on Information Technology Applications in Biomedicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITAB.2007.4407401","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Genetic Programming is a heuristic search algorithm inspired by evolutionary techniques that has been shown to produce satisfactory solutions to problems related to several scientific domains [1]. Presented here is a methodology for the creation of Quantitative Structure-Activity Relationship (QSAR) models for the prediction of chemical activity, using Genetic Programming. QSAR analysis is crucial for drug discovery since good QSAR models enable human experts to select compounds with increased chances of being active for further investigations. Our technique has been tested using the Selwood dataset, a benchmark dataset for the QSAR field [2]. The results indicate that the QSAR models created are accurate, reliable and simple and can thus be used to identify molecular descriptors correlated with measured activity and for the prediction of the activity of untested molecules. The QSAR models we generated predict the activity of untested molecules with an error ranging between 0.46 -0.8 on the scale [-1,1]. These results compare favourably with results sited in the literature for the same dataset [3], [4], Our models are constructed using any combination of the arithmetic operators {+, -, /, *}, the descriptors available and constant values.