Ahmad Rouhollahi, Hooshang Shafieyan, Jahan Bakhsh Ghasemi
{"title":"A QSPR Study on the GC Retention Times of a Series of Fatty, Dicarboxylic and Amino Acids by MLR and ANN","authors":"Ahmad Rouhollahi, Hooshang Shafieyan, Jahan Bakhsh Ghasemi","doi":"10.1002/adic.200790077","DOIUrl":null,"url":null,"abstract":"<p>Quantitative structure–property relationship (QSPR) analysis has been carried out to a series of fatty, amino and dicarboxylic acids to model their GC retention times. A genetic partial least square method (GAPLS) was applied as a variable selection tool. Modeling of retention times of these compounds as a function of the theoretically derived descriptors was established by multiple linear regression (MLR) and artificial neural network (ANN). The neural network employed here is a connected back-propagation system with a 3-4-1 architecture. Three topological indices for these compounds, namely, mean information index on atomic composition (AAC), average connectivity index chi-0 (X0A) and total information index of atomic composition (IAC) taken as inputs for the regression models. The results indicate that the GA is a very effective variable selection approach for QSPR analysis. The comparison of the two regression methods used showed that ANN has better prediction ability than MLR. The statistical figure of merits of the two models showed the successful modeling of the retention times with molecular descriptors.</p>","PeriodicalId":8193,"journal":{"name":"Annali di chimica","volume":"97 9","pages":"925-933"},"PeriodicalIF":0.0000,"publicationDate":"2007-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/adic.200790077","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annali di chimica","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/adic.200790077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Quantitative structure–property relationship (QSPR) analysis has been carried out to a series of fatty, amino and dicarboxylic acids to model their GC retention times. A genetic partial least square method (GAPLS) was applied as a variable selection tool. Modeling of retention times of these compounds as a function of the theoretically derived descriptors was established by multiple linear regression (MLR) and artificial neural network (ANN). The neural network employed here is a connected back-propagation system with a 3-4-1 architecture. Three topological indices for these compounds, namely, mean information index on atomic composition (AAC), average connectivity index chi-0 (X0A) and total information index of atomic composition (IAC) taken as inputs for the regression models. The results indicate that the GA is a very effective variable selection approach for QSPR analysis. The comparison of the two regression methods used showed that ANN has better prediction ability than MLR. The statistical figure of merits of the two models showed the successful modeling of the retention times with molecular descriptors.