{"title":"Computational Model For Chromatographic Relative Retention Time of Polychlorinated Biphenyls Using Sub-structural Molecular Fragments","authors":"S. Saaidpour","doi":"10.12921/CMST.2016.22.01.004","DOIUrl":null,"url":null,"abstract":"Quantitative structure-retention relationship (QSRR) analysis is a useful technique capable of relating chromato- graphic retention time to the chemical structure of a solute. Using the sub-structural molecular fragments (SMF) derived directly from the molecular structures, the gas chromatographic relative retention times (RRTs) of 209 polychlorinated biphenyls (PCBs) on the SE-54 stationary phase were calculated. An eight-variable regression equation with the correlation coefficient of 0.9945 and the root mean square errors of 0.0134 was developed. Forward and backward stepwise regression variable selection and multi-linear regression analysis (MLRA) are combined to describe the effect of molecular structure on the RRT of PCB according to the QSRR method. To quantitatively relate RRT with the molecular structure MLR analysis is performed on the set of 163 sub-structural molecular fragments (SMF) provided by the ISIDA software. The eight fragments selected by variable subset selection, all belonging to the sub-fragments, adequately represent the structural factors influencing the affinity of PCB to SE-54 stationary phase in the separation process. Finally, a QSRR model is selected based on leave-one-out cross-validation and its prediction ability is further tested on 42 representative compounds excluded from model calibration. The prediction results from the MLR model are in good agreement with the experimental values. By applying the MLR method we can predict the test set with squared cross validated correlation coefficient (Q 2) of 0.9913 and root mean square error (RMSE) of 0.0169.","PeriodicalId":10561,"journal":{"name":"computational methods in science and technology","volume":"45 1","pages":"41-53"},"PeriodicalIF":0.0000,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"computational methods in science and technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12921/CMST.2016.22.01.004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Quantitative structure-retention relationship (QSRR) analysis is a useful technique capable of relating chromato- graphic retention time to the chemical structure of a solute. Using the sub-structural molecular fragments (SMF) derived directly from the molecular structures, the gas chromatographic relative retention times (RRTs) of 209 polychlorinated biphenyls (PCBs) on the SE-54 stationary phase were calculated. An eight-variable regression equation with the correlation coefficient of 0.9945 and the root mean square errors of 0.0134 was developed. Forward and backward stepwise regression variable selection and multi-linear regression analysis (MLRA) are combined to describe the effect of molecular structure on the RRT of PCB according to the QSRR method. To quantitatively relate RRT with the molecular structure MLR analysis is performed on the set of 163 sub-structural molecular fragments (SMF) provided by the ISIDA software. The eight fragments selected by variable subset selection, all belonging to the sub-fragments, adequately represent the structural factors influencing the affinity of PCB to SE-54 stationary phase in the separation process. Finally, a QSRR model is selected based on leave-one-out cross-validation and its prediction ability is further tested on 42 representative compounds excluded from model calibration. The prediction results from the MLR model are in good agreement with the experimental values. By applying the MLR method we can predict the test set with squared cross validated correlation coefficient (Q 2) of 0.9913 and root mean square error (RMSE) of 0.0169.