Asya Hristozova, Margarita Batmazyan, K. Simitchiev, Slava Tsoneva, V. Kmetov, Erwin Rosenberg
{"title":"顶空 - 固相微萃取与液相进样 GC-MS 分析精油:通过多元线性回归预测线性保留指数","authors":"Asya Hristozova, Margarita Batmazyan, K. Simitchiev, Slava Tsoneva, V. Kmetov, Erwin Rosenberg","doi":"10.1556/1326.2024.01207","DOIUrl":null,"url":null,"abstract":"Due to the relative independence from the operational parameters, the linear retention indices (LRIs) are useful tool in gas chromatography-mass spectrometry (GC-MS) qualitative analysis. The aim of the current study was to develop a multiple linear regression (MLR) model for the prediction of LRIs as a function of selected molecular descriptors. Liquid injection GC-MS was used for the analysis of Essential oils (Rose, Lavender and Peppermint) separating the ingredients by a semi-standard non-polar stationary phase. As a result, a sum of 103 compounds were identified and their experimental LRIs were derived relying on reference measurements of a standard mixture of n-alkanes (from C8 to C20). As a next step, a set of molecular descriptors was generated for the distinguished chemical structures. Further, a stepwise MLR was applied for the selection of the significant descriptors (variables) which can be used to predict the LRIs. From an exploit set of over 2000 molecular descriptors, it was found that only 16 can be regarded as significant and independent variables. At this point split validation was applied: the identified compounds were randomly divided into training (85%) and validation (15%) sets. The training set (87 compounds) was used to derive two MLR models by applying i) the ‘enter’ algorithm (R2 = 0.9960, RMSЕ = 17) and ii) the ‘stepwise’ one (R2 = 0.9958, RMSЕ = 17). The predictive power was assessed by the validation set (16 compounds) as follows i) q2F1 = 0.9896, RMSE = 25 and ii) q2F1 = 0.9886, RMSE = 26, respectively. The adequateness of both regression approaches was further evaluated. Newly developed headspace-solid phase microextraction (HS-SPME) procedures in combination with GC-MS were used for an alternative analysis of the studied Essential oils. Twelve additional compounds, not previously detected by the liquid sample introduction mode of analysis, were identified for which the values of the significant descriptors were within the working range of the developed MLRs. For the last compounds, the LRIs were calculated and the experimental data was used as an external set for assessment of the regression models. The predictive power for both regression approaches was assessed as follows: Enter RMSE = 41, q2F2 = 0.9503 and Stepwise RMSE = 40, q2F2 = 0.9521.","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":"25 11","pages":""},"PeriodicalIF":17.7000,"publicationDate":"2024-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Headspace – Solid phase microextraction vs liquid injection GC-MS analysis of essential oils: Prediction of linear retention indices by multiple linear regression\",\"authors\":\"Asya Hristozova, Margarita Batmazyan, K. Simitchiev, Slava Tsoneva, V. Kmetov, Erwin Rosenberg\",\"doi\":\"10.1556/1326.2024.01207\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to the relative independence from the operational parameters, the linear retention indices (LRIs) are useful tool in gas chromatography-mass spectrometry (GC-MS) qualitative analysis. The aim of the current study was to develop a multiple linear regression (MLR) model for the prediction of LRIs as a function of selected molecular descriptors. Liquid injection GC-MS was used for the analysis of Essential oils (Rose, Lavender and Peppermint) separating the ingredients by a semi-standard non-polar stationary phase. As a result, a sum of 103 compounds were identified and their experimental LRIs were derived relying on reference measurements of a standard mixture of n-alkanes (from C8 to C20). As a next step, a set of molecular descriptors was generated for the distinguished chemical structures. Further, a stepwise MLR was applied for the selection of the significant descriptors (variables) which can be used to predict the LRIs. From an exploit set of over 2000 molecular descriptors, it was found that only 16 can be regarded as significant and independent variables. At this point split validation was applied: the identified compounds were randomly divided into training (85%) and validation (15%) sets. The training set (87 compounds) was used to derive two MLR models by applying i) the ‘enter’ algorithm (R2 = 0.9960, RMSЕ = 17) and ii) the ‘stepwise’ one (R2 = 0.9958, RMSЕ = 17). The predictive power was assessed by the validation set (16 compounds) as follows i) q2F1 = 0.9896, RMSE = 25 and ii) q2F1 = 0.9886, RMSE = 26, respectively. The adequateness of both regression approaches was further evaluated. Newly developed headspace-solid phase microextraction (HS-SPME) procedures in combination with GC-MS were used for an alternative analysis of the studied Essential oils. Twelve additional compounds, not previously detected by the liquid sample introduction mode of analysis, were identified for which the values of the significant descriptors were within the working range of the developed MLRs. For the last compounds, the LRIs were calculated and the experimental data was used as an external set for assessment of the regression models. The predictive power for both regression approaches was assessed as follows: Enter RMSE = 41, q2F2 = 0.9503 and Stepwise RMSE = 40, q2F2 = 0.9521.\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":\"25 11\",\"pages\":\"\"},\"PeriodicalIF\":17.7000,\"publicationDate\":\"2024-02-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1556/1326.2024.01207\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1556/1326.2024.01207","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Headspace – Solid phase microextraction vs liquid injection GC-MS analysis of essential oils: Prediction of linear retention indices by multiple linear regression
Due to the relative independence from the operational parameters, the linear retention indices (LRIs) are useful tool in gas chromatography-mass spectrometry (GC-MS) qualitative analysis. The aim of the current study was to develop a multiple linear regression (MLR) model for the prediction of LRIs as a function of selected molecular descriptors. Liquid injection GC-MS was used for the analysis of Essential oils (Rose, Lavender and Peppermint) separating the ingredients by a semi-standard non-polar stationary phase. As a result, a sum of 103 compounds were identified and their experimental LRIs were derived relying on reference measurements of a standard mixture of n-alkanes (from C8 to C20). As a next step, a set of molecular descriptors was generated for the distinguished chemical structures. Further, a stepwise MLR was applied for the selection of the significant descriptors (variables) which can be used to predict the LRIs. From an exploit set of over 2000 molecular descriptors, it was found that only 16 can be regarded as significant and independent variables. At this point split validation was applied: the identified compounds were randomly divided into training (85%) and validation (15%) sets. The training set (87 compounds) was used to derive two MLR models by applying i) the ‘enter’ algorithm (R2 = 0.9960, RMSЕ = 17) and ii) the ‘stepwise’ one (R2 = 0.9958, RMSЕ = 17). The predictive power was assessed by the validation set (16 compounds) as follows i) q2F1 = 0.9896, RMSE = 25 and ii) q2F1 = 0.9886, RMSE = 26, respectively. The adequateness of both regression approaches was further evaluated. Newly developed headspace-solid phase microextraction (HS-SPME) procedures in combination with GC-MS were used for an alternative analysis of the studied Essential oils. Twelve additional compounds, not previously detected by the liquid sample introduction mode of analysis, were identified for which the values of the significant descriptors were within the working range of the developed MLRs. For the last compounds, the LRIs were calculated and the experimental data was used as an external set for assessment of the regression models. The predictive power for both regression approaches was assessed as follows: Enter RMSE = 41, q2F2 = 0.9503 and Stepwise RMSE = 40, q2F2 = 0.9521.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.