{"title":"Intelligent Consensus Predictions of the Retention Index of Flavor and Fragrance Compounds Using 2D Descriptors","authors":"Doelima Bera, Ankur Kumar, Joyita Roy, Kunal Roy","doi":"10.1007/s10337-024-04349-5","DOIUrl":null,"url":null,"abstract":"<div><p>The demand for novel flavors and fragrance (F&F) compounds has increased, highlighting the need for a systematic design approach. Currently, the F&F industry relies heavily on experimental approaches without considering the potential consequences of altering the features that contribute to the fragrance of the compound. In silico approaches have great potential to identify the necessary features for creating novel F&F compounds. In the present study, Quantitative Structure–Property Relationship (QSPR) models were developed using 1208 compounds and simple 2D descriptors, focusing on the RI (retention index) as the endpoint to predict the olfactory properties of molecules. Feature selection was initially carried out by multi-layered stepwise regression followed by feature thinning using the Genetic Algorithm (GA) and optimal feature combination selection using the BSS (best subset selection) method. Final models were developed using the Partial Least Squares (PLS) method. Additionally, internal and external validation of the models was performed using different validation metrics suggesting that the developed models are reliable, predictive, reproducible, and robust. To enhance the external prediction of the developed models, an Intelligent Consensus Prediction (ICP) method was employed and <b>CM3</b> (consensus model 3) (best selection of predictions (compound-wise) from individual models) was found to provide the best predictivity. The modeling descriptors suggested that the hydrophobicity, high molecular weight, aromaticity, and presence of large-size fragments (high percentage of carbon) enhance the RI values. Conversely, polarity and hydrophilicity decrease the RI values. This study can be used to optimize the stationary phase according to the flavor and fragrance compounds to obtain the desired retention index (RI values).</p><h3>Graphical abstract</h3>\n<div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>","PeriodicalId":518,"journal":{"name":"Chromatographia","volume":"87 9","pages":"581 - 595"},"PeriodicalIF":1.2000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chromatographia","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1007/s10337-024-04349-5","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
The demand for novel flavors and fragrance (F&F) compounds has increased, highlighting the need for a systematic design approach. Currently, the F&F industry relies heavily on experimental approaches without considering the potential consequences of altering the features that contribute to the fragrance of the compound. In silico approaches have great potential to identify the necessary features for creating novel F&F compounds. In the present study, Quantitative Structure–Property Relationship (QSPR) models were developed using 1208 compounds and simple 2D descriptors, focusing on the RI (retention index) as the endpoint to predict the olfactory properties of molecules. Feature selection was initially carried out by multi-layered stepwise regression followed by feature thinning using the Genetic Algorithm (GA) and optimal feature combination selection using the BSS (best subset selection) method. Final models were developed using the Partial Least Squares (PLS) method. Additionally, internal and external validation of the models was performed using different validation metrics suggesting that the developed models are reliable, predictive, reproducible, and robust. To enhance the external prediction of the developed models, an Intelligent Consensus Prediction (ICP) method was employed and CM3 (consensus model 3) (best selection of predictions (compound-wise) from individual models) was found to provide the best predictivity. The modeling descriptors suggested that the hydrophobicity, high molecular weight, aromaticity, and presence of large-size fragments (high percentage of carbon) enhance the RI values. Conversely, polarity and hydrophilicity decrease the RI values. This study can be used to optimize the stationary phase according to the flavor and fragrance compounds to obtain the desired retention index (RI values).
期刊介绍:
Separation sciences, in all their various forms such as chromatography, field-flow fractionation, and electrophoresis, provide some of the most powerful techniques in analytical chemistry and are applied within a number of important application areas, including archaeology, biotechnology, clinical, environmental, food, medical, petroleum, pharmaceutical, polymer and biopolymer research. Beyond serving analytical purposes, separation techniques are also used for preparative and process-scale applications. The scope and power of separation sciences is significantly extended by combination with spectroscopic detection methods (e.g., laser-based approaches, nuclear-magnetic resonance, Raman, chemiluminescence) and particularly, mass spectrometry, to create hyphenated techniques. In addition to exciting new developments in chromatography, such as ultra high-pressure systems, multidimensional separations, and high-temperature approaches, there have also been great advances in hybrid methods combining chromatography and electro-based separations, especially on the micro- and nanoscale. Integrated biological procedures (e.g., enzymatic, immunological, receptor-based assays) can also be part of the overall analytical process.