{"title":"利用二维描述符对香精香料化合物的保留指数进行智能共识预测","authors":"Doelima Bera, Ankur Kumar, Joyita Roy, Kunal Roy","doi":"10.1007/s10337-024-04349-5","DOIUrl":null,"url":null,"abstract":"<div><p>The demand for novel flavors and fragrance (F&F) compounds has increased, highlighting the need for a systematic design approach. Currently, the F&F industry relies heavily on experimental approaches without considering the potential consequences of altering the features that contribute to the fragrance of the compound. In silico approaches have great potential to identify the necessary features for creating novel F&F compounds. In the present study, Quantitative Structure–Property Relationship (QSPR) models were developed using 1208 compounds and simple 2D descriptors, focusing on the RI (retention index) as the endpoint to predict the olfactory properties of molecules. Feature selection was initially carried out by multi-layered stepwise regression followed by feature thinning using the Genetic Algorithm (GA) and optimal feature combination selection using the BSS (best subset selection) method. Final models were developed using the Partial Least Squares (PLS) method. Additionally, internal and external validation of the models was performed using different validation metrics suggesting that the developed models are reliable, predictive, reproducible, and robust. To enhance the external prediction of the developed models, an Intelligent Consensus Prediction (ICP) method was employed and <b>CM3</b> (consensus model 3) (best selection of predictions (compound-wise) from individual models) was found to provide the best predictivity. The modeling descriptors suggested that the hydrophobicity, high molecular weight, aromaticity, and presence of large-size fragments (high percentage of carbon) enhance the RI values. Conversely, polarity and hydrophilicity decrease the RI values. This study can be used to optimize the stationary phase according to the flavor and fragrance compounds to obtain the desired retention index (RI values).</p><h3>Graphical abstract</h3>\n<div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>","PeriodicalId":518,"journal":{"name":"Chromatographia","volume":"87 9","pages":"581 - 595"},"PeriodicalIF":1.2000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Intelligent Consensus Predictions of the Retention Index of Flavor and Fragrance Compounds Using 2D Descriptors\",\"authors\":\"Doelima Bera, Ankur Kumar, Joyita Roy, Kunal Roy\",\"doi\":\"10.1007/s10337-024-04349-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The demand for novel flavors and fragrance (F&F) compounds has increased, highlighting the need for a systematic design approach. Currently, the F&F industry relies heavily on experimental approaches without considering the potential consequences of altering the features that contribute to the fragrance of the compound. In silico approaches have great potential to identify the necessary features for creating novel F&F compounds. In the present study, Quantitative Structure–Property Relationship (QSPR) models were developed using 1208 compounds and simple 2D descriptors, focusing on the RI (retention index) as the endpoint to predict the olfactory properties of molecules. Feature selection was initially carried out by multi-layered stepwise regression followed by feature thinning using the Genetic Algorithm (GA) and optimal feature combination selection using the BSS (best subset selection) method. Final models were developed using the Partial Least Squares (PLS) method. Additionally, internal and external validation of the models was performed using different validation metrics suggesting that the developed models are reliable, predictive, reproducible, and robust. To enhance the external prediction of the developed models, an Intelligent Consensus Prediction (ICP) method was employed and <b>CM3</b> (consensus model 3) (best selection of predictions (compound-wise) from individual models) was found to provide the best predictivity. The modeling descriptors suggested that the hydrophobicity, high molecular weight, aromaticity, and presence of large-size fragments (high percentage of carbon) enhance the RI values. Conversely, polarity and hydrophilicity decrease the RI values. This study can be used to optimize the stationary phase according to the flavor and fragrance compounds to obtain the desired retention index (RI values).</p><h3>Graphical abstract</h3>\\n<div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>\",\"PeriodicalId\":518,\"journal\":{\"name\":\"Chromatographia\",\"volume\":\"87 9\",\"pages\":\"581 - 595\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2024-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chromatographia\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10337-024-04349-5\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chromatographia","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1007/s10337-024-04349-5","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
摘要
对新型香精香料(F&F)化合物的需求与日俱增,凸显了对系统设计方法的需求。目前,F&F 行业主要依赖实验方法,而不考虑改变化合物香味特征的潜在后果。硅学方法具有巨大的潜力,可用于识别创造新型香料和香精化合物的必要特征。在本研究中,利用 1208 种化合物和简单的二维描述符建立了定量结构-性质关系(QSPR)模型,并将 RI(保留指数)作为预测分子嗅觉特性的终点。首先通过多层逐步回归法进行特征选择,然后使用遗传算法(GA)对特征进行精简,并使用最佳子集选择法(BSS)选择最佳特征组合。最后使用偏最小二乘法(PLS)建立模型。此外,还使用不同的验证指标对模型进行了内部和外部验证,表明所开发的模型是可靠的、可预测的、可重现的和稳健的。为了提高所开发模型的外部预测能力,采用了智能共识预测(ICP)方法,发现 CM3(共识模型 3)(从单个模型中选择最佳预测(复合预测))提供了最佳预测能力。建模描述符表明,疏水性、高分子量、芳香性和大尺寸片段(高碳百分比)的存在提高了 RI 值。相反,极性和亲水性会降低 RI 值。这项研究可用于根据香精香料化合物优化固定相,以获得所需的保留指数(RI 值)。
Intelligent Consensus Predictions of the Retention Index of Flavor and Fragrance Compounds Using 2D Descriptors
The demand for novel flavors and fragrance (F&F) compounds has increased, highlighting the need for a systematic design approach. Currently, the F&F industry relies heavily on experimental approaches without considering the potential consequences of altering the features that contribute to the fragrance of the compound. In silico approaches have great potential to identify the necessary features for creating novel F&F compounds. In the present study, Quantitative Structure–Property Relationship (QSPR) models were developed using 1208 compounds and simple 2D descriptors, focusing on the RI (retention index) as the endpoint to predict the olfactory properties of molecules. Feature selection was initially carried out by multi-layered stepwise regression followed by feature thinning using the Genetic Algorithm (GA) and optimal feature combination selection using the BSS (best subset selection) method. Final models were developed using the Partial Least Squares (PLS) method. Additionally, internal and external validation of the models was performed using different validation metrics suggesting that the developed models are reliable, predictive, reproducible, and robust. To enhance the external prediction of the developed models, an Intelligent Consensus Prediction (ICP) method was employed and CM3 (consensus model 3) (best selection of predictions (compound-wise) from individual models) was found to provide the best predictivity. The modeling descriptors suggested that the hydrophobicity, high molecular weight, aromaticity, and presence of large-size fragments (high percentage of carbon) enhance the RI values. Conversely, polarity and hydrophilicity decrease the RI values. This study can be used to optimize the stationary phase according to the flavor and fragrance compounds to obtain the desired retention index (RI values).
期刊介绍:
Separation sciences, in all their various forms such as chromatography, field-flow fractionation, and electrophoresis, provide some of the most powerful techniques in analytical chemistry and are applied within a number of important application areas, including archaeology, biotechnology, clinical, environmental, food, medical, petroleum, pharmaceutical, polymer and biopolymer research. Beyond serving analytical purposes, separation techniques are also used for preparative and process-scale applications. The scope and power of separation sciences is significantly extended by combination with spectroscopic detection methods (e.g., laser-based approaches, nuclear-magnetic resonance, Raman, chemiluminescence) and particularly, mass spectrometry, to create hyphenated techniques. In addition to exciting new developments in chromatography, such as ultra high-pressure systems, multidimensional separations, and high-temperature approaches, there have also been great advances in hybrid methods combining chromatography and electro-based separations, especially on the micro- and nanoscale. Integrated biological procedures (e.g., enzymatic, immunological, receptor-based assays) can also be part of the overall analytical process.