{"title":"Raman spectral feature extraction and analysis methods for olefin polymerization and cracking based on machine learning techniques.","authors":"Yaolan Yang, Jijiang Hu, Shaojie Zheng, Minghao Sun, Fujie Wang, Bogeng Li, Zhen Yao","doi":"10.1039/d4ay01882f","DOIUrl":null,"url":null,"abstract":"<p><p>The use of Raman spectroscopy for real-time gas monitoring has the advantages of response speed, high sensitivity and low cost. However, due to the overlap of each peak position, the Raman spectral data usually exhibit high dimensionality, complex nonlinear relationships and significant noise interference, which makes it difficult to directly determine the composition of the mixture using traditional data processing methods. This work focuses on the optimization of a machine learning model, XGBoost, for predicting gas composition based on Raman spectral data, enhancing predictive accuracy through three different feature extraction and feature selection methods. The superior performance of the XGBoost model is demonstrated by comparison with other machine learning models, including decision trees, random forests, support vector machines and neural networks, using the Raman spectrum of a gas mixture of hydrogen, ethylene, propylene and butene. The results show that XGBoost has better accuracy and generalization ability for quantitative analysis of Raman spectra, making it suitable for complex chemical process monitoring.</p>","PeriodicalId":64,"journal":{"name":"Analytical Methods","volume":" ","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical Methods","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1039/d4ay01882f","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
The use of Raman spectroscopy for real-time gas monitoring has the advantages of response speed, high sensitivity and low cost. However, due to the overlap of each peak position, the Raman spectral data usually exhibit high dimensionality, complex nonlinear relationships and significant noise interference, which makes it difficult to directly determine the composition of the mixture using traditional data processing methods. This work focuses on the optimization of a machine learning model, XGBoost, for predicting gas composition based on Raman spectral data, enhancing predictive accuracy through three different feature extraction and feature selection methods. The superior performance of the XGBoost model is demonstrated by comparison with other machine learning models, including decision trees, random forests, support vector machines and neural networks, using the Raman spectrum of a gas mixture of hydrogen, ethylene, propylene and butene. The results show that XGBoost has better accuracy and generalization ability for quantitative analysis of Raman spectra, making it suitable for complex chemical process monitoring.