Jorge E. Cote-Ballesteros, Victor Hugo Grisales Palacios, Jhon Edisson Rodriguez-Castellanos
{"title":"数据驱动工业软测量中基于互信息的混合方法变量选择算法","authors":"Jorge E. Cote-Ballesteros, Victor Hugo Grisales Palacios, Jhon Edisson Rodriguez-Castellanos","doi":"10.18359/rcin.5644","DOIUrl":null,"url":null,"abstract":"\n \n \n \nThe development of virtual sensors predicting the desired output requires a careful selection of input variables for model construction. In an industrial environment, datasets contain many instrumentation system measures; however, these variables are often non-relevant or excessive information. This paper proposes a variable selection algorithm based on mutual information examination, redundancy analysis, and variable reduction for soft-sensor modeling. A relevance calculation is performed in the first stage to select important variables using the mutual information criterion. Then, the detection and exclusion of redundant variables are carried out, penalizing undesired variables. Finally, the most relevant variables subset is determined through a wrapper method using Mallowssans' Cp metric to assess the fitting prediction performance. The approach was successfully applied to estimate the ethanol concentration for a distillation column process using an adaptive network-based fuzzy inference system architecture as a non-linear dynamic regression model. A comparative study was performed considering the application of correlation analysis and the method proposed in this study. Simulation results show the effectiveness of the proposed approach in the variable selection providing a reduction in search of suitable models that achieve faster results for developing soft sensors oriented to industrial applications. \n \n \n \n","PeriodicalId":31201,"journal":{"name":"Ciencia e Ingenieria Neogranadina","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Hybrid Approach Variable Selection Algorithm Based on Mutual Information for Data-Driven Industrial Soft-Sensor Applications\",\"authors\":\"Jorge E. Cote-Ballesteros, Victor Hugo Grisales Palacios, Jhon Edisson Rodriguez-Castellanos\",\"doi\":\"10.18359/rcin.5644\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n \\n \\n \\nThe development of virtual sensors predicting the desired output requires a careful selection of input variables for model construction. In an industrial environment, datasets contain many instrumentation system measures; however, these variables are often non-relevant or excessive information. This paper proposes a variable selection algorithm based on mutual information examination, redundancy analysis, and variable reduction for soft-sensor modeling. A relevance calculation is performed in the first stage to select important variables using the mutual information criterion. Then, the detection and exclusion of redundant variables are carried out, penalizing undesired variables. Finally, the most relevant variables subset is determined through a wrapper method using Mallowssans' Cp metric to assess the fitting prediction performance. The approach was successfully applied to estimate the ethanol concentration for a distillation column process using an adaptive network-based fuzzy inference system architecture as a non-linear dynamic regression model. A comparative study was performed considering the application of correlation analysis and the method proposed in this study. Simulation results show the effectiveness of the proposed approach in the variable selection providing a reduction in search of suitable models that achieve faster results for developing soft sensors oriented to industrial applications. \\n \\n \\n \\n\",\"PeriodicalId\":31201,\"journal\":{\"name\":\"Ciencia e Ingenieria Neogranadina\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ciencia e Ingenieria Neogranadina\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18359/rcin.5644\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ciencia e Ingenieria Neogranadina","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18359/rcin.5644","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Hybrid Approach Variable Selection Algorithm Based on Mutual Information for Data-Driven Industrial Soft-Sensor Applications
The development of virtual sensors predicting the desired output requires a careful selection of input variables for model construction. In an industrial environment, datasets contain many instrumentation system measures; however, these variables are often non-relevant or excessive information. This paper proposes a variable selection algorithm based on mutual information examination, redundancy analysis, and variable reduction for soft-sensor modeling. A relevance calculation is performed in the first stage to select important variables using the mutual information criterion. Then, the detection and exclusion of redundant variables are carried out, penalizing undesired variables. Finally, the most relevant variables subset is determined through a wrapper method using Mallowssans' Cp metric to assess the fitting prediction performance. The approach was successfully applied to estimate the ethanol concentration for a distillation column process using an adaptive network-based fuzzy inference system architecture as a non-linear dynamic regression model. A comparative study was performed considering the application of correlation analysis and the method proposed in this study. Simulation results show the effectiveness of the proposed approach in the variable selection providing a reduction in search of suitable models that achieve faster results for developing soft sensors oriented to industrial applications.