Jorge S. S. Júnior;Jérôme Mendes;Francisco Souza;Cristiano Premebida
{"title":"Distilling Complex Knowledge Into Explainable T–S Fuzzy Systems","authors":"Jorge S. S. Júnior;Jérôme Mendes;Francisco Souza;Cristiano Premebida","doi":"10.1109/TFUZZ.2024.3506122","DOIUrl":null,"url":null,"abstract":"This article introduces a novel method for distilling knowledge from complex models using fuzzy systems. The complex knowledge comes from a proposed hybrid NFN-LSTM model (teacher) composed of a long shor-term memory (LSTM) coupled to a neo-fuzzy neuron (NFN) structure. The proposed student model, the NFN-MOD, is an explainable Takagi–Sugeno fuzzy model that resembles modular characteristics to mimic the temporal memory of the LSTM part in the teacher model. The NFN-MOD is adaptable across many scenarios, including solo learning (without a teacher), with the estimation of a previously trained teacher, or training in parallel with the teacher. The complexity reduction of the student model is achieved through the pruning of its consequent parameters with the lowest L1-norm. Application of NFN-MOD in industrial case studies (sulfur recovery unit and cement manufacturing process) demonstrates the efficiency of NFN-MOD in distilling complex knowledge from the teacher model NFN-LSTM, with emphasis on parallel training and parameter pruning. In addition, a novel explainability analysis is introduced, which evaluates the influence of antecedent parameters of the student model in relation to the expected real system output.","PeriodicalId":13212,"journal":{"name":"IEEE Transactions on Fuzzy Systems","volume":"33 3","pages":"1037-1048"},"PeriodicalIF":10.7000,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Fuzzy Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10767755/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
This article introduces a novel method for distilling knowledge from complex models using fuzzy systems. The complex knowledge comes from a proposed hybrid NFN-LSTM model (teacher) composed of a long shor-term memory (LSTM) coupled to a neo-fuzzy neuron (NFN) structure. The proposed student model, the NFN-MOD, is an explainable Takagi–Sugeno fuzzy model that resembles modular characteristics to mimic the temporal memory of the LSTM part in the teacher model. The NFN-MOD is adaptable across many scenarios, including solo learning (without a teacher), with the estimation of a previously trained teacher, or training in parallel with the teacher. The complexity reduction of the student model is achieved through the pruning of its consequent parameters with the lowest L1-norm. Application of NFN-MOD in industrial case studies (sulfur recovery unit and cement manufacturing process) demonstrates the efficiency of NFN-MOD in distilling complex knowledge from the teacher model NFN-LSTM, with emphasis on parallel training and parameter pruning. In addition, a novel explainability analysis is introduced, which evaluates the influence of antecedent parameters of the student model in relation to the expected real system output.
期刊介绍:
The IEEE Transactions on Fuzzy Systems is a scholarly journal that focuses on the theory, design, and application of fuzzy systems. It aims to publish high-quality technical papers that contribute significant technical knowledge and exploratory developments in the field of fuzzy systems. The journal particularly emphasizes engineering systems and scientific applications. In addition to research articles, the Transactions also includes a letters section featuring current information, comments, and rebuttals related to published papers.