Alnis Murtovi, Alexander Bainczyk, Gerrit Nolte, Maximilian Schlüter, Bernhard Steffen
{"title":"Forest GUMP: a tool for verification and explanation","authors":"Alnis Murtovi, Alexander Bainczyk, Gerrit Nolte, Maximilian Schlüter, Bernhard Steffen","doi":"10.1007/s10009-023-00702-5","DOIUrl":null,"url":null,"abstract":"Abstract In this paper, we present Forest GUMP (for Generalized, Unifying Merge Process) a tool for verification and precise explanation of Random forests. Besides pre/post-condition-based verification and equivalence checking, Forest GUMP also supports three concepts of explanation, the well-known model explanation and outcome explanation , as well as class characterization , i.e., the precise characterization of all samples that are equally classified. Key technology to achieve these results is algebraic aggregation, i.e., the transformation of a Random Forest into a semantically equivalent, concise white-box representation in terms of Algebraic Decision Diagrams (ADDs). The paper sketches the method and demonstrates the use of Forest GUMP along illustrative examples. This way readers should acquire an intuition about the tool, and the way how it should be used to increase the understanding not only of the considered dataset, but also of the character of Random Forests and the ADD technology, here enriched to comprise infeasible path elimination. As Forest GUMP is publicly available all experiments can be reproduced, modified, and complemented using any dataset that is available in the ARFF format.","PeriodicalId":14395,"journal":{"name":"International Journal on Software Tools for Technology Transfer","volume":"45 1","pages":"0"},"PeriodicalIF":1.1000,"publicationDate":"2023-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal on Software Tools for Technology Transfer","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10009-023-00702-5","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 3
Abstract
Abstract In this paper, we present Forest GUMP (for Generalized, Unifying Merge Process) a tool for verification and precise explanation of Random forests. Besides pre/post-condition-based verification and equivalence checking, Forest GUMP also supports three concepts of explanation, the well-known model explanation and outcome explanation , as well as class characterization , i.e., the precise characterization of all samples that are equally classified. Key technology to achieve these results is algebraic aggregation, i.e., the transformation of a Random Forest into a semantically equivalent, concise white-box representation in terms of Algebraic Decision Diagrams (ADDs). The paper sketches the method and demonstrates the use of Forest GUMP along illustrative examples. This way readers should acquire an intuition about the tool, and the way how it should be used to increase the understanding not only of the considered dataset, but also of the character of Random Forests and the ADD technology, here enriched to comprise infeasible path elimination. As Forest GUMP is publicly available all experiments can be reproduced, modified, and complemented using any dataset that is available in the ARFF format.
期刊介绍:
The International Journal on Software Tools for Technology Transfer (STTT) provides a forum for the discussion of all aspects of tools supporting the development of computer systems. It offers, above all, a tool-oriented link between academic research and industrial practice.
Tool support for the development of reliable and correct computer-based systems is of growing importance, and a wealth of design methodologies, algorithms, and associated tools have been developed in different areas of computer science. However, each area has its own culture and terminology, preventing researchers from taking advantage of the results obtained by colleagues in other fields. Tool builders are often unaware of the work done by others, and thus unable to apply it. The situation is even more critical when considering the transfer of new technology into industrial practice.