{"title":"Performance Assessment and Interpretation of Random Forests by Three-dimensional Visualizations","authors":"R. Hänsch, O. Hellwich","doi":"10.5220/0005310901490156","DOIUrl":null,"url":null,"abstract":"Ensemble learning techniques and in particular Random Forests have been one of the most successful machine learning approaches of the last decade. Despite their success, there exist barely suitable visualizations of Random Forests, which allow a fast and accurate understanding of how well they perform a certain task and what leads to this performance. This paper proposes an exemplar-driven visualization illustrating the most important key concepts of a Random Forest classifier, namely strength and correlation of the individual trees as well as strength of the whole forest. A visual inspection of the results enables not only an easy performance evaluation but also provides further insights why this performance was achieved and how parameters of the underlying Random Forest should be changed in order to further improve the performance. Although the paper focuses on Random Forests for classification tasks, the developed framework is by no means limited to that and can be easily applied to other tree-based ensemble learning methods.","PeriodicalId":326087,"journal":{"name":"International Conference on Information Visualization Theory and Applications","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Information Visualization Theory and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0005310901490156","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Ensemble learning techniques and in particular Random Forests have been one of the most successful machine learning approaches of the last decade. Despite their success, there exist barely suitable visualizations of Random Forests, which allow a fast and accurate understanding of how well they perform a certain task and what leads to this performance. This paper proposes an exemplar-driven visualization illustrating the most important key concepts of a Random Forest classifier, namely strength and correlation of the individual trees as well as strength of the whole forest. A visual inspection of the results enables not only an easy performance evaluation but also provides further insights why this performance was achieved and how parameters of the underlying Random Forest should be changed in order to further improve the performance. Although the paper focuses on Random Forests for classification tasks, the developed framework is by no means limited to that and can be easily applied to other tree-based ensemble learning methods.