Performance Assessment and Interpretation of Random Forests by Three-dimensional Visualizations

International Conference on Information Visualization Theory and Applications Pub Date : 1900-01-01 DOI:10.5220/0005310901490156

R. Hänsch, O. Hellwich

引用次数: 2

Abstract

Ensemble learning techniques and in particular Random Forests have been one of the most successful machine learning approaches of the last decade. Despite their success, there exist barely suitable visualizations of Random Forests, which allow a fast and accurate understanding of how well they perform a certain task and what leads to this performance. This paper proposes an exemplar-driven visualization illustrating the most important key concepts of a Random Forest classifier, namely strength and correlation of the individual trees as well as strength of the whole forest. A visual inspection of the results enables not only an easy performance evaluation but also provides further insights why this performance was achieved and how parameters of the underlying Random Forest should be changed in order to further improve the performance. Although the paper focuses on Random Forests for classification tasks, the developed framework is by no means limited to that and can be easily applied to other tree-based ensemble learning methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

随机森林的三维可视化性能评估与解释

集成学习技术，特别是随机森林是过去十年中最成功的机器学习方法之一。尽管它们取得了成功，但却很少有合适的随机森林可视化，这使得我们能够快速准确地了解它们执行特定任务的情况以及导致这种表现的原因。本文提出了一个示例驱动的可视化，说明了随机森林分类器最重要的关键概念，即单个树的强度和相关性以及整个森林的强度。对结果进行视觉检查不仅可以轻松地进行性能评估，还可以进一步了解为什么可以实现这种性能，以及应该如何更改底层随机森林的参数以进一步提高性能。虽然本文的重点是随机森林分类任务，但所开发的框架绝不仅限于此，并且可以很容易地应用于其他基于树的集成学习方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Conference on Information Visualization Theory and Applications

自引率

0.00%

发文量

期刊最新文献

A Linear Time Algorithm for Visualizing Knotted Structures in 3 Pages Information Visualization for CSV Open Data Files Structure Analysis VISUAL ANALYTICS OF MULTIMODAL BIOLOGICAL DATA Arc and Swarm-based Representations of Customer's Flows among Supermarkets Effect of Displaying Uncertainty in Line and Bar Charts - Presentation and Interpretation