{"title":"LARF: Two-Level Attention-Based Random Forests with a Mixture of Contamination Models","authors":"Andrei Konstantinov, Lev Utkin, Vladimir Muliukha","doi":"10.3390/informatics10020040","DOIUrl":null,"url":null,"abstract":"This paper provides new models of the attention-based random forests called LARF (leaf attention-based random forest). The first idea behind the models is to introduce a two-level attention, where one of the levels is the “leaf” attention, and the attention mechanism is applied to every leaf of trees. The second level is the tree attention depending on the “leaf” attention. The second idea is to replace the softmax operation in the attention with the weighted sum of the softmax operations with different parameters. It is implemented by applying a mixture of Huber’s contamination models and can be regarded as an analog of the multi-head attention, with “heads” defined by selecting a value of the softmax parameter. Attention parameters are simply trained by solving the quadratic optimization problem. To simplify the tuning process of the models, it is proposed to convert the tuning contamination parameters into trainable parameters and to compute them by solving the quadratic optimization problem. Many numerical experiments with real datasets are performed for studying LARFs. The code of the proposed algorithms is available.","PeriodicalId":37100,"journal":{"name":"Informatics","volume":"92 1","pages":"0"},"PeriodicalIF":3.4000,"publicationDate":"2023-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/informatics10020040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper provides new models of the attention-based random forests called LARF (leaf attention-based random forest). The first idea behind the models is to introduce a two-level attention, where one of the levels is the “leaf” attention, and the attention mechanism is applied to every leaf of trees. The second level is the tree attention depending on the “leaf” attention. The second idea is to replace the softmax operation in the attention with the weighted sum of the softmax operations with different parameters. It is implemented by applying a mixture of Huber’s contamination models and can be regarded as an analog of the multi-head attention, with “heads” defined by selecting a value of the softmax parameter. Attention parameters are simply trained by solving the quadratic optimization problem. To simplify the tuning process of the models, it is proposed to convert the tuning contamination parameters into trainable parameters and to compute them by solving the quadratic optimization problem. Many numerical experiments with real datasets are performed for studying LARFs. The code of the proposed algorithms is available.
本文提出了一种新的基于注意力的随机森林模型,称为LARF (leaf attention-based random forest)。模型背后的第一个想法是引入两层注意,其中一层是“叶子”注意,注意机制应用于树的每一片叶子。第二级是树的注意,依赖于“叶子”的注意。第二种思路是将注意力中的softmax操作替换为不同参数的softmax操作的加权和。它是通过应用Huber污染模型的混合来实现的,可以看作是多头注意力的模拟,通过选择softmax参数的一个值来定义“头”。通过求解二次优化问题,简单地训练注意力参数。为了简化模型的整定过程,提出将整定污染参数转换为可训练参数,并通过求解二次优化问题对其进行计算。在实际数据集上进行了许多数值实验来研究larf。所提出的算法的代码是可用的。