Segmented Linear Regression Trees

IF 0.9 3区 数学 Q2 MATHEMATICS Acta Mathematica Sinica-English Series Pub Date : 2025-02-15 DOI:10.1007/s10114-025-3349-5
Xiangyu Zheng, Songxi Chen
{"title":"Segmented Linear Regression Trees","authors":"Xiangyu Zheng,&nbsp;Songxi Chen","doi":"10.1007/s10114-025-3349-5","DOIUrl":null,"url":null,"abstract":"<div><p>Tree-based models have been widely applied in both academic and industrial settings due to the natural interpretability, good predictive accuracy, and high scalability. In this paper, we focus on improving the single-tree method and propose the segmented linear regression trees (SLRT) model that replaces the traditional constant leaf model with linear ones. From the parametric view, SLRT can be employed as a recursive change point detect procedure for segmented linear regression (SLR) models, which is much more efficient and flexible than the traditional grid search method. Along this way, we propose to use the conditional Kendall’s <i>τ</i> correlation coefficient to select the underlying change points. From the non-parametric view, we propose an efficient greedy splitting method that selects the splits by analyzing the association between residuals and each candidate split variable. Further, with the SLRT as a single-tree predictor, we propose a linear random forest approach that aggregates the SLRTs by a weighted average. Both simulation and empirical studies showed significant improvements than the CART trees and even the random forest.</p></div>","PeriodicalId":50893,"journal":{"name":"Acta Mathematica Sinica-English Series","volume":"41 2","pages":"498 - 521"},"PeriodicalIF":0.9000,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Mathematica Sinica-English Series","FirstCategoryId":"100","ListUrlMain":"https://link.springer.com/article/10.1007/s10114-025-3349-5","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Tree-based models have been widely applied in both academic and industrial settings due to the natural interpretability, good predictive accuracy, and high scalability. In this paper, we focus on improving the single-tree method and propose the segmented linear regression trees (SLRT) model that replaces the traditional constant leaf model with linear ones. From the parametric view, SLRT can be employed as a recursive change point detect procedure for segmented linear regression (SLR) models, which is much more efficient and flexible than the traditional grid search method. Along this way, we propose to use the conditional Kendall’s τ correlation coefficient to select the underlying change points. From the non-parametric view, we propose an efficient greedy splitting method that selects the splits by analyzing the association between residuals and each candidate split variable. Further, with the SLRT as a single-tree predictor, we propose a linear random forest approach that aggregates the SLRTs by a weighted average. Both simulation and empirical studies showed significant improvements than the CART trees and even the random forest.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
分段线性回归树
基于树的模型由于其自然的可解释性、良好的预测精度和高的可扩展性,在学术和工业环境中得到了广泛的应用。本文着重对单树方法进行改进,提出了分段线性回归树(SLRT)模型,用线性回归树模型代替传统的常叶模型。从参数化角度看,SLRT可以作为分段线性回归(SLR)模型的递归变化点检测过程,比传统的网格搜索方法更高效、更灵活。在此过程中,我们建议使用条件肯德尔τ相关系数来选择潜在的变化点。从非参数的角度出发,我们提出了一种高效的贪婪分割方法,通过分析残差与候选分割变量之间的关联来选择分割。此外,将SLRT作为单树预测器,我们提出了一种线性随机森林方法,通过加权平均值汇总SLRT。仿真和实证研究都表明,该方法比CART树甚至随机森林都有显著的改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
1.00
自引率
0.00%
发文量
138
审稿时长
14.5 months
期刊介绍: Acta Mathematica Sinica, established by the Chinese Mathematical Society in 1936, is the first and the best mathematical journal in China. In 1985, Acta Mathematica Sinica is divided into English Series and Chinese Series. The English Series is a monthly journal, publishing significant research papers from all branches of pure and applied mathematics. It provides authoritative reviews of current developments in mathematical research. Contributions are invited from researchers from all over the world.
期刊最新文献
Multilinear Maximal Operators on Triebel–Lizorkin Spaces and Besov Spaces Möbius Inversion and Duality for Summations of Stable Graphs Virtual Element Method for the Elastic Transmission Eigenvalue Problem with Equal Elastic Tensors Asymptotic Lower and Upper Bounds for Linear Elasticity Eigenvalues New Types of Singular Solutions of Conformal Q-curvature Equations
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1