高维鲁棒回归的迁移学习

Xiaohui Yuan, Shujie Ren
{"title":"高维鲁棒回归的迁移学习","authors":"Xiaohui Yuan, Shujie Ren","doi":"arxiv-2406.17567","DOIUrl":null,"url":null,"abstract":"Transfer learning has become an essential technique for utilizing information\nfrom source datasets to improve the performance of the target task. However, in\nthe context of high-dimensional data, heterogeneity arises due to\nheteroscedastic variance or inhomogeneous covariate effects. To solve this\nproblem, this paper proposes a robust transfer learning based on the Huber\nregression, specifically designed for scenarios where the transferable source\ndata set is known. This method effectively mitigates the impact of data\nheteroscedasticity, leading to improvements in estimation and prediction\naccuracy. Moreover, when the transferable source data set is unknown, the paper\nintroduces an efficient detection algorithm to identify informative sources.\nThe effectiveness of the proposed method is proved through numerical simulation\nand empirical analysis using superconductor data.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Transfer Learning for High Dimensional Robust Regression\",\"authors\":\"Xiaohui Yuan, Shujie Ren\",\"doi\":\"arxiv-2406.17567\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Transfer learning has become an essential technique for utilizing information\\nfrom source datasets to improve the performance of the target task. However, in\\nthe context of high-dimensional data, heterogeneity arises due to\\nheteroscedastic variance or inhomogeneous covariate effects. To solve this\\nproblem, this paper proposes a robust transfer learning based on the Huber\\nregression, specifically designed for scenarios where the transferable source\\ndata set is known. This method effectively mitigates the impact of data\\nheteroscedasticity, leading to improvements in estimation and prediction\\naccuracy. Moreover, when the transferable source data set is unknown, the paper\\nintroduces an efficient detection algorithm to identify informative sources.\\nThe effectiveness of the proposed method is proved through numerical simulation\\nand empirical analysis using superconductor data.\",\"PeriodicalId\":501323,\"journal\":{\"name\":\"arXiv - STAT - Other Statistics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Other Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2406.17567\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Other Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.17567","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

迁移学习已成为利用源数据集信息提高目标任务性能的一项基本技术。然而,在高维数据的背景下,由于异速方差或不均匀的协变量效应,会产生异质性。为了解决这个问题,本文提出了一种基于 Huber 回归的稳健迁移学习方法,专门用于已知可迁移源数据集的情况。这种方法有效地减轻了数据异方差的影响,从而提高了估计和预测的准确性。此外,当可转移源数据集未知时,本文引入了一种高效的检测算法来识别信息源。通过使用超导体数据进行数值模拟和实证分析,证明了所提方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Transfer Learning for High Dimensional Robust Regression
Transfer learning has become an essential technique for utilizing information from source datasets to improve the performance of the target task. However, in the context of high-dimensional data, heterogeneity arises due to heteroscedastic variance or inhomogeneous covariate effects. To solve this problem, this paper proposes a robust transfer learning based on the Huber regression, specifically designed for scenarios where the transferable source data set is known. This method effectively mitigates the impact of data heteroscedasticity, leading to improvements in estimation and prediction accuracy. Moreover, when the transferable source data set is unknown, the paper introduces an efficient detection algorithm to identify informative sources. The effectiveness of the proposed method is proved through numerical simulation and empirical analysis using superconductor data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Censored Data Forecasting: Applying Tobit Exponential Smoothing with Time Aggregation How to survive the Squid Games using probability theory Cross-sectional personal network analysis of adult smoking in rural areas Modeling information spread across networks with communities using a multitype branching process framework Asymptotic confidence intervals for the difference and the ratio of the weighted kappa coefficients of two diagnostic tests subject to a paired design
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1