{"title":"Inference for High-Dimensional Streamed Longitudinal Data","authors":"Senyuan Zheng, Ling Zhou","doi":"10.1007/s10114-025-3305-4","DOIUrl":null,"url":null,"abstract":"<div><p>With the advent of modern devices, such as smartphones and wearable devices, high-dimensional data are collected on many participants for a period of time or even in perpetuity. For this type of data, dependencies between and within data batches exist because data are collected from the same individual over time. Under the framework of streamed data, individual historical data are not available due to the storage and computation burden. It is urgent to develop computationally efficient methods with statistical guarantees to analyze high-dimensional streamed data and make reliable inferences in practice. In addition, the homogeneity assumption on the model parameters may not be valid in practice over time. To address the above issues, in this paper, we develop a new renewable debiased-lasso inference method for high-dimensional streamed data allowing dependences between and within data batches to exist and model parameters to gradually change. We establish the large sample properties of the proposed estimators, including consistency and asymptotic normality. The numerical results, including simulations and real data analysis, show the superior performance of the proposed method.</p></div>","PeriodicalId":50893,"journal":{"name":"Acta Mathematica Sinica-English Series","volume":"41 2","pages":"757 - 779"},"PeriodicalIF":0.8000,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Mathematica Sinica-English Series","FirstCategoryId":"100","ListUrlMain":"https://link.springer.com/article/10.1007/s10114-025-3305-4","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS","Score":null,"Total":0}
引用次数: 0
Abstract
With the advent of modern devices, such as smartphones and wearable devices, high-dimensional data are collected on many participants for a period of time or even in perpetuity. For this type of data, dependencies between and within data batches exist because data are collected from the same individual over time. Under the framework of streamed data, individual historical data are not available due to the storage and computation burden. It is urgent to develop computationally efficient methods with statistical guarantees to analyze high-dimensional streamed data and make reliable inferences in practice. In addition, the homogeneity assumption on the model parameters may not be valid in practice over time. To address the above issues, in this paper, we develop a new renewable debiased-lasso inference method for high-dimensional streamed data allowing dependences between and within data batches to exist and model parameters to gradually change. We establish the large sample properties of the proposed estimators, including consistency and asymptotic normality. The numerical results, including simulations and real data analysis, show the superior performance of the proposed method.
期刊介绍:
Acta Mathematica Sinica, established by the Chinese Mathematical Society in 1936, is the first and the best mathematical journal in China. In 1985, Acta Mathematica Sinica is divided into English Series and Chinese Series. The English Series is a monthly journal, publishing significant research papers from all branches of pure and applied mathematics. It provides authoritative reviews of current developments in mathematical research. Contributions are invited from researchers from all over the world.