{"title":"Approximate tolerance intervals for nonparametric regression models","authors":"Yafan Guo, Derek S. Young","doi":"10.1080/10485252.2023.2277260","DOIUrl":null,"url":null,"abstract":"AbstractTolerance intervals in regression allow the user to quantify, with a specified degree of confidence, bounds for a specified proportion of the sampled population when conditioned on a set of covariate values. While methods are available for tolerance intervals in fully-parametric regression settings, the construction of tolerance intervals for nonparametric regression models has been treated in a limited capacity. This paper fills this gap and develops likelihood-based approaches for the construction of pointwise one-sided and two-sided tolerance intervals for nonparametric regression models. A numerical approach is also presented for constructing simultaneous tolerance intervals. An appealing facet of this work is that the resulting methodology is consistent with what is done for fully-parametric regression tolerance intervals. Extensive coverage studies are presented, which demonstrate very good performance of the proposed methods. The proposed tolerance intervals are calculated and interpreted for analyses involving a fertility dataset and a triceps measurement dataset.Keywords: Bootstrapboundary effectscoverage probabilitiesk-factorsmoothing splineAMS Subject Classifications: 62G0862G15 AcknowledgmentsWe would thank the University of Kentucky Center for Computational Sciences and Information Technology Services Research Computing for their support and use of the Lipscomb Compute Cluster and associated research computing resources. The authors are also thankful to the Associate Editor and two reviewers who provided numerous insightful comments that improved the overall quality of this work.Disclosure statementNo potential conflict of interest was reported by the author(s).Data availability statementThe fertility data are available at the HFC's website bluehttps://www.fertilitydata.org/cgi-bin/data.php. The triceps data are available in the R package MultiKink (Wan and Zhong Citation2020), and can be accessed by typing data(triceps).","PeriodicalId":50112,"journal":{"name":"Journal of Nonparametric Statistics","volume":"33 4","pages":"0"},"PeriodicalIF":0.8000,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Nonparametric Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/10485252.2023.2277260","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
AbstractTolerance intervals in regression allow the user to quantify, with a specified degree of confidence, bounds for a specified proportion of the sampled population when conditioned on a set of covariate values. While methods are available for tolerance intervals in fully-parametric regression settings, the construction of tolerance intervals for nonparametric regression models has been treated in a limited capacity. This paper fills this gap and develops likelihood-based approaches for the construction of pointwise one-sided and two-sided tolerance intervals for nonparametric regression models. A numerical approach is also presented for constructing simultaneous tolerance intervals. An appealing facet of this work is that the resulting methodology is consistent with what is done for fully-parametric regression tolerance intervals. Extensive coverage studies are presented, which demonstrate very good performance of the proposed methods. The proposed tolerance intervals are calculated and interpreted for analyses involving a fertility dataset and a triceps measurement dataset.Keywords: Bootstrapboundary effectscoverage probabilitiesk-factorsmoothing splineAMS Subject Classifications: 62G0862G15 AcknowledgmentsWe would thank the University of Kentucky Center for Computational Sciences and Information Technology Services Research Computing for their support and use of the Lipscomb Compute Cluster and associated research computing resources. The authors are also thankful to the Associate Editor and two reviewers who provided numerous insightful comments that improved the overall quality of this work.Disclosure statementNo potential conflict of interest was reported by the author(s).Data availability statementThe fertility data are available at the HFC's website bluehttps://www.fertilitydata.org/cgi-bin/data.php. The triceps data are available in the R package MultiKink (Wan and Zhong Citation2020), and can be accessed by typing data(triceps).
在一组协变量值的条件下,回归中的容忍区间允许用户以指定的置信度量化抽样总体中指定比例的界限。虽然在全参数回归设置中有可用于公差区间的方法,但对非参数回归模型的公差区间的构造的处理能力有限。本文填补了这一空白,并开发了基于似然的方法来构建非参数回归模型的点向单侧和双侧容差区间。提出了一种构造同步公差区间的数值方法。这项工作的一个吸引人的方面是,所得到的方法与对全参数回归容忍区间所做的一致。广泛的覆盖研究表明,所提出的方法具有良好的性能。提出的公差区间计算和解释分析涉及生育数据集和三头肌测量数据集。关键词:自举边界效应覆盖概率因子平滑样条ams学科分类:62G0862G15致谢我们要感谢肯塔基大学计算科学和信息技术服务研究计算中心对Lipscomb计算集群和相关研究计算资源的支持和使用。作者还感谢副编辑和两位审稿人,他们提供了许多有见地的评论,提高了本文的整体质量。披露声明作者未报告潜在的利益冲突。数据可用性声明生育率数据可在HFC的网站bluehttps://www.fertilitydata.org/cgi-bin/data.php上获得。肱三头肌数据可以在R软件包MultiKink (Wan and Zhong Citation2020)中获得,可以通过输入数据(肱三头肌)来访问。
期刊介绍:
Journal of Nonparametric Statistics provides a medium for the publication of research and survey work in nonparametric statistics and related areas. The scope includes, but is not limited to the following topics:
Nonparametric modeling,
Nonparametric function estimation,
Rank and other robust and distribution-free procedures,
Resampling methods,
Lack-of-fit testing,
Multivariate analysis,
Inference with high-dimensional data,
Dimension reduction and variable selection,
Methods for errors in variables, missing, censored, and other incomplete data structures,
Inference of stochastic processes,
Sample surveys,
Time series analysis,
Longitudinal and functional data analysis,
Nonparametric Bayes methods and decision procedures,
Semiparametric models and procedures,
Statistical methods for imaging and tomography,
Statistical inverse problems,
Financial statistics and econometrics,
Bioinformatics and comparative genomics,
Statistical algorithms and machine learning.
Both the theory and applications of nonparametric statistics are covered in the journal. Research applying nonparametric methods to medicine, engineering, technology, science and humanities is welcomed, provided the novelty and quality level are of the highest order.
Authors are encouraged to submit supplementary technical arguments, computer code, data analysed in the paper or any additional information for online publication along with the published paper.