Interval-censored data arise when a failure process is under intermittent observation and failure status is only known at assessment times. We consider the development of predictive algorithms when training samples involve interval censoring. Using censoring unbiased transformations and pseudo-observations, we define observed data loss functions, which are unbiased estimates of the corresponding complete data loss functions. We show that regression trees based on these loss functions can recover the tree structure and yield good predictive accuracy. An application is given to a study involving individuals with psoriatic arthritis where the aim is to identify genetic markers useful for the prediction of axial disease within 10 years of a baseline assessment.
{"title":"Regression trees for interval-censored failure time data based on censoring unbiased transformations and pseudo-observations","authors":"Ce Yang, Xianwei Li, Liqun Diao, Richard J. Cook","doi":"10.1002/cjs.11807","DOIUrl":"10.1002/cjs.11807","url":null,"abstract":"<p>Interval-censored data arise when a failure process is under intermittent observation and failure status is only known at assessment times. We consider the development of predictive algorithms when training samples involve interval censoring. Using censoring unbiased transformations and pseudo-observations, we define observed data loss functions, which are unbiased estimates of the corresponding complete data loss functions. We show that regression trees based on these loss functions can recover the tree structure and yield good predictive accuracy. An application is given to a study involving individuals with psoriatic arthritis where the aim is to identify genetic markers useful for the prediction of axial disease within 10 years of a baseline assessment.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"52 4","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141513857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Consider the problem of benchmarking small-area estimates under multiplicative models with positive parameters. The goal is to propose a loss function that guarantees positive constrained estimates of small-area parameters in this situation. The weighted precautionary loss function is introduced to solve the problem. Compared with the weighted Kullback–Leibler (KL) loss function, our proposed loss function penalizes underestimation of the small-area parameters of interest more for small values of parameters. This property is appealing when we estimate disease rates. It tends to give larger estimates of small-area parameters compared with those obtained under the KL loss function. The hierarchical empirical Bayes and constrained hierarchical empirical Bayes estimates of small-area parameters and their corresponding risk functions under the new proposed loss function are obtained. The performance of the proposed methods is investigated using simulation studies and a real dataset.
{"title":"Constrained Bayes in multiplicative area-level models under the precautionary loss function","authors":"Elaheh Torkashvand, Mohammad Jafari Jozani","doi":"10.1002/cjs.11809","DOIUrl":"10.1002/cjs.11809","url":null,"abstract":"<p>Consider the problem of benchmarking small-area estimates under multiplicative models with positive parameters. The goal is to propose a loss function that guarantees positive constrained estimates of small-area parameters in this situation. The weighted precautionary loss function is introduced to solve the problem. Compared with the weighted Kullback–Leibler (KL) loss function, our proposed loss function penalizes underestimation of the small-area parameters of interest more for small values of parameters. This property is appealing when we estimate disease rates. It tends to give larger estimates of small-area parameters compared with those obtained under the KL loss function. The hierarchical empirical Bayes and constrained hierarchical empirical Bayes estimates of small-area parameters and their corresponding risk functions under the new proposed loss function are obtained. The performance of the proposed methods is investigated using simulation studies and a real dataset.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":"52 4","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141513858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Katherine A. L. Valeriano, Fernanda L. Schumacher, Christian E. Galarza, Larissa A. Matos
Data collected over time are common in applications and may contain censored or missing observations, making it difficult to use standard statistical procedures. This article proposes an algorithm to estimate the parameters of a censored linear regression model with errors serially correlated and innovations following a Student-