Two-step semiparametric empirical likelihood inference from capture–recapture data with missing covariates

IF 1.2 4区 数学 Q2 STATISTICS & PROBABILITY Test Pub Date : 2024-02-14 DOI:10.1007/s11749-024-00921-1
Yang Liu, Yukun Liu, Pengfei Li, Riquan Zhang
{"title":"Two-step semiparametric empirical likelihood inference from capture–recapture data with missing covariates","authors":"Yang Liu, Yukun Liu, Pengfei Li, Riquan Zhang","doi":"10.1007/s11749-024-00921-1","DOIUrl":null,"url":null,"abstract":"<p>Missing covariates are not uncommon in capture–recapture studies. When covariate information is missing at random in capture–recapture data, an empirical full likelihood method has been demonstrated to outperform conditional-likelihood-based methods in abundance estimation. However, the fully observed covariates must be discrete, and the method is not directly applicable to continuous-time capture–recapture data. Based on the Binomial and Poisson regression models, we propose a two-step semiparametric empirical likelihood approach for abundance estimation in the presence of missing covariates, regardless of whether the fully observed covariates are discrete or continuous. We show that the maximum semiparametric empirical likelihood estimators for the underlying parameters and the abundance are asymptotically normal, and more efficient than the counterpart for a completely known non-missingness probability. After scaling, the empirical likelihood ratio test statistic for abundance follows a limiting chi-square distribution with one degree of freedom. The proposed approach is further extended to one-inflated count regression models, and a score-like test is constructed to assess whether one-inflation exists among the number of captures. Our simulation shows that, compared with the previous method, the proposed method not only performs better in correcting bias, but also has a more accurate coverage in the presence of fully observed continuous covariates, although there may be a slight efficiency loss when the fully observed covariates are only discrete. The performance of the new method is illustrated by analyses of the yellow-bellied prinia data and the rana pretiosa data.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"73 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2024-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Test","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s11749-024-00921-1","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

Abstract

Missing covariates are not uncommon in capture–recapture studies. When covariate information is missing at random in capture–recapture data, an empirical full likelihood method has been demonstrated to outperform conditional-likelihood-based methods in abundance estimation. However, the fully observed covariates must be discrete, and the method is not directly applicable to continuous-time capture–recapture data. Based on the Binomial and Poisson regression models, we propose a two-step semiparametric empirical likelihood approach for abundance estimation in the presence of missing covariates, regardless of whether the fully observed covariates are discrete or continuous. We show that the maximum semiparametric empirical likelihood estimators for the underlying parameters and the abundance are asymptotically normal, and more efficient than the counterpart for a completely known non-missingness probability. After scaling, the empirical likelihood ratio test statistic for abundance follows a limiting chi-square distribution with one degree of freedom. The proposed approach is further extended to one-inflated count regression models, and a score-like test is constructed to assess whether one-inflation exists among the number of captures. Our simulation shows that, compared with the previous method, the proposed method not only performs better in correcting bias, but also has a more accurate coverage in the presence of fully observed continuous covariates, although there may be a slight efficiency loss when the fully observed covariates are only discrete. The performance of the new method is illustrated by analyses of the yellow-bellied prinia data and the rana pretiosa data.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从具有缺失协变量的捕获-再捕获数据中进行两步半参数经验似然推断
在捕获-再捕获研究中,协变量缺失的情况并不少见。当捕获-再捕获数据中的协变量信息随机缺失时,经验全似然法在丰度估计中的表现优于基于条件似然法的方法。然而,完全观测到的协变量必须是离散的,该方法不能直接用于连续时间的捕获-再捕获数据。基于二项回归和泊松回归模型,我们提出了一种两步半参数经验似然法,用于缺失协变量情况下的丰度估计,无论完全观测到的协变量是离散的还是连续的。我们证明,基础参数和丰度的最大半参数经验似然估计值是渐近正态的,比完全已知非缺失概率的对应估计值更有效。缩放后,丰度的经验似然比检验统计量遵循一个自由度的极限奇平方分布。所提出的方法进一步扩展到单膨胀计数回归模型,并构建了一个类似分数的检验来评估捕获数量中是否存在单膨胀。我们的模拟结果表明,与之前的方法相比,所提出的方法不仅在纠正偏差方面表现更好,而且在完全观测到连续协变量的情况下具有更准确的覆盖范围,不过当完全观测到的协变量只是离散协变量时,可能会有轻微的效率损失。新方法的性能通过对黄腹角雉数据和滇金丝猴数据的分析得到了说明。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Test
Test 数学-统计学与概率论
CiteScore
2.20
自引率
7.70%
发文量
41
审稿时长
>12 weeks
期刊介绍: TEST is an international journal of Statistics and Probability, sponsored by the Spanish Society of Statistics and Operations Research. English is the official language of the journal. The emphasis of TEST is placed on papers containing original theoretical contributions of direct or potential value in applications. In this respect, the methodological contents are considered to be crucial for the papers published in TEST, but the practical implications of the methodological aspects are also relevant. Original sound manuscripts on either well-established or emerging areas in the scope of the journal are welcome. One volume is published annually in four issues. In addition to the regular contributions, each issue of TEST contains an invited paper from a world-wide recognized outstanding statistician on an up-to-date challenging topic, including discussions.
期刊最新文献
Jackknife empirical likelihood for the correlation coefficient with additive distortion measurement errors Nonparametric conditional survival function estimation and plug-in bandwidth selection with multiple covariates Higher-order spatial autoregressive varying coefficient model: estimation and specification test Composite quantile estimation in partially functional linear regression model with randomly censored responses Bayesian inference and cure rate modeling for event history data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1