Smoothing level selection for density estimators based on the moments

IF 1.2 4区 数学 Q2 STATISTICS & PROBABILITY Journal of Applied Statistics Pub Date : 2023-11-07 DOI:10.1080/02664763.2023.2277125
Rosa M. García-Fernández, Federico Palacios-González
{"title":"Smoothing level selection for density estimators based on the moments","authors":"Rosa M. García-Fernández, Federico Palacios-González","doi":"10.1080/02664763.2023.2277125","DOIUrl":null,"url":null,"abstract":"AbstractThis paper introduces an approach to select the bandwidth or smoothing parameter in multiresolution (MR) density estimation and nonparametric density estimation. It is based on the evolution of the second, third and fourth central moments and the shape of the estimated densities for different bandwidths and resolution levels. The proposed method has been applied to density estimation by means of multiresolution densities as well as kernel density estimation (MRDE and KDE respectively). The results of the simulations and the empirical application demonstrate that the level of resolution resulting from the moments method performs better with multimodal densities than the Bayesian Information Criterion (BIC) for multiresolution densities estimation and the plug-in for kernel densities estimation.KEYWORDS: Multiresolution density estimationkernel density estimationbandwidthmoments and level of resolution Disclosure statementNo potential conflict of interest was reported by the author(s).Notes1 The multiresolution densities are a particular case of semiparametric models (see, [Citation12,Citation14]).2 This is a well-known fact underlying all the bandwidth selection methods.3 Remind that these intervals form a partition of the real line and their amplitude converges to zero as j increases.4 Unless this is done parametrically using the EM algorithm on a mixture model of three double exponential distributions. But for a sample of size 10,000 the process time is too long.5 Note that the values for the Gini coefficient can differ from other publications since our illustration is based on gross income instead of net income.6 The expected value of the density is zero and the central and non-central moments are equal.","PeriodicalId":15239,"journal":{"name":"Journal of Applied Statistics","volume":"312 10","pages":"0"},"PeriodicalIF":1.2000,"publicationDate":"2023-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/02664763.2023.2277125","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

Abstract

AbstractThis paper introduces an approach to select the bandwidth or smoothing parameter in multiresolution (MR) density estimation and nonparametric density estimation. It is based on the evolution of the second, third and fourth central moments and the shape of the estimated densities for different bandwidths and resolution levels. The proposed method has been applied to density estimation by means of multiresolution densities as well as kernel density estimation (MRDE and KDE respectively). The results of the simulations and the empirical application demonstrate that the level of resolution resulting from the moments method performs better with multimodal densities than the Bayesian Information Criterion (BIC) for multiresolution densities estimation and the plug-in for kernel densities estimation.KEYWORDS: Multiresolution density estimationkernel density estimationbandwidthmoments and level of resolution Disclosure statementNo potential conflict of interest was reported by the author(s).Notes1 The multiresolution densities are a particular case of semiparametric models (see, [Citation12,Citation14]).2 This is a well-known fact underlying all the bandwidth selection methods.3 Remind that these intervals form a partition of the real line and their amplitude converges to zero as j increases.4 Unless this is done parametrically using the EM algorithm on a mixture model of three double exponential distributions. But for a sample of size 10,000 the process time is too long.5 Note that the values for the Gini coefficient can differ from other publications since our illustration is based on gross income instead of net income.6 The expected value of the density is zero and the central and non-central moments are equal.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于矩的密度估计器平滑水平选择
摘要介绍了一种多分辨率(MR)密度估计和非参数密度估计中带宽或平滑参数的选择方法。它基于第二、第三和第四中心矩的演变以及不同带宽和分辨率水平下估计密度的形状。该方法已应用于多分辨率密度估计和核密度估计(分别为MRDE和KDE)。仿真和经验应用结果表明,矩量法在多模态密度下的分辨率水平优于贝叶斯信息准则(BIC)和核密度估计插件。关键词:多分辨率密度估计核密度估计带宽矩和分辨率水平披露声明作者未报告潜在利益冲突。注1:多分辨率密度是半参数模型的特殊情况(参见[Citation12,Citation14])这是所有带宽选择方法背后的一个众所周知的事实提醒一下,这些区间形成了实线的一个分割,它们的振幅随着j的增加收敛于零除非在三个双指数分布的混合模型上使用EM算法进行参数化处理。但是对于大小为10,000的样本,处理时间太长了请注意,基尼系数的值可能与其他出版物不同,因为我们的插图是基于总收入而不是净收入密度的期望值为零,中心矩和非中心矩相等。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Applied Statistics
Journal of Applied Statistics 数学-统计学与概率论
CiteScore
3.40
自引率
0.00%
发文量
126
审稿时长
6 months
期刊介绍: Journal of Applied Statistics provides a forum for communication between both applied statisticians and users of applied statistical techniques across a wide range of disciplines. These areas include business, computing, economics, ecology, education, management, medicine, operational research and sociology, but papers from other areas are also considered. The editorial policy is to publish rigorous but clear and accessible papers on applied techniques. Purely theoretical papers are avoided but those on theoretical developments which clearly demonstrate significant applied potential are welcomed. Each paper is submitted to at least two independent referees.
期刊最新文献
Framework for constructing an optimal weighted score based on agreement On function-on-function linear quantile regression Quantile regression based method for characterizing risk-specific behavioral patterns in relation to longitudinal left-censored biomarker data collected from heterogeneous populations A two-sample nonparametric test for one-sided location-scale alternative Modeling the time to dropout under phase-wise variable stress fixed cohort setup
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1