A hyperbolic divergence based nonparametric test for two-sample multivariate distributions

IF 0.8 4区 数学 Q3 STATISTICS & PROBABILITY Canadian Journal of Statistics-Revue Canadienne De Statistique Pub Date : 2022-11-26 DOI:10.1002/cjs.11736
Roulin Wang, Wei Fan, Xueqin Wang
{"title":"A hyperbolic divergence based nonparametric test for two-sample multivariate distributions","authors":"Roulin Wang,&nbsp;Wei Fan,&nbsp;Xueqin Wang","doi":"10.1002/cjs.11736","DOIUrl":null,"url":null,"abstract":"<p>Two-sample hypothesis testing, as a fundamental problem in statistical inference, seeks to detect the difference between two probability measures and has numerous real-world applications. Current test procedures for multivariate two-sample problems typically rely on angles and lengths in a Euclidean space, or lengths in a unit hypersphere after representing data with the spherical model. This article introduces a hyperbolic divergence based on hyperbolic lengths in hyperbolic geometry, as well as a subsequent nonparametric approach to testing the multivariate two-sample problem. We investigate the properties of our test procedure and discover that our hyperbolic divergence statistic is strongly consistent and consistent against all other alternatives; we also demonstrate that its limit distribution is an infinite mixture of <math>\n <msup>\n <mrow>\n <mi>χ</mi>\n </mrow>\n <mrow>\n <mn>2</mn>\n </mrow>\n </msup></math> distributions under the null hypothesis and a normal distribution under the alternative hypothesis. To calculate the <math>\n <mrow>\n <mi>P</mi>\n </mrow></math>-value, we employ the permutation method. Furthermore, in numerical studies, we compare our method with several nonparametric procedures under various distributional assumptions and alternatives. We discover that our test procedure has some advantages when the distributions' complex correlation structures differ. Finally, we examine one real data set to show how our method can be used to test two-sample heterogeneity.</p>","PeriodicalId":55281,"journal":{"name":"Canadian Journal of Statistics-Revue Canadienne De Statistique","volume":null,"pages":null},"PeriodicalIF":0.8000,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Canadian Journal of Statistics-Revue Canadienne De Statistique","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cjs.11736","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

Abstract

Two-sample hypothesis testing, as a fundamental problem in statistical inference, seeks to detect the difference between two probability measures and has numerous real-world applications. Current test procedures for multivariate two-sample problems typically rely on angles and lengths in a Euclidean space, or lengths in a unit hypersphere after representing data with the spherical model. This article introduces a hyperbolic divergence based on hyperbolic lengths in hyperbolic geometry, as well as a subsequent nonparametric approach to testing the multivariate two-sample problem. We investigate the properties of our test procedure and discover that our hyperbolic divergence statistic is strongly consistent and consistent against all other alternatives; we also demonstrate that its limit distribution is an infinite mixture of χ 2 distributions under the null hypothesis and a normal distribution under the alternative hypothesis. To calculate the P -value, we employ the permutation method. Furthermore, in numerical studies, we compare our method with several nonparametric procedures under various distributional assumptions and alternatives. We discover that our test procedure has some advantages when the distributions' complex correlation structures differ. Finally, we examine one real data set to show how our method can be used to test two-sample heterogeneity.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
两样本多元分布的基于双曲散度的非参数检验
双样本假设检验作为统计推断中的一个基本问题,旨在检测两个概率度量之间的差异,并在现实世界中有许多应用。当前多变量双样本问题的测试程序通常依赖于欧几里得空间中的角度和长度,或者在用球面模型表示数据后的单位超球中的长度。本文介绍了双曲几何中基于双曲长度的双曲散度,以及随后检验多变量双样本问题的非参数方法。我们研究了我们的检验程序的性质,并发现我们的双曲散度统计量对所有其他选择都是强一致和一致的;我们还证明了它的极限分布是零假设下的χ 2分布和备择假设下的正态分布的无限混合。为了计算P值,我们采用置换法。此外,在数值研究中,我们将该方法与不同分布假设和选择下的几种非参数过程进行了比较。我们发现,当分布的复杂相关结构不同时,我们的测试方法具有一定的优势。最后,我们检查了一个真实的数据集,以显示如何使用我们的方法来测试双样本异质性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
1.40
自引率
0.00%
发文量
62
审稿时长
>12 weeks
期刊介绍: The Canadian Journal of Statistics is the official journal of the Statistical Society of Canada. It has a reputation internationally as an excellent journal. The editorial board is comprised of statistical scientists with applied, computational, methodological, theoretical and probabilistic interests. Their role is to ensure that the journal continues to provide an international forum for the discipline of Statistics. The journal seeks papers making broad points of interest to many readers, whereas papers making important points of more specific interest are better placed in more specialized journals. The levels of innovation and impact are key in the evaluation of submitted manuscripts.
期刊最新文献
Issue Information Issue Information Issue Information Censored autoregressive regression models with Student-t innovations Acknowledgement of referees' services remerciements aux membres des jurys
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1