{"title":"Comparison of the Methodology for Hypothesis Testing of the Independence of Two-Dimensional Random Variables Based on a Nonparametric Classifier","authors":"A. V. Lapko, V. A. Lapko, A. V. Bakhtina","doi":"10.3103/s0147688223060084","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract—</h3><p>The properties of a new method for the hypothesis testing of the independence of random variables based on the use of a nonparametric pattern recognition algorithm corresponding to the maximum likelihood criterion are considered. The estimation of the distribution laws in classes is carried out using the initial statistical data under the assumption of the independence and dependence of the analyzed random variables. Under these conditions, estimates of the probabilities of pattern recognition errors in classes are calculated. A decision is made on the independence or dependence of random variables according to their minimum value. The results of the proposed method are compared using the Pearson criterion and the Pearson, Spearman, and Kendall correlation coefficients. When implementing the Pearson criterion, the formula for optimal discretization of the range of values of a two-dimensional random variable is used. Their effectiveness in complicating the dependence between random variables and changing the volume of initial statistical data is studied using computational experiment.</p>","PeriodicalId":43962,"journal":{"name":"Scientific and Technical Information Processing","volume":"24 1","pages":""},"PeriodicalIF":0.4000,"publicationDate":"2024-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific and Technical Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3103/s0147688223060084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract—
The properties of a new method for the hypothesis testing of the independence of random variables based on the use of a nonparametric pattern recognition algorithm corresponding to the maximum likelihood criterion are considered. The estimation of the distribution laws in classes is carried out using the initial statistical data under the assumption of the independence and dependence of the analyzed random variables. Under these conditions, estimates of the probabilities of pattern recognition errors in classes are calculated. A decision is made on the independence or dependence of random variables according to their minimum value. The results of the proposed method are compared using the Pearson criterion and the Pearson, Spearman, and Kendall correlation coefficients. When implementing the Pearson criterion, the formula for optimal discretization of the range of values of a two-dimensional random variable is used. Their effectiveness in complicating the dependence between random variables and changing the volume of initial statistical data is studied using computational experiment.
期刊介绍:
Scientific and Technical Information Processing is a refereed journal that covers all aspects of management and use of information technology in libraries and archives, information centres, and the information industry in general. Emphasis is on practical applications of new technologies and techniques for information analysis and processing.