Giuseppe Pandolfo, Antonio D’Ambrosio, G. C. Porzio
{"title":"A note on depth-based classification of circular data","authors":"Giuseppe Pandolfo, Antonio D’Ambrosio, G. C. Porzio","doi":"10.1285/I20705948V11N2P447","DOIUrl":null,"url":null,"abstract":"A procedure is developed in order to deal with the classification problem of objects in circular statistics. It is fully non-parametric and based on depth functions for directional data. Using the so-called DD-plot, we apply the k-nearest neighbors method in order to discriminate between competing groups. Three different notions of data depth for directional data are considered: the angular simplicial, the angular Tukey and the arc distance. We investigate and compare their performances through the average misclassification rate with respect to different distributional settings by using simulated and real data sets. Results show that the use of the arc distance depth should be generally preferred, and in some cases it outperforms the classifier based both on the angular simplicial and Tukey depths.","PeriodicalId":44770,"journal":{"name":"Electronic Journal of Applied Statistical Analysis","volume":"11 1","pages":"447-462"},"PeriodicalIF":0.6000,"publicationDate":"2018-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1285/I20705948V11N2P447","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electronic Journal of Applied Statistical Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1285/I20705948V11N2P447","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 9
Abstract
A procedure is developed in order to deal with the classification problem of objects in circular statistics. It is fully non-parametric and based on depth functions for directional data. Using the so-called DD-plot, we apply the k-nearest neighbors method in order to discriminate between competing groups. Three different notions of data depth for directional data are considered: the angular simplicial, the angular Tukey and the arc distance. We investigate and compare their performances through the average misclassification rate with respect to different distributional settings by using simulated and real data sets. Results show that the use of the arc distance depth should be generally preferred, and in some cases it outperforms the classifier based both on the angular simplicial and Tukey depths.