{"title":"任意维空间中基于点向范数的数据聚类","authors":"Soumita Modak","doi":"10.1080/23737484.2023.2199952","DOIUrl":null,"url":null,"abstract":"ABSTRACT In this paper, a novel nonparametric norm-based clustering algorithm is proposed to classify real-valued continuous data sets given in arbitrary dimensional space. For data univariate, multivariate or high-dimensional, with the number of study variables close to or larger than the data size, our straightforward algorithm is implemented throughout under an univariate set-up, where we make use of the observation-wise (or pointwise) norms which quantify the distances of the observations from the origin zero or the null vector. The method begins with determination of the sample quantile using nonparamteric bootstrapping on the computed norms and always converges independently. By its design, the suggested algorithm is fast enough to detect the number of existing clusters itself and to form well-defined groups. Data study demonstrates its competitiveness in comparison to 2 popular clustering algorithms K-means and K-medoids.","PeriodicalId":36561,"journal":{"name":"Communications in Statistics Case Studies Data Analysis and Applications","volume":"10 1","pages":"121 - 134"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Pointwise norm-based clustering of data in arbitrary dimensional space\",\"authors\":\"Soumita Modak\",\"doi\":\"10.1080/23737484.2023.2199952\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT In this paper, a novel nonparametric norm-based clustering algorithm is proposed to classify real-valued continuous data sets given in arbitrary dimensional space. For data univariate, multivariate or high-dimensional, with the number of study variables close to or larger than the data size, our straightforward algorithm is implemented throughout under an univariate set-up, where we make use of the observation-wise (or pointwise) norms which quantify the distances of the observations from the origin zero or the null vector. The method begins with determination of the sample quantile using nonparamteric bootstrapping on the computed norms and always converges independently. By its design, the suggested algorithm is fast enough to detect the number of existing clusters itself and to form well-defined groups. Data study demonstrates its competitiveness in comparison to 2 popular clustering algorithms K-means and K-medoids.\",\"PeriodicalId\":36561,\"journal\":{\"name\":\"Communications in Statistics Case Studies Data Analysis and Applications\",\"volume\":\"10 1\",\"pages\":\"121 - 134\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications in Statistics Case Studies Data Analysis and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/23737484.2023.2199952\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Statistics Case Studies Data Analysis and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/23737484.2023.2199952","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Mathematics","Score":null,"Total":0}
Pointwise norm-based clustering of data in arbitrary dimensional space
ABSTRACT In this paper, a novel nonparametric norm-based clustering algorithm is proposed to classify real-valued continuous data sets given in arbitrary dimensional space. For data univariate, multivariate or high-dimensional, with the number of study variables close to or larger than the data size, our straightforward algorithm is implemented throughout under an univariate set-up, where we make use of the observation-wise (or pointwise) norms which quantify the distances of the observations from the origin zero or the null vector. The method begins with determination of the sample quantile using nonparamteric bootstrapping on the computed norms and always converges independently. By its design, the suggested algorithm is fast enough to detect the number of existing clusters itself and to form well-defined groups. Data study demonstrates its competitiveness in comparison to 2 popular clustering algorithms K-means and K-medoids.