Examining distributional characteristics of clusters.

Bulletin de la Societe des sciences medicales du Grand-Duche de Luxembourg Pub Date : 2010-01-01

A von Eye

{"title":"Examining distributional characteristics of clusters.","authors":"A von Eye","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Standard cluster analysis creates clusters based on the criterion that their members be closer to each other than to members of other clusters. In this article, it is proposed to examine empirical clusters that result from standard clustering, with the goal of assessing whether they contradict distributional assumptions. Four models are proposed. The models consider two data generation processes, the Poisson and the multinormal, as well as two convex shapes of cluster hulls, the spherical and the ellipsoidal. Based on the model, the probability of being in a cluster of a given location, size, and shape is estimated. This probability is compared with the observed proportion of cases. The observed proportion can turn out to be larger, as large, or smaller than expected. Examples are given using simulated and empirical data. The simulation showed that the size of a cluster, the data generation process, and the true distribution of data have the strongest effect on the results obtained with the proposed method. The empirical examples discuss distributional characteristics of cross-sectional and longitudinal clusters of aggressive behavior in adolescents. The examples show that clustering methods do not always yield clusters that contradict distributional assumptions. Some clusters contain even fewer cases than expected.</p>","PeriodicalId":72476,"journal":{"name":"Bulletin de la Societe des sciences medicales du Grand-Duche de Luxembourg","volume":"Spec No 1 1","pages":"14-39"},"PeriodicalIF":0.0000,"publicationDate":"2010-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin de la Societe des sciences medicales du Grand-Duche de Luxembourg","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Standard cluster analysis creates clusters based on the criterion that their members be closer to each other than to members of other clusters. In this article, it is proposed to examine empirical clusters that result from standard clustering, with the goal of assessing whether they contradict distributional assumptions. Four models are proposed. The models consider two data generation processes, the Poisson and the multinormal, as well as two convex shapes of cluster hulls, the spherical and the ellipsoidal. Based on the model, the probability of being in a cluster of a given location, size, and shape is estimated. This probability is compared with the observed proportion of cases. The observed proportion can turn out to be larger, as large, or smaller than expected. Examples are given using simulated and empirical data. The simulation showed that the size of a cluster, the data generation process, and the true distribution of data have the strongest effect on the results obtained with the proposed method. The empirical examples discuss distributional characteristics of cross-sectional and longitudinal clusters of aggressive behavior in adolescents. The examples show that clustering methods do not always yield clusters that contradict distributional assumptions. Some clusters contain even fewer cases than expected.

微信好友朋友圈 QQ好友复制链接

本刊更多论文

研究集群的分布特征。

标准聚类分析是根据聚类的成员彼此之间的距离比其他聚类的成员更近的标准来创建聚类的。在这篇文章中，我们提出了检验由标准聚类产生的经验聚类，目的是评估它们是否与分布假设相矛盾。提出了四种模型。该模型考虑了两种数据生成过程，泊松和多正态，以及两种凸形状的簇壳，球形和椭球体。基于该模型，估计在给定位置、大小和形状的集群中的概率。将这个概率与观察到的病例比例进行比较。观察到的比例可能比预期的更大、同样大或更小。用模拟数据和经验数据给出了实例。仿真结果表明，聚类的大小、数据的生成过程和数据的真实分布对所提方法得到的结果影响最大。实证分析了青少年攻击行为横截面和纵向集群的分布特征。实例表明，聚类方法并不总是产生与分布假设相矛盾的聚类。一些集群包含的病例甚至比预期的还要少。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Bulletin de la Societe des sciences medicales du Grand-Duche de Luxembourg

自引率

0.00%

发文量

期刊最新文献

Immune Network Case Report: Primary Spinal Lymphoma. [In process]. Treating the emotional and motivational inhibition of highly gifted underachievers with music psychotherapy: Meta-analysis of an evaluation study based on a sequential design. [In process]