Comparison of K-Means, BIRCH and Hierarchical Clustering Algorithms in Clustering OCD Symptom Data

Public Research Journal of Engineering, Data Technology and Computer Science Pub Date : 2024-02-01 DOI:10.57152/predatecs.v1i2.1106

Alika Rahmarsyarah Rizalde, Haykal Alya Mubarak, Gilang Ramadhan, Mohd. Adzka Fatan

{"title":"Comparison of K-Means, BIRCH and Hierarchical Clustering Algorithms in Clustering OCD Symptom Data","authors":"Alika Rahmarsyarah Rizalde, Haykal Alya Mubarak, Gilang Ramadhan, Mohd. Adzka Fatan","doi":"10.57152/predatecs.v1i2.1106","DOIUrl":null,"url":null,"abstract":"The hallmarks of Obsessive-Compulsive Disorder (OCD) are intrusive, anxiety-inducing thoughts (called obsessions) and associated repeated activities (called compulsions). To understand the patterns and relationships between OCD data that have been obtained, data will be grouped (clustering). In clustering using several clustering algorithms, namely K-Means, BIRCH, In this work, hierarchical clustering was used to identify the optimal cluster value comparison, and the Davies Bouldin Index (DBI) was used to confirm the results. Then the results of the best cluster value in processing OCD data are using the BIRCH algorithm in the K10 experiment which gets a value of 1.3. While the K-Means algorithm obtained the best cluster at K10 with a value obtained of 1.36 and the Hierarchical clustering algorithm also at the K10 value of 2.03. Thus in this study, the comparison results of the application of 3 clustering algorithms obtained results, namely the BIRCH algorithm shows the value of the resulting cluster is the best in clustering OCD data. This means that the BIRCH algorithm can be used to cluster OCD data more accurately and efficiently.","PeriodicalId":516904,"journal":{"name":"Public Research Journal of Engineering, Data Technology and Computer Science","volume":"75 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Public Research Journal of Engineering, Data Technology and Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.57152/predatecs.v1i2.1106","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The hallmarks of Obsessive-Compulsive Disorder (OCD) are intrusive, anxiety-inducing thoughts (called obsessions) and associated repeated activities (called compulsions). To understand the patterns and relationships between OCD data that have been obtained, data will be grouped (clustering). In clustering using several clustering algorithms, namely K-Means, BIRCH, In this work, hierarchical clustering was used to identify the optimal cluster value comparison, and the Davies Bouldin Index (DBI) was used to confirm the results. Then the results of the best cluster value in processing OCD data are using the BIRCH algorithm in the K10 experiment which gets a value of 1.3. While the K-Means algorithm obtained the best cluster at K10 with a value obtained of 1.36 and the Hierarchical clustering algorithm also at the K10 value of 2.03. Thus in this study, the comparison results of the application of 3 clustering algorithms obtained results, namely the BIRCH algorithm shows the value of the resulting cluster is the best in clustering OCD data. This means that the BIRCH algorithm can be used to cluster OCD data more accurately and efficiently.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

K-Means 算法、BIRCH 算法和分层聚类算法在强迫症症状数据聚类中的比较

强迫症（OCD）的特征是侵入性的、引起焦虑的想法（称为强迫症）和相关的重复活动（称为强迫症）。为了解强迫症数据之间的模式和关系，将对数据进行分组（聚类）。在使用几种聚类算法进行聚类时，即 K-Means、BIRCH、在这项工作中，使用分层聚类来确定最佳聚类值比较，并使用戴维斯-博尔丁指数（DBI）来确认结果。然后，在 K10 实验中使用 BIRCH 算法处理 OCD 数据的最佳聚类值结果为 1.3。而 K-Means 算法在 K10 得到的最佳聚类值为 1.36，层次聚类算法的 K10 值也为 2.03。因此，在本研究中，应用 3 种聚类算法得到的结果比较结果显示，BIRCH 算法得到的聚类值在 OCD 数据聚类中是最好的。这说明 BIRCH 算法可以更准确、更高效地对 OCD 数据进行聚类。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Public Research Journal of Engineering, Data Technology and Computer Science

自引率

0.00%

发文量