检测网瘾的聚类和分类方法的经验比较

CTE Workshop Proceedings Pub Date : 2023-12-17 DOI:10.55056/cte.664

O. Klochko, V. Fedorets, V. I. Klochko

{"title":"检测网瘾的聚类和分类方法的经验比较","authors":"O. Klochko, V. Fedorets, V. I. Klochko","doi":"10.55056/cte.664","DOIUrl":null,"url":null,"abstract":"Machine learning methods for clustering and classification are widely used in various domains. However, their performance and applicability may depend on the characteristics of the data and the problem. In this paper, we present an empirical comparison of several clustering and classification methods using WEKA, a free software for machine learning. We apply these methods to the data collected from surveys of students from different majors, aiming to detect the signs of Internet addiction (IA), a behavioural disorder caused by excessive Internet use. We use Expectation Maximization, Farthest First and K-Means for clustering, and AdaBoost, Bagging, Random Forest and Vote for classification. We evaluate the methods based on their accuracy, complexity and interpretability. We also describe the models developed by these methods and discuss their implications for identifying the respondents with IA symptoms and risk groups. The results show that these methods can be effectively used for clustering and classifying IA-related data. However, they have different strengths and limitations when choosing the best method for a specific task.","PeriodicalId":240357,"journal":{"name":"CTE Workshop Proceedings","volume":"5 8","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Empirical comparison of clustering and classification methods for detecting Internet addiction\",\"authors\":\"O. Klochko, V. Fedorets, V. I. Klochko\",\"doi\":\"10.55056/cte.664\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning methods for clustering and classification are widely used in various domains. However, their performance and applicability may depend on the characteristics of the data and the problem. In this paper, we present an empirical comparison of several clustering and classification methods using WEKA, a free software for machine learning. We apply these methods to the data collected from surveys of students from different majors, aiming to detect the signs of Internet addiction (IA), a behavioural disorder caused by excessive Internet use. We use Expectation Maximization, Farthest First and K-Means for clustering, and AdaBoost, Bagging, Random Forest and Vote for classification. We evaluate the methods based on their accuracy, complexity and interpretability. We also describe the models developed by these methods and discuss their implications for identifying the respondents with IA symptoms and risk groups. The results show that these methods can be effectively used for clustering and classifying IA-related data. However, they have different strengths and limitations when choosing the best method for a specific task.\",\"PeriodicalId\":240357,\"journal\":{\"name\":\"CTE Workshop Proceedings\",\"volume\":\"5 8\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CTE Workshop Proceedings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.55056/cte.664\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CTE Workshop Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55056/cte.664","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

用于聚类和分类的机器学习方法被广泛应用于各个领域。然而，它们的性能和适用性可能取决于数据和问题的特征。在本文中，我们使用免费的机器学习软件 WEKA 对几种聚类和分类方法进行了实证比较。我们将这些方法应用于从不同专业学生的调查中收集的数据，目的是检测网络成瘾（IA）的迹象，这是一种因过度使用互联网而导致的行为障碍。我们使用期望最大化、最远优先和 K-Means 进行聚类，并使用 AdaBoost、Bagging、Random Forest 和 Vote 进行分类。我们根据这些方法的准确性、复杂性和可解释性对其进行评估。我们还描述了这些方法所建立的模型，并讨论了这些模型对识别具有内分泌失调症状的受访者和风险群体的影响。结果表明，这些方法可以有效地用于 IA 相关数据的聚类和分类。然而，在为特定任务选择最佳方法时，这些方法具有不同的优势和局限性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Empirical comparison of clustering and classification methods for detecting Internet addiction

Machine learning methods for clustering and classification are widely used in various domains. However, their performance and applicability may depend on the characteristics of the data and the problem. In this paper, we present an empirical comparison of several clustering and classification methods using WEKA, a free software for machine learning. We apply these methods to the data collected from surveys of students from different majors, aiming to detect the signs of Internet addiction (IA), a behavioural disorder caused by excessive Internet use. We use Expectation Maximization, Farthest First and K-Means for clustering, and AdaBoost, Bagging, Random Forest and Vote for classification. We evaluate the methods based on their accuracy, complexity and interpretability. We also describe the models developed by these methods and discuss their implications for identifying the respondents with IA symptoms and risk groups. The results show that these methods can be effectively used for clustering and classifying IA-related data. However, they have different strengths and limitations when choosing the best method for a specific task.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

CTE Workshop Proceedings

自引率

0.00%

发文量