Agatha Silvani Sekarningtyas, M. A. Ayu, T. Mantoro
{"title":"基于大五理论的推特用户性格分类的k近邻算法","authors":"Agatha Silvani Sekarningtyas, M. A. Ayu, T. Mantoro","doi":"10.1109/ICCED53389.2021.9664857","DOIUrl":null,"url":null,"abstract":"Social media is an application or website-based system that enables users to create and share content or participate in social networking that allows its user to share their thoughts, opinions, or feelings that represent their personality. At present several studies to classify an individual's personality through social media have been developed, especially on social media Twitter. However, most of the analysis on Twitter only uses text based data such as posted tweets. This research presents a study on analyzing the users’ twitter data to classify their types of personality based on Big Five Theory by using their social statistic data. The data were acquired using Twitter API which was taken from Indonesian users with the total of 225 data. This study shows that using K-Nearest Neighbor (K-NN) Algorithm for classification of these data were not resulting in high accuracy. However, this study has shown that amount and balance distribution of training data critically contribute to the performance of classification process.","PeriodicalId":6800,"journal":{"name":"2021 IEEE 7th International Conference on Computing, Engineering and Design (ICCED)","volume":"5 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Using K-Nearest Neighbor Algorithm for Personality Classification of Twitter’s Users Based on the Big Five Theory\",\"authors\":\"Agatha Silvani Sekarningtyas, M. A. Ayu, T. Mantoro\",\"doi\":\"10.1109/ICCED53389.2021.9664857\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Social media is an application or website-based system that enables users to create and share content or participate in social networking that allows its user to share their thoughts, opinions, or feelings that represent their personality. At present several studies to classify an individual's personality through social media have been developed, especially on social media Twitter. However, most of the analysis on Twitter only uses text based data such as posted tweets. This research presents a study on analyzing the users’ twitter data to classify their types of personality based on Big Five Theory by using their social statistic data. The data were acquired using Twitter API which was taken from Indonesian users with the total of 225 data. This study shows that using K-Nearest Neighbor (K-NN) Algorithm for classification of these data were not resulting in high accuracy. However, this study has shown that amount and balance distribution of training data critically contribute to the performance of classification process.\",\"PeriodicalId\":6800,\"journal\":{\"name\":\"2021 IEEE 7th International Conference on Computing, Engineering and Design (ICCED)\",\"volume\":\"5 1\",\"pages\":\"1-6\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 7th International Conference on Computing, Engineering and Design (ICCED)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCED53389.2021.9664857\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 7th International Conference on Computing, Engineering and Design (ICCED)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCED53389.2021.9664857","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using K-Nearest Neighbor Algorithm for Personality Classification of Twitter’s Users Based on the Big Five Theory
Social media is an application or website-based system that enables users to create and share content or participate in social networking that allows its user to share their thoughts, opinions, or feelings that represent their personality. At present several studies to classify an individual's personality through social media have been developed, especially on social media Twitter. However, most of the analysis on Twitter only uses text based data such as posted tweets. This research presents a study on analyzing the users’ twitter data to classify their types of personality based on Big Five Theory by using their social statistic data. The data were acquired using Twitter API which was taken from Indonesian users with the total of 225 data. This study shows that using K-Nearest Neighbor (K-NN) Algorithm for classification of these data were not resulting in high accuracy. However, this study has shown that amount and balance distribution of training data critically contribute to the performance of classification process.