Peng Su, W. Mao, D. Zeng, Xiaochen Li, Fei-Yue Wang
{"title":"Handling Class Imbalance Problem in Cultural Modeling","authors":"Peng Su, W. Mao, D. Zeng, Xiaochen Li, Fei-Yue Wang","doi":"10.1109/ISI.2009.5137320","DOIUrl":null,"url":null,"abstract":"Cultural modeling is an emergent and promising research area in social computing. It aims at developing behavioral models of groups and analyzing the impact of culture factors on group behavior using computational methods. Machine learning methods in particular classification, play a central role in such applications. In cultural modeling, it is expected that classifiers yield good performance. However, the performance of standard classifiers is often severely hindered in practice due to the imbalanced distribution of class in cultural data. In this paper, we identify class imbalance problem in cultural modeling domain. To handle the problem, we propose a user involved solution employing the receiver operating characteristic (ROC) analysis for classification algorithms with sampling approaches. Finally, we conduct experiment to verify the effectiveness of the proposed solution.","PeriodicalId":210911,"journal":{"name":"2009 IEEE International Conference on Intelligence and Security Informatics","volume":"157 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Conference on Intelligence and Security Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISI.2009.5137320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Cultural modeling is an emergent and promising research area in social computing. It aims at developing behavioral models of groups and analyzing the impact of culture factors on group behavior using computational methods. Machine learning methods in particular classification, play a central role in such applications. In cultural modeling, it is expected that classifiers yield good performance. However, the performance of standard classifiers is often severely hindered in practice due to the imbalanced distribution of class in cultural data. In this paper, we identify class imbalance problem in cultural modeling domain. To handle the problem, we propose a user involved solution employing the receiver operating characteristic (ROC) analysis for classification algorithms with sampling approaches. Finally, we conduct experiment to verify the effectiveness of the proposed solution.