{"title":"基于多策略集成学习的不平衡情感分类","authors":"Zhongqing Wang, Shoushan Li, Guodong Zhou, Peifeng Li, Qiaoming Zhu","doi":"10.1109/IALP.2011.28","DOIUrl":null,"url":null,"abstract":"Recently, sentiment classification has become a hot research topic in natural language processing. But most existing studies assume that the samples in the negative and positive categories are balanced, which might not be true in real applications. In this paper, we investigate sentiment classification tasks where the class distribution of the sam-ples is imbalanced. To handle the imbalanced problem, we propose a multi-strategy ensemble learning approach to this problem. Our ensemble approach integrates sample-ensemble, feature-ensemble, and classifier-ensemble by ex-ploiting multiple classification algorithms. Evaluation across four domains shows that our ensemble approach outper-forms many other popular approaches that handling imbal-anced classification problems, such as re-sampling and cost-sensitive approaches, and is proven effective for imbalanced sentiment classification.","PeriodicalId":297167,"journal":{"name":"2011 International Conference on Asian Language Processing","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Imbalanced Sentiment Classification with Multi-strategy Ensemble Learning\",\"authors\":\"Zhongqing Wang, Shoushan Li, Guodong Zhou, Peifeng Li, Qiaoming Zhu\",\"doi\":\"10.1109/IALP.2011.28\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, sentiment classification has become a hot research topic in natural language processing. But most existing studies assume that the samples in the negative and positive categories are balanced, which might not be true in real applications. In this paper, we investigate sentiment classification tasks where the class distribution of the sam-ples is imbalanced. To handle the imbalanced problem, we propose a multi-strategy ensemble learning approach to this problem. Our ensemble approach integrates sample-ensemble, feature-ensemble, and classifier-ensemble by ex-ploiting multiple classification algorithms. Evaluation across four domains shows that our ensemble approach outper-forms many other popular approaches that handling imbal-anced classification problems, such as re-sampling and cost-sensitive approaches, and is proven effective for imbalanced sentiment classification.\",\"PeriodicalId\":297167,\"journal\":{\"name\":\"2011 International Conference on Asian Language Processing\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-11-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 International Conference on Asian Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IALP.2011.28\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 International Conference on Asian Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IALP.2011.28","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Imbalanced Sentiment Classification with Multi-strategy Ensemble Learning
Recently, sentiment classification has become a hot research topic in natural language processing. But most existing studies assume that the samples in the negative and positive categories are balanced, which might not be true in real applications. In this paper, we investigate sentiment classification tasks where the class distribution of the sam-ples is imbalanced. To handle the imbalanced problem, we propose a multi-strategy ensemble learning approach to this problem. Our ensemble approach integrates sample-ensemble, feature-ensemble, and classifier-ensemble by ex-ploiting multiple classification algorithms. Evaluation across four domains shows that our ensemble approach outper-forms many other popular approaches that handling imbal-anced classification problems, such as re-sampling and cost-sensitive approaches, and is proven effective for imbalanced sentiment classification.