Wilawan Yathongkhum, Y. Laosiritaworn, Jakramate Bootkrajang, Pucktada Treeratpituk, Jeerayut Chaijaruwanich
{"title":"基于类别相关特征集的经济和财经新闻混合分类","authors":"Wilawan Yathongkhum, Y. Laosiritaworn, Jakramate Bootkrajang, Pucktada Treeratpituk, Jeerayut Chaijaruwanich","doi":"10.3233/ida-237373","DOIUrl":null,"url":null,"abstract":"A large amount of economic and financial news is now accessible through various news websites and social media platforms. Categorizing them into appropriate categories can be advantageous for various tasks, such as sentiment analysis and news-based market prediction. Unfortunately, news headlines categories may contain ambiguities due to the subjective nature of label assignment by authors or publishers. Consequently, achieving precise classification of news can be time-consuming and still reliant on human expertise. To tackle this challenging task, we proposed a hybrid approach to enhance the performance of economic and financial news classification. This approach combines baseline classifiers with a novel method called the Category Associated Feature Set (CAFS) classifier. CAFS transforms text input from the lexicon-space into the entity-space and discovers associations between entities and classes, akin to association rule learning. Experimental results on three datasets demonstrated that the proposed method is comparable to existing approaches and exhibits a significant improvement in the classification results for out-of-domain datasets. Additionally, employing CAFS in tandem with the existing text classification baselines can provide a general categorizer for distinguishing news categories across various sources without the need for extensive fine-tuning of the parameters associated with those classification baselines. This confirms that utilizing CAFS in a hybrid approach is appropriate and suitable for economic and financial news classification.","PeriodicalId":50355,"journal":{"name":"Intelligent Data Analysis","volume":"5 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2023-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Economic and financial news hybrid- classification based on category-associated feature set\",\"authors\":\"Wilawan Yathongkhum, Y. Laosiritaworn, Jakramate Bootkrajang, Pucktada Treeratpituk, Jeerayut Chaijaruwanich\",\"doi\":\"10.3233/ida-237373\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A large amount of economic and financial news is now accessible through various news websites and social media platforms. Categorizing them into appropriate categories can be advantageous for various tasks, such as sentiment analysis and news-based market prediction. Unfortunately, news headlines categories may contain ambiguities due to the subjective nature of label assignment by authors or publishers. Consequently, achieving precise classification of news can be time-consuming and still reliant on human expertise. To tackle this challenging task, we proposed a hybrid approach to enhance the performance of economic and financial news classification. This approach combines baseline classifiers with a novel method called the Category Associated Feature Set (CAFS) classifier. CAFS transforms text input from the lexicon-space into the entity-space and discovers associations between entities and classes, akin to association rule learning. Experimental results on three datasets demonstrated that the proposed method is comparable to existing approaches and exhibits a significant improvement in the classification results for out-of-domain datasets. Additionally, employing CAFS in tandem with the existing text classification baselines can provide a general categorizer for distinguishing news categories across various sources without the need for extensive fine-tuning of the parameters associated with those classification baselines. This confirms that utilizing CAFS in a hybrid approach is appropriate and suitable for economic and financial news classification.\",\"PeriodicalId\":50355,\"journal\":{\"name\":\"Intelligent Data Analysis\",\"volume\":\"5 1\",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2023-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Intelligent Data Analysis\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.3233/ida-237373\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent Data Analysis","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.3233/ida-237373","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Economic and financial news hybrid- classification based on category-associated feature set
A large amount of economic and financial news is now accessible through various news websites and social media platforms. Categorizing them into appropriate categories can be advantageous for various tasks, such as sentiment analysis and news-based market prediction. Unfortunately, news headlines categories may contain ambiguities due to the subjective nature of label assignment by authors or publishers. Consequently, achieving precise classification of news can be time-consuming and still reliant on human expertise. To tackle this challenging task, we proposed a hybrid approach to enhance the performance of economic and financial news classification. This approach combines baseline classifiers with a novel method called the Category Associated Feature Set (CAFS) classifier. CAFS transforms text input from the lexicon-space into the entity-space and discovers associations between entities and classes, akin to association rule learning. Experimental results on three datasets demonstrated that the proposed method is comparable to existing approaches and exhibits a significant improvement in the classification results for out-of-domain datasets. Additionally, employing CAFS in tandem with the existing text classification baselines can provide a general categorizer for distinguishing news categories across various sources without the need for extensive fine-tuning of the parameters associated with those classification baselines. This confirms that utilizing CAFS in a hybrid approach is appropriate and suitable for economic and financial news classification.
期刊介绍:
Intelligent Data Analysis provides a forum for the examination of issues related to the research and applications of Artificial Intelligence techniques in data analysis across a variety of disciplines. These techniques include (but are not limited to): all areas of data visualization, data pre-processing (fusion, editing, transformation, filtering, sampling), data engineering, database mining techniques, tools and applications, use of domain knowledge in data analysis, big data applications, evolutionary algorithms, machine learning, neural nets, fuzzy logic, statistical pattern recognition, knowledge filtering, and post-processing. In particular, papers are preferred that discuss development of new AI related data analysis architectures, methodologies, and techniques and their applications to various domains.