{"title":"Using Case-Based Reasoning Approach for Text Classification Task","authors":"Igor Nikonov, I. Kurilenko","doi":"10.1109/REEPE49198.2020.9059249","DOIUrl":null,"url":null,"abstract":"This paper discusses the method of solving the text classification task by using case-based reasoning. As a part of the text classification module in the implemented system, modified TF-IDF (Term Frequency - Inverse Document Frequency) measure of the text content based on the known information on the distribution of documents between different categories is proposed. This modification allows to improve the quality of the classification by considering the information on the distribution of the words in the entire case base. Implementation of the prototype of text classification module used in the interactive voice response system is presented. This module allows to get away from button menus to automated voice interaction between the user and the system. The result of computational experiments confirming the effectiveness of the developed system is proposed.","PeriodicalId":142369,"journal":{"name":"2020 International Youth Conference on Radio Electronics, Electrical and Power Engineering (REEPE)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Youth Conference on Radio Electronics, Electrical and Power Engineering (REEPE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/REEPE49198.2020.9059249","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper discusses the method of solving the text classification task by using case-based reasoning. As a part of the text classification module in the implemented system, modified TF-IDF (Term Frequency - Inverse Document Frequency) measure of the text content based on the known information on the distribution of documents between different categories is proposed. This modification allows to improve the quality of the classification by considering the information on the distribution of the words in the entire case base. Implementation of the prototype of text classification module used in the interactive voice response system is presented. This module allows to get away from button menus to automated voice interaction between the user and the system. The result of computational experiments confirming the effectiveness of the developed system is proposed.