{"title":"Comparative Analysis of Globalisation Techniques for Medical Document Classification","authors":"B. Parlak, S. Aydemi̇r","doi":"10.55195/jscai.1216800","DOIUrl":null,"url":null,"abstract":"Medical document classification is one of the important topics of text mining. Globalisation techniques play a major role in text classification. It is also known that globalisation techniques play an important role in text classification. Our aim in the study is to conduct a detailed analysis on two data sets with English and Turkish content by using medical text summaries of Turkish articles. These datasets consist of Turkish and English text summaries of the same articles. To observe how successful local feature selection methods in the field of text classification affect the classification performance on these two equivalent data sets by applying different globalisation techniques. The feature selection methods used are CHI2, MI, OR, WLLR. Globalisation techniques are SUM, AVG, MAX. Classifiers are MNB, DT, and SVM.","PeriodicalId":48494,"journal":{"name":"Journal of Artificial Intelligence and Soft Computing Research","volume":"1 1","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2023-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Artificial Intelligence and Soft Computing Research","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.55195/jscai.1216800","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Medical document classification is one of the important topics of text mining. Globalisation techniques play a major role in text classification. It is also known that globalisation techniques play an important role in text classification. Our aim in the study is to conduct a detailed analysis on two data sets with English and Turkish content by using medical text summaries of Turkish articles. These datasets consist of Turkish and English text summaries of the same articles. To observe how successful local feature selection methods in the field of text classification affect the classification performance on these two equivalent data sets by applying different globalisation techniques. The feature selection methods used are CHI2, MI, OR, WLLR. Globalisation techniques are SUM, AVG, MAX. Classifiers are MNB, DT, and SVM.
期刊介绍:
Journal of Artificial Intelligence and Soft Computing Research (available also at Sciendo (De Gruyter)) is a dynamically developing international journal focused on the latest scientific results and methods constituting traditional artificial intelligence methods and soft computing techniques. Our goal is to bring together scientists representing both approaches and various research communities.