P. Paokanta, M. Ceccarelli, Somdat Srichairatanakool
{"title":"用于筛选β-地中海贫血的机器学习技术的数据类型分类性能的效率","authors":"P. Paokanta, M. Ceccarelli, Somdat Srichairatanakool","doi":"10.1109/ISABEL.2010.5702769","DOIUrl":null,"url":null,"abstract":"Performance of classification methods using Machine Learning Techniques majority depends on the quality of data were used in learning. The transformation techniques are used to increase the efficiency of classification because each type of data is suitable for different classification techniques. This study is aimed at providing comparative performance of different classification techniques by changing the type of data to find the appropriate type of data for each technique. The ß-Thalassemia data is used for classifying genotypes of ß-Thalassemia patients. The results of this study show that the types of data are Nominal scale which can be used as well for Bayesian Networks (BNs) and Multinomial Logistic Regression with the percentage of accuracy 85.83 and 84.25 respectively. Moreover, the data types which such as Interval scale can be used appropriately for K-Nearest Neighbors (KNN), Multi-Layer Perceptron (MLP) and NaiveBayes with the percentage of accuracy 88.98, 87.40 and 84.25 respectively. In the future, we will study the impacts of data separation to be used for classifying genotypes of patients with Thalassemia using the other classification techniques.","PeriodicalId":165367,"journal":{"name":"2010 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2010)","volume":"415 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":"{\"title\":\"The effeciency of data types for classification performance of Machine Learning Techniques for screening β-Thalassemia\",\"authors\":\"P. Paokanta, M. Ceccarelli, Somdat Srichairatanakool\",\"doi\":\"10.1109/ISABEL.2010.5702769\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Performance of classification methods using Machine Learning Techniques majority depends on the quality of data were used in learning. The transformation techniques are used to increase the efficiency of classification because each type of data is suitable for different classification techniques. This study is aimed at providing comparative performance of different classification techniques by changing the type of data to find the appropriate type of data for each technique. The ß-Thalassemia data is used for classifying genotypes of ß-Thalassemia patients. The results of this study show that the types of data are Nominal scale which can be used as well for Bayesian Networks (BNs) and Multinomial Logistic Regression with the percentage of accuracy 85.83 and 84.25 respectively. Moreover, the data types which such as Interval scale can be used appropriately for K-Nearest Neighbors (KNN), Multi-Layer Perceptron (MLP) and NaiveBayes with the percentage of accuracy 88.98, 87.40 and 84.25 respectively. In the future, we will study the impacts of data separation to be used for classifying genotypes of patients with Thalassemia using the other classification techniques.\",\"PeriodicalId\":165367,\"journal\":{\"name\":\"2010 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2010)\",\"volume\":\"415 4\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"16\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2010)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISABEL.2010.5702769\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 3rd International Symposium on Applied Sciences in Biomedical and Communication Technologies (ISABEL 2010)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISABEL.2010.5702769","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The effeciency of data types for classification performance of Machine Learning Techniques for screening β-Thalassemia
Performance of classification methods using Machine Learning Techniques majority depends on the quality of data were used in learning. The transformation techniques are used to increase the efficiency of classification because each type of data is suitable for different classification techniques. This study is aimed at providing comparative performance of different classification techniques by changing the type of data to find the appropriate type of data for each technique. The ß-Thalassemia data is used for classifying genotypes of ß-Thalassemia patients. The results of this study show that the types of data are Nominal scale which can be used as well for Bayesian Networks (BNs) and Multinomial Logistic Regression with the percentage of accuracy 85.83 and 84.25 respectively. Moreover, the data types which such as Interval scale can be used appropriately for K-Nearest Neighbors (KNN), Multi-Layer Perceptron (MLP) and NaiveBayes with the percentage of accuracy 88.98, 87.40 and 84.25 respectively. In the future, we will study the impacts of data separation to be used for classifying genotypes of patients with Thalassemia using the other classification techniques.