{"title":"基于功能的smile代码分类特征选择","authors":"D. Ratnawati, Marjono, Widodo, S. Anam","doi":"10.1109/ISRITI48646.2019.9034619","DOIUrl":null,"url":null,"abstract":"The classification of the active compound based on their function is important to be done because most of them are unknown their function. The structure of the active compound can be represented by SMILES code that unique, compact and complete. Preprocessing SMILES code is a crucial task before SMILES codes are classified. In this research, preprocessing is extracting SMILES code into several features. The features must represent patterns or information from SMILES codes because the proper features of the SMILES codes will increase the accuracy of classification results. This paper uses features from SMILES codes directed by an expert and based on the previous research. Features will be normalized and are classified by an efficient and good classification method, Extreme Learning Machine (ELM). The experiment results show that first, adding features will increase the average of the accuracy of the system until 10.9% on dataset 1-3-4 (nerve-bacterial-cancer). Second, ELM is superior to SVM and KMNB in terms of both accuracy and processing time.","PeriodicalId":367363,"journal":{"name":"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Features Selection for Classification of SMILES Codes Based on Their Function\",\"authors\":\"D. Ratnawati, Marjono, Widodo, S. Anam\",\"doi\":\"10.1109/ISRITI48646.2019.9034619\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The classification of the active compound based on their function is important to be done because most of them are unknown their function. The structure of the active compound can be represented by SMILES code that unique, compact and complete. Preprocessing SMILES code is a crucial task before SMILES codes are classified. In this research, preprocessing is extracting SMILES code into several features. The features must represent patterns or information from SMILES codes because the proper features of the SMILES codes will increase the accuracy of classification results. This paper uses features from SMILES codes directed by an expert and based on the previous research. Features will be normalized and are classified by an efficient and good classification method, Extreme Learning Machine (ELM). The experiment results show that first, adding features will increase the average of the accuracy of the system until 10.9% on dataset 1-3-4 (nerve-bacterial-cancer). Second, ELM is superior to SVM and KMNB in terms of both accuracy and processing time.\",\"PeriodicalId\":367363,\"journal\":{\"name\":\"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISRITI48646.2019.9034619\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISRITI48646.2019.9034619","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Features Selection for Classification of SMILES Codes Based on Their Function
The classification of the active compound based on their function is important to be done because most of them are unknown their function. The structure of the active compound can be represented by SMILES code that unique, compact and complete. Preprocessing SMILES code is a crucial task before SMILES codes are classified. In this research, preprocessing is extracting SMILES code into several features. The features must represent patterns or information from SMILES codes because the proper features of the SMILES codes will increase the accuracy of classification results. This paper uses features from SMILES codes directed by an expert and based on the previous research. Features will be normalized and are classified by an efficient and good classification method, Extreme Learning Machine (ELM). The experiment results show that first, adding features will increase the average of the accuracy of the system until 10.9% on dataset 1-3-4 (nerve-bacterial-cancer). Second, ELM is superior to SVM and KMNB in terms of both accuracy and processing time.