Gretchen Bonilla-Caraballo, Manuel Rodriguez-Martinez
{"title":"从 SMILES 预测分子性质的深度学习方法。","authors":"Gretchen Bonilla-Caraballo, Manuel Rodriguez-Martinez","doi":"10.1007/978-3-031-67447-1_9","DOIUrl":null,"url":null,"abstract":"<p><p>Machine learning methods have been proposed in lieu of simulations to predict chemical properties of molecules. The trade-off here is paying for the training time once, in exchange for instant predictions on the input data. However, many of these methods rely heavily on feature engineering to prepare the data for these models. Moreover, the use of molecular structural information has been limited, despite having such information encoded in the Simplified Molecular Input Line Entry System (SMILES) format. In this paper we present a framework that relies on SMILES data to predict molecular properties. Our methods are based on 1-D Convolutional Networks and do not require complex feature engineering. Our methods can be applied to learn molecular properties from base data, thus making them accessible to a wider audience. Our experiments show that this method can predict the molecular weight and XLogP properties without any encoding of complex chemical rules.</p>","PeriodicalId":520278,"journal":{"name":"Proceedings of the International Symposium on Intelligent Computing and Networking 2024 : (ISICN 2024). International Symposium on Intelligent Computing and Networking (1st : 2024 : San Juan, P.R.)","volume":"1094 ","pages":"119-138"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11529754/pdf/","citationCount":"0","resultStr":"{\"title\":\"Deep Learning Methods to Help Predict Properties of Molecules from SMILES.\",\"authors\":\"Gretchen Bonilla-Caraballo, Manuel Rodriguez-Martinez\",\"doi\":\"10.1007/978-3-031-67447-1_9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Machine learning methods have been proposed in lieu of simulations to predict chemical properties of molecules. The trade-off here is paying for the training time once, in exchange for instant predictions on the input data. However, many of these methods rely heavily on feature engineering to prepare the data for these models. Moreover, the use of molecular structural information has been limited, despite having such information encoded in the Simplified Molecular Input Line Entry System (SMILES) format. In this paper we present a framework that relies on SMILES data to predict molecular properties. Our methods are based on 1-D Convolutional Networks and do not require complex feature engineering. Our methods can be applied to learn molecular properties from base data, thus making them accessible to a wider audience. Our experiments show that this method can predict the molecular weight and XLogP properties without any encoding of complex chemical rules.</p>\",\"PeriodicalId\":520278,\"journal\":{\"name\":\"Proceedings of the International Symposium on Intelligent Computing and Networking 2024 : (ISICN 2024). International Symposium on Intelligent Computing and Networking (1st : 2024 : San Juan, P.R.)\",\"volume\":\"1094 \",\"pages\":\"119-138\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11529754/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the International Symposium on Intelligent Computing and Networking 2024 : (ISICN 2024). International Symposium on Intelligent Computing and Networking (1st : 2024 : San Juan, P.R.)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/978-3-031-67447-1_9\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/8/8 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Symposium on Intelligent Computing and Networking 2024 : (ISICN 2024). International Symposium on Intelligent Computing and Networking (1st : 2024 : San Juan, P.R.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/978-3-031-67447-1_9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/8 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Learning Methods to Help Predict Properties of Molecules from SMILES.
Machine learning methods have been proposed in lieu of simulations to predict chemical properties of molecules. The trade-off here is paying for the training time once, in exchange for instant predictions on the input data. However, many of these methods rely heavily on feature engineering to prepare the data for these models. Moreover, the use of molecular structural information has been limited, despite having such information encoded in the Simplified Molecular Input Line Entry System (SMILES) format. In this paper we present a framework that relies on SMILES data to predict molecular properties. Our methods are based on 1-D Convolutional Networks and do not require complex feature engineering. Our methods can be applied to learn molecular properties from base data, thus making them accessible to a wider audience. Our experiments show that this method can predict the molecular weight and XLogP properties without any encoding of complex chemical rules.