Gretchen Bonilla-Caraballo, Manuel Rodriguez-Martinez
{"title":"Deep Learning Methods to Help Predict Properties of Molecules from SMILES.","authors":"Gretchen Bonilla-Caraballo, Manuel Rodriguez-Martinez","doi":"10.1007/978-3-031-67447-1_9","DOIUrl":null,"url":null,"abstract":"<p><p>Machine learning methods have been proposed in lieu of simulations to predict chemical properties of molecules. The trade-off here is paying for the training time once, in exchange for instant predictions on the input data. However, many of these methods rely heavily on feature engineering to prepare the data for these models. Moreover, the use of molecular structural information has been limited, despite having such information encoded in the Simplified Molecular Input Line Entry System (SMILES) format. In this paper we present a framework that relies on SMILES data to predict molecular properties. Our methods are based on 1-D Convolutional Networks and do not require complex feature engineering. Our methods can be applied to learn molecular properties from base data, thus making them accessible to a wider audience. Our experiments show that this method can predict the molecular weight and XLogP properties without any encoding of complex chemical rules.</p>","PeriodicalId":520278,"journal":{"name":"Proceedings of the International Symposium on Intelligent Computing and Networking 2024 : (ISICN 2024). International Symposium on Intelligent Computing and Networking (1st : 2024 : San Juan, P.R.)","volume":"1094 ","pages":"119-138"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11529754/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Symposium on Intelligent Computing and Networking 2024 : (ISICN 2024). International Symposium on Intelligent Computing and Networking (1st : 2024 : San Juan, P.R.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/978-3-031-67447-1_9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/8 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Machine learning methods have been proposed in lieu of simulations to predict chemical properties of molecules. The trade-off here is paying for the training time once, in exchange for instant predictions on the input data. However, many of these methods rely heavily on feature engineering to prepare the data for these models. Moreover, the use of molecular structural information has been limited, despite having such information encoded in the Simplified Molecular Input Line Entry System (SMILES) format. In this paper we present a framework that relies on SMILES data to predict molecular properties. Our methods are based on 1-D Convolutional Networks and do not require complex feature engineering. Our methods can be applied to learn molecular properties from base data, thus making them accessible to a wider audience. Our experiments show that this method can predict the molecular weight and XLogP properties without any encoding of complex chemical rules.