{"title":"用于分子特性预测的高级深度学习方法","authors":"Chao Pang, Henry H. Y. Tong, Leyi Wei","doi":"10.1002/qub2.23","DOIUrl":null,"url":null,"abstract":"The prediction of molecular properties is a crucial task in the field of drug discovery. Computational methods that can accurately predict molecular properties can significantly accelerate the drug discovery process and reduce the cost of drug discovery. In recent years, iterative updates in computing hardware and the rise of deep learning have created a new and effective path for molecular property prediction. Deep learning methods can leverage the vast amount of data accumulated over the years in drug discovery and do not require complex feature engineering. In this review, we summarize molecular representations and commonly used datasets in molecular property prediction models and present advanced deep learning methods for molecular property prediction, including state‐of‐the‐art deep learning networks such as graph neural networks and Transformer‐based models, as well as state‐of‐the‐art deep learning strategies such as 3D pre‐train, contrastive learning, multi‐task learning, transfer learning, and meta‐learning. We also point out some critical issues such as lack of datasets, low information utilization, and lack of specificity for diseases.","PeriodicalId":45660,"journal":{"name":"Quantitative Biology","volume":"33 2","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2023-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Advanced deep learning methods for molecular property prediction\",\"authors\":\"Chao Pang, Henry H. Y. Tong, Leyi Wei\",\"doi\":\"10.1002/qub2.23\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The prediction of molecular properties is a crucial task in the field of drug discovery. Computational methods that can accurately predict molecular properties can significantly accelerate the drug discovery process and reduce the cost of drug discovery. In recent years, iterative updates in computing hardware and the rise of deep learning have created a new and effective path for molecular property prediction. Deep learning methods can leverage the vast amount of data accumulated over the years in drug discovery and do not require complex feature engineering. In this review, we summarize molecular representations and commonly used datasets in molecular property prediction models and present advanced deep learning methods for molecular property prediction, including state‐of‐the‐art deep learning networks such as graph neural networks and Transformer‐based models, as well as state‐of‐the‐art deep learning strategies such as 3D pre‐train, contrastive learning, multi‐task learning, transfer learning, and meta‐learning. We also point out some critical issues such as lack of datasets, low information utilization, and lack of specificity for diseases.\",\"PeriodicalId\":45660,\"journal\":{\"name\":\"Quantitative Biology\",\"volume\":\"33 2\",\"pages\":\"\"},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2023-11-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Quantitative Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1002/qub2.23\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"MATHEMATICAL & COMPUTATIONAL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/qub2.23","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
预测分子性质是药物发现领域的一项重要任务。能够准确预测分子性质的计算方法可以大大加快药物发现的进程,降低药物发现的成本。近年来,计算硬件的迭代更新和深度学习的兴起为分子性质预测开辟了一条新的有效途径。深度学习方法可以利用药物发现过程中多年积累的大量数据,而且不需要复杂的特征工程。在这篇综述中,我们总结了分子性质预测模型中的分子表征和常用数据集,并介绍了用于分子性质预测的先进深度学习方法,包括最先进的深度学习网络(如图神经网络和基于 Transformer 的模型),以及最先进的深度学习策略(如 3D 预训练、对比学习、多任务学习、迁移学习和元学习)。我们还指出了一些关键问题,如缺乏数据集、信息利用率低、缺乏疾病特异性等。
Advanced deep learning methods for molecular property prediction
The prediction of molecular properties is a crucial task in the field of drug discovery. Computational methods that can accurately predict molecular properties can significantly accelerate the drug discovery process and reduce the cost of drug discovery. In recent years, iterative updates in computing hardware and the rise of deep learning have created a new and effective path for molecular property prediction. Deep learning methods can leverage the vast amount of data accumulated over the years in drug discovery and do not require complex feature engineering. In this review, we summarize molecular representations and commonly used datasets in molecular property prediction models and present advanced deep learning methods for molecular property prediction, including state‐of‐the‐art deep learning networks such as graph neural networks and Transformer‐based models, as well as state‐of‐the‐art deep learning strategies such as 3D pre‐train, contrastive learning, multi‐task learning, transfer learning, and meta‐learning. We also point out some critical issues such as lack of datasets, low information utilization, and lack of specificity for diseases.
期刊介绍:
Quantitative Biology is an interdisciplinary journal that focuses on original research that uses quantitative approaches and technologies to analyze and integrate biological systems, construct and model engineered life systems, and gain a deeper understanding of the life sciences. It aims to provide a platform for not only the analysis but also the integration and construction of biological systems. It is a quarterly journal seeking to provide an inter- and multi-disciplinary forum for a broad blend of peer-reviewed academic papers in order to promote rapid communication and exchange between scientists in the East and the West. The content of Quantitative Biology will mainly focus on the two broad and related areas: ·bioinformatics and computational biology, which focuses on dealing with information technologies and computational methodologies that can efficiently and accurately manipulate –omics data and transform molecular information into biological knowledge. ·systems and synthetic biology, which focuses on complex interactions in biological systems and the emergent functional properties, and on the design and construction of new biological functions and systems. Its goal is to reflect the significant advances made in quantitatively investigating and modeling both natural and engineered life systems at the molecular and higher levels. The journal particularly encourages original papers that link novel theory with cutting-edge experiments, especially in the newly emerging and multi-disciplinary areas of research. The journal also welcomes high-quality reviews and perspective articles.