Desirable molecule discovery via generative latent space exploration

IF 3.8 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Visual Informatics Pub Date : 2023-12-01 DOI:10.1016/j.visinf.2023.10.002

Wanjie Zheng, Jie Li, Yang Zhang

{"title":"Desirable molecule discovery via generative latent space exploration","authors":"Wanjie Zheng, Jie Li, Yang Zhang","doi":"10.1016/j.visinf.2023.10.002","DOIUrl":null,"url":null,"abstract":"<div><p>Drug molecule design is a classic research topic. Drug experts traditionally design molecules relying on their experience. Manual drug design is time-consuming and may produce low-efficacy and off-target molecules. With the popularity of deep learning, drug experts are beginning to use generative models to design drug molecules. A well-trained generative model can learn the distribution of training samples and infinitely generate drug-like molecules similar to the training samples. The automatic process improves design efficiency. However, most existing methods focus on proposing and optimizing generative models. How to discover ideal molecules from massive candidates is still an unresolved challenge. We propose a visualization system to discover ideal drug molecules generated by generative models. In this paper, we investigated the requirements and issues of drug design experts when using generative models, i.e., generating molecular structures with specific constraints and finding other molecular structures similar to potential drug molecular structures. We formalized the first problem as an optimization problem and proposed using a genetic algorithm to solve it. For the second problem, we proposed using a neighborhood sampling algorithm based on the continuity of the latent space to find solutions. We integrated the proposed algorithms into a visualization tool, and a case study for discovering potential drug molecules to make KOR agonists and experiments demonstrated the utility of our approach.</p></div>","PeriodicalId":36903,"journal":{"name":"Visual Informatics","volume":"7 4","pages":"Pages 13-21"},"PeriodicalIF":3.8000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2468502X23000475/pdfft?md5=6a9b7eb869496bec1ac323f40edb7d54&pid=1-s2.0-S2468502X23000475-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Visual Informatics","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468502X23000475","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Drug molecule design is a classic research topic. Drug experts traditionally design molecules relying on their experience. Manual drug design is time-consuming and may produce low-efficacy and off-target molecules. With the popularity of deep learning, drug experts are beginning to use generative models to design drug molecules. A well-trained generative model can learn the distribution of training samples and infinitely generate drug-like molecules similar to the training samples. The automatic process improves design efficiency. However, most existing methods focus on proposing and optimizing generative models. How to discover ideal molecules from massive candidates is still an unresolved challenge. We propose a visualization system to discover ideal drug molecules generated by generative models. In this paper, we investigated the requirements and issues of drug design experts when using generative models, i.e., generating molecular structures with specific constraints and finding other molecular structures similar to potential drug molecular structures. We formalized the first problem as an optimization problem and proposed using a genetic algorithm to solve it. For the second problem, we proposed using a neighborhood sampling algorithm based on the continuity of the latent space to find solutions. We integrated the proposed algorithms into a visualization tool, and a case study for discovering potential drug molecules to make KOR agonists and experiments demonstrated the utility of our approach.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过生成潜在空间探索发现理想分子

药物分子设计是一个经典的研究课题。药物专家通常依靠他们的经验来设计分子。人工药物设计耗时长，可能产生低疗效和脱靶分子。随着深度学习的普及，药物专家开始使用生成模型来设计药物分子。一个训练良好的生成模型可以学习训练样本的分布，并无限地生成与训练样本相似的类药物分子。自动化流程提高了设计效率。然而，大多数现有的方法都集中在生成模型的提出和优化上。如何从大量候选分子中发现理想分子仍然是一个未解决的挑战。我们提出了一个可视化系统来发现由生成模型生成的理想药物分子。在本文中，我们研究了药物设计专家在使用生成模型时的要求和问题，即生成具有特定约束的分子结构和寻找与潜在药物分子结构相似的其他分子结构。我们将第一个问题形式化为一个优化问题，并提出使用遗传算法来解决它。对于第二个问题，我们提出了一种基于隐空间连续性的邻域采样算法来求解。我们将提出的算法集成到一个可视化工具中，并进行了一个案例研究，用于发现潜在的药物分子来制造KOR激动剂和实验，证明了我们方法的实用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊