Xuhan Liu, Jun Zhang, Zhonghuai Hou, Yi Isaac Yang, Yi Qin Gao
Reinforcement learning (RL) is one important branch of artificial intelligence (AI), which intuitively imitates the learning style of human beings. It is commonly derived from solving game playing problems and is extensively used for decision-making, control and optimization problems. It has been extensively applied for solving complicated problems with the property of Markov decision-making processes. With data accumulation and comprehensive analysis, researchers are not only satisfied with predicting the results for experimental systems but also hope to design or control them for the sake of obtaining the desired properties or functions. RL is potentially facilitated to solve a large number of complicated biological and chemical problems, because they could be decomposed into multi-step decision-making process. In practice, substantial progress has been made in the application of RL to the field of biomedicine. In this paper, we will first give a brief description about RL, including its definition, basic theory and different type of methods. Then we will review some detailed applications in various domains, for example, molecular design, reaction planning, molecular simulation and etc. In the end, we will summarize the essentialities of RL approaches to solve more diverse problems compared with other machine learning methods and also outlook the possible trends to overcome their limitations in the future.
{"title":"From predicting to decision making: Reinforcement learning in biomedicine","authors":"Xuhan Liu, Jun Zhang, Zhonghuai Hou, Yi Isaac Yang, Yi Qin Gao","doi":"10.1002/wcms.1723","DOIUrl":"https://doi.org/10.1002/wcms.1723","url":null,"abstract":"<p>Reinforcement learning (RL) is one important branch of artificial intelligence (AI), which intuitively imitates the learning style of human beings. It is commonly derived from solving game playing problems and is extensively used for decision-making, control and optimization problems. It has been extensively applied for solving complicated problems with the property of Markov decision-making processes. With data accumulation and comprehensive analysis, researchers are not only satisfied with predicting the results for experimental systems but also hope to design or control them for the sake of obtaining the desired properties or functions. RL is potentially facilitated to solve a large number of complicated biological and chemical problems, because they could be decomposed into multi-step decision-making process. In practice, substantial progress has been made in the application of RL to the field of biomedicine. In this paper, we will first give a brief description about RL, including its definition, basic theory and different type of methods. Then we will review some detailed applications in various domains, for example, molecular design, reaction planning, molecular simulation and etc. In the end, we will summarize the essentialities of RL approaches to solve more diverse problems compared with other machine learning methods and also outlook the possible trends to overcome their limitations in the future.</p><p>This article is categorized under:\u0000 </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"14 4","pages":""},"PeriodicalIF":16.8,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141561159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Qiang Xu, Cheng Ma, Wenhui Mi, Yanchao Wang, Yanming Ma
Orbital-free density functional theory (OFDFT) stands out as a many-body electronic structure approach with a low computational cost that scales linearly with system size, making it well suitable for large-scale simulations. The past decades have witnessed impressive progress in OFDFT, which opens a new avenue to capture the complexity of realistic systems (e.g., solids, liquids, and warm dense matters) and provide a complete description of some complicated physical phenomena under realistic conditions (e.g., dislocation mobility, ductile processes, and vacancy diffusion). In this review, we first present a concise summary of the major methodological advances in OFDFT, placing particular emphasis on kinetic energy density functional and the schemes to evaluate the electron–ion interaction energy. We then give a brief overview of the current status of OFDFT developments in finite-temperature and time-dependent regimes, as well as our developed OFDFT-based software package, named by ATLAS. Finally, we highlight perspectives for further development in this fascinating field, including the major outstanding issues to be solved and forthcoming opportunities to explore large-scale materials.
{"title":"Recent advancements and challenges in orbital-free density functional theory","authors":"Qiang Xu, Cheng Ma, Wenhui Mi, Yanchao Wang, Yanming Ma","doi":"10.1002/wcms.1724","DOIUrl":"https://doi.org/10.1002/wcms.1724","url":null,"abstract":"<p>Orbital-free density functional theory (OFDFT) stands out as a many-body electronic structure approach with a low computational cost that scales linearly with system size, making it well suitable for large-scale simulations. The past decades have witnessed impressive progress in OFDFT, which opens a new avenue to capture the complexity of realistic systems (e.g., solids, liquids, and warm dense matters) and provide a complete description of some complicated physical phenomena under realistic conditions (e.g., dislocation mobility, ductile processes, and vacancy diffusion). In this review, we first present a concise summary of the major methodological advances in OFDFT, placing particular emphasis on kinetic energy density functional and the schemes to evaluate the electron–ion interaction energy. We then give a brief overview of the current status of OFDFT developments in finite-temperature and time-dependent regimes, as well as our developed OFDFT-based software package, named by ATLAS. Finally, we highlight perspectives for further development in this fascinating field, including the major outstanding issues to be solved and forthcoming opportunities to explore large-scale materials.</p><p>This article is categorized under:\u0000 </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"14 3","pages":""},"PeriodicalIF":11.4,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/wcms.1724","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141315446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tobias Harren, Torben Gutermuth, Christoph Grebner, Gerhard Hessler, Matthias Rarey
Structure-based drug design is a widely applied approach in the discovery of new lead compounds for known therapeutic targets. In most structure-based drug design applications, the docking procedure is considered the crucial step. Here, a potential ligand is fitted into the binding site, and a scoring function assesses its binding capability. With the rise of modern machine-learning in drug discovery, novel scoring functions using machine-learning techniques achieved significant performance gains in virtual screening and ligand optimization tasks on retrospective data. However, real-world applications of these methods are still limited. Missing success stories in prospective applications are one reason for this. Additionally, the fast-evolving nature of the field makes it challenging to assess the advantages of each individual method. This review will highlight recent strides toward improved real world applicability of machine-learning based scoring, enabling a better understanding of the potential benefits and pitfalls of these functions on a project. Furthermore, a systematic way of classifying machine-learning based scoring that facilitates comparisons will be presented.
{"title":"Modern machine-learning for binding affinity estimation of protein–ligand complexes: Progress, opportunities, and challenges","authors":"Tobias Harren, Torben Gutermuth, Christoph Grebner, Gerhard Hessler, Matthias Rarey","doi":"10.1002/wcms.1716","DOIUrl":"https://doi.org/10.1002/wcms.1716","url":null,"abstract":"<p>Structure-based drug design is a widely applied approach in the discovery of new lead compounds for known therapeutic targets. In most structure-based drug design applications, the docking procedure is considered the crucial step. Here, a potential ligand is fitted into the binding site, and a scoring function assesses its binding capability. With the rise of modern machine-learning in drug discovery, novel scoring functions using machine-learning techniques achieved significant performance gains in virtual screening and ligand optimization tasks on retrospective data. However, real-world applications of these methods are still limited. Missing success stories in prospective applications are one reason for this. Additionally, the fast-evolving nature of the field makes it challenging to assess the advantages of each individual method. This review will highlight recent strides toward improved real world applicability of machine-learning based scoring, enabling a better understanding of the potential benefits and pitfalls of these functions on a project. Furthermore, a systematic way of classifying machine-learning based scoring that facilitates comparisons will be presented.</p><p>This article is categorized under:\u0000 </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"14 3","pages":""},"PeriodicalIF":11.4,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/wcms.1716","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141304251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
For dealing with complex reaction (CR) systems that show typical chemical phenomena in molecular aggregation states, the Red Moon (RM) approach is introduced based on a new efficient and systematic RM methodology. First, the theoretical background with my motivation to develop the RM approach is presented from the recent necessity to perform ‘atomistic’ molecular simulation of large-scale and long-term phenomena of (i) complex chemical reactions, (ii) stereospecificity, and (iii) aggregation structures. The RM methodology uses both the molecular dynamics (MD) method for molecular motions (translation, rotation, and vibration of molecules) that frequently occur on a short-time scale and the Monte Carlo (MC) method for rare events such as chemical reactions that hardly do on that time scale. Then, under the transition rate using both the potential energy difference before and after a rare event trial and its chemical kinetic probability, it is tested and judged by the MC method whether the trial is possible (Metropolis method). Next, typical applications of the RM approach are reviewed in two main research fields, (i) polymerization and (ii) storage battery (rechargeable battery or secondary cell), with various examples of our successful studies. Finally, we conclude that the RM approach using the RM methodology should become an efficient new-generation approach as one promising computational molecular strategy (CMT). We believe it will play an essential role in surveying, at the multilevel resolution, various specificities of CR systems in molecular aggregation states.
{"title":"The computational molecular technology for complex reaction systems: The Red Moon approach","authors":"Masataka Nagaoka","doi":"10.1002/wcms.1714","DOIUrl":"https://doi.org/10.1002/wcms.1714","url":null,"abstract":"<p>For dealing with complex reaction (CR) systems that show typical chemical phenomena in molecular aggregation states, the Red Moon (RM) approach is introduced based on a new efficient and systematic RM methodology. First, the theoretical background with my motivation to develop the RM approach is presented from the recent necessity to perform ‘atomistic’ molecular simulation of large-scale and long-term phenomena of (i) complex chemical reactions, (ii) stereospecificity, and (iii) aggregation structures. The RM methodology uses both the molecular dynamics (MD) method for molecular motions (translation, rotation, and vibration of molecules) that frequently occur on a short-time scale and the Monte Carlo (MC) method for rare events such as chemical reactions that hardly do on that time scale. Then, under the transition rate using both the potential energy difference before and after a rare event trial and its chemical kinetic probability, it is tested and judged by the MC method whether the trial is possible (Metropolis method). Next, typical applications of the RM approach are reviewed in two main research fields, (i) polymerization and (ii) storage battery (rechargeable battery or secondary cell), with various examples of our successful studies. Finally, we conclude that the RM approach using the RM methodology should become an efficient new-generation approach as one promising computational molecular strategy (CMT). We believe it will play an essential role in surveying, at the multilevel resolution, various specificities of CR systems in molecular aggregation states.</p><p>This article is categorized under:\u0000 </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"14 3","pages":""},"PeriodicalIF":11.4,"publicationDate":"2024-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/wcms.1714","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140952708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Time-resolved photoelectron spectroscopy is a powerful pump-probe technique which can probe nonadiabatic dynamics in molecules. Interpretation of the experimental signals however requires input from theoretical simulations. Advances in electronic structure theory, nonadiabatic dynamics, and theory to calculate the ionization yields, have enabled accurate simulation of time-resolved photoelectron spectra leading to successful applications of the technique. We review the basic theory and steps involved in calculating time-resolved photoelectron spectra, and highlight successful applications.
{"title":"Time-resolved photoelectron spectroscopy via trajectory surface hopping","authors":"Pratip Chakraborty, Spiridoula Matsika","doi":"10.1002/wcms.1715","DOIUrl":"https://doi.org/10.1002/wcms.1715","url":null,"abstract":"<p>Time-resolved photoelectron spectroscopy is a powerful pump-probe technique which can probe nonadiabatic dynamics in molecules. Interpretation of the experimental signals however requires input from theoretical simulations. Advances in electronic structure theory, nonadiabatic dynamics, and theory to calculate the ionization yields, have enabled accurate simulation of time-resolved photoelectron spectra leading to successful applications of the technique. We review the basic theory and steps involved in calculating time-resolved photoelectron spectra, and highlight successful applications.</p><p>This article is categorized under:\u0000 </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"14 3","pages":""},"PeriodicalIF":11.4,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140902733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Enayat Mohsenzadeh, Vilma Ratautaite, Ernestas Brazys, Simonas Ramanavicius, Sarunas Zukauskas, Deivis Plausinaitis, Arunas Ramanavicius
This paper focuses on the computationally assisted design of molecularly imprinted polymers (MIP), emphasizing the selected strategies and chosen methods of approach. In summary, this paper provides an overview of the MIP fabrication procedure, focusing on key factors and challenges, where the fabrication of MIP includes a step-by-step process with extensive experimental procedures. This brings challenges in optimizing experimental conditions, such as the selection of monomer, cross-linker, and their relevant molar ratios to the template and solvent. Next, the principles of computational methods are elucidated to explore their potential applicability in solving the challenges. The computational approach can tackle the problems and optimize the MIP's design. Finally, the atomistic, quantum mechanical (QM), and combined methods in the recent research studies are overviewed with stress on strategies, analyses, and results. It is demonstrated that optimization of pre-polymerization mixture by employing simulations significantly reduces the trial-and-error experiments. Besides, higher selectivity and sensitivity of MIP are observed. The polymerization and resulting binding sites by computational methods are considered. Several models of binding sites are formed and analyzed to assess the affinities representing the sensitivity and selectivity of modeled cavities. Combined QM/atomistic methods showed more flexibility and versatility for realistic modeling with higher accuracy. This methodological advancement aligns with the principles of green chemistry, offering cost-effective and time-efficient solutions in MIP design.
{"title":"Design of molecularly imprinted polymers (MIP) using computational methods: A review of strategies and approaches","authors":"Enayat Mohsenzadeh, Vilma Ratautaite, Ernestas Brazys, Simonas Ramanavicius, Sarunas Zukauskas, Deivis Plausinaitis, Arunas Ramanavicius","doi":"10.1002/wcms.1713","DOIUrl":"https://doi.org/10.1002/wcms.1713","url":null,"abstract":"<p>This paper focuses on the computationally assisted design of molecularly imprinted polymers (MIP), emphasizing the selected strategies and chosen methods of approach. In summary, this paper provides an overview of the MIP fabrication procedure, focusing on key factors and challenges, where the fabrication of MIP includes a step-by-step process with extensive experimental procedures. This brings challenges in optimizing experimental conditions, such as the selection of monomer, cross-linker, and their relevant molar ratios to the template and solvent. Next, the principles of computational methods are elucidated to explore their potential applicability in solving the challenges. The computational approach can tackle the problems and optimize the MIP's design. Finally, the atomistic, quantum mechanical (QM), and combined methods in the recent research studies are overviewed with stress on strategies, analyses, and results. It is demonstrated that optimization of pre-polymerization mixture by employing simulations significantly reduces the trial-and-error experiments. Besides, higher selectivity and sensitivity of MIP are observed. The polymerization and resulting binding sites by computational methods are considered. Several models of binding sites are formed and analyzed to assess the affinities representing the sensitivity and selectivity of modeled cavities. Combined QM/atomistic methods showed more flexibility and versatility for realistic modeling with higher accuracy. This methodological advancement aligns with the principles of green chemistry, offering cost-effective and time-efficient solutions in MIP design.</p><p>This article is categorized under:\u0000 </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"14 3","pages":""},"PeriodicalIF":11.4,"publicationDate":"2024-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140826122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Bernadette Mohr, Thor van Heesch, Alberto Pérez de Alba Ortíz, Jocelyne Vreede
Molecular dynamics (MD) simulations can provide detailed insights into complex molecular systems, such as DNA, at high resolution in space and time. Using current computer architectures, time scales of tens of microseconds are feasible with contemporary all-atom force fields. However, these timescales are insufficient to accurately characterize large conformational transitions in DNA and compare calculations to experimental data. This review discusses the advantages and drawbacks of two simulation approaches to overcome the timescale challenge. The first approach is based on adding biasing potentials to the system to drive transitions. Umbrella sampling, steered MD, and metadynamics are examples of these methods. A key challenge of such methods is the necessity of selecting one or a few efficient coordinates, commonly referred to as collective variables (CVs), along which to apply the biasing potential. The path-metadynamics methodology addresses this issue by finding the optimal route(s) between states in a multi-dimensional CV space. The second strategy is path sampling, which focuses MD simulations on the transitions. The assumption is that even though transitions between states are rare, they are generally fast. Stopping the simulations as soon as they reach a stable state can significantly increase simulation efficiency. We introduce these methods on the two-dimensional Müller–Brown potential. DNA applications are featured for two different processes: the Watson–Crick–Franklin to Hoogsteen transition in adenine–thymine base pairs and the binding of a DNA-binding protein domain to DNA.
This article is categorized under:
分子动力学(MD)模拟可以在空间和时间上以高分辨率详细了解 DNA 等复杂分子系统。利用当前的计算机架构,几十微秒的时间尺度在当代全原子力场中是可行的。然而,这些时间尺度不足以准确描述 DNA 中的大型构象转变,也不足以将计算结果与实验数据进行比较。本综述讨论了克服时间尺度挑战的两种模拟方法的优缺点。第一种方法基于在系统中添加偏置电位来驱动转变。伞状采样、定向 MD 和元动力学就是这些方法的例子。这些方法面临的一个主要挑战是,必须选择一个或几个有效坐标(通常称为集体变量(CV))来应用偏置电势。路径计量学方法通过在多维 CV 空间中寻找状态之间的最佳路径来解决这一问题。第二种策略是路径采样,它将 MD 模拟的重点放在转换上。我们的假设是,尽管状态之间的转换很少,但转换速度通常很快。一旦达到稳定状态,立即停止模拟,可以显著提高模拟效率。我们在二维 Müller-Brown 势上介绍了这些方法。DNA 应用是两个不同过程的特色:腺嘌呤-胸腺嘧啶碱基对中沃森-克里克-弗兰克林到霍格斯坦的转变,以及 DNA 结合蛋白结构域与 DNA 的结合:
{"title":"Enhanced sampling strategies for molecular simulation of DNA","authors":"Bernadette Mohr, Thor van Heesch, Alberto Pérez de Alba Ortíz, Jocelyne Vreede","doi":"10.1002/wcms.1712","DOIUrl":"https://doi.org/10.1002/wcms.1712","url":null,"abstract":"<p>Molecular dynamics (MD) simulations can provide detailed insights into complex molecular systems, such as DNA, at high resolution in space and time. Using current computer architectures, time scales of tens of microseconds are feasible with contemporary all-atom force fields. However, these timescales are insufficient to accurately characterize large conformational transitions in DNA and compare calculations to experimental data. This review discusses the advantages and drawbacks of two simulation approaches to overcome the timescale challenge. The first approach is based on adding biasing potentials to the system to drive transitions. Umbrella sampling, steered MD, and metadynamics are examples of these methods. A key challenge of such methods is the necessity of selecting one or a few efficient coordinates, commonly referred to as collective variables (CVs), along which to apply the biasing potential. The path-metadynamics methodology addresses this issue by finding the optimal route(s) between states in a multi-dimensional CV space. The second strategy is path sampling, which focuses MD simulations on the transitions. The assumption is that even though transitions between states are rare, they are generally fast. Stopping the simulations as soon as they reach a stable state can significantly increase simulation efficiency. We introduce these methods on the two-dimensional Müller–Brown potential. DNA applications are featured for two different processes: the Watson–Crick–Franklin to Hoogsteen transition in adenine–thymine base pairs and the binding of a DNA-binding protein domain to DNA.</p><p>This article is categorized under:\u0000 </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"14 2","pages":""},"PeriodicalIF":11.4,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/wcms.1712","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140351641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jason Yim, Hannes Stärk, Gabriele Corso, Bowen Jing, Regina Barzilay, Tommi S. Jaakkola
Generative AI is rapidly transforming the frontier of research in computational structural biology. Indeed, recent successes have substantially advanced protein design and drug discovery. One of the key methodologies underlying these advances is diffusion models (DM). Diffusion models originated in computer vision, rapidly taking over image generation and offering superior quality and performance. These models were subsequently extended and modified for uses in other areas including computational structural biology. DMs are well equipped to model high dimensional, geometric data while exploiting key strengths of deep learning. In structural biology, for example, they have achieved state-of-the-art results on protein 3D structure generation and small molecule docking. This review covers the basics of diffusion models, associated modeling choices regarding molecular representations, generation capabilities, prevailing heuristics, as well as key limitations and forthcoming refinements. We also provide best practices around evaluation procedures to help establish rigorous benchmarking and evaluation. The review is intended to provide a fresh view into the state-of-the-art as well as highlight its potentials and current challenges of recent generative techniques in computational structural biology.
{"title":"Diffusion models in protein structure and docking","authors":"Jason Yim, Hannes Stärk, Gabriele Corso, Bowen Jing, Regina Barzilay, Tommi S. Jaakkola","doi":"10.1002/wcms.1711","DOIUrl":"https://doi.org/10.1002/wcms.1711","url":null,"abstract":"<p>Generative AI is rapidly transforming the frontier of research in computational structural biology. Indeed, recent successes have substantially advanced protein design and drug discovery. One of the key methodologies underlying these advances is diffusion models (DM). Diffusion models originated in computer vision, rapidly taking over image generation and offering superior quality and performance. These models were subsequently extended and modified for uses in other areas including computational structural biology. DMs are well equipped to model high dimensional, geometric data while exploiting key strengths of deep learning. In structural biology, for example, they have achieved state-of-the-art results on protein 3D structure generation and small molecule docking. This review covers the basics of diffusion models, associated modeling choices regarding molecular representations, generation capabilities, prevailing heuristics, as well as key limitations and forthcoming refinements. We also provide best practices around evaluation procedures to help establish rigorous benchmarking and evaluation. The review is intended to provide a fresh view into the state-of-the-art as well as highlight its potentials and current challenges of recent generative techniques in computational structural biology.</p><p>This article is categorized under:\u0000 </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"14 2","pages":""},"PeriodicalIF":11.4,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/wcms.1711","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140351592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sarah N. Elliott, Kevin B. Moore III, Clayton R. Mulvihill, Andreas V. Copan, Luna Pratali Maffei, Stephen J. Klippenstein
Stereochemical effects significantly influence chemical processes, yet it is not well understood if they are a leading source of uncertainty in combustion modeling. Stereochemistry influences a combustion model (i) at the earliest stage of its construction when mapping the reaction network, (ii) in the computation of individual thermochemical and rate parameters, and (iii) in the prediction of combustion observables. The present work reviews the importance of enumerating stereochemical species and reactions at each of these steps. Further, it analyzes the separate influence of several types of stereochemistry, including geometric, optical, and fleeting transition state diastereomers. Three reaction networks serve to examine which stages of low-temperature oxidation are most affected by stereochemistry, including the first and second oxidation of n-butane, the third oxidation of n-pentane, and the early stages of pyrolysis of 1- and 2-pentene. The 149 reactions in the n-butane mechanism are expanded to 183 reactions when accounting for diastereomerism. Each of these 183 reactions is parameterized with ab initio kinetics computations to determine that, for the n-butane mechanism, the median factor of diastereomeric deviation is 3.5 at 360 K for rate constants and as high as 1.6 for mechanism reactivity, in terms of ignition delay times, as opposed to a mechanism without stereochemical expansion.
This article is categorized under:
立体化学效应对化学过程有重大影响,但立体化学效应是否是燃烧建模中不确定性的主要来源还不十分清楚。立体化学对燃烧模型的影响包括:(i) 在构建模型的最初阶段绘制反应网络图时;(ii) 在计算单个热化学和速率参数时;(iii) 在预测燃烧观测值时。本研究回顾了在上述每个步骤中列举立体化学物种和反应的重要性。此外,它还分析了几类立体化学的单独影响,包括几何、光学和转瞬即逝的过渡态非对映异构体。三个反应网络用于研究低温氧化的哪些阶段受立体化学的影响最大,包括正丁烷的第一和第二次氧化、正戊烷的第三次氧化以及 1-和 2-戊烯热解的早期阶段。考虑到非对映异构,正丁烷机理中的 149 个反应扩展为 183 个反应。通过对这 183 个反应中的每一个反应进行参数化,并利用 ab initio 动力学计算确定,与没有进行立体化学扩展的机理相比,正丁烷机理的非对映异构体偏差中位系数在 360 K 时的速率常数为 3.5,而机理反应性的非对映异构体偏差中位系数(以点火延迟时间计算)高达 1.6:
{"title":"The role of stereochemistry in combustion processes","authors":"Sarah N. Elliott, Kevin B. Moore III, Clayton R. Mulvihill, Andreas V. Copan, Luna Pratali Maffei, Stephen J. Klippenstein","doi":"10.1002/wcms.1710","DOIUrl":"https://doi.org/10.1002/wcms.1710","url":null,"abstract":"<p>Stereochemical effects significantly influence chemical processes, yet it is not well understood if they are a leading source of uncertainty in combustion modeling. Stereochemistry influences a combustion model (i) at the earliest stage of its construction when mapping the reaction network, (ii) in the computation of individual thermochemical and rate parameters, and (iii) in the prediction of combustion observables. The present work reviews the importance of enumerating stereochemical species and reactions at each of these steps. Further, it analyzes the separate influence of several types of stereochemistry, including geometric, optical, and fleeting transition state diastereomers. Three reaction networks serve to examine which stages of low-temperature oxidation are most affected by stereochemistry, including the first and second oxidation of <i>n</i>-butane, the third oxidation of <i>n</i>-pentane, and the early stages of pyrolysis of 1- and 2-pentene. The 149 reactions in the <i>n</i>-butane mechanism are expanded to 183 reactions when accounting for diastereomerism. Each of these 183 reactions is parameterized with ab initio kinetics computations to determine that, for the <i>n</i>-butane mechanism, the median factor of diastereomeric deviation is 3.5 at 360 K for rate constants and as high as 1.6 for mechanism reactivity, in terms of ignition delay times, as opposed to a mechanism without stereochemical expansion.</p><p>This article is categorized under:\u0000 </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"14 2","pages":""},"PeriodicalIF":11.4,"publicationDate":"2024-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/wcms.1710","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140188539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ab-initio quantum chemistry simulations are essential for understanding electronic structure of molecules and materials in almost all areas of chemistry. A broad variety of electronic structure theories and implementations has been developed in the past decades to hopefully solve the many-body Schrödinger equation in an approximate manner on modern computers. In this review, we present recent progress in advancing low-rank electronic structure methodologies that rely on the wavefunction sparsity and compressibility to select the important subset of electronic configurations for both weakly and strongly correlated molecules. Representative chemistry applications that require the many-body treatment beyond traditional density functional approximations are discussed. The low-rank electronic structure theories have further prompted us to highlight compressive and expressive principles that are useful to catalyze idea of quantum learning models. The intersection of the low-rank correlated feature design and the modern deep neural network learning provides new feasibilities to predict chemically accurate correlation energies of unknown molecules that are not represented in the training dataset. The results by others and us are discussed to reveal that the electronic feature sets from an extremely low-rank correlation representation, which is very poor for explicit energy computation, are however sufficiently expressive for capturing and transferring electron correlation patterns across distinct molecular compositions, bond types and geometries.
{"title":"Making quantum chemistry compressive and expressive: Toward practical ab-initio simulation","authors":"Jun Yang","doi":"10.1002/wcms.1706","DOIUrl":"https://doi.org/10.1002/wcms.1706","url":null,"abstract":"<p>Ab-initio quantum chemistry simulations are essential for understanding electronic structure of molecules and materials in almost all areas of chemistry. A broad variety of electronic structure theories and implementations has been developed in the past decades to hopefully solve the many-body Schrödinger equation in an approximate manner on modern computers. In this review, we present recent progress in advancing low-rank electronic structure methodologies that rely on the wavefunction sparsity and compressibility to select the important subset of electronic configurations for both weakly and strongly correlated molecules. Representative chemistry applications that require the many-body treatment beyond traditional density functional approximations are discussed. The low-rank electronic structure theories have further prompted us to highlight compressive and expressive principles that are useful to catalyze idea of quantum learning models. The intersection of the low-rank correlated feature design and the modern deep neural network learning provides new feasibilities to predict chemically accurate correlation energies of unknown molecules that are not represented in the training dataset. The results by others and us are discussed to reveal that the electronic feature sets from an extremely low-rank correlation representation, which is very poor for explicit energy computation, are however sufficiently expressive for capturing and transferring electron correlation patterns across distinct molecular compositions, bond types and geometries.</p><p>This article is categorized under:\u0000 </p>","PeriodicalId":236,"journal":{"name":"Wiley Interdisciplinary Reviews: Computational Molecular Science","volume":"14 2","pages":""},"PeriodicalIF":11.4,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/wcms.1706","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140114230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}