Supercomput. Front. Innov.最新文献

英文中文

Evaluating the Performance of OpenMP Offloading on the NEC SX-Aurora TSUBASA Vector Engine 在NEC SX-Aurora TSUBASA矢量引擎上评估OpenMP卸载性能

Supercomput. Front. Innov.

Pub Date : 2021-06-01 DOI: 10.14529/jsfi210204

Tim Cramer, B. Kosmynin, Simon Moll, Manoel Römmer, E. Focht, Matthias S. Müller

引用次数: 4

Accelerating Seismic Redatuming Using Tile Low-Rank Approximations on NEC SX-Aurora TSUBASA 利用低秩近似加速NEC SX-Aurora TSUBASA地震数据重建

Supercomput. Front. Innov.

Pub Date : 2021-06-01 DOI: 10.14529/jsfi210201

Yuxi Hong, H. Ltaief, M. Ravasi, L. Gatineau, D. Keyes

隶属于

引用次数: 5

Micro-Workflows Data Stream Processing Model for Industrial Internet of Things 工业物联网微工作流数据流处理模型

Supercomput. Front. Innov.

Pub Date : 2021-04-23 DOI: 10.14529/jsfi210106

Ameer B. A. Alaasam, G. Radchenko, A. Tchernykh

The fog computing paradigm has become prominent in stream processing for IoT systems where cloud computing struggles from high latency challenges. It enables the deployment of computational resources between the edge and cloud layers and helps to resolve constraints, primarily due to the need to react in real-time to state changes, improve the locality of data storage, and overcome external communication channels’ limitations. There is an urgent need for tools and platforms to model, implement, manage, and monitor complex fog computing workflows. Traditional scientific workflow management systems (SWMSs) provide modularity and flexibility to design, execute, and monitor complex computational workflows used in smart industry applications. However, they are mainly focused on batch execution of jobs consisting of tightly coupled tasks. Integrating data streams into SWMSs of IoT systems is challenging. We proposed a microworkflow model to redesign the monolith architecture of workflow systems into a set of smaller and independent workflows that support stream processing. Micro-workflow is an independent data stream processing service that can be deployed on different layers of the fog computing environment. To validate the feasibility and practicability of the micro-workflow refactoring, we provide intensive experimental analysis evaluating the interval between sensor messages, the time interval required to create a message, between sending sensor message and receiving the message in SWMS, including data serialization, network latency, etc. We show that the proposed decoupling support of the independence of implementation, execution, development, maintenance, and cross-platform deployment, where each micro-workflow becomes a standalone computational unit, is a suitable mechanism for IoT stream processing.

雾计算范式在云计算面临高延迟挑战的物联网系统的流处理中变得突出。它支持在边缘和云之间部署计算资源，并有助于解决约束，主要是由于需要实时响应状态变化，改善数据存储的局部性，并克服外部通信通道的限制。现在迫切需要工具和平台来建模、实现、管理和监控复杂的雾计算工作流。传统的科学工作流管理系统(SWMSs)提供模块化和灵活性来设计、执行和监控智能工业应用中使用的复杂计算工作流。然而，它们主要关注由紧密耦合任务组成的作业的批处理执行。将数据流集成到物联网系统的swms中具有挑战性。我们提出了一个微工作流模型，将工作流系统的整体架构重新设计为一组支持流处理的更小且独立的工作流。微工作流是一种独立的数据流处理服务，可以部署在雾计算环境的不同层上。为了验证微工作流重构的可行性和实用性，我们进行了深入的实验分析，评估了传感器消息之间的间隔，创建消息所需的时间间隔，在SWMS中发送传感器消息和接收消息之间的间隔，包括数据序列化，网络延迟等。我们表明，提出的解耦支持实现、执行、开发、维护和跨平台部署的独立性，其中每个微工作流成为一个独立的计算单元，是物联网流处理的合适机制。

{"title":"Micro-Workflows Data Stream Processing Model for Industrial Internet of Things","authors":"Ameer B. A. Alaasam, G. Radchenko, A. Tchernykh","doi":"10.14529/jsfi210106","DOIUrl":"https://doi.org/10.14529/jsfi210106","url":null,"abstract":"The fog computing paradigm has become prominent in stream processing for IoT systems where cloud computing struggles from high latency challenges. It enables the deployment of computational resources between the edge and cloud layers and helps to resolve constraints, primarily due to the need to react in real-time to state changes, improve the locality of data storage, and overcome external communication channels’ limitations. There is an urgent need for tools and platforms to model, implement, manage, and monitor complex fog computing workflows. Traditional scientific workflow management systems (SWMSs) provide modularity and flexibility to design, execute, and monitor complex computational workflows used in smart industry applications. However, they are mainly focused on batch execution of jobs consisting of tightly coupled tasks. Integrating data streams into SWMSs of IoT systems is challenging. We proposed a microworkflow model to redesign the monolith architecture of workflow systems into a set of smaller and independent workflows that support stream processing. Micro-workflow is an independent data stream processing service that can be deployed on different layers of the fog computing environment. To validate the feasibility and practicability of the micro-workflow refactoring, we provide intensive experimental analysis evaluating the interval between sensor messages, the time interval required to create a message, between sending sensor message and receiving the message in SWMS, including data serialization, network latency, etc. We show that the proposed decoupling support of the independence of implementation, execution, development, maintenance, and cross-platform deployment, where each micro-workflow becomes a standalone computational unit, is a suitable mechanism for IoT stream processing.","PeriodicalId":338883,"journal":{"name":"Supercomput. Front. Innov.","volume":"12 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126196261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Developing an Architecture-independent Graph Framework for Modern Vector Processors and NVIDIA GPUs 为现代矢量处理器和NVIDIA gpu开发一个与架构无关的图形框架

Supercomput. Front. Innov.

Pub Date : 2021-01-12 DOI: 10.14529/jsfi200404

I. Afanasyev

This paper describes the first-in-the-world attempt to develop an architectural-independent graph framework named VGL, designed for different modern architectures with high-bandwidth memory. Currently VGL supports two classes of architectures: NEC SX-Aurora TSUBASA vector processors and NVIDIA GPUs. However, VGL can be easily extended to other architectures due to its flexible software structure. VGL is designed to provide users with the possibility of selecting the most suitable architecture for solving a specific graph problem on a given input data, which, in return, allows to significantly outperform existing frameworks and libraries, developed for modern multicore CPUs and NVIDIA GPUs. Since VGL uses an identical set of computational and data abstractions for all architectures, its users can easily port graph algorithms between different target architectures without any source code modifications. Additionally, in this paper we show how graph algorithms should be implemented and optimised for NVIDIA GPU and NEC SX-Aurora TSUBASA architectures, demonstrating that both architectures have multiple similar properties and hardware features.

本文描述了世界上第一次尝试开发一个名为VGL的独立于架构的图形框架，该框架是为不同的具有高带宽内存的现代架构而设计的。目前VGL支持两类架构:NEC SX-Aurora TSUBASA矢量处理器和NVIDIA gpu。然而，VGL由于其灵活的软件结构，可以很容易地扩展到其他体系结构。VGL旨在为用户提供选择最合适的架构来解决给定输入数据上的特定图形问题的可能性，这反过来又允许显着优于现有的框架和库，为现代多核cpu和NVIDIA gpu开发。由于VGL对所有体系结构使用相同的计算和数据抽象集，因此它的用户可以轻松地在不同的目标体系结构之间移植图算法，而无需修改任何源代码。此外，在本文中，我们展示了图形算法应该如何实现和优化NVIDIA GPU和NEC SX-Aurora TSUBASA架构，证明这两种架构具有多个相似的属性和硬件功能。

引用次数: 0

Enhancing the in Situ Visualization of Performance Data in Parallel CFD Applications 增强并行CFD应用中性能数据的现场可视化

Supercomput. Front. Innov.

Pub Date : 2020-12-01 DOI: 10.14529/jsfi200402

Rigel F. C. Alves, A. Knüpfer

This paper continues the work initiated by the authors on the feasibility of using ParaView as visualization software for the analysis of parallel Computational Fluid Dynamics (CFD) codes’ performance. Current performance tools have limited capacity of displaying their data on top of three-dimensional, framed (i.e., time-stepped) representations of the cluster’s topology. In our first paper, a plugin for the open-source performance tool Score-P was introduced, which intercepts an arbitrary number of manually selected code regions (mostly functions) and send their respective measurements–amount of executions and cumulative time spent–to ParaView (through its in situ library, Catalyst), as if they were any other flow-related variable. Our second paper added to such plugin the capacity to (also) map communication data (messages exchanged between MPI ranks) to the simulation’s geometry. So far the tool was limited to codes which already have the in situ adapter; but in this paper, we will take the performance data and display it–also in codes without in situ–on a three-dimensional representation of the hardware resources being used by the simulation. Testing is done with the Multi-Grid and Block Tri-diagonal NPBs, as well as Rolls-Royce’s CFD code, Hydra. The benefits and overhead of the plugin's new functionalities are discussed.

本文继续了作者关于使用ParaView作为可视化软件分析并行计算流体力学(CFD)代码性能的可行性的工作。当前的性能工具在集群拓扑的三维、框架(即时间步进)表示上显示数据的能力有限。在我们的第一篇论文中，介绍了一个开源性能工具Score-P的插件，它拦截任意数量的手动选择的代码区域(主要是函数)，并将它们各自的测量值——执行量和累计花费的时间——发送给ParaView(通过它的原位库Catalyst)，就像它们是任何其他流相关的变量一样。我们的第二篇论文为这样的插件添加了映射通信数据(MPI等级之间交换的消息)到模拟几何的能力。到目前为止，该工具仅限于已经具有原位适配器的代码;但在本文中，我们将采用性能数据并将其显示为模拟所使用的硬件资源的三维表示。测试使用了Multi-Grid和Block三对角线npb，以及Rolls-Royce的CFD代码Hydra。讨论了插件新功能的好处和开销。

{"title":"Enhancing the in Situ Visualization of Performance Data in Parallel CFD Applications","authors":"Rigel F. C. Alves, A. Knüpfer","doi":"10.14529/jsfi200402","DOIUrl":"https://doi.org/10.14529/jsfi200402","url":null,"abstract":"This paper continues the work initiated by the authors on the feasibility of using ParaView as visualization software for the analysis of parallel Computational Fluid Dynamics (CFD) codes’ performance. Current performance tools have limited capacity of displaying their data on top of three-dimensional, framed (i.e., time-stepped) representations of the cluster’s topology. In our first paper, a plugin for the open-source performance tool Score-P was introduced, which intercepts an arbitrary number of manually selected code regions (mostly functions) and send their respective measurements–amount of executions and cumulative time spent–to ParaView (through its in situ library, Catalyst), as if they were any other flow-related variable. Our second paper added to such plugin the capacity to (also) map communication data (messages exchanged between MPI ranks) to the simulation’s geometry. So far the tool was limited to codes which already have the in situ adapter; but in this paper, we will take the performance data and display it–also in codes without in situ–on a three-dimensional representation of the hardware resources being used by the simulation. Testing is done with the Multi-Grid and Block Tri-diagonal NPBs, as well as Rolls-Royce’s CFD code, Hydra. The benefits and overhead of the plugin's new functionalities are discussed.","PeriodicalId":338883,"journal":{"name":"Supercomput. Front. Innov.","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125934271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Computational Approaches To Identify A Hidden Pharmacological Potential In Large Chemical Libraries 在大型化学文库中识别隐藏药理学潜力的计算方法

Supercomput. Front. Innov.

Pub Date : 2020-10-10 DOI: 10.14529/jsfi200306

D. Druzhilovskiy, L. Stolbov, P. Savosina, P. Pogodin, D. Filimonov, A. Veselovsky, K. Stefanisko, N. Tarasova, M. Nicklaus, V. Poroikov

To improve the discovery of more effective and less toxic pharmaceutical agents, large virtual repositories of synthesizable molecules have been generated to increase the explored chemical-pharmacological space diversity. Such libraries include billions of structural formulae of drug-like molecules associated with data on synthetic schemes, required building blocks, estimated physical-chemical parameters, etc. Clearly, such repositories are “Big Data”. Thus, to identify the most promising compounds with the required pharmacological properties (hits) among billions of available opportunities, special computational methods are necessary. We have proposed using a combined computational approach, which combines structural similarity assessment, machine learning, and molecular modeling. Our approach has been validated in a project aimed at finding new pharmaceutical agents against HIV/AIDS and associated comorbidities from the Synthetically Accessible Virtual Inventory (SAVI), a 1.75 billion compound database. Potential inhibitors of HIV-1 protease and reverse transcriptase and agonists of toll-like receptors and STING, affecting innate immunity, were computationally identified. The activity of the three synthesized compounds has been confirmed in a cell-based assay. These compounds belong to the chemical classes, in which the agonistic effect on TLR 7/8 had not been previously shown. Synthesis and biological testing of several dozens of compounds with predicted antiretroviral activity are currently taking place at the NCI/NIH. We also carried out virtual screening among one billion substances to find compounds potentially possessing anti-SARS-CoV-2 activity. The selected hits' information has been accepted by the European Initiative “JEDI Grand Challenge against COVID-19” for synthesis and further biological evaluation. The possibilities and limitations of the approach are discussed.

为了更好地发现更有效和更低毒性的药物制剂，已经产生了可合成分子的大型虚拟存储库，以增加探索的化学-药理学空间的多样性。这些文库包括数十亿个药物类分子的结构式，以及与合成方案相关的数据、所需的构建块、估计的物理化学参数等。显然，这样的存储库就是“大数据”。因此，要在数十亿个可用的机会中识别具有所需药理特性(hit)的最有希望的化合物，需要特殊的计算方法。我们建议使用结合了结构相似性评估、机器学习和分子建模的组合计算方法。我们的方法已经在一个项目中得到验证，该项目旨在从一个17.5亿美元的化合物数据库——综合可访问虚拟库存(SAVI)中寻找对抗艾滋病毒/艾滋病和相关合并症的新药。通过计算确定了影响先天免疫的HIV-1蛋白酶和逆转录酶的潜在抑制剂以及toll样受体和STING的激动剂。这三种合成化合物的活性已在基于细胞的测定中得到证实。这些化合物属于化学类，其中对tlr7 /8的激动作用以前未被证明。NCI/NIH目前正在对几十种预测具有抗逆转录病毒活性的化合物进行合成和生物学测试。我们还对10亿种物质进行了虚拟筛选，以发现可能具有抗sars - cov -2活性的化合物。选定的命中信息已被欧洲倡议“抗击COVID-19 JEDI大挑战”接受，用于合成和进一步的生物学评估。讨论了该方法的可能性和局限性。

{"title":"Computational Approaches To Identify A Hidden Pharmacological Potential In Large Chemical Libraries","authors":"D. Druzhilovskiy, L. Stolbov, P. Savosina, P. Pogodin, D. Filimonov, A. Veselovsky, K. Stefanisko, N. Tarasova, M. Nicklaus, V. Poroikov","doi":"10.14529/jsfi200306","DOIUrl":"https://doi.org/10.14529/jsfi200306","url":null,"abstract":"To improve the discovery of more effective and less toxic pharmaceutical agents, large virtual repositories of synthesizable molecules have been generated to increase the explored chemical-pharmacological space diversity. Such libraries include billions of structural formulae of drug-like molecules associated with data on synthetic schemes, required building blocks, estimated physical-chemical parameters, etc. Clearly, such repositories are “Big Data”. Thus, to identify the most promising compounds with the required pharmacological properties (hits) among billions of available opportunities, special computational methods are necessary. We have proposed using a combined computational approach, which combines structural similarity assessment, machine learning, and molecular modeling. Our approach has been validated in a project aimed at finding new pharmaceutical agents against HIV/AIDS and associated comorbidities from the Synthetically Accessible Virtual Inventory (SAVI), a 1.75 billion compound database. Potential inhibitors of HIV-1 protease and reverse transcriptase and agonists of toll-like receptors and STING, affecting innate immunity, were computationally identified. The activity of the three synthesized compounds has been confirmed in a cell-based assay. These compounds belong to the chemical classes, in which the agonistic effect on TLR 7/8 had not been previously shown. Synthesis and biological testing of several dozens of compounds with predicted antiretroviral activity are currently taking place at the NCI/NIH. We also carried out virtual screening among one billion substances to find compounds potentially possessing anti-SARS-CoV-2 activity. The selected hits' information has been accepted by the European Initiative “JEDI Grand Challenge against COVID-19” for synthesis and further biological evaluation. The possibilities and limitations of the approach are discussed.","PeriodicalId":338883,"journal":{"name":"Supercomput. Front. Innov.","volume":"42 13","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133489586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Computational Characterization of the Substrate Activation in the Active Site of SARS-CoV-2 Main Protease SARS-CoV-2主蛋白酶活性位点底物活化的计算表征

Supercomput. Front. Innov.

Pub Date : 2020-10-10 DOI: 10.14529/jsfi200304

M. Khrenova, V. Tsirelson, A. Nemukhin

Molecular dynamics simulations with the QM(DFT)/MM potentials are utilized to discriminate between reactive and nonreactive complexes of the SARS-CoV-2 main protease and its substrates. Classification of frames along the molecular dynamic trajectories is utilized by analysis of the 2D maps of the Laplacian of electron density. Those are calculated in the plane formed by the carbonyl group of the substrate and a nucleophilic sulfur atom of the cysteine residue that initiates enzymatic reaction. Utilization of the GPU-based DFT code allows fast and accurate simulations with the hybrid functional PBE0 and double-zeta basis set. Exclusion of the polarization functions accelerates the calculations 2-fold, however this does not describe the substrate activation. Larger basis set with d-functions on heavy atoms and p-functions on hydrogen atoms enables to disclose equilibrium between the reactive and nonreactive species along the MD trajectory. The suggested approach can be utilized to choose covalent inhibitors that will readily interact with the catalytic residue of the selected enzyme.

利用QM(DFT)/MM电位的分子动力学模拟来区分SARS-CoV-2主要蛋白酶及其底物的反应性和非反应性复合物。通过对二维电子密度拉普拉斯图的分析，利用了分子动力学轨迹框架的分类。这些是在由底物的羰基和启动酶促反应的半胱氨酸残基的亲核硫原子形成的平面上计算的。利用基于gpu的DFT代码，可以使用混合功能PBE0和双zeta基集进行快速准确的仿真。极化函数的排除使计算速度加快了2倍，但这并不能描述衬底的激活。重原子上的d-函数和氢原子上的p-函数的更大基集能够揭示沿MD轨迹的反应性和非反应性物质之间的平衡。建议的方法可以用来选择共价抑制剂，将很容易地与所选酶的催化残基相互作用。

引用次数: 1

Computational Modeling of the SARS-CoV-2 Main Protease Inhibition by the Covalent Binding of Prospective Drug Molecules 未来药物分子共价结合抑制SARS-CoV-2主要蛋白酶的计算模型

Supercomput. Front. Innov.

Pub Date : 2020-10-10 DOI: 10.14529/jsfi200303

A. Nemukhin, B. Grigorenko, I. Polyakov, S. Lushchekina

We illustrate modern modeling tools applied in the computational design of drugs acting as covalent inhibitors of enzymes. We take the Main protease (M pro ) from the SARS-CoV-2 virus as an important present-day representative. In this work, we construct a compound capable to block M pro , which is composed of fragments of antimalarial drugs and covalent inhibitors of cysteine proteases. To characterize the mechanism of its interaction with the enzyme, the algorithms based on force fields, including molecular mechanics (MM), molecular dynamics (MD) and molecular docking, as well as quantum-based approaches, including quantum chemistry and quantum mechanics/molecular mechanics (QM/MM) methods, should be applied. The use of supercomputers is indispensably important at least in the latter approach. Its application to enzymes assumes that energies and forces in the active sites are computed using methods of quantum chemistry, whereas the rest of protein matrix is described using conventional force fields. For the proposed compound, containing the benzoisothiazolone fragment and the substitute at the uracil ring, we show that it can form a stable covalently bound adduct with the target enzyme, and thus can be recommended for experimental trials.

我们说明现代建模工具应用于计算设计的药物作为共价抑制剂的酶。我们以SARS-CoV-2病毒的主蛋白酶(M pro)为当今的重要代表。在这项工作中，我们构建了一种能够阻断mpro的化合物，该化合物由抗疟疾药物片段和半胱氨酸蛋白酶的共价抑制剂组成。为了表征其与酶的相互作用机制，需要应用基于力场的分子力学(MM)、分子动力学(MD)和分子对接等算法，以及量子化学和量子力学/分子力学(QM/MM)方法等量子方法。超级计算机的使用是必不可少的，至少在后一种方法中是如此。它在酶中的应用假设使用量子化学方法计算活性位点的能量和力，而蛋白质基质的其余部分则使用常规力场来描述。我们发现，该化合物含有苯并异噻唑酮片段和尿嘧啶环上的替代物，可以与目标酶形成稳定的共价加合物，因此可以推荐用于实验试验。

{"title":"Computational Modeling of the SARS-CoV-2 Main Protease Inhibition by the Covalent Binding of Prospective Drug Molecules","authors":"A. Nemukhin, B. Grigorenko, I. Polyakov, S. Lushchekina","doi":"10.14529/jsfi200303","DOIUrl":"https://doi.org/10.14529/jsfi200303","url":null,"abstract":"We illustrate modern modeling tools applied in the computational design of drugs acting as covalent inhibitors of enzymes. We take the Main protease (M pro ) from the SARS-CoV-2 virus as an important present-day representative. In this work, we construct a compound capable to block M pro , which is composed of fragments of antimalarial drugs and covalent inhibitors of cysteine proteases. To characterize the mechanism of its interaction with the enzyme, the algorithms based on force fields, including molecular mechanics (MM), molecular dynamics (MD) and molecular docking, as well as quantum-based approaches, including quantum chemistry and quantum mechanics/molecular mechanics (QM/MM) methods, should be applied. The use of supercomputers is indispensably important at least in the latter approach. Its application to enzymes assumes that energies and forces in the active sites are computed using methods of quantum chemistry, whereas the rest of protein matrix is described using conventional force fields. For the proposed compound, containing the benzoisothiazolone fragment and the substitute at the uracil ring, we show that it can form a stable covalently bound adduct with the target enzyme, and thus can be recommended for experimental trials.","PeriodicalId":338883,"journal":{"name":"Supercomput. Front. Innov.","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128072929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

In Search of Non-covalent Inhibitors of SARS-CoV-2 Main Protease: Computer Aided Drug Design Using Docking and Quantum Chemistry 寻找SARS-CoV-2主蛋白酶非共价抑制剂:基于对接和量子化学的计算机辅助药物设计

Supercomput. Front. Innov.

Pub Date : 2020-10-01 DOI: 10.14529/jsfi200305

A. Sulimov, D. Kutov, Anna S. Taschilova, I. Ilin, N. Stolpovskaya, K. Shikhaliev, V. Sulimov

Two stages virtual screening of a database containing several thousand low molecular weight organic compounds is performed with the goal to find inhibitors of SARS-CoV-2 main protease. Overall near 41000 different 3D molecular structures have been generated from the initial molecules taking into account several conformers of most molecules. At the first stage the classical SOL docking program is used to determine most promising candidates to become inhibitors. SOL employs the MMFF94 force field, the genetic algorithm (GA) of the global energy optimization, takes into account the desolvation effect arising upon protein-ligand binding and the internal stress energy of the ligand. Parameters of GA are selected to perform the meticulous global optimization, and for docking of one ligand several hours on one computing core are needed on the average. The main protease model is constructed on the base of the protein structure from the Protein Data Bank complex 6W63. More than 1000 ligands structures have been selected for further postprocessing. The SOL score values of these ligands are more negative than the threshold of –6.3 kcal/mol obtained for the native X77 ligand docking. Subsequent calculation of the protein-ligand binding enthalpy by the PM7 quantum-chemical semiempirical method with COSMO solvent model have narrowed down the number of best candidates. Finally, the diverse set of 20 most perspective candidates for the in vitro validation are selected.

对包含数千种低分子量有机化合物的数据库进行了两个阶段的虚拟筛选，目的是找到SARS-CoV-2主要蛋白酶的抑制剂。总的来说，考虑到大多数分子的几种构象，从初始分子中产生了近41000种不同的3D分子结构。在第一阶段，使用经典的SOL对接程序来确定最有希望成为抑制剂的候选者。SOL采用MMFF94力场，采用全局能量优化的遗传算法(GA)，考虑了蛋白质与配体结合时产生的脱溶效应和配体的内应力能。选择遗传算法的参数进行精细的全局优化，一个配体的对接在一个计算核心上平均需要几个小时。主要的蛋白酶模型是基于蛋白质数据库络合物6W63的蛋白质结构构建的。选择了1000多个配体结构进行进一步的后处理。这些配体的SOL得分值比天然X77配体对接时获得的-6.3 kcal/mol的阈值更负。随后用PM7量子化学半经验方法和COSMO溶剂模型计算了蛋白质与配体的结合焓，缩小了最佳候选分子的数量。最后，选择了20个最有前景的体外验证候选物。

{"title":"In Search of Non-covalent Inhibitors of SARS-CoV-2 Main Protease: Computer Aided Drug Design Using Docking and Quantum Chemistry","authors":"A. Sulimov, D. Kutov, Anna S. Taschilova, I. Ilin, N. Stolpovskaya, K. Shikhaliev, V. Sulimov","doi":"10.14529/jsfi200305","DOIUrl":"https://doi.org/10.14529/jsfi200305","url":null,"abstract":"Two stages virtual screening of a database containing several thousand low molecular weight organic compounds is performed with the goal to find inhibitors of SARS-CoV-2 main protease. Overall near 41000 different 3D molecular structures have been generated from the initial molecules taking into account several conformers of most molecules. At the first stage the classical SOL docking program is used to determine most promising candidates to become inhibitors. SOL employs the MMFF94 force field, the genetic algorithm (GA) of the global energy optimization, takes into account the desolvation effect arising upon protein-ligand binding and the internal stress energy of the ligand. Parameters of GA are selected to perform the meticulous global optimization, and for docking of one ligand several hours on one computing core are needed on the average. The main protease model is constructed on the base of the protein structure from the Protein Data Bank complex 6W63. More than 1000 ligands structures have been selected for further postprocessing. The SOL score values of these ligands are more negative than the threshold of –6.3 kcal/mol obtained for the native X77 ligand docking. Subsequent calculation of the protein-ligand binding enthalpy by the PM7 quantum-chemical semiempirical method with COSMO solvent model have narrowed down the number of best candidates. Finally, the diverse set of 20 most perspective candidates for the in vitro validation are selected.","PeriodicalId":338883,"journal":{"name":"Supercomput. Front. Innov.","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122303350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Perspectives on Supercomputing and Artificial Intelligence Applications in Drug Discovery 超级计算和人工智能在药物发现中的应用展望

Supercomput. Front. Innov.

Pub Date : 2020-10-01 DOI: 10.14529/jsfi200302

Jun Xu, Jiming Ye

This review starts with outlining how science and technology evaluated from last century into high throughput science and technology in modern era due to the Nobel-Prize-level inventions of combinatorial chemistry, polymerase chain reaction, and high-throughput screening. The evolution results in big data accumulated in life sciences and the fields of drug discovery. The big data demands for supercomputing in biology and medicine, although the computing complexity is still a grand challenge for sophisticated biosystems in drug design in this supercomputing era. In order to resolve the real-world issues, artificial intelligence algorithms (specifically machine learning approaches) were introduced, and have demonstrated the power in discovering structure-activity relations hidden in big biochemical data. Particularly, this review summarizes on how people modernize the conventional machine learning algorithms by combing non-numeric pattern recognition and deep learning algorithms, and successfully resolved drug design and high throughput screening issues. The review ends with the perspectives on computational opportunities and challenges in drug discovery by introducing new drug design principles and modeling the process of packing DNA with histones in micrometer scale space, a n example of how a macrocosm object gets into microcosm world.

本文首先概述了由于组合化学、聚合酶链反应和高通量筛选等诺贝尔奖级别的发明，科学技术如何从上个世纪发展到现代的高通量科学技术。这种进化导致了生命科学和药物发现领域积累的大数据。大数据对生物和医学领域的超级计算提出了要求，尽管在这个超级计算时代，计算复杂性仍然是药物设计中复杂生物系统的一个巨大挑战。为了解决现实世界的问题，引入了人工智能算法(特别是机器学习方法)，并展示了发现隐藏在大生化数据中的结构-活性关系的能力。特别地，本文总结了人们如何将非数字模式识别和深度学习算法相结合，使传统的机器学习算法现代化，并成功地解决了药物设计和高通量筛选问题。最后，通过介绍新的药物设计原则和在微米尺度空间内用组蛋白包装DNA的过程建模，回顾了药物发现中的计算机会和挑战，这是一个宏观物体如何进入微观世界的例子。

{"title":"Perspectives on Supercomputing and Artificial Intelligence Applications in Drug Discovery","authors":"Jun Xu, Jiming Ye","doi":"10.14529/jsfi200302","DOIUrl":"https://doi.org/10.14529/jsfi200302","url":null,"abstract":"This review starts with outlining how science and technology evaluated from last century into high throughput science and technology in modern era due to the Nobel-Prize-level inventions of combinatorial chemistry, polymerase chain reaction, and high-throughput screening. The evolution results in big data accumulated in life sciences and the fields of drug discovery. The big data demands for supercomputing in biology and medicine, although the computing complexity is still a grand challenge for sophisticated biosystems in drug design in this supercomputing era. In order to resolve the real-world issues, artificial intelligence algorithms (specifically machine learning approaches) were introduced, and have demonstrated the power in discovering structure-activity relations hidden in big biochemical data. Particularly, this review summarizes on how people modernize the conventional machine learning algorithms by combing non-numeric pattern recognition and deep learning algorithms, and successfully resolved drug design and high throughput screening issues. The review ends with the perspectives on computational opportunities and challenges in drug discovery by introducing new drug design principles and modeling the process of packing DNA with histones in micrometer scale space, a n example of how a macrocosm object gets into microcosm world.","PeriodicalId":338883,"journal":{"name":"Supercomput. Front. Innov.","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124713675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Supercomput. Front. Innov.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀