首页 > 最新文献

Communications in Information and Systems最新文献

英文 中文
Mathematical artificial intelligence design of mutation-proof COVID-19 monoclonal antibodies. 数学人工智能设计防突变的 COVID-19 单克隆抗体。
IF 0.6 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2022-01-01 Epub Date: 2022-07-22 DOI: 10.4310/cis.2022.v22.n3.a3
Jiahui Chen, Guo-Wei Wei

Emerging severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants have compromised existing vaccines and posed a grand challenge to coronavirus disease 2019 (COVID-19) prevention, control, and global economic recovery. For COVID-19 patients, one of the most effective COVID-19 medications is monoclonal antibody (mAb) therapies. The United States Food and Drug Administration (U.S. FDA) has given the emergency use authorization (EUA) to a few mAbs, including those from Regeneron, Eli Elly, etc. However, they are also undermined by SARS-CoV-2 mutations. It is imperative to develop effective mutation-proof mAbs for treating COVID-19 patients infected by all emerging variants and/or the original SARS-CoV-2. We carry out a deep mutational scanning to present the blueprint of such mAbs using algebraic topology and artificial intelligence (AI). To reduce the risk of clinical trial-related failure, we select five mAbs either with FDA EUA or in clinical trials as our starting point. We demonstrate that topological AI-designed mAbs are effective for variants of concerns and variants of interest designated by the World Health Organization (WHO), as well as the original SARS-CoV-2. Our topological AI methodologies have been validated by tens of thousands of deep mutational data and their predictions have been confirmed by results from tens of experimental laboratories and population-level statistics of genome isolates from hundreds of thousands of patients.

新出现的严重急性呼吸系统综合征冠状病毒 2(SARS-CoV-2)变种破坏了现有的疫苗,并对 2019 年冠状病毒疾病(COVID-19)的预防、控制和全球经济复苏构成了巨大挑战。对于 COVID-19 患者来说,最有效的 COVID-19 药物之一是单克隆抗体(mAb)疗法。美国食品和药物管理局(U.S. FDA)已向一些 mAb 提供了紧急使用授权(EUA),其中包括 Regeneron、Eli Elly 等公司的产品。然而,它们也受到了 SARS-CoV-2 变异的影响。当务之急是开发有效的抗变异 mAbs,用于治疗感染所有新变种和/或原始 SARS-CoV-2 的 COVID-19 患者。我们利用代数拓扑学和人工智能(AI)进行了深度突变扫描,以展示此类 mAbs 的蓝图。为了降低临床试验失败的风险,我们选择了五种已获得美国食品及药物管理局(FDA)EUA 或正在进行临床试验的 mAbs 作为起点。我们证明,拓扑人工智能设计的 mAbs 对关注的变种、世界卫生组织(WHO)指定的感兴趣的变种以及原始 SARS-CoV-2 均有效。我们的拓扑人工智能方法已通过数以万计的深度变异数据进行了验证,其预测结果也得到了数十个实验实验室的结果和来自数十万患者的基因组分离物的群体级统计数据的证实。
{"title":"Mathematical artificial intelligence design of mutation-proof COVID-19 monoclonal antibodies.","authors":"Jiahui Chen, Guo-Wei Wei","doi":"10.4310/cis.2022.v22.n3.a3","DOIUrl":"10.4310/cis.2022.v22.n3.a3","url":null,"abstract":"<p><p>Emerging severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants have compromised existing vaccines and posed a grand challenge to coronavirus disease 2019 (COVID-19) prevention, control, and global economic recovery. For COVID-19 patients, one of the most effective COVID-19 medications is monoclonal antibody (mAb) therapies. The United States Food and Drug Administration (U.S. FDA) has given the emergency use authorization (EUA) to a few mAbs, including those from Regeneron, Eli Elly, etc. However, they are also undermined by SARS-CoV-2 mutations. It is imperative to develop effective mutation-proof mAbs for treating COVID-19 patients infected by all emerging variants and/or the original SARS-CoV-2. We carry out a deep mutational scanning to present the blueprint of such mAbs using algebraic topology and artificial intelligence (AI). To reduce the risk of clinical trial-related failure, we select five mAbs either with FDA EUA or in clinical trials as our starting point. We demonstrate that topological AI-designed mAbs are effective for variants of concerns and variants of interest designated by the World Health Organization (WHO), as well as the original SARS-CoV-2. Our topological AI methodologies have been validated by tens of thousands of deep mutational data and their predictions have been confirmed by results from tens of experimental laboratories and population-level statistics of genome isolates from hundreds of thousands of patients.</p>","PeriodicalId":45018,"journal":{"name":"Communications in Information and Systems","volume":"22 3","pages":"339-361"},"PeriodicalIF":0.6,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9881605/pdf/nihms-1825681.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10695510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Carbohydrate-Protein Interactions: Advances and Challenges. 碳水化合物-蛋白质相互作用:进展与挑战。
IF 0.9 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-01-01 DOI: 10.4310/cis.2021.v21.n1.a7
Shuang Zhang, Kyle Yu Chen, Xiaoqin Zou

A carbohydrate, also called saccharide in biochemistry, is a biomolecule consisting of carbon (C), hydrogen (H) and oxygen (O) atoms. For example, sugars are low molecular-weight carbohydrates, and starches are high molecular-weight carbohydrates. Carbohydrates are the most abundant organic substances in nature and essential constituents of all living things. Protein-carbohydrate interactions play important roles in many biological processes, such as cell growth, differentiation, and aggregation. They also have broad applications in pharmaceutical drug design. In this review, we will summarize the characteristic features of protein-carbohydrate interactions and review the computational methods for structure prediction, energy calculations, and kinetic studies of protein-carbohydrate complexes. Finally, we will discuss the challenges in this field.

碳水化合物,在生物化学中也称为糖类,是由碳(C)、氢(H)和氧(O)原子组成的生物分子。例如,糖是低分子量碳水化合物,而淀粉是高分子量碳水化合物。碳水化合物是自然界中含量最丰富的有机物质,是所有生物的基本成分。蛋白质-碳水化合物的相互作用在许多生物过程中起着重要作用,如细胞生长、分化和聚集。它们在药物设计中也有广泛的应用。本文综述了蛋白质-碳水化合物相互作用的特点,综述了蛋白质-碳水化合物复合物的结构预测、能量计算和动力学研究的计算方法。最后,我们将讨论这一领域面临的挑战。
{"title":"Carbohydrate-Protein Interactions: Advances and Challenges.","authors":"Shuang Zhang,&nbsp;Kyle Yu Chen,&nbsp;Xiaoqin Zou","doi":"10.4310/cis.2021.v21.n1.a7","DOIUrl":"https://doi.org/10.4310/cis.2021.v21.n1.a7","url":null,"abstract":"<p><p>A carbohydrate, also called saccharide in biochemistry, is a biomolecule consisting of carbon (C), hydrogen (H) and oxygen (O) atoms. For example, sugars are low molecular-weight carbohydrates, and starches are high molecular-weight carbohydrates. Carbohydrates are the most abundant organic substances in nature and essential constituents of all living things. Protein-carbohydrate interactions play important roles in many biological processes, such as cell growth, differentiation, and aggregation. They also have broad applications in pharmaceutical drug design. In this review, we will summarize the characteristic features of protein-carbohydrate interactions and review the computational methods for structure prediction, energy calculations, and kinetic studies of protein-carbohydrate complexes. Finally, we will discuss the challenges in this field.</p>","PeriodicalId":45018,"journal":{"name":"Communications in Information and Systems","volume":"21 1","pages":"147-163"},"PeriodicalIF":0.9,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8336717/pdf/nihms-1690418.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39290717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
COVID-19 data sharing and collaboration COVID-19数据共享与协作
IF 0.9 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-01-01 DOI: 10.4310/CIS.2021.V21.N3.A1
D. Duncan
There is an immediate need to study COVID-19, and the COVID-19 Data Archive (COVID-ARC) provides access to data along with user-friendly tools for researchers to perform analyses to better understand COVID-19 and encourage collaboration on this research. The COVID-19 pandemic has been spreading rapidly across the world, and there are still many unknowns about COVID-19. There is an urgent need for scientists around the world to work together to model the virus, study how the virus has changed and will change over time, understand how it spreads, and study transmission after vaccination. COVID-ARC can also prepare scientists for future pandemics by putting the infrastructure in place to enable researchers to aggregate data and perform analyses quickly in the event of an emergency. We have developed a platform of networked and centralized web-accessible data archives to store multimodal data related to COVID-19 and make them broadly available and accessible to the world-wide scientific community to expedite research in this area. COVID-ARC provides tools for researchers to visualize and analyze various types of data as well as a website with tools for training, announcements, virtual information sessions, and a knowledgebase wherein researchers post questions and receive answers from the community.
目前迫切需要研究COVID-19, COVID-19数据档案(COVID-ARC)为研究人员提供了数据访问和用户友好的工具,以便他们进行分析,以更好地了解COVID-19并鼓励在这项研究中开展合作。当前,新冠肺炎疫情在全球范围内迅速蔓延,人们对新冠肺炎仍有许多未知之处。世界各地的科学家迫切需要共同努力,建立病毒模型,研究病毒如何随着时间变化和将如何变化,了解病毒如何传播,并研究疫苗接种后的传播。COVID-ARC还可以通过建立基础设施使研究人员能够在紧急情况下快速汇总数据并进行分析,使科学家为未来的大流行做好准备。我们开发了一个联网和集中式网络数据档案平台,用于存储与COVID-19相关的多模式数据,并使其广泛提供给全世界科学界,以加快这一领域的研究。COVID-ARC为研究人员提供了可视化和分析各种类型数据的工具,以及一个网站,其中包含培训、公告、虚拟信息会议和知识库工具,研究人员可以在其中发布问题并从社区获得答案。
{"title":"COVID-19 data sharing and collaboration","authors":"D. Duncan","doi":"10.4310/CIS.2021.V21.N3.A1","DOIUrl":"https://doi.org/10.4310/CIS.2021.V21.N3.A1","url":null,"abstract":"There is an immediate need to study COVID-19, and the COVID-19 Data Archive (COVID-ARC) provides access to data along with user-friendly tools for researchers to perform analyses to better understand COVID-19 and encourage collaboration on this research. The COVID-19 pandemic has been spreading rapidly across the world, and there are still many unknowns about COVID-19. There is an urgent need for scientists around the world to work together to model the virus, study how the virus has changed and will change over time, understand how it spreads, and study transmission after vaccination. COVID-ARC can also prepare scientists for future pandemics by putting the infrastructure in place to enable researchers to aggregate data and perform analyses quickly in the event of an emergency. We have developed a platform of networked and centralized web-accessible data archives to store multimodal data related to COVID-19 and make them broadly available and accessible to the world-wide scientific community to expedite research in this area. COVID-ARC provides tools for researchers to visualize and analyze various types of data as well as a website with tools for training, announcements, virtual information sessions, and a knowledgebase wherein researchers post questions and receive answers from the community.","PeriodicalId":45018,"journal":{"name":"Communications in Information and Systems","volume":"1 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70404859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
SARS-CoV-2 becoming more infectious as revealed by algebraic topology and deep learning. 代数拓扑和深度学习揭示了SARS-CoV-2的传染性增强。
IF 0.9 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-01-01 Epub Date: 2021-02-08 DOI: 10.4310/cis.2021.v21.n1.a2
Jiahui Chen, Rui Wang, Guo-Wei Wei

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) caused by coronavirus disease 2019 (COVID-19) has led to a tremendous human fatality and economic loss. SARS-CoV-2 infectivity is a key reason for the widespread viral transmission, but its rigorous experimental measurement is essentially impossible due to the ongoing genome evolution around the world. We show that artificial intelligence (AI) and algebraic topology (AT) offer an accurate and efficient alternative to the experimental determination of viral infectivity. AI and AT analysis indicates that the on-going mutations make SARS-CoV-2 more infectious.

由2019冠状病毒病(COVID-19)引起的严重急性呼吸综合征冠状病毒2 (SARS-CoV-2)已导致巨大的人员死亡和经济损失。SARS-CoV-2的传染性是病毒广泛传播的一个关键原因,但由于世界各地正在进行的基因组进化,其严格的实验测量基本上是不可能的。我们表明,人工智能(AI)和代数拓扑(AT)提供了一种准确和有效的替代实验确定病毒传染性。AI和AT分析表明,正在进行的突变使SARS-CoV-2更具传染性。
{"title":"SARS-CoV-2 becoming more infectious as revealed by algebraic topology and deep learning.","authors":"Jiahui Chen,&nbsp;Rui Wang,&nbsp;Guo-Wei Wei","doi":"10.4310/cis.2021.v21.n1.a2","DOIUrl":"https://doi.org/10.4310/cis.2021.v21.n1.a2","url":null,"abstract":"<p><p>Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) caused by coronavirus disease 2019 (COVID-19) has led to a tremendous human fatality and economic loss. SARS-CoV-2 infectivity is a key reason for the widespread viral transmission, but its rigorous experimental measurement is essentially impossible due to the ongoing genome evolution around the world. We show that artificial intelligence (AI) and algebraic topology (AT) offer an accurate and efficient alternative to the experimental determination of viral infectivity. AI and AT analysis indicates that the on-going mutations make SARS-CoV-2 more infectious.</p>","PeriodicalId":45018,"journal":{"name":"Communications in Information and Systems","volume":"21 1","pages":"31-36"},"PeriodicalIF":0.9,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8528241/pdf/nihms-1698929.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39541759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayes-inspired theory for optimally building an efficient coarse-grained folding force field. 一种贝叶斯启发理论,用于优化建立高效的粗粒度折叠力场。
IF 0.9 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2021-01-01 DOI: 10.4310/cis.2021.v21.n1.a4
Travis Hurst, Dong Zhang, Yuanzhe Zhou, Shi-Jie Chen

Because of their potential utility in predicting conformational changes and assessing folding dynamics, coarse-grained (CG) RNA folding models are appealing for rapid characterization of RNA molecules. Previously, we reported the iterative simulated RNA reference state (IsRNA) method for parameterizing a CG force field for RNA folding, which consecutively updates the simulation force field to reflect marginal distributions of folding coordinates in the structure database and extract various energy terms. While the IsRNA model was validated by showing close agreement between the IsRNA-simulated and experimentally observed distributions, here, we expand our theoretical understanding of the model and, in doing so, improve the parameterization process to optimize the subset of included folding coordinates, which leads to accelerated simulations. Using statistical mechanical theory, we analyze the underlying, Bayesian concept that drives parameterization of the energy function, providing a general method for developing predictive, knowledge-based, polymer force fields on the basis of limited data. Furthermore, we propose an optimal parameterization procedure, based on the principal of maximum entropy.

粗粒度(CG)RNA 折叠模型在预测构象变化和评估折叠动力学方面具有潜在的实用性,因此对于快速鉴定 RNA 分子具有吸引力。此前,我们报道了迭代模拟 RNA 参考态(IsRNA)的方法,该方法可为 RNA 折叠的 CG 力场参数化,连续更新模拟力场以反映结构数据库中折叠坐标的边际分布,并提取各种能量项。IsRNA 模型通过显示 IsRNA 模拟分布与实验观察分布之间的密切一致性而得到了验证,在此,我们扩展了对该模型的理论理解,并在此过程中改进了参数化过程,以优化所包含的折叠坐标子集,从而加快了模拟速度。我们利用统计力学理论分析了驱动能量函数参数化的基本贝叶斯概念,为在有限数据的基础上开发基于知识的预测性聚合物力场提供了通用方法。此外,我们还提出了一种基于最大熵原理的最优参数化程序。
{"title":"A Bayes-inspired theory for optimally building an efficient coarse-grained folding force field.","authors":"Travis Hurst, Dong Zhang, Yuanzhe Zhou, Shi-Jie Chen","doi":"10.4310/cis.2021.v21.n1.a4","DOIUrl":"10.4310/cis.2021.v21.n1.a4","url":null,"abstract":"<p><p>Because of their potential utility in predicting conformational changes and assessing folding dynamics, coarse-grained (CG) RNA folding models are appealing for rapid characterization of RNA molecules. Previously, we reported the iterative simulated RNA reference state (IsRNA) method for parameterizing a CG force field for RNA folding, which consecutively updates the simulation force field to reflect marginal distributions of folding coordinates in the structure database and extract various energy terms. While the IsRNA model was validated by showing close agreement between the IsRNA-simulated and experimentally observed distributions, here, we expand our theoretical understanding of the model and, in doing so, improve the parameterization process to optimize the subset of included folding coordinates, which leads to accelerated simulations. Using statistical mechanical theory, we analyze the underlying, Bayesian concept that drives parameterization of the energy function, providing a general method for developing predictive, knowledge-based, polymer force fields on the basis of limited data. Furthermore, we propose an optimal parameterization procedure, based on the principal of maximum entropy.</p>","PeriodicalId":45018,"journal":{"name":"Communications in Information and Systems","volume":"21 1","pages":"65-83"},"PeriodicalIF":0.9,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8336718/pdf/nihms-1690260.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39280490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scoring Functions for Protein-RNA Complex Structure Prediction: Advances, Applications, and Future Directions. 蛋白质- rna复合物结构预测的评分功能:进展、应用和未来方向。
IF 0.9 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2020-01-01 DOI: 10.4310/cis.2020.v20.n1.a1
Liming Qiu, Xiaoqin Zou

Protein-RNA interaction is among the most essential of biological events in living cells, being involved in protein synthesizing, RNA processing and transport, DNA transcription, and regulation of gene expression, and many other critical bio-molecular activities. A thorough understanding of this interaction is of paramount importance in fundamental study of a variety of vital cellular processes and therapeutic application for remedy of a broad range of diseases. Experimental high-resolution 3D structure determination is the primary source of knowledge for protein-RNA complexes. However, due to technical limitations, the existing techniques for experimental structure determination couldn't match the demand from fast growing interest in academia and industry. This problem necessitates the alternative high-throughput computational method for protein-RNA complex structure prediction. Similar to the in silico methods used for protein-protein and protein-DNA interactions, a reliable prediction of protein-RNA complex structure requires a scoring function with commensurate discriminatory power. Derived from determined structures and purposed to predict the to-be-determined structures, the scoring function is not only a predictive tool but also a gauge of our knowledge of protein-RNA interaction. In this review, we present an overview of the status of existing scoring functions and the scientific principle behind their constructions as well as their strengths and limitations. Finally, we will discuss about future directions of the scoring function development for protein-RNA structure prediction.

蛋白质-RNA相互作用是活细胞中最重要的生物事件之一,涉及蛋白质合成、RNA加工和运输、DNA转录、基因表达调控以及许多其他关键的生物分子活动。对这种相互作用的透彻理解对于各种重要细胞过程的基础研究和广泛疾病的治疗应用至关重要。实验高分辨率三维结构测定是蛋白质- rna复合物知识的主要来源。然而,由于技术的限制,现有的实验结构确定技术无法满足学术界和工业界快速增长的需求。这个问题需要另一种高通量计算方法来预测蛋白质- rna复合物的结构。与用于蛋白质-蛋白质和蛋白质- dna相互作用的计算机方法类似,对蛋白质- rna复合物结构的可靠预测需要具有相应判别能力的评分函数。从已确定的结构中提取,用于预测待确定的结构,评分函数不仅是一种预测工具,而且是我们对蛋白质- rna相互作用知识的衡量标准。在这篇综述中,我们介绍了现有的评分函数的现状和科学原理背后的结构,以及他们的优势和局限性。最后,我们将讨论用于蛋白质- rna结构预测的评分函数的未来发展方向。
{"title":"Scoring Functions for Protein-RNA Complex Structure Prediction: Advances, Applications, and Future Directions.","authors":"Liming Qiu,&nbsp;Xiaoqin Zou","doi":"10.4310/cis.2020.v20.n1.a1","DOIUrl":"https://doi.org/10.4310/cis.2020.v20.n1.a1","url":null,"abstract":"<p><p>Protein-RNA interaction is among the most essential of biological events in living cells, being involved in protein synthesizing, RNA processing and transport, DNA transcription, and regulation of gene expression, and many other critical bio-molecular activities. A thorough understanding of this interaction is of paramount importance in fundamental study of a variety of vital cellular processes and therapeutic application for remedy of a broad range of diseases. Experimental high-resolution 3D structure determination is the primary source of knowledge for protein-RNA complexes. However, due to technical limitations, the existing techniques for experimental structure determination couldn't match the demand from fast growing interest in academia and industry. This problem necessitates the alternative high-throughput computational method for protein-RNA complex structure prediction. Similar to the in silico methods used for protein-protein and protein-DNA interactions, a reliable prediction of protein-RNA complex structure requires a scoring function with commensurate discriminatory power. Derived from determined structures and purposed to predict the to-be-determined structures, the scoring function is not only a predictive tool but also a gauge of our knowledge of protein-RNA interaction. In this review, we present an overview of the status of existing scoring functions and the scientific principle behind their constructions as well as their strengths and limitations. Finally, we will discuss about future directions of the scoring function development for protein-RNA structure prediction.</p>","PeriodicalId":45018,"journal":{"name":"Communications in Information and Systems","volume":"20 1","pages":"1-22"},"PeriodicalIF":0.9,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8049283/pdf/nihms-1690416.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38817789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Generative network complex (GNC) for drug discovery. 用于药物发现的生成网络复合体(GNC)。
IF 0.9 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2019-01-01 DOI: 10.4310/cis.2019.v19.n3.a2
Christopher Grow, Kaifu Gao, Duc Duy Nguyen, Guo-Wei Wei

It remains a challenging task to generate a vast variety of novel compounds with desirable pharmacological properties. In this work, a generative network complex (GNC) is proposed as a new platform for designing novel compounds, predicting their physical and chemical properties, and selecting potential drug candidates that fulfill various druggable criteria such as binding affinity, solubility, partition coefficient, etc. We combine a SMILES string generator, which consists of an encoder, a drug-property controlled or regulated latent space, and a decoder, with verification deep neural networks, a target-specific three-dimensional (3D) pose generator, and mathematical deep learning networks to generate new compounds, predict their drug properties, construct 3D poses associated with target proteins, and reevaluate druggability, respectively. New compounds were generated in the latent space by either randomized output, controlled output, or optimized output. In our demonstration, 2.08 million and 2.8 million novel compounds are generated respectively for Cathepsin S and BACE targets. These new compounds are very different from the seeds and cover a larger chemical space. For potentially active compounds, their 3D poses are generated using a state-of-the-art method. The resulting 3D complexes are further evaluated for druggability by a championing deep learning algorithm based on algebraic topology, differential geometry, and algebraic graph theories. Performed on supercomputers, the whole process took less than one week. Therefore, our GNC is an efficient new paradigm for discovering new drug candidates.

要生成大量具有理想药理特性的新型化合物,仍然是一项具有挑战性的任务。在这项工作中,我们提出了一种生成网络复合体(GNC),作为设计新型化合物、预测其物理和化学性质以及选择符合各种可药用标准(如结合亲和力、溶解度、分配系数等)的潜在候选药物的新平台。我们将 SMILES 字符串生成器(由编码器、药物特性控制或调节潜空间和解码器组成)与验证深度神经网络、目标特定三维(3D)姿态生成器和数学深度学习网络相结合,分别生成新化合物、预测其药物特性、构建与目标蛋白质相关的三维姿态以及重新评估可药用性。新化合物通过随机输出、控制输出或优化输出在潜空间生成。在我们的演示中,针对 Cathepsin S 和 BACE 目标分别生成了 208 万和 280 万种新型化合物。这些新化合物与种子化合物截然不同,涵盖了更大的化学空间。对于具有潜在活性的化合物,我们采用最先进的方法生成了它们的三维姿态。基于代数拓扑学、微分几何学和代数图理论的深度学习算法将进一步评估生成的三维复合物的可药性。整个过程在超级计算机上完成,耗时不到一周。因此,我们的 GNC 是发现候选新药的高效新范例。
{"title":"Generative network complex (GNC) for drug discovery.","authors":"Christopher Grow, Kaifu Gao, Duc Duy Nguyen, Guo-Wei Wei","doi":"10.4310/cis.2019.v19.n3.a2","DOIUrl":"10.4310/cis.2019.v19.n3.a2","url":null,"abstract":"<p><p>It remains a challenging task to generate a vast variety of novel compounds with desirable pharmacological properties. In this work, a generative network complex (GNC) is proposed as a new platform for designing novel compounds, predicting their physical and chemical properties, and selecting potential drug candidates that fulfill various druggable criteria such as binding affinity, solubility, partition coefficient, etc. We combine a SMILES string generator, which consists of an encoder, a drug-property controlled or regulated latent space, and a decoder, with verification deep neural networks, a target-specific three-dimensional (3D) pose generator, and mathematical deep learning networks to generate new compounds, predict their drug properties, construct 3D poses associated with target proteins, and reevaluate druggability, respectively. New compounds were generated in the latent space by either randomized output, controlled output, or optimized output. In our demonstration, 2.08 million and 2.8 million novel compounds are generated respectively for Cathepsin S and BACE targets. These new compounds are very different from the seeds and cover a larger chemical space. For potentially active compounds, their 3D poses are generated using a state-of-the-art method. The resulting 3D complexes are further evaluated for druggability by a championing deep learning algorithm based on algebraic topology, differential geometry, and algebraic graph theories. Performed on supercomputers, the whole process took less than one week. Therefore, our GNC is an efficient new paradigm for discovering new drug candidates.</p>","PeriodicalId":45018,"journal":{"name":"Communications in Information and Systems","volume":"19 3","pages":"241-277"},"PeriodicalIF":0.9,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8274326/pdf/nihms-1069335.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39182616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evolution of Coagulation-Fragmentation Stochastic Processes Using Accurate Chemical Master Equation Approach. 基于精确化学主方程方法的凝固-破碎随机过程演化。
IF 0.9 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2019-01-01 Epub Date: 2019-04-18 DOI: 10.4310/cis.2019.v19.n1.a3
Farid Manuchehrfar, Wei Tian, Tom Chou, Jie Liang

Coagulation and fragmentation (CF) is a fundamental process in which smaller particles attach to each other to form larger clusters while existing clusters break up into smaller particles . It is a ubiquitous process that plays important roles in many physical and biological phenomena. CF is typically a stochastic process that often occurs in confined spaces with a limited number of available particles . Here, we study the CF process formulated with the discrete Chemical Master Equation (dCME). Using the newly developed Accurate Chemical Master Equation (ACME) method, we examine the time-dependent behavior of the CF system. We investigate the effects of a number of important factors that influence the overall behavior of the system, including the dimensionality, the ratio of attachment to detachment rates among clusters, and the initial conditions. By comparing CF in one and three dimensions, we conclude that systems in three dimensions are more likely to form large clusters. We also demonstrate how the ratio of the attachment to detachment rates affects the dynamics and the steady-state of the system. Finally, we demonstrate the relationship between the formation of large clusters and the initial condition.

凝聚和破碎(CF)是一个基本的过程,在这个过程中,较小的颗粒相互附着形成较大的团簇,而现有的团簇则分解成较小的颗粒。它是一个无处不在的过程,在许多物理和生物现象中起着重要作用。CF通常是一个随机过程,通常发生在有限数量的可用粒子的密闭空间中。在这里,我们研究了用离散化学主方程(dCME)表示的CF过程。使用新开发的精确化学主方程(ACME)方法,我们研究了CF系统的时间依赖行为。我们研究了影响系统整体行为的一些重要因素的影响,包括维数、簇间的附着率和分离率的比率以及初始条件。通过比较一维和三维的CF,我们得出结论,三维的系统更有可能形成大的集群。我们还演示了附着率与分离率的比值如何影响系统的动力学和稳态。最后,我们证明了大簇的形成与初始条件之间的关系。
{"title":"Evolution of Coagulation-Fragmentation Stochastic Processes Using Accurate Chemical Master Equation Approach.","authors":"Farid Manuchehrfar,&nbsp;Wei Tian,&nbsp;Tom Chou,&nbsp;Jie Liang","doi":"10.4310/cis.2019.v19.n1.a3","DOIUrl":"https://doi.org/10.4310/cis.2019.v19.n1.a3","url":null,"abstract":"<p><p>Coagulation and fragmentation (CF) is a fundamental process in which smaller particles attach to each other to form larger clusters while existing clusters break up into smaller particles . It is a ubiquitous process that plays important roles in many physical and biological phenomena. CF is typically a stochastic process that often occurs in confined spaces with a limited number of available particles . Here, we study the CF process formulated with the discrete Chemical Master Equation (dCME). Using the newly developed Accurate Chemical Master Equation (ACME) method, we examine the time-dependent behavior of the CF system. We investigate the effects of a number of important factors that influence the overall behavior of the system, including the dimensionality, the ratio of attachment to detachment rates among clusters, and the initial conditions. By comparing CF in one and three dimensions, we conclude that systems in three dimensions are more likely to form large clusters. We also demonstrate how the ratio of the attachment to detachment rates affects the dynamics and the steady-state of the system. Finally, we demonstrate the relationship between the formation of large clusters and the initial condition.</p>","PeriodicalId":45018,"journal":{"name":"Communications in Information and Systems","volume":"19 1","pages":"37-55"},"PeriodicalIF":0.9,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8378664/pdf/nihms-1707545.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39333873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Review of quantitative systems pharmacological modeling in thrombosis. 血栓形成定量系统药理模型研究进展。
IF 0.9 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2019-01-01 Epub Date: 2019-12-06 DOI: 10.4310/cis.2019.v19.n3.a1
Limei Cheng, Guo-Wei Wei, Tarek Leil

Hemostasis and thrombosis are often thought as two sides of the same clotting mechanism whereas hemostasis is a natural protective mechanism to prevent bleeding and thrombosis is a blood clot abnormally formulated inside a blood vessel, blocking the normal blood flow. The evidence to date suggests that at least arterial thrombosis results from the same critical pathways of hemostasis. Analysis of these complex processes and pathways using quantitative systems pharmacological model-based approach can facilitate the delineation of the causal pathways that lead to the emergence of thrombosis. In this paper, we provide an overview of the main molecular and physiological mechanisms associated with hemostasis and thrombosis, and review the models and quantitative system pharmacological modeling approaches that are relevant in characterizing the interplay among the multiple factors and pathways of thrombosis. An emphasis is given to computational models for drug development. Future trends are discussed.

止血和血栓形成通常被认为是同一凝血机制的两个方面,而止血是防止出血的自然保护机制,血栓形成是血管内异常形成的血块,阻断了正常的血液流动。迄今为止的证据表明,至少动脉血栓形成是由相同的关键止血途径引起的。使用基于定量系统药理学模型的方法分析这些复杂的过程和途径可以促进导致血栓形成的因果途径的描述。在本文中,我们概述了与止血和血栓形成相关的主要分子和生理机制,并综述了与表征血栓形成的多因素和途径之间相互作用相关的模型和定量系统药理学建模方法。重点是药物开发的计算模型。讨论了未来的发展趋势。
{"title":"Review of quantitative systems pharmacological modeling in thrombosis.","authors":"Limei Cheng,&nbsp;Guo-Wei Wei,&nbsp;Tarek Leil","doi":"10.4310/cis.2019.v19.n3.a1","DOIUrl":"https://doi.org/10.4310/cis.2019.v19.n3.a1","url":null,"abstract":"<p><p>Hemostasis and thrombosis are often thought as two sides of the same clotting mechanism whereas hemostasis is a natural protective mechanism to prevent bleeding and thrombosis is a blood clot abnormally formulated inside a blood vessel, blocking the normal blood flow. The evidence to date suggests that at least arterial thrombosis results from the same critical pathways of hemostasis. Analysis of these complex processes and pathways using quantitative systems pharmacological model-based approach can facilitate the delineation of the causal pathways that lead to the emergence of thrombosis. In this paper, we provide an overview of the main molecular and physiological mechanisms associated with hemostasis and thrombosis, and review the models and quantitative system pharmacological modeling approaches that are relevant in characterizing the interplay among the multiple factors and pathways of thrombosis. An emphasis is given to computational models for drug development. Future trends are discussed.</p>","PeriodicalId":45018,"journal":{"name":"Communications in Information and Systems","volume":"19 3","pages":"219-240"},"PeriodicalIF":0.9,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8153064/pdf/nihms-1069336.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39026484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Divide-and-conquer strategy for large-scale Eulerian solvent excluded surface 大规模欧拉溶剂排除表面的分治策略
IF 0.9 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2018-09-12 DOI: 10.4310/CIS.2018.v18.n4.a5
Rundong Zhao, Menglun Wang, Y. Tong, G. Wei
MotivationSurface generation and visualization are some of the most important tasks in biomolecular modeling and computation. Eulerian solvent excluded surface (ESES) software provides analytical solvent excluded surface (SES) in the Cartesian grid, which is necessary for simulating many biomolecular electrostatic and ion channel models. However, large biomolecules and/or fine grid resolutions give rise to excessively large memory requirements in ESES construction. We introduce an out-of-core and parallel algorithm to improve the ESES software.ResultsThe present approach drastically improves the spatial and temporal efficiency of ESES. The memory footprint and time complexity are analyzed and empirically verified through extensive tests with a large collection of biomolecule examples. Our results show that our algorithm can successfully reduce memory footprint through a straightforward divide-and-conquer strategy to perform the calculation of arbitrarily large proteins on a typical commodity personal computer. On multi-core computers or clusters, our algorithm can reduce the execution time by parallelizing most of the calculation as disjoint subproblems. Various comparisons with the state-of-the-art Cartesian grid based SES calculation were done to validate the present method and show the improved efficiency. This approach makes ESES a robust software for the construction of analytical solvent excluded surfaces.Availability and implementationhttp://weilab.math.msu.edu/ESES.
动机表面生成和可视化是生物分子建模和计算中最重要的任务之一。欧拉溶剂排除表面(ESES)软件提供了笛卡尔网格中的分析溶剂排除表面,这对于模拟许多生物分子静电和离子通道模型是必要的。然而,在ESES构建中,大的生物分子和/或精细的网格分辨率引起了过大的存储器需求。为了改进ESES软件,我们引入了一种核外并行算法。结果该方法大大提高了ESES的时空效率。通过大量生物分子实例的广泛测试,分析并实证验证了记忆足迹和时间复杂性。我们的结果表明,我们的算法可以通过简单的分而治之策略在典型的商品个人计算机上执行任意大蛋白质的计算,成功地减少内存占用。在多核计算机或集群上,我们的算法可以通过将大部分计算并行化为不相交的子问题来减少执行时间。与最先进的基于笛卡尔网格的SES计算进行了各种比较,以验证当前方法,并显示出改进的效率。这种方法使ESES成为一个强大的软件,用于构建分析溶剂排除表面。可用性和实施ationhttp://weilab.math.msu.edu/ESES.
{"title":"Divide-and-conquer strategy for large-scale Eulerian solvent excluded surface","authors":"Rundong Zhao, Menglun Wang, Y. Tong, G. Wei","doi":"10.4310/CIS.2018.v18.n4.a5","DOIUrl":"https://doi.org/10.4310/CIS.2018.v18.n4.a5","url":null,"abstract":"Motivation\u0000Surface generation and visualization are some of the most important tasks in biomolecular modeling and computation. Eulerian solvent excluded surface (ESES) software provides analytical solvent excluded surface (SES) in the Cartesian grid, which is necessary for simulating many biomolecular electrostatic and ion channel models. However, large biomolecules and/or fine grid resolutions give rise to excessively large memory requirements in ESES construction. We introduce an out-of-core and parallel algorithm to improve the ESES software.\u0000\u0000\u0000Results\u0000The present approach drastically improves the spatial and temporal efficiency of ESES. The memory footprint and time complexity are analyzed and empirically verified through extensive tests with a large collection of biomolecule examples. Our results show that our algorithm can successfully reduce memory footprint through a straightforward divide-and-conquer strategy to perform the calculation of arbitrarily large proteins on a typical commodity personal computer. On multi-core computers or clusters, our algorithm can reduce the execution time by parallelizing most of the calculation as disjoint subproblems. Various comparisons with the state-of-the-art Cartesian grid based SES calculation were done to validate the present method and show the improved efficiency. This approach makes ESES a robust software for the construction of analytical solvent excluded surfaces.\u0000\u0000\u0000Availability and implementation\u0000http://weilab.math.msu.edu/ESES.","PeriodicalId":45018,"journal":{"name":"Communications in Information and Systems","volume":"18 4 1","pages":"299-329"},"PeriodicalIF":0.9,"publicationDate":"2018-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49172497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Communications in Information and Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1