Alejandro Gómez-García, Daniel A Acuña Jiménez, William J Zamora, Haruna L Barazorda-Ccahuana, Miguel Á Chávez-Fumagalli, Marilia Valli, Adriano D Andricopulo, Vanderlan da S Bolzani, Dionisio A Olmedo, Pablo N Solís, Marvin J Núñez, Johny R Rodríguez Pérez, Hoover A Valencia Sánchez, Héctor F Cortés Hernández, Oscar M Mosquera Martinez, José L Medina-Franco
{"title":"Latin American Natural Product Database (LANaPDB): An Update.","authors":"Alejandro Gómez-García, Daniel A Acuña Jiménez, William J Zamora, Haruna L Barazorda-Ccahuana, Miguel Á Chávez-Fumagalli, Marilia Valli, Adriano D Andricopulo, Vanderlan da S Bolzani, Dionisio A Olmedo, Pablo N Solís, Marvin J Núñez, Johny R Rodríguez Pérez, Hoover A Valencia Sánchez, Héctor F Cortés Hernández, Oscar M Mosquera Martinez, José L Medina-Franco","doi":"10.1021/acs.jcim.4c01560","DOIUrl":null,"url":null,"abstract":"<p><p>Natural product (NP) databases are crucial tools in computer-aided drug design (CADD). Over the past decade, there has been a worldwide effort to assemble information regarding natural products (NPs) isolated and characterized in certain geographical regions. In 2023, it was published LANaPDB, and to our knowledge, this is the first attempt to gather and standardize all the NP databases of Latin America. Herein, we present and analyze in detail the contents of an updated version of LANaPDB, which includes 619 newly added compounds from Colombia, Costa Rica, and Mexico. The present version of LANaPDB has a total of 13 578 compounds, coming from ten databases of seven Latin American countries. A chemoinformatic characterization of LANaPDB was carried out, which includes the structural classification of the compounds, calculation of six physicochemical properties of pharmaceutical interest, and visualization of the chemical space by employing and comparing two different fingerprints (MACCS keys (166-bit) and Morgan2 (2048-bit)). Furthermore, additional analyses were made, and valuable information not included in the first version of LANaPDB was added, which includes structural diversity, molecular complexity, synthetic feasibility, commercial availability, and reported and predicted biological activity. In addition, the LANaPDB compounds were cross-referenced to two of the largest public chemical compound databases annotated with biological activity: ChEMBL and PubChem.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"8495-8509"},"PeriodicalIF":5.6000,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11600509/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.4c01560","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/6 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0
Abstract
Natural product (NP) databases are crucial tools in computer-aided drug design (CADD). Over the past decade, there has been a worldwide effort to assemble information regarding natural products (NPs) isolated and characterized in certain geographical regions. In 2023, it was published LANaPDB, and to our knowledge, this is the first attempt to gather and standardize all the NP databases of Latin America. Herein, we present and analyze in detail the contents of an updated version of LANaPDB, which includes 619 newly added compounds from Colombia, Costa Rica, and Mexico. The present version of LANaPDB has a total of 13 578 compounds, coming from ten databases of seven Latin American countries. A chemoinformatic characterization of LANaPDB was carried out, which includes the structural classification of the compounds, calculation of six physicochemical properties of pharmaceutical interest, and visualization of the chemical space by employing and comparing two different fingerprints (MACCS keys (166-bit) and Morgan2 (2048-bit)). Furthermore, additional analyses were made, and valuable information not included in the first version of LANaPDB was added, which includes structural diversity, molecular complexity, synthetic feasibility, commercial availability, and reported and predicted biological activity. In addition, the LANaPDB compounds were cross-referenced to two of the largest public chemical compound databases annotated with biological activity: ChEMBL and PubChem.
期刊介绍:
The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery.
Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field.
As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.