首页 > 最新文献

Journal of Chemical Information and Modeling 最新文献

英文 中文
Exploring Metformin's Therapeutic Potential for Alzheimer's Disease: An In-Silico Perspective Using Well-Tempered Funnel Metadynamics. 探索二甲双胍治疗阿尔茨海默病的潜力:利用井喷式漏斗元动力学的室内视角。
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2025-04-13 DOI: 10.1021/acs.jcim.5c00106
Sunandini Swain,Atanu K Metya
Alzheimer's disease (AD), often referred to as the "diabetes of the brain", is intricately linked to insulin resistance. Metformin, a first-line antidiabetic drug, has been anticipated as a potential treatment for AD and is currently undergoing phase 3 clinical trials. The potential success of metformin in treating AD could herald a new era in the management of this debilitating disease, providing hope for millions of people affected worldwide. Despite this fact, the precise molecular mechanisms underlying the therapeutic effects of metformin on AD remain poorly understood. To pursue this, in this present work, we implement a comprehensive computational approach combining classical molecular dynamics (MD) simulations and the advanced enhanced sampling technique funnel metadynamics (FM) to explore the dynamics and affinity of metformin and acetylcholinesterase (AChE), a novel target for AD. The MD and FM simulations suggest that metformin induces significant configurational changes within the AChE, resulting in weak and nonspecific binding. Furthermore, the presence of metformin alters the conformational landscape of AChE causing the emergence of metastable states and less rigid binding patterns. The binding energies for the metformin-AChE complex are -4.89 ± 1.2 kcal/mol and -1.68 ± 0.2 kcal/mol, as estimated through the molecular mechanics Poisson-Boltzmann surface area (MMPBSA) and FM approaches, respectively. To elucidate the binding energy relevance calculated by MMPBSA and FM approach with experimental inhibitory potency, ΔGexp is calculated using IC50 value reported in prior experimental studies. ΔGexp is estimated to be -3.59 kcal/mol. A comparison of these binding energy values with different methods highlights the moderate inhibitory potency of metformin toward AChE. This work provides molecular-level insights emphasizing the dynamic configurational changes induced by metformin within AChE and underscores its translational potential in the repurposing of AD.
{"title":"Exploring Metformin's Therapeutic Potential for Alzheimer's Disease: An In-Silico Perspective Using Well-Tempered Funnel Metadynamics.","authors":"Sunandini Swain,Atanu K Metya","doi":"10.1021/acs.jcim.5c00106","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c00106","url":null,"abstract":"Alzheimer's disease (AD), often referred to as the \"diabetes of the brain\", is intricately linked to insulin resistance. Metformin, a first-line antidiabetic drug, has been anticipated as a potential treatment for AD and is currently undergoing phase 3 clinical trials. The potential success of metformin in treating AD could herald a new era in the management of this debilitating disease, providing hope for millions of people affected worldwide. Despite this fact, the precise molecular mechanisms underlying the therapeutic effects of metformin on AD remain poorly understood. To pursue this, in this present work, we implement a comprehensive computational approach combining classical molecular dynamics (MD) simulations and the advanced enhanced sampling technique funnel metadynamics (FM) to explore the dynamics and affinity of metformin and acetylcholinesterase (AChE), a novel target for AD. The MD and FM simulations suggest that metformin induces significant configurational changes within the AChE, resulting in weak and nonspecific binding. Furthermore, the presence of metformin alters the conformational landscape of AChE causing the emergence of metastable states and less rigid binding patterns. The binding energies for the metformin-AChE complex are -4.89 ± 1.2 kcal/mol and -1.68 ± 0.2 kcal/mol, as estimated through the molecular mechanics Poisson-Boltzmann surface area (MMPBSA) and FM approaches, respectively. To elucidate the binding energy relevance calculated by MMPBSA and FM approach with experimental inhibitory potency, ΔGexp is calculated using IC50 value reported in prior experimental studies. ΔGexp is estimated to be -3.59 kcal/mol. A comparison of these binding energy values with different methods highlights the moderate inhibitory potency of metformin toward AChE. This work provides molecular-level insights emphasizing the dynamic configurational changes induced by metformin within AChE and underscores its translational potential in the repurposing of AD.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"49 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2025-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143831552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of Umami Peptides Based on a Large Language Model of Proteins.
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2025-04-11 DOI: 10.1021/acs.jcim.4c02394
Yi He,Zhenglin Tian,Jingxian Zheng,Haohao Wang,Lu Han,Weiwei Han
Umami peptides possess unique characteristics, making their study highly significant. To better understand umami peptides, this research systematically investigates them using protein language models. First, we collected IC50 and Kd data to construct a protein-peptide affinity model and combined it with protein-peptide docking techniques to explore the affinity relationships between umami peptides, non-umami peptides, and taste receptors. The results indicate that umami peptides exhibit stronger affinity to umami receptors compared to non-umami peptides but show no significant difference in affinity to bitter receptors. Subsequently, we systematically gathered 972 umami peptides and 608 non-umami peptides, developing the largest data set of umami peptides to date. Using protein language models combined with molecular docking and affinity prediction results, we constructed the most accurate umami peptide prediction model, achieving an accuracy of 82% and an area under the curve (AUC) of 0.87. Finally, we developed a user-friendly website for umami peptide analysis, UmamiMeta, accessible at https://hwwlab.com/Webserver/umamimeta, providing a convenient tool for the research and application of umami peptides.
{"title":"Prediction of Umami Peptides Based on a Large Language Model of Proteins.","authors":"Yi He,Zhenglin Tian,Jingxian Zheng,Haohao Wang,Lu Han,Weiwei Han","doi":"10.1021/acs.jcim.4c02394","DOIUrl":"https://doi.org/10.1021/acs.jcim.4c02394","url":null,"abstract":"Umami peptides possess unique characteristics, making their study highly significant. To better understand umami peptides, this research systematically investigates them using protein language models. First, we collected IC50 and Kd data to construct a protein-peptide affinity model and combined it with protein-peptide docking techniques to explore the affinity relationships between umami peptides, non-umami peptides, and taste receptors. The results indicate that umami peptides exhibit stronger affinity to umami receptors compared to non-umami peptides but show no significant difference in affinity to bitter receptors. Subsequently, we systematically gathered 972 umami peptides and 608 non-umami peptides, developing the largest data set of umami peptides to date. Using protein language models combined with molecular docking and affinity prediction results, we constructed the most accurate umami peptide prediction model, achieving an accuracy of 82% and an area under the curve (AUC) of 0.87. Finally, we developed a user-friendly website for umami peptide analysis, UmamiMeta, accessible at https://hwwlab.com/Webserver/umamimeta, providing a convenient tool for the research and application of umami peptides.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"103 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143822420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning for Predicting the Drug-to-Antibody Ratio (DAR) in the Synthesis of Antibody-Drug Conjugates (ADCs).
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2025-04-11 DOI: 10.1021/acs.jcim.5c00037
Lorenzo Angiolini,Fabrizio Manetti,Ottavia Spiga,Andrea Tafi,Anna Visibelli,Elena Petricci
The pharmaceutical industry faces challenges in developing efficient and cost-effective drug delivery systems. Among various applications, antibody-drug conjugates (ADCs) stand out by combining cytotoxic or bioactive agents with monoclonal antibodies (mAbs) for targeted therapies. However, bioconjugation methods can produce different outcomes, including no bioconjugation, depending on the mAb, the amino acid residues, and the linker-payload (LP) system used. In this work, we developed a machine learning (ML) algorithm capable of predicting bioconjugation outcomes, allowing the design of the best mAb, LP systems, and conditions for the development of efficient ADCs. In particular, we exploited the potential of the XGBoost algorithm in predicting the drug-to-antibody ratio (DAR) in the synthesis of ADCs. Our model demonstrated high predictive accuracy, with R2 scores of 0.85 and 0.95 for lysine and cysteine data sets, respectively. The integration of ML algorithms into bioconjugation processes for ADC synthesis offers a promising approach to streamlining ADC development.
{"title":"Machine Learning for Predicting the Drug-to-Antibody Ratio (DAR) in the Synthesis of Antibody-Drug Conjugates (ADCs).","authors":"Lorenzo Angiolini,Fabrizio Manetti,Ottavia Spiga,Andrea Tafi,Anna Visibelli,Elena Petricci","doi":"10.1021/acs.jcim.5c00037","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c00037","url":null,"abstract":"The pharmaceutical industry faces challenges in developing efficient and cost-effective drug delivery systems. Among various applications, antibody-drug conjugates (ADCs) stand out by combining cytotoxic or bioactive agents with monoclonal antibodies (mAbs) for targeted therapies. However, bioconjugation methods can produce different outcomes, including no bioconjugation, depending on the mAb, the amino acid residues, and the linker-payload (LP) system used. In this work, we developed a machine learning (ML) algorithm capable of predicting bioconjugation outcomes, allowing the design of the best mAb, LP systems, and conditions for the development of efficient ADCs. In particular, we exploited the potential of the XGBoost algorithm in predicting the drug-to-antibody ratio (DAR) in the synthesis of ADCs. Our model demonstrated high predictive accuracy, with R2 scores of 0.85 and 0.95 for lysine and cysteine data sets, respectively. The integration of ML algorithms into bioconjugation processes for ADC synthesis offers a promising approach to streamlining ADC development.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"5 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143822845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Influence of Water/Ethanol Mixing Ratio on Gemcitabine Binding to Cucurbit-7-uril Based on Molecular Dynamics Simulations and Three-Dimensional Reference Interaction Site Model. 基于分子动力学模拟和三维参考相互作用位点模型的水/乙醇混合比对吉西他滨与葫芦-7-脲结合的影响
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2025-04-10 DOI: 10.1021/acs.jcim.5c00225
Natthiti Chiangraeng,Norio Yoshida,Haruyuki Nakano,Apinpus Rujiwatra,Piyarat Nimmanpipug
Accurate binding affinities of the cucurbit-7-uril-gemcitabine (CB7-GEM) complex in pure water and ethanol/water-mixed solvents were obtained by combining molecular dynamics simulations and three-dimensional reference interaction site model (3D-RISM) calculations. Point charges of CB7 and GEM molecules, depending on solvent mixture ratios, were determined using 3D-RISM self-consistent field (3D-RISM-SCF) calculations. The calculated binding affinities reveal that the most preferable CB7-GEM complex forms in the pure water system. The complexes in the mixed solvents show lower stability at higher ethanol ratios. Stable conformations at different solvent concentrations appear to be a key factor in the obtained trend of binding affinity enhancement. Conformations in the high-water fractions, associated with higher complex stability, exhibit lower internal energies than those in high-methanol fractions. Disruption of hydrogen-bonding formation also plays a crucial role in the solvation free energies. An explicit solvent model is crucial for accurate calculations of CB7-GEM complexes in these binary mixtures, providing results comparable to the experiments.
{"title":"Influence of Water/Ethanol Mixing Ratio on Gemcitabine Binding to Cucurbit-7-uril Based on Molecular Dynamics Simulations and Three-Dimensional Reference Interaction Site Model.","authors":"Natthiti Chiangraeng,Norio Yoshida,Haruyuki Nakano,Apinpus Rujiwatra,Piyarat Nimmanpipug","doi":"10.1021/acs.jcim.5c00225","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c00225","url":null,"abstract":"Accurate binding affinities of the cucurbit-7-uril-gemcitabine (CB7-GEM) complex in pure water and ethanol/water-mixed solvents were obtained by combining molecular dynamics simulations and three-dimensional reference interaction site model (3D-RISM) calculations. Point charges of CB7 and GEM molecules, depending on solvent mixture ratios, were determined using 3D-RISM self-consistent field (3D-RISM-SCF) calculations. The calculated binding affinities reveal that the most preferable CB7-GEM complex forms in the pure water system. The complexes in the mixed solvents show lower stability at higher ethanol ratios. Stable conformations at different solvent concentrations appear to be a key factor in the obtained trend of binding affinity enhancement. Conformations in the high-water fractions, associated with higher complex stability, exhibit lower internal energies than those in high-methanol fractions. Disruption of hydrogen-bonding formation also plays a crucial role in the solvation free energies. An explicit solvent model is crucial for accurate calculations of CB7-GEM complexes in these binary mixtures, providing results comparable to the experiments.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"6 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143819349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Heat Capacity of Ionic Liquids: Toward Interpretable Chemical Structure-Based Machine Learning Approaches.
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2025-04-10 DOI: 10.1021/acs.jcim.5c00238
Ali Esmaeili,Hesamedin Hekmatmehr,Mohammad Moheisen,Saeid Atashrouz,Ali Abedi,Ahmad Mohaddespour
This study focuses on predicting the heat capacity of pure liquid-phase ionic liquids (ILs) using machine learning models from various categories, including support vector machines, instance-based learning, ensemble learning, and neural networks, with linear regression serving as a baseline. A key aim of this work is not only to achieve accurate predictions but also to ensure the interpretability of the results, addressing a gap often overlooked in predictive modeling studies. To accomplish this, we curated and cleaned a comprehensive data set of 13,893 data points covering 322 ILs, using temperature and chemical structure-based features as inputs. We evaluated model performance and conducted a thorough interpretability analysis to reveal the patterns of the top-performing model's predictions, ensuring that they are understandable. All models outperformed the baseline, with XGBoost (eXtreme Gradient Boosting) from the ensemble learning category achieving the best results, with total RMSE, R2, and AARD (%) values of 11.389, 0.997, and 1.212%, respectively. Shallow neural networks also performed competitively, suggesting that complex deep learning architectures may not be necessary. Both 10-fold and leave-one-IL-out (LOILO) cross-validation further validated the robustness of these results. Importantly, the interpretability analysis identified key factors influencing heat capacity predictions, such as anion size (e.g., NTf2 and FAP) and alkyl chain length. These factors were validated by testing the model on previously unseen IL examples. Additionally, a user-friendly web application was developed to make predictions, allowing users to input chemical groups or select compounds from a predefined list of 1633 ILs. This study underscores the importance of combining diverse modeling approaches with robust interpretability techniques to achieve reliable and explainable predictions for IL heat capacity.
{"title":"Heat Capacity of Ionic Liquids: Toward Interpretable Chemical Structure-Based Machine Learning Approaches.","authors":"Ali Esmaeili,Hesamedin Hekmatmehr,Mohammad Moheisen,Saeid Atashrouz,Ali Abedi,Ahmad Mohaddespour","doi":"10.1021/acs.jcim.5c00238","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c00238","url":null,"abstract":"This study focuses on predicting the heat capacity of pure liquid-phase ionic liquids (ILs) using machine learning models from various categories, including support vector machines, instance-based learning, ensemble learning, and neural networks, with linear regression serving as a baseline. A key aim of this work is not only to achieve accurate predictions but also to ensure the interpretability of the results, addressing a gap often overlooked in predictive modeling studies. To accomplish this, we curated and cleaned a comprehensive data set of 13,893 data points covering 322 ILs, using temperature and chemical structure-based features as inputs. We evaluated model performance and conducted a thorough interpretability analysis to reveal the patterns of the top-performing model's predictions, ensuring that they are understandable. All models outperformed the baseline, with XGBoost (eXtreme Gradient Boosting) from the ensemble learning category achieving the best results, with total RMSE, R2, and AARD (%) values of 11.389, 0.997, and 1.212%, respectively. Shallow neural networks also performed competitively, suggesting that complex deep learning architectures may not be necessary. Both 10-fold and leave-one-IL-out (LOILO) cross-validation further validated the robustness of these results. Importantly, the interpretability analysis identified key factors influencing heat capacity predictions, such as anion size (e.g., NTf2 and FAP) and alkyl chain length. These factors were validated by testing the model on previously unseen IL examples. Additionally, a user-friendly web application was developed to make predictions, allowing users to input chemical groups or select compounds from a predefined list of 1633 ILs. This study underscores the importance of combining diverse modeling approaches with robust interpretability techniques to achieve reliable and explainable predictions for IL heat capacity.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"23 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143819350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing Molecules Generated by MMPDB and REINVENT4 with Ideas from Drug Discovery Design Teams.
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2025-04-10 DOI: 10.1021/acs.jcim.5c00250
Shan Sun,David J Huggins
This study compares molecules designed by drug discovery project teams from the Sanders Tri-Institutional Therapeutics Discovery Institute with molecules generated by two computational tools: MMPDB and REINVENT4. Seven different test cases with diverse chemotypes are studied in order to explore the potential of these computational tools in complementing human expertise in the early stages of drug discovery. By comparing the molecular structures and properties generated by MMPDB and REINVENT4 to those designed by project design teams, we aim to assess the value of such tools. The results indicate that MMPDB and REINVENT4 cover regions of chemical space larger than those covered by ideas from the drug discovery project teams. However, the chemical spaces covered by the two methods are quite different, and neither method completely covers the chemical space identified by the drug discovery project teams. Thus, the computational methods are complementary to one another and to drug discovery project team ideation. Effective application of generative molecule design tools has the potential to accelerate the identification of novel therapeutic candidates by expanding the chemical space explored during drug discovery and enabling optimal exploration.
{"title":"Comparing Molecules Generated by MMPDB and REINVENT4 with Ideas from Drug Discovery Design Teams.","authors":"Shan Sun,David J Huggins","doi":"10.1021/acs.jcim.5c00250","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c00250","url":null,"abstract":"This study compares molecules designed by drug discovery project teams from the Sanders Tri-Institutional Therapeutics Discovery Institute with molecules generated by two computational tools: MMPDB and REINVENT4. Seven different test cases with diverse chemotypes are studied in order to explore the potential of these computational tools in complementing human expertise in the early stages of drug discovery. By comparing the molecular structures and properties generated by MMPDB and REINVENT4 to those designed by project design teams, we aim to assess the value of such tools. The results indicate that MMPDB and REINVENT4 cover regions of chemical space larger than those covered by ideas from the drug discovery project teams. However, the chemical spaces covered by the two methods are quite different, and neither method completely covers the chemical space identified by the drug discovery project teams. Thus, the computational methods are complementary to one another and to drug discovery project team ideation. Effective application of generative molecule design tools has the potential to accelerate the identification of novel therapeutic candidates by expanding the chemical space explored during drug discovery and enabling optimal exploration.","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":"22 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143819351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PoseidonQ: A Free Machine Learning Platform for the Development, Analysis, and Validation of Efficient and Portable QSAR Models for Drug Discovery.
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2025-04-09 DOI: 10.1021/acs.jcim.4c02372
Muzammil Kabier, Nicola Gambacorta, Fulvio Ciriaco, Fabrizio Mastrolorito, Sunil Kumar, Bijo Mathew, Orazio Nicolotti

The advent of powerful machine learning algorithms as well as the availability of high volume of pharmacological data has given new fuel to QSAR, opening new unprecedented options for deriving highly predictive models for assisting the rationale design of new bioactive compounds, for screening and prioritizing large molecular libraries, and for repurposing new drugs toward new clinical uses. Here, we present PoseidonQ (an acronym for Personal Optimization Software for Efficient Implementation and Derivation of Online QSAR), a user-friendly software solution designed to simplify the derivation of the QSAR model for drug design and discovery. PoseidonQ incorporates 22 machine learning algorithms, 17 types of molecular fingerprints, and 208 RDKit molecular descriptors and enables the quick derivation of both regression and classification models along with a calculated and easily interpretable applicability domain. Importantly, the platform is automatically linked to the latest version of the ChEMBL database, thus providing streamlined access to large amounts of curated bioactivity data. Importantly, the user is also given the option of gathering high-quality experimental data based on customizable filtering settings. Noteworthy, PoseidonQ facilitates the deployment of trained QSAR models as web-based applications through seamless integration with Streamlit Cloud and GitHub, empowering users to share, refine, and integrate models effortlessly. Interestingly, the translation of QSAR models into web-based applications makes them free accessible, portable, and ready for screening large volumes of new data without limits. By unifying data preparation, model generation, and deployment into an intuitive workflow, PoseidonQ makes advanced QSAR modeling for drug design and discovery accessible to a wide audience of researchers irrespective of their skill levels. PoseidonQ bridges the gap between complex machine learning techniques and practical drug discovery applications, enhancing the efficiency, collaboration, and adoption of QSAR approaches in modern drug discovery programs. PoseidonQ is available for Windows and Linux (ubuntu 22.04 distro) operating systems and can be downloaded for free at https://github.com/Muzatheking12/PoseidonQ.

{"title":"PoseidonQ: A Free Machine Learning Platform for the Development, Analysis, and Validation of Efficient and Portable QSAR Models for Drug Discovery.","authors":"Muzammil Kabier, Nicola Gambacorta, Fulvio Ciriaco, Fabrizio Mastrolorito, Sunil Kumar, Bijo Mathew, Orazio Nicolotti","doi":"10.1021/acs.jcim.4c02372","DOIUrl":"https://doi.org/10.1021/acs.jcim.4c02372","url":null,"abstract":"<p><p>The advent of powerful machine learning algorithms as well as the availability of high volume of pharmacological data has given new fuel to QSAR, opening new unprecedented options for deriving highly predictive models for assisting the rationale design of new bioactive compounds, for screening and prioritizing large molecular libraries, and for repurposing new drugs toward new clinical uses. Here, we present PoseidonQ (an acronym for Personal Optimization Software for Efficient Implementation and Derivation of Online QSAR), a user-friendly software solution designed to simplify the derivation of the QSAR model for drug design and discovery. PoseidonQ incorporates 22 machine learning algorithms, 17 types of molecular fingerprints, and 208 RDKit molecular descriptors and enables the quick derivation of both regression and classification models along with a calculated and easily interpretable applicability domain. Importantly, the platform is automatically linked to the latest version of the ChEMBL database, thus providing streamlined access to large amounts of curated bioactivity data. Importantly, the user is also given the option of gathering high-quality experimental data based on customizable filtering settings. Noteworthy, PoseidonQ facilitates the deployment of trained QSAR models as web-based applications through seamless integration with Streamlit Cloud and GitHub, empowering users to share, refine, and integrate models effortlessly. Interestingly, the translation of QSAR models into web-based applications makes them free accessible, portable, and ready for screening large volumes of new data without limits. By unifying data preparation, model generation, and deployment into an intuitive workflow, PoseidonQ makes advanced QSAR modeling for drug design and discovery accessible to a wide audience of researchers irrespective of their skill levels. PoseidonQ bridges the gap between complex machine learning techniques and practical drug discovery applications, enhancing the efficiency, collaboration, and adoption of QSAR approaches in modern drug discovery programs. PoseidonQ is available for Windows and Linux (ubuntu 22.04 distro) operating systems and can be downloaded for free at https://github.com/Muzatheking12/PoseidonQ.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.6,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143809990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
af3cli: Streamlining AlphaFold3 Input Preparation.
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2025-04-09 DOI: 10.1021/acs.jcim.5c00276
Philipp Döpner, Stefan Kemnitz, Mark Doerr, Lukas Schulig

With the release of AlphaFold3, modeling capabilities have expanded beyond protein structure prediction to embrace the inherent complexity of biomolecular systems, including nucleic acids, ions, small molecules, and their interactions. The increased complexity of these assemblies is reflected in the input file generation process, presenting a significant hurdle for researchers without advanced computational expertise. While AlphaFold Server comes with a user-friendly graphical user interface, it supports only a subset of the features of AlphaFold3. To address this, we present af3cli, an open-source tool designed to facilitate the generation of AlphaFold3 input files, specifically tailored to the standalone version of AlphaFold3 and its unrestricted functionality. Featuring a user-friendly command-line interface and an accompanying Python library, af3cli simplifies the input generation process while maintaining flexibility and customization, which makes af3cli especially useful for fast (automated) generation of a large number of input files since it enables direct incorporation of FASTA files, keeps track of IDs, and validates the JSON file. Through practical examples, we demonstrate its capabilities for constructing input data for diverse biological structures, ranging from simple proteins to complex systems, and demonstrate its seamless integration into both manual and automated workflows.

{"title":"af3cli: Streamlining AlphaFold3 Input Preparation.","authors":"Philipp Döpner, Stefan Kemnitz, Mark Doerr, Lukas Schulig","doi":"10.1021/acs.jcim.5c00276","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c00276","url":null,"abstract":"<p><p>With the release of AlphaFold3, modeling capabilities have expanded beyond protein structure prediction to embrace the inherent complexity of biomolecular systems, including nucleic acids, ions, small molecules, and their interactions. The increased complexity of these assemblies is reflected in the input file generation process, presenting a significant hurdle for researchers without advanced computational expertise. While AlphaFold Server comes with a user-friendly graphical user interface, it supports only a subset of the features of AlphaFold3. To address this, we present af3cli, an open-source tool designed to facilitate the generation of AlphaFold3 input files, specifically tailored to the standalone version of AlphaFold3 and its unrestricted functionality. Featuring a user-friendly command-line interface and an accompanying Python library, af3cli simplifies the input generation process while maintaining flexibility and customization, which makes af3cli especially useful for fast (automated) generation of a large number of input files since it enables direct incorporation of FASTA files, keeps track of IDs, and validates the JSON file. Through practical examples, we demonstrate its capabilities for constructing input data for diverse biological structures, ranging from simple proteins to complex systems, and demonstrate its seamless integration into both manual and automated workflows.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.6,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143809981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unified Deep Learning of Molecular and Protein Language Representations with T5ProtChem.
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2025-04-08 DOI: 10.1021/acs.jcim.5c00051
Thomas Kelly, Song Xia, Jieyu Lu, Yingkai Zhang

Deep learning has revolutionized difficult tasks in chemistry and biology, yet existing language models often treat these domains separately, relying on concatenated architectures and independently pretrained weights. These approaches fail to fully exploit the shared atomic foundations of molecular and protein sequences. Here, we introduce T5ProtChem, a unified model based on the T5 architecture, designed to simultaneously process molecular and protein sequences. Using a new pretraining objective, ProtiSMILES, T5ProtChem bridges the molecular and protein domains, enabling efficient, generalizable protein-chemical modeling. The model achieves a state-of-the-art performance in tasks such as binding affinity prediction and reaction prediction, while having a strong performance in protein function prediction. Additionally, it supports novel applications, including covalent binder classification and sequence-level adduct prediction. These results demonstrate the versatility of unified language models for drug discovery, protein engineering, and other interdisciplinary efforts in computational biology and chemistry.

{"title":"Unified Deep Learning of Molecular and Protein Language Representations with T5ProtChem.","authors":"Thomas Kelly, Song Xia, Jieyu Lu, Yingkai Zhang","doi":"10.1021/acs.jcim.5c00051","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c00051","url":null,"abstract":"<p><p>Deep learning has revolutionized difficult tasks in chemistry and biology, yet existing language models often treat these domains separately, relying on concatenated architectures and independently pretrained weights. These approaches fail to fully exploit the shared atomic foundations of molecular and protein sequences. Here, we introduce T5ProtChem, a unified model based on the T5 architecture, designed to simultaneously process molecular and protein sequences. Using a new pretraining objective, ProtiSMILES, T5ProtChem bridges the molecular and protein domains, enabling efficient, generalizable protein-chemical modeling. The model achieves a state-of-the-art performance in tasks such as binding affinity prediction and reaction prediction, while having a strong performance in protein function prediction. Additionally, it supports novel applications, including covalent binder classification and sequence-level adduct prediction. These results demonstrate the versatility of unified language models for drug discovery, protein engineering, and other interdisciplinary efforts in computational biology and chemistry.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.6,"publicationDate":"2025-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143802015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Activity Regulation and Conformation Response of Janus Kinase 3 Mediated by Phosphorylation: Exploration from Correlation Network Analysis and Markov Model.
IF 5.6 2区 化学 Q1 CHEMISTRY, MEDICINAL Pub Date : 2025-04-08 DOI: 10.1021/acs.jcim.5c00096
Jianzhong Chen, Jian Wang, Wanchun Yang, Lu Zhao, Jing Su

The activity of the enzyme JAK3 is modulated by tyrosine phosphorylation, yet the underlying molecular details remain not fully understood. In this study, we employed a GaMD trajectory-based Markov model and correlation network analysis (CNA) to investigate the impact of single phosphorylation (SP) at Y980 (pY980) and double phosphorylation (DP) at Y980/Y981 (pY980/pY981) on the conformational dynamics of JAK3 bound by inhibitors IZA and MI1. The Markov model analysis indicated that both SP and DP result in fewer conformational states and significantly influence the conformational dynamics of the P-loop, αC-helix, and loop1-loop3, while maintaining the hinge region's high rigidity. The CNA findings revealed that phosphorylation alters the communication network among different structural regions of JAK3, providing a rational explanation for how phosphorylation affects the conformational dynamics of the distant P-loop and loop1-loop3. Moreover, the conformational changes mediated by SP and DP further affect the interactions between the inhibitors and the hot spots (L828, V836, E903, Y904, L905, and L956) of JAK3. This work offers valuable theoretical insights into the molecular mechanisms that regulate JAK3 activity.

{"title":"Activity Regulation and Conformation Response of Janus Kinase 3 Mediated by Phosphorylation: Exploration from Correlation Network Analysis and Markov Model.","authors":"Jianzhong Chen, Jian Wang, Wanchun Yang, Lu Zhao, Jing Su","doi":"10.1021/acs.jcim.5c00096","DOIUrl":"https://doi.org/10.1021/acs.jcim.5c00096","url":null,"abstract":"<p><p>The activity of the enzyme JAK3 is modulated by tyrosine phosphorylation, yet the underlying molecular details remain not fully understood. In this study, we employed a GaMD trajectory-based Markov model and correlation network analysis (CNA) to investigate the impact of single phosphorylation (SP) at Y980 (pY980) and double phosphorylation (DP) at Y980/Y981 (pY980/pY981) on the conformational dynamics of JAK3 bound by inhibitors IZA and MI1. The Markov model analysis indicated that both SP and DP result in fewer conformational states and significantly influence the conformational dynamics of the P-loop, αC-helix, and loop1-loop3, while maintaining the hinge region's high rigidity. The CNA findings revealed that phosphorylation alters the communication network among different structural regions of JAK3, providing a rational explanation for how phosphorylation affects the conformational dynamics of the distant P-loop and loop1-loop3. Moreover, the conformational changes mediated by SP and DP further affect the interactions between the inhibitors and the hot spots (L828, V836, E903, Y904, L905, and L956) of JAK3. This work offers valuable theoretical insights into the molecular mechanisms that regulate JAK3 activity.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":""},"PeriodicalIF":5.6,"publicationDate":"2025-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143809979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Chemical Information and Modeling
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1