首页 > 最新文献

Journal of Chemical Theory and Computation最新文献

英文 中文
Role of Noise-Modulated Self-Propulsion in Driving Spatiotemporal Orders in Active Systems.
IF 5.5 1区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-04-17 DOI: 10.1021/acs.jctc.5c00093
Kaustav Mondal,Tarpan Maiti,Pushpita Ghosh
Fluctuations play a pivotal role in driving spatiotemporal order in active matter systems. In this study, we employ a novel analytical framework to investigate the impact of dichotomous noise on the self-propelling velocity of active particle systems such as polymerizing actin filaments or reproducing elongated bacteria. By incorporating dichotomous fluctuations with Ornstein-Zernike correlations into a continuum-based model, we derive a bifurcation condition in the noise parameter space, revealing a noise-induced instability that drives the emergence of traveling waves. This approach demonstrates how specific noise strengths and correlation times expand the instability region by introducing effective new degrees of freedom that alter the system's stability matrix. Advance numerical simulations, meticulously designed to handle the properties of dichotomous noise, validate these theoretical predictions and reveal excellent agreement. A key finding is the observation of wave-reversal behavior, driven by the sign alternation of the noise-modulated advection term and modulated by the relaxation time. Remarkably, we identify a finite parameter range where this reversal is suppressed, offering new insights into noise-induced bifurcations and spatiotemporal dynamics. Our combined analytical and numerical approach provides a deeper understanding of the role of noise in shaping self-organization and pattern formation in biological and synthetic active systems.
{"title":"Role of Noise-Modulated Self-Propulsion in Driving Spatiotemporal Orders in Active Systems.","authors":"Kaustav Mondal,Tarpan Maiti,Pushpita Ghosh","doi":"10.1021/acs.jctc.5c00093","DOIUrl":"https://doi.org/10.1021/acs.jctc.5c00093","url":null,"abstract":"Fluctuations play a pivotal role in driving spatiotemporal order in active matter systems. In this study, we employ a novel analytical framework to investigate the impact of dichotomous noise on the self-propelling velocity of active particle systems such as polymerizing actin filaments or reproducing elongated bacteria. By incorporating dichotomous fluctuations with Ornstein-Zernike correlations into a continuum-based model, we derive a bifurcation condition in the noise parameter space, revealing a noise-induced instability that drives the emergence of traveling waves. This approach demonstrates how specific noise strengths and correlation times expand the instability region by introducing effective new degrees of freedom that alter the system's stability matrix. Advance numerical simulations, meticulously designed to handle the properties of dichotomous noise, validate these theoretical predictions and reveal excellent agreement. A key finding is the observation of wave-reversal behavior, driven by the sign alternation of the noise-modulated advection term and modulated by the relaxation time. Remarkably, we identify a finite parameter range where this reversal is suppressed, offering new insights into noise-induced bifurcations and spatiotemporal dynamics. Our combined analytical and numerical approach provides a deeper understanding of the role of noise in shaping self-organization and pattern formation in biological and synthetic active systems.","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"75 1","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143846329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Hybrid Bottom-Up and Data-Driven Machine Learning Approach for Accurate Coarse-Graining of Large Molecular Complexes.
IF 5.5 1区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-04-16 DOI: 10.1021/acs.jctc.5c00063
Korbinian Liebl,Gregory A Voth
Bottom-up coarse-graining refers to the development of low-resolution simulation models that are thermodynamically consistent with certain distributions from fully atomistic simulations. Force-matching and relative entropy minimization represent two major, frequently applied methods that allow to develop such bottom-up models. Nevertheless, atomistic simulations can often provide only limited sampling of the phase space. For bottom-up coarse-graining, these limitations may result in overfitting of the atomistic reference data, especially for large molecular complexes, where the learning may be agnostic of the actual affinities between binding partners. As a solution to this problem, we devise a data-driven machine learning hybrid coarse-graining concept that represents a regularized version of the relative entropy minimization approach. We demonstrate that this new approach allows one to develop coarse-grained models for molecular complexes that reproduce the targeted binding affinity but also describe the underlying complex structure accurately. The trained models therefore show diverse behavior as they can undergo frequent unbinding and binding events and are also transferable for simulating entire protein lattices, e.g., for a virus capsid.
{"title":"A Hybrid Bottom-Up and Data-Driven Machine Learning Approach for Accurate Coarse-Graining of Large Molecular Complexes.","authors":"Korbinian Liebl,Gregory A Voth","doi":"10.1021/acs.jctc.5c00063","DOIUrl":"https://doi.org/10.1021/acs.jctc.5c00063","url":null,"abstract":"Bottom-up coarse-graining refers to the development of low-resolution simulation models that are thermodynamically consistent with certain distributions from fully atomistic simulations. Force-matching and relative entropy minimization represent two major, frequently applied methods that allow to develop such bottom-up models. Nevertheless, atomistic simulations can often provide only limited sampling of the phase space. For bottom-up coarse-graining, these limitations may result in overfitting of the atomistic reference data, especially for large molecular complexes, where the learning may be agnostic of the actual affinities between binding partners. As a solution to this problem, we devise a data-driven machine learning hybrid coarse-graining concept that represents a regularized version of the relative entropy minimization approach. We demonstrate that this new approach allows one to develop coarse-grained models for molecular complexes that reproduce the targeted binding affinity but also describe the underlying complex structure accurately. The trained models therefore show diverse behavior as they can undergo frequent unbinding and binding events and are also transferable for simulating entire protein lattices, e.g., for a virus capsid.","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"29 1","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143846358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving the Hit Rates of Virtual Screening by Active Learning from Bioactivity Feedback.
IF 5.5 1区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-04-16 DOI: 10.1021/acs.jctc.4c01618
Xun Deng,Junlong Liu,Zhike Liu,Jiansheng Wu,Fuli Feng,Jieping Ye,Zheng Wang
Virtual screening has been widely used to identify potential hit candidates that can bind to the target protein in drug discovery. Contemporary screening methods typically rely on oversimplified scoring functions, frequently yielding one-digit hit rates (or even zero) among top-ranked candidates. The substantial cost of laboratory validation further constrains the exploration of candidate molecules. We find that test-time prediction refinement is almost blank in this area, which means bioactivity feedback in the wet-lab experiments is neglected. Here, we introduce an Active Learning from Bioactivity Feedback (ALBF) framework to enhance the weak hit rate of current virtual screening methods. ALBF spends the budget of wet-lab experiments iteratively and leverages the target-specific bioactivity insights from current wet-lab tests to refine the score results (i.e., rankings). Our framework consists of two components: a novel query strategy that considers the evaluation quality and its overall influence on other top-scored molecules; and an efficient score optimization strategy that propagates the bioactivity feedback to structurally similar molecules. We evaluated ALBF on diverse subsets of the well-known DUD-E and LIT-PCBA benchmarks. Our active learning protocol averagely enhances top-100 hit rates by 60% and 30% on DUD-E and LIT-PCBA with 50 to 200 bioactivity queries on the selected molecules that are deployed in ten rounds. The consistently superior performance demonstrates ALBF's potential to enhance both the accuracy and cost-effectiveness of active learning-based laboratory testing.
{"title":"Improving the Hit Rates of Virtual Screening by Active Learning from Bioactivity Feedback.","authors":"Xun Deng,Junlong Liu,Zhike Liu,Jiansheng Wu,Fuli Feng,Jieping Ye,Zheng Wang","doi":"10.1021/acs.jctc.4c01618","DOIUrl":"https://doi.org/10.1021/acs.jctc.4c01618","url":null,"abstract":"Virtual screening has been widely used to identify potential hit candidates that can bind to the target protein in drug discovery. Contemporary screening methods typically rely on oversimplified scoring functions, frequently yielding one-digit hit rates (or even zero) among top-ranked candidates. The substantial cost of laboratory validation further constrains the exploration of candidate molecules. We find that test-time prediction refinement is almost blank in this area, which means bioactivity feedback in the wet-lab experiments is neglected. Here, we introduce an Active Learning from Bioactivity Feedback (ALBF) framework to enhance the weak hit rate of current virtual screening methods. ALBF spends the budget of wet-lab experiments iteratively and leverages the target-specific bioactivity insights from current wet-lab tests to refine the score results (i.e., rankings). Our framework consists of two components: a novel query strategy that considers the evaluation quality and its overall influence on other top-scored molecules; and an efficient score optimization strategy that propagates the bioactivity feedback to structurally similar molecules. We evaluated ALBF on diverse subsets of the well-known DUD-E and LIT-PCBA benchmarks. Our active learning protocol averagely enhances top-100 hit rates by 60% and 30% on DUD-E and LIT-PCBA with 50 to 200 bioactivity queries on the selected molecules that are deployed in ten rounds. The consistently superior performance demonstrates ALBF's potential to enhance both the accuracy and cost-effectiveness of active learning-based laboratory testing.","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"26 1","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143846328","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Small Molecules Targeting the Structural Dynamics of AR-V7 Partially Disordered Proteins Using Deep Ensemble Docking.
IF 5.5 1区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-04-15 DOI: 10.1021/acs.jctc.5c00171
Pantelis Karatzas,Z Faidon Brotzakis,Haralambos Sarimveis
The extensive conformational dynamics of partially disordered proteins hinders the efficiency of traditional in-silico structure-based drug discovery approaches due to the challenge of screening large chemical spaces of compounds, albeit with an excessive number of transient binding sites, quickly making this problem intractable. In this study, using the monomer of the AR-V7 transcription factor splicing variant related to prostate cancer as a test case, we present a deep ensemble docking pipeline that accelerates the screening of small molecule binders targeting partially disordered proteins at functional regions. By swiftly identifying the conformational ensemble of AR-V7 and reducing the dimension of binding sites by a factor of 90, we identify functionally relevant binding sites along the AR-V7 structural ensemble at phase separation-prone regions that have been experimentally shown to contribute to enhanced transcription activity and the onset of tumor growth. Following this, we combine physics-based molecular docking and multiobjective classification machine learning models to speed up the screening for binders in a larger chemical space able to target these functional multiple binding sites of AR-V7. This step increases the multibinding site hit rate of small molecules by a factor of 17 compared to naive molecular docking. Finally, assessing in atomistic molecular dynamics the effect of a selected binder on AR-V7 dynamics, we find that in the presence of the ChEMBL22003 compound, AR-V7 exhibits less conformational entropy, smaller solvent exposure of phase separation-prone regions, and higher solvent exposure of other protein regions, promoting this compound as a potential AR-V7 phase separation modulator.
{"title":"Small Molecules Targeting the Structural Dynamics of AR-V7 Partially Disordered Proteins Using Deep Ensemble Docking.","authors":"Pantelis Karatzas,Z Faidon Brotzakis,Haralambos Sarimveis","doi":"10.1021/acs.jctc.5c00171","DOIUrl":"https://doi.org/10.1021/acs.jctc.5c00171","url":null,"abstract":"The extensive conformational dynamics of partially disordered proteins hinders the efficiency of traditional in-silico structure-based drug discovery approaches due to the challenge of screening large chemical spaces of compounds, albeit with an excessive number of transient binding sites, quickly making this problem intractable. In this study, using the monomer of the AR-V7 transcription factor splicing variant related to prostate cancer as a test case, we present a deep ensemble docking pipeline that accelerates the screening of small molecule binders targeting partially disordered proteins at functional regions. By swiftly identifying the conformational ensemble of AR-V7 and reducing the dimension of binding sites by a factor of 90, we identify functionally relevant binding sites along the AR-V7 structural ensemble at phase separation-prone regions that have been experimentally shown to contribute to enhanced transcription activity and the onset of tumor growth. Following this, we combine physics-based molecular docking and multiobjective classification machine learning models to speed up the screening for binders in a larger chemical space able to target these functional multiple binding sites of AR-V7. This step increases the multibinding site hit rate of small molecules by a factor of 17 compared to naive molecular docking. Finally, assessing in atomistic molecular dynamics the effect of a selected binder on AR-V7 dynamics, we find that in the presence of the ChEMBL22003 compound, AR-V7 exhibits less conformational entropy, smaller solvent exposure of phase separation-prone regions, and higher solvent exposure of other protein regions, promoting this compound as a potential AR-V7 phase separation modulator.","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"5 1","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143836613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Calculating Bond Capacities by Linear Response Methods. 用线性响应法计算债券容量。
IF 5.5 1区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-04-15 DOI: 10.1021/acs.jctc.5c00233
Jonas E S Mikkelsen,Frank Jensen
Bond capacities can be considered as atom-atom condensed versions of the density response function. They quantify the ease with which the electron density can be transferred between atoms due to differences in potential and are thus central quantities for modeling charge flow in force fields. We describe an implementation of calculating bond capacities by linear response methods with the minimal basis iterative stockholder definition of atoms in molecules. The calculated bond capacities are moderately sensitive to the level of theory at Hartree-Fock, density functional theory, and multiconfigurational self-consistent field and are insensitive to basis set quality beyond a polarized double-ζ quality. The dependence of bond capacities on chemical structure displays a high degree of transferability and conforms to the concept of functional groups. Bond capacities connect all atom pairs in a molecule; however, the magnitude rapidly diminishes as a function of the number of connecting bonds for the nonconjugated system, while a less rapid decay and oscillating pattern is observed for conjugated systems.
{"title":"Calculating Bond Capacities by Linear Response Methods.","authors":"Jonas E S Mikkelsen,Frank Jensen","doi":"10.1021/acs.jctc.5c00233","DOIUrl":"https://doi.org/10.1021/acs.jctc.5c00233","url":null,"abstract":"Bond capacities can be considered as atom-atom condensed versions of the density response function. They quantify the ease with which the electron density can be transferred between atoms due to differences in potential and are thus central quantities for modeling charge flow in force fields. We describe an implementation of calculating bond capacities by linear response methods with the minimal basis iterative stockholder definition of atoms in molecules. The calculated bond capacities are moderately sensitive to the level of theory at Hartree-Fock, density functional theory, and multiconfigurational self-consistent field and are insensitive to basis set quality beyond a polarized double-ζ quality. The dependence of bond capacities on chemical structure displays a high degree of transferability and conforms to the concept of functional groups. Bond capacities connect all atom pairs in a molecule; however, the magnitude rapidly diminishes as a function of the number of connecting bonds for the nonconjugated system, while a less rapid decay and oscillating pattern is observed for conjugated systems.","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"40 1","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143836611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Pairwise Interaction for Extrapolative and Interpretable Machine Learning Interatomic Potentials with Physics-Informed Neural Network.
IF 5.5 1区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-04-14 DOI: 10.1021/acs.jctc.5c00090
Hoje Chun,Minjoon Hong,Seung Hyo Noh,Byungchan Han
Achieving both robust extrapolation and physical interpretability in machine learning interatomic potentials (ML-IPs) for atomistic simulation remains a significant challenge, particularly in data-scarce areas such as chemical reactions or complex, multicomponent materials at extreme conditions. Here, we present a pairwise-decomposed physics-informed neural network (P2Net) that parametrizes an analytical bond-order potential (BOP) layer to decouple the energy contributions of atomic pairs. By leveraging fundamental physical principles, P2Net demonstrates excellence at extrapolating beyond its training regime and accurately capturing molecular geometries far from equilibrium. The pairwise energy decomposition further empowers the bond analyses for deprotonation and SN2 reactions, which is not easy with most ML-IPs. The atomic pair energy offers how to elucidate the evolution of interatomic interactions as a reaction proceeds. Our methodology highlights enhanced data efficiency in building ML-IPs and facilitates more informative postsimulation analysis, thereby broadening the applicability of ML-IPs to complex and reactive systems.
{"title":"Learning Pairwise Interaction for Extrapolative and Interpretable Machine Learning Interatomic Potentials with Physics-Informed Neural Network.","authors":"Hoje Chun,Minjoon Hong,Seung Hyo Noh,Byungchan Han","doi":"10.1021/acs.jctc.5c00090","DOIUrl":"https://doi.org/10.1021/acs.jctc.5c00090","url":null,"abstract":"Achieving both robust extrapolation and physical interpretability in machine learning interatomic potentials (ML-IPs) for atomistic simulation remains a significant challenge, particularly in data-scarce areas such as chemical reactions or complex, multicomponent materials at extreme conditions. Here, we present a pairwise-decomposed physics-informed neural network (P2Net) that parametrizes an analytical bond-order potential (BOP) layer to decouple the energy contributions of atomic pairs. By leveraging fundamental physical principles, P2Net demonstrates excellence at extrapolating beyond its training regime and accurately capturing molecular geometries far from equilibrium. The pairwise energy decomposition further empowers the bond analyses for deprotonation and SN2 reactions, which is not easy with most ML-IPs. The atomic pair energy offers how to elucidate the evolution of interatomic interactions as a reaction proceeds. Our methodology highlights enhanced data efficiency in building ML-IPs and facilitates more informative postsimulation analysis, thereby broadening the applicability of ML-IPs to complex and reactive systems.","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"6 1","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143831544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing Nonbonded Aggregates Populations: Application to the Concentration-Dependent IR O–H Band of Phenol
IF 5.7 1区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-04-14 DOI: 10.1021/acs.jctc.5c0028110.1021/acs.jctc.5c00281
J. Pablo Gálvez, José Zúñiga and Javier Cerezo*, 

In this work, we present two alternative computational strategies to determine the populations of nonbonded aggregates. One approach extracts these populations from molecular dynamics (MD) simulations, while the other employs quantum mechanical partition functions for the most relevant minima of the multimolecular potential energy surfaces (PESs), identified by automated conformational sampling. In both cases, we adopt a common graph-theory-based framework, introduced in this work, for identifying aggregate conformations, which enables a consistent comparative assessment of both methodologies and provides insight into the underlying approximations. We apply both strategies to investigate phenol aggregates, up to the tetramer, at different concentrations in phenol/carbon tetrachloride mixtures. Subsequently, we simulate the concentration-dependent OH stretching IR region by averaging the harmonic Infrared (IR) spectra of aggregates using the populations predicted by each strategy. Our results indicate that the populations extracted from MD trajectories yield OH stretching signals that closely follow the experimental trends, outperforming the spectra from populations obtained by systematic conformational searches. Such a better performance of MD is attributed to a better description of the entropic contributions. Moreover, the proposed protocol not only successfully addresses a very challenging problem but also offers a benchmark to assess the accuracy of the intermolecular force fields.

{"title":"Assessing Nonbonded Aggregates Populations: Application to the Concentration-Dependent IR O–H Band of Phenol","authors":"J. Pablo Gálvez,&nbsp;José Zúñiga and Javier Cerezo*,&nbsp;","doi":"10.1021/acs.jctc.5c0028110.1021/acs.jctc.5c00281","DOIUrl":"https://doi.org/10.1021/acs.jctc.5c00281https://doi.org/10.1021/acs.jctc.5c00281","url":null,"abstract":"<p >In this work, we present two alternative computational strategies to determine the populations of nonbonded aggregates. One approach extracts these populations from molecular dynamics (MD) simulations, while the other employs quantum mechanical partition functions for the most relevant minima of the multimolecular potential energy surfaces (PESs), identified by automated conformational sampling. In both cases, we adopt a common graph-theory-based framework, introduced in this work, for identifying aggregate conformations, which enables a consistent comparative assessment of both methodologies and provides insight into the underlying approximations. We apply both strategies to investigate phenol aggregates, up to the tetramer, at different concentrations in phenol/carbon tetrachloride mixtures. Subsequently, we simulate the concentration-dependent OH stretching IR region by averaging the harmonic Infrared (IR) spectra of aggregates using the populations predicted by each strategy. Our results indicate that the populations extracted from MD trajectories yield OH stretching signals that closely follow the experimental trends, outperforming the spectra from populations obtained by systematic conformational searches. Such a better performance of MD is attributed to a better description of the entropic contributions. Moreover, the proposed protocol not only successfully addresses a very challenging problem but also offers a benchmark to assess the accuracy of the intermolecular force fields.</p>","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"21 8","pages":"3888–3901 3888–3901"},"PeriodicalIF":5.7,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143854169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Calculating a Phase Diagram of a Simple Water Model Using Unsupervised Machine Learning on Simulation Data
IF 5.7 1区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-04-14 DOI: 10.1021/acs.jctc.4c0145610.1021/acs.jctc.4c01456
Peter Ogrin,  and , Tomaz Urbic*, 

We use unsupervised machine learning to construct a phase diagram of a simple 2D rose water model. The machine learning method that we use is a combination of dimensionality reduction methods and clustering algorithms. Two different data sets from the same simulations are used as input data for machine learning. These are angular distribution functions and a set of different thermodynamic, dynamic, and structural properties. To evaluate the efficiency of the method, the machine learning results are compared to manually determined phase diagrams. We show that the methods successfully predict the phase diagram of the rose water model. Furthermore, the phase diagrams obtained from the two data sets are in semiquantitative agreement with each other. Four different solid phases, one liquid phase, and one gaseous phase were determined. The method we have presented is straightforward and easy to implement. It requires almost no prior knowledge of the system to obtain a phase diagram. The method can also be used to distinguish between the different parts of the same phase that have different properties or a sufficiently different structure, and in this way find local differences and anomalies.

{"title":"Calculating a Phase Diagram of a Simple Water Model Using Unsupervised Machine Learning on Simulation Data","authors":"Peter Ogrin,&nbsp; and ,&nbsp;Tomaz Urbic*,&nbsp;","doi":"10.1021/acs.jctc.4c0145610.1021/acs.jctc.4c01456","DOIUrl":"https://doi.org/10.1021/acs.jctc.4c01456https://doi.org/10.1021/acs.jctc.4c01456","url":null,"abstract":"<p >We use unsupervised machine learning to construct a phase diagram of a simple 2D rose water model. The machine learning method that we use is a combination of dimensionality reduction methods and clustering algorithms. Two different data sets from the same simulations are used as input data for machine learning. These are angular distribution functions and a set of different thermodynamic, dynamic, and structural properties. To evaluate the efficiency of the method, the machine learning results are compared to manually determined phase diagrams. We show that the methods successfully predict the phase diagram of the rose water model. Furthermore, the phase diagrams obtained from the two data sets are in semiquantitative agreement with each other. Four different solid phases, one liquid phase, and one gaseous phase were determined. The method we have presented is straightforward and easy to implement. It requires almost no prior knowledge of the system to obtain a phase diagram. The method can also be used to distinguish between the different parts of the same phase that have different properties or a sufficiently different structure, and in this way find local differences and anomalies.</p>","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"21 8","pages":"3867–3887 3867–3887"},"PeriodicalIF":5.7,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acs.jctc.4c01456","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143854170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Calculating a Phase Diagram of a Simple Water Model Using Unsupervised Machine Learning on Simulation Data.
IF 5.5 1区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-04-14 DOI: 10.1021/acs.jctc.4c01456
Peter Ogrin,Tomaz Urbic
We use unsupervised machine learning to construct a phase diagram of a simple 2D rose water model. The machine learning method that we use is a combination of dimensionality reduction methods and clustering algorithms. Two different data sets from the same simulations are used as input data for machine learning. These are angular distribution functions and a set of different thermodynamic, dynamic, and structural properties. To evaluate the efficiency of the method, the machine learning results are compared to manually determined phase diagrams. We show that the methods successfully predict the phase diagram of the rose water model. Furthermore, the phase diagrams obtained from the two data sets are in semiquantitative agreement with each other. Four different solid phases, one liquid phase, and one gaseous phase were determined. The method we have presented is straightforward and easy to implement. It requires almost no prior knowledge of the system to obtain a phase diagram. The method can also be used to distinguish between the different parts of the same phase that have different properties or a sufficiently different structure, and in this way find local differences and anomalies.
{"title":"Calculating a Phase Diagram of a Simple Water Model Using Unsupervised Machine Learning on Simulation Data.","authors":"Peter Ogrin,Tomaz Urbic","doi":"10.1021/acs.jctc.4c01456","DOIUrl":"https://doi.org/10.1021/acs.jctc.4c01456","url":null,"abstract":"We use unsupervised machine learning to construct a phase diagram of a simple 2D rose water model. The machine learning method that we use is a combination of dimensionality reduction methods and clustering algorithms. Two different data sets from the same simulations are used as input data for machine learning. These are angular distribution functions and a set of different thermodynamic, dynamic, and structural properties. To evaluate the efficiency of the method, the machine learning results are compared to manually determined phase diagrams. We show that the methods successfully predict the phase diagram of the rose water model. Furthermore, the phase diagrams obtained from the two data sets are in semiquantitative agreement with each other. Four different solid phases, one liquid phase, and one gaseous phase were determined. The method we have presented is straightforward and easy to implement. It requires almost no prior knowledge of the system to obtain a phase diagram. The method can also be used to distinguish between the different parts of the same phase that have different properties or a sufficiently different structure, and in this way find local differences and anomalies.","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"218 1","pages":""},"PeriodicalIF":5.5,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143831819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Pairwise Interaction for Extrapolative and Interpretable Machine Learning Interatomic Potentials with Physics-Informed Neural Network
IF 5.7 1区 化学 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-04-14 DOI: 10.1021/acs.jctc.5c0009010.1021/acs.jctc.5c00090
Hoje Chun, Minjoon Hong, Seung Hyo Noh and Byungchan Han*, 

Achieving both robust extrapolation and physical interpretability in machine learning interatomic potentials (ML-IPs) for atomistic simulation remains a significant challenge, particularly in data-scarce areas such as chemical reactions or complex, multicomponent materials at extreme conditions. Here, we present a pairwise-decomposed physics-informed neural network (P2Net) that parametrizes an analytical bond-order potential (BOP) layer to decouple the energy contributions of atomic pairs. By leveraging fundamental physical principles, P2Net demonstrates excellence at extrapolating beyond its training regime and accurately capturing molecular geometries far from equilibrium. The pairwise energy decomposition further empowers the bond analyses for deprotonation and SN2 reactions, which is not easy with most ML-IPs. The atomic pair energy offers how to elucidate the evolution of interatomic interactions as a reaction proceeds. Our methodology highlights enhanced data efficiency in building ML-IPs and facilitates more informative postsimulation analysis, thereby broadening the applicability of ML-IPs to complex and reactive systems.

{"title":"Learning Pairwise Interaction for Extrapolative and Interpretable Machine Learning Interatomic Potentials with Physics-Informed Neural Network","authors":"Hoje Chun,&nbsp;Minjoon Hong,&nbsp;Seung Hyo Noh and Byungchan Han*,&nbsp;","doi":"10.1021/acs.jctc.5c0009010.1021/acs.jctc.5c00090","DOIUrl":"https://doi.org/10.1021/acs.jctc.5c00090https://doi.org/10.1021/acs.jctc.5c00090","url":null,"abstract":"<p >Achieving both robust extrapolation and physical interpretability in machine learning interatomic potentials (ML-IPs) for atomistic simulation remains a significant challenge, particularly in data-scarce areas such as chemical reactions or complex, multicomponent materials at extreme conditions. Here, we present a pairwise-decomposed physics-informed neural network (P2Net) that parametrizes an analytical bond-order potential (BOP) layer to decouple the energy contributions of atomic pairs. By leveraging fundamental physical principles, P2Net demonstrates excellence at extrapolating beyond its training regime and accurately capturing molecular geometries far from equilibrium. The pairwise energy decomposition further empowers the bond analyses for deprotonation and S<sub>N</sub>2 reactions, which is not easy with most ML-IPs. The atomic pair energy offers how to elucidate the evolution of interatomic interactions as a reaction proceeds. Our methodology highlights enhanced data efficiency in building ML-IPs and facilitates more informative postsimulation analysis, thereby broadening the applicability of ML-IPs to complex and reactive systems.</p>","PeriodicalId":45,"journal":{"name":"Journal of Chemical Theory and Computation","volume":"21 8","pages":"4030–4039 4030–4039"},"PeriodicalIF":5.7,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143854174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Chemical Theory and Computation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1