Firas Shuaib, Guido Ori, Philippe Thomas, Olivier Masson, Assil Bouzid
{"title":"基于多核相似性的非晶系统聚类和通过主动学习的机器学习原子间电位","authors":"Firas Shuaib, Guido Ori, Philippe Thomas, Olivier Masson, Assil Bouzid","doi":"10.1111/jace.20128","DOIUrl":null,"url":null,"abstract":"<p>We present a hybrid similarity kernel that exemplifies the integration of short- and long-range descriptors via the use of an average kernel approach. This technique allows for a direct measure of the similarity between amorphous configurations, and when combined with an active learning (AL) spectral clustering approach, it leads to a classification of the amorphous configurations into uncorrelated clusters. Subsequently, a minimum size database is built by considering a small fraction of configurations belonging to each cluster and a machine learning interatomic potential (MLIP), within the Gaussian approximation scheme, is fitted by relying on a Bayesian optimization of the potential hyperparameters. This step is embedded within an AL loop that allows to sequentially increase the size of the learning database whenever the MLIP fails to meet a predefined energy convergence threshold. As such, MLIP are fitted in an almost fully automatized fashion. This approach is tested on two diverse amorphous systems that were previously generated using first-principles molecular dynamics. Accurate potentials with less than 2 meV/atom root mean square energy error compared to the reference data are obtained. This accuracy is achieved with only 175 configurations sampling the studied systems at various temperatures. The robustness of these potentials is then confirmed by producing models with several thousands of atoms featuring a good agreement with reference ab initio and experimental data.</p>","PeriodicalId":200,"journal":{"name":"Journal of the American Ceramic Society","volume":"108 1","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jace.20128","citationCount":"0","resultStr":"{\"title\":\"Multikernel similarity-based clustering of amorphous systems and machine-learned interatomic potentials by active learning\",\"authors\":\"Firas Shuaib, Guido Ori, Philippe Thomas, Olivier Masson, Assil Bouzid\",\"doi\":\"10.1111/jace.20128\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>We present a hybrid similarity kernel that exemplifies the integration of short- and long-range descriptors via the use of an average kernel approach. This technique allows for a direct measure of the similarity between amorphous configurations, and when combined with an active learning (AL) spectral clustering approach, it leads to a classification of the amorphous configurations into uncorrelated clusters. Subsequently, a minimum size database is built by considering a small fraction of configurations belonging to each cluster and a machine learning interatomic potential (MLIP), within the Gaussian approximation scheme, is fitted by relying on a Bayesian optimization of the potential hyperparameters. This step is embedded within an AL loop that allows to sequentially increase the size of the learning database whenever the MLIP fails to meet a predefined energy convergence threshold. As such, MLIP are fitted in an almost fully automatized fashion. This approach is tested on two diverse amorphous systems that were previously generated using first-principles molecular dynamics. Accurate potentials with less than 2 meV/atom root mean square energy error compared to the reference data are obtained. This accuracy is achieved with only 175 configurations sampling the studied systems at various temperatures. The robustness of these potentials is then confirmed by producing models with several thousands of atoms featuring a good agreement with reference ab initio and experimental data.</p>\",\"PeriodicalId\":200,\"journal\":{\"name\":\"Journal of the American Ceramic Society\",\"volume\":\"108 1\",\"pages\":\"\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2024-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jace.20128\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the American Ceramic Society\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/jace.20128\",\"RegionNum\":3,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATERIALS SCIENCE, CERAMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Ceramic Society","FirstCategoryId":"88","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jace.20128","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, CERAMICS","Score":null,"Total":0}
引用次数: 0
摘要
我们提出了一种混合相似性内核,它通过使用平均内核方法体现了短程和长程描述符的整合。这种技术可以直接测量非定常构型之间的相似性,并与主动学习(AL)光谱聚类方法相结合,将非定常构型分类为不相关的聚类。随后,通过考虑属于每个聚类的一小部分构型,建立一个最小规模的数据库,并在高斯近似方案中,依靠对势能超参数的贝叶斯优化,拟合机器学习原子间势能(MLIP)。这一步骤被嵌入到一个 AL 循环中,每当 MLIP 无法达到预定的能量收敛阈值时,就可以依次增加学习数据库的大小。因此,MLIP 几乎是以完全自动化的方式拟合的。这种方法在两个不同的无定形系统上进行了测试,这两个系统是之前使用第一原理分子动力学生成的。与参考数据相比,获得的精确势能均方根能量误差小于 2 meV/原子。在不同温度下,仅用 175 个配置对所研究的体系进行采样,就达到了这一精确度。然后,通过生成具有数千个原子的模型,证实了这些势的稳健性,其特点是与参考的 ab initio 和实验数据具有良好的一致性。
Multikernel similarity-based clustering of amorphous systems and machine-learned interatomic potentials by active learning
We present a hybrid similarity kernel that exemplifies the integration of short- and long-range descriptors via the use of an average kernel approach. This technique allows for a direct measure of the similarity between amorphous configurations, and when combined with an active learning (AL) spectral clustering approach, it leads to a classification of the amorphous configurations into uncorrelated clusters. Subsequently, a minimum size database is built by considering a small fraction of configurations belonging to each cluster and a machine learning interatomic potential (MLIP), within the Gaussian approximation scheme, is fitted by relying on a Bayesian optimization of the potential hyperparameters. This step is embedded within an AL loop that allows to sequentially increase the size of the learning database whenever the MLIP fails to meet a predefined energy convergence threshold. As such, MLIP are fitted in an almost fully automatized fashion. This approach is tested on two diverse amorphous systems that were previously generated using first-principles molecular dynamics. Accurate potentials with less than 2 meV/atom root mean square energy error compared to the reference data are obtained. This accuracy is achieved with only 175 configurations sampling the studied systems at various temperatures. The robustness of these potentials is then confirmed by producing models with several thousands of atoms featuring a good agreement with reference ab initio and experimental data.
期刊介绍:
The Journal of the American Ceramic Society contains records of original research that provide insight into or describe the science of ceramic and glass materials and composites based on ceramics and glasses. These papers include reports on discovery, characterization, and analysis of new inorganic, non-metallic materials; synthesis methods; phase relationships; processing approaches; microstructure-property relationships; and functionalities. Of great interest are works that support understanding founded on fundamental principles using experimental, theoretical, or computational methods or combinations of those approaches. All the published papers must be of enduring value and relevant to the science of ceramics and glasses or composites based on those materials.
Papers on fundamental ceramic and glass science are welcome including those in the following areas:
Enabling materials for grand challenges[...]
Materials design, selection, synthesis and processing methods[...]
Characterization of compositions, structures, defects, and properties along with new methods [...]
Mechanisms, Theory, Modeling, and Simulation[...]
JACerS accepts submissions of full-length Articles reporting original research, in-depth Feature Articles, Reviews of the state-of-the-art with compelling analysis, and Rapid Communications which are short papers with sufficient novelty or impact to justify swift publication.