{"title":"Data-efficient and Interpretable Inverse Materials Design using a Disentangled Variational Autoencoder","authors":"Cheng Zeng, Zulqarnain Khan, Nathan L. Post","doi":"arxiv-2409.06740","DOIUrl":null,"url":null,"abstract":"Inverse materials design has proven successful in accelerating novel material\ndiscovery. Many inverse materials design methods use unsupervised learning\nwhere a latent space is learned to offer a compact description of materials\nrepresentations. A latent space learned this way is likely to be entangled, in\nterms of the target property and other properties of the materials. This makes\nthe inverse design process ambiguous. Here, we present a semi-supervised\nlearning approach based on a disentangled variational autoencoder to learn a\nprobabilistic relationship between features, latent variables and target\nproperties. This approach is data efficient because it combines all labelled\nand unlabelled data in a coherent manner, and it uses expert-informed prior\ndistributions to improve model robustness even with limited labelled data. It\nis in essence interpretable, as the learnable target property is disentangled\nout of the other properties of the materials, and an extra layer of\ninterpretability can be provided by a post-hoc analysis of the classification\nhead of the model. We demonstrate this new approach on an experimental\nhigh-entropy alloy dataset with chemical compositions as input and single-phase\nformation as the single target property. While single property is used in this\nwork, the disentangled model can be extended to customize for inverse design of\nmaterials with multiple target properties.","PeriodicalId":501234,"journal":{"name":"arXiv - PHYS - Materials Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Materials Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06740","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Inverse materials design has proven successful in accelerating novel material
discovery. Many inverse materials design methods use unsupervised learning
where a latent space is learned to offer a compact description of materials
representations. A latent space learned this way is likely to be entangled, in
terms of the target property and other properties of the materials. This makes
the inverse design process ambiguous. Here, we present a semi-supervised
learning approach based on a disentangled variational autoencoder to learn a
probabilistic relationship between features, latent variables and target
properties. This approach is data efficient because it combines all labelled
and unlabelled data in a coherent manner, and it uses expert-informed prior
distributions to improve model robustness even with limited labelled data. It
is in essence interpretable, as the learnable target property is disentangled
out of the other properties of the materials, and an extra layer of
interpretability can be provided by a post-hoc analysis of the classification
head of the model. We demonstrate this new approach on an experimental
high-entropy alloy dataset with chemical compositions as input and single-phase
formation as the single target property. While single property is used in this
work, the disentangled model can be extended to customize for inverse design of
materials with multiple target properties.