Soundous Touati, Ali Benghia, Zoulikha Hebboul, Ibn Khaldoun Lefkaier, Mohammed Benali Kanoun and Souraya Goumri-Said*,
{"title":"Machine Learning Models for Efficient Property Prediction of ABX3 Materials: A High-Throughput Approach","authors":"Soundous Touati, Ali Benghia, Zoulikha Hebboul, Ibn Khaldoun Lefkaier, Mohammed Benali Kanoun and Souraya Goumri-Said*, ","doi":"10.1021/acsomega.4c0613910.1021/acsomega.4c06139","DOIUrl":null,"url":null,"abstract":"<p >Recently, ABX<sub>3</sub> materials have garnered significant attention due to their diverse applications in photovoltaics, catalysis, and optoelectronics as well as their remarkable efficiency in energy conversion. However, progress has been somewhat slow due to the high expenses of the experiment or the time-consuming density functional theory (DFT) calculation. In this study, we utilized the extreme gradient boosting (XGBoost) algorithm to facilitate the discovery and characterization of ABX<sub>3</sub> compounds based on vast data sets generated by DFT calculations. While the XGBoost algorithm provides a powerful tool for accelerating the discovery of ABX<sub>3</sub> compounds, it is crucial to acknowledge that different DFT approximation levels can significantly impact the predicted band gaps, potentially introducing discrepancies when compared with experimental values. In the first step, we predict the space group of 13947 oxides and halides using the Open Quantum Materials Database and elemental features. Our analysis yields classification accuracies ranging from 82.39% to 99.14% across these materials. Following this, XGBoost regression algorithms are employed to interrogate the data set, enabling predictions of volume (achieving an optimal accuracy of 98.41%, with a mean absolute error (MAE) of 2.395 Å<sup>3</sup> and a root-mean-square error (RMSE) of 4.416 Å<sup>3</sup>), formation energy (an optimal accuracy of 97.36%, with an MAE of 0.075 eV/atom and an RMSE of 0.132 eV/atom), and band gap energy (an optimal accuracy of 87.00%, an MAE of 0.391 eV, and an RMSE of 0.574 eV). Finally, these prediction models are employed to identify the possible space groups for each of the 1252 new ABX<sub>3</sub> formulas. Then, we predict the volume, the formation energy, and the band gap energy for each candidate space group. Through these predictive models, machine learning accelerates the exploration of new materials with enhanced performance and functionality.</p>","PeriodicalId":22,"journal":{"name":"ACS Omega","volume":"9 48","pages":"47519–47531 47519–47531"},"PeriodicalIF":3.7000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acsomega.4c06139","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Omega","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acsomega.4c06139","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, ABX3 materials have garnered significant attention due to their diverse applications in photovoltaics, catalysis, and optoelectronics as well as their remarkable efficiency in energy conversion. However, progress has been somewhat slow due to the high expenses of the experiment or the time-consuming density functional theory (DFT) calculation. In this study, we utilized the extreme gradient boosting (XGBoost) algorithm to facilitate the discovery and characterization of ABX3 compounds based on vast data sets generated by DFT calculations. While the XGBoost algorithm provides a powerful tool for accelerating the discovery of ABX3 compounds, it is crucial to acknowledge that different DFT approximation levels can significantly impact the predicted band gaps, potentially introducing discrepancies when compared with experimental values. In the first step, we predict the space group of 13947 oxides and halides using the Open Quantum Materials Database and elemental features. Our analysis yields classification accuracies ranging from 82.39% to 99.14% across these materials. Following this, XGBoost regression algorithms are employed to interrogate the data set, enabling predictions of volume (achieving an optimal accuracy of 98.41%, with a mean absolute error (MAE) of 2.395 Å3 and a root-mean-square error (RMSE) of 4.416 Å3), formation energy (an optimal accuracy of 97.36%, with an MAE of 0.075 eV/atom and an RMSE of 0.132 eV/atom), and band gap energy (an optimal accuracy of 87.00%, an MAE of 0.391 eV, and an RMSE of 0.574 eV). Finally, these prediction models are employed to identify the possible space groups for each of the 1252 new ABX3 formulas. Then, we predict the volume, the formation energy, and the band gap energy for each candidate space group. Through these predictive models, machine learning accelerates the exploration of new materials with enhanced performance and functionality.
ACS OmegaChemical Engineering-General Chemical Engineering
CiteScore
6.60
自引率
4.90%
发文量
3945
审稿时长
2.4 months
期刊介绍:
ACS Omega is an open-access global publication for scientific articles that describe new findings in chemistry and interfacing areas of science, without any perceived evaluation of immediate impact.