Developing new high-entropy alloys with enhanced hardness using a hybrid machine learning approach: integrating interpretability and NSGA-II optimization

IF 3.9 3区 材料科学 Q2 MATERIALS SCIENCE, MULTIDISCIPLINARY Journal of Materials Science Pub Date : 2025-03-05 DOI:10.1007/s10853-025-10729-5
Debsundar Dey, Anik Pal, Pranjal Biyani, Pritam Mandal, Snehanshu Pal, Suchandan Das, Santanu Dey, Manojit Ghosh
{"title":"Developing new high-entropy alloys with enhanced hardness using a hybrid machine learning approach: integrating interpretability and NSGA-II optimization","authors":"Debsundar Dey,&nbsp;Anik Pal,&nbsp;Pranjal Biyani,&nbsp;Pritam Mandal,&nbsp;Snehanshu Pal,&nbsp;Suchandan Das,&nbsp;Santanu Dey,&nbsp;Manojit Ghosh","doi":"10.1007/s10853-025-10729-5","DOIUrl":null,"url":null,"abstract":"<div><p>This study uses machine learning (ML) to simplify the complex and time-consuming process of predicting the hardness of high-entropy alloys (HEAs). A stacking regression model combined with a Transformed Target Regressor (TTR) is proposed, utilizing three top-performing base models such as support vector regression (SVR), LightGBM (LGBM), and random forest (RF). The model incorporates 20 key thermodynamic, mismatch, and combination parameters (physical features) along with 18 different elements to enhance generalization and account for various input feature effects, specifically to predict the hardness of HEAs. Feature selection was done in two stages using the Pearson correlation coefficient (<i>P</i><sub><i>c</i></sub>) and conditional mutual information-based feature selection (CMIFS) methods. The impact of alloy composition and physical features on hardness was analyzed with SHapley Additive exPlanations (SHAP) values and partial dependence plots (PDPs), helping to better understand the model’s predictions. The stacked model outperformed the individual models, achieving an overall <i>R</i><sup>2</sup> score of 0.88 and 0.99 for composition and physical features-based data, respectively. Additionally, the non-dominated sorting genetic algorithm II (NSGA-II) was used to optimize the hardness of the HEAs, resulting in a more than 24% increase in hardness compared to the initial data. The optimized composition of Al<sub>17.24</sub>Fe<sub>24.79</sub>Cr<sub>1.95</sub>Mo<sub>6.84</sub> Ti<sub>13.03</sub> Nb<sub>7.89</sub> Hf<sub>8.26</sub> was identified as having the highest hardness. This ML workflow serves as a general framework to optimize alloy chemical spaces and input features to achieve desired properties. Overall, this model provides interpretability and generalization through ensemble learning, offering insights for designing high hardness HEAs.</p><h3>Graphical abstract</h3>\n<div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>","PeriodicalId":645,"journal":{"name":"Journal of Materials Science","volume":"60 10","pages":"4820 - 4845"},"PeriodicalIF":3.9000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Materials Science","FirstCategoryId":"88","ListUrlMain":"https://link.springer.com/article/10.1007/s10853-025-10729-5","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

This study uses machine learning (ML) to simplify the complex and time-consuming process of predicting the hardness of high-entropy alloys (HEAs). A stacking regression model combined with a Transformed Target Regressor (TTR) is proposed, utilizing three top-performing base models such as support vector regression (SVR), LightGBM (LGBM), and random forest (RF). The model incorporates 20 key thermodynamic, mismatch, and combination parameters (physical features) along with 18 different elements to enhance generalization and account for various input feature effects, specifically to predict the hardness of HEAs. Feature selection was done in two stages using the Pearson correlation coefficient (Pc) and conditional mutual information-based feature selection (CMIFS) methods. The impact of alloy composition and physical features on hardness was analyzed with SHapley Additive exPlanations (SHAP) values and partial dependence plots (PDPs), helping to better understand the model’s predictions. The stacked model outperformed the individual models, achieving an overall R2 score of 0.88 and 0.99 for composition and physical features-based data, respectively. Additionally, the non-dominated sorting genetic algorithm II (NSGA-II) was used to optimize the hardness of the HEAs, resulting in a more than 24% increase in hardness compared to the initial data. The optimized composition of Al17.24Fe24.79Cr1.95Mo6.84 Ti13.03 Nb7.89 Hf8.26 was identified as having the highest hardness. This ML workflow serves as a general framework to optimize alloy chemical spaces and input features to achieve desired properties. Overall, this model provides interpretability and generalization through ensemble learning, offering insights for designing high hardness HEAs.

Graphical abstract

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用混合机器学习方法开发具有增强硬度的新型高熵合金:整合可解释性和NSGA-II优化
本研究使用机器学习(ML)来简化高熵合金(HEAs)硬度预测的复杂且耗时的过程。利用支持向量回归(SVR)、LightGBM (LGBM)和随机森林(RF)三种性能最好的基础模型,提出了一种结合变换目标回归(TTR)的叠加回归模型。该模型结合了20个关键的热力学、失配和组合参数(物理特征)以及18个不同的元素,以增强泛化并考虑各种输入特征效应,特别是用于预测HEAs的硬度。采用Pearson相关系数(Pc)和条件互信息特征选择(CMIFS)方法分两个阶段进行特征选择。利用SHapley加性解释(SHAP)值和部分相关图(pdp)分析合金成分和物理特征对硬度的影响,有助于更好地理解模型的预测结果。堆叠模型优于单个模型,对于基于成分和物理特征的数据,其总体R2得分分别为0.88和0.99。此外,采用非支配排序遗传算法II (NSGA-II)对HEAs的硬度进行优化,硬度比初始数据提高24%以上。优化后的组合物Al17.24Fe24.79Cr1.95Mo6.84 Ti13.03 Nb7.89 Hf8.26的硬度最高。该机器学习工作流程可作为优化合金化学空间和输入特征以实现所需属性的一般框架。总体而言,该模型通过集成学习提供了可解释性和泛化性,为设计高硬度HEAs提供了见解。图形抽象
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Materials Science
Journal of Materials Science 工程技术-材料科学:综合
CiteScore
7.90
自引率
4.40%
发文量
1297
审稿时长
2.4 months
期刊介绍: The Journal of Materials Science publishes reviews, full-length papers, and short Communications recording original research results on, or techniques for studying the relationship between structure, properties, and uses of materials. The subjects are seen from international and interdisciplinary perspectives covering areas including metals, ceramics, glasses, polymers, electrical materials, composite materials, fibers, nanostructured materials, nanocomposites, and biological and biomedical materials. The Journal of Materials Science is now firmly established as the leading source of primary communication for scientists investigating the structure and properties of all engineering materials.
期刊最新文献
The influence of carbon sources on the structural and electrochemical performance of silicon–carbon composite anodes for lithium-ion batteries Fabrication of modified bamboo fiber/PLA/PBAT composite for interfacial compatibilization and toughening Correction: Electron beam-induced crosslinking and nano-tin integration: a novel strategy for enhancing the mechanical, structural, and electrical properties of PVA/PVP nanocomposite films A series of highly water-stable Ln-MOFs for efficient Fe3+ ion fluorescent probe Rational design of hierarchical ZnO microspheres for high-performance acetone gas detection
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1