Classified dataset, regression and machine learning modeling for prediction of phase transformation temperatures in steels

IF 1.9 3区 材料科学 Q4 CHEMISTRY, PHYSICAL Calphad-computer Coupling of Phase Diagrams and Thermochemistry Pub Date : 2024-09-18 DOI:10.1016/j.calphad.2024.102748
Jinlei Lu, Guanglong Xu, Fuwen Chen, Yuwen Cui
{"title":"Classified dataset, regression and machine learning modeling for prediction of phase transformation temperatures in steels","authors":"Jinlei Lu,&nbsp;Guanglong Xu,&nbsp;Fuwen Chen,&nbsp;Yuwen Cui","doi":"10.1016/j.calphad.2024.102748","DOIUrl":null,"url":null,"abstract":"<div><p>The prediction of the characteristic Martensite Start (<em>M</em><sub><em>s</em></sub>) temperature and Austenitic Nose Tip Temperature (ANTT) in steels is of scientific and technological importance; however, it faces significant challenges due to multiphysical complexity.</p><p>In this study, we introduced a structured framework for data classification and hierarchical iterations aimed at predicting Ms (Martensite start temperature) and ANTT (Austenite non-transforming temperature). This framework was incorporated into two optimization models, leading to enhancements in accuracy, extrapolation capabilities, and generalization performance. First, we classified the collected Ms datasets hierarchically based on the alloying elements presented in steels, including carbon, austenite stabilizers, non-austenitization elements, and data credibility. Regression analyses of Ms temperatures concerning chemical compositions were then carried out using phenomenological variables from binary systems to multi-component systems in alignment with the spirit of CALPHAD modeling, which is renowned for its robust extrapolation abilities. By iteratively fitting the hierarchically classified datasets and implementing hierarchical iterations, we developed the CALPHAD-guided phenomenological variable (CGPV) Ms regression model. This model achieved improved accuracy levels, with R<sup>2</sup> values of 0.9 for training and 0.87 for testing, surpassing most conventional regression models that do not account for compositional interactions. Furthermore, the CALPHAD-guided machine learning (CGML) model, constructed based on the classified datasets and hierarchical iterations but without utilizing phenomenological variables, demonstrated strong performance with R<sup>2</sup> values of 0.98 and 0.86 for training and testing, respectively. The CGML model was demonstrated not only to reliably filter out problematic data in a dataset but also to unveil the unnoticed coupling between carbon and other alloying elements on <em>M</em><sub><em>s</em></sub>. Finally, the CGML method has been readily transferred to predict ANTT with high accuracy as well.</p></div>","PeriodicalId":9436,"journal":{"name":"Calphad-computer Coupling of Phase Diagrams and Thermochemistry","volume":"87 ","pages":"Article 102748"},"PeriodicalIF":1.9000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Calphad-computer Coupling of Phase Diagrams and Thermochemistry","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0364591624000907","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

The prediction of the characteristic Martensite Start (Ms) temperature and Austenitic Nose Tip Temperature (ANTT) in steels is of scientific and technological importance; however, it faces significant challenges due to multiphysical complexity.

In this study, we introduced a structured framework for data classification and hierarchical iterations aimed at predicting Ms (Martensite start temperature) and ANTT (Austenite non-transforming temperature). This framework was incorporated into two optimization models, leading to enhancements in accuracy, extrapolation capabilities, and generalization performance. First, we classified the collected Ms datasets hierarchically based on the alloying elements presented in steels, including carbon, austenite stabilizers, non-austenitization elements, and data credibility. Regression analyses of Ms temperatures concerning chemical compositions were then carried out using phenomenological variables from binary systems to multi-component systems in alignment with the spirit of CALPHAD modeling, which is renowned for its robust extrapolation abilities. By iteratively fitting the hierarchically classified datasets and implementing hierarchical iterations, we developed the CALPHAD-guided phenomenological variable (CGPV) Ms regression model. This model achieved improved accuracy levels, with R2 values of 0.9 for training and 0.87 for testing, surpassing most conventional regression models that do not account for compositional interactions. Furthermore, the CALPHAD-guided machine learning (CGML) model, constructed based on the classified datasets and hierarchical iterations but without utilizing phenomenological variables, demonstrated strong performance with R2 values of 0.98 and 0.86 for training and testing, respectively. The CGML model was demonstrated not only to reliably filter out problematic data in a dataset but also to unveil the unnoticed coupling between carbon and other alloying elements on Ms. Finally, the CGML method has been readily transferred to predict ANTT with high accuracy as well.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于预测钢中相变温度的分类数据集、回归和机器学习模型
预测钢材中的马氏体起始温度(Ms)和奥氏体鼻尖温度(ANTT)具有重要的科学和技术意义;然而,由于多重物理复杂性,预测工作面临着巨大挑战。该框架被纳入两个优化模型,从而提高了准确性、外推能力和泛化性能。首先,我们根据钢中的合金元素(包括碳、奥氏体稳定剂、非奥氏体化元素和数据可信度)对收集的 Ms 数据集进行了分级分类。然后,利用从二元体系到多组分体系的现象变量,对有关化学成分的 Ms 温度进行回归分析,这与 CALPHAD 建模的精神是一致的,CALPHAD 以其强大的外推能力而著称。通过迭代拟合分层分类数据集和实施分层迭代,我们开发出了 CALPHAD 引导的现象学变量(CGPV)Ms 回归模型。该模型提高了准确度,训练的 R2 值为 0.9,测试的 R2 值为 0.87,超过了大多数不考虑成分相互作用的传统回归模型。此外,基于分类数据集和分层迭代构建的 CALPHAD 引导的机器学习(CGML)模型表现出色,训练和测试的 R2 值分别为 0.98 和 0.86。结果表明,CGML 模型不仅能可靠地过滤掉数据集中的问题数据,还能揭示出 Ms 上碳与其他合金元素之间未被注意到的耦合关系。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.00
自引率
16.70%
发文量
94
审稿时长
2.5 months
期刊介绍: The design of industrial processes requires reliable thermodynamic data. CALPHAD (Computer Coupling of Phase Diagrams and Thermochemistry) aims to promote computational thermodynamics through development of models to represent thermodynamic properties for various phases which permit prediction of properties of multicomponent systems from those of binary and ternary subsystems, critical assessment of data and their incorporation into self-consistent databases, development of software to optimize and derive thermodynamic parameters and the development and use of databanks for calculations to improve understanding of various industrial and technological processes. This work is disseminated through the CALPHAD journal and its annual conference.
期刊最新文献
Exploration of high-ductility ternary refractory complex concentrated alloys using first-principles calculations and machine learning Interdiffusivity matrices and atomic mobilities in fcc Ni–Fe–Mo alloys: Experiment and modeling The Al2O3-SiO2-“V2O3” phase diagram at 1873 K Experimental investigation and thermodynamic calculation of the Al-Cr-Pd ternary system High temperature phase relations and structure determination of solid solutions in the ternary Nd-Dy-Cu system
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1