考虑虚拟样本生成的ZIF-8’BET比表面积预测双阶段堆叠机器学习方法及实验验证

IF 3.7 2区 化学 Q2 CHEMISTRY, MULTIDISCIPLINARY Langmuir Pub Date : 2025-01-17 DOI:10.1021/acs.langmuir.4c04088
Fengfei Chen, Hongguang Zhou, Xiaohui Yu, Yunpeng Zhao, Chenchen Wang, Bin Dai, Sheng Han
{"title":"考虑虚拟样本生成的ZIF-8’BET比表面积预测双阶段堆叠机器学习方法及实验验证","authors":"Fengfei Chen, Hongguang Zhou, Xiaohui Yu, Yunpeng Zhao, Chenchen Wang, Bin Dai, Sheng Han","doi":"10.1021/acs.langmuir.4c04088","DOIUrl":null,"url":null,"abstract":"The widespread application of metal–organic frameworks (MOFs) in wastewater and gas treatment has created an increasing demand for accurate and rapid assessment of their BET specific surface area. However, experimental methods for acquiring sufficient statistical data are often costly and time-consuming. Therefore, this study proposes a dual-stage stacking model with Gaussian mixture model-virtual sample generation (GMM-VSG) technology for the BET specific surface area prediction. In this study, 90 real samples were selected from the MOF database and 300 virtual samples were generated. The performance on both real and virtual samples was evaluated by using four machine learning models, including Bayesian regression (Bayes), adaptive boosting (AdaBoost), random forest (RF), and extreme gradient boosting (XGBoost). Subsequently, three best-performing models and a linear regression model were selected for constructing a two-stage stacking model, with <i>R</i><sup>2</sup> value of 0.974. Finally, experimental conditions were adjusted based on feature importance analysis during the validation process, and the result shows that the prediction accuracy of the BET specific surface area is 0.943. This study contributes to the development of more efficient and accurate evaluation methods.","PeriodicalId":50,"journal":{"name":"Langmuir","volume":"14 1","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dual-Stage Stacking Machine Learning Method Considering Virtual Sample Generation for the Prediction of ZIF-8′ BET Specific Surface Area with Experimental Validation\",\"authors\":\"Fengfei Chen, Hongguang Zhou, Xiaohui Yu, Yunpeng Zhao, Chenchen Wang, Bin Dai, Sheng Han\",\"doi\":\"10.1021/acs.langmuir.4c04088\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The widespread application of metal–organic frameworks (MOFs) in wastewater and gas treatment has created an increasing demand for accurate and rapid assessment of their BET specific surface area. However, experimental methods for acquiring sufficient statistical data are often costly and time-consuming. Therefore, this study proposes a dual-stage stacking model with Gaussian mixture model-virtual sample generation (GMM-VSG) technology for the BET specific surface area prediction. In this study, 90 real samples were selected from the MOF database and 300 virtual samples were generated. The performance on both real and virtual samples was evaluated by using four machine learning models, including Bayesian regression (Bayes), adaptive boosting (AdaBoost), random forest (RF), and extreme gradient boosting (XGBoost). Subsequently, three best-performing models and a linear regression model were selected for constructing a two-stage stacking model, with <i>R</i><sup>2</sup> value of 0.974. Finally, experimental conditions were adjusted based on feature importance analysis during the validation process, and the result shows that the prediction accuracy of the BET specific surface area is 0.943. This study contributes to the development of more efficient and accurate evaluation methods.\",\"PeriodicalId\":50,\"journal\":{\"name\":\"Langmuir\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-01-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Langmuir\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.langmuir.4c04088\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Langmuir","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.langmuir.4c04088","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

金属有机骨架(MOFs)在废水和气体处理中的广泛应用,对其BET比表面积的准确和快速评估产生了越来越大的需求。然而,获取足够统计数据的实验方法往往既昂贵又费时。因此,本研究提出了一种采用高斯混合模型-虚拟样品生成(GMM-VSG)技术的双级叠加模型,用于BET比表面积预测。本研究从MOF数据库中选取90个真实样本,生成300个虚拟样本。通过使用四种机器学习模型,包括贝叶斯回归(Bayes)、自适应增强(AdaBoost)、随机森林(RF)和极端梯度增强(XGBoost),对真实和虚拟样本的性能进行了评估。随后,选取三个表现最好的模型和一个线性回归模型构建两阶段叠加模型,R2值为0.974。最后,在验证过程中,根据特征重要性分析调整实验条件,结果表明,BET比表面积的预测精度为0.943。本研究有助于开发更高效、准确的评价方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Dual-Stage Stacking Machine Learning Method Considering Virtual Sample Generation for the Prediction of ZIF-8′ BET Specific Surface Area with Experimental Validation
The widespread application of metal–organic frameworks (MOFs) in wastewater and gas treatment has created an increasing demand for accurate and rapid assessment of their BET specific surface area. However, experimental methods for acquiring sufficient statistical data are often costly and time-consuming. Therefore, this study proposes a dual-stage stacking model with Gaussian mixture model-virtual sample generation (GMM-VSG) technology for the BET specific surface area prediction. In this study, 90 real samples were selected from the MOF database and 300 virtual samples were generated. The performance on both real and virtual samples was evaluated by using four machine learning models, including Bayesian regression (Bayes), adaptive boosting (AdaBoost), random forest (RF), and extreme gradient boosting (XGBoost). Subsequently, three best-performing models and a linear regression model were selected for constructing a two-stage stacking model, with R2 value of 0.974. Finally, experimental conditions were adjusted based on feature importance analysis during the validation process, and the result shows that the prediction accuracy of the BET specific surface area is 0.943. This study contributes to the development of more efficient and accurate evaluation methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Langmuir
Langmuir 化学-材料科学:综合
CiteScore
6.50
自引率
10.30%
发文量
1464
审稿时长
2.1 months
期刊介绍: Langmuir is an interdisciplinary journal publishing articles in the following subject categories: Colloids: surfactants and self-assembly, dispersions, emulsions, foams Interfaces: adsorption, reactions, films, forces Biological Interfaces: biocolloids, biomolecular and biomimetic materials Materials: nano- and mesostructured materials, polymers, gels, liquid crystals Electrochemistry: interfacial charge transfer, charge transport, electrocatalysis, electrokinetic phenomena, bioelectrochemistry Devices and Applications: sensors, fluidics, patterning, catalysis, photonic crystals However, when high-impact, original work is submitted that does not fit within the above categories, decisions to accept or decline such papers will be based on one criteria: What Would Irving Do? Langmuir ranks #2 in citations out of 136 journals in the category of Physical Chemistry with 113,157 total citations. The journal received an Impact Factor of 4.384*. This journal is also indexed in the categories of Materials Science (ranked #1) and Multidisciplinary Chemistry (ranked #5).
期刊最新文献
Self-Assembly of Mycolic Acid in Water: Monolayer or Bilayer Composition Conversion-Induced Disassembly of Amphiphilic ABA Triblock Copolymer Vesicles: A Monte Carlo Study Beyond Surfactants: Janus Particles for Functional Interfaces and Coatings Propensity of Water Self-Ions at Air(Oil)–Water Interfaces Revealed by Deep Potential Molecular Dynamics with Enhanced Sampling Encapsulation of Oil Droplets Using Film-Forming Janus Nanoparticles
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1