A Multimodal Learning Model based on a QSPR approach for the estimation of RON, MON and CN, for any C, H, O hydrocarbons

IF 6.7 1区 工程技术 Q2 ENERGY & FUELS Fuel Pub Date : 2024-10-30 DOI:10.1016/j.fuel.2024.133438
{"title":"A Multimodal Learning Model based on a QSPR approach for the estimation of RON, MON and CN, for any C, H, O hydrocarbons","authors":"","doi":"10.1016/j.fuel.2024.133438","DOIUrl":null,"url":null,"abstract":"<div><div>With the increasing demand for alternative fuels and the use of biomass as feedstock for fuel production, a wider range of oxygenated hydrocarbons as fuel additives needs to be considered. Consequently, the development of robust methods for predicting criteria such as Research Octane Number (RON), Motor Octane Number (MON), and Cetane Number (CN) will play a crucial role in characterizing novel fuels.</div><div>In this paper, we propose a robust deep-learning model based on a Quantitative Structure-Property Relationship (QSPR) approach for estimating RON, MON, and CN of any C, H, and O molecules. We developed a multimodal learning model that combines two types of data, using an Artificial Neural Network (ANN) as the foundation. The Mordred algorithm was used to determine 457 descriptors to characterize hydrocarbons. These numerical values represent the first type of data considered in this study.</div><div>To account for the effects of mesomerism or chirality in molecules, the InChIKey notation was used. This 27-character notation represents the second type of data, treated as text data. To encode these textual variables into numeric data, we employed a Word Embedding method.</div><div>The predictions from the final model were successfully tested against a large set of experimental data and compared with those from five recent learning models—GNN, ANN, GPR, DLMO, and PIGNN—found in the literature. The GNN (Graph Neural Networks) model relies on the molecular architecture, the ANN (Artificial Neural Networks) model is based on a limited number of chemical groups, while the GPR (Gaussian Process Regression) model is primarily based on Joback groups. The two most recent methodologies, DLMO (Deep Learning Mixing Operator) and PIGNN (Physics-Informed Graph Neural Network), utilize more sophisticated algorithms.</div><div>Full comparisons and multiple tests demonstrate the very robust and predictive capabilities of our newly proposed multimodal learning model. The prediction tool is available via a web page at <span><span>http://ehlcathol.eu/</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":325,"journal":{"name":"Fuel","volume":null,"pages":null},"PeriodicalIF":6.7000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fuel","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016236124025870","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0

Abstract

With the increasing demand for alternative fuels and the use of biomass as feedstock for fuel production, a wider range of oxygenated hydrocarbons as fuel additives needs to be considered. Consequently, the development of robust methods for predicting criteria such as Research Octane Number (RON), Motor Octane Number (MON), and Cetane Number (CN) will play a crucial role in characterizing novel fuels.
In this paper, we propose a robust deep-learning model based on a Quantitative Structure-Property Relationship (QSPR) approach for estimating RON, MON, and CN of any C, H, and O molecules. We developed a multimodal learning model that combines two types of data, using an Artificial Neural Network (ANN) as the foundation. The Mordred algorithm was used to determine 457 descriptors to characterize hydrocarbons. These numerical values represent the first type of data considered in this study.
To account for the effects of mesomerism or chirality in molecules, the InChIKey notation was used. This 27-character notation represents the second type of data, treated as text data. To encode these textual variables into numeric data, we employed a Word Embedding method.
The predictions from the final model were successfully tested against a large set of experimental data and compared with those from five recent learning models—GNN, ANN, GPR, DLMO, and PIGNN—found in the literature. The GNN (Graph Neural Networks) model relies on the molecular architecture, the ANN (Artificial Neural Networks) model is based on a limited number of chemical groups, while the GPR (Gaussian Process Regression) model is primarily based on Joback groups. The two most recent methodologies, DLMO (Deep Learning Mixing Operator) and PIGNN (Physics-Informed Graph Neural Network), utilize more sophisticated algorithms.
Full comparisons and multiple tests demonstrate the very robust and predictive capabilities of our newly proposed multimodal learning model. The prediction tool is available via a web page at http://ehlcathol.eu/.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于 QSPR 方法的多模态学习模型,用于估算任何 C、H、O 碳氢化合物的 RON、MON 和 CN 值
随着对替代燃料的需求不断增加,以及使用生物质作为燃料生产的原料,需要考虑将更多含氧碳氢化合物作为燃料添加剂。因此,开发用于预测研究辛烷值(RON)、汽车辛烷值(MON)和十六烷值(CN)等标准的稳健方法将在鉴定新型燃料的特性方面发挥至关重要的作用。在本文中,我们提出了一种基于定量结构-特性关系(QSPR)方法的稳健深度学习模型,用于估算任何 C、H 和 O 分子的 RON、MON 和 CN。我们以人工神经网络(ANN)为基础,开发了一种结合两类数据的多模态学习模型。我们使用 Mordred 算法确定了 457 个描述符来描述碳氢化合物的特征。这些数值代表了本研究中考虑的第一类数据。为了考虑分子中的中观性或手性的影响,我们使用了 InChIKey 符号。这种 27 个字符的符号代表第二类数据,被视为文本数据。为了将这些文本变量编码为数字数据,我们采用了单词嵌入法。最终模型的预测结果成功地通过了大量实验数据的测试,并与文献中最新发现的五种学习模型--GNN、ANN、GPR、DLMO 和 PIGNN 的预测结果进行了比较。GNN(图神经网络)模型依赖于分子结构,ANN(人工神经网络)模型基于数量有限的化学基团,而 GPR(高斯过程回归)模型主要基于乔巴克基团。最新的两种方法,即 DLMO(深度学习混合运算器)和 PIGNNN(物理信息图神经网络),则采用了更复杂的算法。全面比较和多重测试表明,我们新提出的多模态学习模型具有非常强大的预测能力。预测工具可通过网页 http://ehlcathol.eu/ 获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Fuel
Fuel 工程技术-工程:化工
CiteScore
12.80
自引率
20.30%
发文量
3506
审稿时长
64 days
期刊介绍: The exploration of energy sources remains a critical matter of study. For the past nine decades, fuel has consistently held the forefront in primary research efforts within the field of energy science. This area of investigation encompasses a wide range of subjects, with a particular emphasis on emerging concerns like environmental factors and pollution.
期刊最新文献
Multi-scale reservoir response characteristics of coalbed methane development through stress relief via horizontal well cavity completion in tectonically deformed coals Analysis of the combustion speed in a spark ignition engine fuelled with hydrogen and gasoline blends at different air fuel ratios Experimental study on elastic development characteristics and imbibition mechanism of shale oil reservoirs at micro-nano scale La-doped bimetallic PCN-777 (La-Zr) catalyst synthesized by oil-bath heating for efficient oxidative desulfurization of dibenzothiophene Evaluation of the synergistic effect of platinum-supported graphene materials on methylcyclohexane decomposition via flow reactor experiments and reactive force-field molecular dynamics simulations
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1