利用机器学习模型预测离子液体的毒性,并将其应用于天然气水合物

IF 1 4区 工程技术 Q4 ENGINEERING, CHEMICAL Process Safety Progress Pub Date : 2024-03-23 DOI:10.1002/prs.12599
Nurul Hannah Abdullah, Dzulkarnain Zaini, Bhajan Lal
{"title":"利用机器学习模型预测离子液体的毒性,并将其应用于天然气水合物","authors":"Nurul Hannah Abdullah, Dzulkarnain Zaini, Bhajan Lal","doi":"10.1002/prs.12599","DOIUrl":null,"url":null,"abstract":"Ionic liquids (ILs) are highly favored in the oil and gas industry as gas hydrate inhibitors due to their dual functionality as thermodynamic inhibitor and kinetic hydrate inhibitor. Though known as the “green alternatives,” concerns about the effects of ILs in the environment are rising such that ILs can stabilize in water systems. Furthermore, there are insufficient data on the toxicity of ILs, limiting the use of ILs for industrial applications. Ridge, LASSO, decision tree, random forest, extra tree, gradient boost, and support vector regressions were used to develop IL toxicity predictive models. Random forest yielded the strongest predictive performance, scoring the highest <jats:italic>R</jats:italic><jats:sup>2</jats:sup> value of 0.86, with mean absolute error and root mean square error values of 0.32 and 0.43, respectively. Feature selections were conducted to investigate the contributions of the five molecular descriptors involved in developing regression models in this work. Descriptor MSD<jats:sup>C</jats:sup> was found to contribute the highest at 67% in predicting the toxicity of ILs, followed by SNar<jats:sup>A</jats:sup> and MAXDP<jats:sup>C</jats:sup>, demonstrating contributions of 15.2% and 14.1%, respectively. Further quantitative structure–activity relationship model validations were executed; the use of three descriptors resulted in a 2% increase in predictive performance for decision tree regression, whereas <jats:italic>R</jats:italic><jats:sup>2</jats:sup> values remained the same for random forest, extra tree, and gradient boosting.","PeriodicalId":20680,"journal":{"name":"Process Safety Progress","volume":"86 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2024-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of ionic liquids toxicity using machine learning models for application to gas hydrate\",\"authors\":\"Nurul Hannah Abdullah, Dzulkarnain Zaini, Bhajan Lal\",\"doi\":\"10.1002/prs.12599\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Ionic liquids (ILs) are highly favored in the oil and gas industry as gas hydrate inhibitors due to their dual functionality as thermodynamic inhibitor and kinetic hydrate inhibitor. Though known as the “green alternatives,” concerns about the effects of ILs in the environment are rising such that ILs can stabilize in water systems. Furthermore, there are insufficient data on the toxicity of ILs, limiting the use of ILs for industrial applications. Ridge, LASSO, decision tree, random forest, extra tree, gradient boost, and support vector regressions were used to develop IL toxicity predictive models. Random forest yielded the strongest predictive performance, scoring the highest <jats:italic>R</jats:italic><jats:sup>2</jats:sup> value of 0.86, with mean absolute error and root mean square error values of 0.32 and 0.43, respectively. Feature selections were conducted to investigate the contributions of the five molecular descriptors involved in developing regression models in this work. Descriptor MSD<jats:sup>C</jats:sup> was found to contribute the highest at 67% in predicting the toxicity of ILs, followed by SNar<jats:sup>A</jats:sup> and MAXDP<jats:sup>C</jats:sup>, demonstrating contributions of 15.2% and 14.1%, respectively. Further quantitative structure–activity relationship model validations were executed; the use of three descriptors resulted in a 2% increase in predictive performance for decision tree regression, whereas <jats:italic>R</jats:italic><jats:sup>2</jats:sup> values remained the same for random forest, extra tree, and gradient boosting.\",\"PeriodicalId\":20680,\"journal\":{\"name\":\"Process Safety Progress\",\"volume\":\"86 1\",\"pages\":\"\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2024-03-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Process Safety Progress\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1002/prs.12599\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENGINEERING, CHEMICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Process Safety Progress","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1002/prs.12599","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0

摘要

离子液体(ILs)具有热力学抑制剂和动力学水合物抑制剂的双重功能,因此作为天然气水合物抑制剂在石油和天然气行业备受青睐。虽然离子液体被称为 "绿色替代品",但人们对其环境影响的担忧也在增加,因为离子液体可在水系统中稳定存在。此外,有关 ILs 毒性的数据不足,限制了 ILs 在工业中的应用。研究人员使用 Ridge、LASSO、决策树、随机森林、额外树、梯度提升和支持向量回归等方法建立了惰性气体毒性预测模型。随机森林的预测性能最强,R2 值最高,为 0.86,平均绝对误差和均方根误差值分别为 0.32 和 0.43。对特征进行了选择,以研究本研究中建立回归模型所涉及的五个分子描述符的贡献。结果发现,描述符 MSDC 在预测 IL 毒性方面的贡献率最高,达到 67%,其次是 SNarA 和 MAXDPC,贡献率分别为 15.2% 和 14.1%。进一步对定量结构-活性关系模型进行了验证;使用三个描述符后,决策树回归的预测性能提高了 2%,而随机森林、额外树和梯度提升的 R2 值保持不变。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Prediction of ionic liquids toxicity using machine learning models for application to gas hydrate
Ionic liquids (ILs) are highly favored in the oil and gas industry as gas hydrate inhibitors due to their dual functionality as thermodynamic inhibitor and kinetic hydrate inhibitor. Though known as the “green alternatives,” concerns about the effects of ILs in the environment are rising such that ILs can stabilize in water systems. Furthermore, there are insufficient data on the toxicity of ILs, limiting the use of ILs for industrial applications. Ridge, LASSO, decision tree, random forest, extra tree, gradient boost, and support vector regressions were used to develop IL toxicity predictive models. Random forest yielded the strongest predictive performance, scoring the highest R2 value of 0.86, with mean absolute error and root mean square error values of 0.32 and 0.43, respectively. Feature selections were conducted to investigate the contributions of the five molecular descriptors involved in developing regression models in this work. Descriptor MSDC was found to contribute the highest at 67% in predicting the toxicity of ILs, followed by SNarA and MAXDPC, demonstrating contributions of 15.2% and 14.1%, respectively. Further quantitative structure–activity relationship model validations were executed; the use of three descriptors resulted in a 2% increase in predictive performance for decision tree regression, whereas R2 values remained the same for random forest, extra tree, and gradient boosting.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Process Safety Progress
Process Safety Progress 工程技术-工程:化工
CiteScore
2.20
自引率
10.00%
发文量
99
审稿时长
6-12 weeks
期刊介绍: Process Safety Progress covers process safety for engineering professionals. It addresses such topics as incident investigations/case histories, hazardous chemicals management, hazardous leaks prevention, risk assessment, process hazards evaluation, industrial hygiene, fire and explosion analysis, preventive maintenance, vapor cloud dispersion, and regulatory compliance, training, education, and other areas in process safety and loss prevention, including emerging concerns like plant and/or process security. Papers from the annual Loss Prevention Symposium and other AIChE safety conferences are automatically considered for publication, but unsolicited papers, particularly those addressing process safety issues in emerging technologies and industries are encouraged and evaluated equally.
期刊最新文献
Numerical study of failure modes of hazardous material tanks exposed to fire accidents in the process industry So, you cannot vent: A deep dive into other explosion protection methods Risk and consequence analysis of ammonia storage units in a nuclear fuel cycle facility Diagnosing electrostatic problems and hazards in industrial processes: Case studies Numerical simulation study on propane gas leakage and diffusion law in slope terrain
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1