Quo Vadis ChatGPT? From large language models to Large Knowledge Models

IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computers & Chemical Engineering Pub Date : 2024-10-28 DOI:10.1016/j.compchemeng.2024.108895
Venkat Venkatasubramanian, Arijit Chakraborty
{"title":"Quo Vadis ChatGPT? From large language models to Large Knowledge Models","authors":"Venkat Venkatasubramanian,&nbsp;Arijit Chakraborty","doi":"10.1016/j.compchemeng.2024.108895","DOIUrl":null,"url":null,"abstract":"<div><div>The startling success of ChatGPT and other large language models (LLMs) using transformer-based generative neural network architecture in applications such as natural language processing and image synthesis has many researchers excited about potential opportunities in process systems engineering (PSE). The almost human-like performance of LLMs in these areas is indeed very impressive, surprising, and a major breakthrough. Their capabilities are very useful in certain tasks, such as writing first drafts of documents, code writing assistance, text summarization, etc. However, their success is limited in highly scientific domains as they cannot yet reason, plan, or explain due to their lack of in-depth mechanistic domain knowledge. This is a problem in domains such as chemical engineering as they are governed by fundamental laws of physics and chemistry (and biology), constitutive relations, and highly technical knowledge about materials, processes, and systems. Although purely data-driven machine learning has its immediate uses, the long-term success of AI in scientific and engineering domains would depend on developing hybrid AI systems that combine first principles and technical knowledge effectively. We call these hybrid AI systems <em>Large Knowledge Models</em> (LKMs), as they will not be limited to only NLP-based techniques or NLP-like applications. In this paper, we discuss the challenges and opportunities in developing such systems in chemical engineering.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"192 ","pages":"Article 108895"},"PeriodicalIF":3.9000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135424003132","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

The startling success of ChatGPT and other large language models (LLMs) using transformer-based generative neural network architecture in applications such as natural language processing and image synthesis has many researchers excited about potential opportunities in process systems engineering (PSE). The almost human-like performance of LLMs in these areas is indeed very impressive, surprising, and a major breakthrough. Their capabilities are very useful in certain tasks, such as writing first drafts of documents, code writing assistance, text summarization, etc. However, their success is limited in highly scientific domains as they cannot yet reason, plan, or explain due to their lack of in-depth mechanistic domain knowledge. This is a problem in domains such as chemical engineering as they are governed by fundamental laws of physics and chemistry (and biology), constitutive relations, and highly technical knowledge about materials, processes, and systems. Although purely data-driven machine learning has its immediate uses, the long-term success of AI in scientific and engineering domains would depend on developing hybrid AI systems that combine first principles and technical knowledge effectively. We call these hybrid AI systems Large Knowledge Models (LKMs), as they will not be limited to only NLP-based techniques or NLP-like applications. In this paper, we discuss the challenges and opportunities in developing such systems in chemical engineering.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ChatGPT 何去何从?从大型语言模型到大型知识模型
ChatGPT 和其他使用基于变压器的生成神经网络架构的大型语言模型 (LLM) 在自然语言处理和图像合成等应用领域取得了惊人的成功,这让许多研究人员对流程系统工程 (PSE) 的潜在机遇感到兴奋。LLM 在这些领域中几乎与人类无异的表现确实令人印象深刻,令人惊讶,是一项重大突破。它们的能力在某些任务中非常有用,如撰写文档初稿、协助编写代码、文本摘要等。然而,由于缺乏深入的机械领域知识,它们还不能进行推理、规划或解释,因此在高度科学的领域中,它们的成功是有限的。这在化学工程等领域是个问题,因为这些领域受物理和化学(以及生物学)基本规律、构成关系以及有关材料、过程和系统的高技术知识的制约。尽管纯数据驱动的机器学习有其直接用途,但人工智能在科学和工程领域的长期成功将取决于能否开发出有效结合第一原理和技术知识的混合人工智能系统。我们称这些混合人工智能系统为大型知识模型(LKM),因为它们将不仅仅局限于基于 NLP 的技术或类似 NLP 的应用。在本文中,我们将讨论在化学工程领域开发此类系统所面临的挑战和机遇。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computers & Chemical Engineering
Computers & Chemical Engineering 工程技术-工程:化工
CiteScore
8.70
自引率
14.00%
发文量
374
审稿时长
70 days
期刊介绍: Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.
期刊最新文献
Editorial Board Highly accelerated kinetic Monte Carlo models for depolymerisation systems An optimization approach for sustainable and resilient closed-loop floating solar photovoltaic supply chain network design A semi-supervised learning algorithm for high and low-frequency variable imbalances in industrial data Prediction of viscoelastic and printability properties on binder-free TiO2-based ceramic pastes by DIW through a machine learning approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1