Quo Vadis ChatGPT? From large language models to Large Knowledge Models

IF 3.9 2区工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computers & Chemical Engineering Pub Date : 2024-10-28 DOI:10.1016/j.compchemeng.2024.108895

Venkat Venkatasubramanian, Arijit Chakraborty

{"title":"Quo Vadis ChatGPT? From large language models to Large Knowledge Models","authors":"Venkat Venkatasubramanian, Arijit Chakraborty","doi":"10.1016/j.compchemeng.2024.108895","DOIUrl":null,"url":null,"abstract":"<div><div>The startling success of ChatGPT and other large language models (LLMs) using transformer-based generative neural network architecture in applications such as natural language processing and image synthesis has many researchers excited about potential opportunities in process systems engineering (PSE). The almost human-like performance of LLMs in these areas is indeed very impressive, surprising, and a major breakthrough. Their capabilities are very useful in certain tasks, such as writing first drafts of documents, code writing assistance, text summarization, etc. However, their success is limited in highly scientific domains as they cannot yet reason, plan, or explain due to their lack of in-depth mechanistic domain knowledge. This is a problem in domains such as chemical engineering as they are governed by fundamental laws of physics and chemistry (and biology), constitutive relations, and highly technical knowledge about materials, processes, and systems. Although purely data-driven machine learning has its immediate uses, the long-term success of AI in scientific and engineering domains would depend on developing hybrid AI systems that combine first principles and technical knowledge effectively. We call these hybrid AI systems <em>Large Knowledge Models</em> (LKMs), as they will not be limited to only NLP-based techniques or NLP-like applications. In this paper, we discuss the challenges and opportunities in developing such systems in chemical engineering.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"192 ","pages":"Article 108895"},"PeriodicalIF":3.9000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135424003132","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

The startling success of ChatGPT and other large language models (LLMs) using transformer-based generative neural network architecture in applications such as natural language processing and image synthesis has many researchers excited about potential opportunities in process systems engineering (PSE). The almost human-like performance of LLMs in these areas is indeed very impressive, surprising, and a major breakthrough. Their capabilities are very useful in certain tasks, such as writing first drafts of documents, code writing assistance, text summarization, etc. However, their success is limited in highly scientific domains as they cannot yet reason, plan, or explain due to their lack of in-depth mechanistic domain knowledge. This is a problem in domains such as chemical engineering as they are governed by fundamental laws of physics and chemistry (and biology), constitutive relations, and highly technical knowledge about materials, processes, and systems. Although purely data-driven machine learning has its immediate uses, the long-term success of AI in scientific and engineering domains would depend on developing hybrid AI systems that combine first principles and technical knowledge effectively. We call these hybrid AI systems Large Knowledge Models (LKMs), as they will not be limited to only NLP-based techniques or NLP-like applications. In this paper, we discuss the challenges and opportunities in developing such systems in chemical engineering.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

ChatGPT 何去何从？从大型语言模型到大型知识模型

ChatGPT 和其他使用基于变压器的生成神经网络架构的大型语言模型 (LLM) 在自然语言处理和图像合成等应用领域取得了惊人的成功，这让许多研究人员对流程系统工程 (PSE) 的潜在机遇感到兴奋。LLM 在这些领域中几乎与人类无异的表现确实令人印象深刻，令人惊讶，是一项重大突破。它们的能力在某些任务中非常有用，如撰写文档初稿、协助编写代码、文本摘要等。然而，由于缺乏深入的机械领域知识，它们还不能进行推理、规划或解释，因此在高度科学的领域中，它们的成功是有限的。这在化学工程等领域是个问题，因为这些领域受物理和化学（以及生物学）基本规律、构成关系以及有关材料、过程和系统的高技术知识的制约。尽管纯数据驱动的机器学习有其直接用途，但人工智能在科学和工程领域的长期成功将取决于能否开发出有效结合第一原理和技术知识的混合人工智能系统。我们称这些混合人工智能系统为大型知识模型（LKM），因为它们将不仅仅局限于基于 NLP 的技术或类似 NLP 的应用。在本文中，我们将讨论在化学工程领域开发此类系统所面临的挑战和机遇。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Computers & Chemical Engineering 工程技术-工程：化工

CiteScore

8.70

自引率

14.00%

发文量

374

审稿时长

70 days

期刊介绍： Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.