Multimodal Generative Artificial Intelligence Tackles Visual Problems in Chemistry

IF 2.5 3区 教育学 Q2 CHEMISTRY, MULTIDISCIPLINARY Journal of Chemical Education Pub Date : 2024-06-26 DOI:10.1021/acs.jchemed.4c00138
Eman A. Alasadi,  and , Carlos R. Baiz*, 
{"title":"Multimodal Generative Artificial Intelligence Tackles Visual Problems in Chemistry","authors":"Eman A. Alasadi,&nbsp; and ,&nbsp;Carlos R. Baiz*,&nbsp;","doi":"10.1021/acs.jchemed.4c00138","DOIUrl":null,"url":null,"abstract":"<p >The introduction of multimodal capabilities in large language models (LLMs) marks a significant advancement in the field of artificial intelligence (AI). In particular, the ability to process and interpret visual data, including complex graphs and plots frequently encountered in chemistry, expands the potential of these models. This integration of text and image processing allows multimodal AI to tackle a broader range of problems, especially in areas where visual information is central to understanding and solving problems. This study provides an examination of GPT-4’s image input capabilities, specifically targeting its efficacy in interpreting and solving chemistry problems that require graphical information. This study evaluates GPT-4’s image input feature, focusing on its accuracy in interpreting chemical diagrams, structures, and tabular data, and its utility as an interactive, conversational tutor in chemistry education. The research assesses the consistency of the AI’s responses to visual data of varying quality and its ability to parse handwritten problems and answers. Further, the study examines GPT-4’s capacity for molecular structure analysis and spectral data interpretation, vital for advanced problem-solving in chemistry. Through analysis, we demonstrate how the image processing capabilities of GPT-4 could be leveraged for pedagogical purposes, particularly in undergraduate chemistry courses. In addition, we provide advice for prompt development to improve response quality.</p>","PeriodicalId":43,"journal":{"name":"Journal of Chemical Education","volume":null,"pages":null},"PeriodicalIF":2.5000,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Education","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.jchemed.4c00138","RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

The introduction of multimodal capabilities in large language models (LLMs) marks a significant advancement in the field of artificial intelligence (AI). In particular, the ability to process and interpret visual data, including complex graphs and plots frequently encountered in chemistry, expands the potential of these models. This integration of text and image processing allows multimodal AI to tackle a broader range of problems, especially in areas where visual information is central to understanding and solving problems. This study provides an examination of GPT-4’s image input capabilities, specifically targeting its efficacy in interpreting and solving chemistry problems that require graphical information. This study evaluates GPT-4’s image input feature, focusing on its accuracy in interpreting chemical diagrams, structures, and tabular data, and its utility as an interactive, conversational tutor in chemistry education. The research assesses the consistency of the AI’s responses to visual data of varying quality and its ability to parse handwritten problems and answers. Further, the study examines GPT-4’s capacity for molecular structure analysis and spectral data interpretation, vital for advanced problem-solving in chemistry. Through analysis, we demonstrate how the image processing capabilities of GPT-4 could be leveraged for pedagogical purposes, particularly in undergraduate chemistry courses. In addition, we provide advice for prompt development to improve response quality.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
多模态生成人工智能解决化学中的视觉问题
在大型语言模型(LLM)中引入多模态功能标志着人工智能(AI)领域的重大进步。特别是处理和解释可视化数据(包括化学中经常遇到的复杂图形和绘图)的能力,拓展了这些模型的潜力。这种文本和图像处理的整合使得多模态人工智能能够解决更广泛的问题,尤其是在视觉信息对于理解和解决问题至关重要的领域。本研究考察了 GPT-4 的图像输入能力,特别是其在解释和解决需要图形信息的化学问题方面的功效。本研究对 GPT-4 的图像输入功能进行了评估,重点关注其在解释化学图表、结构和表格数据方面的准确性,以及其作为化学教育中交互式对话辅导的实用性。这项研究评估了人工智能对不同质量的视觉数据做出反应的一致性,以及解析手写问题和答案的能力。此外,本研究还考察了 GPT-4 的分子结构分析和光谱数据解读能力,这对解决化学高级问题至关重要。通过分析,我们展示了如何将 GPT-4 的图像处理能力用于教学目的,特别是本科生化学课程。此外,我们还提供了及时开发以提高响应质量的建议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Chemical Education
Journal of Chemical Education 化学-化学综合
CiteScore
5.60
自引率
50.00%
发文量
465
审稿时长
6.5 months
期刊介绍: The Journal of Chemical Education is the official journal of the Division of Chemical Education of the American Chemical Society, co-published with the American Chemical Society Publications Division. Launched in 1924, the Journal of Chemical Education is the world’s premier chemical education journal. The Journal publishes peer-reviewed articles and related information as a resource to those in the field of chemical education and to those institutions that serve them. JCE typically addresses chemical content, activities, laboratory experiments, instructional methods, and pedagogies. The Journal serves as a means of communication among people across the world who are interested in the teaching and learning of chemistry. This includes instructors of chemistry from middle school through graduate school, professional staff who support these teaching activities, as well as some scientists in commerce, industry, and government.
期刊最新文献
Complementary Instrumental Techniques Applied to Pain Relieving Tablets in an Undergraduate Laboratory Experiment Breaking the Access to Education Barrier: Enhancing HPLC Learning with Virtual Reality Evaluation of the Use of a 360° Immersive Visit of the Organic Chemistry Practical Laboratory for Pharmacy Students Integration of Teaching Laboratory Activities Based on the Valorization of Industrial Waste into Chemical Education to Address the Emerging Sustainable Development Goals Critical Chemical Literacy as a Main Goal of Chemistry Education Aiming for Climate Empowerment and Agency
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1