A human-robot interaction system for automated chemical experiments based on vision and natural language processing semantics

IF 8 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Engineering Applications of Artificial Intelligence Pub Date : 2025-04-15 Epub Date: 2025-02-17 DOI:10.1016/j.engappai.2025.110226
Zhuang Yang , Yu Du , Dong Liu , Kesong Zhao , Ming Cong
{"title":"A human-robot interaction system for automated chemical experiments based on vision and natural language processing semantics","authors":"Zhuang Yang ,&nbsp;Yu Du ,&nbsp;Dong Liu ,&nbsp;Kesong Zhao ,&nbsp;Ming Cong","doi":"10.1016/j.engappai.2025.110226","DOIUrl":null,"url":null,"abstract":"<div><div>Using collaborative robots to replace researchers in performing repetitive and hazardous chemical experiments can effectively enhance experimental efficiency. However, this technology still faces several challenges, including understanding researchers' natural language instructions, autonomously generating action sequences, and more. Therefore, we developed a general control framework for robots in automated chemical experiments based on visual and natural language semantic information. Firstly, starting with the recognition of keywords within Chinese language instructions, we established a domain dictionary for chemical experiment operations and proposed an instruction understanding model based on the bidirectional long-short-term memory and conditional random field(BiLSTM-CRF), enhancing the robot's cognitive ability towards user instructions. Then, a rule matching method for chemical experimental information and a multimodal information feature matching mechanism were established for command content verification and the automatic generation of multiple types of structured language. At the same time, a robot feedback mechanism was added, enabling human-computer interaction and establishing closed-loop control of the system. Finally, propose a robot action sequence generation mechanism based on hierarchical finite state machines(HFSM), transforming structured language into operational strategies for chemical experiments required by the robot. Experimental results show that on the instruction task comprehension dataset created in this paper, the proposed method improves the F1 score by up to 4.44% in the instruction keyword extraction task compared to other models. In addition, compared to traditional manual teaching control, this method significantly reduces time costs. This verifies that the method effectively enhances the robot's ability to comprehend Chinese instructions and generates reliable executable action sequences.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"146 ","pages":"Article 110226"},"PeriodicalIF":8.0000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095219762500226X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/17 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Using collaborative robots to replace researchers in performing repetitive and hazardous chemical experiments can effectively enhance experimental efficiency. However, this technology still faces several challenges, including understanding researchers' natural language instructions, autonomously generating action sequences, and more. Therefore, we developed a general control framework for robots in automated chemical experiments based on visual and natural language semantic information. Firstly, starting with the recognition of keywords within Chinese language instructions, we established a domain dictionary for chemical experiment operations and proposed an instruction understanding model based on the bidirectional long-short-term memory and conditional random field(BiLSTM-CRF), enhancing the robot's cognitive ability towards user instructions. Then, a rule matching method for chemical experimental information and a multimodal information feature matching mechanism were established for command content verification and the automatic generation of multiple types of structured language. At the same time, a robot feedback mechanism was added, enabling human-computer interaction and establishing closed-loop control of the system. Finally, propose a robot action sequence generation mechanism based on hierarchical finite state machines(HFSM), transforming structured language into operational strategies for chemical experiments required by the robot. Experimental results show that on the instruction task comprehension dataset created in this paper, the proposed method improves the F1 score by up to 4.44% in the instruction keyword extraction task compared to other models. In addition, compared to traditional manual teaching control, this method significantly reduces time costs. This verifies that the method effectively enhances the robot's ability to comprehend Chinese instructions and generates reliable executable action sequences.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于视觉和自然语言处理语义的自动化化学实验人机交互系统
利用协作机器人代替人工进行重复性、危险性的化学实验,可以有效提高实验效率。然而,这项技术仍然面临着一些挑战,包括理解研究人员的自然语言指令,自主生成动作序列等等。因此,我们开发了一个基于视觉和自然语言语义信息的自动化化学实验机器人通用控制框架。首先,从中文指令中的关键词识别入手,建立化学实验操作领域词典,提出基于双向长短期记忆和条件随机场(BiLSTM-CRF)的指令理解模型,增强机器人对用户指令的认知能力。然后,建立了化工实验信息的规则匹配方法和多模态信息特征匹配机制,用于命令内容验证和多类型结构化语言的自动生成。同时增加了机器人反馈机构,实现了人机交互,建立了系统的闭环控制。最后,提出了一种基于层次有限状态机(HFSM)的机器人动作序列生成机制,将结构化语言转化为机器人所需的化学实验操作策略。实验结果表明,在本文创建的指令任务理解数据集上,与其他模型相比,该方法在指令关键字提取任务中的F1分数提高了4.44%。此外,与传统的人工教学控制相比,该方法显著降低了时间成本。验证了该方法有效地提高了机器人对中文指令的理解能力,并生成了可靠的可执行动作序列。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
期刊最新文献
Novel artificial intelligence driven model-based control framework for solar–electric vehicle home energy optimization Exploring semantic dependency for reasoning over temporal knowledge graph Enhancing the safety assessment of open-pit mine slopes with interpretable, data-driven stacking learning and three-dimensional stability analysis Dual-domain data enhancement and lightweight deep architecture for robust powder bed defect detection Modeling function-level relationships for vulnerability detection in graph neural networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1