Beyond Words: An Intelligent Human-Machine Dialogue System with Multimodal Generation and Emotional Comprehension

Yaru Zhao, Bo Cheng, Yakun Huang, Zhiguo Wan
{"title":"Beyond Words: An Intelligent Human-Machine Dialogue System with Multimodal Generation and Emotional Comprehension","authors":"Yaru Zhao, Bo Cheng, Yakun Huang, Zhiguo Wan","doi":"10.1155/2023/9267487","DOIUrl":null,"url":null,"abstract":"Intelligent service robots have become an indispensable aspect of modern-day society, playing a crucial role in various domains ranging from healthcare to hospitality. Among these robotic systems, human-machine dialogue systems are particularly noteworthy as they deliver both auditory and visual services to users, effectively bridging the communication gap between humans and machines. Despite their utility, the majority of existing approaches to these systems primarily concentrate on augmenting the logical coherence of the system’s responses, inadvertently neglecting the significance of user emotions in shaping a comprehensive communication experience. To tackle this shortcoming, we propose the development of an innovative human-machine dialogue system that is both intelligent and emotionally sensitive, employing multimodal generation techniques. This system is architecturally comprised of three components: (1) data collection and processing, responsible for gathering and preparing relevant information, (2) a dialogue engine, which generates contextually appropriate responses, and (3) an interaction module, responsible for facilitating the communication interface between users and the system. To validate our proposed approach, we have constructed a prototype system and conducted an evaluation of the performance of the core dialogue engine by utilizing an open dataset. The results of our study indicate that our system demonstrates a remarkable level of multimodal generation response, ultimately offering a more human-like dialogue experience.","PeriodicalId":507857,"journal":{"name":"International Journal of Intelligent Systems","volume":"15 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/2023/9267487","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Intelligent service robots have become an indispensable aspect of modern-day society, playing a crucial role in various domains ranging from healthcare to hospitality. Among these robotic systems, human-machine dialogue systems are particularly noteworthy as they deliver both auditory and visual services to users, effectively bridging the communication gap between humans and machines. Despite their utility, the majority of existing approaches to these systems primarily concentrate on augmenting the logical coherence of the system’s responses, inadvertently neglecting the significance of user emotions in shaping a comprehensive communication experience. To tackle this shortcoming, we propose the development of an innovative human-machine dialogue system that is both intelligent and emotionally sensitive, employing multimodal generation techniques. This system is architecturally comprised of three components: (1) data collection and processing, responsible for gathering and preparing relevant information, (2) a dialogue engine, which generates contextually appropriate responses, and (3) an interaction module, responsible for facilitating the communication interface between users and the system. To validate our proposed approach, we have constructed a prototype system and conducted an evaluation of the performance of the core dialogue engine by utilizing an open dataset. The results of our study indicate that our system demonstrates a remarkable level of multimodal generation response, ultimately offering a more human-like dialogue experience.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
超越言语:具有多模态生成和情感理解功能的智能人机对话系统
智能服务机器人已成为现代社会不可或缺的一部分,在从医疗到酒店等各个领域发挥着至关重要的作用。在这些机器人系统中,人机对话系统尤其值得一提,因为它们能为用户提供听觉和视觉服务,有效地弥合了人与机器之间的沟通鸿沟。尽管人机对话系统非常实用,但现有的大多数人机对话系统主要集中在增强系统反应的逻辑连贯性上,无意中忽视了用户情感在塑造全面交流体验中的重要性。为了解决这一缺陷,我们建议开发一种创新的人机对话系统,该系统采用多模态生成技术,既具有智能性,又具有情感敏感性。该系统在结构上由三个部分组成:(1) 数据收集和处理,负责收集和准备相关信息;(2) 对话引擎,根据语境生成适当的回应;(3) 交互模块,负责促进用户与系统之间的交流界面。为了验证我们提出的方法,我们构建了一个原型系统,并利用一个开放数据集对核心对话引擎的性能进行了评估。研究结果表明,我们的系统在多模态生成响应方面表现出了卓越的水平,最终提供了更类似于人类的对话体验。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A New Artificial Intelligence-Based Model for Amyotrophic Lateral Sclerosis Prediction Pulmonary Nodule Detection from 3D CT Image with a Two-Stage Network Real-Time Frequency Adaptive Tracking Control of the WPT System Based on Apparent Power Detection Beyond Words: An Intelligent Human-Machine Dialogue System with Multimodal Generation and Emotional Comprehension A New Pareto Discrete NSGAII Algorithm for Disassembly Line Balance Problem
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1