基于预训练模型和口语对话代理的人脸识别系统开发

Sinan Chen, Masahide Nakamura
{"title":"基于预训练模型和口语对话代理的人脸识别系统开发","authors":"Sinan Chen, Masahide Nakamura","doi":"10.1109/BalkanCom55633.2022.9900847","DOIUrl":null,"url":null,"abstract":"Our research group is currently studying and developing listening services using spoken dialogue agents and IoT technologies to assist the “mind” of the elderly at home. However, the user identification function, an essential part of the service, has not yet been realized. It is difficult to determine the identity of the person who interacts with the spoken dialogue agent. Although with the rapid development of the artificial intelligence field, various smart devices and services using deep learning have appeared in the face recognition technology, problems exist, including costs and computational resources to build and apply a recognition model. The purpose of this paper is to develop a facial identification system using the pre-trained model and spoken dialogue agent. Our key ideas include automatic training data generation by spoken dialogue between the user and the agent and the acquisition and comparison of facial features using a pre-trained model. In this way, our face identification system can be easier introduced and expected with only a general-purpose computer and a Web camera, without needing a conventional Internet connection and manual labeling of training data.","PeriodicalId":114443,"journal":{"name":"2022 International Balkan Conference on Communications and Networking (BalkanCom)","volume":"557 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Developing a Facial Identification System Using Pre-Trained Model and Spoken Dialogue Agent\",\"authors\":\"Sinan Chen, Masahide Nakamura\",\"doi\":\"10.1109/BalkanCom55633.2022.9900847\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Our research group is currently studying and developing listening services using spoken dialogue agents and IoT technologies to assist the “mind” of the elderly at home. However, the user identification function, an essential part of the service, has not yet been realized. It is difficult to determine the identity of the person who interacts with the spoken dialogue agent. Although with the rapid development of the artificial intelligence field, various smart devices and services using deep learning have appeared in the face recognition technology, problems exist, including costs and computational resources to build and apply a recognition model. The purpose of this paper is to develop a facial identification system using the pre-trained model and spoken dialogue agent. Our key ideas include automatic training data generation by spoken dialogue between the user and the agent and the acquisition and comparison of facial features using a pre-trained model. In this way, our face identification system can be easier introduced and expected with only a general-purpose computer and a Web camera, without needing a conventional Internet connection and manual labeling of training data.\",\"PeriodicalId\":114443,\"journal\":{\"name\":\"2022 International Balkan Conference on Communications and Networking (BalkanCom)\",\"volume\":\"557 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Balkan Conference on Communications and Networking (BalkanCom)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BalkanCom55633.2022.9900847\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Balkan Conference on Communications and Networking (BalkanCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BalkanCom55633.2022.9900847","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

我们的研究小组目前正在研究和开发使用口语对话代理和物联网技术的听力服务,以帮助家中老年人的“心灵”。然而,作为该服务必不可少的一部分,用户识别功能尚未实现。很难确定与口语对话代理交互的人的身份。尽管随着人工智能领域的快速发展,人脸识别技术中已经出现了各种使用深度学习的智能设备和服务,但存在一些问题,包括建立和应用识别模型的成本和计算资源。本文的目的是利用预先训练好的模型和语音对话代理开发一个人脸识别系统。我们的主要思想包括通过用户和智能体之间的口头对话自动生成训练数据,以及使用预训练模型获取和比较面部特征。通过这种方式,我们的人脸识别系统可以更容易地引入和预期,只需要一台通用计算机和一个网络摄像头,而不需要传统的互联网连接和人工标记训练数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Developing a Facial Identification System Using Pre-Trained Model and Spoken Dialogue Agent
Our research group is currently studying and developing listening services using spoken dialogue agents and IoT technologies to assist the “mind” of the elderly at home. However, the user identification function, an essential part of the service, has not yet been realized. It is difficult to determine the identity of the person who interacts with the spoken dialogue agent. Although with the rapid development of the artificial intelligence field, various smart devices and services using deep learning have appeared in the face recognition technology, problems exist, including costs and computational resources to build and apply a recognition model. The purpose of this paper is to develop a facial identification system using the pre-trained model and spoken dialogue agent. Our key ideas include automatic training data generation by spoken dialogue between the user and the agent and the acquisition and comparison of facial features using a pre-trained model. In this way, our face identification system can be easier introduced and expected with only a general-purpose computer and a Web camera, without needing a conventional Internet connection and manual labeling of training data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An Architecture of MySQL Storage Engines to Increase the Resource Utilization Molecular Communication Transmitter Architectures for the Internet of Bio-Nano Things Markov Model Based Traffic Classification with Multiple Features Statistical Modeling of Sand and Dust Storm Attenuation Calibrated Probabilistic QoT Regression for Unestablished Lightpaths in Optical Networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1