{"title":"A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent","authors":"Ming-Che Lee, Sheng-Cheng Yeh, Sheng Yu Chiu, Jia-Wei Chang","doi":"10.1145/3083187.3083220","DOIUrl":null,"url":null,"abstract":"This study presents a Virtual Elderly Companion Agent that based on speech spectrograms and deep convolutional neural networks. The system can dynamically detect and analyze the user's emotion from the dialogue and give appropriate positive feedback. The proposed system architecture is divided into two parts. The client side supports Android operating system; the server side is implemented in python, and applied GoogleLeNet and AlexNet for emotion recognition. The system supports natural language speech input, and then analyzes the converted speech spectrogram to provide appropriate feedback.","PeriodicalId":123321,"journal":{"name":"Proceedings of the 8th ACM on Multimedia Systems Conference","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 8th ACM on Multimedia Systems Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3083187.3083220","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
This study presents a Virtual Elderly Companion Agent that based on speech spectrograms and deep convolutional neural networks. The system can dynamically detect and analyze the user's emotion from the dialogue and give appropriate positive feedback. The proposed system architecture is divided into two parts. The client side supports Android operating system; the server side is implemented in python, and applied GoogleLeNet and AlexNet for emotion recognition. The system supports natural language speech input, and then analyzes the converted speech spectrogram to provide appropriate feedback.