基于跨语料库语音数据集的意大利老年人情感识别

AIxAS@AI*IA Pub Date : 2022-11-14 DOI:10.48550/arXiv.2211.07307

F. Gasparini, A. Grossi

{"title":"基于跨语料库语音数据集的意大利老年人情感识别","authors":"F. Gasparini, A. Grossi","doi":"10.48550/arXiv.2211.07307","DOIUrl":null,"url":null,"abstract":"The aim of this work is to define a speech emotion recognition (SER) model able to recognize positive, neutral and negative emotions in natural conversations of Italian elderly people. Several datasets for SER are available in the literature. However most of them are in English or Chinese, have been recorded while actors and actresses pronounce short phrases and thus are not related to natural conversation. Moreover only few speeches among all the databases are related to elderly people. Therefore, in this work, a multi-language and multi-age corpus is considered merging a dataset in English, that includes also elderly people, with a dataset in Italian. A general model, trained on young and adult English actors and actresses is proposed, based on XGBoost. Then two strategies of domain adaptation are proposed to adapt the model either to elderly people and to Italian speakers. The results suggest that this approach increases the classification performance, underlining also that new datasets should be collected.","PeriodicalId":308455,"journal":{"name":"AIxAS@AI*IA","volume":"195 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sentiment Recognition of Italian Elderly through Domain Adaptation on Cross-corpus Speech Dataset\",\"authors\":\"F. Gasparini, A. Grossi\",\"doi\":\"10.48550/arXiv.2211.07307\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of this work is to define a speech emotion recognition (SER) model able to recognize positive, neutral and negative emotions in natural conversations of Italian elderly people. Several datasets for SER are available in the literature. However most of them are in English or Chinese, have been recorded while actors and actresses pronounce short phrases and thus are not related to natural conversation. Moreover only few speeches among all the databases are related to elderly people. Therefore, in this work, a multi-language and multi-age corpus is considered merging a dataset in English, that includes also elderly people, with a dataset in Italian. A general model, trained on young and adult English actors and actresses is proposed, based on XGBoost. Then two strategies of domain adaptation are proposed to adapt the model either to elderly people and to Italian speakers. The results suggest that this approach increases the classification performance, underlining also that new datasets should be collected.\",\"PeriodicalId\":308455,\"journal\":{\"name\":\"AIxAS@AI*IA\",\"volume\":\"195 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AIxAS@AI*IA\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2211.07307\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AIxAS@AI*IA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2211.07307","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

这项工作的目的是定义一个语音情感识别(SER)模型，能够识别意大利老年人自然对话中的积极、中性和消极情绪。文献中有几个SER的数据集。然而，大多数都是用英语或汉语录制的，演员们都是用简短的短语发音，因此与自然对话无关。此外，所有数据库中与老年人有关的演讲很少。因此，在这项工作中，考虑将英语数据集(也包括老年人)与意大利语数据集合并为一个多语言和多年龄的语料库。提出了一种基于XGBoost的通用模型，对青年和成年英语男女演员进行训练。然后提出了两种领域适应策略来适应老年人和意大利语使用者。结果表明，这种方法提高了分类性能，也强调了应该收集新的数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Sentiment Recognition of Italian Elderly through Domain Adaptation on Cross-corpus Speech Dataset

The aim of this work is to define a speech emotion recognition (SER) model able to recognize positive, neutral and negative emotions in natural conversations of Italian elderly people. Several datasets for SER are available in the literature. However most of them are in English or Chinese, have been recorded while actors and actresses pronounce short phrases and thus are not related to natural conversation. Moreover only few speeches among all the databases are related to elderly people. Therefore, in this work, a multi-language and multi-age corpus is considered merging a dataset in English, that includes also elderly people, with a dataset in Italian. A general model, trained on young and adult English actors and actresses is proposed, based on XGBoost. Then two strategies of domain adaptation are proposed to adapt the model either to elderly people and to Italian speakers. The results suggest that this approach increases the classification performance, underlining also that new datasets should be collected.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

AIxAS@AI*IA

自引率

0.00%

发文量

期刊最新文献

Sentiment Recognition of Italian Elderly through Domain Adaptation on Cross-corpus Speech Dataset