ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

英文中文

CLIP-Font: Sementic Self-Supervised Few-Shot Font Generation with Clip CLIP-Font：带剪辑的 Sementic 自监督少枪字体生成技术

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2024-04-14 DOI: 10.1109/icassp48485.2024.10447490

Jialu Xiong, Yefei Wang, Jinshan Zeng

引用次数: 0

Elevating Visual Prompting in Transfer Learning Via Pruned Model Ensembles: No Retrain, No Pain 通过剪枝模型集合提升迁移学习中的视觉提示：无重训，无痛苦

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2024-04-14 DOI: 10.1109/icassp48485.2024.10447808

Brian Zhang, Yuguang Yao, Sijia Liu

引用次数: 0

Binauralmusic: A Diverse Dataset for Improving Cross-Modal Binaural Audio Generation 双耳音乐：用于改进跨模态双耳音频生成的多样化数据集

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2024-04-14 DOI: 10.1109/icassp48485.2024.10448509

Yunqi Li, Shulin Liu, Haonan Cheng, Long Ye

引用次数: 0

Enriching Music Descriptions with A Finetuned-LLM and Metadata for Text-to-Music Retrieval 用微调 LLM 和元数据丰富音乐描述，实现从文本到音乐的检索

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2024-04-14 DOI: 10.1109/icassp48485.2024.10446380

Seungheon Doh, Minhee Lee, Dasaem Jeong, Juhan Nam

引用次数: 0

Dynamic Speech Emotion Recognition Using A Conditional Neural Process 使用条件神经过程进行动态语音情感识别

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2024-04-14 DOI: 10.1109/icassp48485.2024.10447805

Luz Martinez-Lucas, Carlos Busso

The problem of predicting emotional attributes from speech has often focused on predicting a single value from a sentence or short speaking turn. These methods often ignore that natural emotions are both dynamic and dependent on context. To model the dynamic nature of emotions, we can treat the prediction of emotion from speech as a time-series problem. We refer to the problem of predicting these emotional traces as dynamic speech emotion recognition. Previous studies in this area have used models that treat all emotional traces as coming from the same underlying distribution. Since emotions are dependent on contextual information, these methods might obscure the context of an emotional interaction. This paper uses a neural process model with a segment-level speech emotion recognition (SER) model for this problem. This type of model leverages information from the time-series and predictions from the SER model to learn a prior that defines a distribution over emotional traces. Our proposed model performs 21% better than a bidirectional long short-term memory (BiLSTM) baseline when predicting emotional traces for valence.

从语音中预测情感属性的问题通常集中在从句子或简短的发言中预测单个值上。这些方法往往忽略了自然情感是动态的，而且依赖于语境。为了模拟情感的动态性质，我们可以将从语音中预测情感视为一个时间序列问题。我们将预测这些情绪轨迹的问题称为动态语音情绪识别。以往在这一领域的研究使用的模型将所有情绪轨迹视为来自相同的基本分布。由于情感依赖于上下文信息，这些方法可能会模糊情感交互的上下文。本文针对这一问题使用了带有分段级语音情感识别（SER）模型的神经过程模型。这种模型利用时间序列信息和 SER 模型的预测信息来学习先验，从而定义情绪痕迹的分布。与双向长短期记忆（BiLSTM）基线相比，我们提出的模型在预测情绪踪迹的价值时，表现要好 21%。

{"title":"Dynamic Speech Emotion Recognition Using A Conditional Neural Process","authors":"Luz Martinez-Lucas, Carlos Busso","doi":"10.1109/icassp48485.2024.10447805","DOIUrl":"https://doi.org/10.1109/icassp48485.2024.10447805","url":null,"abstract":"The problem of predicting emotional attributes from speech has often focused on predicting a single value from a sentence or short speaking turn. These methods often ignore that natural emotions are both dynamic and dependent on context. To model the dynamic nature of emotions, we can treat the prediction of emotion from speech as a time-series problem. We refer to the problem of predicting these emotional traces as dynamic speech emotion recognition. Previous studies in this area have used models that treat all emotional traces as coming from the same underlying distribution. Since emotions are dependent on contextual information, these methods might obscure the context of an emotional interaction. This paper uses a neural process model with a segment-level speech emotion recognition (SER) model for this problem. This type of model leverages information from the time-series and predictions from the SER model to learn a prior that defines a distribution over emotional traces. Our proposed model performs 21% better than a bidirectional long short-term memory (BiLSTM) baseline when predicting emotional traces for valence.","PeriodicalId":517764,"journal":{"name":"ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"9 10","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140705349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An MVDR-Embedded U-Net Beamformer for Effective and Robust Multichannel Speech Enhancement 一种嵌入 MVDR 的 U-Net 波束形成器，用于有效和稳健的多通道语音增强

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2024-04-14 DOI: 10.1109/icassp48485.2024.10448366

Ching-Hua Lee, Kashyap Patel, Chouchang Yang, Yilin Shen, Hongxia Jin

引用次数: 0

AUTOSGM: A Unified Lowpass Regularization Framework for Accelerated Learning AUTOSGM：用于加速学习的统一低通正则化框架

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2024-04-14 DOI: 10.1109/icassp48485.2024.10448203

Oluwasegun Ayokunle Somefun, Stefan Lee, V. J. Mathews

引用次数: 0

A Modified Cramér-Rao Bound for Discrete-Time Markovian Dynamic Systems 离散时间马尔可夫动态系统的修正克拉梅尔-拉奥约束

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2024-04-14 DOI: 10.1109/icassp48485.2024.10446252

Sara El Bouch, J. Galy, É. Chaumette, J. Vilà‐Valls

引用次数: 0

Trades++: Enhancing Multi-Object Tracking of Real Low Confidence Targets Using a Pyramid-Like Self-Attention Model 交易++：利用金字塔式自我关注模型加强对真实低置信度目标的多目标跟踪

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2024-04-14 DOI: 10.1109/icassp48485.2024.10446257

Chenxin Wen, Yanlei Gao, Jie Li

引用次数: 0

Maskstr: Guide Scene Text Recognition Models with Masking Maskstr：带遮罩的场景文本识别模型指南

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pub Date : 2024-04-14 DOI: 10.1109/icassp48485.2024.10446874

Baole Wei, Minghang He, Liangcai Gao, Duoyou Zhou, Xiang Bai, Zhi Tang

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀