Sound event detection using non-negative dictionaries learned from annotated overlapping events

2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics Pub Date : 2013-10-01 DOI:10.1109/WASPAA.2013.6701861

O. Dikmen, A. Mesaros

引用次数: 53

Abstract

Detection of overlapping sound events generally requires training class models either from separate data for each class or by making assumptions about the dominating events in the mixed signals. Methods based on sound source separation are currently used in this task, but involve the problem of assigning separated components to sources. In this paper, we propose a method which bypasses the need to build separate sound models. Instead, non-negative dictionaries for the sound content and their annotations are learned in a coupled sense. In the testing stage, time activations of the sound dictionary columns are estimated and used to reconstruct annotations using the annotation dictionary. The method requires no separate training data for classes and in general very promising results are obtained using only a small amount of data.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

声音事件检测使用非负字典学习注解重叠事件

检测重叠的声音事件通常需要训练类模型，或者从每个类的单独数据中，或者通过对混合信号中的主要事件进行假设。基于声源分离的方法目前用于这项任务，但涉及到将分离的组件分配给声源的问题。在本文中，我们提出了一种绕过需要建立单独的声音模型的方法。相反，声音内容及其注释的非负字典是以耦合的方式学习的。在测试阶段，估计声音字典列的时间激活，并使用注释字典重建注释。该方法不需要单独的类训练数据，通常仅使用少量数据就可以获得非常有希望的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊