Shift-variant non-negative matrix deconvolution for music transcription

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2012-03-25 DOI:10.1109/ICASSP.2012.6287833

Holger Kirchhoff, S. Dixon, Anssi Klapuri

引用次数: 23

Abstract

In this paper, we address the task of semi-automatic music transcription in which the user provides prior information about the polyphonic mixture under analysis. We propose a non-negative matrix deconvolution framework for this task that allows instruments to be represented by a different basis function for each fundamental frequency (“shift variance”). Two different types of user input are studied: information about the types of instruments, which enables the use of basis functions from an instrument database, and a manual transcription of a number of notes which enables the template estimation from the data under analysis itself. Experiments are performed on a data set of mixtures of acoustical instruments up to a polyphony of five. The results confirm a significant loss in accuracy when database templates are used and show the superiority of the Kullback-Leibler divergence over the least squares error cost function.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

音乐转录的移位非负矩阵反褶积

在本文中，我们解决了半自动音乐转录的任务，其中用户提供了有关分析中的复调混合的先验信息。我们为这项任务提出了一个非负矩阵反卷积框架，该框架允许仪器由每个基频(“移位方差”)的不同基函数表示。研究了两种不同类型的用户输入:一种是关于仪器类型的信息，这种信息可以使用仪器数据库中的基函数，另一种是手工抄写一些音符，这种笔记可以根据分析中的数据本身进行模板估计。实验是在一组声学仪器的混合数据集上进行的，最多可达五复调。结果证实，当使用数据库模板时，准确性会有显著的损失，并显示了Kullback-Leibler散度比最小二乘误差代价函数的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

自引率

0.00%

发文量