首页 > 最新文献

2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)最新文献

英文 中文
Blind separation and deconvolution of MIMO system driven by colored inputs using SIMO-model-based ICA with information-geometric learning 基于信息几何学习的simo模型ICA对彩色输入驱动的MIMO系统进行盲分离和反卷积
H. Saruwatari, H. Yamajo, T. Takatani, T. Nishikawa, K. Shikano
We propose a new two-stage blind separation and deconvolution algorithm for multiple-input multiple-output (MIMO)- FIR system driven by colored sound sources, in which a new single-input multiple-output (SIMO)-model-based ICA (SIMO-ICA) and blind multichannel inverse filtering are combined. SIMO-ICA can separate the mixed signals, not into monaural source signals but into SIMO-model-based signals from independent sources. After SIMO-ICA, a simple blind deconvolution technique for the SIMO model can be applied even when each source signal is temporally correlated. The simulation results reveal that the proposed algorithm can successfully achieve the separation and deconvolution for a convolutive mixture of speech.
针对彩色声源驱动的多输入多输出(MIMO)- FIR系统,提出了一种新的两级盲分离和反卷积算法,该算法将一种新的基于单输入多输出(SIMO)模型的ICA (SIMO-ICA)和盲多通道反滤波相结合。SIMO-ICA可以将混合信号分离,而不是将其分离为单源信号,而是将其分离为独立源的基于simo模型的信号。在SIMO- ica之后,即使每个源信号都是时间相关的,SIMO模型也可以应用简单的盲反褶积技术。仿真结果表明,该算法能够成功地实现卷积混合语音的分离和反卷积。
{"title":"Blind separation and deconvolution of MIMO system driven by colored inputs using SIMO-model-based ICA with information-geometric learning","authors":"H. Saruwatari, H. Yamajo, T. Takatani, T. Nishikawa, K. Shikano","doi":"10.1109/NNSP.2003.1318037","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318037","url":null,"abstract":"We propose a new two-stage blind separation and deconvolution algorithm for multiple-input multiple-output (MIMO)- FIR system driven by colored sound sources, in which a new single-input multiple-output (SIMO)-model-based ICA (SIMO-ICA) and blind multichannel inverse filtering are combined. SIMO-ICA can separate the mixed signals, not into monaural source signals but into SIMO-model-based signals from independent sources. After SIMO-ICA, a simple blind deconvolution technique for the SIMO model can be applied even when each source signal is temporally correlated. The simulation results reveal that the proposed algorithm can successfully achieve the separation and deconvolution for a convolutive mixture of speech.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121139304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Regularized discriminative clustering 正则化判别聚类
Samuel Kaski, J. Sinkkonen, Arto Klami
A generative distributional clustering model for continuous data is reviewed and methods for optimizing and regularizing it are introduced and compared. Based on pairs of auxiliary and primary data, the primary data space is partitioned into Voronoi regions that are maximally homogeneous in terms of auxiliary data. Then only variation in the primary data associated with variation in the auxiliary data influences the clusters. Because the whole primary space is partitioned, new samples can be easily clustered in terms of primary data alone. In experiments, the approach is shown to produce more homogeneous clusters than alternative methods. Two regularization methods are demonstrated to further improve the results: an entropy-type penalty for unequal cluster sizes, and the inclusion of a K-means component to the model. The latter can alternatively be interpreted as special kind of joint distribution modeling where the emphasis between discrimination and unsupervised modeling of primary data can be tuned.
对连续数据的生成分布聚类模型进行了综述,并对其优化和正则化的方法进行了介绍和比较。基于辅助数据和主要数据对,将主要数据空间划分为辅助数据最大齐次的Voronoi区域。然后,只有与辅助数据相关的原始数据的变化才会影响聚类。由于整个主空间被划分,因此可以很容易地仅根据主数据聚类新的样本。在实验中,该方法被证明比其他方法产生更均匀的簇。本文演示了两种正则化方法来进一步改善结果:对不相等簇大小的熵型惩罚,以及在模型中包含K-means成分。后者可以被解释为特殊类型的联合分布建模,其中可以调整原始数据的判别和无监督建模之间的重点。
{"title":"Regularized discriminative clustering","authors":"Samuel Kaski, J. Sinkkonen, Arto Klami","doi":"10.1109/NNSP.2003.1318028","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318028","url":null,"abstract":"A generative distributional clustering model for continuous data is reviewed and methods for optimizing and regularizing it are introduced and compared. Based on pairs of auxiliary and primary data, the primary data space is partitioned into Voronoi regions that are maximally homogeneous in terms of auxiliary data. Then only variation in the primary data associated with variation in the auxiliary data influences the clusters. Because the whole primary space is partitioned, new samples can be easily clustered in terms of primary data alone. In experiments, the approach is shown to produce more homogeneous clusters than alternative methods. Two regularization methods are demonstrated to further improve the results: an entropy-type penalty for unequal cluster sizes, and the inclusion of a K-means component to the model. The latter can alternatively be interpreted as special kind of joint distribution modeling where the emphasis between discrimination and unsupervised modeling of primary data can be tuned.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128079079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Stochastic complexities of hidden Markov models 隐马尔可夫模型的随机复杂性
Keisuke Yamazaki, Sumio Watanabe
Hidden Markov models are now used in many fields, for example, speech recognition, natural language processing etc. However, the mathematical foundation of analysis for the models has not yet been constructed, since the HMMs are non-identifiable. In recent years, we have developed the algebraic geometrical method that allows us to analyze the non-regular and non-identifiable models. In this paper, we apply this method to the HMM and reveal the asymptotic order of its stochastic complexity in the mathematically rigorous way.
隐马尔可夫模型目前应用于许多领域,如语音识别、自然语言处理等。然而,模型分析的数学基础尚未建立,因为hmm是不可识别的。近年来,我们发展了代数几何方法,使我们能够分析非规则和不可识别的模型。本文将此方法应用于HMM,并以数学严谨的方式揭示了其随机复杂度的渐近阶。
{"title":"Stochastic complexities of hidden Markov models","authors":"Keisuke Yamazaki, Sumio Watanabe","doi":"10.1109/NNSP.2003.1318017","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318017","url":null,"abstract":"Hidden Markov models are now used in many fields, for example, speech recognition, natural language processing etc. However, the mathematical foundation of analysis for the models has not yet been constructed, since the HMMs are non-identifiable. In recent years, we have developed the algebraic geometrical method that allows us to analyze the non-regular and non-identifiable models. In this paper, we apply this method to the HMM and reveal the asymptotic order of its stochastic complexity in the mathematically rigorous way.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"398 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131848121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Multi-modal audio-visual event recognition for football analysis 用于足球分析的多模态视听事件识别
M. Barnard, J. Odobez, Samy Bengio
The recognition of events within multi-modal data is a challenging problem. In this paper we focus on the recognition of events by using both audio and video data. We investigate the use of data fusion techniques in order to recognise these sequences within the framework of hidden Markov models (HMM) used to model audio and video data sequences. Specifically we look at the recognition of play and break sequences in football and the segmentation of football games based on these two events. Recognising relatively simple semantic events such as this is an important step towards full automatic indexing of such video material. These experiments were done using approximately 3 hours of data from two games of the Euro96 competition. We propose that modelling the audio and video streams separately for each sequence and fusing the decisions from each stream should yield an accurate and robust method of segmenting multi-modal data.
多模态数据中的事件识别是一个具有挑战性的问题。在本文中,我们重点研究了同时使用音频和视频数据的事件识别。我们研究了数据融合技术的使用,以便在用于建模音频和视频数据序列的隐马尔可夫模型(HMM)框架内识别这些序列。具体来说,我们着眼于足球比赛中比赛和中断序列的识别以及基于这两个事件的足球比赛分割。识别诸如此类相对简单的语义事件是实现此类视频材料全自动索引的重要一步。这些实验使用了96年欧洲杯两场比赛中大约3小时的数据。我们提出,为每个序列分别建模音频和视频流,并融合每个流的决策,应该产生一种准确而稳健的多模态数据分割方法。
{"title":"Multi-modal audio-visual event recognition for football analysis","authors":"M. Barnard, J. Odobez, Samy Bengio","doi":"10.1109/NNSP.2003.1318046","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318046","url":null,"abstract":"The recognition of events within multi-modal data is a challenging problem. In this paper we focus on the recognition of events by using both audio and video data. We investigate the use of data fusion techniques in order to recognise these sequences within the framework of hidden Markov models (HMM) used to model audio and video data sequences. Specifically we look at the recognition of play and break sequences in football and the segmentation of football games based on these two events. Recognising relatively simple semantic events such as this is an important step towards full automatic indexing of such video material. These experiments were done using approximately 3 hours of data from two games of the Euro96 competition. We propose that modelling the audio and video streams separately for each sequence and fusing the decisions from each stream should yield an accurate and robust method of segmenting multi-modal data.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"146 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133104277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
Correlation-based feature detection using pulsed neural networks 基于相关性的脉冲神经网络特征检测
A. Heittmann, U. Ramacher
The feature extraction and detection in visual scenes set up the basis for robust image processing and scene analysis. While the receptive fields of simple cells in the visual cortex are modeled by Gabor functions, simple cells are commonly treated as linear filters. In this paper, we demonstrate how the non-linear operations on pulses like correlation, synchronization and detection of decorrelation can be used for implementation of feature detectors. Using essentially two data-driven adaption rules dependent on dendritic currents and to membrane potentials, linear detection of intensity gradients can be realized. As a technical application, a feature detector sensitive to orientation is presented.
视觉场景中的特征提取和检测为鲁棒图像处理和场景分析奠定了基础。虽然视觉皮层中简单细胞的接受野是由Gabor函数建模的,但简单细胞通常被视为线性过滤器。在本文中,我们演示了如何使用脉冲的非线性操作,如相关、同步和去相关检测来实现特征检测器。利用基于树突电流和膜电位的两个数据驱动的自适应规则,可以实现强度梯度的线性检测。作为一种技术应用,提出了一种对方向敏感的特征检测器。
{"title":"Correlation-based feature detection using pulsed neural networks","authors":"A. Heittmann, U. Ramacher","doi":"10.1109/NNSP.2003.1318047","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318047","url":null,"abstract":"The feature extraction and detection in visual scenes set up the basis for robust image processing and scene analysis. While the receptive fields of simple cells in the visual cortex are modeled by Gabor functions, simple cells are commonly treated as linear filters. In this paper, we demonstrate how the non-linear operations on pulses like correlation, synchronization and detection of decorrelation can be used for implementation of feature detectors. Using essentially two data-driven adaption rules dependent on dendritic currents and to membrane potentials, linear detection of intensity gradients can be realized. As a technical application, a feature detector sensitive to orientation is presented.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114622496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Blending of missile control modes with neural networks 导弹控制模式与神经网络的融合
J. Farges, P. Fabiani, S. L. Ménec
A solution using neural networks for the problem of the choice of the control mode of a missile is proposed and implemented. The test on 7000 interceptions shows that this approach makes it possible to reduce the number of failures (miss distance larger than 5 meters) compared to the use of expert rules.
提出并实现了一种利用神经网络解决导弹控制方式选择问题的方法。对7000次拦截的测试表明,与使用专家规则相比,这种方法可以减少失败次数(脱靶距离大于5米)。
{"title":"Blending of missile control modes with neural networks","authors":"J. Farges, P. Fabiani, S. L. Ménec","doi":"10.1109/NNSP.2003.1318012","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318012","url":null,"abstract":"A solution using neural networks for the problem of the choice of the control mode of a missile is proposed and implemented. The test on 7000 interceptions shows that this approach makes it possible to reduce the number of failures (miss distance larger than 5 meters) compared to the use of expert rules.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128643840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast and efficient sequential learning algorithms using direct-link RBF networks 基于直连RBF网络的快速高效的顺序学习算法
V. Asirvadam, S. McLoone, G. Irwin
Novel fast and efficient sequential learning algorithms are proposed for direct-link radial basis function (DRBF) networks. The dynamic DRBF network is trained using the recently proposed decomposed/parallel recursive Levenberg Marquardt (PRLM) algorithm by neglecting the interneuron weight interactions. The resulting sequential learning approach enables weights to be updated in an efficient parallel manner and facilitates a minimal update extension for real-time applications. Simulation results for two benchmark problems show the feasibility of the new training algorithms.
针对直连径向基函数(DRBF)网络,提出了一种新的快速高效的序列学习算法。动态DRBF网络的训练采用了最近提出的分解/并行递归Levenberg Marquardt (PRLM)算法,忽略了神经元间权的相互作用。由此产生的顺序学习方法能够以有效的并行方式更新权重,并为实时应用程序提供最小的更新扩展。两个基准问题的仿真结果表明了新训练算法的可行性。
{"title":"Fast and efficient sequential learning algorithms using direct-link RBF networks","authors":"V. Asirvadam, S. McLoone, G. Irwin","doi":"10.1109/NNSP.2003.1318020","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318020","url":null,"abstract":"Novel fast and efficient sequential learning algorithms are proposed for direct-link radial basis function (DRBF) networks. The dynamic DRBF network is trained using the recently proposed decomposed/parallel recursive Levenberg Marquardt (PRLM) algorithm by neglecting the interneuron weight interactions. The resulting sequential learning approach enables weights to be updated in an efficient parallel manner and facilitates a minimal update extension for real-time applications. Simulation results for two benchmark problems show the feasibility of the new training algorithms.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123661463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Improving independent component analysis performances by variable selection 通过变量选择提高独立成分分析性能
F. Vrins, J. Lee, M. Verleysen, V. Vigneron, C. Jutten
Blind source separation (BSS) consists in recovering unobserved signals from observed mixtures of them. In most cases the whole set of mixtures is used for the separation, possibly after a dimension reduction by PCA. This paper aims to show that in many applications the quality of the separation can be improved by first selecting a subset of some mixtures among the available ones, possibly by an information content criterion, and performing PCA and BSS afterwards. The benefit of this procedure is shown on simulated electrocardiographic data by extracting the fetal electrocardiogram signal from mixtures recorded on the abdomen of a pregnant woman.
盲源分离(BSS)是指从观察到的混合信号中恢复未观察到的信号。在大多数情况下,整个混合物被用于分离,可能是在PCA降维之后。本文旨在表明,在许多应用中,可以通过首先从可用的混合物中选择一些混合物的子集(可能是通过信息含量标准),然后执行主成分分析和BSS,从而提高分离的质量。通过从孕妇腹部记录的混合物中提取胎儿心电图信号,模拟心电图数据显示了该程序的好处。
{"title":"Improving independent component analysis performances by variable selection","authors":"F. Vrins, J. Lee, M. Verleysen, V. Vigneron, C. Jutten","doi":"10.1109/NNSP.2003.1318035","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318035","url":null,"abstract":"Blind source separation (BSS) consists in recovering unobserved signals from observed mixtures of them. In most cases the whole set of mixtures is used for the separation, possibly after a dimension reduction by PCA. This paper aims to show that in many applications the quality of the separation can be improved by first selecting a subset of some mixtures among the available ones, possibly by an information content criterion, and performing PCA and BSS afterwards. The benefit of this procedure is shown on simulated electrocardiographic data by extracting the fetal electrocardiogram signal from mixtures recorded on the abdomen of a pregnant woman.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125120946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Underdetermined blind separation of sparse sources with instantaneous and convolutive mixtures 瞬态和卷积混合稀疏源的欠定盲分离
D. Luengo, I. Santamaría, L. Vielva, C. Pantaleón
We consider the underdetermined blind source separation problem with linear instantaneous and convolutive mixtures when the input signals are sparse, or have been rendered sparse. In the underdetermined case the problem requires solving three sub-problems: detecting the number of sources, estimating the mixing matrix, and finding an adequate inversion strategy to obtain the sources. This paper solves the first two problems. We assume that the number of sources is unknown, and estimate it by means of an information theoretic criterion (MDL). Then the mixing matrix is expressed in spheric coordinates and we estimate sequentially the angles and amplitudes of each column, and their order. The performance of the method is illustrated through simulations.
本文研究了当输入信号稀疏或被渲染为稀疏时,具有线性瞬时和卷积混合的欠定盲源分离问题。在欠定情况下,该问题需要解决三个子问题:检测源数量,估计混合矩阵,找到适当的反演策略来获取源。本文解决了前两个问题。我们假设源的数量是未知的,并通过信息理论准则(MDL)来估计它。然后用球坐标系表示混合矩阵,并依次估计各柱的角度和振幅及其阶数。仿真结果表明了该方法的有效性。
{"title":"Underdetermined blind separation of sparse sources with instantaneous and convolutive mixtures","authors":"D. Luengo, I. Santamaría, L. Vielva, C. Pantaleón","doi":"10.1109/NNSP.2003.1318027","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318027","url":null,"abstract":"We consider the underdetermined blind source separation problem with linear instantaneous and convolutive mixtures when the input signals are sparse, or have been rendered sparse. In the underdetermined case the problem requires solving three sub-problems: detecting the number of sources, estimating the mixing matrix, and finding an adequate inversion strategy to obtain the sources. This paper solves the first two problems. We assume that the number of sources is unknown, and estimate it by means of an information theoretic criterion (MDL). Then the mixing matrix is expressed in spheric coordinates and we estimate sequentially the angles and amplitudes of each column, and their order. The performance of the method is illustrated through simulations.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123953690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Recognition of musical instruments by generalized min-max classifiers 基于广义最小-最大分类器的乐器识别
G. Costantini, A. Rizzi, D. Casali
The correct classification of single musical sources is a relevant aspect for the source separation task and the automatic transcription of polyphonic music. In this paper, we deal with a classification problem concerning the recognition of six different musical instruments: violin, clarinet, flute, oboe, saxophone and piano. A satisfactory solution of such a recognition problem depends mainly on both the preprocessing procedure (set of features extracted from row data) and the adopted classification system. As concerns feature extraction, a suitable signal preprocessing based on FFT, QFT (Q-constant frequency transform) and cepstrum coefficients are employed. We adopt min-max neurofuzzy networks as the classification model, both in their classical and generalized version. The synthesis of these classifiers is performed by the adaptive resolution training technique (ARC, PARC and GPARC algorithms), since it assures good performances and an excellent automation degree.
单一音源的正确分类是音源分离任务和复调音乐自动转写的一个相关方面。本文研究了小提琴、单簧管、长笛、双簧管、萨克斯管和钢琴六种不同乐器的分类问题。这种识别问题的满意解决方案主要取决于预处理程序(从行数据中提取的特征集)和所采用的分类系统。在特征提取方面,采用了基于FFT、QFT (Q-constant frequency transform)和倒谱系数的信号预处理。我们采用最小-最大神经模糊网络作为分类模型,包括经典模型和广义模型。这些分类器的综合是通过自适应分辨率训练技术(ARC, PARC和GPARC算法)进行的,因为它保证了良好的性能和良好的自动化程度。
{"title":"Recognition of musical instruments by generalized min-max classifiers","authors":"G. Costantini, A. Rizzi, D. Casali","doi":"10.1109/NNSP.2003.1318055","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318055","url":null,"abstract":"The correct classification of single musical sources is a relevant aspect for the source separation task and the automatic transcription of polyphonic music. In this paper, we deal with a classification problem concerning the recognition of six different musical instruments: violin, clarinet, flute, oboe, saxophone and piano. A satisfactory solution of such a recognition problem depends mainly on both the preprocessing procedure (set of features extracted from row data) and the adopted classification system. As concerns feature extraction, a suitable signal preprocessing based on FFT, QFT (Q-constant frequency transform) and cepstrum coefficients are employed. We adopt min-max neurofuzzy networks as the classification model, both in their classical and generalized version. The synthesis of these classifiers is performed by the adaptive resolution training technique (ARC, PARC and GPARC algorithms), since it assures good performances and an excellent automation degree.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126386103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1