首页 > 最新文献

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

英文 中文
Coordinated beamforming in MIMO FBMC/OQAM systems MIMO FBMC/OQAM系统中的协调波束形成
Yao Cheng, Peng Li, M. Haardt
In this contribution, we propose a coordinated transmit beamforming technique for point-to-point multiple-input-multiple-output (MIMO) filter bank based multi-carrier with offset quadrature amplitude modulation (FBMC/OQAM) systems. To enable reliable transmissions when the number of transmit antennas does not exceed the number of receive antennas and the channel is not flat fading, we design a joint and iterative procedure to calculate the precoding matrix and the decoding matrix for each subcarrier. Simulation results show that the proposed algorithm outperforms the existing transmission strategies for MIMO FBMC/OQAM systems. It is also observed that by employing the proposed coordinated beamforming scheme, the MIMO FBMC/OQAM system achieves a similar bit error rate (BER) performance as its orthogonal frequency division multiplexing with the cyclic prefix insertion (CP-OFDM) based counterpart while exhibiting superiority in terms of a higher spectral efficiency, a greater robustness against synchronization errors, and a lower out-of-band radiation.
在这篇论文中,我们提出了一种基于多载波偏置正交调幅(FBMC/OQAM)系统的点对点多输入多输出(MIMO)滤波器组的协调发射波束形成技术。为了在发射天线数量不超过接收天线数量且信道不平坦衰落的情况下实现可靠的传输,我们设计了一个联合迭代程序来计算每个子载波的预编码矩阵和解码矩阵。仿真结果表明,该算法优于现有MIMO FBMC/OQAM系统的传输策略。通过采用所提出的协调波束形成方案,MIMO FBMC/OQAM系统实现了与基于循环前缀插入(CP-OFDM)的正交频分复用相似的误码率(BER)性能,同时在更高的频谱效率、更强的对同步误差的鲁棒性和更低的带外辐射方面表现出优势。
{"title":"Coordinated beamforming in MIMO FBMC/OQAM systems","authors":"Yao Cheng, Peng Li, M. Haardt","doi":"10.1109/ICASSP.2014.6853643","DOIUrl":"https://doi.org/10.1109/ICASSP.2014.6853643","url":null,"abstract":"In this contribution, we propose a coordinated transmit beamforming technique for point-to-point multiple-input-multiple-output (MIMO) filter bank based multi-carrier with offset quadrature amplitude modulation (FBMC/OQAM) systems. To enable reliable transmissions when the number of transmit antennas does not exceed the number of receive antennas and the channel is not flat fading, we design a joint and iterative procedure to calculate the precoding matrix and the decoding matrix for each subcarrier. Simulation results show that the proposed algorithm outperforms the existing transmission strategies for MIMO FBMC/OQAM systems. It is also observed that by employing the proposed coordinated beamforming scheme, the MIMO FBMC/OQAM system achieves a similar bit error rate (BER) performance as its orthogonal frequency division multiplexing with the cyclic prefix insertion (CP-OFDM) based counterpart while exhibiting superiority in terms of a higher spectral efficiency, a greater robustness against synchronization errors, and a lower out-of-band radiation.","PeriodicalId":6545,"journal":{"name":"2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"74 1","pages":"484-488"},"PeriodicalIF":0.0,"publicationDate":"2014-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77780399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Verification based ECG biometrics with cardiac irregular conditions using heartbeat level and segment level information fusion 基于心跳水平和节段水平信息融合的心电生物识别技术
Ming Li, Xin Li
We propose an ECG based robust human verification system for both healthy and cardiac irregular conditions using the heartbeat level and segment level information fusion. At the heartbeat level, we first propose a novel beat normalization and outlier removal algorithm after peak detection to extract normalized representative beats. Then after principal component analysis (PCA), we apply linear discriminant analysis (LDA) and within-class covariance normalization (WCCN) for beat variability compensation followed by cosine similarity and Snorm as scoring. At the segment level, we adopt the hierarchical Dirichlet process auto-regressive hidden Markov model (HDP-AR-HMM) in the Bayesian non-parametric framework for unsupervised joint segmentation and clustering without any peak detection. It automatically decodes each raw signal into a string vector. We then apply n-gram language model and hypothesis testing for scoring. Combining the aforementioned two subsystems together further improved the performance and outperformed the PCA baseline by 25% relatively on the PTB database.
我们提出了一种基于心电的健康和不规则心脏的鲁棒人体验证系统,该系统采用心跳水平和节段水平信息融合。在心跳层面,我们首先提出了一种新的峰值检测后的心跳归一化和异常值去除算法,以提取归一化的代表性心跳。然后,在主成分分析(PCA)之后,采用线性判别分析(LDA)和类内协方差归一化(WCCN)进行温度变异性补偿,然后采用余弦相似度和斯诺姆值进行评分。在片段层面,我们在贝叶斯非参数框架中采用层次Dirichlet过程自回归隐马尔可夫模型(HDP-AR-HMM)进行无监督联合分割和聚类,不进行任何峰值检测。它自动将每个原始信号解码成字符串向量。然后,我们应用n-gram语言模型和假设检验进行评分。将上述两个子系统结合在一起进一步提高了性能,在PTB数据库上的性能比PCA基线高出25%。
{"title":"Verification based ECG biometrics with cardiac irregular conditions using heartbeat level and segment level information fusion","authors":"Ming Li, Xin Li","doi":"10.1109/ICASSP.2014.6854306","DOIUrl":"https://doi.org/10.1109/ICASSP.2014.6854306","url":null,"abstract":"We propose an ECG based robust human verification system for both healthy and cardiac irregular conditions using the heartbeat level and segment level information fusion. At the heartbeat level, we first propose a novel beat normalization and outlier removal algorithm after peak detection to extract normalized representative beats. Then after principal component analysis (PCA), we apply linear discriminant analysis (LDA) and within-class covariance normalization (WCCN) for beat variability compensation followed by cosine similarity and Snorm as scoring. At the segment level, we adopt the hierarchical Dirichlet process auto-regressive hidden Markov model (HDP-AR-HMM) in the Bayesian non-parametric framework for unsupervised joint segmentation and clustering without any peak detection. It automatically decodes each raw signal into a string vector. We then apply n-gram language model and hypothesis testing for scoring. Combining the aforementioned two subsystems together further improved the performance and outperformed the PCA baseline by 25% relatively on the PTB database.","PeriodicalId":6545,"journal":{"name":"2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"41 1","pages":"3769-3773"},"PeriodicalIF":0.0,"publicationDate":"2014-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80107285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Using monocular depth cues for modeling stereoscopic 3D saliency 使用单眼深度线索建模立体3D显著性
Iana Iatsun, M. Larabi, C. Fernandez-Maloigne
Saliency is one of the most important features in human visual perception. It is widely used nowadays for perceptually optimizing image processing algorithms. Several models have been proposed for 2D images and only few attempts can be observed for 3D ones. In this paper, we propose a stereoscopic 3D saliency model relying on 2D saliency features jointly with depth obtained from monocular cues. On the one hand, the use of 2D saliency features is justified psychophysically by the similarity observed between 2D and 3D attention maps. On the other hand, 3D perception is significantly based on monocular cues. The validation of our model using state-of-the-art procedures including Kullback-Leibler divergence (KLD), area under the curve (AUC) and correlation coefficient (CC) in comparison with attention maps showed very good performance.
显著性是人类视觉感知的重要特征之一。目前,它被广泛用于感知优化图像处理算法。对于二维图像已经提出了几种模型,但对于三维图像的尝试很少。本文提出了一种基于二维显著性特征和单目线索深度的立体三维显著性模型。一方面,通过观察到2D和3D注意图之间的相似性,2D显著性特征的使用在心理物理学上是合理的。另一方面,3D感知很大程度上是基于单目线索。使用最先进的程序验证我们的模型,包括Kullback-Leibler散度(KLD),曲线下面积(AUC)和相关系数(CC),与注意图相比显示出非常好的性能。
{"title":"Using monocular depth cues for modeling stereoscopic 3D saliency","authors":"Iana Iatsun, M. Larabi, C. Fernandez-Maloigne","doi":"10.1109/ICASSP.2014.6853664","DOIUrl":"https://doi.org/10.1109/ICASSP.2014.6853664","url":null,"abstract":"Saliency is one of the most important features in human visual perception. It is widely used nowadays for perceptually optimizing image processing algorithms. Several models have been proposed for 2D images and only few attempts can be observed for 3D ones. In this paper, we propose a stereoscopic 3D saliency model relying on 2D saliency features jointly with depth obtained from monocular cues. On the one hand, the use of 2D saliency features is justified psychophysically by the similarity observed between 2D and 3D attention maps. On the other hand, 3D perception is significantly based on monocular cues. The validation of our model using state-of-the-art procedures including Kullback-Leibler divergence (KLD), area under the curve (AUC) and correlation coefficient (CC) in comparison with attention maps showed very good performance.","PeriodicalId":6545,"journal":{"name":"2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"28 1","pages":"589-593"},"PeriodicalIF":0.0,"publicationDate":"2014-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80429259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A postfilter to modify the modulation spectrum in HMM-based speech synthesis 在基于hmm的语音合成中修改调制频谱的后滤波器
Shinnosuke Takamichi, T. Toda, Graham Neubig, S. Sakti, Satoshi Nakamura
In this paper, we propose a postfilter to compensate modulation spectrum in HMM-based speech synthesis. In order to alleviate over-smoothing effects which is a main cause of quality degradation in HMM-based speech synthesis, it is necessary to consider features that can capture over-smoothing. Global Variance (GV) is one well-known example of such a feature, and the effectiveness of parameter generation algorithm considering GV have been confirmed. However, the quality gap between natural speech and synthetic speech is still large. In this paper, we introduce the Modulation Spectrum (MS) of speech parameter trajectory as a new feature to effectively capture the over-smoothing effect, and we propose a postfilter based on the MS. The MS is represented as a power spectrum of the parameter trajectory. The generated speech parameter sequence is filtered to ensure that its MS has a pattern similar to natural speech. Experimental results show quality improvements when the proposed methods are applied to spectral and F0 components, compared with conventional methods considering GV.
在本文中,我们提出了一种后置滤波器来补偿基于hmm的语音合成中的调制频谱。在基于hmm的语音合成中,过度平滑是导致质量下降的主要原因,为了减轻过度平滑的影响,有必要考虑能够捕获过度平滑的特征。全局方差(Global Variance, GV)就是一个很好的例子,考虑全局方差的参数生成算法的有效性已经得到了验证。然而,自然语音和合成语音之间的质量差距仍然很大。本文引入了语音参数轨迹的调制谱(Modulation Spectrum, MS)作为一种新的特征来有效地捕捉过平滑效应,并提出了一种基于MS的后置滤波器,MS表示为参数轨迹的功率谱。对生成的语音参数序列进行过滤,以确保其MS具有与自然语音相似的模式。实验结果表明,与考虑GV的传统方法相比,将该方法应用于光谱和F0分量时,质量有所提高。
{"title":"A postfilter to modify the modulation spectrum in HMM-based speech synthesis","authors":"Shinnosuke Takamichi, T. Toda, Graham Neubig, S. Sakti, Satoshi Nakamura","doi":"10.1109/ICASSP.2014.6853604","DOIUrl":"https://doi.org/10.1109/ICASSP.2014.6853604","url":null,"abstract":"In this paper, we propose a postfilter to compensate modulation spectrum in HMM-based speech synthesis. In order to alleviate over-smoothing effects which is a main cause of quality degradation in HMM-based speech synthesis, it is necessary to consider features that can capture over-smoothing. Global Variance (GV) is one well-known example of such a feature, and the effectiveness of parameter generation algorithm considering GV have been confirmed. However, the quality gap between natural speech and synthetic speech is still large. In this paper, we introduce the Modulation Spectrum (MS) of speech parameter trajectory as a new feature to effectively capture the over-smoothing effect, and we propose a postfilter based on the MS. The MS is represented as a power spectrum of the parameter trajectory. The generated speech parameter sequence is filtered to ensure that its MS has a pattern similar to natural speech. Experimental results show quality improvements when the proposed methods are applied to spectral and F0 components, compared with conventional methods considering GV.","PeriodicalId":6545,"journal":{"name":"2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"27 1","pages":"290-294"},"PeriodicalIF":0.0,"publicationDate":"2014-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76704780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 76
An unmixing-based method for the analysis of thermal hyperspectral images 一种基于非混合的热高光谱图像分析方法
M. Cubero-Castan, J. Chanussot, X. Briottet, M. Shimoni, V. Achard
The estimation of surface emissivity and temperature from thermal hyperspectral data is a challenge. Methods that estimate the temperature and emissivity on a pixel composed by one single material exist. However, the estimation of the temperature on a mixed pixel, i.e. a pixel composed by more than one material, is more complex and has scarcely been investigated in the literature. This paper addresses this issue by proposing an estimator which linearizes the Black Body law around the mean temperature of each material. The performance of this estimator is studied using simulated data with different hyperspectral sensor configurations and under various noise conditions. The obtained results are encouraging and show an accuracy on the estimated temperature of 0.5 K while using high spectral resolution sensor.
利用热高光谱数据估算地表发射率和温度是一个挑战。在由单一材料组成的像素上估计温度和发射率的方法是存在的。然而,混合像元(即由多种材料组成的像元)上的温度估计更为复杂,文献中很少研究。本文通过提出一个估计器来解决这个问题,该估计器将黑体定律围绕每种材料的平均温度线性化。利用不同高光谱传感器配置和不同噪声条件下的模拟数据,研究了该估计器的性能。所得结果令人鼓舞,表明在使用高光谱分辨率传感器时,估计温度的精度为0.5 K。
{"title":"An unmixing-based method for the analysis of thermal hyperspectral images","authors":"M. Cubero-Castan, J. Chanussot, X. Briottet, M. Shimoni, V. Achard","doi":"10.1109/ICASSP.2014.6855120","DOIUrl":"https://doi.org/10.1109/ICASSP.2014.6855120","url":null,"abstract":"The estimation of surface emissivity and temperature from thermal hyperspectral data is a challenge. Methods that estimate the temperature and emissivity on a pixel composed by one single material exist. However, the estimation of the temperature on a mixed pixel, i.e. a pixel composed by more than one material, is more complex and has scarcely been investigated in the literature. This paper addresses this issue by proposing an estimator which linearizes the Black Body law around the mean temperature of each material. The performance of this estimator is studied using simulated data with different hyperspectral sensor configurations and under various noise conditions. The obtained results are encouraging and show an accuracy on the estimated temperature of 0.5 K while using high spectral resolution sensor.","PeriodicalId":6545,"journal":{"name":"2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"20 1","pages":"7809-7813"},"PeriodicalIF":0.0,"publicationDate":"2014-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76752129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Sentiment retrieval on web reviews using spontaneous natural speech 基于自然语音的网络评论情感检索
Jose Costa Pereira, J. Luque, Xavier Anguera Miró
This paper addresses the problem of document retrieval based on sentiment polarity criteria. A query based on natural spontaneous speech, expressing an opinion about a certain topic, is used to search a repository of documents containing favorable or unfavorable opinions. The goal is to retrieve documents whose opinions more closely resemble the one in the query. A semantic system based on speech transcripts is augmented with information from full-length text articles. Posterior probabilities extracted from the articles are used to regularize their transcription counterparts. This paper makes three important contributions. First, we introduce a framework for polarity analysis of sentiments that can accommodate combinations of different modalities capable of dealing with the absence of any modality. Second, we show that it is possible to improve average precision on speech transcriptions' sentiment retrieval by means of regularization. Third, we demonstrate the robustness of our approach by training regularizers on one dataset, while performing sentiment retrieval experiments, with substantial gains, on another dataset.
本文研究了基于情感极性准则的文档检索问题。基于自然自发的语言,表达对某个主题的意见的查询,用于搜索包含有利或不利意见的文档库。目标是检索其观点与查询中的观点更接近的文档。基于语音文本的语义系统增加了来自全文文章的信息。从文章中提取的后验概率用于正则化其转录对应项。本文有三个重要贡献。首先,我们引入了一个情感极性分析框架,该框架可以容纳不同模态的组合,能够处理任何模态的缺失。其次,我们证明了使用正则化方法可以提高语音转录的情感检索的平均精度。第三,我们通过在一个数据集上训练正则器来证明我们方法的鲁棒性,同时在另一个数据集上执行情感检索实验,获得了可观的收益。
{"title":"Sentiment retrieval on web reviews using spontaneous natural speech","authors":"Jose Costa Pereira, J. Luque, Xavier Anguera Miró","doi":"10.1109/ICASSP.2014.6854470","DOIUrl":"https://doi.org/10.1109/ICASSP.2014.6854470","url":null,"abstract":"This paper addresses the problem of document retrieval based on sentiment polarity criteria. A query based on natural spontaneous speech, expressing an opinion about a certain topic, is used to search a repository of documents containing favorable or unfavorable opinions. The goal is to retrieve documents whose opinions more closely resemble the one in the query. A semantic system based on speech transcripts is augmented with information from full-length text articles. Posterior probabilities extracted from the articles are used to regularize their transcription counterparts. This paper makes three important contributions. First, we introduce a framework for polarity analysis of sentiments that can accommodate combinations of different modalities capable of dealing with the absence of any modality. Second, we show that it is possible to improve average precision on speech transcriptions' sentiment retrieval by means of regularization. Third, we demonstrate the robustness of our approach by training regularizers on one dataset, while performing sentiment retrieval experiments, with substantial gains, on another dataset.","PeriodicalId":6545,"journal":{"name":"2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"59 1","pages":"4583-4587"},"PeriodicalIF":0.0,"publicationDate":"2014-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82437553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Reduced complexity sphere decoding using a geometrical approach 使用几何方法降低了球体解码的复杂性
M. Abbasi, A. Tadaion, S. Gazor
In this paper we propose an algorithm with reduced complexity for the sphere detection (SD) which is used in multiple input multiple output (MIMO) detection algorithms without any performance degradation. The trade-off between the complexity and the bit error rate is a main challenge in wireless MIMO systems. The maximum likelihood (ML) detector considered as the optimum detector in the literatures. Since the complexity of the naive ML detectors is significantly high, the SD algorithms are proposed to lower the complexity. In this paper, we use the result of the geometrical decoder (GD) proposed in [8] which performs as the ML detector and has lower complexity than SD algorithm. We propose a method to further reduce the complexity of this SD algorithm. We show that the complexity is further reduced by almost 60%, i.e, the number of nodes visited by the proposed SD method is in average 60% less than that of the original one.
在本文中,我们提出了一种降低复杂度的球体检测(SD)算法,该算法用于多输入多输出(MIMO)检测算法中,且不降低性能。在复杂性和误码率之间的权衡是无线MIMO系统面临的主要挑战。文献中认为最大似然检测器是最优检测器。由于朴素ML检测器的复杂度非常高,提出了SD算法来降低复杂度。在本文中,我们使用了[8]中提出的几何解码器(GD)的结果,它作为ML检测器,比SD算法具有更低的复杂度。我们提出了一种进一步降低SD算法复杂度的方法。我们发现,复杂度进一步降低了近60%,即提出的SD方法访问的节点数量比原始方法平均减少了60%。
{"title":"Reduced complexity sphere decoding using a geometrical approach","authors":"M. Abbasi, A. Tadaion, S. Gazor","doi":"10.1109/ICASSP.2014.6853934","DOIUrl":"https://doi.org/10.1109/ICASSP.2014.6853934","url":null,"abstract":"In this paper we propose an algorithm with reduced complexity for the sphere detection (SD) which is used in multiple input multiple output (MIMO) detection algorithms without any performance degradation. The trade-off between the complexity and the bit error rate is a main challenge in wireless MIMO systems. The maximum likelihood (ML) detector considered as the optimum detector in the literatures. Since the complexity of the naive ML detectors is significantly high, the SD algorithms are proposed to lower the complexity. In this paper, we use the result of the geometrical decoder (GD) proposed in [8] which performs as the ML detector and has lower complexity than SD algorithm. We propose a method to further reduce the complexity of this SD algorithm. We show that the complexity is further reduced by almost 60%, i.e, the number of nodes visited by the proposed SD method is in average 60% less than that of the original one.","PeriodicalId":6545,"journal":{"name":"2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"35 1","pages":"1926-1930"},"PeriodicalIF":0.0,"publicationDate":"2014-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81352802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Automatic spatial gain control for an informed spatial filter 自动空间增益控制的通知空间滤波器
Sebastian Braun, O. Thiergart, Emanuël Habets
When capturing speech in a multi-talker telecommunication scenario, it is desirable to keep the enhanced signal at an equal loudness level for each speaker. Single-channel automatic gain control systems are not able to adjust the level of different talkers when they are simultaneously active. In this work, an automatic spatial gain control (ASGC) algorithm is proposed that adjusts the directional response of an existing informed spatial filter such that the direct sound of multiple sources can be kept at a constant desired loudness level at the output. The spatial filter additionally reduces diffuse sound and ambient noise. It is shown that the proposed AGSC works well within the tested scenario, and is able to adjust the levels of different speakers even during double talk scenarios.
在多讲话者通信场景中捕获语音时,希望将增强信号保持在每个讲话者的相同响度水平。单通道自动增益控制系统无法在不同通话者同时工作时调节其音量。在这项工作中,提出了一种自动空间增益控制(ASGC)算法,该算法可以调整现有信息空间滤波器的方向响应,从而使多个源的直接声音在输出时保持在恒定的期望响度水平。空间滤波器还可以减少漫射声和环境噪声。结果表明,所提出的AGSC在测试场景中工作良好,并且即使在双话场景中也能够调整不同说话者的水平。
{"title":"Automatic spatial gain control for an informed spatial filter","authors":"Sebastian Braun, O. Thiergart, Emanuël Habets","doi":"10.1109/ICASSP.2014.6853713","DOIUrl":"https://doi.org/10.1109/ICASSP.2014.6853713","url":null,"abstract":"When capturing speech in a multi-talker telecommunication scenario, it is desirable to keep the enhanced signal at an equal loudness level for each speaker. Single-channel automatic gain control systems are not able to adjust the level of different talkers when they are simultaneously active. In this work, an automatic spatial gain control (ASGC) algorithm is proposed that adjusts the directional response of an existing informed spatial filter such that the direct sound of multiple sources can be kept at a constant desired loudness level at the output. The spatial filter additionally reduces diffuse sound and ambient noise. It is shown that the proposed AGSC works well within the tested scenario, and is able to adjust the levels of different speakers even during double talk scenarios.","PeriodicalId":6545,"journal":{"name":"2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"14 1","pages":"830-834"},"PeriodicalIF":0.0,"publicationDate":"2014-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81866035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Label propagation through edge-preserving filters 标签传播通过边缘保持滤波器
Richard Rzeszutek, D. Androutsos
In this paper we investigate methods for propagating automatically generated or user-defined labels through an image using edge-preserving filters. We focus on the domain transform filter as it has been used for propagation purposes in the past. The method we present addresses some of the numerical issues that arise with using the filter directly and also improve on the results by better respecting the underlying image structure during the label propagation. Finally we also demonstrate how a filter-based approach is preferable to using global optimization for interpolating automatically generated sparse features.
在本文中,我们研究了使用边缘保持滤波器通过图像传播自动生成或用户自定义标签的方法。我们关注域变换滤波器,因为它在过去被用于传播目的。我们提出的方法解决了直接使用滤波器时出现的一些数值问题,并且通过在标签传播过程中更好地尊重底层图像结构来改进结果。最后,我们还演示了基于过滤器的方法如何优于使用全局优化来插值自动生成的稀疏特征。
{"title":"Label propagation through edge-preserving filters","authors":"Richard Rzeszutek, D. Androutsos","doi":"10.1109/ICASSP.2014.6853666","DOIUrl":"https://doi.org/10.1109/ICASSP.2014.6853666","url":null,"abstract":"In this paper we investigate methods for propagating automatically generated or user-defined labels through an image using edge-preserving filters. We focus on the domain transform filter as it has been used for propagation purposes in the past. The method we present addresses some of the numerical issues that arise with using the filter directly and also improve on the results by better respecting the underlying image structure during the label propagation. Finally we also demonstrate how a filter-based approach is preferable to using global optimization for interpolating automatically generated sparse features.","PeriodicalId":6545,"journal":{"name":"2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"29 1","pages":"599-603"},"PeriodicalIF":0.0,"publicationDate":"2014-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82077206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Active target detection with mobile agents 主动目标检测与移动代理
Sunav Choudhary, Naveen Kumar, S. Narayanan, U. Mitra
A strategy for active target detection suitable for the use of mobile agents in a field is presented. In particular, there is an interest in autonomous underwater vehicles. By exploiting notions from group testing, the proposed algorithm decides when to collect new samples depending on whether the mobile agent perceives the sensor measurements correspond to noise or a target pattern. Under suitable assumptions about the field emanated by the target, i.e. the target signature is locally low rank in the field, one can efficiently sample the field to locate the target using O(m log m log n) samples on an n × n grid where m ≪ n is a parameter specifying the group size.
提出了一种适合于现场移动agent使用的主动目标检测策略。特别是,人们对自主水下航行器很感兴趣。通过利用群体测试的概念,该算法根据移动代理是否感知到传感器测量值对应于噪声或目标模式来决定何时收集新样本。在对目标发出的磁场进行适当假设的情况下,即目标信号在磁场中是局部低阶的,人们可以在n × n网格上使用O(m log m log n)个样本对磁场进行有效采样以确定目标的位置,其中m≪n是表示组大小的参数。
{"title":"Active target detection with mobile agents","authors":"Sunav Choudhary, Naveen Kumar, S. Narayanan, U. Mitra","doi":"10.1109/ICASSP.2014.6854390","DOIUrl":"https://doi.org/10.1109/ICASSP.2014.6854390","url":null,"abstract":"A strategy for active target detection suitable for the use of mobile agents in a field is presented. In particular, there is an interest in autonomous underwater vehicles. By exploiting notions from group testing, the proposed algorithm decides when to collect new samples depending on whether the mobile agent perceives the sensor measurements correspond to noise or a target pattern. Under suitable assumptions about the field emanated by the target, i.e. the target signature is locally low rank in the field, one can efficiently sample the field to locate the target using O(m log m log n) samples on an n × n grid where m ≪ n is a parameter specifying the group size.","PeriodicalId":6545,"journal":{"name":"2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"45 1","pages":"4185-4189"},"PeriodicalIF":0.0,"publicationDate":"2014-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82230544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1