2008 IEEE International Conference on Acoustics, Speech and Signal Processing最新文献

英文中文

Root-music based direction-of-arrival estimation methods for arbitrary non-uniform arrays 基于根音乐的任意非均匀阵列到达方向估计方法

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4518110

M. Rübsamen, A. Gershman

Two computationally efficient high-resolution methods are proposed for direction-of-arrival (DOA) estimation in arbitrary nonuniform sensor arrays. Our first algorithm is based on the fact that the spectral MUSIC function is periodic in angle. Expanding this function using Fourier series, we reformulate the DOA estimation problem as an equivalent polynomial rooting problem. Our second approach applies the inverse Fourier transform to the so-obtained root-MUSIC polynomial to compute the null-spectrum without any polynomial rooting, using a simple line search. The proposed techniques are shown to offer substantially improved performance-to- complexity tradeoffs as compared to the existing root-MUSIC-type methods applicable to non-uniform arrays.

针对任意非均匀传感器阵列，提出了两种计算效率较高的高分辨率到达方向估计方法。我们的第一个算法是基于谱MUSIC函数在角度上是周期性的这一事实。利用傅里叶级数展开该函数，我们将DOA估计问题重新表述为一个等价多项式生根问题。我们的第二种方法将傅里叶反变换应用于获得的根music多项式，使用简单的直线搜索，在没有任何多项式生根的情况下计算零谱。与适用于非均匀数组的现有root- music类型方法相比，所提出的技术显示出大大改进了性能与复杂性之间的权衡。

引用次数: 23

Adaptive rate estimation for H.264/AVC intra mode decision H.264/AVC模式内决策的自适应速率估计

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517820

Ke-Ying Liao, J. Yang, Ming-Ting Sun

In this paper, a modified bit-rate estimation method is proposed to reduce the computation for 4x4 intra mode decision of H.264/AVC video encoder. The number of coded bits is modeled by a linear combination of existing coding parameters, which are highly related to the entropy coding of H.264/AVC. Furthermore, to improve the accuracy of the estimation, the proposed scheme is made adaptive to the information obtained from previously coded blocks. Comparing to the original rate distortion optimized (RDO) encoding process, which needs to calculate the actual encoded bits of H.264/AVC for each coding mode, the proposed adaptive rate estimation can save about 28% and 21% of the total encoding time for QCIF and VGA sequences, respectively. For the coding performance, the proposed method achieves nearly no loss in visual quality with only slight bit-rate increases.

为了减少H.264/AVC视频编码器4 × 4帧内模式决策的计算量，提出了一种改进的码率估计方法。编码的比特数由现有编码参数的线性组合来建模，这与H.264/AVC的熵编码高度相关。此外，为了提高估计的准确性，该方案对先前编码块中获得的信息进行了自适应处理。原始的RDO编码过程需要计算每种编码模式下H.264/AVC的实际编码比特数，相比之下，本文提出的自适应率估计可以分别为QCIF和VGA序列节省约28%和21%的总编码时间。在编码性能方面，所提出的方法几乎没有视觉质量的损失，只有轻微的比特率增加。

引用次数: 1

On fusion of timbre-motivated features for singing voice detection and singer identification 基于音色动机特征的歌唱声音检测与歌手识别

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4518087

T. Nwe, Haizhou Li

Timbre is the quality of sound which allows the ear to distinguish between musical sounds. In this paper, we study timbre effects in identification of singing voice segments in popular songs. Firstly, we identify between singing voice and instrumental segments in a song. Then, singing voice segments are further categorized according to their singer identity. Timbre-motivated effects are formulated by fusion of systems that use the features from vibrato, harmonic information and other features extracted using Mel and Log frequency scale filter banks. Statistical methods to select singing voice segments with high confidence measure are proposed for better performance in singer identification process. The experiments conducted on a database of 214 popular songs show that the proposed approach is effective.

音色是声音的质量，它可以让耳朵区分不同的音乐声音。本文研究了流行歌曲唱段识别中的音色效应。首先，我们识别歌曲中的唱腔和器乐部分。然后，根据歌唱者的身份，进一步对歌唱声段进行分类。音色驱动的效果是由使用Mel和Log频率尺度滤波器组提取的振动、谐波信息和其他特征的系统融合而成的。为了在歌手识别过程中取得更好的效果，提出了采用统计方法选取具有高置信度度量的演唱人声片段。在214首流行歌曲的数据库上进行的实验表明，该方法是有效的。

引用次数: 27

Point-to-point MIMO MMSE vector precoding and thp achieving capacity 点对点MIMO MMSE矢量预编码和thp实现能力

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4518262

M. Joham, H. Brunner, R. Hunger, D. Schmidt, W. Utschick

Non-linear precoding for point-to-point (P2P) multiple-input multiple-output (MIMO) systems is considered. First, the minimum mean square error (MMSE) optimal vector precoding (VP) is presented for different receiver structures, viz., weighted identity matrix, diagonal matrix, weighted unitary matrix, and matrix without particular structure. Whereas the former two structures can also be applied to the vector broadcast channel, the latter two are only realizable for cooperative receivers. Second, VP is derived that minimizes the MSE but is restricted to maximize the mutual information of the MIMO channel. Third, the corresponding Tomlinson-Harashima precoding (THP) is found by applying the nearest-plane approximation to the computation of the perturbation signal. The resulting maximum mutual information THP clearly outperforms the state-of-the-art P2P-MIMO THP based on the generalized triangular decomposition (GTD) with respect to MSE and BER.

研究了点对点多输入多输出(MIMO)系统的非线性预编码问题。首先，针对不同的接收机结构，即加权单位矩阵、对角矩阵、加权酉矩阵和无特定结构矩阵，提出了最小均方误差(MMSE)最优矢量预编码(VP)。而前两种结构也可以应用于矢量广播信道，后两种结构仅适用于合作接收机。其次，推导出最小化MSE的VP，但限制为最大化MIMO信道的互信息。第三，通过对扰动信号的计算应用最接近平面逼近，得到相应的Tomlinson-Harashima预编码(THP)。由此产生的最大互信息THP明显优于基于MSE和BER的广义三角分解(GTD)的最先进的P2P-MIMO THP。

引用次数: 3

Radar target doa estimation: Moving window VS AML estimator 雷达目标方位估计:移动窗口与AML估计器

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517906

M. Greco, F. Gini, A. Farina, L. Timmoneri

In this paper we compare two radar target direction-of-arrival (DOA) estimation algorithms, the classical moving window (MW) and the asymptotic maximum likelihood (AML) estimators. The first technique for azimuth DOA estimation exploits multiple detections in the same time-on-target and the second one exploits the fact that the radar antenna mechanical scanning impresses an amplitude modulation on the signals backscattered by the target. Performances of the estimators are numerically investigated through Monte Carlo simulation in terms of root-mean-square-error (RMSE), probability of detection for a fixed probability of false alarm, and probability of "splitting". The obtained results show that the asymptotic maximum likelihood estimator generally outperforms the classical moving window estimator.

本文比较了两种雷达目标到达方向(DOA)估计算法，即经典的移动窗口(MW)估计算法和渐近极大似然(AML)估计算法。第一种方位角估计技术利用了同一时间内的多次探测，第二种技术利用了雷达天线机械扫描对目标后向散射信号进行幅度调制的特性。通过蒙特卡罗模拟对估计器的性能进行了数值研究，包括均方根误差(RMSE)、固定概率的虚警检测概率和“分裂”概率。结果表明，渐近极大似然估计量总体上优于经典的移动窗估计量。

引用次数: 3

Joint algorithm/code-level optimization of H.264 video decoder for mobile multimedia applications 面向移动多媒体应用的H.264视频解码器联合算法/码级优化

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4518078

Ting-Yu Huang, Guo-An Jian, J. Chu, Ching-Lung Su, Jiun-In Guo

In this paper, we propose a joint algorithm/code-level optimization scheme to make it feasible to perform real-time H.264/AVC video decoding software on ARM-based platform for mobile multimedia applications. In the algorithm-level optimization, we propose various techniques like fast interpolation scheme, zero-skipping technique for texture decoding, fast boundary strength decision for in-loop filtering, and pattern matching algorithm for CAVLD. In the code-level optimization, we propose the design techniques on minimizing memory access and branch times. The experimental result shows that we have reduced the complexity of H.264 video decoder up to 93% as compared to the reference software JM9.7. The optimized H.264 video decoder can achieve the QCIF@30Hz video decoding on an ARM9 processor when operating at 120MHz clock.

本文提出了一种联合算法/代码级优化方案，使H.264/AVC视频实时解码软件在arm平台上的移动多媒体应用成为可能。在算法级优化方面，我们提出了快速插值方案、纹理解码的跳零技术、环内滤波的快速边界强度决策以及CAVLD的模式匹配算法等多种技术。在代码级优化中，我们提出了最小化内存访问和分支时间的设计技术。实验结果表明，与参考软件JM9.7相比，我们将H.264视频解码器的复杂度降低了93%。优化后的H.264视频解码器可以在ARM9处理器上实现QCIF@30Hz视频解码，工作频率为120MHz。

引用次数: 6

Sparse measurements, compressed sampling, and DNA microarrays 稀疏测量，压缩采样和DNA微阵列

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517676

H. Vikalo, F. Parvaresh, Sidhant Misra, B. Hassibi

DNA microarrays comprising tens of thousands of probe spots are currently being employed to test multitude of targets in a single experiment. Typically, each microarray spot contains a large number of copies of a single probe designed to capture a single target, and hence collects only a single data point. This is a wasteful use of the sensing resources in comparative DNA microarray experiments, where a test sample is measured relative to a reference sample. Since only a small fraction of the total number of genes represented by the two samples is differentially expressed, a vast number of probe spots will not provide any useful information. To this end we consider an alternative design, the so-called compressed microarrays, wherein each spot is a composite of several different probes and the total number of spots is potentially much smaller than the number of targets being tested. Fewer spots directly translates to significantly lower costs due to cheaper array manufacturing, simpler image acquisition and processing, and smaller amount of genomic material needed for experiments. To recover signals from compressed microarray measurements, we leverage ideas from compressive sampling. Moreover, we propose an algorithm which has far less computational complexity than the widely-used linear-programming-based methods, and can also recover signals with less sparsity.

由成千上万个探针点组成的DNA微阵列目前被用于在一次实验中测试多个目标。通常，每个微阵列点包含单个探针的大量副本，用于捕获单个目标，因此只收集单个数据点。在比较DNA微阵列实验中，这是对传感资源的浪费，其中测试样本相对于参考样本进行测量。由于两个样本所代表的基因总数中只有一小部分是差异表达的，因此大量的探针点将无法提供任何有用的信息。为此，我们考虑了一种替代设计，即所谓的压缩微阵列，其中每个点是几个不同探针的组合，并且点的总数可能比被测试的目标数量要小得多。由于更便宜的阵列制造，更简单的图像采集和处理，以及实验所需的更少的基因组材料，更少的斑点直接转化为显着降低的成本。为了从压缩微阵列测量中恢复信号，我们利用压缩采样的思想。此外，我们提出了一种算法，其计算复杂度远低于广泛使用的基于线性规划的方法，并且还可以恢复稀疏度较小的信号。

{"title":"Sparse measurements, compressed sampling, and DNA microarrays","authors":"H. Vikalo, F. Parvaresh, Sidhant Misra, B. Hassibi","doi":"10.1109/ICASSP.2008.4517676","DOIUrl":"https://doi.org/10.1109/ICASSP.2008.4517676","url":null,"abstract":"DNA microarrays comprising tens of thousands of probe spots are currently being employed to test multitude of targets in a single experiment. Typically, each microarray spot contains a large number of copies of a single probe designed to capture a single target, and hence collects only a single data point. This is a wasteful use of the sensing resources in comparative DNA microarray experiments, where a test sample is measured relative to a reference sample. Since only a small fraction of the total number of genes represented by the two samples is differentially expressed, a vast number of probe spots will not provide any useful information. To this end we consider an alternative design, the so-called compressed microarrays, wherein each spot is a composite of several different probes and the total number of spots is potentially much smaller than the number of targets being tested. Fewer spots directly translates to significantly lower costs due to cheaper array manufacturing, simpler image acquisition and processing, and smaller amount of genomic material needed for experiments. To recover signals from compressed microarray measurements, we leverage ideas from compressive sampling. Moreover, we propose an algorithm which has far less computational complexity than the widely-used linear-programming-based methods, and can also recover signals with less sparsity.","PeriodicalId":333742,"journal":{"name":"2008 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2008-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129238901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Globally optimal decentralized spatial smoothing for wireless sensor networks with local interactions 具有局部交互的无线传感器网络全局最优分散空间平滑

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4518097

S. Barbarossa, T. Battisti, A. Swami

In most sensor network applications, the vector containing the observations gathered by the sensors lies in a space of dimension equal to the number of nodes, typically because of observation noise, even though the useful signal belongs to a subspace of much smaller dimension. This motivates smoothing or rank reduction. We formulate a convex optimization problem, where we incorporate a fidelity constraint that prevents the final smoothed estimate from diverging too far from the observations. This leads to a distributed algorithm in which nodes exchange updates only with neighboring nodes. We show that the widely studied consensus algorithm is indeed only a very specific case of our more general formulation. Finally, we study the convergence rate and propose some approaches to maximize it.

在大多数传感器网络应用中，包含传感器收集的观测值的向量位于维度等于节点数的空间中，这通常是因为观测噪声，即使有用信号属于维度小得多的子空间。这激发了平滑或秩减少。我们制定了一个凸优化问题，其中我们纳入了保真度约束，以防止最终平滑估计偏离观测值太远。这导致了一种分布式算法，其中节点仅与相邻节点交换更新。我们表明，广泛研究的共识算法实际上只是我们更一般的公式的一个非常具体的例子。最后，我们研究了收敛速度，并提出了一些最大化收敛速度的方法。

引用次数: 1

Autofocus in multistatic passive SAR imaging 多基地被动SAR成像中的自动对焦

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517850

Kuang-Hung Liu, D. Munson

Synthetic aperture radar (SAR) imaging suffers from image focus degradation in the presence of phase errors in the received signals due to unknown platform or target motion. We study automatic focusing (autofocus) under a multistatic passive scenario, where the illumination is provided by a set of stationary UHF transmitters and the goal is to image a moving aircraft. We employ heuristic iterative estimation algorithms that maximize a sharpness metric of the image. A similar method has been studied for the case of mono-static radar, where only one antenna is used for both transmitting and receiving. We present simulation results to help assess the effectiveness of the proposed autofocus approach.

合成孔径雷达(SAR)成像由于平台或目标运动未知，在接收信号中存在相位误差，导致图像焦距下降。我们研究了多静态被动场景下的自动对焦(autofocus)，其中照明由一组固定的UHF发射机提供，目标是对移动的飞机进行成像。我们采用启发式迭代估计算法，最大化图像的清晰度度量。在单静态雷达中，只使用一个天线进行发射和接收，研究了类似的方法。我们给出了仿真结果来帮助评估所提出的自动对焦方法的有效性。

引用次数: 8

Smart tone reproduction 智能音色再现

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517851

Dun-Yu Hsiao, H. Liao

In this paper, we propose an effective scheme to enhance the visual details at the minimal cost of user adjustments. The uprising importance of automatic tone reproduction comes from the increasing population of digital archive programs, which contains a large number of images/videos either old irreproducible, or poorly captured. We attempt to solve above issues by a new local normalization step and an adaptive contrast assessment process. With those two processes, our method can effectively enhance poor quality regions and simultaneously preserving good quality ones with default parameter settings. The experimental results demonstrate that our method is superior to many existing algorithms when applied to aid digital archiving issues.

在本文中，我们提出了一种有效的方案来增强视觉细节，以最小的用户调整成本。自动音色再现的重要性来自于数字档案程序的不断增加，其中包含大量的图像/视频，要么是旧的不可复制的，要么是捕捉不好的。我们尝试通过新的局部归一化步骤和自适应对比评估过程来解决上述问题。通过这两个过程，我们的方法可以有效地增强质量较差的区域，同时保留默认参数设置的质量较好的区域。实验结果表明，该方法在辅助数字存档问题上优于现有的许多算法。

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀