2008 IEEE International Conference on Acoustics, Speech and Signal Processing最新文献

英文中文

Efficient 4D motion compensated lossless compression of dynamic volumetric medical image data 高效四维运动补偿无损压缩动态体医学图像数据

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517668

V. Sanchez, P. Nasiopoulos, R. Abugharbieh

Dynamic volumetric (four dimensional- 4D) medical images are typically huge in file size and require a vast amount of resources for storage and transmission purposes. In this paper, we propose an efficient lossless compression method for 4D medical images that is based on a multi-frame motion compensation process employing a 4D search, variable block- sizes and bi-directional prediction. Data redundancies are reduced by recursively applying multi-frame motion compensation in the spatial and temporal dimensions. The proposed method also uses a novel differential coding algorithm to reduce redundancies in motion vectors and a new context-based adaptive binary arithmetic coder (CABAC) for compression of the residual data. Performance evaluations on real medical images of varying modality resulted in lossless compression ratios of up to 16:1.

动态体积(四维- 4D)医学图像通常具有巨大的文件大小，并且需要大量的资源用于存储和传输。在本文中，我们提出了一种高效的四维医学图像无损压缩方法，该方法基于多帧运动补偿过程，采用四维搜索、可变块大小和双向预测。通过递归地在空间和时间维度上应用多帧运动补偿来减少数据冗余。该方法还使用了一种新的差分编码算法来减少运动向量的冗余，并使用了一种新的基于上下文的自适应二进制算法编码器(CABAC)来压缩残差数据。对不同模态的真实医学图像进行性能评估，得到高达16:1的无损压缩比。

引用次数: 25

Media processor architecture for video and imaging on camera phones 用于照相手机上的视频和成像的媒体处理器架构

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4518866

Joseph Meehan

A dedicated media processor is used in many camera phones to accelerate video and image processing. Increased demand for higher pixel resolution and higher quality image and video processing necessitates dramatically increased signal processing capability. To provide the increased performance at acceptable cost and power requires redesign of the traditional architecture. By wisely partitioning algorithms across programmable and fixed- function blocks, the performance goals can be met while still maintaining flexibility for new feature requirements and new standards. In this paper we provide an overview of media processor architectures for camera phones and describe the system architecture, power, and performance. We also address the challenges in supporting new imaging trends and high resolution video at low power and cost.

许多照相手机都使用专用的媒体处理器来加速视频和图像处理。对更高像素分辨率和更高质量的图像和视频处理的需求不断增加，这就需要显著提高信号处理能力。为了在可接受的成本和功耗下提供更高的性能，需要重新设计传统架构。通过在可编程和固定功能块之间明智地划分算法，可以在满足性能目标的同时保持对新功能需求和新标准的灵活性。本文概述了相机手机的媒体处理器架构，并描述了系统架构、功耗和性能。我们还解决了以低功耗和低成本支持新成像趋势和高分辨率视频的挑战。

引用次数: 6

Confidence measures for acoustic detection of film slates based on time-domain features 基于时域特征的膜板声检测置信度测度

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517565

M. Schlosser

An acoustic detector for film slates is proposed to assist a human operator with the synchronization of audio and video in post-production. To be computationally efficient, the signal analysis is restricted to time-domain features. Although the features are statistically dependent, separate classifiers are trained for each of them. The statistical dependence is taken into account during the combination of the log-likelihood ratios provided by the individual classifiers. The overall confidence in a classification is determined as a weighted sum of the individual log-likelihood ratios, where the weights depend on the correlation between the different features. Experimental results for real-world recordings from film sets show that the confidence measures allow for a fast identification of the film slates while minimizing the interference from false detections.

提出了一种用于电影板的声学检测器，以帮助操作员在后期制作中实现音频和视频的同步。为了提高计算效率，信号分析被限制在时域特征上。尽管这些特征在统计上是相关的，但是为每一个特征都训练了单独的分类器。在组合单个分类器提供的对数似然比时，考虑了统计依赖性。分类的总体置信度由单个对数似然比的加权和确定，其中权重取决于不同特征之间的相关性。来自电影集的真实世界记录的实验结果表明，置信度措施允许快速识别电影板，同时最大限度地减少假检测的干扰。

引用次数: 0

A programmable analog radial-basis-function based classifier 基于径向基函数的可编程模拟分类器

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517887

Sheng-Yu Peng, Yu-Chi Tsao, P. Hasler, David V. Anderson

A 16 x 16 programmable analog radial-basis-function (RBF) based classifier is demonstrated. The distribution of each feature is modeled by a Gaussian function, which is realized by a proposed floating-gate bump circuit having bell-shaped transfer characteristics. The maximum likelihood, mean, and variance of the distribution are stored in floating-gate transistors and are independently programmable. By cascading these floating-gate bump circuits, the overall transfer characteristics approximate a multivariate Gaussian distribution with a diagonal covariance matrix. An array of these circuits constitutes a compact RBF-based classifier. When followed by a winner-take-all circuit, the analog classifier can implement vector quantization. Automatic gender identification is implemented on a 16 x 16 analog vector quantizer chip as one possible audio application of this work. The performance of the analog classifier is comparable to that of digital counter -parts. The proposed approach can be at least two orders of magnitude more power efficient than the digital microprocessors at the same task.

演示了一个16 × 16可编程模拟径向基函数(RBF)分类器。每个特征的分布用高斯函数建模，该模型由具有钟形传递特性的浮门碰撞电路实现。分布的最大似然、均值和方差存储在浮栅晶体管中，并可独立编程。通过级联这些浮门碰撞电路，总体传输特性近似于具有对角协方差矩阵的多元高斯分布。这些电路的阵列构成了一个紧凑的基于rbf的分类器。当采用赢家通吃电路时，模拟分类器可以实现矢量量化。自动性别识别是在一个16 × 16模拟矢量量化芯片上实现的，作为这项工作的一个可能的音频应用。模拟分类器的性能可与数字计数器相媲美。所提出的方法可以比数字微处理器在相同的任务中至少节省两个数量级的功率。

引用次数: 6

A new color image regularization scheme for blind image deconvolution 一种新的彩色图像正则化盲图像反卷积方案

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517844

Yu He, Kim-Hui Yap, Li Chen, Lap-Pui Chau

This paper proposes a new regularization scheme to address blind color image deconvolution. Conventional blind monochromatic image deconvolution algorithms handle each color channel independently, thereby ignoring the inter-channel correlation present in the color images. Further, most existing blind color deconvolution algorithms do not take the parametric information of the blurs into consideration. In view of these, a regularization scheme is proposed to perform blind color image deconvolution. A new regularization operator is developed in the blur domain. A reinforcement blur modeling scheme is adopted to evaluate the relevance of manifold parametric blur structures, and the information is integrated into the deconvolution scheme. In addition, a regularization scheme for image is developed to recover edges of color images and reduce color artifacts. Experimental results show that the method is able to achieve satisfactory restored color images under noisy environment.

提出了一种新的正则化方案来解决彩色图像的盲反卷积问题。传统的盲单色图像反卷积算法独立处理每个颜色通道，从而忽略了彩色图像中存在的通道间相关性。此外，现有的大多数盲颜色反卷积算法都没有考虑到模糊的参数信息。针对这些问题，提出了一种用于彩色图像盲反卷积的正则化方案。提出了一种新的模糊域正则化算子。采用强化模糊建模方案评估流形参数模糊结构的相关性，并将信息集成到反卷积方案中。此外，还提出了一种图像正则化方案，以恢复彩色图像的边缘，减少彩色伪影。实验结果表明，该方法能够在噪声环境下获得满意的彩色图像恢复效果。

引用次数: 5

A robust and computationally efficient method for tonal active noise control using a simplified secondary path model 采用简化的二次路径模型，提出了一种鲁棒且计算效率高的调性主动噪声控制方法

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517617

Flavio P. Ribeiro, V. Nascimento

We present a computationally efficient method for sinusoidal noise cancellation based on the FXLMS algorithm. It features subsampling in order to increase convergence speed and decrease computational requirements, and most importantly, does not require extra noise added to the filter output for secondary path identification. In addition, it is robust to secondary path variations and in low SNR scenarios, which are frequently found in practical active noise control systems, features fast tracking and can be directly generalized to multichannel systems. We illustrate its operation with simulations.

基于FXLMS算法，提出了一种计算效率高的正弦噪声消除方法。它的特点是子采样，以提高收敛速度和减少计算需求，最重要的是，不需要在次要路径识别的滤波器输出中添加额外的噪声。此外，它对次级路径变化和低信噪比情况具有鲁棒性，这在实际的主动噪声控制系统中经常发现，具有快速跟踪的特点，可以直接推广到多通道系统。我们用模拟来说明它的操作。

引用次数: 0

A nobel key-search method for side channel attacks based on pattern recognition 基于模式识别的边信道攻击的诺贝尔键搜索方法

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517974

You-Seok Lee, YongJe Choi, Dong‐Guk Han, Ho-Yeon Kim, Hyoung-Nam Kim

Differential power analysis (DPA) has been known as an efficient attack for finding secret keys of cryptosystems but its efficiency may be lowered due to the misalignment of the acquired signals. Though the misalignment problem has been now solvable by various successful approaches in DPA, a lot of power traces are still required to find correct keys. Since the required number of power traces is directly connected with the efficiency of SCAs, we propose a key-search method even with relatively reduced number of power traces based on recognizing special patterns of the signal caused by cryptographic operations. Experimental results show that the proposed method is able to search correct keys with much smaller number of traces than the minimum number of traces with which the conventional methods of the energy-based DPA and frequency-based DPA succeed in finding keys.

差分功率分析(DPA)被认为是一种有效的密码系统密钥查找方法，但由于获取的信号不一致，其效率会降低。虽然在DPA中，失调问题已经通过各种成功的方法得到了解决，但仍然需要大量的电源走线来找到正确的键。由于所需的功率走线数量与sca的效率直接相关，因此我们提出了一种基于识别由密码操作引起的信号的特殊模式的密钥搜索方法，即使功率走线数量相对较少。实验结果表明，与传统的基于能量的DPA和基于频率的DPA的最小轨迹数相比，该方法能够以更少的轨迹数搜索到正确的密钥。

引用次数: 3

CCA for joint blind source separation of multiple datasets with application to group FMRI analysis 多数据集联合盲源分离的CCA方法及其在成组FMRI分析中的应用

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517990

Yi-Ou Li, Wei Wang, T. Adalı, V. Calhoun

In this work, we propose a scheme for joint blind source separation (BSS) of multiple datasets using canonical correlation analysis (CCA). The proposed scheme jointly extracts sources from each dataset in the order of between-set source correlations. We show that, when sources are uncorrelated within each dataset and correlated across different datasets only on corresponding indices, (i) CCA on two datasets achieves BSS when the sources from the two datasets have distinct between-set correlation coefficients, and (ii) CCA on multiple datasets (M-CCA) achieves BSS with a more relaxed condition on the between-set source correlation coefficients compared to CCA on two datasets. We present simulation results to demonstrate the properties of CCA and M-CCA on joint BSS. We apply M-CCA to group functional magnetic resonance imaging (fMRI) data acquired from several subjects performing a visuomotor task and obtain interesting brain activations as well as their correlation profiles across different subjects in the group.

本文提出了一种基于典型相关分析(CCA)的多数据集联合盲源分离(BSS)方案。该方案按照集间源相关性的顺序从每个数据集中联合提取源。我们表明，当每个数据集中的源不相关，并且不同数据集之间仅在相应的指标上相关时，(i)当两个数据集的源具有不同的集间相关系数时，两个数据集上的CCA实现了BSS，并且(ii)与两个数据集上的CCA相比，多个数据集上的CCA (M-CCA)在集间源相关系数条件更宽松的情况下实现了BSS。仿真结果验证了CCA和M-CCA在联合BSS中的性能。我们将M-CCA应用于从执行视觉运动任务的几个受试者中获得的组功能磁共振成像(fMRI)数据，并获得了有趣的大脑激活及其在组中不同受试者之间的相关性曲线。

引用次数: 15

Automatic phonetics-driven reconstruction of medical dictations on multiple levels of segmentation 基于多级分割的医学听写自动语音驱动重建

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4518610

Štefan Petrík, F. Pernkopf

Automatic phonetic reconstruction of medical dictations from non- literal and automatically recognized speech transcripts leads to closer-to-literal transcripts for training. In this paper, we introduce an extended alignment method assessing multiple levels of text segmentation and show how open issues like wrong segmentation in the recognized transcript can be resolved. Furthermore, the effect of context-dependent reconstruction and the phonetic similarity threshold on the quality of the reconstructed transcription is measured. Experiments show an increase in precision between 0.7% and 4.7% absolute without loss in recall for the combined system incorporating all of these techniques in comparison to the system in the previous work.

从非文字和自动识别的语音抄本中自动重建医学听写的语音，从而获得更接近文字的训练抄本。在本文中，我们介绍了一种扩展的对齐方法来评估多层次的文本分割，并展示了如何解决识别文本中的错误分割等开放性问题。此外，还测量了上下文相关重构和语音相似阈值对重构转录质量的影响。实验表明，与之前的系统相比，结合所有这些技术的组合系统的绝对精度提高了0.7%到4.7%，而召回率没有下降。

引用次数: 2

Active noise control of noisy periodic signals using signal separation 利用信号分离技术对含噪周期信号进行主动噪声控制

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Pub Date : 2008-05-12 DOI: 10.1109/ICASSP.2008.4517935

G. Kannan, A. Milani, I. Panahi

Periodic signals (since they can be easily predicted) can be canceled much more effectively when compared to non- periodic/stochastic signals. A large class of acoustic noise sources have an underlying periodic process that generates a periodic noise component, and thus the acoustic noise can in general be modeled as the sum of a periodic signal and a random signal (usually a background noise). In this paper we present the idea that separating the acoustic noise into periodic and random noise components and doing separate active noise control(ANC) for each tends to increase the over-all noise attenuation level (NAL). Formulae for the exact improvement in noise attenuation levels are derived. A novel signal separation and noise cancelation scheme based on adaptive filtering is developed and its effectiveness is shown for several periodic signal in white noise cases.

与非周期/随机信号相比，周期信号(因为它们很容易预测)可以更有效地消除。一大类噪声源具有产生周期性噪声分量的潜在周期性过程，因此声学噪声通常可以建模为周期信号和随机信号(通常是背景噪声)的总和。本文提出了将噪声分成周期性噪声和随机噪声两部分并分别进行主动噪声控制(ANC)的思想，这有利于提高总体噪声衰减水平(NAL)。导出了精确改进噪声衰减水平的公式。提出了一种新的基于自适应滤波的信号分离与噪声消除方案，并对白噪声情况下的多个周期信号进行了验证。

引用次数: 12

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2008 IEEE International Conference on Acoustics, Speech and Signal Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀