2012 IEEE International Symposium on Multimedia最新文献

英文中文

Spectral Noise Gate Technique Applied to Birdsong Preprocessing on Embedded Unit 频谱噪声门技术在嵌入式单元鸟鸣信号预处理中的应用

2012 IEEE International Symposium on Multimedia

Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.12

Davi Miara Kiapuchinski, C. Lima, Celso A. A. Kaestner

This paper proposes an approach for audio preprocessing and noise removal from recordings obtained in natural environments. The method is inspired in the acoustic signature of the audio, and aims to preprocess the recordings of bird songs obtained directly in the field. Using the Spectral Noise Gate technique, the undesired noise is removed on a real application in real time during the recording using an embedded environment. In addition, important statistic features of the audio signal are computed. The main purpose on approach is to eliminate the manual and tedious process of preparing the audio recordings done in the field in order to make them ready to be used as input in other tasks, such as the automatic classification of bird species from recorded bird songs. This is necessary because classification results depend widely from the quality of the input data.

本文提出了一种对自然环境下的录音进行音频预处理和去噪的方法。该方法受到音频声学特征的启发，旨在对直接在野外获得的鸟类鸣叫录音进行预处理。使用频谱噪声门技术，在使用嵌入式环境录制过程中实时去除实际应用中不需要的噪声。此外，还计算了音频信号的重要统计特征。该方法的主要目的是消除在野外进行的手工和繁琐的录音准备过程，以便将其用作其他任务的输入，例如从记录的鸟类鸣叫中自动分类鸟类物种。这是必要的，因为分类结果很大程度上取决于输入数据的质量。

引用次数: 15

Facial Expression Recognition Using Dual Layer Hierarchical SVM Ensemble Classification 基于双层层次支持向量机集成分类的面部表情识别

2012 IEEE International Symposium on Multimedia

Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.104

Mahesh Babu Mariappan, Myunghoon Suk, B. Prabhakaran

In this paper, we present our approach for automatic facial expression recognition. We use a feature extraction technique inspired by our empirical study on human recognition of facial expressions. We propose our dual-layer hierarchical SVM ensemble mechanism for classification. We also provide system architecture and system implementation details in this paper.

在本文中，我们提出了一种自动面部表情识别方法。我们使用了一种特征提取技术，灵感来自于我们对人类面部表情识别的实证研究。提出了一种双层分层支持向量机集成分类机制。本文还提供了系统架构和系统实现细节。

引用次数: 6

Person-Independent Deformable Templates for Fast Face Recognition 用于快速人脸识别的独立于人的可变形模板

2012 IEEE International Symposium on Multimedia

Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.89

S. Clippingdale, Mahito Fujii

A face tracking and recognition system has been developed based on deformable template matching [1]. Person-dependent deformable templates are used for recognition, and person-independent deformable templates for tracking. The computational load associated with recognition is greater than that associated with tracking, because the number of person-dependent templates that must be deformed and matched against the input frame is equal to the number of registered individuals, whereas there is only a single person-independent template (per pose cell) for tracking. In this work, we show how person-independent templates can be used for recognition as well as tracking, resulting in a substantial reduction in the computation associated with recognition in the system of [1] (and potentially by extension in similar systems), at relatively small cost in recognition performance.

开发了一种基于可变形模板匹配的人脸跟踪识别系统。与人相关的可变形模板用于识别，与人无关的可变形模板用于跟踪。与识别相关的计算负荷大于与跟踪相关的计算负荷，因为必须根据输入帧进行变形和匹配的人相关模板的数量等于注册个体的数量，而用于跟踪的人相关模板(每个位姿单元)只有一个。在这项工作中，我们展示了如何将独立于人的模板用于识别和跟踪，从而大大减少了[1]系统中与识别相关的计算(并可能扩展到类似的系统中)，而识别性能的成本相对较小。

引用次数: 1

Discriminative Multiple Canonical Correlation Analysis for Multi-feature Information Fusion 多特征信息融合的判别多重典型相关分析

2012 IEEE International Symposium on Multimedia

Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.15

Lei Gao, L. Qi, E. Chen, L. Guan

This paper presents a novel approach for multi-feature information fusion. The proposed method is based on the Discriminative Multiple Canonical Correlation Analysis (DMCCA), which can extract more discriminative characteristics for recognition from multi-feature information representation. It represents the different patterns among multiple subsets of features identified by minimizing the Frobenius norm. We will demonstrate that the Canonical Correlation Analysis (CCA), the Multiple Canonical Correlation Analysis (MCCA), and the Discriminative Canonical Correlation Analysis (DCCA) are special cases of the DMCCA. The effectiveness of the DMCCA is demonstrated through experimentation in speaker recognition and speech-based emotion recognition. Experimental results show that the proposed approach outperforms the traditional methods of serial fusion, CCA, MCCA and DCCA.

提出了一种新的多特征信息融合方法。该方法基于判别多重典型相关分析(Discriminative Multiple Canonical Correlation Analysis, DMCCA)，可以从多特征信息表示中提取更多的判别特征用于识别。它表示通过最小化Frobenius范数确定的多个特征子集之间的不同模式。我们将证明典型相关分析(CCA)，多典型相关分析(MCCA)和判别典型相关分析(DCCA)是DMCCA的特殊情况。通过说话人识别和基于语音的情感识别实验，验证了该方法的有效性。实验结果表明，该方法优于传统的串行融合、CCA、MCCA和DCCA方法。

引用次数: 23

Segmentation Tree Based Multiple Object Image Retrieval 基于分割树的多目标图像检索

2012 IEEE International Symposium on Multimedia

Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.49

Wei-bang Chen, Chengcui Zhang, Song Gao

Inaccurate image segmentation often has a negative impact on object-based image retrieval. Researchers have attempted to alleviate this problem by using hierarchical image representation. However, these attempts suffer from the inefficiency in building the hierarchical image representation and the high computational complexity in matching two hierarchically represented images. Existing approaches construct the hierarchical image representation in two steps. The first step is to perform segmentation at different image resolutions, and the second step is to construct a hierarchical representation of the image by associating segments from different resolutions. In this research, an innovative all-in-one run approach is proposed that concurrently performs image segmentation and hierarchical tree construction, producing a hierarchical region tree to represent the image. In addition, an efficient hierarchical region tree matching algorithm is proposed with a reasonably low time complexity and used in multiple object image retrieval. The experimental results demonstrate the efficacy and efficiency of the proposed approach.

不准确的图像分割往往会对基于对象的图像检索产生负面影响。研究人员试图通过使用分层图像表示来缓解这个问题。然而，这些尝试受到构建分层图像表示的低效率和匹配两个分层表示的图像的高计算复杂度的影响。现有的方法分两步构建分层图像表示。第一步是在不同的图像分辨率下进行分割，第二步是通过关联不同分辨率的图像片段来构建图像的分层表示。在本研究中，提出了一种创新的一体化运行方法，同时进行图像分割和分层树构建，生成分层区域树来表示图像。此外，提出了一种时间复杂度较低的高效层次区域树匹配算法，并将其应用于多目标图像检索中。实验结果证明了该方法的有效性和有效性。

引用次数: 6

A Study on Difficulty Level Recognition of Piano Sheet Music 钢琴活页乐谱难度等级识别研究

2012 IEEE International Symposium on Multimedia

Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.11

Shih-Chuan Chiu, Min-Syan Chen

Looking for a piano sheet music with proper difficulty for a piano learner is always an important work to his/her teacher. In the paper, we study on a new and challenging issue of recognizing the difficulty level of piano sheet music. To analyze the semantic content of music, we focus on symbolic music, i.e., sheet music or score. Specifically, difficulty level recognition is formulated as a regression problem to predict the difficulty level of piano sheet music. Since the existing symbolic music features are not able to capture the characteristics of difficulty, we propose a set of new features. To improve the performance, a feature selection approach, RReliefF, is used to select relevant features. An extensive performance study is conducted over two real datasets with different characteristics to evaluate the accuracy of the regression approach for predicting difficulty level. The best performance evaluated in terms of the R2 statistics over two datasets reaches 39.9% and 38.8%, respectively.

为钢琴学习者寻找一首难度适中的钢琴乐谱一直是钢琴教师的一项重要工作。本文研究了钢琴活页乐谱难度等级识别这一具有挑战性的新问题。为了分析音乐的语义内容，我们将重点放在符号音乐上，即乐谱或乐谱。具体来说，难度等级识别被表述为一个回归问题来预测钢琴活页乐谱的难度等级。由于现有的符号音乐特征无法捕捉难度特征，我们提出了一套新的特征。为了提高性能，使用特征选择方法RReliefF来选择相关的特征。在两个具有不同特征的真实数据集上进行了广泛的性能研究，以评估回归方法预测难度级别的准确性。根据两个数据集的R2统计值评估的最佳性能分别达到39.9%和38.8%。

引用次数: 14

Automatic Camera Control for Tracking a Presenter during a Talk 在演讲过程中跟踪主讲人的自动摄像机控制

2012 IEEE International Symposium on Multimedia

Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.96

M. Winkler, Kai Michael Höver, Aristotelis Hadjakos, M. Mühlhäuser

Today, talks, presentations, and lectures are often captured on video to give a broad audience the possibility to (re-)access the content. As presenters are often moving around during a talk it is necessary to guide recording cameras. We present an automatic solution for user tracking and camera control. It uses a depth camera for user tracking, and a scalable networking architecture based on publish/subscribe messaging for controlling multiple video cameras. Furthermore, we present our experiences with the system during actual lectures at an university.

今天，谈话、演示和讲座经常被拍摄成视频，让广大观众有可能(重新)访问内容。由于演讲人在演讲过程中经常走动，因此有必要引导录音摄像机。我们提出了一个用户跟踪和相机控制的自动解决方案。它使用深度摄像头进行用户跟踪，并使用基于发布/订阅消息的可扩展网络架构来控制多个摄像头。此外，我们还在一所大学的实际讲座中介绍了我们使用该系统的经验。

引用次数: 21

Mediating Multimedia Traffic with Strict Delivery Constraints 具有严格交付约束的多媒体流量中介

2012 IEEE International Symposium on Multimedia

Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.53

Michael Karl, Tatiana Polishchuk, T. Herfet, A. Gurtov

Internet multimedia traffic currently occupies more than half of the total Internet traffic and it continues to expand tremendously. Targeting to meet strict constraints imposed by the requirements of real-time multimedia applications appropriate error-correction techniques should be implemented within the data dissemination network. We propose to introduce multipurpose relay nodes called Mediators into several positions within the tree networks typical for multicasting and broadcasting scenarios. By utilizing the error-correction domain separation paradigm in combination with selective insertion of the supplementary data from parallel networks, when the corresponding content is available, the proposed mechanism reduces the total network load and improves scalability of multicast/broadcast transmission. We share our view on how the existing application frameworks could benefit from the incremental deployment of the proposed mechanism. Experimental results confirm suitability and applicability of our assumptions.

互联网多媒体流量目前占据了互联网总流量的一半以上，并且还在不断地急剧增长。为了满足实时多媒体应用的严格要求，应在数据传播网络内实施适当的纠错技术。我们建议在多播和广播场景的树状网络中引入称为中介器的多用途中继节点。该机制利用纠错域分离模式，结合并行网络中补充数据的选择性插入，在相应内容可用的情况下，降低了网络总负载，提高了组播/广播传输的可扩展性。对于现有的应用程序框架如何从提议的机制的增量部署中获益，我们分享了我们的观点。实验结果证实了我们假设的适宜性和适用性。

引用次数: 2

2D-FRFT Based Rotation Invariant Digital Image Watermarking 基于2D-FRFT的旋转不变数字图像水印

2012 IEEE International Symposium on Multimedia

Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.60

Lei Gao, L. Qi, Shou-yi Yang, Yongjin Wang, Tie Yun, L. Guan

The extraction of rotation invariant representation is important for many signal processing problems such as image analysis, computer vision, and pattern recognition. In this paper, we present a systematic analysis of the Two-Dimensional Fractional Fourier Transform (2D-FRFT), and show that under certain conditions, the 2D-FRFT technique possesses the attractive property of rotation invariance. Based on our analysis, we proposed a novel digital image watermarking method which combines 2D chirp signal with the addition and rotation invariant properties of 2D-FRFT to achieve improved robustness and security. The effectiveness of the proposed solution is demonstrated through experiments.

旋转不变表示的提取对于图像分析、计算机视觉和模式识别等信号处理问题具有重要意义。本文对二维分数阶傅里叶变换(2D-FRFT)进行了系统的分析，并证明在一定条件下，2D-FRFT技术具有旋转不变性的吸引人的特性。在此基础上，我们提出了一种新的数字图像水印方法，该方法将二维啁啾信号与二维frft的加法和旋转不变性结合起来，以提高鲁棒性和安全性。通过实验验证了该方法的有效性。

引用次数: 5

Feature-Based Multi-sensor Images Alignment and Enhancement 基于特征的多传感器图像对齐与增强

2012 IEEE International Symposium on Multimedia

Pub Date : 2012-12-10 DOI: 10.1109/ISM.2012.45

Myung-Ho Ju, Sung-Yong Kim, Hang-Bong Kang

This paper presents an efficient image alignment and image enhancement method for multi-sensor images. The shape of the object captured in one of the multi-sensor images can be found by similar edges but different contrasts in the other multi-sensor image. Using this cue, our approach is based on the magnitudes of the oriented edges and results in the fast alignment method by feature matching between multi-sensor images. To enhance the image with aligned multi-sensor images, we estimate a salient region mask which covers the information of all input images. Our experimental results show that our proposed method can efficiently align multi-sensor images and enhance them better than the current methods.

针对多传感器图像，提出了一种高效的图像对齐和图像增强方法。在一幅多传感器图像中捕获的物体的形状可以通过在另一幅多传感器图像中相似的边缘而不同的对比度来发现。利用这一线索，我们的方法基于定向边缘的大小，并通过多传感器图像之间的特征匹配得到快速对齐方法。为了用对齐的多传感器图像增强图像，我们估计了一个覆盖所有输入图像信息的显著区域掩模。实验结果表明，该方法可以有效地对多传感器图像进行对齐，并具有较好的增强效果。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2012 IEEE International Symposium on Multimedia

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀