首页 > 最新文献

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)最新文献

英文 中文
Predicting Key Recognition Difficulty in Polyphonic Audio 预测复调音频的键识别困难
C. Chuan, Aleksey Charapko
In this paper, we present statistical models to predict the difficulty of recognizing musical keys from polyphonic audio signals. Automatic audio key finding has been studied for many years, and various approaches have been proposed and reported. Reports of these methods' performance are usually based on the proposers' own data sets. Without details on the data set, i.e., how challenging the data set is, directly comparing the effectiveness of these methods is not meaningful or even possible. Thus, in this study we focus on predicting the difficulty level of key recognition as perceived by human experts. Given an audio recording, represented as the extracted acoustic features, we apply multiple linear regression and proportional odds model to predict the difficulty level of the recording, annotated by experts as an integer on a 5-point Likert scale. We use four metrics to evaluate our prediction results: root mean square error, Pearson correlation coefficient, exact accuracy, and adjacent accuracy. We also examine the difference between experts' annotations and discuss their consistency.
在本文中,我们提出了统计模型来预测从复调音频信号中识别音乐键的难度。音频键的自动查找已经进行了多年的研究,提出并报道了各种方法。这些方法的性能报告通常是基于提出者自己的数据集。如果没有数据集的细节,即数据集的挑战性如何,直接比较这些方法的有效性是没有意义的,甚至是不可能的。因此,在本研究中,我们专注于预测人类专家感知到的关键识别的难度水平。给定一个录音,表示为提取的声学特征,我们应用多元线性回归和比例几率模型来预测录音的难度水平,专家在5点李克特量表上以整数形式注释。我们使用四个指标来评估我们的预测结果:均方根误差、Pearson相关系数、精确精度和相邻精度。我们还检查了专家注释之间的差异,并讨论了它们的一致性。
{"title":"Predicting Key Recognition Difficulty in Polyphonic Audio","authors":"C. Chuan, Aleksey Charapko","doi":"10.1109/ISM.2013.82","DOIUrl":"https://doi.org/10.1109/ISM.2013.82","url":null,"abstract":"In this paper, we present statistical models to predict the difficulty of recognizing musical keys from polyphonic audio signals. Automatic audio key finding has been studied for many years, and various approaches have been proposed and reported. Reports of these methods' performance are usually based on the proposers' own data sets. Without details on the data set, i.e., how challenging the data set is, directly comparing the effectiveness of these methods is not meaningful or even possible. Thus, in this study we focus on predicting the difficulty level of key recognition as perceived by human experts. Given an audio recording, represented as the extracted acoustic features, we apply multiple linear regression and proportional odds model to predict the difficulty level of the recording, annotated by experts as an integer on a 5-point Likert scale. We use four metrics to evaluate our prediction results: root mean square error, Pearson correlation coefficient, exact accuracy, and adjacent accuracy. We also examine the difference between experts' annotations and discuss their consistency.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"3 1","pages":"421-426"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83620748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Resolution Control for Size Bias Elimination in Multi-resolution Visual Matching 多分辨率视觉匹配中尺寸偏差消除的分辨率控制
S. Clippingdale
Visual matching for tracking and recognition, for example in video indexing, often uses image features measured at multiple resolutions. As a tracked object moves away from the camera, appearing progressively smaller, the higher resolutions consecutively become unavailable for matching, causing step changes in the similarity or “match score” of the tracked object. If several candidate matches (hypotheses) are maintained for a tracked region, this effect causes a bias toward larger region hypotheses that match at one extra resolution relative to even slightly smaller hypotheses. The effect is subtle and appears intermittent because it occurs only around a specific discrete set of object sizes. We describe the problem and the class of visual matching methods that it affects, and propose a solution. We present experimental results from a real video indexing system to illustrate both the problem and the effectiveness of the proposed solution.
跟踪和识别的视觉匹配,例如在视频索引中,经常使用在多个分辨率下测量的图像特征。随着被跟踪对象逐渐远离摄像机,变得越来越小,高分辨率的对象连续无法匹配,导致被跟踪对象的相似性或“匹配分数”发生阶跃变化。如果在跟踪区域中保留了几个候选匹配(假设),则该效应会导致偏向于以一个额外分辨率匹配的较大区域假设,而不是稍微小一点的假设。这种影响是微妙的,并且是间歇性的,因为它只发生在一个特定的离散对象大小集合周围。我们描述了这个问题以及它所影响的一类视觉匹配方法,并提出了解决方案。我们给出了一个真实视频索引系统的实验结果,以说明所提出的解决方案的问题和有效性。
{"title":"Resolution Control for Size Bias Elimination in Multi-resolution Visual Matching","authors":"S. Clippingdale","doi":"10.1109/ISM.2013.87","DOIUrl":"https://doi.org/10.1109/ISM.2013.87","url":null,"abstract":"Visual matching for tracking and recognition, for example in video indexing, often uses image features measured at multiple resolutions. As a tracked object moves away from the camera, appearing progressively smaller, the higher resolutions consecutively become unavailable for matching, causing step changes in the similarity or “match score” of the tracked object. If several candidate matches (hypotheses) are maintained for a tracked region, this effect causes a bias toward larger region hypotheses that match at one extra resolution relative to even slightly smaller hypotheses. The effect is subtle and appears intermittent because it occurs only around a specific discrete set of object sizes. We describe the problem and the class of visual matching methods that it affects, and propose a solution. We present experimental results from a real video indexing system to illustrate both the problem and the effectiveness of the proposed solution.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"76 1","pages":"451-456"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83857076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards an Evaluation of Denoising Algorithms with Respect to Realistic Camera Noise 基于真实相机噪声的去噪算法评价
Tamara Seybold, Christian Keimel, Marion Knopp, W. Stechele
The development and tuning of denoising algorithms is usually based on readily processed test images that are artificially degraded with additive white Gaussian noise (AWGN). While AWGN allows us to easily generate test data in a repeatable manner, it does not reflect the noise characteristics in a real digital camera. Realistic camera noise is signal-dependent and spatially correlated due to the demosaicking step required to obtain full-color images. Hence, the noise characteristic is fundamentally different from AWGN. Using such unrealistic data to test, optimize and compare denoising algorithms may lead to incorrect parameter tuning or sub optimal choices in research on denoising algorithms. In this paper, we therefore propose an approach to evaluate denoising algorithms with respect to realistic camera noise: we describe a new camera noise model that includes the full processing chain of a single sensor camera. We determine the visual quality of noisy and denoised test sequences using a subjective test with 18 participants. We show that the noise characteristics have a significant effect on visual quality. Quality metrics, which are required to compare denoising results, are applied, and we evaluate the performance of 10 full-reference metrics and one no-reference metric with our realistic test data. We conclude that a more realistic noise model should be used in future research to improve the quality estimation of digital images and videos and to improve the research on denoising algorithms.
去噪算法的开发和调整通常是基于易于处理的测试图像,这些图像被加性高斯白噪声(AWGN)人工退化。虽然AWGN使我们能够轻松地以可重复的方式生成测试数据,但它并不能反映真实数码相机的噪声特性。逼真的相机噪声是信号依赖和空间相关的,因为获得全彩图像所需的去马赛克步骤。因此,噪声特性与AWGN有本质区别。使用这些不切实际的数据来测试、优化和比较去噪算法,可能会导致去噪算法研究中的参数调整错误或次优选择。因此,在本文中,我们提出了一种方法来评估关于真实相机噪声的去噪算法:我们描述了一个新的相机噪声模型,其中包括单个传感器相机的完整处理链。我们使用18名参与者的主观测试来确定噪声和去噪测试序列的视觉质量。研究表明,噪声特性对视觉质量有显著影响。应用了比较去噪结果所需的质量指标,我们用实际测试数据评估了10个完全参考指标和一个无参考指标的性能。我们得出结论,在未来的研究中应该使用更真实的噪声模型来提高数字图像和视频的质量估计,并改进去噪算法的研究。
{"title":"Towards an Evaluation of Denoising Algorithms with Respect to Realistic Camera Noise","authors":"Tamara Seybold, Christian Keimel, Marion Knopp, W. Stechele","doi":"10.1109/ISM.2013.39","DOIUrl":"https://doi.org/10.1109/ISM.2013.39","url":null,"abstract":"The development and tuning of denoising algorithms is usually based on readily processed test images that are artificially degraded with additive white Gaussian noise (AWGN). While AWGN allows us to easily generate test data in a repeatable manner, it does not reflect the noise characteristics in a real digital camera. Realistic camera noise is signal-dependent and spatially correlated due to the demosaicking step required to obtain full-color images. Hence, the noise characteristic is fundamentally different from AWGN. Using such unrealistic data to test, optimize and compare denoising algorithms may lead to incorrect parameter tuning or sub optimal choices in research on denoising algorithms. In this paper, we therefore propose an approach to evaluate denoising algorithms with respect to realistic camera noise: we describe a new camera noise model that includes the full processing chain of a single sensor camera. We determine the visual quality of noisy and denoised test sequences using a subjective test with 18 participants. We show that the noise characteristics have a significant effect on visual quality. Quality metrics, which are required to compare denoising results, are applied, and we evaluate the performance of 10 full-reference metrics and one no-reference metric with our realistic test data. We conclude that a more realistic noise model should be used in future research to improve the quality estimation of digital images and videos and to improve the research on denoising algorithms.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"29 1","pages":"203-210"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83582587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Unsupervised Co-segmentation of Complex Image Set via Bi-harmonic Distance Governed Multi-level Deformable Graph Clustering 基于双调和距离控制的多层次可变形图聚类的复图像集无监督共分割
Jizhou Ma, Shuai Li, A. Hao, Hong Qin
Despite the recent success of extensive co-segmentation studies, they still suffer from limitations in accommodating multiple-foreground, large-scale, high-variability image set, as well as their underlying capability for parallel implementation. To improve, this paper proposes a bi-harmonic distance governed flexible method for the robust coherent segmentation of the overlapping/similar contents co-existing in image group, which is independent of supervised learning and any other user-specified prior. The central idea is the novel integration of bi-harmonic distance metric design and multi-level deformable graph generation for multi-level clustering, which gives rise to a host of unique advantages: accommodating multiple-foreground images, respecting both local structures and global semantics of images, being more robust and accurate, and being convenient for parallel acceleration. Critical pipeline of our method involves intrinsic content-coherent measuring, super-pixel assisted bottom-up clustering, and multi-level deformable graph clustering based cross-image optimization. We conduct extensive experiments on the iCoseg benchmark and Oxford flower datasets, and make comprehensive evaluations to demonstrate the superiority of our method via comparison with state-of-the-art methods collected in the MSRC database.
尽管近年来广泛的共分割研究取得了成功,但它们在适应多前景、大规模、高可变性图像集以及并行实现的潜在能力方面仍然存在局限性。为了改进这一问题,本文提出了一种双谐波距离控制的灵活方法,用于图像组中共存的重叠/相似内容的鲁棒连贯分割,该方法独立于监督学习和任何其他用户指定的先验。该算法的核心思想是将双谐波距离度量设计和多级可变形图生成相结合,实现多级聚类,具有适应多前景图像、尊重图像的局部结构和全局语义、鲁棒性和准确性更高、便于并行加速等独特优势。该方法的关键流程包括内在内容相干测量、超像素辅助的自下而上聚类和基于多层次可变形图聚类的交叉图像优化。我们在iCoseg基准和牛津花数据集上进行了广泛的实验,并通过与MSRC数据库中收集的最先进的方法进行比较,进行了全面的评估,以证明我们的方法的优越性。
{"title":"Unsupervised Co-segmentation of Complex Image Set via Bi-harmonic Distance Governed Multi-level Deformable Graph Clustering","authors":"Jizhou Ma, Shuai Li, A. Hao, Hong Qin","doi":"10.1109/ISM.2013.16","DOIUrl":"https://doi.org/10.1109/ISM.2013.16","url":null,"abstract":"Despite the recent success of extensive co-segmentation studies, they still suffer from limitations in accommodating multiple-foreground, large-scale, high-variability image set, as well as their underlying capability for parallel implementation. To improve, this paper proposes a bi-harmonic distance governed flexible method for the robust coherent segmentation of the overlapping/similar contents co-existing in image group, which is independent of supervised learning and any other user-specified prior. The central idea is the novel integration of bi-harmonic distance metric design and multi-level deformable graph generation for multi-level clustering, which gives rise to a host of unique advantages: accommodating multiple-foreground images, respecting both local structures and global semantics of images, being more robust and accurate, and being convenient for parallel acceleration. Critical pipeline of our method involves intrinsic content-coherent measuring, super-pixel assisted bottom-up clustering, and multi-level deformable graph clustering based cross-image optimization. We conduct extensive experiments on the iCoseg benchmark and Oxford flower datasets, and make comprehensive evaluations to demonstrate the superiority of our method via comparison with state-of-the-art methods collected in the MSRC database.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"50 1","pages":"38-45"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83976155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Quantitative Analysis of a Virtual Programming Lab 虚拟编程实验室的定量分析
Jan Vanvinkenroye, Christoph Grüninger, C. Heine, T. Richter
We implemented a survey with one learning group using the web-based tools and a control group working with a traditional setup based on editor and compiler. In a recent publication, we described the design and implementation of a web-based programming lab (ViPLab) targeted at undergraduate Engineering and Mathematics courses. This work provides a quantitative analysis of the user feedback, experience and learning success. The survey shows that web-based installations are as efficient as classical tools, while Windows users prefer the web-based chain over the editor/compiler installation on Linux. This justifies the use of web-based installations in programming beginner courses, if the learning target focuses on programming and not a particular tool chain.
我们实施了一项调查,其中一个学习小组使用基于网络的工具,另一个控制组使用基于编辑器和编译器的传统设置。在最近的一篇文章中,我们描述了针对本科工程和数学课程的基于web的编程实验室(ViPLab)的设计和实现。这项工作提供了对用户反馈、经验和学习成功的定量分析。调查显示,基于web的安装与传统工具一样高效,而Windows用户更喜欢基于web的链,而不是Linux上的编辑器/编译器安装。这证明了在编程初学者课程中使用基于web的安装是合理的,如果学习目标侧重于编程而不是特定的工具链。
{"title":"A Quantitative Analysis of a Virtual Programming Lab","authors":"Jan Vanvinkenroye, Christoph Grüninger, C. Heine, T. Richter","doi":"10.1109/ISM.2013.88","DOIUrl":"https://doi.org/10.1109/ISM.2013.88","url":null,"abstract":"We implemented a survey with one learning group using the web-based tools and a control group working with a traditional setup based on editor and compiler. In a recent publication, we described the design and implementation of a web-based programming lab (ViPLab) targeted at undergraduate Engineering and Mathematics courses. This work provides a quantitative analysis of the user feedback, experience and learning success. The survey shows that web-based installations are as efficient as classical tools, while Windows users prefer the web-based chain over the editor/compiler installation on Linux. This justifies the use of web-based installations in programming beginner courses, if the learning target focuses on programming and not a particular tool chain.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"61 1","pages":"457-461"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80731895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Pitch Marking Using the Fundamental Signal for Speech Modifications via TDPSOLA 基于TDPSOLA的语音修改基本信号的基音标记
F. Ykhlef, L. Bendaouia
The quality of synthetic speech offered by pitch and duration modifications via Time Domain Pitch Synchronous Overlap Add method (TD-PSOLA) relies on an accurate positioning of pitch marks. In this paper, we propose a new pitch marking technique of voiced regions based on the fundamental signal of the speech waveform. By using the valleys of the fundamental signal, we locate a set of precise intervals where the exact instants of pitch marks are expected to be found. The fundamental signal is composed only from the fundamental frequency (pitch) of speech. It is represented by a specific signal named "mean based signal" (MBS). The optimal pitch marks are found by extracting the set of global peak instants within the obtained intervals. To improve the performance of the proposed technique, we have proposed a post processing stage which allows us to correct the erroneous pitch marks that may occur due to some synchronization problems. The proposed technique is evaluated on CMU ACRTIC database by using objective and subjective measures. The experiments demonstrate that the proposed technique allows pitch and duration modifications via TD-PSOLA with high quality.
通过时域基音同步重叠添加方法(TD-PSOLA)修改基音和持续时间所提供的合成语音质量依赖于基音标记的精确定位。本文提出了一种基于语音波形基本信号的浊音区基音标记方法。通过使用基波信号的谷值,我们找到了一组精确的间隔,在这些间隔中,我们期望找到音高标记的精确瞬间。基本信号仅由语音的基本频率(音高)组成。它由一个特定的信号表示,称为“基于均值的信号”(MBS)。通过提取得到的区间内的全局峰值瞬间集来找到最优的音高标记。为了提高所提出的技术的性能,我们提出了一个后处理阶段,它允许我们纠正由于一些同步问题可能出现的错误音高标记。在CMU ACRTIC数据库上对该技术进行了客观和主观评价。实验表明,该技术可以通过TD-PSOLA高质量地修改音高和音长。
{"title":"Pitch Marking Using the Fundamental Signal for Speech Modifications via TDPSOLA","authors":"F. Ykhlef, L. Bendaouia","doi":"10.1109/ISM.2013.28","DOIUrl":"https://doi.org/10.1109/ISM.2013.28","url":null,"abstract":"The quality of synthetic speech offered by pitch and duration modifications via Time Domain Pitch Synchronous Overlap Add method (TD-PSOLA) relies on an accurate positioning of pitch marks. In this paper, we propose a new pitch marking technique of voiced regions based on the fundamental signal of the speech waveform. By using the valleys of the fundamental signal, we locate a set of precise intervals where the exact instants of pitch marks are expected to be found. The fundamental signal is composed only from the fundamental frequency (pitch) of speech. It is represented by a specific signal named \"mean based signal\" (MBS). The optimal pitch marks are found by extracting the set of global peak instants within the obtained intervals. To improve the performance of the proposed technique, we have proposed a post processing stage which allows us to correct the erroneous pitch marks that may occur due to some synchronization problems. The proposed technique is evaluated on CMU ACRTIC database by using objective and subjective measures. The experiments demonstrate that the proposed technique allows pitch and duration modifications via TD-PSOLA with high quality.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"57 1","pages":"118-124"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80227193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Accurate Detection of Moving Objects in Traffic Video Streams over Limited Bandwidth Networks 有限带宽网络下交通视频流中运动目标的精确检测
Bo-Hao Chen, Shih-Chia Huang
Automated detection of moving objects is an essential task for any intelligent transportation system. However, conventional motion detection techniques often suffer from the loss of moving objects due to bit-rate variation in video streams transmitted via wireless video communication systems. To achieve motion detection that is both reliable and accurate in video streams of variable bit-rate, this paper proposes a novel motion detection approach which is based on grey relational analysis, and which integrates a multi-quality background generation module and a moving object detection module. As our experimental results demonstrate, the proposed approach attained superior motion detection performance compared to other state-of-the-art techniques based on qualitative and quantitative evaluations. Quantitative evaluations produced F1 and Similarity accuracy scores for the proposed approach that were up to 59.96% and 55.42% higher than those of the other compared techniques, respectively.
自动检测移动物体是任何智能交通系统的基本任务。然而,由于无线视频通信系统传输的视频流中的比特率变化,传统的运动检测技术经常遭受运动物体丢失的困扰。为了在可变比特率视频流中实现可靠而准确的运动检测,本文提出了一种基于灰色关联分析的运动检测方法,该方法集成了多质量背景生成模块和运动目标检测模块。正如我们的实验结果所表明的,与基于定性和定量评估的其他最先进技术相比,所提出的方法获得了优越的运动检测性能。定量评价结果表明,该方法的F1和Similarity准确率分别比其他方法高59.96%和55.42%。
{"title":"Accurate Detection of Moving Objects in Traffic Video Streams over Limited Bandwidth Networks","authors":"Bo-Hao Chen, Shih-Chia Huang","doi":"10.1109/ISM.2013.20","DOIUrl":"https://doi.org/10.1109/ISM.2013.20","url":null,"abstract":"Automated detection of moving objects is an essential task for any intelligent transportation system. However, conventional motion detection techniques often suffer from the loss of moving objects due to bit-rate variation in video streams transmitted via wireless video communication systems. To achieve motion detection that is both reliable and accurate in video streams of variable bit-rate, this paper proposes a novel motion detection approach which is based on grey relational analysis, and which integrates a multi-quality background generation module and a moving object detection module. As our experimental results demonstrate, the proposed approach attained superior motion detection performance compared to other state-of-the-art techniques based on qualitative and quantitative evaluations. Quantitative evaluations produced F1 and Similarity accuracy scores for the proposed approach that were up to 59.96% and 55.42% higher than those of the other compared techniques, respectively.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"13 1","pages":"69-75"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85439868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An Improvement in Media Discovery Service Using Name Spotting 使用名称定位的媒体发现服务的改进
Manish Goswami, Lan Yang
Digital Object Repository in the Digital Object Architecture stores a large number of audio/video media files. Lack of metadata in audio/video media files limits the media discovery service in Digital Object Architecture from searching those media files. In this paper we designed a system that uses name spotting module to extract the names, stores the extracted names with audio/video media files, simulates the media discovery service and reports the findings related to the improvement in searching the media file.
数字对象架构中的数字对象存储库存储了大量的音频/视频媒体文件。音频/视频媒体文件中元数据的缺乏限制了数字对象体系结构中的媒体发现服务对这些媒体文件的搜索。在本文中,我们设计了一个系统,该系统使用名字识别模块提取名字,将提取的名字存储在音频/视频媒体文件中,模拟媒体发现服务,并报告与搜索媒体文件相关的改进结果。
{"title":"An Improvement in Media Discovery Service Using Name Spotting","authors":"Manish Goswami, Lan Yang","doi":"10.1109/ISM.2013.83","DOIUrl":"https://doi.org/10.1109/ISM.2013.83","url":null,"abstract":"Digital Object Repository in the Digital Object Architecture stores a large number of audio/video media files. Lack of metadata in audio/video media files limits the media discovery service in Digital Object Architecture from searching those media files. In this paper we designed a system that uses name spotting module to extract the names, stores the extracted names with audio/video media files, simulates the media discovery service and reports the findings related to the improvement in searching the media file.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"10 1","pages":"427-432"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90252177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Super Resolution Using Edge Directed Unsharp Masking Sharpening Method 有效的超分辨率使用边缘定向不锐利掩蔽锐化方法
Kuo-Shiuan Peng, F. Lin, Yi-Pai Huang, H. Shieh
This paper investigated the potential of the real-time implementation in single image super resolution using edge directed unsharp masking sharpening (EDUMS) method. To achieve efficient real-time implementation with unsharp masking sharpening, the resolution enhancement process needed only simply filtering operations without iterations. Also, with edge directed information as the prior of the unsharp masking sharpening method, the jaggy artifact was efficiently suppressed. Clear edge structures and vivid details of high resolution images with minimum artifacts were presented by the proposed method.
本文研究了利用边缘定向非锐利掩蔽锐化(EDUMS)方法实时实现单幅图像超分辨率的潜力。为了在不锐利的掩蔽锐化下实现高效的实时实现,分辨率增强过程只需要简单的滤波操作,而不需要迭代。同时,利用边缘定向信息作为非锐化掩膜锐化方法的先验,有效地抑制了锯齿状伪影。该方法具有边缘结构清晰、细节逼真、伪影最小的特点。
{"title":"Efficient Super Resolution Using Edge Directed Unsharp Masking Sharpening Method","authors":"Kuo-Shiuan Peng, F. Lin, Yi-Pai Huang, H. Shieh","doi":"10.1109/ISM.2013.100","DOIUrl":"https://doi.org/10.1109/ISM.2013.100","url":null,"abstract":"This paper investigated the potential of the real-time implementation in single image super resolution using edge directed unsharp masking sharpening (EDUMS) method. To achieve efficient real-time implementation with unsharp masking sharpening, the resolution enhancement process needed only simply filtering operations without iterations. Also, with edge directed information as the prior of the unsharp masking sharpening method, the jaggy artifact was efficiently suppressed. Clear edge structures and vivid details of high resolution images with minimum artifacts were presented by the proposed method.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"65 1","pages":"508-509"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73364173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Trimmed Non-local Means Technique for Mixed Noise Removal in Color Images 彩色图像混合噪声去除的裁剪非局部均值技术
Krystian Radlak, B. Smolka
In this paper a novel approach to the mixed noise removal in color images is proposed. The described method is a generalization of the Non-Local Means algorithm, where the pixels in the filtering window are ordered and only the most centrally located pixels in the filtering window are considered and used to calculate the weights needed for the averaging operation. The comparison with the existing state-of-the-art denoising schemes in terms of image restoration quality measures shows, that the new approach yields significantly better results in suppressing mixed noise in color digital images.
提出了一种去除彩色图像中混合噪声的新方法。所描述的方法是非局部均值算法的推广,其中过滤窗口中的像素是有序的,并且只考虑过滤窗口中最集中的像素,并用于计算平均操作所需的权重。与现有最先进的降噪方案在图像恢复质量措施方面的比较表明,新方法在抑制彩色数字图像中的混合噪声方面取得了显着更好的效果。
{"title":"Trimmed Non-local Means Technique for Mixed Noise Removal in Color Images","authors":"Krystian Radlak, B. Smolka","doi":"10.1109/ISM.2013.78","DOIUrl":"https://doi.org/10.1109/ISM.2013.78","url":null,"abstract":"In this paper a novel approach to the mixed noise removal in color images is proposed. The described method is a generalization of the Non-Local Means algorithm, where the pixels in the filtering window are ordered and only the most centrally located pixels in the filtering window are considered and used to calculate the weights needed for the averaging operation. The comparison with the existing state-of-the-art denoising schemes in terms of image restoration quality measures shows, that the new approach yields significantly better results in suppressing mixed noise in color digital images.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"13 1","pages":"405-406"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76826743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1