首页 > 最新文献

2012 IEEE International Conference on Multimedia and Expo Workshops最新文献

英文 中文
A Human Caregiver Support System in Elderly Monitoring Facility 老年人监护设施中的人类照顾者支持系统
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.82
M. A. Hossain, D. Ahmed
The number of elderly population is increasing worldwide and often they need assistance in their daily activities. In many situations, these elders are placed in elderly care facilities in order to receive continuous assistance from the human caregivers. The caregivers usually keep a watchful eye on the elders and help them in their activities of daily living. However, study shows that the human caregivers often suffer from boredom for being engaged in monitoring the elderly, which also compromises the care and assistance needed for the vulnerable elderly. In order to address this issue, we propose a human caregiver support system that aims to comprehend elderly persons' activities and decides what services to provide them in different situations and when to notify the human caregiver about any incident that happens in the care facility. Our preliminary experiment shows the potential of such system.
世界范围内老年人口的数量正在增加,他们在日常活动中往往需要帮助。在许多情况下,这些老人被安置在老年护理设施中,以便从人类照顾者那里得到持续的帮助。护理员通常会密切关注老人,并帮助他们进行日常生活活动。然而,研究表明,人类照顾者往往因为忙于照顾老人而感到无聊,这也损害了对弱势老年人的照顾和帮助。为了解决这个问题,我们提出了一个人类照顾者支持系统,旨在了解老年人的活动,并决定在不同情况下为他们提供什么服务,以及何时通知人类照顾者在护理设施中发生的任何事件。我们的初步实验表明了这种系统的潜力。
{"title":"A Human Caregiver Support System in Elderly Monitoring Facility","authors":"M. A. Hossain, D. Ahmed","doi":"10.1109/ICMEW.2012.82","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.82","url":null,"abstract":"The number of elderly population is increasing worldwide and often they need assistance in their daily activities. In many situations, these elders are placed in elderly care facilities in order to receive continuous assistance from the human caregivers. The caregivers usually keep a watchful eye on the elders and help them in their activities of daily living. However, study shows that the human caregivers often suffer from boredom for being engaged in monitoring the elderly, which also compromises the care and assistance needed for the vulnerable elderly. In order to address this issue, we propose a human caregiver support system that aims to comprehend elderly persons' activities and decides what services to provide them in different situations and when to notify the human caregiver about any incident that happens in the care facility. Our preliminary experiment shows the potential of such system.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126956562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Depth and Geometry from a Single 2D Image Using Triangulation 深度和几何从一个单一的2D图像使用三角测量
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.95
Yasir Salih, A. Malik
We present a novel method for computing depth of field and geometry from a single 2D image. This technique, unlike the existing ones measures the absolute depth of field and distances in the scene from single image only using the concept of triangulation. This algorithm requires minimum inputs such as camera height, camera pitch angle and camera field of view for computing the depth of field and 3D coordinates of any given point in the image. In addition, this method can be used to compute the actual size of an object in the scene (width and height) as well as the distance between different objects in the image. The proposed methodology has the potential to be implemented in high impact applications such as distance measurement from mobile phones, robot navigation and aerial surveillances.
我们提出了一种从单个二维图像计算景深和几何形状的新方法。与现有的技术不同,该技术仅使用三角测量的概念,从单个图像中测量场景中的绝对景深和距离。该算法需要最小的输入,如相机高度,相机俯仰角和相机视场,用于计算图像中任何给定点的景深和3D坐标。此外,该方法还可以用于计算场景中物体的实际尺寸(宽度和高度)以及图像中不同物体之间的距离。所提出的方法有可能在高影响应用中实施,例如移动电话的距离测量、机器人导航和空中监视。
{"title":"Depth and Geometry from a Single 2D Image Using Triangulation","authors":"Yasir Salih, A. Malik","doi":"10.1109/ICMEW.2012.95","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.95","url":null,"abstract":"We present a novel method for computing depth of field and geometry from a single 2D image. This technique, unlike the existing ones measures the absolute depth of field and distances in the scene from single image only using the concept of triangulation. This algorithm requires minimum inputs such as camera height, camera pitch angle and camera field of view for computing the depth of field and 3D coordinates of any given point in the image. In addition, this method can be used to compute the actual size of an object in the scene (width and height) as well as the distance between different objects in the image. The proposed methodology has the potential to be implemented in high impact applications such as distance measurement from mobile phones, robot navigation and aerial surveillances.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122001685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Theoretical Framework for Evaluating Partial Checksum Protection in Wireless Video Streaming 无线视频流中部分校验和保护评估的理论框架
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.118
J. Korhonen, Søren Forchhammer, K. J. Larsen
The benefits of passing partially corrupted packets to the application instead of discarding them have been debated actively, since Lightweight User Data gram Protocol (UDP Lite) was introduced. UDP Lite allows partial check summing in order to omit bit errors in the non-critical part of the packet payload. Several studies have shown that data throughput over a link prone to bit errors can be significantly improved with partial check summing. However, the higher throughput comes at the cost of bit errors appearing in the non-critical parts of the payload. Therefore, the overall benefit depends highly on the capability of coping with errors at the application layer. In this paper, we present a theoretical framework for defining the optimal level of partial checksum protection, assuming that the bit error characteristics and the perceptual impact of bit errors appearing in the non-protected parts of the payload are known. We have also derived experimentally the distortion levels for video sequences coded with different bit rates and protection levels in the presence of bit errors. The results show that in some scenarios it is possible to improve the perceived overall video quality by using partial check summing.
将部分损坏的数据包传递给应用程序而不是丢弃它们的好处,自从轻量级用户数据报文协议(UDP Lite)引入以来,一直存在激烈的争论。UDP Lite允许部分校验和,以便忽略数据包有效载荷的非关键部分的位错误。一些研究表明,部分校验和可以显著提高易出错链路上的数据吞吐量。然而,更高的吞吐量是以在负载的非关键部分出现位错误为代价的。因此,总体收益在很大程度上取决于在应用层处理错误的能力。在本文中,我们提出了一个理论框架来定义部分校验和保护的最佳水平,假设误码特征和误码在有效载荷的非保护部分出现的感知影响是已知的。我们还通过实验推导了在存在比特错误的情况下,用不同比特率和保护级别编码的视频序列的失真水平。结果表明,在某些情况下,使用部分校验和可以提高感知到的整体视频质量。
{"title":"Theoretical Framework for Evaluating Partial Checksum Protection in Wireless Video Streaming","authors":"J. Korhonen, Søren Forchhammer, K. J. Larsen","doi":"10.1109/ICMEW.2012.118","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.118","url":null,"abstract":"The benefits of passing partially corrupted packets to the application instead of discarding them have been debated actively, since Lightweight User Data gram Protocol (UDP Lite) was introduced. UDP Lite allows partial check summing in order to omit bit errors in the non-critical part of the packet payload. Several studies have shown that data throughput over a link prone to bit errors can be significantly improved with partial check summing. However, the higher throughput comes at the cost of bit errors appearing in the non-critical parts of the payload. Therefore, the overall benefit depends highly on the capability of coping with errors at the application layer. In this paper, we present a theoretical framework for defining the optimal level of partial checksum protection, assuming that the bit error characteristics and the perceptual impact of bit errors appearing in the non-protected parts of the payload are known. We have also derived experimentally the distortion levels for video sequences coded with different bit rates and protection levels in the presence of bit errors. The results show that in some scenarios it is possible to improve the perceived overall video quality by using partial check summing.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124687321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OpenGL SC Implementation over an OpenGL ES 1.1 Graphics Board OpenGL SC在OpenGL ES 1.1图形板上的实现
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.127
Nakhoon Baek, Hwanyong Lee
OpenGL SC, the safety critical profile of OpenGL plays the major role for the graphical user interfaces, especially in the safety-critical markets, including avionics, military, medical and automotive applications. In other side, OpenGL ES, the embedded systems version of OpenGL, has many commercial implementations. In this demonstration, we show that the OpenGL SC features can be provided over the wide-spread OpenGL ES graphics boards. This is the most cost-effective way of implementing OpenGL SC, at this time. Our result is the first implementation based on OpenGL ES 1.1 hardware. We will demonstrate this OpenGL SC-over-OpenGL ES 1.1 implementation, and show its successful behaviors.
OpenGL SC, OpenGL的安全关键配置文件在图形用户界面中起着主要作用,特别是在安全关键市场,包括航空电子,军事,医疗和汽车应用。另一方面,OpenGL的嵌入式系统版本OpenGL ES有许多商业实现。在这个演示中,我们展示了OpenGL SC特性可以在广泛的OpenGL ES图形板上提供。这是目前实现OpenGL SC最具成本效益的方法。我们的结果是基于OpenGL ES 1.1硬件的第一个实现。我们将演示这个OpenGL SC-over-OpenGL ES 1.1实现,并展示其成功的行为。
{"title":"OpenGL SC Implementation over an OpenGL ES 1.1 Graphics Board","authors":"Nakhoon Baek, Hwanyong Lee","doi":"10.1109/ICMEW.2012.127","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.127","url":null,"abstract":"OpenGL SC, the safety critical profile of OpenGL plays the major role for the graphical user interfaces, especially in the safety-critical markets, including avionics, military, medical and automotive applications. In other side, OpenGL ES, the embedded systems version of OpenGL, has many commercial implementations. In this demonstration, we show that the OpenGL SC features can be provided over the wide-spread OpenGL ES graphics boards. This is the most cost-effective way of implementing OpenGL SC, at this time. Our result is the first implementation based on OpenGL ES 1.1 hardware. We will demonstrate this OpenGL SC-over-OpenGL ES 1.1 implementation, and show its successful behaviors.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124859977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Crowd Density Estimation Based on Local Binary Pattern Co-Occurrence Matrix 基于局部二值模式共现矩阵的人群密度估计
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.71
Zhe Wang, Hong Liu, Yueliang Qian, Tao Xu
Crowd density estimation is important for intelligent video surveillance. Many methods based on texture features have been proposed to solve this problem. Most of the existing algorithms only estimate crowd density on the whole image while ignore crowd density in local region. In this paper, we propose a novel texture descriptor based on Local Binary Pattern (LBP) Co-occurrence Matrix (LBPCM) for crowd density estimation. LBPCM is constructed from several overlapping cells in an image block, which is going to be classified into different crowd density levels. LBPCM describes both the statistical properties and the spatial information of LBP and thus makes full use of LBP for local texture features. Additionally, we both extract LBPCM on gray and gradient images to improve the performance of crowd density estimation. Finally, the sliding window technique is used to detect the potential crowded area. The experimental results show the proposed method has better performance than other texture based crowd density estimation methods.
人群密度估计对智能视频监控具有重要意义。人们提出了许多基于纹理特征的方法来解决这一问题。现有的算法大多只对整个图像的人群密度进行估计,而忽略了局部区域的人群密度。本文提出了一种基于局部二值模式共现矩阵(LBPCM)的纹理描述符,用于人群密度估计。LBPCM由图像块中的多个重叠单元构成,并将其划分为不同的人群密度水平。LBPCM既描述了LBP的统计特性,又描述了LBP的空间信息,从而充分利用了LBP的局部纹理特征。此外,我们在灰度和梯度图像上都提取了LBPCM,以提高人群密度估计的性能。最后,利用滑动窗口技术检测潜在拥挤区域。实验结果表明,该方法比其他基于纹理的人群密度估计方法具有更好的性能。
{"title":"Crowd Density Estimation Based on Local Binary Pattern Co-Occurrence Matrix","authors":"Zhe Wang, Hong Liu, Yueliang Qian, Tao Xu","doi":"10.1109/ICMEW.2012.71","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.71","url":null,"abstract":"Crowd density estimation is important for intelligent video surveillance. Many methods based on texture features have been proposed to solve this problem. Most of the existing algorithms only estimate crowd density on the whole image while ignore crowd density in local region. In this paper, we propose a novel texture descriptor based on Local Binary Pattern (LBP) Co-occurrence Matrix (LBPCM) for crowd density estimation. LBPCM is constructed from several overlapping cells in an image block, which is going to be classified into different crowd density levels. LBPCM describes both the statistical properties and the spatial information of LBP and thus makes full use of LBP for local texture features. Additionally, we both extract LBPCM on gray and gradient images to improve the performance of crowd density estimation. Finally, the sliding window technique is used to detect the potential crowded area. The experimental results show the proposed method has better performance than other texture based crowd density estimation methods.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129972319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 46
ROI-Based Video Stabilization Algorithm for Hand-Held Cameras 基于roi的手持摄像机稳像算法
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.60
Dong-Bok Lee, Ick-hyun Choi, B. Song, T. Lee
Recently, a content-preserving warping algorithm utilizing 3D motion has been acknowledged as state-of-the-art thanks to its superior stabilization performance. However, the huge computational cost of this technique is a serious burden in spite of its excellent performance. Thus, we propose a fast video stabilization algorithm that provides significantly reduced computational complexity over the state-of-the-art with the same stabilization performance. First, we estimate the 3D information of the feature points in each input frame and define the region of interest (ROI) based on the estimated 3D information. Next, we apply the proposed ROI-based pre-warping and content-preserving warping sequentially to the input frame. From intensive simulation results, we find that the proposed algorithm reduces computational complexity by 15% of that of the state-of-the-art method, while keeping almost equivalent stabilization performance.
最近,一种利用3D运动的内容保留翘曲算法因其优越的稳定性能而被公认为最先进的算法。然而,尽管该技术具有优异的性能,但其巨大的计算成本是一个严重的负担。因此,我们提出了一种快速视频防抖算法,该算法在具有相同防抖性能的情况下显着降低了最先进的计算复杂度。首先,我们估计每个输入帧中特征点的三维信息,并根据估计的三维信息定义感兴趣区域(ROI)。接下来,我们将提出的基于roi的预翘曲和内容保留翘曲依次应用于输入帧。从密集的仿真结果中,我们发现所提出的算法将计算复杂度降低了最先进方法的15%,同时保持了几乎相同的稳定性能。
{"title":"ROI-Based Video Stabilization Algorithm for Hand-Held Cameras","authors":"Dong-Bok Lee, Ick-hyun Choi, B. Song, T. Lee","doi":"10.1109/ICMEW.2012.60","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.60","url":null,"abstract":"Recently, a content-preserving warping algorithm utilizing 3D motion has been acknowledged as state-of-the-art thanks to its superior stabilization performance. However, the huge computational cost of this technique is a serious burden in spite of its excellent performance. Thus, we propose a fast video stabilization algorithm that provides significantly reduced computational complexity over the state-of-the-art with the same stabilization performance. First, we estimate the 3D information of the feature points in each input frame and define the region of interest (ROI) based on the estimated 3D information. Next, we apply the proposed ROI-based pre-warping and content-preserving warping sequentially to the input frame. From intensive simulation results, we find that the proposed algorithm reduces computational complexity by 15% of that of the state-of-the-art method, while keeping almost equivalent stabilization performance.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130011831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
A Robust Wavelet-based Approach to Fingerprint Indentification 基于鲁棒小波的指纹识别方法
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.78
M. Omidyeganeh, A. Javadtalab, S. Ghaemmaghami, S. Shirmohammadi
A robust fingerprint recognition system based on marginal statistics of 2D wavelet transform is introduced which significantly improves the accuracy of previous wavelet based approaches due to 1) a better selection of features extracted from the wavelet transform, and 2) a more accurate distance measure to find the similarity between fingerprints. A combination of Jain and Poincare algorithms is employed to locate the fingerprint reference point. The main part of the fingerprint image is chosen as a rectangle with the reference point at its center. The image is then divided into nonoverlapping sub-images, the wavelet transform is applied to the bi-level sub-images, and Generalized Gaussian Density (GGD) features are extracted from each wavelet sub band. Finally, the fingerprint recognition is done through the k-Nearest Neighbor (k-NN) classification employing Kullback-Leibler Distance (KLD) measure. Our test results confirm the superiority of the proposed method over the current fingerprint recognition methods.
提出了一种基于二维小波变换边缘统计量的鲁棒指纹识别系统,该系统可以更好地选择小波变换提取的特征,并且可以更准确地寻找指纹之间的相似度,从而大大提高了以往基于小波变换的方法的识别精度。结合Jain和Poincare算法对指纹参考点进行定位。选取指纹图像的主体部分作为一个矩形,以参考点为中心。然后将图像分割成互不重叠的子图像,对两级子图像进行小波变换,从每个小波子带提取广义高斯密度特征。最后,采用Kullback-Leibler距离(KLD)测度,通过k-最近邻(k-NN)分类完成指纹识别。我们的测试结果证实了该方法相对于现有指纹识别方法的优越性。
{"title":"A Robust Wavelet-based Approach to Fingerprint Indentification","authors":"M. Omidyeganeh, A. Javadtalab, S. Ghaemmaghami, S. Shirmohammadi","doi":"10.1109/ICMEW.2012.78","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.78","url":null,"abstract":"A robust fingerprint recognition system based on marginal statistics of 2D wavelet transform is introduced which significantly improves the accuracy of previous wavelet based approaches due to 1) a better selection of features extracted from the wavelet transform, and 2) a more accurate distance measure to find the similarity between fingerprints. A combination of Jain and Poincare algorithms is employed to locate the fingerprint reference point. The main part of the fingerprint image is chosen as a rectangle with the reference point at its center. The image is then divided into nonoverlapping sub-images, the wavelet transform is applied to the bi-level sub-images, and Generalized Gaussian Density (GGD) features are extracted from each wavelet sub band. Finally, the fingerprint recognition is done through the k-Nearest Neighbor (k-NN) classification employing Kullback-Leibler Distance (KLD) measure. Our test results confirm the superiority of the proposed method over the current fingerprint recognition methods.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124430704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Robust Background Subtraction Based on Perceptual Mixture-of-Gaussians with Dynamic Adaptation Speed 基于动态自适应速度的感知高斯混合鲁棒背景减法
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.75
Mahfuzul Haque, M. Murshed
In this paper, we propose a new background subtraction technique based on perceptual mixture-of-Gaussians (PMOG). Unlike numerous variants of the classical MOG based approach [1], which can ensure reliable detection result only in known operating environments through proper parameter tuning, PMOG shows superior detection performance across dynamic unconstrained scenarios without any tuning. This is due to PMOG's intrinsic capability of exploiting several perceptual characteristics of human visual system for better understanding of the operating environment to avoid blind reliance on statistical observations. Furthermore, the proposed technique dynamically varies the model adaptation speed, i.e., learning rate, based on observed scene statistics for faster adaptation of changed background and better persistency of detected foreground entities. Comprehensive experimental evaluation on a number of standard datasets validates the robustness of the technique compared to the state-of-the-art.
在本文中,我们提出了一种新的基于感知混合高斯(PMOG)的背景减去技术。经典的基于MOG方法[1]的众多变体只有在已知的操作环境中通过适当的参数调优才能确保可靠的检测结果,而PMOG不需要任何调优就能在动态无约束场景中表现出卓越的检测性能。这是由于PMOG利用人类视觉系统的几个感知特征来更好地理解操作环境的内在能力,以避免盲目依赖统计观察。此外,该技术基于观察到的场景统计量动态改变模型的自适应速度,即学习率,从而更快地适应变化的背景,更好地持续检测到前景实体。对许多标准数据集的综合实验评估验证了该技术与最先进技术相比的鲁棒性。
{"title":"Robust Background Subtraction Based on Perceptual Mixture-of-Gaussians with Dynamic Adaptation Speed","authors":"Mahfuzul Haque, M. Murshed","doi":"10.1109/ICMEW.2012.75","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.75","url":null,"abstract":"In this paper, we propose a new background subtraction technique based on perceptual mixture-of-Gaussians (PMOG). Unlike numerous variants of the classical MOG based approach [1], which can ensure reliable detection result only in known operating environments through proper parameter tuning, PMOG shows superior detection performance across dynamic unconstrained scenarios without any tuning. This is due to PMOG's intrinsic capability of exploiting several perceptual characteristics of human visual system for better understanding of the operating environment to avoid blind reliance on statistical observations. Furthermore, the proposed technique dynamically varies the model adaptation speed, i.e., learning rate, based on observed scene statistics for faster adaptation of changed background and better persistency of detected foreground entities. Comprehensive experimental evaluation on a number of standard datasets validates the robustness of the technique compared to the state-of-the-art.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115585274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Towards a Video Browser for the Digital Native 面向数字原生代的视频浏览器
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.29
Brett Adams, S. Greenhill, S. Venkatesh
Almost every aspect of how we create, transmit, and consume video has changed, but video interfaces still mimic those from video's inception. We extend Temporal Semantic Compression for interactive video browsing, which uses an arbitrary frame-by-frame interest measure to sub-sample video in real time, with user interface elements that visualize these measures and the effect of compressing on them. We experiment with a novel interest measure for popularity, and design novel visualizations for expressing interest measures and the compression interaction. We conduct the first formative evaluation of the TSC paradigm, with 8 subjects, and report design implications arising from it.
我们制作、传输和消费视频的方式几乎每个方面都发生了变化,但视频接口仍然模仿视频最初的样子。我们将时态语义压缩扩展到交互式视频浏览,它使用任意逐帧兴趣度量来实时对视频进行子采样,并使用用户界面元素将这些度量和压缩对它们的影响可视化。我们尝试了一种新颖的流行兴趣度量,并设计了新颖的可视化来表达兴趣度量和压缩交互。我们对8个研究对象进行了TSC范式的第一次形成性评估,并报告了由此产生的设计启示。
{"title":"Towards a Video Browser for the Digital Native","authors":"Brett Adams, S. Greenhill, S. Venkatesh","doi":"10.1109/ICMEW.2012.29","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.29","url":null,"abstract":"Almost every aspect of how we create, transmit, and consume video has changed, but video interfaces still mimic those from video's inception. We extend Temporal Semantic Compression for interactive video browsing, which uses an arbitrary frame-by-frame interest measure to sub-sample video in real time, with user interface elements that visualize these measures and the effect of compressing on them. We experiment with a novel interest measure for popularity, and design novel visualizations for expressing interest measures and the compression interaction. We conduct the first formative evaluation of the TSC paradigm, with 8 subjects, and report design implications arising from it.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123337222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A Textural Based Hidden Markov Model for Animation Genre Discrimination 基于纹理的隐马尔可夫动画类型判别模型
Pub Date : 2012-07-09 DOI: 10.1109/ICMEW.2012.102
Joseph Santarcangelo, Xiao-Ping Zhang
This paper develops a novel method to automatically categorize different animation genres in a video database made for children, this is the first such research done in animation genre categorization. The method is based on statistically modeling the temporal texture attributes of the video sequence. The features are extracted from gray-level co-occurrence matrices and a hidden Markov models (HMM) are used as a classifier. It was found the method had 16.66% better accuracy compared to other methods with the same number of parameters and dimensions of feature vector.
本文提出了一种针对儿童视频数据库中不同动画类型的自动分类方法,这在动画类型分类领域尚属首次。该方法基于对视频序列的时间纹理属性进行统计建模。从灰度共生矩阵中提取特征,并使用隐马尔可夫模型作为分类器。结果表明,在相同参数个数和特征向量维数的情况下,该方法的准确率比其他方法提高了16.66%。
{"title":"A Textural Based Hidden Markov Model for Animation Genre Discrimination","authors":"Joseph Santarcangelo, Xiao-Ping Zhang","doi":"10.1109/ICMEW.2012.102","DOIUrl":"https://doi.org/10.1109/ICMEW.2012.102","url":null,"abstract":"This paper develops a novel method to automatically categorize different animation genres in a video database made for children, this is the first such research done in animation genre categorization. The method is based on statistically modeling the temporal texture attributes of the video sequence. The features are extracted from gray-level co-occurrence matrices and a hidden Markov models (HMM) are used as a classifier. It was found the method had 16.66% better accuracy compared to other methods with the same number of parameters and dimensions of feature vector.","PeriodicalId":385797,"journal":{"name":"2012 IEEE International Conference on Multimedia and Expo Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123979560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2012 IEEE International Conference on Multimedia and Expo Workshops
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1