首页 > 最新文献

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)最新文献

英文 中文
A New Approach for 2D-3D Heterogeneous Face Recognition 二维-三维异构人脸识别新方法
Xiaolong Wang, V. Ly, G. Guo, C. Kambhamettu
This paper proposes a novel scheme for face recognition from visible images to depth images. In our proposed technique, we adopt Partial Least Square (PLS) to handle correlation mapping between 2D to 3D. A considerable performance improvement is observed compared to using Canonical Correlation Analysis (CCA). To further improve the performance, a fusion scheme based on PLS and CCA is advocated. We evaluate the advocated approach on a popular face dataset-FRGCV2.0. Experimental results demonstrate that the proposed scheme is an effective approach to perform 2D-3D face recognition.
提出了一种从可见图像到深度图像的人脸识别新方案。在我们提出的技术中,我们采用偏最小二乘(PLS)来处理2D到3D之间的相关映射。与使用典型相关分析(CCA)相比,可以观察到相当大的性能改进。为了进一步提高性能,提出了一种基于PLS和CCA的融合方案。我们在一个流行的人脸数据集frgcv2.0上对所提倡的方法进行了评估。实验结果表明,该方法是一种有效的2D-3D人脸识别方法。
{"title":"A New Approach for 2D-3D Heterogeneous Face Recognition","authors":"Xiaolong Wang, V. Ly, G. Guo, C. Kambhamettu","doi":"10.1109/ISM.2013.58","DOIUrl":"https://doi.org/10.1109/ISM.2013.58","url":null,"abstract":"This paper proposes a novel scheme for face recognition from visible images to depth images. In our proposed technique, we adopt Partial Least Square (PLS) to handle correlation mapping between 2D to 3D. A considerable performance improvement is observed compared to using Canonical Correlation Analysis (CCA). To further improve the performance, a fusion scheme based on PLS and CCA is advocated. We evaluate the advocated approach on a popular face dataset-FRGCV2.0. Experimental results demonstrate that the proposed scheme is an effective approach to perform 2D-3D face recognition.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"13 1","pages":"301-304"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87826532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Eye Blink Detection for Smart Glasses 智能眼镜的眨眼检测
Hoang Le, Thanh Dang, Feng Liu
Eye blink is a quick action of closing and opening of the eyelids. Eye blink detection has a wide range of applications in human computer interaction and human vision health care research. Existing approaches to eye blink detection often cannot suit well resource-limited eye blink detection platforms like Smart Glasses, which have limited energy supply and typically cannot afford strong imaging and computational capabilities. In this paper, we present an efficient and robust eye blink detection method for Smart Glasses. Our method first employs an eigen-eye approach to detect closing-eye in individual video frames. Our method then learns eye blink patterns based on the closing-eye detection results and detects eye blinks using a Gradient Boosting method. Our method further uses a non-maximum suppression algorithm to remove repeated detection of the same eye-blink action among consecutive video frames. Experiments with our prototyped smart glasses equipped with a low-power camera and an embedded processor show an accurate detection result (with more than 96% accuracy) on video frames of a small size of 16 × 12 at 96 fps, which enables a number of applications in health care, driving safety, and human computer interaction.
眨眼是眼睑快速闭合和张开的动作。眨眼检测在人机交互和人类视觉保健研究中有着广泛的应用。现有的眨眼检测方法往往不能很好地适应像智能眼镜这样资源有限的眨眼检测平台,因为智能眼镜的能量供应有限,通常无法提供强大的成像和计算能力。本文提出了一种高效、鲁棒的智能眼镜眨眼检测方法。我们的方法首先采用特征眼方法来检测单个视频帧中的闭眼现象。然后,我们的方法基于闭眼检测结果学习眨眼模式,并使用梯度增强方法检测眨眼。我们的方法进一步使用非最大抑制算法来消除连续视频帧中相同眨眼动作的重复检测。我们的原型智能眼镜配备了低功耗摄像头和嵌入式处理器,在16 × 12、96 fps的小尺寸视频帧上进行了实验,显示出准确的检测结果(准确率超过96%),这使得医疗保健、驾驶安全和人机交互等领域的许多应用成为可能。
{"title":"Eye Blink Detection for Smart Glasses","authors":"Hoang Le, Thanh Dang, Feng Liu","doi":"10.1109/ISM.2013.59","DOIUrl":"https://doi.org/10.1109/ISM.2013.59","url":null,"abstract":"Eye blink is a quick action of closing and opening of the eyelids. Eye blink detection has a wide range of applications in human computer interaction and human vision health care research. Existing approaches to eye blink detection often cannot suit well resource-limited eye blink detection platforms like Smart Glasses, which have limited energy supply and typically cannot afford strong imaging and computational capabilities. In this paper, we present an efficient and robust eye blink detection method for Smart Glasses. Our method first employs an eigen-eye approach to detect closing-eye in individual video frames. Our method then learns eye blink patterns based on the closing-eye detection results and detects eye blinks using a Gradient Boosting method. Our method further uses a non-maximum suppression algorithm to remove repeated detection of the same eye-blink action among consecutive video frames. Experiments with our prototyped smart glasses equipped with a low-power camera and an embedded processor show an accurate detection result (with more than 96% accuracy) on video frames of a small size of 16 × 12 at 96 fps, which enables a number of applications in health care, driving safety, and human computer interaction.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"29 1","pages":"305-308"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73580508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
A Hybrid Contextual User Perception Model for Streamed Video Quality Assessment 流媒体视频质量评估的混合上下文用户感知模型
M. Diallo, N. Maréchal, H. Afifi
Users' satisfaction is the service providers' aim to reduce the churn, promote new services and improve ARPU (Average Revenue per User). In this work, a novel hybrid assessment technique is presented. It refines known mathematical models for quality assessment using both context information and subjectives tests. The model is then enriched with new features such as content characteristics, device type and network status, and compared to the state of the art. The effect of application parameters (startup time and buffering ratio) on user perceived quality is also analyzed in this article.
用户满意度是服务提供商减少流失、推广新服务和提高ARPU(每用户平均收益)的目标。本文提出了一种新的混合评估技术。它改进了已知的数学模型,使用上下文信息和主观测试进行质量评估。然后用内容特征、设备类型和网络状态等新特征丰富该模型,并将其与现有技术进行比较。本文还分析了应用程序参数(启动时间和缓冲比率)对用户感知质量的影响。
{"title":"A Hybrid Contextual User Perception Model for Streamed Video Quality Assessment","authors":"M. Diallo, N. Maréchal, H. Afifi","doi":"10.1109/ISM.2013.104","DOIUrl":"https://doi.org/10.1109/ISM.2013.104","url":null,"abstract":"Users' satisfaction is the service providers' aim to reduce the churn, promote new services and improve ARPU (Average Revenue per User). In this work, a novel hybrid assessment technique is presented. It refines known mathematical models for quality assessment using both context information and subjectives tests. The model is then enriched with new features such as content characteristics, device type and network status, and compared to the state of the art. The effect of application parameters (startup time and buffering ratio) on user perceived quality is also analyzed in this article.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"65 1","pages":"518-519"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83225670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Detection of Most Popular Routes and Effective Time Segments Using Trajectory Distributions 利用轨迹分布检测最流行路线和有效时间段
Kazuma Ito, Hung-Hsuan Huang, K. Kawagoe
There have been some innovative research studies for detecting the Most Popular Route (MRP) using GPS devices in order to support tourists who travel in an unfamiliar area. The MPR is a route on which many moving objects move the most among the entire possible routes. Current MRP detection methods do not take into account the time of trajectory measurement, however, road conditions vary depending on a time zone. Therefore, the detected MRP may not be an appropriate route that was defined outside of the certain time zone. The aim of this study is to propose a new method to detect the MRP which is capable of considering a time zone of trajectory measurement. In addition to the new method, "Popularity Measure" is proposed in order to verify the suitability of the detected MRP. The detected MRP using the existing and proposed method are evaluated by compared from a viewpoint of this popularity measures.
利用GPS设备检测最受欢迎路线(MRP),为在陌生地区旅行的游客提供支持,已经有了一些创新的研究。MPR是所有可能路径中移动物体最多的路径。目前的MRP检测方法没有考虑轨迹测量的时间,然而,不同时区的路况是不同的。因此,检测到的MRP可能不是在某个时区之外定义的合适路由。本研究的目的是提出一种能够考虑弹道测量时区的MRP检测新方法。为了验证所检测的MRP的适用性,在此基础上提出了“流行度测量”方法。从流行度的角度对现有方法和所提出方法检测到的MRP进行了比较。
{"title":"Detection of Most Popular Routes and Effective Time Segments Using Trajectory Distributions","authors":"Kazuma Ito, Hung-Hsuan Huang, K. Kawagoe","doi":"10.1109/ISM.2013.107","DOIUrl":"https://doi.org/10.1109/ISM.2013.107","url":null,"abstract":"There have been some innovative research studies for detecting the Most Popular Route (MRP) using GPS devices in order to support tourists who travel in an unfamiliar area. The MPR is a route on which many moving objects move the most among the entire possible routes. Current MRP detection methods do not take into account the time of trajectory measurement, however, road conditions vary depending on a time zone. Therefore, the detected MRP may not be an appropriate route that was defined outside of the certain time zone. The aim of this study is to propose a new method to detect the MRP which is capable of considering a time zone of trajectory measurement. In addition to the new method, \"Popularity Measure\" is proposed in order to verify the suitability of the detected MRP. The detected MRP using the existing and proposed method are evaluated by compared from a viewpoint of this popularity measures.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"18 1","pages":"530-531"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84789450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Simple Desktop Compression and Streaming System 一个简单的桌面压缩和流系统
I. Hadžić, Hans C. Woithe, Martin D. Carroll
We present a compression algorithm and a streaming protocol designed for streaming of computer-desktop graphics. The encoder has low memory requirements and can be broken into a large number of independent contexts with a high degree of data locality. The encoder also uses only simple arithmetic, which makes it amenable to hardware or highly parallel software implementation. The decoder is trivial and requires no memory, which makes it suitable for use on devices with limited computing capabilities. The streaming protocol runs over UDP and has its own unique error recovery mechanism specifically designed for interactive applications.
提出了一种针对计算机桌面图形流的压缩算法和流协议。编码器的内存需求低,可以分解成大量具有高度数据局部性的独立上下文。编码器也只使用简单的算法,这使得它适合硬件或高度并行的软件实现。解码器很简单,不需要内存,这使得它适合在计算能力有限的设备上使用。流协议在UDP上运行,并有自己独特的错误恢复机制,专门为交互式应用程序设计。
{"title":"A Simple Desktop Compression and Streaming System","authors":"I. Hadžić, Hans C. Woithe, Martin D. Carroll","doi":"10.1109/ISM.2013.65","DOIUrl":"https://doi.org/10.1109/ISM.2013.65","url":null,"abstract":"We present a compression algorithm and a streaming protocol designed for streaming of computer-desktop graphics. The encoder has low memory requirements and can be broken into a large number of independent contexts with a high degree of data locality. The encoder also uses only simple arithmetic, which makes it amenable to hardware or highly parallel software implementation. The decoder is trivial and requires no memory, which makes it suitable for use on devices with limited computing capabilities. The streaming protocol runs over UDP and has its own unique error recovery mechanism specifically designed for interactive applications.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"11 1","pages":"339-346"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86663587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Keypoint Reduction for Smart Image Retrieval 智能图像检索的关键点缩减
K. Yuasa, T. Wada
Content-based image retrieval (CBIR) is an image retrieval problem with image-content query. This problem is investigated in many applications, such as, human identification, information embedding to real-world objects, life-log, and so on. Through many researches on CBIR, local image features, such as SIFT, SURF, and LBP, defined on image key points are proved to be effective for fast and occlusion-robust image retrieval. In CBIR using local features, it is clear that not all features are necessary for image retrieval. That is, distinctive features have stronger discrimination power than commonly observed features. Also, some local features are fragile against observation distortions. This paper presents an importance measure representing both the robustness and the distinctiveness of a local feature based on diverse density. According to this measure, we can reduce the number of local features related to each database entry. Through some experiments, database having reduced local feature indices performs better than database using all local features as indices.
基于内容的图像检索(CBIR)是一种基于图像内容查询的图像检索问题。这一问题在许多应用中都得到了研究,如人体识别、信息嵌入到现实世界的物体、生命日志等。通过对CBIR的大量研究,证明了在图像关键点上定义的SIFT、SURF、LBP等局部图像特征对于快速、抗遮挡的图像检索是有效的。在使用局部特征的CBIR中,显然并非所有特征都是图像检索所必需的。也就是说,显著特征比常见特征具有更强的辨别能力。此外,一些局部特征在观测失真的影响下是脆弱的。本文提出了一种基于不同密度的局部特征鲁棒性和显著性的重要度量。根据这种方法,我们可以减少与每个数据库条目相关的局部特征的数量。通过一些实验,减少局部特征索引的数据库比使用所有局部特征作为索引的数据库性能更好。
{"title":"Keypoint Reduction for Smart Image Retrieval","authors":"K. Yuasa, T. Wada","doi":"10.1109/ISM.2013.67","DOIUrl":"https://doi.org/10.1109/ISM.2013.67","url":null,"abstract":"Content-based image retrieval (CBIR) is an image retrieval problem with image-content query. This problem is investigated in many applications, such as, human identification, information embedding to real-world objects, life-log, and so on. Through many researches on CBIR, local image features, such as SIFT, SURF, and LBP, defined on image key points are proved to be effective for fast and occlusion-robust image retrieval. In CBIR using local features, it is clear that not all features are necessary for image retrieval. That is, distinctive features have stronger discrimination power than commonly observed features. Also, some local features are fragile against observation distortions. This paper presents an importance measure representing both the robustness and the distinctiveness of a local feature based on diverse density. According to this measure, we can reduce the number of local features related to each database entry. Through some experiments, database having reduced local feature indices performs better than database using all local features as indices.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"52 1","pages":"351-358"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88952400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Quantitative Analysis of a Virtual Programming Lab 虚拟编程实验室的定量分析
Jan Vanvinkenroye, Christoph Grüninger, C. Heine, T. Richter
We implemented a survey with one learning group using the web-based tools and a control group working with a traditional setup based on editor and compiler. In a recent publication, we described the design and implementation of a web-based programming lab (ViPLab) targeted at undergraduate Engineering and Mathematics courses. This work provides a quantitative analysis of the user feedback, experience and learning success. The survey shows that web-based installations are as efficient as classical tools, while Windows users prefer the web-based chain over the editor/compiler installation on Linux. This justifies the use of web-based installations in programming beginner courses, if the learning target focuses on programming and not a particular tool chain.
我们实施了一项调查,其中一个学习小组使用基于网络的工具,另一个控制组使用基于编辑器和编译器的传统设置。在最近的一篇文章中,我们描述了针对本科工程和数学课程的基于web的编程实验室(ViPLab)的设计和实现。这项工作提供了对用户反馈、经验和学习成功的定量分析。调查显示,基于web的安装与传统工具一样高效,而Windows用户更喜欢基于web的链,而不是Linux上的编辑器/编译器安装。这证明了在编程初学者课程中使用基于web的安装是合理的,如果学习目标侧重于编程而不是特定的工具链。
{"title":"A Quantitative Analysis of a Virtual Programming Lab","authors":"Jan Vanvinkenroye, Christoph Grüninger, C. Heine, T. Richter","doi":"10.1109/ISM.2013.88","DOIUrl":"https://doi.org/10.1109/ISM.2013.88","url":null,"abstract":"We implemented a survey with one learning group using the web-based tools and a control group working with a traditional setup based on editor and compiler. In a recent publication, we described the design and implementation of a web-based programming lab (ViPLab) targeted at undergraduate Engineering and Mathematics courses. This work provides a quantitative analysis of the user feedback, experience and learning success. The survey shows that web-based installations are as efficient as classical tools, while Windows users prefer the web-based chain over the editor/compiler installation on Linux. This justifies the use of web-based installations in programming beginner courses, if the learning target focuses on programming and not a particular tool chain.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"61 1","pages":"457-461"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80731895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Resolution Control for Size Bias Elimination in Multi-resolution Visual Matching 多分辨率视觉匹配中尺寸偏差消除的分辨率控制
S. Clippingdale
Visual matching for tracking and recognition, for example in video indexing, often uses image features measured at multiple resolutions. As a tracked object moves away from the camera, appearing progressively smaller, the higher resolutions consecutively become unavailable for matching, causing step changes in the similarity or “match score” of the tracked object. If several candidate matches (hypotheses) are maintained for a tracked region, this effect causes a bias toward larger region hypotheses that match at one extra resolution relative to even slightly smaller hypotheses. The effect is subtle and appears intermittent because it occurs only around a specific discrete set of object sizes. We describe the problem and the class of visual matching methods that it affects, and propose a solution. We present experimental results from a real video indexing system to illustrate both the problem and the effectiveness of the proposed solution.
跟踪和识别的视觉匹配,例如在视频索引中,经常使用在多个分辨率下测量的图像特征。随着被跟踪对象逐渐远离摄像机,变得越来越小,高分辨率的对象连续无法匹配,导致被跟踪对象的相似性或“匹配分数”发生阶跃变化。如果在跟踪区域中保留了几个候选匹配(假设),则该效应会导致偏向于以一个额外分辨率匹配的较大区域假设,而不是稍微小一点的假设。这种影响是微妙的,并且是间歇性的,因为它只发生在一个特定的离散对象大小集合周围。我们描述了这个问题以及它所影响的一类视觉匹配方法,并提出了解决方案。我们给出了一个真实视频索引系统的实验结果,以说明所提出的解决方案的问题和有效性。
{"title":"Resolution Control for Size Bias Elimination in Multi-resolution Visual Matching","authors":"S. Clippingdale","doi":"10.1109/ISM.2013.87","DOIUrl":"https://doi.org/10.1109/ISM.2013.87","url":null,"abstract":"Visual matching for tracking and recognition, for example in video indexing, often uses image features measured at multiple resolutions. As a tracked object moves away from the camera, appearing progressively smaller, the higher resolutions consecutively become unavailable for matching, causing step changes in the similarity or “match score” of the tracked object. If several candidate matches (hypotheses) are maintained for a tracked region, this effect causes a bias toward larger region hypotheses that match at one extra resolution relative to even slightly smaller hypotheses. The effect is subtle and appears intermittent because it occurs only around a specific discrete set of object sizes. We describe the problem and the class of visual matching methods that it affects, and propose a solution. We present experimental results from a real video indexing system to illustrate both the problem and the effectiveness of the proposed solution.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"76 1","pages":"451-456"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83857076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised Co-segmentation of Complex Image Set via Bi-harmonic Distance Governed Multi-level Deformable Graph Clustering 基于双调和距离控制的多层次可变形图聚类的复图像集无监督共分割
Jizhou Ma, Shuai Li, A. Hao, Hong Qin
Despite the recent success of extensive co-segmentation studies, they still suffer from limitations in accommodating multiple-foreground, large-scale, high-variability image set, as well as their underlying capability for parallel implementation. To improve, this paper proposes a bi-harmonic distance governed flexible method for the robust coherent segmentation of the overlapping/similar contents co-existing in image group, which is independent of supervised learning and any other user-specified prior. The central idea is the novel integration of bi-harmonic distance metric design and multi-level deformable graph generation for multi-level clustering, which gives rise to a host of unique advantages: accommodating multiple-foreground images, respecting both local structures and global semantics of images, being more robust and accurate, and being convenient for parallel acceleration. Critical pipeline of our method involves intrinsic content-coherent measuring, super-pixel assisted bottom-up clustering, and multi-level deformable graph clustering based cross-image optimization. We conduct extensive experiments on the iCoseg benchmark and Oxford flower datasets, and make comprehensive evaluations to demonstrate the superiority of our method via comparison with state-of-the-art methods collected in the MSRC database.
尽管近年来广泛的共分割研究取得了成功,但它们在适应多前景、大规模、高可变性图像集以及并行实现的潜在能力方面仍然存在局限性。为了改进这一问题,本文提出了一种双谐波距离控制的灵活方法,用于图像组中共存的重叠/相似内容的鲁棒连贯分割,该方法独立于监督学习和任何其他用户指定的先验。该算法的核心思想是将双谐波距离度量设计和多级可变形图生成相结合,实现多级聚类,具有适应多前景图像、尊重图像的局部结构和全局语义、鲁棒性和准确性更高、便于并行加速等独特优势。该方法的关键流程包括内在内容相干测量、超像素辅助的自下而上聚类和基于多层次可变形图聚类的交叉图像优化。我们在iCoseg基准和牛津花数据集上进行了广泛的实验,并通过与MSRC数据库中收集的最先进的方法进行比较,进行了全面的评估,以证明我们的方法的优越性。
{"title":"Unsupervised Co-segmentation of Complex Image Set via Bi-harmonic Distance Governed Multi-level Deformable Graph Clustering","authors":"Jizhou Ma, Shuai Li, A. Hao, Hong Qin","doi":"10.1109/ISM.2013.16","DOIUrl":"https://doi.org/10.1109/ISM.2013.16","url":null,"abstract":"Despite the recent success of extensive co-segmentation studies, they still suffer from limitations in accommodating multiple-foreground, large-scale, high-variability image set, as well as their underlying capability for parallel implementation. To improve, this paper proposes a bi-harmonic distance governed flexible method for the robust coherent segmentation of the overlapping/similar contents co-existing in image group, which is independent of supervised learning and any other user-specified prior. The central idea is the novel integration of bi-harmonic distance metric design and multi-level deformable graph generation for multi-level clustering, which gives rise to a host of unique advantages: accommodating multiple-foreground images, respecting both local structures and global semantics of images, being more robust and accurate, and being convenient for parallel acceleration. Critical pipeline of our method involves intrinsic content-coherent measuring, super-pixel assisted bottom-up clustering, and multi-level deformable graph clustering based cross-image optimization. We conduct extensive experiments on the iCoseg benchmark and Oxford flower datasets, and make comprehensive evaluations to demonstrate the superiority of our method via comparison with state-of-the-art methods collected in the MSRC database.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"50 1","pages":"38-45"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83976155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Towards an Evaluation of Denoising Algorithms with Respect to Realistic Camera Noise 基于真实相机噪声的去噪算法评价
Tamara Seybold, Christian Keimel, Marion Knopp, W. Stechele
The development and tuning of denoising algorithms is usually based on readily processed test images that are artificially degraded with additive white Gaussian noise (AWGN). While AWGN allows us to easily generate test data in a repeatable manner, it does not reflect the noise characteristics in a real digital camera. Realistic camera noise is signal-dependent and spatially correlated due to the demosaicking step required to obtain full-color images. Hence, the noise characteristic is fundamentally different from AWGN. Using such unrealistic data to test, optimize and compare denoising algorithms may lead to incorrect parameter tuning or sub optimal choices in research on denoising algorithms. In this paper, we therefore propose an approach to evaluate denoising algorithms with respect to realistic camera noise: we describe a new camera noise model that includes the full processing chain of a single sensor camera. We determine the visual quality of noisy and denoised test sequences using a subjective test with 18 participants. We show that the noise characteristics have a significant effect on visual quality. Quality metrics, which are required to compare denoising results, are applied, and we evaluate the performance of 10 full-reference metrics and one no-reference metric with our realistic test data. We conclude that a more realistic noise model should be used in future research to improve the quality estimation of digital images and videos and to improve the research on denoising algorithms.
去噪算法的开发和调整通常是基于易于处理的测试图像,这些图像被加性高斯白噪声(AWGN)人工退化。虽然AWGN使我们能够轻松地以可重复的方式生成测试数据,但它并不能反映真实数码相机的噪声特性。逼真的相机噪声是信号依赖和空间相关的,因为获得全彩图像所需的去马赛克步骤。因此,噪声特性与AWGN有本质区别。使用这些不切实际的数据来测试、优化和比较去噪算法,可能会导致去噪算法研究中的参数调整错误或次优选择。因此,在本文中,我们提出了一种方法来评估关于真实相机噪声的去噪算法:我们描述了一个新的相机噪声模型,其中包括单个传感器相机的完整处理链。我们使用18名参与者的主观测试来确定噪声和去噪测试序列的视觉质量。研究表明,噪声特性对视觉质量有显著影响。应用了比较去噪结果所需的质量指标,我们用实际测试数据评估了10个完全参考指标和一个无参考指标的性能。我们得出结论,在未来的研究中应该使用更真实的噪声模型来提高数字图像和视频的质量估计,并改进去噪算法的研究。
{"title":"Towards an Evaluation of Denoising Algorithms with Respect to Realistic Camera Noise","authors":"Tamara Seybold, Christian Keimel, Marion Knopp, W. Stechele","doi":"10.1109/ISM.2013.39","DOIUrl":"https://doi.org/10.1109/ISM.2013.39","url":null,"abstract":"The development and tuning of denoising algorithms is usually based on readily processed test images that are artificially degraded with additive white Gaussian noise (AWGN). While AWGN allows us to easily generate test data in a repeatable manner, it does not reflect the noise characteristics in a real digital camera. Realistic camera noise is signal-dependent and spatially correlated due to the demosaicking step required to obtain full-color images. Hence, the noise characteristic is fundamentally different from AWGN. Using such unrealistic data to test, optimize and compare denoising algorithms may lead to incorrect parameter tuning or sub optimal choices in research on denoising algorithms. In this paper, we therefore propose an approach to evaluate denoising algorithms with respect to realistic camera noise: we describe a new camera noise model that includes the full processing chain of a single sensor camera. We determine the visual quality of noisy and denoised test sequences using a subjective test with 18 participants. We show that the noise characteristics have a significant effect on visual quality. Quality metrics, which are required to compare denoising results, are applied, and we evaluate the performance of 10 full-reference metrics and one no-reference metric with our realistic test data. We conclude that a more realistic noise model should be used in future research to improve the quality estimation of digital images and videos and to improve the research on denoising algorithms.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"29 1","pages":"203-210"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83582587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
期刊
2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1