首页 > 最新文献

Proceedings 11th International Conference on Image Analysis and Processing最新文献

英文 中文
A local color descriptor for efficient scene-object recognition 一种用于高效场景物体识别的局部颜色描述符
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957049
E. Bigorgne, C. Achard, J. Devars
This paper presents an effective use of local descriptors for object or scene recognition and indexing. This approach is in keeping with model-based recognition systems and consists of an extension of a standard point-to-point matching between two images. Aiming at this, we address the use of Full-Zernike moments as a reliable local characterization of the image signal. A fundamental characteristic of the used descriptors is then their ability to "absorb" a given set of potential image modifications. Their design calls principally for the theory of invariants. A built-in invariance to similarities allows one to manage narrow bounded perspective transformations. Moreover we provide a study of the substantial and costless contribution of the use of color information. In order to achieve photometric invariance, different types of normalization are evaluated through a model-based object recognition task.
本文提出了一种有效地利用局部描述符进行对象或场景识别和索引的方法。这种方法与基于模型的识别系统保持一致,并由两个图像之间标准点对点匹配的扩展组成。针对这一点,我们解决了使用全泽尼克矩作为图像信号的可靠的局部表征。所使用的描述符的一个基本特征是它们能够“吸收”给定的一组潜在的图像修改。它们的设计主要需要不变量理论。内置的相似性不变性允许管理窄界透视图转换。此外,我们提供了大量的和无成本的使用颜色信息的贡献的研究。为了实现光度不变性,通过基于模型的目标识别任务评估不同类型的归一化。
{"title":"A local color descriptor for efficient scene-object recognition","authors":"E. Bigorgne, C. Achard, J. Devars","doi":"10.1109/ICIAP.2001.957049","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957049","url":null,"abstract":"This paper presents an effective use of local descriptors for object or scene recognition and indexing. This approach is in keeping with model-based recognition systems and consists of an extension of a standard point-to-point matching between two images. Aiming at this, we address the use of Full-Zernike moments as a reliable local characterization of the image signal. A fundamental characteristic of the used descriptors is then their ability to \"absorb\" a given set of potential image modifications. Their design calls principally for the theory of invariants. A built-in invariance to similarities allows one to manage narrow bounded perspective transformations. Moreover we provide a study of the substantial and costless contribution of the use of color information. In order to achieve photometric invariance, different types of normalization are evaluated through a model-based object recognition task.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"299 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115999755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Integrated 2D and 3D images for face recognition 集成2D和3D图像,用于人脸识别
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.956984
Yingjie Wang, C. Chua, Yeong-Khing Ho, Ying Ren
This paper presents a feature-based face recognition system based on both 3D range data as well as 2D gray-level facial images. Ten 2D feature points and four 3D feature points are designed to be robust against changes of facial expressions and viewpoints and are described by Gabor filter responses in the 2D domain and point signature in the 3D domain. Localizing feature points in a new facial image is based on 3D-2D correspondence, average layout and corresponding bunch (covering a wide range of possible variations on each point). Extracted shape features from 3D feature points and texture features from 2D feature points are first projected into their own subspace using PCA. In subspace, the corresponding shape and texture weight vectors are then integrated to form an augmented vector which is used to represent each facial image. For a given test facial image, the best match in the model library is identified according to a classifier. Similarity function and support vector machine (SVM) are two types of classifier considered. Experimental results involving 2D persons with different facial expressions and extracted from different viewpoints have demonstrated the efficiency of our algorithm.
本文提出了一种基于三维距离数据和二维灰度图像的特征人脸识别系统。设计了10个二维特征点和4个三维特征点,对面部表情和视点的变化具有鲁棒性,并在二维域中使用Gabor滤波器响应,在三维域中使用点签名进行描述。在新的人脸图像中定位特征点是基于3D-2D对应,平均布局和对应束(覆盖每个点的广泛可能变化)。首先利用PCA将从三维特征点提取的形状特征和从二维特征点提取的纹理特征投影到各自的子空间中。然后在子空间中,对相应的形状和纹理权重向量进行积分,形成一个增广向量,用于表示每个面部图像。对于给定的测试面部图像,根据分类器识别模型库中的最佳匹配。相似函数和支持向量机(SVM)是两种分类器。实验结果表明,该算法具有不同的面部表情和不同的视点。
{"title":"Integrated 2D and 3D images for face recognition","authors":"Yingjie Wang, C. Chua, Yeong-Khing Ho, Ying Ren","doi":"10.1109/ICIAP.2001.956984","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.956984","url":null,"abstract":"This paper presents a feature-based face recognition system based on both 3D range data as well as 2D gray-level facial images. Ten 2D feature points and four 3D feature points are designed to be robust against changes of facial expressions and viewpoints and are described by Gabor filter responses in the 2D domain and point signature in the 3D domain. Localizing feature points in a new facial image is based on 3D-2D correspondence, average layout and corresponding bunch (covering a wide range of possible variations on each point). Extracted shape features from 3D feature points and texture features from 2D feature points are first projected into their own subspace using PCA. In subspace, the corresponding shape and texture weight vectors are then integrated to form an augmented vector which is used to represent each facial image. For a given test facial image, the best match in the model library is identified according to a classifier. Similarity function and support vector machine (SVM) are two types of classifier considered. Experimental results involving 2D persons with different facial expressions and extracted from different viewpoints have demonstrated the efficiency of our algorithm.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"263 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123394486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Visual grouping and object recognition 视觉分组和对象识别
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957078
Jitendra Malik
We develop a two-stage framework for parsing and understanding images, a process of image segmentation grouping pixels to form regions of coherent color and texture, and a process of recognition - comparing assemblies of such regions, hypothesized to correspond to a single object, with views of stored prototypes. We treat segmenting images into regions as an optimization problem: partition the image into regions such that there is high similarity within a region and low similarity across regions. This is formalized as the minimization of the normalized cut between regions. Using ideas from spectral graph theory, the minimization can be set as an eigenvalue problem. Visual attributes such as color, texture, contour and motion are encoded in this framework by suitable specification of graph edge weights. The recognition problem requires us to compare assemblies of image regions with previously stored proto-typical views of known objects. We have devised a novel algorithm for shape matching based on a relationship descriptor called the shape context. This enables us to compute similarity measures between shapes which, together with similarity measures for texture and color, can be used for object recognition. The shape matching algorithm has yielded excellent results on a variety of different 2D and 3D recognition problems.
我们开发了一个用于解析和理解图像的两阶段框架,一个图像分割过程,分组像素以形成连贯的颜色和纹理区域,以及一个识别过程-比较这些区域的集合,假设对应于单个对象,与存储原型的视图。我们将图像分割为区域作为一个优化问题:将图像划分为区域,使区域内具有高相似性,区域间具有低相似性。这被形式化为区域间标准化切割的最小化。利用谱图理论的思想,最小化可以被设置为特征值问题。该框架通过适当的图边权重规范对颜色、纹理、轮廓和运动等视觉属性进行编码。识别问题要求我们将图像区域集合与先前存储的已知对象的原型视图进行比较。我们设计了一种新的基于形状上下文关系描述符的形状匹配算法。这使我们能够计算形状之间的相似性度量,这些相似性度量与纹理和颜色的相似性度量一起可用于对象识别。形状匹配算法在各种不同的二维和三维识别问题上取得了优异的效果。
{"title":"Visual grouping and object recognition","authors":"Jitendra Malik","doi":"10.1109/ICIAP.2001.957078","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957078","url":null,"abstract":"We develop a two-stage framework for parsing and understanding images, a process of image segmentation grouping pixels to form regions of coherent color and texture, and a process of recognition - comparing assemblies of such regions, hypothesized to correspond to a single object, with views of stored prototypes. We treat segmenting images into regions as an optimization problem: partition the image into regions such that there is high similarity within a region and low similarity across regions. This is formalized as the minimization of the normalized cut between regions. Using ideas from spectral graph theory, the minimization can be set as an eigenvalue problem. Visual attributes such as color, texture, contour and motion are encoded in this framework by suitable specification of graph edge weights. The recognition problem requires us to compare assemblies of image regions with previously stored proto-typical views of known objects. We have devised a novel algorithm for shape matching based on a relationship descriptor called the shape context. This enables us to compute similarity measures between shapes which, together with similarity measures for texture and color, can be used for object recognition. The shape matching algorithm has yielded excellent results on a variety of different 2D and 3D recognition problems.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132258081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Noise filtering of periodic image sequences 周期性图像序列的噪声滤波
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957004
A. Plebe
This work describes a method for filtering image sequences degraded by noise, where the main object is moving with an almost periodic displacement. This object is assumed to be the only region of interest in the image, and tracking its movement against the background is the goal of the image processing. Under such circumstances, it is argued that a noise reduction strategy based on the knowledge of the motion will be more efficient than other classical methods for dynamic image sequences. This kind of problem is not unusual in the processing of scientific images, especially in the medical field. In this case the presence of noise is critical not only for the degradation of the visual quality, but also for the effectiveness of subsequent processing tasks, such as analysis and clinical interpretation.
这项工作描述了一种过滤被噪声退化的图像序列的方法,其中主要对象以几乎周期性的位移移动。该对象被假定为图像中唯一感兴趣的区域,并且跟踪其在背景下的运动是图像处理的目标。在这种情况下,基于运动知识的降噪策略将比其他经典方法更有效地处理动态图像序列。这种问题在科学图像处理中并不少见,尤其是在医学领域。在这种情况下,噪声的存在不仅对视觉质量的退化至关重要,而且对后续处理任务的有效性也至关重要,例如分析和临床解释。
{"title":"Noise filtering of periodic image sequences","authors":"A. Plebe","doi":"10.1109/ICIAP.2001.957004","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957004","url":null,"abstract":"This work describes a method for filtering image sequences degraded by noise, where the main object is moving with an almost periodic displacement. This object is assumed to be the only region of interest in the image, and tracking its movement against the background is the goal of the image processing. Under such circumstances, it is argued that a noise reduction strategy based on the knowledge of the motion will be more efficient than other classical methods for dynamic image sequences. This kind of problem is not unusual in the processing of scientific images, especially in the medical field. In this case the presence of noise is critical not only for the degradation of the visual quality, but also for the effectiveness of subsequent processing tasks, such as analysis and clinical interpretation.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132387172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A neural network-based image processing system for detection of vandal acts in unmanned railway environments 一种基于神经网络的图像处理系统,用于无人驾驶铁路环境中的破坏行为检测
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957064
C. Sacchi, C. Regazzoni, G. Vernazza
Lately, the interest in advanced video-based surveillance applications has been increasing. This is especially true in the field of urban railway transport where video-based surveillance can be exploited to face many relevant security aspects (e.g. vandalism, overcrowding, abandoned object detection etc.). This paper aims at investigating an open problem in the implementation of video-based surveillance systems for transport applications, i.e., the implementation of reliable image understanding modules in order to recognize dangerous situations with reduced false alarm and misdetection rates. We considered the use of a neural network-based classifier for detecting vandal behavior in metro stations. The achieved results show that the classifier achieves very good performance even in the presence of high scene complexity.
最近,人们对基于视频的高级监控应用越来越感兴趣。在城市轨道交通领域尤其如此,视频监控可以被利用来面对许多相关的安全问题(例如破坏行为、过度拥挤、废弃物体检测等)。本文旨在研究交通应用中基于视频的监控系统实现中的一个开放问题,即实现可靠的图像理解模块,以减少误报和误检率来识别危险情况。我们考虑使用基于神经网络的分类器来检测地铁站的破坏行为。实验结果表明,该分类器在场景复杂度较高的情况下也能取得很好的分类效果。
{"title":"A neural network-based image processing system for detection of vandal acts in unmanned railway environments","authors":"C. Sacchi, C. Regazzoni, G. Vernazza","doi":"10.1109/ICIAP.2001.957064","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957064","url":null,"abstract":"Lately, the interest in advanced video-based surveillance applications has been increasing. This is especially true in the field of urban railway transport where video-based surveillance can be exploited to face many relevant security aspects (e.g. vandalism, overcrowding, abandoned object detection etc.). This paper aims at investigating an open problem in the implementation of video-based surveillance systems for transport applications, i.e., the implementation of reliable image understanding modules in order to recognize dangerous situations with reduced false alarm and misdetection rates. We considered the use of a neural network-based classifier for detecting vandal behavior in metro stations. The achieved results show that the classifier achieves very good performance even in the presence of high scene complexity.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132552683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Improved minimum distance classification with Gaussian outlier detection for industrial inspection 基于高斯离群点检测的工业检测改进最小距离分类
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957073
D. Toth, T. Aach
A pattern recognition system used for industrial inspection has to be highly reliable and fast. The reliability is essential for reducing the cost caused by incorrect decisions, while speed is necessary for real-time operation. We address the problem of inspecting optical media like compact disks and digital versatile disks. As the disks are checked during production and the output of the production line has to be sufficiently high, the time available for the whole examination is very short, ie, about 1 sec per disk. In such real-time applications, the well-known minimum distance algorithm is often used as classifier. However, its main drawback is the unreliability when the training data are not well clustered in feature-space. Here we describe a method for off-line outlier detection, which cleans the training data set and yields substantially better classification results. It works on a statistical test basis. In addition, two improved versions of the minimum distance classifier, which both yield higher rates of correct classification with practically no speed-loss are presented. To evaluate the results, we compare them to the results obtained using a standard minimum distance classifier, a k-nearest neighbor classifier, and a fuzzy k-nearest neighbor classifier.
用于工业检测的模式识别系统必须具有高可靠性和快速性。可靠性对于降低错误决策所带来的成本至关重要,而速度对于实时操作至关重要。我们解决了检查光盘和数字多功能磁盘等光学介质的问题。由于磁盘是在生产过程中检查的,并且生产线的输出必须足够高,因此整个检查的可用时间非常短,即每个磁盘大约1秒。在这类实时应用中,常用最小距离算法作为分类器。然而,它的主要缺点是当训练数据在特征空间中没有很好地聚类时不可靠。在这里,我们描述了一种离线异常值检测方法,该方法可以清理训练数据集并产生更好的分类结果。它在统计测试的基础上起作用。此外,还提出了两种改进的最小距离分类器,这两种方法都能在几乎没有速度损失的情况下获得更高的正确分类率。为了评估结果,我们将它们与使用标准最小距离分类器、k近邻分类器和模糊k近邻分类器获得的结果进行比较。
{"title":"Improved minimum distance classification with Gaussian outlier detection for industrial inspection","authors":"D. Toth, T. Aach","doi":"10.1109/ICIAP.2001.957073","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957073","url":null,"abstract":"A pattern recognition system used for industrial inspection has to be highly reliable and fast. The reliability is essential for reducing the cost caused by incorrect decisions, while speed is necessary for real-time operation. We address the problem of inspecting optical media like compact disks and digital versatile disks. As the disks are checked during production and the output of the production line has to be sufficiently high, the time available for the whole examination is very short, ie, about 1 sec per disk. In such real-time applications, the well-known minimum distance algorithm is often used as classifier. However, its main drawback is the unreliability when the training data are not well clustered in feature-space. Here we describe a method for off-line outlier detection, which cleans the training data set and yields substantially better classification results. It works on a statistical test basis. In addition, two improved versions of the minimum distance classifier, which both yield higher rates of correct classification with practically no speed-loss are presented. To evaluate the results, we compare them to the results obtained using a standard minimum distance classifier, a k-nearest neighbor classifier, and a fuzzy k-nearest neighbor classifier.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123330493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
A hierarchical framework for modal correspondence matching 模态对应匹配的层次框架
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957030
Marco Carcassoni, E. Hancock
The modal correspondence method of L.S. Shapiro and J.M. Brady (see Image and Vision Computing, vol.10, p.283-8, 1992) aims to match point-sets by comparing the eigenvectors of a pairwise point proximity matrix. Although elegant by means of its matrix representation, the method is notoriously susceptible to differences in the relational structure of the point-sets under consideration. We demonstrate how the method can be rendered robust to structural differences by adopting a hierarchical approach. We place the modal matching problem in a probabilistic setting in which the arrangement of pairwise clusters can be used to constrain the individual point correspondences. We commence by using an iterative pairwise clustering method which can be applied to locate the main structure in the point-sets under study. Once we have located point clusters, we compute within-cluster and between-cluster proximity matrices. The modal coefficients for these two sets of proximity matrices are used to compute the probabilities that the detected cluster-centres are in correspondence and also the probabilities that individual points are in correspondence. We develop an evidence-combining framework which draws on these two sets of probabilities to locate point correspondences. In this way, the arrangement of the cluster-centre correspondences constrain the individual point correspondences.
L.S. Shapiro和J.M. Brady的模态对应方法(见《图像与视觉计算》,vol.10, p.283- 8,1992)旨在通过比较成对点接近矩阵的特征向量来匹配点集。虽然它的矩阵表示方式很优雅,但该方法很容易受到所考虑的点集关系结构差异的影响。我们演示了如何通过采用分层方法使该方法对结构差异具有鲁棒性。我们将模态匹配问题置于一个概率设置中,其中成对簇的排列可以用来约束单个点对应。我们首先使用迭代成对聚类方法,该方法可用于定位所研究的点集中的主要结构。一旦我们定位了点簇,我们计算簇内和簇间接近矩阵。这两组接近矩阵的模态系数用于计算检测到的簇中心对应的概率以及单个点对应的概率。我们开发了一个证据组合框架,利用这两组概率来定位点对应。这样,簇中心对应的排列约束了单个点对应。
{"title":"A hierarchical framework for modal correspondence matching","authors":"Marco Carcassoni, E. Hancock","doi":"10.1109/ICIAP.2001.957030","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957030","url":null,"abstract":"The modal correspondence method of L.S. Shapiro and J.M. Brady (see Image and Vision Computing, vol.10, p.283-8, 1992) aims to match point-sets by comparing the eigenvectors of a pairwise point proximity matrix. Although elegant by means of its matrix representation, the method is notoriously susceptible to differences in the relational structure of the point-sets under consideration. We demonstrate how the method can be rendered robust to structural differences by adopting a hierarchical approach. We place the modal matching problem in a probabilistic setting in which the arrangement of pairwise clusters can be used to constrain the individual point correspondences. We commence by using an iterative pairwise clustering method which can be applied to locate the main structure in the point-sets under study. Once we have located point clusters, we compute within-cluster and between-cluster proximity matrices. The modal coefficients for these two sets of proximity matrices are used to compute the probabilities that the detected cluster-centres are in correspondence and also the probabilities that individual points are in correspondence. We develop an evidence-combining framework which draws on these two sets of probabilities to locate point correspondences. In this way, the arrangement of the cluster-centre correspondences constrain the individual point correspondences.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"178 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123344154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effective moving cast shadow detection for monocular color image sequences 单眼彩色图像序列的有效运动投影检测
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957043
Gsk Fung, N. Yung, G. Pang, A. Lai
For an accurate scene analysis in monocular image sequences, a robust segmentation of a moving object from the static background is generally required. However, the existence of moving cast shadow may lead to an inaccurate object segmentation, and as a result, lead to further erroneous scene analysis. An effective detection of moving cast shadow in monocular color image sequences is developed. Firstly, by realizing the various characteristics of shadow in luminance, chrominance, and gradient density, an indicator, called shadow confidence score, of the probability of the region classified as cast shadow is calculated. Secondly the canny edge detector is employed to detect edge pixels in the detected region. These pixels are then bounded by their convex hull, which estimates the position of the object. Lastly, by analyzing the shadow confidence score and the bounding hull, the cast shadow is identified as those regions outside the bounding hull and with high shadow confidence score. A number of typical outdoor scenes are evaluated and it is shown that our method can effectively detect the associated cast shadow from the object of interest.
为了在单目图像序列中进行准确的场景分析,通常需要从静态背景中对运动物体进行鲁棒分割。然而,移动阴影的存在可能会导致物体分割不准确,从而导致进一步错误的场景分析。提出了一种有效的单眼彩色图像序列运动阴影检测方法。首先,通过实现阴影在亮度、色度、梯度密度等方面的各种特征,计算出被分类为阴影区域的概率指标——阴影置信度分数。其次,利用canny边缘检测器对检测区域的边缘像素进行检测;然后这些像素被它们的凸包包围,凸包估计物体的位置。最后,通过分析阴影置信度得分和边界船体,将投射阴影识别为边界船体外阴影置信度得分高的区域。对一些典型的户外场景进行了评估,结果表明我们的方法可以有效地检测到感兴趣对象的相关投影。
{"title":"Effective moving cast shadow detection for monocular color image sequences","authors":"Gsk Fung, N. Yung, G. Pang, A. Lai","doi":"10.1109/ICIAP.2001.957043","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957043","url":null,"abstract":"For an accurate scene analysis in monocular image sequences, a robust segmentation of a moving object from the static background is generally required. However, the existence of moving cast shadow may lead to an inaccurate object segmentation, and as a result, lead to further erroneous scene analysis. An effective detection of moving cast shadow in monocular color image sequences is developed. Firstly, by realizing the various characteristics of shadow in luminance, chrominance, and gradient density, an indicator, called shadow confidence score, of the probability of the region classified as cast shadow is calculated. Secondly the canny edge detector is employed to detect edge pixels in the detected region. These pixels are then bounded by their convex hull, which estimates the position of the object. Lastly, by analyzing the shadow confidence score and the bounding hull, the cast shadow is identified as those regions outside the bounding hull and with high shadow confidence score. A number of typical outdoor scenes are evaluated and it is shown that our method can effectively detect the associated cast shadow from the object of interest.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115539143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
Attentive billboards 细心的广告牌
Pub Date : 1900-01-01 DOI: 10.1109/ICIAP.2001.957002
I. Haritaoglu, M. Flickner
We describe a real-time vision system for electronic billboards that can detect and count number of people standing in front of the billboards, determine how long they have been looking at the advertisements currently shown on the billboards, and try to obtain demographics information about the audience automatically to determine when and which advertisements might be shown on the electronic billboard to reach a targeted audience. As the location of billboard, such as coffee shops and stores, are very crowded areas where people either are waiting or moving together individual person cannot be isolated but are partially or total occluded by other people. We combined silhouette and motion-based people detection with fast infrared illumination-based pupil detection to detect people and determine whether they are looking at the billboards or not. Experimental results demonstrate the robustness and real-time performance of the algorithm.
我们描述了一种用于电子广告牌的实时视觉系统,该系统可以检测和计算站在广告牌前的人数,确定他们观看广告牌上当前显示的广告的时间,并尝试自动获取有关受众的人口统计信息,以确定何时以及哪些广告可以在电子广告牌上显示以达到目标受众。由于广告牌的位置,如咖啡店和商店,是非常拥挤的地方,人们要么在等待,要么在一起移动,个人不能被隔离,而是部分或全部被其他人遮挡。我们将基于轮廓和动作的人检测与基于快速红外照明的瞳孔检测相结合,以检测人们并确定他们是否在看广告牌。实验结果证明了该算法的鲁棒性和实时性。
{"title":"Attentive billboards","authors":"I. Haritaoglu, M. Flickner","doi":"10.1109/ICIAP.2001.957002","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957002","url":null,"abstract":"We describe a real-time vision system for electronic billboards that can detect and count number of people standing in front of the billboards, determine how long they have been looking at the advertisements currently shown on the billboards, and try to obtain demographics information about the audience automatically to determine when and which advertisements might be shown on the electronic billboard to reach a targeted audience. As the location of billboard, such as coffee shops and stores, are very crowded areas where people either are waiting or moving together individual person cannot be isolated but are partially or total occluded by other people. We combined silhouette and motion-based people detection with fast infrared illumination-based pupil detection to detect people and determine whether they are looking at the billboards or not. Experimental results demonstrate the robustness and real-time performance of the algorithm.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115828394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
期刊
Proceedings 11th International Conference on Image Analysis and Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1