2009 Digital Image Computing: Techniques and Applications最新文献

英文中文

Biometric Authentication Based on Infrared Thermal Hand Vein Patterns 基于红外热手静脉模式的生物识别认证

2009 Digital Image Computing: Techniques and Applications

Pub Date : 2009-12-01 DOI: 10.1109/DICTA.2009.63

Amioy Kumar, M. Hanmandlu, V. Madasu, B. Lovell

Hand Vein patterns have been adjudged to be one of the safest biometric modalities due to their strong resilience against the impostor attacks. This paper presents a new approach for biometric authentication using infrared thermal hand vein patterns. In contrast to the existing features for hand vein patterns which are based solely on edge detection, we propose Box and branch point based approaches for multiple feature representations. A robust peg free camera set up is employed for infrared thermal imaging. A region of interest (ROI) is extracted from the vein patterns and is convolved with Gabor filter. The real part of this convolution is only preserved for further processing. Multiple features are extracted from the real parts of the convolved images using the proposed branch point based feature extraction techniques. The multiple features are then integrated at the decision level. AND and OR fusion rules are employed to combine the decisions taken by the individual matcher. Experiments conducted on a database of 100 users result in a False Acceptance Rate (FAR) of 0.1% for the Genuine Acceptance Rate (GAR) of 99% for decision level fusion.

手部静脉模式被认为是最安全的生物识别模式之一，因为它对冒名顶替者的攻击具有很强的弹性。本文提出了一种利用红外热手静脉模式进行生物识别认证的新方法。与现有的仅基于边缘检测的手部静脉模式特征不同，我们提出了基于Box和分支点的多特征表示方法。红外热成像采用了一种坚固的无钉摄像机装置。从静脉模式中提取感兴趣区域(ROI)并与Gabor滤波器进行卷积。这个卷积的实部只保留作进一步的处理。利用所提出的基于分支点的特征提取技术，从卷积图像的实部提取多个特征。然后在决策级别集成多个特征。使用AND和OR融合规则来组合单个匹配者所做的决定。在100个用户的数据库上进行的实验结果是，决策级融合的真实接受率(GAR)为99%，而错误接受率(FAR)为0.1%。

{"title":"Biometric Authentication Based on Infrared Thermal Hand Vein Patterns","authors":"Amioy Kumar, M. Hanmandlu, V. Madasu, B. Lovell","doi":"10.1109/DICTA.2009.63","DOIUrl":"https://doi.org/10.1109/DICTA.2009.63","url":null,"abstract":"Hand Vein patterns have been adjudged to be one of the safest biometric modalities due to their strong resilience against the impostor attacks. This paper presents a new approach for biometric authentication using infrared thermal hand vein patterns. In contrast to the existing features for hand vein patterns which are based solely on edge detection, we propose Box and branch point based approaches for multiple feature representations. A robust peg free camera set up is employed for infrared thermal imaging. A region of interest (ROI) is extracted from the vein patterns and is convolved with Gabor filter. The real part of this convolution is only preserved for further processing. Multiple features are extracted from the real parts of the convolved images using the proposed branch point based feature extraction techniques. The multiple features are then integrated at the decision level. AND and OR fusion rules are employed to combine the decisions taken by the individual matcher. Experiments conducted on a database of 100 users result in a False Acceptance Rate (FAR) of 0.1% for the Genuine Acceptance Rate (GAR) of 99% for decision level fusion.","PeriodicalId":277395,"journal":{"name":"2009 Digital Image Computing: Techniques and Applications","volume":"165 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123330216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 32

A Fast Corner Detector Based on the Chord-to-Point Distance Accumulation Technique 基于弦点距离积累技术的快速角点检测

2009 Digital Image Computing: Techniques and Applications

Pub Date : 2009-12-01 DOI: 10.1109/DICTA.2009.91

M. Awrangjeb, Guojun Lu, C. Fraser, M. Ravanbakhsh

The previously proposed contour-based multi-scale corner detector based on the chord-to-point distance accumulation (CPDA) technique has proved its superior robustness over many other single- and multi-scale detectors. However, the original CPDA detector is computationally expensive since it calculates the CPDA discrete curvature on each point of the curve. The proposed improvement obtains a set of probable candidate points before the CPDA curvature estimation. The CPDA curvature is estimated on these chosen candidate points only. Consequently, the improved CPDA detector becomes faster, while retaining a similar robustness to the original CPDA detector.

先前提出的基于弦点距离积累(CPDA)技术的等高线多尺度角点检测器，与许多其他单尺度和多尺度检测器相比，具有较强的鲁棒性。然而，原始的CPDA检测器计算成本很高，因为它计算曲线上每个点上的CPDA离散曲率。提出的改进方法在CPDA曲率估计之前先得到一组可能的候选点。CPDA曲率仅在这些选定的候选点上估计。因此，改进的CPDA检测器变得更快，同时保留了与原始CPDA检测器相似的鲁棒性。

引用次数: 56

Improved Single Image Dehazing Using Geometry 改进的单图像去雾使用几何

2009 Digital Image Computing: Techniques and Applications

Pub Date : 2009-12-01 DOI: 10.1109/DICTA.2009.25

Peter Carr, R. Hartley

Images captured in foggy weather conditions exhibit losses in quality which are dependent on distance. If the depth and atmospheric conditions are known, one can enhance the images (to some degree) by compensating for the effects of the fog. Recently, several investigations have presented methods for recovering depth maps using only the information contained in a single foggy image. Each technique estimates the depth of each pixel independently, and assumes neighbouring pixels will have similar depths. In this work, we employ the fact that images containing fog are captured from outdoor cameras. As a result, the scene geometry is usually dominated by a ground plane. More importantly, objects which appear towards the top of the image are usually further away. We show how this preference (implemented as a soft constraint) is compatible with the alpha-expansion optimization technique and illustrate how it can be used to improve the robustness of any single image dehazing technique.

在有雾的天气条件下拍摄的图像质量会随着距离的远近而下降。如果深度和大气条件已知，人们可以通过补偿雾的影响来增强图像(在某种程度上)。最近，一些研究提出了仅使用单个雾天图像中包含的信息来恢复深度图的方法。每种技术都独立地估计每个像素的深度，并假设相邻像素具有相似的深度。在这项工作中，我们采用了一个事实，即包含雾的图像是从室外相机捕获的。因此，场景几何体通常由地平面主导。更重要的是，出现在图像顶部的物体通常更远。我们展示了这种偏好(作为软约束实现)如何与α -扩展优化技术兼容，并说明了如何使用它来提高任何单个图像去雾技术的鲁棒性。

引用次数: 82

Feature and Classifier Selection for Automatic Classification of Lesions in Dynamic Contrast-Enhanced MRI of the Breast 乳腺动态增强MRI病灶自动分类的特征与分类器选择

2009 Digital Image Computing: Techniques and Applications

Pub Date : 2009-12-01 DOI: 10.1109/DICTA.2009.29

Y. Gal, A. Mehnert, A. Bradley, D. Kennedy, S. Crozier

The clinical interpretation of breast MRI remains largely subjective, and the reported findings qualitative. Although the sensitivity of the method for detecting breast cancer is high, its specificity is poor. Computerised interpretation offers the possibility of improving specificity through objective quantitative measurement. This paper reviews the plethora of such features that have been proposed and presents a preliminary study of the most discriminatory features for dynamic contrast-enhanced MRI of the breast. In particular the results of a feature/classifier selection experiment are presented based on 20 lesions (10 malignant and 10 benign) from 20 routine clinical breast MRI examinations. Each lesion was segmented manually by a clinical radiographer and its diagnostic status confirmed by cytopathology or histopathology. The results show that textural and kinetic, rather than morphometric, features are the most important for lesion classification. They also show that the SVM classifier with sigmoid kernel performs better than other well-known classifiers: Fisher's linear discriminant function, Bayes linear classifier, logistic regression, and SVM with other kernels (distance, exponential, and radial).

乳房MRI的临床解释在很大程度上仍然是主观的，而报道的结果是定性的。虽然该方法检测乳腺癌的灵敏度较高，但特异性较差。通过客观的定量测量，计算机解释提供了提高特异性的可能性。本文回顾了已经提出的过多的这样的特征，并提出了乳房动态对比增强MRI最具歧视性的特征的初步研究。特别是基于20例常规临床乳腺MRI检查的20个病变(10个恶性和10个良性)的特征/分类器选择实验的结果。每个病变由临床放射技师手工分割，并通过细胞病理学或组织病理学证实其诊断状态。结果表明，纹理和动力学特征，而不是形态特征，是最重要的病变分类。他们还表明，具有sigmoid核的SVM分类器比其他知名的分类器性能更好:Fisher的线性判别函数、Bayes线性分类器、逻辑回归和具有其他核(距离、指数和径向)的SVM。

{"title":"Feature and Classifier Selection for Automatic Classification of Lesions in Dynamic Contrast-Enhanced MRI of the Breast","authors":"Y. Gal, A. Mehnert, A. Bradley, D. Kennedy, S. Crozier","doi":"10.1109/DICTA.2009.29","DOIUrl":"https://doi.org/10.1109/DICTA.2009.29","url":null,"abstract":"The clinical interpretation of breast MRI remains largely subjective, and the reported findings qualitative. Although the sensitivity of the method for detecting breast cancer is high, its specificity is poor. Computerised interpretation offers the possibility of improving specificity through objective quantitative measurement. This paper reviews the plethora of such features that have been proposed and presents a preliminary study of the most discriminatory features for dynamic contrast-enhanced MRI of the breast. In particular the results of a feature/classifier selection experiment are presented based on 20 lesions (10 malignant and 10 benign) from 20 routine clinical breast MRI examinations. Each lesion was segmented manually by a clinical radiographer and its diagnostic status confirmed by cytopathology or histopathology. The results show that textural and kinetic, rather than morphometric, features are the most important for lesion classification. They also show that the SVM classifier with sigmoid kernel performs better than other well-known classifiers: Fisher's linear discriminant function, Bayes linear classifier, logistic regression, and SVM with other kernels (distance, exponential, and radial).","PeriodicalId":277395,"journal":{"name":"2009 Digital Image Computing: Techniques and Applications","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134414454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

An Efficient and Accurate Iris Segmentation Technique 一种高效准确的虹膜分割技术

2009 Digital Image Computing: Techniques and Applications

Pub Date : 2009-12-01 DOI: 10.1109/DICTA.2009.65

Nitin K. Mahadeo, Nandita Bhattacharjee

Accurate segmentation is a crucial phase in the implementation of an iris recognition system. In this paper we investigate a novel technique for iris segmentation. Morphological operations and area computation are applied together with other iris segmentation techniques in order to increase the speed and accuracy of the preprocessing phase. A rough approximation of the pupil’s location is first determined in the initial stage, followed by edge detection and circular Hough transform for accurate iris segmentation. The edge image used to localize the outer iris border is modified increasing the speed and accuracy of the process. Finally, we investigate the effect of eyelids detection using a parabolic curve fitting technique. Two data sets of eye images are used to evaluate the proposed techniques. Experimental results show that the proposed segmentation technique is efficient and performs well on both data sets of images.

准确分割是虹膜识别系统实现的关键环节。本文研究了一种新的虹膜分割技术。形态学运算和面积计算与其他虹膜分割技术相结合，提高了预处理阶段的速度和准确性。首先在初始阶段确定瞳孔位置的粗略近似值，然后进行边缘检测和圆形霍夫变换进行精确的虹膜分割。对用于虹膜外边界定位的边缘图像进行了改进，提高了定位的速度和精度。最后，我们研究了使用抛物线曲线拟合技术的眼睑检测效果。使用两组眼睛图像来评估所提出的技术。实验结果表明，该分割方法对两组图像都有较好的分割效果。

引用次数: 8

Solving Multilabel Graph Cut Problems with Multilabel Swap 用多标签交换求解多标签图割问题

2009 Digital Image Computing: Techniques and Applications

Pub Date : 2009-12-01 DOI: 10.1109/DICTA.2009.90

Peter Carr, R. Hartley

Approximate solutions to labelling problems can be found using binary graph cuts and either the alpha-expansion or alpha-beta swap algorithms. In some specific cases, an exact solution can be computed by constructing a multilabel graph. However, in many practical applications the multilabel graph construction is infeasible due to its excessively large memory requirements. In this work, we expand the concept of alpha-beta swap to consider larger sets of labels at each iteration, and demonstrate how this approach is able to produce good approximate solutions to problems which can be solved using multilabel graph cuts. Furthermore, we show how alpha-expansion is a special case of multilabel swap, and from this new formulation, illustrate how alpha-expansion is now able to handle binary energy functions which do not satisfy the triangle inequality. Compared to alpha-beta swap, multilabel swap is able to produce an approximate solution in a shorter amount of time. We demonstrate the merits of our approach by considering the denoising and stereo problems. We illustrate how multilabel swap can be used in a recursive fashion to produce a good solution quickly and without requiring excessive amounts of memory.

标记问题的近似解决方案可以使用二进制图切割和α -扩展或α - β交换算法。在某些特定情况下，可以通过构造多标签图来计算精确解。然而，在许多实际应用中，由于对内存的要求过大，多标签图的构造是不可行的。在这项工作中，我们扩展了alpha-beta交换的概念，在每次迭代中考虑更大的标签集，并演示了这种方法如何能够为可以使用多标签图切割解决的问题产生良好的近似解。进一步，我们证明了α -展开是多标号交换的一个特例，并从这个新公式说明了α -展开现在如何能够处理不满足三角不等式的二元能量函数。与α - β交换相比，多标签交换能够在更短的时间内产生近似解。我们通过考虑去噪和立体声问题来证明我们的方法的优点。我们将说明如何以递归方式使用多标签交换，以快速生成良好的解决方案，而不需要过多的内存。

{"title":"Solving Multilabel Graph Cut Problems with Multilabel Swap","authors":"Peter Carr, R. Hartley","doi":"10.1109/DICTA.2009.90","DOIUrl":"https://doi.org/10.1109/DICTA.2009.90","url":null,"abstract":"Approximate solutions to labelling problems can be found using binary graph cuts and either the alpha-expansion or alpha-beta swap algorithms. In some specific cases, an exact solution can be computed by constructing a multilabel graph. However, in many practical applications the multilabel graph construction is infeasible due to its excessively large memory requirements. In this work, we expand the concept of alpha-beta swap to consider larger sets of labels at each iteration, and demonstrate how this approach is able to produce good approximate solutions to problems which can be solved using multilabel graph cuts. Furthermore, we show how alpha-expansion is a special case of multilabel swap, and from this new formulation, illustrate how alpha-expansion is now able to handle binary energy functions which do not satisfy the triangle inequality. Compared to alpha-beta swap, multilabel swap is able to produce an approximate solution in a shorter amount of time. We demonstrate the merits of our approach by considering the denoising and stereo problems. We illustrate how multilabel swap can be used in a recursive fashion to produce a good solution quickly and without requiring excessive amounts of memory.","PeriodicalId":277395,"journal":{"name":"2009 Digital Image Computing: Techniques and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129227866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

Optimizing Sharpness Measure for Bright Lesion Detection in Retinal Image Analysis 视网膜图像分析中明亮病灶检测的锐度优化方法

2009 Digital Image Computing: Techniques and Applications

Pub Date : 2009-12-01 DOI: 10.1109/DICTA.2009.14

Benson S. Y. Lam, Yongsheng Gao, Alan Wee-Chung Liew

Due to the spherical shape nature of retina and the illumination effect, detecting bright lesions in a retinal image is a challenging problem. Existing methods depend heavily on a prior knowledge about lesions, which either a user-defined parameter is employed or a supervised learning technique is adopted to estimate the parameter. In this paper, a novel sharpness measure is proposed, which indicates the degree of sharpness of bright lesions in the whole retinal image. It has a sudden jump at the optimal parameter. A polynomial fitting technique is used to capture this jump. We have tested our method on a public available dataset. Experimental results show that the proposed unsupervised approach is able to detect bright lesions accurately in an unhealthy retinal image and it outperforms existing supervised learning method. Also, the proposed method reports no abnormality for a healthy retinal image.

由于视网膜的球形特性和光照效应，检测视网膜图像中的明亮病变是一个具有挑战性的问题。现有的方法严重依赖于病变的先验知识，要么使用用户自定义参数，要么采用监督学习技术来估计参数。本文提出了一种新的锐度度量方法，用来表示整个视网膜图像中明亮病灶的锐度。在最优参数处有一个突然的跳跃。使用多项式拟合技术来捕捉这种跳跃。我们已经在一个公开可用的数据集上测试了我们的方法。实验结果表明，所提出的无监督学习方法能够准确地检测出不健康视网膜图像中的明亮病变，优于现有的监督学习方法。此外，该方法报告健康视网膜图像没有异常。

引用次数: 0

A Fast Number Theoretic Finite Radon Transform 快速数论有限Radon变换

2009 Digital Image Computing: Techniques and Applications

Pub Date : 2009-12-01 DOI: 10.1109/DICTA.2009.67

S. Chandra, I. Svalbe

This paper presents a new fast method to map between images and their digital projections based on the Number Theoretic Transform (NTT) and the Finite Radon Transform (FRT). The FRT is a Discrete Radon Transform (DRT) defined on the same finite geometry as the Finite or Discrete Fourier Transform (DFT). Consequently, it may be inverted directly and exactly via the Fast Fourier Transform (FFT) without any interpolation or filtering [1]. As with the FFT, the FRT can be adapted to square images of arbitrary sizes such as dyadic images, prime-adic images and arbitrary-sized images. However, its simplest form is that of prime-sized images [2]. The FRT also preserves the discrete versions of both the Fourier Slice Theorem (FST) and Convolution Property of the Radon Transform (RT). The NTT is also defined on the same geometry as the DFT and preserves the Circular Convolution Property (CCP) of the DFT [3, 4]. This paper shows that the Slice Theorem is also valid within the NTT and that it can be utilized as a new exact, integer-only and fast inversion scheme for the FRT, with the same computational complexity as the FFT. Digital convolutions and exact digital filtering of projections can also be performed using this Number Theoretic FRT (NFRT).

本文提出了一种基于数论变换(NTT)和有限Radon变换(FRT)的图像与数字投影之间快速映射的新方法。FRT是一个离散Radon变换(DRT)，定义在与有限或离散傅里叶变换(DFT)相同的有限几何上。因此，它可以直接和准确地通过快速傅里叶变换(FFT)反转，而无需任何插值或滤波[1]。与FFT一样，FRT可以适用于任意大小的正方形图像，如二进图像、素数进图像和任意大小的图像。然而，其最简单的形式是大小适中的图像[2]。FRT还保留了傅里叶切片定理(FST)的离散版本和Radon变换(RT)的卷积性质。NTT也定义在与DFT相同的几何上，并保留DFT的圆卷积特性(CCP)[3,4]。本文证明了切片定理在NTT内也是有效的，它可以作为一种新的精确的、全整数的、快速的FRT反演方案，其计算复杂度与FFT相同。数字卷积和投影的精确数字滤波也可以使用该数论FRT (NFRT)进行。

{"title":"A Fast Number Theoretic Finite Radon Transform","authors":"S. Chandra, I. Svalbe","doi":"10.1109/DICTA.2009.67","DOIUrl":"https://doi.org/10.1109/DICTA.2009.67","url":null,"abstract":"This paper presents a new fast method to map between images and their digital projections based on the Number Theoretic Transform (NTT) and the Finite Radon Transform (FRT). The FRT is a Discrete Radon Transform (DRT) defined on the same finite geometry as the Finite or Discrete Fourier Transform (DFT). Consequently, it may be inverted directly and exactly via the Fast Fourier Transform (FFT) without any interpolation or filtering [1]. As with the FFT, the FRT can be adapted to square images of arbitrary sizes such as dyadic images, prime-adic images and arbitrary-sized images. However, its simplest form is that of prime-sized images [2]. The FRT also preserves the discrete versions of both the Fourier Slice Theorem (FST) and Convolution Property of the Radon Transform (RT). The NTT is also defined on the same geometry as the DFT and preserves the Circular Convolution Property (CCP) of the DFT [3, 4]. This paper shows that the Slice Theorem is also valid within the NTT and that it can be utilized as a new exact, integer-only and fast inversion scheme for the FRT, with the same computational complexity as the FFT. Digital convolutions and exact digital filtering of projections can also be performed using this Number Theoretic FRT (NFRT).","PeriodicalId":277395,"journal":{"name":"2009 Digital Image Computing: Techniques and Applications","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115449366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19

Crowd Counting Using Multiple Local Features 使用多个本地特征的人群计数

2009 Digital Image Computing: Techniques and Applications

Pub Date : 2009-12-01 DOI: 10.1109/DICTA.2009.22

D. Ryan, S. Denman, C. Fookes, S. Sridharan

In public venues, crowd size is a key indicator of crowd safety and stability. Crowding levels can be detected using holistic image features, however this requires a large amount of training data to capture the wide variations in crowd distribution. If a crowd counting algorithm is to be deployed across a large number of cameras, such a large and burdensome training requirement is far from ideal. In this paper we propose an approach that uses local features to count the number of people in each foreground blob segment, so that the total crowd estimate is the sum of the group sizes. This results in an approach that is scalable to crowd volumes not seen in the training data, and can be trained on a very small data set. As a local approach is used, the proposed algorithm can easily be used to estimate crowd density throughout different regions of the scene and be used in a multi-camera environment. A unique localised approach to ground truth annotation reduces the required training data is also presented, as a localised approach to crowd counting has different training requirements to a holistic one. Testing on a large pedestrian database compares the proposed technique to existing holistic techniques and demonstrates improved accuracy, and superior performance when test conditions are unseen in the training set, or a minimal training set is used.

在公共场所，人群规模是人群安全和稳定的关键指标。拥挤程度可以使用整体图像特征来检测，但是这需要大量的训练数据来捕捉人群分布的广泛变化。如果要在大量摄像机上部署人群计数算法，那么如此庞大而繁重的训练要求远非理想。在本文中，我们提出了一种使用局部特征来计算每个前景blob段中的人数的方法，从而使总人群估计值是群体大小的总和。这种方法可以扩展到训练数据中看不到的人群数量，并且可以在非常小的数据集上进行训练。由于使用了局部方法，该算法可以很容易地用于估计场景中不同区域的人群密度，并且可以在多摄像机环境中使用。由于人群计数的局部方法与整体方法具有不同的训练要求，因此还提出了一种独特的局部方法来减少所需的训练数据。在大型行人数据库上的测试将所提出的技术与现有的整体技术进行了比较，并证明了在训练集中看不到测试条件或使用最小训练集时提高的准确性和优越的性能。

{"title":"Crowd Counting Using Multiple Local Features","authors":"D. Ryan, S. Denman, C. Fookes, S. Sridharan","doi":"10.1109/DICTA.2009.22","DOIUrl":"https://doi.org/10.1109/DICTA.2009.22","url":null,"abstract":"In public venues, crowd size is a key indicator of crowd safety and stability. Crowding levels can be detected using holistic image features, however this requires a large amount of training data to capture the wide variations in crowd distribution. If a crowd counting algorithm is to be deployed across a large number of cameras, such a large and burdensome training requirement is far from ideal. In this paper we propose an approach that uses local features to count the number of people in each foreground blob segment, so that the total crowd estimate is the sum of the group sizes. This results in an approach that is scalable to crowd volumes not seen in the training data, and can be trained on a very small data set. As a local approach is used, the proposed algorithm can easily be used to estimate crowd density throughout different regions of the scene and be used in a multi-camera environment. A unique localised approach to ground truth annotation reduces the required training data is also presented, as a localised approach to crowd counting has different training requirements to a holistic one. Testing on a large pedestrian database compares the proposed technique to existing holistic techniques and demonstrates improved accuracy, and superior performance when test conditions are unseen in the training set, or a minimal training set is used.","PeriodicalId":277395,"journal":{"name":"2009 Digital Image Computing: Techniques and Applications","volume":"237 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115070553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 298

The Dynamic Decision Switch for Multiple Pixel Connected Component Labeling Algorithm 多像素连通分量标记算法的动态决策切换

2009 Digital Image Computing: Techniques and Applications

Pub Date : 2009-12-01 DOI: 10.1109/DICTA.2009.30

Wen-Shan Wang, Ya-Fei Hung, Jen-Kuei Yang, S. Tseng

Connected component labeling is an indispensable and one of most time consuming tasks of the applications in computer vision. Many labeling algorithms have been introduced, such as scan plus connection table, scan plus union-find, and contour tracing etc. They would rather use byte data than bit data to represent the binary pixel, which is either 1 or 0, due to the heavy cost of bitwise operations. This paper will propose a mechanism employing bit data to stand for the binary image pixels and labeling multiple pixels in one labeling process so that it can turn the weakness of bit data into the strength. According to the test results run in ARM926EJ-S, this new mechanism can double the speed of the scanning and analysis phases of an array based scan plus union-find algorithm. Besides, the much smaller binary image buffer needed by this mechanism is critical for the limited hardware-resource embedded devices, which are implemented in the field of computer vision gradually.

在计算机视觉的应用中，连通构件标注是一项必不可少的、也是最耗时的任务之一。介绍了许多标记算法，如扫描加连接表、扫描加并集查找、轮廓跟踪等。他们宁愿使用字节数据而不是位数据来表示二进制像素，二进制像素要么是1，要么是0，因为按位操作的成本很高。本文将提出一种利用位数据代表二值图像像素的机制，并在一次标记过程中标记多个像素，从而将位数据的弱点转化为优势。在ARM926EJ-S上运行的测试结果表明，该机制可以使基于阵列的扫描加并查找算法的扫描和分析阶段的速度提高一倍。此外，该机制所需的二值图像缓冲区要小得多，这对于硬件资源有限的嵌入式设备来说是至关重要的，嵌入式设备正在逐步在计算机视觉领域实现。

引用次数: 4

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2009 Digital Image Computing: Techniques and Applications

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀