首页 > 最新文献

2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)最新文献

英文 中文
A comparison of bicubic and biquintic interpolators suitable for real-time hardware implementation 适合于实时硬件实现的双三次和双五次插值器的比较
Pub Date : 2009-10-01 DOI: 10.1109/AIPR.2009.5466320
Jonathan Fry, M. Pusateri
Digital multispectral night vision goggles incorporate both imagers and displays that often have different resolutions. While both thermal imager and micro-display technologies continue to produce larger arrays, thermal imagers still lag well behind displays and can require interpolation by a factor of 2.5 in both horizontal and vertical directions. In goggle applications, resizing the imagery streams to the size of the display must occur in real-time with minimal latency. In addition to low latency, a resizing algorithm must produce acceptable imagery, necessitating an understanding of the resized image fidelity and spatial smoothness. While both spatial and spatial frequency domain resizing techniques are available, most spatial frequency techniques require a complete frame for operation introducing unacceptable latency. Spatial domain techniques can be implemented on a neighborhood basis allowing latencies equivalent to several row clock pulses to be achieved. We have already implemented bilinear re-sampling in hardware and, while bilinear re-sampling supports moderate up-sizes with reasonable image quality, its deficiencies are apparent at interpolation ratios of two and greater. We are developing hardware implementations of both bicubic and biquintic resizing algorithms. We present the results of comparison between hardware ready versions of the bicubic and biquintic algorithms with the existing bilinear. We also discuss the hardware requirements for bicubic and biquintic compared to the existing bilinear resizing.
数字多光谱夜视镜包括成像仪和显示器,通常具有不同的分辨率。虽然热成像仪和微显示技术都在继续生产更大的阵列,但热成像仪仍然远远落后于显示器,并且在水平和垂直方向上都需要2.5倍的插值。在谷歌眼镜应用程序中,将图像流的大小调整到显示器的大小必须以最小的延迟实时发生。除了低延迟外,调整大小算法还必须产生可接受的图像,这就需要了解调整后的图像保真度和空间平滑度。虽然空间和空间频域调整技术都是可用的,但大多数空间频率技术需要一个完整的操作框架,从而引入不可接受的延迟。空间域技术可以在邻域基础上实现,允许实现相当于几行时钟脉冲的延迟。我们已经在硬件上实现了双线性重采样,虽然双线性重采样支持适度的放大尺寸和合理的图像质量,但它的缺陷在插值比为2或更大时是明显的。我们正在开发双三次和双五次调整大小算法的硬件实现。我们给出了双三次和双五次算法的硬件版本与现有双线性算法的比较结果。我们还讨论了双三次和双五次调整与现有双线性调整的硬件要求。
{"title":"A comparison of bicubic and biquintic interpolators suitable for real-time hardware implementation","authors":"Jonathan Fry, M. Pusateri","doi":"10.1109/AIPR.2009.5466320","DOIUrl":"https://doi.org/10.1109/AIPR.2009.5466320","url":null,"abstract":"Digital multispectral night vision goggles incorporate both imagers and displays that often have different resolutions. While both thermal imager and micro-display technologies continue to produce larger arrays, thermal imagers still lag well behind displays and can require interpolation by a factor of 2.5 in both horizontal and vertical directions. In goggle applications, resizing the imagery streams to the size of the display must occur in real-time with minimal latency. In addition to low latency, a resizing algorithm must produce acceptable imagery, necessitating an understanding of the resized image fidelity and spatial smoothness. While both spatial and spatial frequency domain resizing techniques are available, most spatial frequency techniques require a complete frame for operation introducing unacceptable latency. Spatial domain techniques can be implemented on a neighborhood basis allowing latencies equivalent to several row clock pulses to be achieved. We have already implemented bilinear re-sampling in hardware and, while bilinear re-sampling supports moderate up-sizes with reasonable image quality, its deficiencies are apparent at interpolation ratios of two and greater. We are developing hardware implementations of both bicubic and biquintic resizing algorithms. We present the results of comparison between hardware ready versions of the bicubic and biquintic algorithms with the existing bilinear. We also discuss the hardware requirements for bicubic and biquintic compared to the existing bilinear resizing.","PeriodicalId":266025,"journal":{"name":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121338951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Evaluation methods for curvilinear feature extraction 曲线特征提取的评价方法
Pub Date : 2009-10-01 DOI: 10.1109/AIPR.2009.5466322
Peter Doucette, Ann Martin, Chris Kavanagh, Tim McIntyre, Steven Barton, J. Grodecki, S. Malitz, Matthew Tang, J. Nolting
The application of quantitative performance evaluation methods can provide useful insights in determining the utility of computer-assisted methods for delineating geographic features from remotely sensed images. Evaluation concepts are demonstrated with road centerlines in particular, but are applicable to similar feature types such as paths, trails, or rivers. The two comparative measures used to differentiate conventional versus computer-assisted delineation are 1) user clock time, and 2) spatial consistency. Our evaluation results with road centerlines demonstrate how such quantitative analyses can be used to determine the utility of computer-assisted methods from both developmental and operational perspectives.
定量性能评估方法的应用可以为确定从遥感图像中描绘地理特征的计算机辅助方法的效用提供有用的见解。评估概念特别以道路中心线为例,但也适用于类似的特征类型,如路径、小径或河流。用于区分传统和计算机辅助描绘的两个比较指标是:1)用户时钟时间和2)空间一致性。我们对道路中心线的评估结果表明,从发展和运营的角度来看,如何使用这种定量分析来确定计算机辅助方法的效用。
{"title":"Evaluation methods for curvilinear feature extraction","authors":"Peter Doucette, Ann Martin, Chris Kavanagh, Tim McIntyre, Steven Barton, J. Grodecki, S. Malitz, Matthew Tang, J. Nolting","doi":"10.1109/AIPR.2009.5466322","DOIUrl":"https://doi.org/10.1109/AIPR.2009.5466322","url":null,"abstract":"The application of quantitative performance evaluation methods can provide useful insights in determining the utility of computer-assisted methods for delineating geographic features from remotely sensed images. Evaluation concepts are demonstrated with road centerlines in particular, but are applicable to similar feature types such as paths, trails, or rivers. The two comparative measures used to differentiate conventional versus computer-assisted delineation are 1) user clock time, and 2) spatial consistency. Our evaluation results with road centerlines demonstrate how such quantitative analyses can be used to determine the utility of computer-assisted methods from both developmental and operational perspectives.","PeriodicalId":266025,"journal":{"name":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124012613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advanced hyperspectral detection based on elliptically contoured distribution models and operator feedback 基于椭圆轮廓分布模型和算子反馈的高级高光谱检测
Pub Date : 2009-10-01 DOI: 10.1109/AIPR.2009.5466308
A. Schaum
In autonomous hyperspectral remote sensing systems, the physical causes of false alarms are not all understood. Some arise from vagaries in sensor performance, especially in non-visible wavelengths. Consequently, many false target declarations are characterized simply as outliers, anomalies conforming to no physical or statistical models. Other false alarms arise from clutter spectra too similar to target spectra. To eliminate the recurrence of such difficult errors, deployed systems should allow operator feedback to their signal processing systems. Here we describe how a hyperspectral system using even advanced detection algorithms, based on a elliptically contoured distribution models, can be enhanced by allowing it to learn from its mistakes.
在自主高光谱遥感系统中,误报的物理原因并不完全清楚。有些是由于传感器性能的变化引起的,特别是在非可见光波段。因此,许多错误的目标声明被简单地描述为异常值,不符合物理或统计模型的异常。其他误报是由杂波光谱与目标光谱太相似引起的。为了消除此类困难错误的再次发生,部署的系统应允许操作员向其信号处理系统反馈。在这里,我们描述了如何使用基于椭圆轮廓分布模型的先进检测算法,通过允许它从错误中学习来增强高光谱系统。
{"title":"Advanced hyperspectral detection based on elliptically contoured distribution models and operator feedback","authors":"A. Schaum","doi":"10.1109/AIPR.2009.5466308","DOIUrl":"https://doi.org/10.1109/AIPR.2009.5466308","url":null,"abstract":"In autonomous hyperspectral remote sensing systems, the physical causes of false alarms are not all understood. Some arise from vagaries in sensor performance, especially in non-visible wavelengths. Consequently, many false target declarations are characterized simply as outliers, anomalies conforming to no physical or statistical models. Other false alarms arise from clutter spectra too similar to target spectra. To eliminate the recurrence of such difficult errors, deployed systems should allow operator feedback to their signal processing systems. Here we describe how a hyperspectral system using even advanced detection algorithms, based on a elliptically contoured distribution models, can be enhanced by allowing it to learn from its mistakes.","PeriodicalId":266025,"journal":{"name":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131920286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
MRI brain image segmentation for spotting tumors using improved mountain clustering approach 基于改进山聚类方法的MRI脑图像分割
Pub Date : 2009-10-01 DOI: 10.1109/AIPR.2009.5466301
N. Verma, Payal Gupta, P. Agrawal, Yan Cui
This paper presents improved mountain clustering technique based MRI (magnetic resonance imaging) brain image segmentation for spotting tumors. The proposed technique is compared with some existing techniques such as K-Means and FCM, clustering. The performance of all these clustering techniques is compared in terms of cluster entropy as a measure of information and also is visually compared for image segmentation of various brain tumor MRI images. The cluster entropy is heuristically determined, but is found to be effective in forming correct clusters as verified by visual assessment.
提出了一种基于改进山聚类技术的磁共振脑图像分割方法。将该方法与现有的K-Means、FCM、聚类等方法进行了比较。对所有这些聚类技术的性能进行了比较,以聚类熵作为信息度量,并对各种脑肿瘤MRI图像的图像分割进行了视觉比较。聚类熵是启发式确定的,但发现通过视觉评估可以有效地形成正确的聚类。
{"title":"MRI brain image segmentation for spotting tumors using improved mountain clustering approach","authors":"N. Verma, Payal Gupta, P. Agrawal, Yan Cui","doi":"10.1109/AIPR.2009.5466301","DOIUrl":"https://doi.org/10.1109/AIPR.2009.5466301","url":null,"abstract":"This paper presents improved mountain clustering technique based MRI (magnetic resonance imaging) brain image segmentation for spotting tumors. The proposed technique is compared with some existing techniques such as K-Means and FCM, clustering. The performance of all these clustering techniques is compared in terms of cluster entropy as a measure of information and also is visually compared for image segmentation of various brain tumor MRI images. The cluster entropy is heuristically determined, but is found to be effective in forming correct clusters as verified by visual assessment.","PeriodicalId":266025,"journal":{"name":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132683815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
User guided visualization for target search 用户引导的可视化目标搜索
Pub Date : 2009-10-01 DOI: 10.1109/AIPR.2009.5466318
J. Irvine
Automated target cueing (ATC) can assist analysts in searching large volumes of imagery. Performance of most automated systems is less than perfect, requiring an analyst to review the results to dismiss false alarms or confirm correct detections. This paper explores methods for improving the presentation and visualization of the ATC output, enabling more efficient and effective review of the detections flagged by the ATC. The approach relies on the interaction between the user and the ATC results. Confirmation of correct detections and dismissal of false alarms provides information to update the visualization. We present a description of the visualization method and illustrate it with results using panchromatic imagery of vehicles.
自动目标提示(ATC)可以帮助分析人员搜索大量图像。大多数自动化系统的性能并不完美,需要分析人员审查结果以排除错误警报或确认正确的检测。本文探讨了改进ATC输出的呈现和可视化的方法,使ATC标记的检测能够更高效和有效地进行审查。该方法依赖于用户和ATC结果之间的交互。正确检测的确认和假警报的排除提供了更新可视化的信息。我们提出了可视化方法的描述,并用车辆全色图像的结果来说明它。
{"title":"User guided visualization for target search","authors":"J. Irvine","doi":"10.1109/AIPR.2009.5466318","DOIUrl":"https://doi.org/10.1109/AIPR.2009.5466318","url":null,"abstract":"Automated target cueing (ATC) can assist analysts in searching large volumes of imagery. Performance of most automated systems is less than perfect, requiring an analyst to review the results to dismiss false alarms or confirm correct detections. This paper explores methods for improving the presentation and visualization of the ATC output, enabling more efficient and effective review of the detections flagged by the ATC. The approach relies on the interaction between the user and the ATC results. Confirmation of correct detections and dismissal of false alarms provides information to update the visualization. We present a description of the visualization method and illustrate it with results using panchromatic imagery of vehicles.","PeriodicalId":266025,"journal":{"name":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124760605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Biologically inspired motion detection neural network models evolved using genetic algorithms 生物启发的运动检测神经网络模型使用遗传算法进化
Pub Date : 2009-10-01 DOI: 10.1109/AIPR.2009.5466326
S. Azary, P. Anderson, R. Gaborski
In this paper we describe a method to evolve biologically inspired motion detection systems utilizing artificial neural networks (ANN's). Previously, the evolution of neural networks has focused on feed-forward neural networks or networks with predefined architectures. The purpose of this paper is to present a novel method for evolving neural networks with no predefined architectures to solve various problems including motion detection models. The neural network models are evolved with genetic algorithms using an encoding that defines a functional network with no restriction on recurrence, activation function types, or the number of nodes that compose the final ANN. The genetic algorithm operates on a population of potential solutions where each potential network is represented in a chromosome. The structure of each chromosome in the population is defined with a weight matrix which allows for efficient simulation of outputs. Each chromosome is evaluated by a fitness function that scores how well the actual output of an ANN compares to the expected output. Crossovers and mutations are made with specified probabilities between population members to evolve new members of the population. After a number of iterations a near optimal network is evolved that solves the problem at hand. The approach has proven to be sufficient to create biologically realistic motion detection neural network models with results that are comparable to results obtained from the standard Reichardt model.
在本文中,我们描述了一种利用人工神经网络(ANN’s)进化生物启发运动检测系统的方法。以前,神经网络的发展主要集中在前馈神经网络或具有预定义架构的网络上。本文的目的是提出一种新的方法来进化神经网络没有预定义的架构,以解决各种问题,包括运动检测模型。神经网络模型是用遗传算法进化的,使用一种编码来定义一个功能网络,不限制递归、激活函数类型或组成最终人工神经网络的节点数量。遗传算法对一群潜在的解决方案进行操作,其中每个潜在的网络用一条染色体表示。种群中每条染色体的结构都用一个权重矩阵来定义,以便有效地模拟输出。每个染色体由适应度函数评估,该函数对人工神经网络的实际输出与预期输出进行评分。在种群成员之间以特定的概率进行交叉和突变,以进化出新的种群成员。经过多次迭代,一个接近最优的网络就能解决当前的问题。该方法已被证明足以创建生物逼真的运动检测神经网络模型,其结果与标准Reichardt模型的结果相当。
{"title":"Biologically inspired motion detection neural network models evolved using genetic algorithms","authors":"S. Azary, P. Anderson, R. Gaborski","doi":"10.1109/AIPR.2009.5466326","DOIUrl":"https://doi.org/10.1109/AIPR.2009.5466326","url":null,"abstract":"In this paper we describe a method to evolve biologically inspired motion detection systems utilizing artificial neural networks (ANN's). Previously, the evolution of neural networks has focused on feed-forward neural networks or networks with predefined architectures. The purpose of this paper is to present a novel method for evolving neural networks with no predefined architectures to solve various problems including motion detection models. The neural network models are evolved with genetic algorithms using an encoding that defines a functional network with no restriction on recurrence, activation function types, or the number of nodes that compose the final ANN. The genetic algorithm operates on a population of potential solutions where each potential network is represented in a chromosome. The structure of each chromosome in the population is defined with a weight matrix which allows for efficient simulation of outputs. Each chromosome is evaluated by a fitness function that scores how well the actual output of an ANN compares to the expected output. Crossovers and mutations are made with specified probabilities between population members to evolve new members of the population. After a number of iterations a near optimal network is evolved that solves the problem at hand. The approach has proven to be sufficient to create biologically realistic motion detection neural network models with results that are comparable to results obtained from the standard Reichardt model.","PeriodicalId":266025,"journal":{"name":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131292015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Face recognition using a hybrid model 人脸识别使用混合模型
Pub Date : 2009-10-01 DOI: 10.1109/AIPR.2009.5466296
Yuheng Wang, P. Anderson, R. Gaborski
This paper introduces a hybrid face recognition model that combines biologically inspired features and Local Binary Features. The structure of the model is mainly based on the human visual ventral pathway. Previously, object-centered models focus on extracting global view-invariant representation of faces (I. Biederman, 1987) while feed-forward view-based models (HMAX model by Riesenhuber and Poggio, 1999) extract local features of faces by simulating responses of neurons in the human visual system. In this paper we first review the current main face recognition algorithms: Local Binary Pattern model and R&P model. This is followed by a detailed description of their implementation and advantages in overcoming intra-class variance. Results from our model are compared to the original Riesenhuber and Poggio model and Local Binary Pattern model (T. Ahonen et al, 2005). Then the paper will focus on our hybrid biological model which takes advantages of both structural information and biological features. Our model shows improved recognition rates and increased tolerance to intra-personal view differences.
介绍了一种结合生物特征和局部二值特征的混合人脸识别模型。该模型的结构主要基于人类视觉腹侧通路。以前,以对象为中心的模型侧重于提取人脸的全局视图不变表示(I. Biederman, 1987),而前馈视图模型(Riesenhuber和Poggio的HMAX模型,1999)通过模拟人类视觉系统中神经元的反应来提取人脸的局部特征。本文首先综述了当前主要的人脸识别算法:局部二值模式模型和R&P模型。接下来是对它们的实现和克服类内差异的优势的详细描述。我们的模型结果与原始的Riesenhuber和Poggio模型以及局部二元模式模型(T. Ahonen et al, 2005)进行了比较。然后,本文将重点介绍我们的混合生物模型,该模型既利用了结构信息,又利用了生物特征。我们的模型显示了更高的识别率和对个人观点差异的容忍度。
{"title":"Face recognition using a hybrid model","authors":"Yuheng Wang, P. Anderson, R. Gaborski","doi":"10.1109/AIPR.2009.5466296","DOIUrl":"https://doi.org/10.1109/AIPR.2009.5466296","url":null,"abstract":"This paper introduces a hybrid face recognition model that combines biologically inspired features and Local Binary Features. The structure of the model is mainly based on the human visual ventral pathway. Previously, object-centered models focus on extracting global view-invariant representation of faces (I. Biederman, 1987) while feed-forward view-based models (HMAX model by Riesenhuber and Poggio, 1999) extract local features of faces by simulating responses of neurons in the human visual system. In this paper we first review the current main face recognition algorithms: Local Binary Pattern model and R&P model. This is followed by a detailed description of their implementation and advantages in overcoming intra-class variance. Results from our model are compared to the original Riesenhuber and Poggio model and Local Binary Pattern model (T. Ahonen et al, 2005). Then the paper will focus on our hybrid biological model which takes advantages of both structural information and biological features. Our model shows improved recognition rates and increased tolerance to intra-personal view differences.","PeriodicalId":266025,"journal":{"name":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126095860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Robust object recognition using a cascade of geometric consistency filters 鲁棒目标识别使用级联的几何一致性滤波器
Pub Date : 2009-10-01 DOI: 10.1109/AIPR.2009.5466299
Yuetian Xu, R. Madison
Bag-of-words is a popular and successful approach to performing object recognition. Its performance is limited by not considering relative geometry information. This limitation is particularly stark when there is significant image noise. We propose a “bag-of-phrases” model which extends bag-of-words by enforcing geometric consistency through application of a “geometric grammar” in a filter cascade. Experimental results on a computer generated dataset show increased robustness to clutter and noise as demonstrated by more than two orders of magnitude reduction in false positives compared with bag-of-words.
词袋是一种流行且成功的对象识别方法。由于不考虑相对几何信息,其性能受到限制。当有明显的图像噪声时,这种限制尤其明显。我们提出了一个“短语袋”模型,该模型通过在过滤器级联中应用“几何语法”来加强几何一致性,从而扩展了单词袋。在计算机生成的数据集上的实验结果表明,与词袋相比,误报率降低了两个数量级以上,从而提高了对杂波和噪声的鲁棒性。
{"title":"Robust object recognition using a cascade of geometric consistency filters","authors":"Yuetian Xu, R. Madison","doi":"10.1109/AIPR.2009.5466299","DOIUrl":"https://doi.org/10.1109/AIPR.2009.5466299","url":null,"abstract":"Bag-of-words is a popular and successful approach to performing object recognition. Its performance is limited by not considering relative geometry information. This limitation is particularly stark when there is significant image noise. We propose a “bag-of-phrases” model which extends bag-of-words by enforcing geometric consistency through application of a “geometric grammar” in a filter cascade. Experimental results on a computer generated dataset show increased robustness to clutter and noise as demonstrated by more than two orders of magnitude reduction in false positives compared with bag-of-words.","PeriodicalId":266025,"journal":{"name":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133796074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Data driven approach to estimating fire danger from satellite images and weather information 从卫星图像和天气信息估计火灾危险的数据驱动方法
Pub Date : 2009-10-01 DOI: 10.1109/AIPR.2009.5466309
N. Markuzon, S. Kolitz
Wildfires cause extensive damage to nature and human developments. Substantial funds are spent preparing for and fighting them. This work develops a data driven approach to modeling the probabilistic risk of a currently burning fire becoming large and dangerous. We based our model upon observations of fire, weather and surrounding extracted from remote satellites. Data driven models reached good recognition accuracy in predicting fire danger in the coming day or two. We intend using the predictions in planning algorithms, e.g. flight plans for unmanned fire surveillance aircraft, to fight the fires in a more efficient and timely manner.
野火对自然和人类发展造成广泛破坏。大量的资金被用于准备和抗击它们。这项工作开发了一种数据驱动的方法来模拟当前燃烧的火灾变得大而危险的概率风险。我们的模型是基于从远程卫星上提取的对火灾、天气和周围环境的观测。数据驱动模型在预测未来一两天的火灾危险方面达到了较好的识别精度。我们打算在规划算法中使用预测,例如无人消防监视飞机的飞行计划,以更有效和及时的方式扑灭火灾。
{"title":"Data driven approach to estimating fire danger from satellite images and weather information","authors":"N. Markuzon, S. Kolitz","doi":"10.1109/AIPR.2009.5466309","DOIUrl":"https://doi.org/10.1109/AIPR.2009.5466309","url":null,"abstract":"Wildfires cause extensive damage to nature and human developments. Substantial funds are spent preparing for and fighting them. This work develops a data driven approach to modeling the probabilistic risk of a currently burning fire becoming large and dangerous. We based our model upon observations of fire, weather and surrounding extracted from remote satellites. Data driven models reached good recognition accuracy in predicting fire danger in the coming day or two. We intend using the predictions in planning algorithms, e.g. flight plans for unmanned fire surveillance aircraft, to fight the fires in a more efficient and timely manner.","PeriodicalId":266025,"journal":{"name":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129190678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Biologically-inspired visible and infrared camera technology development 受生物启发的可见光和红外相机技术的发展
Pub Date : 2009-10-01 DOI: 10.1109/AIPR.2009.5466298
E. Williams, M. Pusateri, J. Scott
Visible band and Infrared (IR) band camera and vision system development has been inspired by the human and animal vision systems. This paper will discuss the development of the Electro-Optical/Infrared (EO/IR) spectrum cameras from the front end optics, the detector or photon to electron convertor, preprocessing such as non-uniformity correction, automatic gain control, foveal vision processing done by the human eye, the gimbal system (human or animal eye ball and head motion), and the analog and digital paths of the data (optic nerve in humans). The computer vision algorithms (human or animal brain vision processing) will not be discussed in this paper. The Integrated Design Services in the College of Engineering at Penn State University has been developing EO/IR camera and sensor based computer vision systems for several years and combined with more than twenty years of developing imaging sensor stabilized platforms will use this imaging system development expertise to describe how the human and animal vision systems inspired the design and development of the computer based vision system. This paper will illustrate a block diagram of both the human eye and a typical EO/IR camera while comparing the two imaging systems.
可见波段和红外(IR)波段相机和视觉系统的发展受到了人类和动物视觉系统的启发。本文将讨论光电/红外(EO/IR)光谱相机的发展,从前端光学,探测器或光子到电子转换器,预处理如非均匀性校正,自动增益控制,人眼的中央凹视觉处理,框架系统(人类或动物的眼球和头部运动),以及数据的模拟和数字路径(人类视神经)。计算机视觉算法(人类或动物的大脑视觉处理)将不会在本文中讨论。宾夕法尼亚州立大学工程学院的综合设计服务部门多年来一直在开发基于EO/IR相机和传感器的计算机视觉系统,并结合20多年来开发成像传感器稳定平台的经验,将利用这种成像系统开发专业知识来描述人类和动物视觉系统如何启发了基于计算机的视觉系统的设计和开发。本文将说明一个框图的人的眼睛和一个典型的EO/IR相机,同时比较两种成像系统。
{"title":"Biologically-inspired visible and infrared camera technology development","authors":"E. Williams, M. Pusateri, J. Scott","doi":"10.1109/AIPR.2009.5466298","DOIUrl":"https://doi.org/10.1109/AIPR.2009.5466298","url":null,"abstract":"Visible band and Infrared (IR) band camera and vision system development has been inspired by the human and animal vision systems. This paper will discuss the development of the Electro-Optical/Infrared (EO/IR) spectrum cameras from the front end optics, the detector or photon to electron convertor, preprocessing such as non-uniformity correction, automatic gain control, foveal vision processing done by the human eye, the gimbal system (human or animal eye ball and head motion), and the analog and digital paths of the data (optic nerve in humans). The computer vision algorithms (human or animal brain vision processing) will not be discussed in this paper. The Integrated Design Services in the College of Engineering at Penn State University has been developing EO/IR camera and sensor based computer vision systems for several years and combined with more than twenty years of developing imaging sensor stabilized platforms will use this imaging system development expertise to describe how the human and animal vision systems inspired the design and development of the computer based vision system. This paper will illustrate a block diagram of both the human eye and a typical EO/IR camera while comparing the two imaging systems.","PeriodicalId":266025,"journal":{"name":"2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2009-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127364078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1