Proceedings of 1994 IEEE Workshop on Applications of Computer Vision最新文献

英文中文

A system for aircraft recognition in perspective aerial images 透视航拍图像中的飞机识别系统

Proceedings of 1994 IEEE Workshop on Applications of Computer Vision

Pub Date : 1994-12-05 DOI: 10.1109/ACV.1994.341305

Subhodev Das, B. Bhanu, Xingzhi. Wu, R. Braithwaite

Recognition of aircraft in complex, perspective aerial imagery has to be accomplished in presence of clutter, occlusion, shadow, and various forms of image degradation. This paper presents a system for aircraft recognition under real-world conditions that is based on the use of a hierarchical database of object models. The particular approach involves three key processes: (a) The qualitative object recognition process performs model-based symbolic feature extraction and generic object recognition; (b) The refocused matching and evaluation process refines the extracted features for more specific classification with input from (a); and (c) The primitive feature extraction process regulates the extracted features based on their saliency and interacts with (a) and (b). Experimental results showing the qualitative recognition of aircraft in perspective, aerial images are presented.<>

在复杂的透视航空图像中识别飞机必须在杂波、遮挡、阴影和各种形式的图像退化的情况下完成。本文提出了一种基于目标模型分层数据库的真实条件下的飞机识别系统。具体方法涉及三个关键过程:(a)定性对象识别过程执行基于模型的符号特征提取和一般对象识别;(b)根据(a)的输入，重新集中匹配和评价过程改进所提取的特征，以便进行更具体的分类;(c)原始特征提取过程根据提取的特征的显著性对其进行调节，并与(a)和(b)相互作用。实验结果显示了透视航拍图像中飞机的定性识别

引用次数: 6

Anatomy of a hand-filled form reader 手工填表阅读器的解剖

Proceedings of 1994 IEEE Workshop on Applications of Computer Vision

Pub Date : 1994-12-05 DOI: 10.1109/ACV.1994.341309

A. K. Chhabra

We describe a prototype generic form reader (GFR) system for reading hand-filled forms. The system can read run-on or touching handprinted characters. A one-time form specification is required for each type of form that the system is expected to read. The form specification includes geometric location of registration marks and fields of interest, field grammars, and system parameters. The GFR begins by detecting registration marks, computing image skew, extracting deskewed fields, and computing connected components in the field images. Next, the connected components are split into segments using heuristics about good splitting points. The system is liberal in splitting, i.e., a split segment could be a part of a character or a complete character, and hopefully no more than a character. Next, the segments are adaptively regrouped into 'seg-groups' with the aid of a dynamic programming algorithm that matches the character answers for the seg-groups with the field grammar specification. The single character recognizer (SCR) uses high order combinations of raw geometric features derived from segments and seg-groups. The high order combining rules are derived by statistical discriminant analysis of raw features. The GFR system provides some generic tools that can be applied to other document image analysis problems besides forms reading.<>

我们描述了一个原型通用表单阅读器(GFR)系统，用于读取手工填写的表单。该系统可以读取运行或触摸手写字符。对于系统期望读取的每种类型的表单，都需要一次性的表单规范。表单规范包括注册标记和感兴趣字段的几何位置、字段语法和系统参数。GFR首先检测配准标记，计算图像倾斜，提取倾斜场，计算场图像中的连接分量。接下来，使用启发式方法将连接的组件分成分段。系统在分割上是自由的，也就是说，一个分割的片段可以是一个字符的一部分，也可以是一个完整的字符，希望不超过一个字符。接下来，在动态规划算法的帮助下，将片段自适应地重新分组为“分段组”，该算法将分段组的字符答案与字段语法规范相匹配。单字符识别器(SCR)使用来自段和段组的原始几何特征的高阶组合。通过对原始特征的统计判别分析，推导出高阶组合规则。GFR系统提供了一些通用的工具，可以应用于除表单读取之外的其他文档图像分析问题

{"title":"Anatomy of a hand-filled form reader","authors":"A. K. Chhabra","doi":"10.1109/ACV.1994.341309","DOIUrl":"https://doi.org/10.1109/ACV.1994.341309","url":null,"abstract":"We describe a prototype generic form reader (GFR) system for reading hand-filled forms. The system can read run-on or touching handprinted characters. A one-time form specification is required for each type of form that the system is expected to read. The form specification includes geometric location of registration marks and fields of interest, field grammars, and system parameters. The GFR begins by detecting registration marks, computing image skew, extracting deskewed fields, and computing connected components in the field images. Next, the connected components are split into segments using heuristics about good splitting points. The system is liberal in splitting, i.e., a split segment could be a part of a character or a complete character, and hopefully no more than a character. Next, the segments are adaptively regrouped into 'seg-groups' with the aid of a dynamic programming algorithm that matches the character answers for the seg-groups with the field grammar specification. The single character recognizer (SCR) uses high order combinations of raw geometric features derived from segments and seg-groups. The high order combining rules are derived by statistical discriminant analysis of raw features. The GFR system provides some generic tools that can be applied to other document image analysis problems besides forms reading.<<ETX>>","PeriodicalId":437089,"journal":{"name":"Proceedings of 1994 IEEE Workshop on Applications of Computer Vision","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126881978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Frameless registration of MR and CT 3D volumetric data sets MR和CT三维体积数据集的无框配准

Proceedings of 1994 IEEE Workshop on Applications of Computer Vision

Pub Date : 1994-12-05 DOI: 10.1109/ACV.1994.341316

Rakesh Kumar, Kristin J. Dana, P. Anandan, Neil E. Okamoto, J. Bergen, P. Hemler, T. Sumanaweera, P. Elsen, J. Adler

In this paper we present techniques for frameless registration of 3D Magnetic Resonance (MR) and Computed Tomography (CT) volumetric data of the head and spine. We present techniques for estimating a 3D affine or rigid transform which can be used to resample the CT (or MR) data to align with the MR (or CT) data. Our technique transforms the MR and CT data sets with spatial filters so they can be directly matched. The matching is done by a direct optimization technique using a gradient based descent approach and a coarse-to-fine control strategy over a 4D pyramid. We present results on registering the head and spine data by matching 3D edges and results on registering cranial ventricle data by matching images filtered by a Laplacian of a Gaussian.<>

在本文中，我们提出了头部和脊柱的三维磁共振(MR)和计算机断层扫描(CT)体积数据的无帧配准技术。我们提出了估计3D仿射或刚性变换的技术，可用于重新采样CT(或MR)数据以与MR(或CT)数据对齐。我们的技术用空间滤波器对MR和CT数据集进行变换，使它们可以直接匹配。匹配是通过使用基于梯度的下降方法和在4D金字塔上的粗到精控制策略的直接优化技术完成的。我们给出了通过匹配三维边缘来匹配头部和脊柱数据的结果，以及通过匹配高斯拉普拉斯算子过滤的图像来匹配脑室数据的结果

引用次数: 16

Model supported exploitation: quick look, detection and counting, and change detection 模型支持的开发:快速查看、检测和计数，以及变更检测

Proceedings of 1994 IEEE Workshop on Applications of Computer Vision

Pub Date : 1994-12-05 DOI: 10.1109/ACV.1994.341302

C. Huang, J. Mundy, Charlie Rothwell

Over the last several years the concept of model-supported exploitation (MSE) has evolved to a point where relatively simple computer vision algorithms can extract significant intelligence information from aerial images in a robust and reliable manner. Information extraction is enabled by the use of detailed 3D site models which provide an extensive context for the application of image analysis algorithms. This paper reviews the basic MSE concept and illustrates the approach using three operational concepts taken from the RADIUS project, quick-look, detection and counting and focussed change detection.<>

在过去的几年里，模型支持开发(MSE)的概念已经发展到一个相对简单的计算机视觉算法可以以鲁棒和可靠的方式从航空图像中提取重要的情报信息的地步。信息提取是通过使用详细的3D站点模型实现的，该模型为图像分析算法的应用提供了广泛的背景。本文回顾了基本的MSE概念，并使用来自RADIUS项目的三个操作概念:快速查看、检测和计数以及集中变更检测来说明该方法。

引用次数: 5

Genetic labeling and its application to depalletizing robot vision 遗传标记及其在码垛机器人视觉中的应用

Proceedings of 1994 IEEE Workshop on Applications of Computer Vision

Pub Date : 1994-12-05 DOI: 10.1109/ACV.1994.341307

M. Hashimoto, K. Sumi

Genetic labeling is a new labeling algorithm using Genetic Algorithm (GA). Although several applications of GA for low-level image processing such as line detection have been studied, they still require much computing time. We apply GA to the labeling for scene interpretation. The chromosome coding method we proposed is such that each bit represents the existence of an object. Genetic operation enables efficient labeling based on the building block hypothesis. We have developed a vision system for depalletizing robot using this technique. Object candidates are properly labeled, and the position of cartons is recognized. Through real image experiments, we estimated that genetic labeling is about 100 times faster than an improved enumerating method. Also we have proven that the reliability and speed of this system is practical.<>

遗传标记是一种基于遗传算法的新型标记算法。虽然已经研究了遗传算法在低级图像处理中的几种应用，如线检测，但它们仍然需要大量的计算时间。我们将遗传算法应用于场景解释的标记。我们提出的染色体编码方法是这样的，每个比特代表一个对象的存在。遗传操作使基于构建块假设的有效标记成为可能。我们利用这种技术开发了一种用于码垛机器人的视觉系统。候选对象被适当地标记，并且纸箱的位置被识别。通过实像实验，我们估计遗传标记比改进的枚举方法快100倍左右。同时也证明了该系统的可靠性和速度是切实可行的

引用次数: 3

An automated stereoscopic coal profiling system-CCLPS 自动立体煤剖面系统cclps

Proceedings of 1994 IEEE Workshop on Applications of Computer Vision

Pub Date : 1994-12-05 DOI: 10.1109/ACV.1994.341283

Philip W. Smith, N. Nandhakumar

This paper describes the design of a binocular stereo system called CCLPS (Computerized Coal Profiling System) that provides dense, accurate disparity maps of coal as it is being transported in open rail cars. After a quantitative analysis of previously developed cepstral correspondence techniques which highlights the shortcomings of the cepstrum's matching ability in the presence of random noise and severe foreshortening distortion, we present a modified power cepstral approach that is less sensitive to these effects, along with analytical arguments verifying its robustness. The design of the CCLPS system is then discussed in detail and its performance is verified.<>

本文介绍了一种称为CCLPS(计算机化煤剖面系统)的双目立体系统的设计，该系统可以在煤在开放式轨道车辆中运输时提供密集、准确的视差图。在对先前开发的倒谱对应技术进行定量分析之后，该技术突出了在随机噪声和严重的前缩失真存在下倒谱匹配能力的缺点，我们提出了一种改进的功率倒谱方法，该方法对这些影响不太敏感，并通过分析论证验证了其鲁棒性。然后详细讨论了CCLPS系统的设计，并对其性能进行了验证。

引用次数: 4

Image mosaicing for tele-reality applications 用于远程现实应用的图像拼接

Proceedings of 1994 IEEE Workshop on Applications of Computer Vision

Pub Date : 1994-12-05 DOI: 10.1109/ACV.1994.341287

R. Szeliski

This paper presents some techniques for automatically deriving realistic 2-D scenes and 3-D geometric models from video sequences. These techniques can be used to build environments and 3-D models for virtual reality application based on recreating a true scene, i.e., tele-reality applications. The fundamental technique used in this paper is image mosaicing, i.e., the automatic alignment of multiple images into larger aggregates which are then used to represent portions of a 3-D scene. The paper first examines the easiest problems, those of flat scene and panoramic scene mosaicing. It then progresses to more complicated scenes with depth, and concludes with full 3-D models. The paper also discusses a number of novel applications based on tele-reality technology.<>

本文介绍了从视频序列中自动生成逼真的二维场景和三维几何模型的一些技术。这些技术可用于在再现真实场景的基础上为虚拟现实应用构建环境和三维模型，即远程现实应用。本文使用的基本技术是图像拼接，即将多个图像自动对齐成更大的聚合，然后用于表示3d场景的部分。本文首先研究了平面场景和全景场景拼接中最容易遇到的问题。然后，它会深入到更复杂的场景，并以完整的3d模型结束。本文还讨论了基于远程现实技术的一些新应用

引用次数: 515

Recursive identification of gesture inputs using hidden Markov models 使用隐马尔可夫模型递归识别手势输入

Proceedings of 1994 IEEE Workshop on Applications of Computer Vision

Pub Date : 1994-12-05 DOI: 10.1109/ACV.1994.341308

J. Schlenzig, E. Hunter, R. Jain

Human-machine interfaces play a role of growing importance as computer technology continues to evolve. Motivated by the desire to provide users with an intuitive gesture input system, we describe the design of a recursive filter applied to the vision-based gesture interpretation problem. The gestures are modeled as a hidden Markov model with the state representing the gesture sequences, and the observations being the current static hand pose. At each time step the recursive filter updates its estimate of what gesture is occurring based on the current extracted pose information. The result is a robust system which provides the user with continual feedback during compound gestures.<>

随着计算机技术的不断发展，人机界面扮演着越来越重要的角色。为了给用户提供一个直观的手势输入系统，我们设计了一种递归滤波器，用于基于视觉的手势解释问题。手势建模为隐马尔可夫模型，状态表示手势序列，观察值为当前静态手部姿势。在每个时间步，递归滤波器根据当前提取的姿态信息更新其对正在发生的手势的估计。结果是一个强大的系统，在复合手势中为用户提供持续的反馈。

引用次数: 122

Modelling issues in vision based aircraft navigation during landing 基于视觉的飞机着陆导航建模问题

Proceedings of 1994 IEEE Workshop on Applications of Computer Vision

Pub Date : 1994-12-05 DOI: 10.1109/ACV.1994.341293

Tarun Soni, B. Sridhar

This paper investigates the the feasibility of using visual and infrared imaging sensors to aid in the location of the aircraft position during operations such as landing in bad weather. The choice of the airport model used is crucial to algorithms which are used for position estimation based on pattern recognition. In this paper we describe the effects the choice of a model has on the behaviour of such matching algorithms. Three basic models are chosen: a line segment based model, an area based model and a texture based model. It is seen that a sparse line segment based model is not adequate to identify the runway since it matches a number of false artifacts in the image. An enhanced line segment based model containing a large number of features compares favourably with the area based model. The texture based model is seen to need a number of camera and weather dependent parameters and the performance of such a scheme is not seen to be substantially better. Thus either a proper area based model or a pseudo-area based model (based on a very large number of line features) can be seen to provide the best performance for such landmark identification and position determination algorithms.<>

本文研究了在恶劣天气降落等作战过程中，利用视觉和红外成像传感器辅助飞机位置定位的可行性。在基于模式识别的位置估计算法中，机场模型的选择至关重要。在本文中，我们描述了模型的选择对这种匹配算法的行为的影响。选择了三种基本模型:基于线段的模型、基于面积的模型和基于纹理的模型。可以看出，基于稀疏线段的模型不足以识别跑道，因为它与图像中的许多虚假伪影相匹配。基于线段的增强模型包含大量的特征，与基于区域的模型比较有利。基于纹理的模型需要许多相机和天气相关的参数，并且这种方案的性能并没有明显更好。因此，适当的基于区域的模型或伪基于区域的模型(基于非常大量的线特征)可以为此类地标识别和位置确定算法提供最佳性能。

{"title":"Modelling issues in vision based aircraft navigation during landing","authors":"Tarun Soni, B. Sridhar","doi":"10.1109/ACV.1994.341293","DOIUrl":"https://doi.org/10.1109/ACV.1994.341293","url":null,"abstract":"This paper investigates the the feasibility of using visual and infrared imaging sensors to aid in the location of the aircraft position during operations such as landing in bad weather. The choice of the airport model used is crucial to algorithms which are used for position estimation based on pattern recognition. In this paper we describe the effects the choice of a model has on the behaviour of such matching algorithms. Three basic models are chosen: a line segment based model, an area based model and a texture based model. It is seen that a sparse line segment based model is not adequate to identify the runway since it matches a number of false artifacts in the image. An enhanced line segment based model containing a large number of features compares favourably with the area based model. The texture based model is seen to need a number of camera and weather dependent parameters and the performance of such a scheme is not seen to be substantially better. Thus either a proper area based model or a pseudo-area based model (based on a very large number of line features) can be seen to provide the best performance for such landmark identification and position determination algorithms.<<ETX>>","PeriodicalId":437089,"journal":{"name":"Proceedings of 1994 IEEE Workshop on Applications of Computer Vision","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115144427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Real-time visual tracking using correlation techniques 使用相关技术的实时视觉跟踪

Proceedings of 1994 IEEE Workshop on Applications of Computer Vision

Pub Date : 1994-12-05 DOI: 10.1109/ACV.1994.341319

M. W. Eklund, G. Ravichandran, M. Trivedi, S. B. Marapane

A real-time correspondence based tracking algorithm is detailed. The system uses a pipeline processor, a general purpose processor, a camera and a display. The Minimum Noise and Correlation Energy (MINACE) filter is used in the tracking algorithm as it provides a good combination of speed, accuracy and flexibility for the targeted hardware system. The system designed is fast and tracking is accomplished at a rate of 15 hz. The system is adaptive and does not rely on a previous model of the object; the training image for filter synthesis is acquired from previous image frames and the filter is synthesized online to accommodate 3-D variations of the target being tracked. The system tracks an object consistently as is demonstrated by the low deviation of the results in the evaluation. The correlation filter-based tracking algorithm has proved to be useful in our research in cooperative mobile robots. A visual servoing system has been implemented using this tracking algorithm for convoying of multiple mobile robots.<>

详细介绍了一种基于实时通信的跟踪算法。该系统采用流水线处理器、通用处理器、摄像头和显示器。在跟踪算法中使用最小噪声和相关能量(MINACE)滤波器，因为它为目标硬件系统提供了良好的速度、精度和灵活性的组合。所设计的系统速度快，跟踪速度为15hz。该系统是自适应的，不依赖于对象的先前模型;从之前的图像帧中获取用于滤波合成的训练图像，并在线合成滤波器以适应被跟踪目标的三维变化。系统始终如一地跟踪一个对象，正如评估结果的低偏差所证明的那样。基于相关滤波器的跟踪算法在协作移动机器人的研究中被证明是有用的。利用该跟踪算法实现了多移动机器人的视觉伺服系统。

引用次数: 20

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of 1994 IEEE Workshop on Applications of Computer Vision

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀