首页 > 最新文献

Proceedings 11th International Conference on Image Analysis and Processing最新文献

英文 中文
A neurodynamical retinal network based on reaction-diffusion systems 基于反应扩散系统的神经动力学视网膜网络
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957010
M. Keil, G. Cristóbal, H. Neumann
A dynamical model for retinal processing is presented. The model describes the output of retinal ganglion cells whose receptive field is composed of a center and a surround combining linearly. However, in comparison to the classical difference-of-Gaussian (DOG) model, center and surround are generated in two separate layers of reaction-diffusion systems, through a difference in the speed of activity-propagation between both layers. Thus, intra-layer coupling is based exclusively on next-neighbor interactions. This makes the model suitable for VLSI implementation. Furthermore, the layers are connected by equations with feedback-inhibition to form ON-center/OFF-surround and OFF-center/OFF-surround receptive fields. The model's output in the early dynamics corresponds to high-resolution contrast information, whereas the output at later times can be considered as correlated with local brightness and darkness, respectively. To examine this in more detail, simulations with the Hermann/Hering-grid and grating induction were carried out.
提出了一种视网膜加工的动力学模型。该模型描述了接受野由中心和周围线性组合而成的视网膜神经节细胞的输出。然而,与经典的高斯差分(DOG)模型相比,中心和环绕是通过两层之间活动传播速度的差异在反应扩散系统的两个独立层中产生的。因此,层内耦合完全基于相邻交互。这使得该模型适合VLSI的实现。此外,通过反馈抑制方程将各层连接起来,形成ON-center/OFF-surround和OFF-center/OFF-surround感受场。模型的早期动态输出对应于高分辨率对比度信息,而后期的输出可以被认为分别与局部亮度和黑暗相关。为了更详细地验证这一点,使用Hermann/ hering网格和光栅感应进行了模拟。
{"title":"A neurodynamical retinal network based on reaction-diffusion systems","authors":"M. Keil, G. Cristóbal, H. Neumann","doi":"10.1109/ICIAP.2001.957010","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957010","url":null,"abstract":"A dynamical model for retinal processing is presented. The model describes the output of retinal ganglion cells whose receptive field is composed of a center and a surround combining linearly. However, in comparison to the classical difference-of-Gaussian (DOG) model, center and surround are generated in two separate layers of reaction-diffusion systems, through a difference in the speed of activity-propagation between both layers. Thus, intra-layer coupling is based exclusively on next-neighbor interactions. This makes the model suitable for VLSI implementation. Furthermore, the layers are connected by equations with feedback-inhibition to form ON-center/OFF-surround and OFF-center/OFF-surround receptive fields. The model's output in the early dynamics corresponds to high-resolution contrast information, whereas the output at later times can be considered as correlated with local brightness and darkness, respectively. To examine this in more detail, simulations with the Hermann/Hering-grid and grating induction were carried out.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123209245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Representing volumetric vascular structures using curve skeletons 用曲线骨架表示体积血管结构
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957058
Ingela Nyström, G. S. D. Baja, S. Svensson
This paper describes a technique to represent relevant information of tree-like structures in a compact way. The technique is general. In the application described here, the images are obtained with contrast-enhanced magnetic resonance angiography (MRA). After segmentation, the vessels are reduced to fully reversible surface skeletons. Thereafter a novel approach to curve skeletonization based on the detection of junctions and curves in the surface skeleton is used. This procedure results in a good description of the tree structure of the vessels, where they are represented with a much smaller number of voxels. This representation is suitable for further quantitative analysis, e.g., measurements of vessel width and length.
本文描述了一种以紧凑的方式表示树状结构相关信息的技术。这种技术是通用的。在这里描述的应用中,图像是通过对比增强磁共振血管造影(MRA)获得的。分割后,血管被还原为完全可逆的表面骨架。在此基础上,提出了一种基于曲面骨架中结点和曲线检测的曲线骨架化方法。这个过程可以很好地描述血管的树形结构,其中它们用更少的体素表示。这种表示法适用于进一步的定量分析,例如,测量容器的宽度和长度。
{"title":"Representing volumetric vascular structures using curve skeletons","authors":"Ingela Nyström, G. S. D. Baja, S. Svensson","doi":"10.1109/ICIAP.2001.957058","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957058","url":null,"abstract":"This paper describes a technique to represent relevant information of tree-like structures in a compact way. The technique is general. In the application described here, the images are obtained with contrast-enhanced magnetic resonance angiography (MRA). After segmentation, the vessels are reduced to fully reversible surface skeletons. Thereafter a novel approach to curve skeletonization based on the detection of junctions and curves in the surface skeleton is used. This procedure results in a good description of the tree structure of the vessels, where they are represented with a much smaller number of voxels. This representation is suitable for further quantitative analysis, e.g., measurements of vessel width and length.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130772339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Similarity-based approach to earthenware reconstruction 基于相似性的陶器重建方法
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957055
M. Kanoh, Shogo Yasuhara, H. Itoh, Shohei Kato
This paper proposes an earthenware reconstruction system, which can automatically reconstruct some earthenware from numerous given potsherds in two-dimensional grayscale images. The system supposes that given potsherds are thin and moderately flat and small. Some earthenware having three-dimensional shape, such as crocks, can be reconstructed by the system within the supposition. The system performs earthenware reconstruction through two phases. At the first phase, potsherds are joined automatically in two dimensions. An efficient joint detection algorithm using surface pattern and shape similarity is proposed at this phase. At the second phase, three-dimensional shape is recovered by an adequate three-dimensional transformation. Some experimental results of reconstruction from numerous potsherds are also reported.
本文提出了一种陶器重建系统,该系统可以从大量给定的二维灰度图像中自动重建出部分陶器。该系统假定给定的陶片是薄的、适度平坦的和小的。一些具有三维形状的陶器,如陶器,可以通过假设中的系统进行重构。该系统通过两个阶段对陶器进行改造。在第一阶段,陶片在两个维度上自动连接。在此阶段,提出了一种利用表面图案和形状相似度的高效联合检测算法。在第二阶段,通过适当的三维变换恢复三维形状。本文还报道了大量陶片重建的一些实验结果。
{"title":"Similarity-based approach to earthenware reconstruction","authors":"M. Kanoh, Shogo Yasuhara, H. Itoh, Shohei Kato","doi":"10.1109/ICIAP.2001.957055","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957055","url":null,"abstract":"This paper proposes an earthenware reconstruction system, which can automatically reconstruct some earthenware from numerous given potsherds in two-dimensional grayscale images. The system supposes that given potsherds are thin and moderately flat and small. Some earthenware having three-dimensional shape, such as crocks, can be reconstructed by the system within the supposition. The system performs earthenware reconstruction through two phases. At the first phase, potsherds are joined automatically in two dimensions. An efficient joint detection algorithm using surface pattern and shape similarity is proposed at this phase. At the second phase, three-dimensional shape is recovered by an adequate three-dimensional transformation. Some experimental results of reconstruction from numerous potsherds are also reported.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126078981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Circle detection based on orientation matching 基于方向匹配的圆检测
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.956995
M. Ceccarelli, A. Petrosino, G. Laccetti
The paper reports a correlation-based method for the detection of circular objects which is capable of overcoming well-known problems arising by the use of gradient-based voting schemes. Specifically, the method is: (a) capable of detecting circular objects on the basis of both magnitude and direction of the image gradient; and (b) of dealing with three-dimensional spherical objects by considering shadows depending on the direction of light. Experimental results about the accuracy of the method and comparisons with the Hough transform and the Hausdorff matching are reported.
本文报告了一种基于相关性的圆形物体检测方法,该方法能够克服使用基于梯度的投票方案所产生的众所周知的问题。具体而言,该方法:(a)能够根据图像梯度的大小和方向检测圆形物体;(b)根据光的方向考虑阴影来处理三维球面物体。实验结果表明了该方法的精度,并与霍夫变换和豪斯多夫匹配进行了比较。
{"title":"Circle detection based on orientation matching","authors":"M. Ceccarelli, A. Petrosino, G. Laccetti","doi":"10.1109/ICIAP.2001.956995","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.956995","url":null,"abstract":"The paper reports a correlation-based method for the detection of circular objects which is capable of overcoming well-known problems arising by the use of gradient-based voting schemes. Specifically, the method is: (a) capable of detecting circular objects on the basis of both magnitude and direction of the image gradient; and (b) of dealing with three-dimensional spherical objects by considering shadows depending on the direction of light. Experimental results about the accuracy of the method and comparisons with the Hough transform and the Hausdorff matching are reported.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"21 10","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113977298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Towards teleconferencing by view synthesis and large-baseline stereo 面向视点合成和大基线立体的远程会议
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957008
F. Isgrò, E. Trucco, Li-Qun Xu
We address the application of computer vision to semi-immersive teleconferencing, and present a prototype vision system synthesising a physically plausible video of a speaker to be displayed at a remote conferencing station. The main system components are a hierarchical, efficient large-baseline disparity estimation and a view synthesis module. We illustrate and discuss some results with a real-speaker sequence. We regard the development of such a system in the domain of advanced teleconferencing as the main contribution of this work.
我们解决了计算机视觉在半沉浸式远程会议中的应用,并提出了一个原型视觉系统,该系统合成了一个在远程会议站显示的演讲者的物理上合理的视频。系统的主要组成部分是分层、高效的大基线视差估计和视图合成模块。我们用一个实录序列来说明和讨论一些结果。我们认为在高级远程会议领域开发这样一个系统是本工作的主要贡献。
{"title":"Towards teleconferencing by view synthesis and large-baseline stereo","authors":"F. Isgrò, E. Trucco, Li-Qun Xu","doi":"10.1109/ICIAP.2001.957008","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957008","url":null,"abstract":"We address the application of computer vision to semi-immersive teleconferencing, and present a prototype vision system synthesising a physically plausible video of a speaker to be displayed at a remote conferencing station. The main system components are a hierarchical, efficient large-baseline disparity estimation and a view synthesis module. We illustrate and discuss some results with a real-speaker sequence. We regard the development of such a system in the domain of advanced teleconferencing as the main contribution of this work.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114440161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
A naive approach to compose aerial images in a mosaic fashion 以马赛克的方式组合航空图像的一种天真的方法
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957061
D. Tegolo, Cesare Valenti
There is growing interest in multiple sequence image analysis to represent those data in a new landscape, for instance reconstruction of old films, mosaicing of images. This paper focuses attention on the mosaic problem; it introduces a naive method to link together images where a common part of the scene is present among two images. An application has been developed to test the method on aerial sequences of images. Given the long distance of aircraft from the scene, the method assumes images without distortions and without problems of prospective. Moreover, the application does not need any additional parameters coming from human experience and for this reason it can be thought of as a full automated application. In our experimentation the method shows good results and it appears robust and accurate for that kind of image.
人们对多序列图像分析越来越感兴趣,以便在新的场景中表示这些数据,例如旧电影的重建,图像的拼接。本文关注的是马赛克问题;它引入了一种简单的方法,将两幅图像中出现的场景的共同部分连接在一起。开发了一个应用程序来测试该方法在航空图像序列上的应用。考虑到飞机距离现场较远,该方法假设的图像没有失真,也没有前瞻性问题。此外,应用程序不需要任何来自人类经验的额外参数,因此它可以被认为是一个完全自动化的应用程序。实验结果表明,该方法具有较好的鲁棒性和准确性。
{"title":"A naive approach to compose aerial images in a mosaic fashion","authors":"D. Tegolo, Cesare Valenti","doi":"10.1109/ICIAP.2001.957061","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957061","url":null,"abstract":"There is growing interest in multiple sequence image analysis to represent those data in a new landscape, for instance reconstruction of old films, mosaicing of images. This paper focuses attention on the mosaic problem; it introduces a naive method to link together images where a common part of the scene is present among two images. An application has been developed to test the method on aerial sequences of images. Given the long distance of aircraft from the scene, the method assumes images without distortions and without problems of prospective. Moreover, the application does not need any additional parameters coming from human experience and for this reason it can be thought of as a full automated application. In our experimentation the method shows good results and it appears robust and accurate for that kind of image.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"268 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132637106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Adaptive color image compression based on visual attention 基于视觉注意力的自适应彩色图像压缩
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957045
N. Ouerhani, J. Bracamonte, Heinz Hugli, M. Ansorge, F. Pellandini
This paper reports an adaptive still color image compression method which produces automatically selected ROI with a higher reconstruction quality with respect to the rest of the input image. The ROI are generated on-the fly with a purely data-driven technique based on visual attention. Inspired from biological vision, the multicue visual attention algorithm detects the most visually salient regions of an image. Thus, when operating in systems with low bit rate constraints, the adaptive coding scheme favors the allocation of a higher number of bits to those image regions that are more conspicuous to the human visual system. The compressed image files produced by this adaptive method are fully compatible with the JPEG standard, which favors their widespread utilization.
本文报道了一种自适应静态彩色图像压缩方法,该方法可以产生相对于输入图像的其余部分具有更高重建质量的自动选择感兴趣区域。ROI是基于视觉注意力的纯数据驱动技术动态生成的。受生物视觉的启发,多重视觉注意算法检测图像中视觉上最显著的区域。因此,当在具有低比特率约束的系统中运行时,自适应编码方案倾向于将更高数量的比特分配给那些对人类视觉系统更明显的图像区域。这种自适应方法产生的压缩图像文件与JPEG标准完全兼容,有利于其广泛应用。
{"title":"Adaptive color image compression based on visual attention","authors":"N. Ouerhani, J. Bracamonte, Heinz Hugli, M. Ansorge, F. Pellandini","doi":"10.1109/ICIAP.2001.957045","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957045","url":null,"abstract":"This paper reports an adaptive still color image compression method which produces automatically selected ROI with a higher reconstruction quality with respect to the rest of the input image. The ROI are generated on-the fly with a purely data-driven technique based on visual attention. Inspired from biological vision, the multicue visual attention algorithm detects the most visually salient regions of an image. Thus, when operating in systems with low bit rate constraints, the adaptive coding scheme favors the allocation of a higher number of bits to those image regions that are more conspicuous to the human visual system. The compressed image files produced by this adaptive method are fully compatible with the JPEG standard, which favors their widespread utilization.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134007679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 51
Real-time disparity analysis for applications in immersive teleconference scenarios-a comparative study 沉浸式电话会议场景应用的实时差异分析——比较研究
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957033
O. Schreer, N. Brandenburg, P. Kauff
We present results of a comparative study of different fast disparity analysis approaches. Two of them are well known standard algorithms, while the third is a new approach based on a hybrid block- and pixel-recursive matching scheme. The key idea of the new algorithm is to choose efficiently a small number of candidate vectors in order to reduce the computational effort by simultaneously achieving spatial and temporal consistency in the resulting disparity map. The latter aspect is very important for 3D video conferencing applications, where novel views of the conferee have to be synthesised in order to provide motion parallax. For this application, processing of a video in ITU-Rec. 601 resolution is required. Our new algorithm is able to provide disparity vector fields for both directions (left/spl rarr/right and right/spl rarr/left) in real-time on one 800 MHz Pentium III in reasonable quality. The different disparity algorithms are compared with respect to reliability, quality of the resulting disparities and speed of the algorithm.
我们提出了不同快速差异分析方法的比较研究结果。其中两种是众所周知的标准算法,而第三种是基于混合块和像素递归匹配方案的新方法。该算法的核心思想是高效地选择少量候选向量,从而在得到的视差图中同时实现时空一致性,从而减少计算量。后一个方面对于3D视频会议应用来说非常重要,在这种应用中,为了提供运动视差,必须合成参与者的新视图。对于本应用程序,在ITU-Rec中处理视频。需要601分辨率。我们的新算法能够在一个800 MHz的Pentium III上以合理的质量实时提供两个方向(左/spl rarr/右和右/spl rarr/左)的视差矢量场。对不同的视差算法在可靠性、视差质量和算法速度等方面进行了比较。
{"title":"Real-time disparity analysis for applications in immersive teleconference scenarios-a comparative study","authors":"O. Schreer, N. Brandenburg, P. Kauff","doi":"10.1109/ICIAP.2001.957033","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957033","url":null,"abstract":"We present results of a comparative study of different fast disparity analysis approaches. Two of them are well known standard algorithms, while the third is a new approach based on a hybrid block- and pixel-recursive matching scheme. The key idea of the new algorithm is to choose efficiently a small number of candidate vectors in order to reduce the computational effort by simultaneously achieving spatial and temporal consistency in the resulting disparity map. The latter aspect is very important for 3D video conferencing applications, where novel views of the conferee have to be synthesised in order to provide motion parallax. For this application, processing of a video in ITU-Rec. 601 resolution is required. Our new algorithm is able to provide disparity vector fields for both directions (left/spl rarr/right and right/spl rarr/left) in real-time on one 800 MHz Pentium III in reasonable quality. The different disparity algorithms are compared with respect to reliability, quality of the resulting disparities and speed of the algorithm.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117091859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
From cross-country autonomous navigation to intelligent deep space communications: visual sensor processing at JPL 从越野自主导航到智能深空通信:喷气推进实验室的视觉传感器处理
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957054
R. Manduchi, L. Matthies, F. Pollara
We describe ongoing work at JPL in two fields: autonomous navigation for terrestrial vehicles, and prioritized progressive transmission for deep space communications. While such applications may seem rather disparate, they have in common the need for autonomous reasoning about visual information. We first review a number of techniques currently under development that make use of data from color and infrared cameras, multispectral sensors, and laser rangefinder for estimating properties of the terrain cover in outdoor vegetated terrain. We then discuss how onboard visual analysis mechanisms for Mars rovers can be used for prioritizing the data to be transmitted to Earth, in order to maximize the science return of a mission.
我们描述了喷气推进实验室在两个领域正在进行的工作:地面飞行器的自主导航,以及深空通信的优先渐进传输。虽然这些应用程序可能看起来相当不同,但它们都需要对视觉信息进行自主推理。我们首先回顾了目前正在开发的一些技术,这些技术利用彩色和红外相机、多光谱传感器和激光测距仪的数据来估计室外植被地形的地形覆盖特性。然后,我们讨论了火星探测器的机载视觉分析机制如何用于优先考虑要传输到地球的数据,以最大限度地提高任务的科学回报。
{"title":"From cross-country autonomous navigation to intelligent deep space communications: visual sensor processing at JPL","authors":"R. Manduchi, L. Matthies, F. Pollara","doi":"10.1109/ICIAP.2001.957054","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957054","url":null,"abstract":"We describe ongoing work at JPL in two fields: autonomous navigation for terrestrial vehicles, and prioritized progressive transmission for deep space communications. While such applications may seem rather disparate, they have in common the need for autonomous reasoning about visual information. We first review a number of techniques currently under development that make use of data from color and infrared cameras, multispectral sensors, and laser rangefinder for estimating properties of the terrain cover in outdoor vegetated terrain. We then discuss how onboard visual analysis mechanisms for Mars rovers can be used for prioritizing the data to be transmitted to Earth, in order to maximize the science return of a mission.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116872738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Effectiveness evaluation of word characteristics obtained from 3D image information for lipreading 三维图像信息获取的词特征在唇读中的有效性评价
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957025
Koji Uda, N. Tagawa, A. Minagawa, T. Moriya
Speech recognition using image information is worthy of remark as one of the next generation of man machine interfaces (MMIs). Several methods that use either voice information or voice information and image information for recognizing words, context and speech have been proposed. Compared to methods that use only voice information, the benefit of using image information is that it is not affected by unwanted sound noise, and so it is applicable in several different environments. However, in general, several constraints are required to capture an image, for example, camera position and the relationship between camera and face. We investigated the effectiveness of using three-dimensional image information for word recognition and found that these constraints are removed. To confirm the effectiveness of the proposed method, the characteristics of two- and three-dimensional images were compared. The results of the word recognition experiment show that the recognition rate for three-dimensional characteristics is higher than that for two-dimensional characteristics.
基于图像信息的语音识别作为下一代人机界面之一,值得关注。人们提出了几种利用语音信息或语音信息和图像信息来识别单词、上下文和语音的方法。与仅使用语音信息的方法相比,使用图像信息的好处是它不受不必要的声音噪声的影响,因此它适用于几种不同的环境。然而,通常情况下,需要几个约束条件来捕获图像,例如,相机的位置和相机与人脸之间的关系。我们研究了使用三维图像信息进行单词识别的有效性,发现这些限制被消除了。为了验证该方法的有效性,对比了二维和三维图像的特征。单词识别实验结果表明,三维特征的识别率高于二维特征的识别率。
{"title":"Effectiveness evaluation of word characteristics obtained from 3D image information for lipreading","authors":"Koji Uda, N. Tagawa, A. Minagawa, T. Moriya","doi":"10.1109/ICIAP.2001.957025","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957025","url":null,"abstract":"Speech recognition using image information is worthy of remark as one of the next generation of man machine interfaces (MMIs). Several methods that use either voice information or voice information and image information for recognizing words, context and speech have been proposed. Compared to methods that use only voice information, the benefit of using image information is that it is not affected by unwanted sound noise, and so it is applicable in several different environments. However, in general, several constraints are required to capture an image, for example, camera position and the relationship between camera and face. We investigated the effectiveness of using three-dimensional image information for word recognition and found that these constraints are removed. To confirm the effectiveness of the proposed method, the characteristics of two- and three-dimensional images were compared. The results of the word recognition experiment show that the recognition rate for three-dimensional characteristics is higher than that for two-dimensional characteristics.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125363114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Proceedings 11th International Conference on Image Analysis and Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1