首页 > 最新文献

Proceedings 11th International Conference on Image Analysis and Processing最新文献

英文 中文
Comparison and combination of adaptive query shifting and feature relevance learning for content-based image retrieval 基于内容的图像检索中自适应查询移位与特征相关学习的比较与结合
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957046
G. Giacinto, F. Roli, G. Fumera
Despite the efforts to reduce the semantic gap between user perception of similarity and feature-based representation of images, user interaction is essential to improve retrieval performance in content-based image retrieval. To this end a number of relevance feedback mechanisms are currently adopted to refine image queries. They are aimed either to locally modify the feature space or to shift the query point towards more promising regions of the feature space. A novel adaptive query shifting mechanism is proposed to improve retrieval performance beyond that provided by other relevance feedback mechanisms. In addition we discuss the extent to which query shifting may provide better performance than feature weighting and provide experimental results on the complementarity of the two approaches. Finally, some combinational approaches are proposed to exploit such complementarities.
尽管人们努力减少用户对图像相似性感知和基于特征的图像表示之间的语义差距,但在基于内容的图像检索中,用户交互对于提高检索性能至关重要。为此,目前采用了一些相关反馈机制来改进图像查询。它们的目的要么是局部修改特征空间,要么是将查询点转移到特征空间中更有希望的区域。为了提高检索性能,提出了一种新的自适应查询转移机制。此外,我们还讨论了查询移位在多大程度上比特征加权提供更好的性能,并提供了两种方法互补性的实验结果。最后,提出了一些利用这种互补性的组合方法。
{"title":"Comparison and combination of adaptive query shifting and feature relevance learning for content-based image retrieval","authors":"G. Giacinto, F. Roli, G. Fumera","doi":"10.1109/ICIAP.2001.957046","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957046","url":null,"abstract":"Despite the efforts to reduce the semantic gap between user perception of similarity and feature-based representation of images, user interaction is essential to improve retrieval performance in content-based image retrieval. To this end a number of relevance feedback mechanisms are currently adopted to refine image queries. They are aimed either to locally modify the feature space or to shift the query point towards more promising regions of the feature space. A novel adaptive query shifting mechanism is proposed to improve retrieval performance beyond that provided by other relevance feedback mechanisms. In addition we discuss the extent to which query shifting may provide better performance than feature weighting and provide experimental results on the complementarity of the two approaches. Finally, some combinational approaches are proposed to exploit such complementarities.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121336379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Root growth analysis in physiological coordinates 生理坐标下的根生长分析
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957074
N. Kirchgessner, H. Spies, H. Scharr, U. Schurr
We present a method for botanical growth analysis of plant roots in physiological coordinates which is necessary for the evaluation of growth mechanisms in roots. The presented framework can be used on long image sequences up to several days. First the displacement vector field is estimated by the structure tensor method. Secondly the physiological coordinates of the root are determined by active contours fitted to the root boundary. Then the middle line as the object coordinate axis of the root is calculated. In the third step the displacement field is sampled and projected on the middle line. This yields an array of tangential displacements along the root which is used to calculate the spatially resolved expansion rate of the root along its length. The performance of the presented framework is demonstrated on both synthetic and real data.
提出了一种植物根系生长的生理坐标分析方法,这是评价根系生长机制所必需的。所提出的框架可用于长达数天的长图像序列。首先用结构张量法估计位移向量场。其次,通过拟合根边界的活动轮廓确定根的生理坐标;然后计算出以中线为对象坐标轴线的根。第三步,对位移场进行采样并在中线上进行投影。这产生沿根的切向位移阵列,用于计算沿其长度的根的空间分辨膨胀率。在综合数据和实际数据上验证了该框架的性能。
{"title":"Root growth analysis in physiological coordinates","authors":"N. Kirchgessner, H. Spies, H. Scharr, U. Schurr","doi":"10.1109/ICIAP.2001.957074","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957074","url":null,"abstract":"We present a method for botanical growth analysis of plant roots in physiological coordinates which is necessary for the evaluation of growth mechanisms in roots. The presented framework can be used on long image sequences up to several days. First the displacement vector field is estimated by the structure tensor method. Secondly the physiological coordinates of the root are determined by active contours fitted to the root boundary. Then the middle line as the object coordinate axis of the root is calculated. In the third step the displacement field is sampled and projected on the middle line. This yields an array of tangential displacements along the root which is used to calculate the spatially resolved expansion rate of the root along its length. The performance of the presented framework is demonstrated on both synthetic and real data.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121241494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Similarity-based approach to earthenware reconstruction 基于相似性的陶器重建方法
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957055
M. Kanoh, Shogo Yasuhara, H. Itoh, Shohei Kato
This paper proposes an earthenware reconstruction system, which can automatically reconstruct some earthenware from numerous given potsherds in two-dimensional grayscale images. The system supposes that given potsherds are thin and moderately flat and small. Some earthenware having three-dimensional shape, such as crocks, can be reconstructed by the system within the supposition. The system performs earthenware reconstruction through two phases. At the first phase, potsherds are joined automatically in two dimensions. An efficient joint detection algorithm using surface pattern and shape similarity is proposed at this phase. At the second phase, three-dimensional shape is recovered by an adequate three-dimensional transformation. Some experimental results of reconstruction from numerous potsherds are also reported.
本文提出了一种陶器重建系统,该系统可以从大量给定的二维灰度图像中自动重建出部分陶器。该系统假定给定的陶片是薄的、适度平坦的和小的。一些具有三维形状的陶器,如陶器,可以通过假设中的系统进行重构。该系统通过两个阶段对陶器进行改造。在第一阶段,陶片在两个维度上自动连接。在此阶段,提出了一种利用表面图案和形状相似度的高效联合检测算法。在第二阶段,通过适当的三维变换恢复三维形状。本文还报道了大量陶片重建的一些实验结果。
{"title":"Similarity-based approach to earthenware reconstruction","authors":"M. Kanoh, Shogo Yasuhara, H. Itoh, Shohei Kato","doi":"10.1109/ICIAP.2001.957055","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957055","url":null,"abstract":"This paper proposes an earthenware reconstruction system, which can automatically reconstruct some earthenware from numerous given potsherds in two-dimensional grayscale images. The system supposes that given potsherds are thin and moderately flat and small. Some earthenware having three-dimensional shape, such as crocks, can be reconstructed by the system within the supposition. The system performs earthenware reconstruction through two phases. At the first phase, potsherds are joined automatically in two dimensions. An efficient joint detection algorithm using surface pattern and shape similarity is proposed at this phase. At the second phase, three-dimensional shape is recovered by an adequate three-dimensional transformation. Some experimental results of reconstruction from numerous potsherds are also reported.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126078981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Adaptive color image compression based on visual attention 基于视觉注意力的自适应彩色图像压缩
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957045
N. Ouerhani, J. Bracamonte, Heinz Hugli, M. Ansorge, F. Pellandini
This paper reports an adaptive still color image compression method which produces automatically selected ROI with a higher reconstruction quality with respect to the rest of the input image. The ROI are generated on-the fly with a purely data-driven technique based on visual attention. Inspired from biological vision, the multicue visual attention algorithm detects the most visually salient regions of an image. Thus, when operating in systems with low bit rate constraints, the adaptive coding scheme favors the allocation of a higher number of bits to those image regions that are more conspicuous to the human visual system. The compressed image files produced by this adaptive method are fully compatible with the JPEG standard, which favors their widespread utilization.
本文报道了一种自适应静态彩色图像压缩方法,该方法可以产生相对于输入图像的其余部分具有更高重建质量的自动选择感兴趣区域。ROI是基于视觉注意力的纯数据驱动技术动态生成的。受生物视觉的启发,多重视觉注意算法检测图像中视觉上最显著的区域。因此,当在具有低比特率约束的系统中运行时,自适应编码方案倾向于将更高数量的比特分配给那些对人类视觉系统更明显的图像区域。这种自适应方法产生的压缩图像文件与JPEG标准完全兼容,有利于其广泛应用。
{"title":"Adaptive color image compression based on visual attention","authors":"N. Ouerhani, J. Bracamonte, Heinz Hugli, M. Ansorge, F. Pellandini","doi":"10.1109/ICIAP.2001.957045","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957045","url":null,"abstract":"This paper reports an adaptive still color image compression method which produces automatically selected ROI with a higher reconstruction quality with respect to the rest of the input image. The ROI are generated on-the fly with a purely data-driven technique based on visual attention. Inspired from biological vision, the multicue visual attention algorithm detects the most visually salient regions of an image. Thus, when operating in systems with low bit rate constraints, the adaptive coding scheme favors the allocation of a higher number of bits to those image regions that are more conspicuous to the human visual system. The compressed image files produced by this adaptive method are fully compatible with the JPEG standard, which favors their widespread utilization.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134007679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 51
Representing volumetric vascular structures using curve skeletons 用曲线骨架表示体积血管结构
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957058
Ingela Nyström, G. S. D. Baja, S. Svensson
This paper describes a technique to represent relevant information of tree-like structures in a compact way. The technique is general. In the application described here, the images are obtained with contrast-enhanced magnetic resonance angiography (MRA). After segmentation, the vessels are reduced to fully reversible surface skeletons. Thereafter a novel approach to curve skeletonization based on the detection of junctions and curves in the surface skeleton is used. This procedure results in a good description of the tree structure of the vessels, where they are represented with a much smaller number of voxels. This representation is suitable for further quantitative analysis, e.g., measurements of vessel width and length.
本文描述了一种以紧凑的方式表示树状结构相关信息的技术。这种技术是通用的。在这里描述的应用中,图像是通过对比增强磁共振血管造影(MRA)获得的。分割后,血管被还原为完全可逆的表面骨架。在此基础上,提出了一种基于曲面骨架中结点和曲线检测的曲线骨架化方法。这个过程可以很好地描述血管的树形结构,其中它们用更少的体素表示。这种表示法适用于进一步的定量分析,例如,测量容器的宽度和长度。
{"title":"Representing volumetric vascular structures using curve skeletons","authors":"Ingela Nyström, G. S. D. Baja, S. Svensson","doi":"10.1109/ICIAP.2001.957058","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957058","url":null,"abstract":"This paper describes a technique to represent relevant information of tree-like structures in a compact way. The technique is general. In the application described here, the images are obtained with contrast-enhanced magnetic resonance angiography (MRA). After segmentation, the vessels are reduced to fully reversible surface skeletons. Thereafter a novel approach to curve skeletonization based on the detection of junctions and curves in the surface skeleton is used. This procedure results in a good description of the tree structure of the vessels, where they are represented with a much smaller number of voxels. This representation is suitable for further quantitative analysis, e.g., measurements of vessel width and length.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130772339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A naive approach to compose aerial images in a mosaic fashion 以马赛克的方式组合航空图像的一种天真的方法
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957061
D. Tegolo, Cesare Valenti
There is growing interest in multiple sequence image analysis to represent those data in a new landscape, for instance reconstruction of old films, mosaicing of images. This paper focuses attention on the mosaic problem; it introduces a naive method to link together images where a common part of the scene is present among two images. An application has been developed to test the method on aerial sequences of images. Given the long distance of aircraft from the scene, the method assumes images without distortions and without problems of prospective. Moreover, the application does not need any additional parameters coming from human experience and for this reason it can be thought of as a full automated application. In our experimentation the method shows good results and it appears robust and accurate for that kind of image.
人们对多序列图像分析越来越感兴趣,以便在新的场景中表示这些数据,例如旧电影的重建,图像的拼接。本文关注的是马赛克问题;它引入了一种简单的方法,将两幅图像中出现的场景的共同部分连接在一起。开发了一个应用程序来测试该方法在航空图像序列上的应用。考虑到飞机距离现场较远,该方法假设的图像没有失真,也没有前瞻性问题。此外,应用程序不需要任何来自人类经验的额外参数,因此它可以被认为是一个完全自动化的应用程序。实验结果表明,该方法具有较好的鲁棒性和准确性。
{"title":"A naive approach to compose aerial images in a mosaic fashion","authors":"D. Tegolo, Cesare Valenti","doi":"10.1109/ICIAP.2001.957061","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957061","url":null,"abstract":"There is growing interest in multiple sequence image analysis to represent those data in a new landscape, for instance reconstruction of old films, mosaicing of images. This paper focuses attention on the mosaic problem; it introduces a naive method to link together images where a common part of the scene is present among two images. An application has been developed to test the method on aerial sequences of images. Given the long distance of aircraft from the scene, the method assumes images without distortions and without problems of prospective. Moreover, the application does not need any additional parameters coming from human experience and for this reason it can be thought of as a full automated application. In our experimentation the method shows good results and it appears robust and accurate for that kind of image.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"268 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132637106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
ISAR image analysis by subspace method: automatic extraction and identification of ship profile 基于子空间方法的ISAR图像分析:船舶轮廓的自动提取与识别
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957063
A. Maki, K. Fukui, K. Onoguchi, Ken-ichi Maeda
This paper deals with automatic identification of ships in images produced by inverse synthetic aperture radar (ISAR). The ISAR technique reconstructs a rapidly updating sequence of range-Doppler image frames of the target. Due to the physics of imaging based on the target's angular motions, however, images are invariably noisy, and not all frames contain equally useful information. The thrust of this research is to cope with these issues by introducing: (i) a multiframe algorithm to stably extract profiling as a basic feature reflecting the entire characteristics of a target; and (ii) subspace analysis for identification of the extracted profiling especially using the recently proposed constrained mutual subspace method (CMSM). Through preliminary experiments we demonstrate the effective performance of the proposed scheme.
研究了逆合成孔径雷达(ISAR)图像中船舶的自动识别问题。ISAR技术重建了一个快速更新的目标距离-多普勒图像帧序列。然而,由于基于目标角运动的成像物理,图像总是有噪声的,并且不是所有帧都包含同样有用的信息。本研究的重点是通过引入:(i)一种多帧算法来稳定地提取轮廓作为反映目标整体特征的基本特征;(ii)子空间分析用于识别提取的剖面,特别是使用最近提出的约束互子空间方法(CMSM)。通过初步实验验证了该方案的有效性。
{"title":"ISAR image analysis by subspace method: automatic extraction and identification of ship profile","authors":"A. Maki, K. Fukui, K. Onoguchi, Ken-ichi Maeda","doi":"10.1109/ICIAP.2001.957063","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957063","url":null,"abstract":"This paper deals with automatic identification of ships in images produced by inverse synthetic aperture radar (ISAR). The ISAR technique reconstructs a rapidly updating sequence of range-Doppler image frames of the target. Due to the physics of imaging based on the target's angular motions, however, images are invariably noisy, and not all frames contain equally useful information. The thrust of this research is to cope with these issues by introducing: (i) a multiframe algorithm to stably extract profiling as a basic feature reflecting the entire characteristics of a target; and (ii) subspace analysis for identification of the extracted profiling especially using the recently proposed constrained mutual subspace method (CMSM). Through preliminary experiments we demonstrate the effective performance of the proposed scheme.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132915323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Real-time disparity analysis for applications in immersive teleconference scenarios-a comparative study 沉浸式电话会议场景应用的实时差异分析——比较研究
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957033
O. Schreer, N. Brandenburg, P. Kauff
We present results of a comparative study of different fast disparity analysis approaches. Two of them are well known standard algorithms, while the third is a new approach based on a hybrid block- and pixel-recursive matching scheme. The key idea of the new algorithm is to choose efficiently a small number of candidate vectors in order to reduce the computational effort by simultaneously achieving spatial and temporal consistency in the resulting disparity map. The latter aspect is very important for 3D video conferencing applications, where novel views of the conferee have to be synthesised in order to provide motion parallax. For this application, processing of a video in ITU-Rec. 601 resolution is required. Our new algorithm is able to provide disparity vector fields for both directions (left/spl rarr/right and right/spl rarr/left) in real-time on one 800 MHz Pentium III in reasonable quality. The different disparity algorithms are compared with respect to reliability, quality of the resulting disparities and speed of the algorithm.
我们提出了不同快速差异分析方法的比较研究结果。其中两种是众所周知的标准算法,而第三种是基于混合块和像素递归匹配方案的新方法。该算法的核心思想是高效地选择少量候选向量,从而在得到的视差图中同时实现时空一致性,从而减少计算量。后一个方面对于3D视频会议应用来说非常重要,在这种应用中,为了提供运动视差,必须合成参与者的新视图。对于本应用程序,在ITU-Rec中处理视频。需要601分辨率。我们的新算法能够在一个800 MHz的Pentium III上以合理的质量实时提供两个方向(左/spl rarr/右和右/spl rarr/左)的视差矢量场。对不同的视差算法在可靠性、视差质量和算法速度等方面进行了比较。
{"title":"Real-time disparity analysis for applications in immersive teleconference scenarios-a comparative study","authors":"O. Schreer, N. Brandenburg, P. Kauff","doi":"10.1109/ICIAP.2001.957033","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957033","url":null,"abstract":"We present results of a comparative study of different fast disparity analysis approaches. Two of them are well known standard algorithms, while the third is a new approach based on a hybrid block- and pixel-recursive matching scheme. The key idea of the new algorithm is to choose efficiently a small number of candidate vectors in order to reduce the computational effort by simultaneously achieving spatial and temporal consistency in the resulting disparity map. The latter aspect is very important for 3D video conferencing applications, where novel views of the conferee have to be synthesised in order to provide motion parallax. For this application, processing of a video in ITU-Rec. 601 resolution is required. Our new algorithm is able to provide disparity vector fields for both directions (left/spl rarr/right and right/spl rarr/left) in real-time on one 800 MHz Pentium III in reasonable quality. The different disparity algorithms are compared with respect to reliability, quality of the resulting disparities and speed of the algorithm.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117091859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
From cross-country autonomous navigation to intelligent deep space communications: visual sensor processing at JPL 从越野自主导航到智能深空通信:喷气推进实验室的视觉传感器处理
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957054
R. Manduchi, L. Matthies, F. Pollara
We describe ongoing work at JPL in two fields: autonomous navigation for terrestrial vehicles, and prioritized progressive transmission for deep space communications. While such applications may seem rather disparate, they have in common the need for autonomous reasoning about visual information. We first review a number of techniques currently under development that make use of data from color and infrared cameras, multispectral sensors, and laser rangefinder for estimating properties of the terrain cover in outdoor vegetated terrain. We then discuss how onboard visual analysis mechanisms for Mars rovers can be used for prioritizing the data to be transmitted to Earth, in order to maximize the science return of a mission.
我们描述了喷气推进实验室在两个领域正在进行的工作:地面飞行器的自主导航,以及深空通信的优先渐进传输。虽然这些应用程序可能看起来相当不同,但它们都需要对视觉信息进行自主推理。我们首先回顾了目前正在开发的一些技术,这些技术利用彩色和红外相机、多光谱传感器和激光测距仪的数据来估计室外植被地形的地形覆盖特性。然后,我们讨论了火星探测器的机载视觉分析机制如何用于优先考虑要传输到地球的数据,以最大限度地提高任务的科学回报。
{"title":"From cross-country autonomous navigation to intelligent deep space communications: visual sensor processing at JPL","authors":"R. Manduchi, L. Matthies, F. Pollara","doi":"10.1109/ICIAP.2001.957054","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957054","url":null,"abstract":"We describe ongoing work at JPL in two fields: autonomous navigation for terrestrial vehicles, and prioritized progressive transmission for deep space communications. While such applications may seem rather disparate, they have in common the need for autonomous reasoning about visual information. We first review a number of techniques currently under development that make use of data from color and infrared cameras, multispectral sensors, and laser rangefinder for estimating properties of the terrain cover in outdoor vegetated terrain. We then discuss how onboard visual analysis mechanisms for Mars rovers can be used for prioritizing the data to be transmitted to Earth, in order to maximize the science return of a mission.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116872738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Effectiveness evaluation of word characteristics obtained from 3D image information for lipreading 三维图像信息获取的词特征在唇读中的有效性评价
Pub Date : 2001-09-26 DOI: 10.1109/ICIAP.2001.957025
Koji Uda, N. Tagawa, A. Minagawa, T. Moriya
Speech recognition using image information is worthy of remark as one of the next generation of man machine interfaces (MMIs). Several methods that use either voice information or voice information and image information for recognizing words, context and speech have been proposed. Compared to methods that use only voice information, the benefit of using image information is that it is not affected by unwanted sound noise, and so it is applicable in several different environments. However, in general, several constraints are required to capture an image, for example, camera position and the relationship between camera and face. We investigated the effectiveness of using three-dimensional image information for word recognition and found that these constraints are removed. To confirm the effectiveness of the proposed method, the characteristics of two- and three-dimensional images were compared. The results of the word recognition experiment show that the recognition rate for three-dimensional characteristics is higher than that for two-dimensional characteristics.
基于图像信息的语音识别作为下一代人机界面之一,值得关注。人们提出了几种利用语音信息或语音信息和图像信息来识别单词、上下文和语音的方法。与仅使用语音信息的方法相比,使用图像信息的好处是它不受不必要的声音噪声的影响,因此它适用于几种不同的环境。然而,通常情况下,需要几个约束条件来捕获图像,例如,相机的位置和相机与人脸之间的关系。我们研究了使用三维图像信息进行单词识别的有效性,发现这些限制被消除了。为了验证该方法的有效性,对比了二维和三维图像的特征。单词识别实验结果表明,三维特征的识别率高于二维特征的识别率。
{"title":"Effectiveness evaluation of word characteristics obtained from 3D image information for lipreading","authors":"Koji Uda, N. Tagawa, A. Minagawa, T. Moriya","doi":"10.1109/ICIAP.2001.957025","DOIUrl":"https://doi.org/10.1109/ICIAP.2001.957025","url":null,"abstract":"Speech recognition using image information is worthy of remark as one of the next generation of man machine interfaces (MMIs). Several methods that use either voice information or voice information and image information for recognizing words, context and speech have been proposed. Compared to methods that use only voice information, the benefit of using image information is that it is not affected by unwanted sound noise, and so it is applicable in several different environments. However, in general, several constraints are required to capture an image, for example, camera position and the relationship between camera and face. We investigated the effectiveness of using three-dimensional image information for word recognition and found that these constraints are removed. To confirm the effectiveness of the proposed method, the characteristics of two- and three-dimensional images were compared. The results of the word recognition experiment show that the recognition rate for three-dimensional characteristics is higher than that for two-dimensional characteristics.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125363114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Proceedings 11th International Conference on Image Analysis and Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1