首页 > 最新文献

12th International Conference on Image Analysis and Processing, 2003.Proceedings.最新文献

英文 中文
Modeling the world: the virtualization pipeline 建模世界:虚拟化管道
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234044
J. Kautz, H. Lensch, M. Goesele, J. Lang, H. Seidel
High quality, virtual 3D models are quickly emerging as a new multimedia data type with applications in such diverse areas as e-commerce, online encyclopaedias, or virtual museums, to name just a few. The paper presents new algorithms and techniques for the acquisition and real-time interaction with complex textured 3D objects and shows how these results can be seamlessly integrated with previous work into a single framework for the acquisition, processing, and interactive display of high quality 3D models. In addition to pure geometry, such algorithms also have to take into account the texture of an object (which is crucial for a realistic appearance) and its reflectance behavior. The measurement of accurate material properties is an important step towards photorealistic rendering, where both the general surface properties as well as the spatially varying effects of the object are needed. Recent work on the image-based reconstruction of spatially varying BRDFs (bidirectional reflectance distribution function) enables the generation of high quality models of real objects from a sparse set of input data. Efficient use of the capabilities of advanced PC graphics hardware allows for interactive rendering under arbitrary viewing and lighting conditions and realistically reproduces the appearance of the original object.
高质量的虚拟3D模型正迅速成为一种新的多媒体数据类型,应用于电子商务、在线百科全书或虚拟博物馆等不同领域,仅举几例。本文介绍了用于复杂纹理3D对象的获取和实时交互的新算法和技术,并展示了如何将这些结果与以前的工作无缝集成到一个框架中,用于高质量3D模型的获取、处理和交互显示。除了纯几何之外,这种算法还必须考虑物体的纹理(这对逼真的外观至关重要)及其反射行为。测量准确的材料属性是实现逼真渲染的重要一步,其中既需要一般的表面属性,也需要物体的空间变化效果。最近在基于图像的空间变化brdf(双向反射分布函数)重建方面的工作使得从稀疏的输入数据集生成高质量的真实物体模型成为可能。有效利用先进的PC图形硬件功能,可以在任意观看和照明条件下进行交互式渲染,并逼真地再现原始对象的外观。
{"title":"Modeling the world: the virtualization pipeline","authors":"J. Kautz, H. Lensch, M. Goesele, J. Lang, H. Seidel","doi":"10.1109/ICIAP.2003.1234044","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234044","url":null,"abstract":"High quality, virtual 3D models are quickly emerging as a new multimedia data type with applications in such diverse areas as e-commerce, online encyclopaedias, or virtual museums, to name just a few. The paper presents new algorithms and techniques for the acquisition and real-time interaction with complex textured 3D objects and shows how these results can be seamlessly integrated with previous work into a single framework for the acquisition, processing, and interactive display of high quality 3D models. In addition to pure geometry, such algorithms also have to take into account the texture of an object (which is crucial for a realistic appearance) and its reflectance behavior. The measurement of accurate material properties is an important step towards photorealistic rendering, where both the general surface properties as well as the spatially varying effects of the object are needed. Recent work on the image-based reconstruction of spatially varying BRDFs (bidirectional reflectance distribution function) enables the generation of high quality models of real objects from a sparse set of input data. Efficient use of the capabilities of advanced PC graphics hardware allows for interactive rendering under arbitrary viewing and lighting conditions and realistically reproduces the appearance of the original object.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114257810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A graphics hardware implementation of the generalized Hough transform for fast object recognition, scale, and 3D pose detection 一个图形硬件实现的广义霍夫变换快速对象识别,规模,和三维姿态检测
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234048
R. Strzodka, Ivo Ihrke, M. Magnor
The generalized Hough transform constitutes a wellknown approach to object recognition and pose detection. To attain reliable detection results, however, a very large number of candidate object poses and scale factors need to be considered. We employ an inexpensive, consumer-market graphics-card as the "poor man's" parallel processing system. We describe the implementation of a fast and enhanced version of the generalized Hough transform on graphics hardware. Thanks to the high bandwidth of on-board texture memory, a single pose can be evaluated in less than 3 ms, independent of the number of edge pixels in the image. From known object geometry, our hardware-accelerated generalized Hough transform algorithm is capable of detecting an object's 3D pose, scale, and position in the image within less than one minute. A good pose estimation is even delivered in less than 10 seconds.
广义霍夫变换是一种众所周知的目标识别和姿态检测方法。然而,为了获得可靠的检测结果,需要考虑大量的候选目标姿态和尺度因素。我们采用一种廉价的、面向消费者市场的显卡作为“穷人”的并行处理系统。我们描述了一种快速增强的广义霍夫变换在图形硬件上的实现。由于机载纹理存储器的高带宽,可以在不到3毫秒的时间内评估单个姿态,而与图像中边缘像素的数量无关。根据已知的物体几何形状,我们的硬件加速广义霍夫变换算法能够在不到一分钟的时间内检测到物体在图像中的3D姿态、比例和位置。一个好的姿势估计甚至可以在不到10秒的时间内完成。
{"title":"A graphics hardware implementation of the generalized Hough transform for fast object recognition, scale, and 3D pose detection","authors":"R. Strzodka, Ivo Ihrke, M. Magnor","doi":"10.1109/ICIAP.2003.1234048","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234048","url":null,"abstract":"The generalized Hough transform constitutes a wellknown approach to object recognition and pose detection. To attain reliable detection results, however, a very large number of candidate object poses and scale factors need to be considered. We employ an inexpensive, consumer-market graphics-card as the \"poor man's\" parallel processing system. We describe the implementation of a fast and enhanced version of the generalized Hough transform on graphics hardware. Thanks to the high bandwidth of on-board texture memory, a single pose can be evaluated in less than 3 ms, independent of the number of edge pixels in the image. From known object geometry, our hardware-accelerated generalized Hough transform algorithm is capable of detecting an object's 3D pose, scale, and position in the image within less than one minute. A good pose estimation is even delivered in less than 10 seconds.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114508920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 60
Estimation of 3D gazed position using view lines 利用视图线估计三维凝视位置
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234094
Ikuhisa Mitsugami, N. Ukita, M. Kidode
We propose a new wearable system that can estimate the 3D position of a gazed point by measuring multiple binocular view lines. In principle, 3D measurement is possible by the triangulation of binocular view lines. However, it is difficult to measure these lines accurately with a device for eye tracking, because of errors caused by (1) difficulty in calibrating the device and (2) the limitation that a human cannot gaze very accurately at a distant point. Concerning (1), the accuracy of calibration can be improved by considering the optical properties of a camera in the device. To solve (2), we propose a stochastic algorithm that determines a gazed 3D position by integrating information of view lines observed at multiple head positions. We validated the effectiveness of the proposed algorithm experimentally.
我们提出了一种新的可穿戴系统,该系统可以通过测量多个双目视线线来估计被凝视点的三维位置。原则上,三维测量是可能的双目视线的三角测量。然而,由于(1)校准设备的困难以及(2)人类无法非常准确地注视远处点的限制,使用眼动追踪设备很难准确地测量这些线条。对于(1),可以通过考虑设备中相机的光学特性来提高标定精度。为了解决(2),我们提出了一种随机算法,该算法通过整合多个头部位置观察到的视线信息来确定凝视的三维位置。通过实验验证了该算法的有效性。
{"title":"Estimation of 3D gazed position using view lines","authors":"Ikuhisa Mitsugami, N. Ukita, M. Kidode","doi":"10.1109/ICIAP.2003.1234094","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234094","url":null,"abstract":"We propose a new wearable system that can estimate the 3D position of a gazed point by measuring multiple binocular view lines. In principle, 3D measurement is possible by the triangulation of binocular view lines. However, it is difficult to measure these lines accurately with a device for eye tracking, because of errors caused by (1) difficulty in calibrating the device and (2) the limitation that a human cannot gaze very accurately at a distant point. Concerning (1), the accuracy of calibration can be improved by considering the optical properties of a camera in the device. To solve (2), we propose a stochastic algorithm that determines a gazed 3D position by integrating information of view lines observed at multiple head positions. We validated the effectiveness of the proposed algorithm experimentally.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114953352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Old fashioned state-of-the-art image classification 老式的最先进的图像分类
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234110
A. Barla, F. Odone, A. Verri
In this paper we present a statistical learning scheme for image classification based on a mixture of old fashioned ideas and state of the art learning tools. We represent input images through large dimensional and usually sparse histograms which, depending on the task, are either color histograms or co-occurrence matrices. Support vector machines are trained on these sparse inputs directly, to solve problems like indoor/outdoor classification and cityscape retrieval from image databases. The experimental results indicate that the use of a kernel function derived from the computer vision literature leads to better recognition results than off the shelf kernels. According to our findings, it appears that image classification problems can be addressed with no need of explicit feature extraction or dimensionality reduction stages. We argue that this might be used as the starting point for developing image classification systems which can be easily tuned to a number of different tasks.
在本文中,我们提出了一种基于传统思想和最新学习工具的图像分类统计学习方案。我们通过大维度的、通常是稀疏的直方图来表示输入图像,根据任务的不同,这些直方图可以是颜色直方图,也可以是共生矩阵。支持向量机直接在这些稀疏输入上进行训练,以解决室内/室外分类和从图像数据库中检索城市景观等问题。实验结果表明,使用从计算机视觉文献中获得的核函数比使用现成的核函数具有更好的识别效果。根据我们的研究结果,似乎不需要明确的特征提取或降维阶段就可以解决图像分类问题。我们认为,这可能被用作开发图像分类系统的起点,它可以很容易地调整到许多不同的任务。
{"title":"Old fashioned state-of-the-art image classification","authors":"A. Barla, F. Odone, A. Verri","doi":"10.1109/ICIAP.2003.1234110","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234110","url":null,"abstract":"In this paper we present a statistical learning scheme for image classification based on a mixture of old fashioned ideas and state of the art learning tools. We represent input images through large dimensional and usually sparse histograms which, depending on the task, are either color histograms or co-occurrence matrices. Support vector machines are trained on these sparse inputs directly, to solve problems like indoor/outdoor classification and cityscape retrieval from image databases. The experimental results indicate that the use of a kernel function derived from the computer vision literature leads to better recognition results than off the shelf kernels. According to our findings, it appears that image classification problems can be addressed with no need of explicit feature extraction or dimensionality reduction stages. We argue that this might be used as the starting point for developing image classification systems which can be easily tuned to a number of different tasks.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123866572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Content-based video summarization and adaptation for ubiquitous media access 基于内容的视频摘要和适应无处不在的媒体访问
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234098
Shih-Fu Chang
Today's mobile and wireless users access multimedia content from different types of networks and terminals. Content analysis plays a critical role in developing effective solutions in meeting unique resource constraints and user preferences in such usage environments. Specifically, content analysis is central to automatic discovery of syntactic-level summaries and generation of concise semantic-level summaries. Content analysis also provides a promising direction for finding optimal adaptation methods under various resource-utility constraints. The paper presents brief overviews of such emerging, fruitful areas and promising research directions.
今天的移动和无线用户从不同类型的网络和终端访问多媒体内容。内容分析在开发有效的解决方案以满足此类使用环境中独特的资源限制和用户偏好方面起着关键作用。具体来说,内容分析是自动发现语法级摘要和生成简明语义级摘要的核心。内容分析也为寻找各种资源效用约束下的最优适应方法提供了有希望的方向。本文对这些新兴的、有成果的领域和有前景的研究方向作了简要的综述。
{"title":"Content-based video summarization and adaptation for ubiquitous media access","authors":"Shih-Fu Chang","doi":"10.1109/ICIAP.2003.1234098","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234098","url":null,"abstract":"Today's mobile and wireless users access multimedia content from different types of networks and terminals. Content analysis plays a critical role in developing effective solutions in meeting unique resource constraints and user preferences in such usage environments. Specifically, content analysis is central to automatic discovery of syntactic-level summaries and generation of concise semantic-level summaries. Content analysis also provides a promising direction for finding optimal adaptation methods under various resource-utility constraints. The paper presents brief overviews of such emerging, fruitful areas and promising research directions.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124231282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
An eigenvector method for shape-from-shading 阴影形状的特征向量法
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234095
A. Robles-Kelly, E. Hancock
We explore how spectral methods for graph seriation can be used to develop a new shape-from-shading algorithm. We characterise the field of surface normals using a transition matrix whose elements are computed from the sectional curvature between different image locations. We use a graph seriation method to define a curvature minimising surface integration path for the purposes of height reconstruction. To smooth the reconstructed surface, we fit quadric patches to the height data. The smoothed surface normal directions are updated ensuring compliance with Lambert's law. The processes of height recovery and surface normal adjustment are interleaved and iterated until a stable surface is obtained. We provide results on synthetic and real-world imagery.
我们探索如何光谱方法的图序列化可以用来开发一个新的形状从阴影算法。我们使用一个过渡矩阵来描述表面法线的场,该矩阵的元素是从不同图像位置之间的截面曲率计算出来的。我们使用图序列化方法来定义曲率最小化的曲面积分路径,用于高度重建。为了使重建表面光滑,我们对高度数据拟合二次块。光滑表面法线方向更新,确保符合兰伯特定律。高度恢复和表面法向调整过程相互交错迭代,直至获得稳定的表面。我们提供合成图像和真实图像的结果。
{"title":"An eigenvector method for shape-from-shading","authors":"A. Robles-Kelly, E. Hancock","doi":"10.1109/ICIAP.2003.1234095","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234095","url":null,"abstract":"We explore how spectral methods for graph seriation can be used to develop a new shape-from-shading algorithm. We characterise the field of surface normals using a transition matrix whose elements are computed from the sectional curvature between different image locations. We use a graph seriation method to define a curvature minimising surface integration path for the purposes of height reconstruction. To smooth the reconstructed surface, we fit quadric patches to the height data. The smoothed surface normal directions are updated ensuring compliance with Lambert's law. The processes of height recovery and surface normal adjustment are interleaved and iterated until a stable surface is obtained. We provide results on synthetic and real-world imagery.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125957815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Recognising moving hand shapes 识别移动的手部形状
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234018
E. Holden, R. Owens
The paper presents a new hand shape representation technique that characterises the finger-only topology of the hand, by adapting an existing technique from speech signal processing. From a moving hand sequence, the tracking algorithm determines the centre of the largest convex subset of the hand, using a combination of pattern matching and condensation algorithms. A hand shape feature represents the topological formation of the finger-only regions of the hand using a linear predictive coding parameter set called cepstral coefficients. Experimental results demonstrate the effectiveness of detecting the shape feature from motion sequences.
本文提出了一种新的手部形状表示技术,该技术通过采用语音信号处理中的现有技术来表征手部的手指拓扑结构。该跟踪算法结合模式匹配和凝聚算法,从移动的手序列中确定手的最大凸子集的中心。手的形状特征表示的拓扑结构的手指区域的手使用线性预测编码参数集称为倒谱系数。实验结果证明了该方法从运动序列中检测形状特征的有效性。
{"title":"Recognising moving hand shapes","authors":"E. Holden, R. Owens","doi":"10.1109/ICIAP.2003.1234018","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234018","url":null,"abstract":"The paper presents a new hand shape representation technique that characterises the finger-only topology of the hand, by adapting an existing technique from speech signal processing. From a moving hand sequence, the tracking algorithm determines the centre of the largest convex subset of the hand, using a combination of pattern matching and condensation algorithms. A hand shape feature represents the topological formation of the finger-only regions of the hand using a linear predictive coding parameter set called cepstral coefficients. Experimental results demonstrate the effectiveness of detecting the shape feature from motion sequences.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121921088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Towards automatic transcription of Syriac handwriting 走向自动抄写叙利亚文笔迹
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234126
W. Clocksin, P. P. Fernando
We describe a method implemented for the recognition of Syriac handwriting from historical manuscripts. The Syriac language has been a neglected area for handwriting recognition research, yet is interesting because the preponderance of scribe-written manuscripts offers a challenging yet tractable medium for OCR research between the extremes of typewritten text and free handwriting. Like Arabic, Syriac is written in a cursive form from right-to-left, and letter shape depends on the position within the word. The method described does not need to find character strokes or contours. Both whole words and character shapes were used in recognition experiments. After segmentation using a novel probabilistic method, features of these shapes are found that tolerate variation in formation and image quality. Each shape is recognised individually using a discriminative support vector machine with 10-fold cross-validation. We describe experiments using a variety of segmentation methods and combinations of features on characters and words. Images from scribe-written historical manuscripts are used, and the recognition results are compared with those for images taken from clearer 19th century typeset documents. Recognition rates vary from 61-100%, depending on the algorithms used and the size and source of the data set.
我们描述了一种从历史手稿中识别叙利亚笔迹的方法。在手写识别研究中,叙利亚语一直是一个被忽视的领域,但它很有趣,因为大量的抄写手稿为OCR研究提供了一种具有挑战性但易于处理的媒介,介于打字文本和自由手写之间。和阿拉伯语一样,叙利亚语也是草书形式,从右向左书写,字母的形状取决于在单词中的位置。所描述的方法不需要查找字符笔画或轮廓。在识别实验中采用了整词和汉字形状两种方法。在使用一种新的概率方法分割后,发现这些形状的特征可以容忍信息和图像质量的变化。使用具有10倍交叉验证的判别支持向量机单独识别每个形状。我们描述了使用各种分割方法和字符和单词特征组合的实验。使用了抄写历史手稿中的图像,并将识别结果与从更清晰的19世纪排版文件中获取的图像进行了比较。识别率从61-100%不等,取决于所使用的算法以及数据集的大小和来源。
{"title":"Towards automatic transcription of Syriac handwriting","authors":"W. Clocksin, P. P. Fernando","doi":"10.1109/ICIAP.2003.1234126","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234126","url":null,"abstract":"We describe a method implemented for the recognition of Syriac handwriting from historical manuscripts. The Syriac language has been a neglected area for handwriting recognition research, yet is interesting because the preponderance of scribe-written manuscripts offers a challenging yet tractable medium for OCR research between the extremes of typewritten text and free handwriting. Like Arabic, Syriac is written in a cursive form from right-to-left, and letter shape depends on the position within the word. The method described does not need to find character strokes or contours. Both whole words and character shapes were used in recognition experiments. After segmentation using a novel probabilistic method, features of these shapes are found that tolerate variation in formation and image quality. Each shape is recognised individually using a discriminative support vector machine with 10-fold cross-validation. We describe experiments using a variety of segmentation methods and combinations of features on characters and words. Images from scribe-written historical manuscripts are used, and the recognition results are compared with those for images taken from clearer 19th century typeset documents. Recognition rates vary from 61-100%, depending on the algorithms used and the size and source of the data set.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121144445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
Camera calibration and 3D reconstruction using interval analysis 摄像机标定和三维重建使用区间分析
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234078
B. Telle, M. Aldon, N. Ramdani
The paper deals with the problem of error estimation in 3D reconstruction. It shows how interval analysis can be used in this way for 3D vision applications. The description of an image point by an interval assumes an unknown but bounded localization. We present a new method based on interval analysis tools to propagate this bounded uncertainty. This way of computation can produce guaranteed results since a datum is not the most probabilistic value but an interval which contains the true value. We validate our method by computing a guaranteed model for a projective camera, and we achieve a guaranteed 3D reconstruction.
本文研究了三维重建中的误差估计问题。它展示了区间分析如何以这种方式用于3D视觉应用。用区间描述一个图像点,假设一个未知但有界的局部化。我们提出了一种基于区间分析工具的新方法来传播这种有界不确定性。这种计算方式可以产生有保证的结果,因为数据不是最有概率的值,而是包含真实值的区间。我们通过计算投影相机的保证模型来验证我们的方法,并实现了保证的三维重建。
{"title":"Camera calibration and 3D reconstruction using interval analysis","authors":"B. Telle, M. Aldon, N. Ramdani","doi":"10.1109/ICIAP.2003.1234078","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234078","url":null,"abstract":"The paper deals with the problem of error estimation in 3D reconstruction. It shows how interval analysis can be used in this way for 3D vision applications. The description of an image point by an interval assumes an unknown but bounded localization. We present a new method based on interval analysis tools to propagate this bounded uncertainty. This way of computation can produce guaranteed results since a datum is not the most probabilistic value but an interval which contains the true value. We validate our method by computing a guaranteed model for a projective camera, and we achieve a guaranteed 3D reconstruction.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116350497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Visual self-localisation using automatic topology construction 使用自动拓扑结构的视觉自定位
Pub Date : 2003-09-17 DOI: 10.1109/ICIAP.2003.1234077
P. Baldassarri, P. Puliti, A. Montesanto, G. Tascini
The paper proposes a machine learning method for self-localising a mobile agent, using the images supplied by a single omni-directional camera. The images acquired by the camera may be viewed as an implicit topological representation of the environment. The environment is a priori unknown and the topological representation is derived by unsupervised neural network architecture. The architecture includes a self-organising neural network, and is constituted by a growing neural gas, which is well known for its topology preserving quality. The growth depends on the topology that is not a priori defined, and on the need of discovering it, by the neural network, during the learning. The implemented system is able to recognise correctly the input frames and to reconstruct a topological map of the environment. Each node of the neural network identifies a single zone of the environment and the connections between the nodes correspond to the real space connections in the environment.
本文提出了一种利用单个全向相机提供的图像进行移动智能体自定位的机器学习方法。摄像机所获得的图像可以看作是环境的隐式拓扑表示。环境是先验未知的,拓扑表示是由无监督神经网络架构导出的。该体系结构包括一个自组织神经网络,由一个生长的神经气体组成,该神经气体以其拓扑保持性而闻名。增长取决于不是先验定义的拓扑,以及神经网络在学习过程中发现拓扑的需要。所实现的系统能够正确识别输入帧并重建环境的拓扑图。神经网络的每个节点识别环境的单个区域,节点之间的连接对应于环境中的真实空间连接。
{"title":"Visual self-localisation using automatic topology construction","authors":"P. Baldassarri, P. Puliti, A. Montesanto, G. Tascini","doi":"10.1109/ICIAP.2003.1234077","DOIUrl":"https://doi.org/10.1109/ICIAP.2003.1234077","url":null,"abstract":"The paper proposes a machine learning method for self-localising a mobile agent, using the images supplied by a single omni-directional camera. The images acquired by the camera may be viewed as an implicit topological representation of the environment. The environment is a priori unknown and the topological representation is derived by unsupervised neural network architecture. The architecture includes a self-organising neural network, and is constituted by a growing neural gas, which is well known for its topology preserving quality. The growth depends on the topology that is not a priori defined, and on the need of discovering it, by the neural network, during the learning. The implemented system is able to recognise correctly the input frames and to reconstruct a topological map of the environment. Each node of the neural network identifies a single zone of the environment and the connections between the nodes correspond to the real space connections in the environment.","PeriodicalId":218076,"journal":{"name":"12th International Conference on Image Analysis and Processing, 2003.Proceedings.","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126571740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
12th International Conference on Image Analysis and Processing, 2003.Proceedings.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1