首页 > 最新文献

Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)最新文献

英文 中文
An address generator, for an N-dimensional pseudo-Hilbert scan in a hyper-rectangular, parallelepiped region 一个地址生成器,用于超矩形平行六面体区域的n维伪希尔伯特扫描
Pub Date : 2000-12-01 DOI: 10.1109/ICIP.2000.901064
Y. Bandoh, S. Kamata
The Hilbert curve is a one-to-one mapping between N-dimensional (N-D) space and 1-D space. The Hilbert curve has been applied to image processing as a scanning technique (Hilbert scan). Applications to multi-dimensional image processing are also studied. In this application. We use the N-D Hilbert scan which maps N-D data to 1-D data along the N-D Hilbert curve. However, the N-D Hilbert scan is the application limited to data in a hyper-cube region. In this paper, we present a novel algorithm for generating N-D pseudo-Hilbert curves in a hyper-rectangular parallelepiped region. Our algorithm is suitable for real-time processing and is easy to implement in hardware, since it is a simple and non-recursive computation using look-up tables.
希尔伯特曲线是n维(N-D)空间和一维空间之间的一对一映射。希尔伯特曲线作为一种扫描技术(希尔伯特扫描)已被应用于图像处理。还研究了在多维图像处理中的应用。在这个应用程序中。我们使用N-D希尔伯特扫描,它沿着N-D希尔伯特曲线将N-D数据映射到1-D数据。然而,N-D希尔伯特扫描的应用仅限于超立方体区域中的数据。本文提出了一种在超矩形平行六面体区域上生成N-D伪希尔伯特曲线的新算法。我们的算法适合于实时处理,并且易于在硬件中实现,因为它是一个使用查找表的简单且非递归计算。
{"title":"An address generator, for an N-dimensional pseudo-Hilbert scan in a hyper-rectangular, parallelepiped region","authors":"Y. Bandoh, S. Kamata","doi":"10.1109/ICIP.2000.901064","DOIUrl":"https://doi.org/10.1109/ICIP.2000.901064","url":null,"abstract":"The Hilbert curve is a one-to-one mapping between N-dimensional (N-D) space and 1-D space. The Hilbert curve has been applied to image processing as a scanning technique (Hilbert scan). Applications to multi-dimensional image processing are also studied. In this application. We use the N-D Hilbert scan which maps N-D data to 1-D data along the N-D Hilbert curve. However, the N-D Hilbert scan is the application limited to data in a hyper-cube region. In this paper, we present a novel algorithm for generating N-D pseudo-Hilbert curves in a hyper-rectangular parallelepiped region. Our algorithm is suitable for real-time processing and is easy to implement in hardware, since it is a simple and non-recursive computation using look-up tables.","PeriodicalId":193198,"journal":{"name":"Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129021100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Region-based scanning for image compression 基于区域的图像压缩扫描
Pub Date : 2000-12-01 DOI: 10.1109/ICIP.2000.899858
S. Kamata, Yu-ichi Hayashi
G. Peano published a paper of space-filling curve in 1890. There are several applications using this curve in the area of image processing, computer graphics, etc. This paper presents a region-based scanning technique for image compression which we call an adaptive space filling scan (ASFS). In order to generate the ASFS, we make use of a minimum spanning tree technique. From several experiments for image compression, it is confirmed that in comparison to JPEG, acceptable quality images can be obtained at the same bit-rates.
皮亚诺于1890年发表了一篇关于空间填充曲线的论文。这条曲线在图像处理、计算机图形学等领域有许多应用。提出了一种基于区域的图像压缩扫描技术,我们称之为自适应空间填充扫描(ASFS)。为了生成ASFS,我们使用了最小生成树技术。通过多次图像压缩实验证实,与JPEG相比,在相同的比特率下可以获得质量可接受的图像。
{"title":"Region-based scanning for image compression","authors":"S. Kamata, Yu-ichi Hayashi","doi":"10.1109/ICIP.2000.899858","DOIUrl":"https://doi.org/10.1109/ICIP.2000.899858","url":null,"abstract":"G. Peano published a paper of space-filling curve in 1890. There are several applications using this curve in the area of image processing, computer graphics, etc. This paper presents a region-based scanning technique for image compression which we call an adaptive space filling scan (ASFS). In order to generate the ASFS, we make use of a minimum spanning tree technique. From several experiments for image compression, it is confirmed that in comparison to JPEG, acceptable quality images can be obtained at the same bit-rates.","PeriodicalId":193198,"journal":{"name":"Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115690924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Shape approximation through recursive scalable layer generation 通过递归可扩展层生成形状逼近
Pub Date : 2000-12-01 DOI: 10.1109/ICIP.2000.899864
G. Melnikov, A. Katsaggelos
This paper presents an efficient recursive algorithm for generating operationally optimal intra mode scalable layer decompositions of object contours. The problem is posed in terms of minimizing the shape distortion at full reconstruction subject to the total (for all scalable layers) bit budget constraint. Based on the chosen vertex-based representation, we solve the problem of determining the number and locations of approximating vertices for all scalable layers jointly and optimally. The number of scalable layers is not constrained, but, rather, is a by-product of the proposed optimization. The algorithm employs two different coding strategies: one for the base layer and one for the enhancement layers. By carefully defining scalable layer recursion and base layer segment costs the problem is solved by executing a directed acyclic graph (DAG) shortest path algorithm.
本文提出了一种有效的递归算法,用于生成可操作的最优模式内可扩展的目标轮廓层分解。该问题是在受总(所有可扩展层)比特预算约束的情况下,在完全重建时最小化形状失真。基于所选择的基于顶点的表示,我们联合并最优地解决了确定所有可扩展层的近似顶点的数量和位置的问题。可伸缩层的数量不受限制,而是所提出的优化的副产品。该算法采用两种不同的编码策略:一种用于基础层,一种用于增强层。通过仔细定义可扩展层递归和基础层段代价,通过执行有向无环图(DAG)最短路径算法来解决问题。
{"title":"Shape approximation through recursive scalable layer generation","authors":"G. Melnikov, A. Katsaggelos","doi":"10.1109/ICIP.2000.899864","DOIUrl":"https://doi.org/10.1109/ICIP.2000.899864","url":null,"abstract":"This paper presents an efficient recursive algorithm for generating operationally optimal intra mode scalable layer decompositions of object contours. The problem is posed in terms of minimizing the shape distortion at full reconstruction subject to the total (for all scalable layers) bit budget constraint. Based on the chosen vertex-based representation, we solve the problem of determining the number and locations of approximating vertices for all scalable layers jointly and optimally. The number of scalable layers is not constrained, but, rather, is a by-product of the proposed optimization. The algorithm employs two different coding strategies: one for the base layer and one for the enhancement layers. By carefully defining scalable layer recursion and base layer segment costs the problem is solved by executing a directed acyclic graph (DAG) shortest path algorithm.","PeriodicalId":193198,"journal":{"name":"Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126866611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A new resolution progressive coding scheme using a sorting algorithm 采用排序算法的一种新的分辨率累进编码方案
Pub Date : 2000-12-01 DOI: 10.1109/ICIP.2000.899329
Tomoko Takahara, M. Okuda, M. Ikehara, Shinichi Takahashi
It has been proposed that resolution progressiveness is one of the most important techniques for image compression. In this paper, we offer a new resolution progressive coding method. The main point stresses how to exploit self-similarity across different scales of wavelet transform by sorting the coefficients. Wavelet transformed images inherently have correlation between subbands at different levels in a spatial decomposition hierarchy, and thus exploiting the correlation of the hierarchy helps to compress the amount of information of images. A number of methods with SNR progressive coding have been presented so far; however, this proposed method concerns resolution progressiveness as well as SNR progressiveness, which can be only applied in each subband, and still be effective and computationally very simple for image compression.
分辨率递进是图像压缩中最重要的技术之一。本文提出了一种新的分辨率递进编码方法。重点介绍了如何通过对小波变换的系数排序来挖掘小波变换在不同尺度上的自相似性。小波变换后的图像在空间分解层次中具有不同层次子带之间的内在相关性,利用层次间的相关性可以压缩图像的信息量。到目前为止,已经提出了许多具有信噪比的渐进编码方法;然而,该方法既考虑了分辨率的递进性,又考虑了信噪比的递进性,只能应用于每个子带,对于图像压缩仍然是有效的,并且计算非常简单。
{"title":"A new resolution progressive coding scheme using a sorting algorithm","authors":"Tomoko Takahara, M. Okuda, M. Ikehara, Shinichi Takahashi","doi":"10.1109/ICIP.2000.899329","DOIUrl":"https://doi.org/10.1109/ICIP.2000.899329","url":null,"abstract":"It has been proposed that resolution progressiveness is one of the most important techniques for image compression. In this paper, we offer a new resolution progressive coding method. The main point stresses how to exploit self-similarity across different scales of wavelet transform by sorting the coefficients. Wavelet transformed images inherently have correlation between subbands at different levels in a spatial decomposition hierarchy, and thus exploiting the correlation of the hierarchy helps to compress the amount of information of images. A number of methods with SNR progressive coding have been presented so far; however, this proposed method concerns resolution progressiveness as well as SNR progressiveness, which can be only applied in each subband, and still be effective and computationally very simple for image compression.","PeriodicalId":193198,"journal":{"name":"Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134588749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A scheme for joint watermarking and compression of video 一种视频联合水印和压缩方案
Pub Date : 2000-12-01 DOI: 10.1109/ICIP.2000.899232
R. Dugad, N. Ahuja
We present a scheme for jointly watermarking and compressing digital video. The amount of watermark added is adapted to the expected degradation of the watermark due to compression. This results in a more robust watermark. This is achieved without any appreciable decrease in the quality of the decoded video compared to the case when the watermark is not adaptive. Results are presented for the flower garden sequence.
提出了一种数字视频的联合水印和压缩方案。水印的添加量与预期的水印因压缩而退化的程度相适应。这将产生更健壮的水印。与水印不自适应的情况相比,在没有明显降低解码视频质量的情况下实现了这一点。给出了花园序列的结果。
{"title":"A scheme for joint watermarking and compression of video","authors":"R. Dugad, N. Ahuja","doi":"10.1109/ICIP.2000.899232","DOIUrl":"https://doi.org/10.1109/ICIP.2000.899232","url":null,"abstract":"We present a scheme for jointly watermarking and compressing digital video. The amount of watermark added is adapted to the expected degradation of the watermark due to compression. This results in a more robust watermark. This is achieved without any appreciable decrease in the quality of the decoded video compared to the case when the watermark is not adaptive. Results are presented for the flower garden sequence.","PeriodicalId":193198,"journal":{"name":"Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121036463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Video compression using integer DCT 视频压缩采用整数DCT
Pub Date : 2000-12-01 DOI: 10.1109/ICIP.2000.899842
Ying-Jui Chen, S. Oraintara, Truong Q. Nguyen
This paper describes the implementation of the integer discrete cosine transform (IntDCT) using the Walsh-Hadamard transform and the lifting scheme. The implementation is in the forms of shifts and adds, and all internal nodes have finite precision. A general-purpose scheme of 8-pt IntDCT with complexity of 45 adds and 18 shifts is proposed which gives comparable performance to the floating-point DCT (FloatDCT). For this particular scheme with 8-bit input, perfect reconstruction (PR) is preserved even when all the internal nodes are limited to 16-bit words, rendering the Pentium MMX optimization possible. Implementation has been done to incorporate the proposed IntDCT into the H.263+ coder, and the resulting system performs equally well as the original. Further extension to the MPEG coder is straightforward. The proposed IntDCT is reversible, with a low level of power consumption, and is very suitable for source coding, and communication, etc. in a mobile environment.
本文描述了利用Walsh-Hadamard变换和提升方案实现整数离散余弦变换(IntDCT)。实现采用移位和加法的形式,所有内部节点具有有限的精度。提出了一种复杂度为45加18移的通用8点整数DCT (FloatDCT)方案,其性能与浮点数DCT相当。对于这个具有8位输入的特定方案,即使所有内部节点都被限制为16位字,也可以保留完美的重构(PR),从而使Pentium MMX优化成为可能。已经完成了将提议的IntDCT合并到H.263+编码器中的实现,并且最终系统的性能与原始系统一样好。进一步扩展到MPEG编码器是直截了当的。所提出的IntDCT是可逆的,功耗低,非常适合在移动环境中进行源编码和通信等。
{"title":"Video compression using integer DCT","authors":"Ying-Jui Chen, S. Oraintara, Truong Q. Nguyen","doi":"10.1109/ICIP.2000.899842","DOIUrl":"https://doi.org/10.1109/ICIP.2000.899842","url":null,"abstract":"This paper describes the implementation of the integer discrete cosine transform (IntDCT) using the Walsh-Hadamard transform and the lifting scheme. The implementation is in the forms of shifts and adds, and all internal nodes have finite precision. A general-purpose scheme of 8-pt IntDCT with complexity of 45 adds and 18 shifts is proposed which gives comparable performance to the floating-point DCT (FloatDCT). For this particular scheme with 8-bit input, perfect reconstruction (PR) is preserved even when all the internal nodes are limited to 16-bit words, rendering the Pentium MMX optimization possible. Implementation has been done to incorporate the proposed IntDCT into the H.263+ coder, and the resulting system performs equally well as the original. Further extension to the MPEG coder is straightforward. The proposed IntDCT is reversible, with a low level of power consumption, and is very suitable for source coding, and communication, etc. in a mobile environment.","PeriodicalId":193198,"journal":{"name":"Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116432378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 70
A coarse-to-fine approach for the generation and tracking of mesh objects from a natural image sequence 从自然图像序列中生成和跟踪网格对象的一种从粗到精的方法
Pub Date : 2000-12-01 DOI: 10.1109/ICIP.2000.899855
Jui-Tai Ko, Sheng-Jyh Wang
A new scheme, which automatically generates a hierarchical mesh from a real image, is proposed. First, three distinct meshes are generated according to a three-layer image pyramid. By eliminating the duplicated mesh nodes between these three meshes, a hierarchical mesh is formed. Because each hierarchy represents the image in different resolutions, this hierarchical structure can achieve progressive coding. Furthermore, to generate a dynamic mesh for an image sequence, this hierarchical structure can help to reduce the computation complexity through the tracking of mesh nodes in the sequence. A method is also proposed to predict the moving direction of mesh nodes and to facilitate the motion estimation.
提出了一种从真实图像中自动生成分层网格的新方案。首先,根据三层图像金字塔生成三个不同的网格;通过消除这三个网格之间的重复网格节点,形成一个分层网格。由于每个层次代表不同分辨率的图像,这种层次结构可以实现渐进式编码。此外,为了生成图像序列的动态网格,这种分层结构可以通过跟踪序列中的网格节点来降低计算复杂度。提出了一种预测网格节点运动方向的方法,便于运动估计。
{"title":"A coarse-to-fine approach for the generation and tracking of mesh objects from a natural image sequence","authors":"Jui-Tai Ko, Sheng-Jyh Wang","doi":"10.1109/ICIP.2000.899855","DOIUrl":"https://doi.org/10.1109/ICIP.2000.899855","url":null,"abstract":"A new scheme, which automatically generates a hierarchical mesh from a real image, is proposed. First, three distinct meshes are generated according to a three-layer image pyramid. By eliminating the duplicated mesh nodes between these three meshes, a hierarchical mesh is formed. Because each hierarchy represents the image in different resolutions, this hierarchical structure can achieve progressive coding. Furthermore, to generate a dynamic mesh for an image sequence, this hierarchical structure can help to reduce the computation complexity through the tracking of mesh nodes in the sequence. A method is also proposed to predict the moving direction of mesh nodes and to facilitate the motion estimation.","PeriodicalId":193198,"journal":{"name":"Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133231107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A generalized 3D shape sampling method and file format for storage or indexing 一种通用的三维形状采样方法和用于存储或索引的文件格式
Pub Date : 2000-12-01 DOI: 10.1109/ICIP.2000.899825
Jiann-Jone Chen, C. Chiang, David W. Lin
Several ways of 2D shape (or contour) description in terms of Fourier and wavelet transform coefficients have been proposed. They provide data compression capability, and some of the descriptors are invariant under scaling, rotation, and choice of the starting point for contour tracing. Several methods for 3D shape description also exist. However, they lack either the simplicity, the generality, or the data compression ability comparable to the 2D methods. We propose a generalized sampling method for efficient description of free-form 3D shape surfaces. The key idea is to warp a 3D spherical coordinate system onto the 3D surface, so that the spatial coordinates of each point on the surface may be represented parametrically as {x(/spl alpha/,/spl beta/), y(/spl alpha/,/spl beta/), z(/spl alpha/,/spl beta/)}, where /spl les//spl alpha//spl les/2/spl pi/ and 0/spl les//spl beta//spl les//spl pi/ with /spl alpha/ and /spl beta/ giving normalized arc lengths on the 3D surface. Fourier, wavelet, or other 2D transforms can then be applied to the three coordinate functions for purposes of data compression, database storage, or indexing. Simulations show that wavelet-based method yields efficient 3D shape compression based on this generalized sampling approach.
提出了几种基于傅里叶变换系数和小波变换系数的二维形状(或轮廓)描述方法。它们提供了数据压缩能力,并且一些描述符在缩放、旋转和轮廓跟踪起始点的选择下是不变的。此外,还存在几种三维形状描述方法。然而,它们缺乏与2D方法相比的简单性、通用性或数据压缩能力。我们提出了一种广义采样方法来有效地描述自由形状的三维曲面。关键思想是将一个三维球面坐标系扭曲到三维表面上,使表面上每个点的空间坐标可以参数化地表示为{x(/spl alpha/,/spl beta/), y(/spl alpha/,/spl beta/), z(/spl alpha/,/spl beta/)},其中/spl les//spl alpha//spl les/2/spl pi/和0/spl les//spl beta//spl pi/ /spl alpha/和/spl beta/给出了三维表面上标准化的弧长。然后可以将傅里叶、小波或其他二维变换应用于三个坐标函数,用于数据压缩、数据库存储或索引。仿真结果表明,基于小波的方法可以有效地压缩三维形状。
{"title":"A generalized 3D shape sampling method and file format for storage or indexing","authors":"Jiann-Jone Chen, C. Chiang, David W. Lin","doi":"10.1109/ICIP.2000.899825","DOIUrl":"https://doi.org/10.1109/ICIP.2000.899825","url":null,"abstract":"Several ways of 2D shape (or contour) description in terms of Fourier and wavelet transform coefficients have been proposed. They provide data compression capability, and some of the descriptors are invariant under scaling, rotation, and choice of the starting point for contour tracing. Several methods for 3D shape description also exist. However, they lack either the simplicity, the generality, or the data compression ability comparable to the 2D methods. We propose a generalized sampling method for efficient description of free-form 3D shape surfaces. The key idea is to warp a 3D spherical coordinate system onto the 3D surface, so that the spatial coordinates of each point on the surface may be represented parametrically as {x(/spl alpha/,/spl beta/), y(/spl alpha/,/spl beta/), z(/spl alpha/,/spl beta/)}, where /spl les//spl alpha//spl les/2/spl pi/ and 0/spl les//spl beta//spl les//spl pi/ with /spl alpha/ and /spl beta/ giving normalized arc lengths on the 3D surface. Fourier, wavelet, or other 2D transforms can then be applied to the three coordinate functions for purposes of data compression, database storage, or indexing. Simulations show that wavelet-based method yields efficient 3D shape compression based on this generalized sampling approach.","PeriodicalId":193198,"journal":{"name":"Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121314797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Recovering frontal-pose image from a single profile image 从单个侧面图像中恢复正面姿态图像
Pub Date : 2000-12-01 DOI: 10.1109/ICIP.2000.899288
Jianbo Ma, N. Ahuja, C. Neti, A. Senior
In appearance based face recognition, lip reading, etc., eigen face and eigen lip are used for recognition. The pose changes of the human head in a video sequence often cause errors in the eigen space comparison stage, because the frontal-pose assumption has been violated. We propose a new method to compensate the pose changes by exploiting the general symmetry of human face. From the imaging geometry we show that a frontal pose can be recovered from only one profile view. The resulting pose compensation method has the following advantages: (1) it only requires one profile image; (2) it does not need any 3D model; (3) it does not need accurate feature detection. Experimental results in the context of lip images are given to show the effectiveness of our method.
在基于外观的人脸识别、唇读等中,主要采用特征脸和特征唇进行识别。在视频序列中,头部的姿态变化往往会在特征空间比较阶段产生错误,因为它违背了正面姿态假设。我们提出了一种利用人脸的一般对称性来补偿姿态变化的新方法。从成像几何,我们表明,正面姿态可以恢复只有一个侧面视图。所得到的姿态补偿方法具有以下优点:(1)只需要一张轮廓图像;(2)不需要任何3D模型;(3)不需要精确的特征检测。最后给出了唇形图像的实验结果,验证了该方法的有效性。
{"title":"Recovering frontal-pose image from a single profile image","authors":"Jianbo Ma, N. Ahuja, C. Neti, A. Senior","doi":"10.1109/ICIP.2000.899288","DOIUrl":"https://doi.org/10.1109/ICIP.2000.899288","url":null,"abstract":"In appearance based face recognition, lip reading, etc., eigen face and eigen lip are used for recognition. The pose changes of the human head in a video sequence often cause errors in the eigen space comparison stage, because the frontal-pose assumption has been violated. We propose a new method to compensate the pose changes by exploiting the general symmetry of human face. From the imaging geometry we show that a frontal pose can be recovered from only one profile view. The resulting pose compensation method has the following advantages: (1) it only requires one profile image; (2) it does not need any 3D model; (3) it does not need accurate feature detection. Experimental results in the context of lip images are given to show the effectiveness of our method.","PeriodicalId":193198,"journal":{"name":"Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121929396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Fusing audio and visual features of speech 融合语音的视听特征
Pub Date : 2000-12-01 DOI: 10.1109/ICIP.2000.899333
Hao Pan, Zhi-Pei Liang, Thomas S. Huang
In this paper, the audio and visual features of speech are integrated using a novel fused-HMM. We assume that the two sets of features may have different data rates and duration. Hidden Markov models (HMMs) are first used to model them separately, and then a general Bayesian fusion method, which is optimal in the maximum entropy sense, is employed to fuse them together. Particularly, an efficient learning algorithm is introduced. Instead of maximizing the joint likelihood of the fuse-HMM, the learning algorithm maximizes the two HMMs separately, and then fuses the HMMs together. In addition, an inference algorithm is proposed. We have tested the proposed method by person verification experiments. Results show that the proposed method significantly reduces the recognition error rates as compared to the unimodal HMMs and the loosely-coupled fusion model.
本文采用一种新颖的融合隐马尔可夫模型,将语音的视听特征融合在一起。我们假设这两组特征可能具有不同的数据速率和持续时间。首先利用隐马尔可夫模型(hmm)对二者分别建模,然后利用最大熵意义上最优的通用贝叶斯融合方法将二者融合在一起。特别介绍了一种高效的学习算法。该学习算法不是最大化融合hmm的联合似然,而是分别最大化两个hmm,然后将hmm融合在一起。此外,还提出了一种推理算法。我们通过人体验证实验对所提出的方法进行了验证。结果表明,与单峰hmm模型和松耦合融合模型相比,该方法显著降低了识别错误率。
{"title":"Fusing audio and visual features of speech","authors":"Hao Pan, Zhi-Pei Liang, Thomas S. Huang","doi":"10.1109/ICIP.2000.899333","DOIUrl":"https://doi.org/10.1109/ICIP.2000.899333","url":null,"abstract":"In this paper, the audio and visual features of speech are integrated using a novel fused-HMM. We assume that the two sets of features may have different data rates and duration. Hidden Markov models (HMMs) are first used to model them separately, and then a general Bayesian fusion method, which is optimal in the maximum entropy sense, is employed to fuse them together. Particularly, an efficient learning algorithm is introduced. Instead of maximizing the joint likelihood of the fuse-HMM, the learning algorithm maximizes the two HMMs separately, and then fuses the HMMs together. In addition, an inference algorithm is proposed. We have tested the proposed method by person verification experiments. Results show that the proposed method significantly reduces the recognition error rates as compared to the unimodal HMMs and the loosely-coupled fusion model.","PeriodicalId":193198,"journal":{"name":"Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)","volume":"28 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134441613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1