Pub Date : 2007-11-12DOI: 10.1109/ICIP.2007.4378886
J. Gai, Yong Li, R. Stevenson
Augmenting electro-optical (EO) based target tracking systems with infrared (IR) modality has been shown to be effective in increasing the accuracy rate of the tracking system. A key issue in designing such a multimodal tracking system is how to combine information observed from different sensor types in a systematic way to obtain desirable performance. In this paper, we present an investigation into integrating EO and IR sensors within hidden Markov model (HMM) based frameworks. We propose to use a coupled hidden Markov model (CHMM) to improve upon the existing fusion schemes. Another contribution is that we propose to use a robust t-distribution based subspace representation in the CHMM to model appearance changes of the target. Numerical experiments demonstrate that the proposed CHMM tracking system has improved performance over other integration schemes for situations where the target object is corrupted by noise or occlusion.
{"title":"Coupled Hidden Markov Models for Robust EO/IR Target Tracking","authors":"J. Gai, Yong Li, R. Stevenson","doi":"10.1109/ICIP.2007.4378886","DOIUrl":"https://doi.org/10.1109/ICIP.2007.4378886","url":null,"abstract":"Augmenting electro-optical (EO) based target tracking systems with infrared (IR) modality has been shown to be effective in increasing the accuracy rate of the tracking system. A key issue in designing such a multimodal tracking system is how to combine information observed from different sensor types in a systematic way to obtain desirable performance. In this paper, we present an investigation into integrating EO and IR sensors within hidden Markov model (HMM) based frameworks. We propose to use a coupled hidden Markov model (CHMM) to improve upon the existing fusion schemes. Another contribution is that we propose to use a robust t-distribution based subspace representation in the CHMM to model appearance changes of the target. Numerical experiments demonstrate that the proposed CHMM tracking system has improved performance over other integration schemes for situations where the target object is corrupted by noise or occlusion.","PeriodicalId":131177,"journal":{"name":"2007 IEEE International Conference on Image Processing","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115873106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-11-12DOI: 10.1109/ICIP.2007.4379104
M. Brucher, C. Heinrich, F. Heitz, J. Armspach
This communication deals with data reduction and regression. A set of high dimensional data (e.g., images) usually has only a few degrees of freedom with corresponding variables that are used to parameterize the original data set. Data understanding, visualization and classification are the usual goals. The proposed method reduces data considering a unique set of low-dimensional variables and a user-defined cost function in the multidimensional scaling framework. Mapping of the reduced variables to the original data is also addressed, which is another contribution of this work. Typical data reduction methods, such as Isomap or LLE, do not deal with this important aspect of manifold learning. We also tackle the inversion of the mapping, which makes it possible to project high-dimensional noisy points onto the manifold, like PCA with linear models. We present an application of our approach to several standard data sets such as the SwissRoll.
{"title":"Unsupervised Nonlinear Manifold Learning","authors":"M. Brucher, C. Heinrich, F. Heitz, J. Armspach","doi":"10.1109/ICIP.2007.4379104","DOIUrl":"https://doi.org/10.1109/ICIP.2007.4379104","url":null,"abstract":"This communication deals with data reduction and regression. A set of high dimensional data (e.g., images) usually has only a few degrees of freedom with corresponding variables that are used to parameterize the original data set. Data understanding, visualization and classification are the usual goals. The proposed method reduces data considering a unique set of low-dimensional variables and a user-defined cost function in the multidimensional scaling framework. Mapping of the reduced variables to the original data is also addressed, which is another contribution of this work. Typical data reduction methods, such as Isomap or LLE, do not deal with this important aspect of manifold learning. We also tackle the inversion of the mapping, which makes it possible to project high-dimensional noisy points onto the manifold, like PCA with linear models. We present an application of our approach to several standard data sets such as the SwissRoll.","PeriodicalId":131177,"journal":{"name":"2007 IEEE International Conference on Image Processing","volume":"217 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124290862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-11-12DOI: 10.1109/ICIP.2007.4379106
P. Kisilev, D. Shaked, Suk Hwan Lim
In this work, we propose noise and signal activity estimation method that discriminates noise from signal based on local and global properties of the image data. The method yields pixel-wise maps of the noise variance and of the signal activity. Using these maps to guide imaging algorithms such as image enhancement and print defect detection improves their performance. The proposed method does not assume a white Gaussian noise model; it is very efficient computationally and, as such, is useful for a wide variety of applications.
{"title":"Noise and Signal Activity Maps for Better Imaging Algorithms","authors":"P. Kisilev, D. Shaked, Suk Hwan Lim","doi":"10.1109/ICIP.2007.4379106","DOIUrl":"https://doi.org/10.1109/ICIP.2007.4379106","url":null,"abstract":"In this work, we propose noise and signal activity estimation method that discriminates noise from signal based on local and global properties of the image data. The method yields pixel-wise maps of the noise variance and of the signal activity. Using these maps to guide imaging algorithms such as image enhancement and print defect detection improves their performance. The proposed method does not assume a white Gaussian noise model; it is very efficient computationally and, as such, is useful for a wide variety of applications.","PeriodicalId":131177,"journal":{"name":"2007 IEEE International Conference on Image Processing","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124341764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-11-12DOI: 10.1109/ICIP.2007.4379291
S. Weng, Yao Zhao, Jeng-Shyang Pan, R. Ni
A novel reversible data hiding scheme based on an integer transform is presented in this paper. The invertible integer transform exploits the correlations among four pixels in a quad. Data embedding is carried out by expanding the differences between one pixel and each of its three neighboring pixels. However, the high hiding capacity can not be achieved only by difference expansion, so the companding technique is introduced into the embedding process so as to further increase hiding capacity. A series of experiments are conducted to verify the feasibility and effectiveness of the proposed approach.
{"title":"A Novel Reversible Watermarking Based on an Integer Transform","authors":"S. Weng, Yao Zhao, Jeng-Shyang Pan, R. Ni","doi":"10.1109/ICIP.2007.4379291","DOIUrl":"https://doi.org/10.1109/ICIP.2007.4379291","url":null,"abstract":"A novel reversible data hiding scheme based on an integer transform is presented in this paper. The invertible integer transform exploits the correlations among four pixels in a quad. Data embedding is carried out by expanding the differences between one pixel and each of its three neighboring pixels. However, the high hiding capacity can not be achieved only by difference expansion, so the companding technique is introduced into the embedding process so as to further increase hiding capacity. A series of experiments are conducted to verify the feasibility and effectiveness of the proposed approach.","PeriodicalId":131177,"journal":{"name":"2007 IEEE International Conference on Image Processing","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114959899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-11-12DOI: 10.1109/ICIP.2007.4380004
Waqar Zia, K. Diepold, T. Stockhammer
Robust video conversational applications for hand-held devices come with numerous challenges, e.g. real-time processing, complexity constrained devices and small end-to-end delays, etc. Transmission losses of compressed video data result in spatio-temporal error propagation in the decoded video sequence. To ensure some QoS, the video codec has to be well tuned to combat the degradation resulting from losses. Several feedback based error mitigation technique are assessed in this work. The proposed error robustness technique based on reference picture selection (RPS) and error tracking enhances the overall performance of the target system by more than 4 dB for moderate radio link control (RLC) PDU loss rates of 1.5%. This enhancement is achieved without any additional computational complexity.
{"title":"Complexity Constrained Robust Video Transmission for Hand-Held Devices","authors":"Waqar Zia, K. Diepold, T. Stockhammer","doi":"10.1109/ICIP.2007.4380004","DOIUrl":"https://doi.org/10.1109/ICIP.2007.4380004","url":null,"abstract":"Robust video conversational applications for hand-held devices come with numerous challenges, e.g. real-time processing, complexity constrained devices and small end-to-end delays, etc. Transmission losses of compressed video data result in spatio-temporal error propagation in the decoded video sequence. To ensure some QoS, the video codec has to be well tuned to combat the degradation resulting from losses. Several feedback based error mitigation technique are assessed in this work. The proposed error robustness technique based on reference picture selection (RPS) and error tracking enhances the overall performance of the target system by more than 4 dB for moderate radio link control (RLC) PDU loss rates of 1.5%. This enhancement is achieved without any additional computational complexity.","PeriodicalId":131177,"journal":{"name":"2007 IEEE International Conference on Image Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117149306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-11-12DOI: 10.1109/ICIP.2007.4379287
Takashi Miyaki, T. Yamasaki, K. Aizawa
This paper describes an object tracking scheme employs sensor fusion approach which is composed of visual information and location information estimated from Wi-Fi signals. Location information is calculated by a set of received signal strength values of beacon packets from Wi-Fi access points (APs) around the targets. Different from the conventional approaches which use another kind of sensors, our approach can cover wider areas both indoor and outdoor with lower cost because of characteristics of Wi-Fi signals. Particle filter is applied to combine these two different kinds of sensory input to track the target continuously. Wi-Fi observation model is involved in a conventional visual particle filtering scheme in order to evaluate importance weights of each particle. By using multiple modality, robust tracking performance is achieved even if reliability of one sensory input declines. In this paper, we present experimental results applied to outdoor surveillance camera environment.
{"title":"Tracking Persons using Particle Filter Fusing Visual and Wi-Fi Localizations for Widely Distributed Camera","authors":"Takashi Miyaki, T. Yamasaki, K. Aizawa","doi":"10.1109/ICIP.2007.4379287","DOIUrl":"https://doi.org/10.1109/ICIP.2007.4379287","url":null,"abstract":"This paper describes an object tracking scheme employs sensor fusion approach which is composed of visual information and location information estimated from Wi-Fi signals. Location information is calculated by a set of received signal strength values of beacon packets from Wi-Fi access points (APs) around the targets. Different from the conventional approaches which use another kind of sensors, our approach can cover wider areas both indoor and outdoor with lower cost because of characteristics of Wi-Fi signals. Particle filter is applied to combine these two different kinds of sensory input to track the target continuously. Wi-Fi observation model is involved in a conventional visual particle filtering scheme in order to evaluate importance weights of each particle. By using multiple modality, robust tracking performance is achieved even if reliability of one sensory input declines. In this paper, we present experimental results applied to outdoor surveillance camera environment.","PeriodicalId":131177,"journal":{"name":"2007 IEEE International Conference on Image Processing","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116026042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-11-12DOI: 10.1109/ICIP.2007.4379853
A. Sinha, Xiaolin Wu
We propose a new superresolution algorithm based on a fast motion estimation technique. Two stages of this algorithm, namely, motion estimation and high-resolution reconstruction, rely on an area-based interpolation scheme that involves intersecting two pixel grids in arbitrary orientation, displacement, and scaling. We develop a fast approximate solution of the above problem, whose exact solution is prohibitively expensive. Also, gradient descent algorithm is used for fast convergence of the motion estimation algorithm. Experimental results demonstrate the good performance of the proposed superresolution algorithm as well its robustness against noise.
{"title":"Fast Generalized Motion Estimation and Superresolution","authors":"A. Sinha, Xiaolin Wu","doi":"10.1109/ICIP.2007.4379853","DOIUrl":"https://doi.org/10.1109/ICIP.2007.4379853","url":null,"abstract":"We propose a new superresolution algorithm based on a fast motion estimation technique. Two stages of this algorithm, namely, motion estimation and high-resolution reconstruction, rely on an area-based interpolation scheme that involves intersecting two pixel grids in arbitrary orientation, displacement, and scaling. We develop a fast approximate solution of the above problem, whose exact solution is prohibitively expensive. Also, gradient descent algorithm is used for fast convergence of the motion estimation algorithm. Experimental results demonstrate the good performance of the proposed superresolution algorithm as well its robustness against noise.","PeriodicalId":131177,"journal":{"name":"2007 IEEE International Conference on Image Processing","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116442625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-11-12DOI: 10.1109/ICIP.2007.4379571
R. Chittineni, M. Su, O. Nalcioglu
MRI is the most accurate imaging modality to monitor response of breast cancer undergoing neoadjuvant chemotherapy, by comparing the tumor volume measured in follow up MRI, taken during the course of therapy, to its baseline value. Due to the deformable nature of the breast, its' shape in MR acquisitions taken in different studies varies significantly. If these images can be co-registered, the location of lesion in each study can be matched. Breast MR images collected often include large areas outside the breast, such as the thoracic region and surrounding air, which may pose a hindrance to registration algorithms. In this paper, we describe a segmentation algorithm to delineate the breast region from the chest by using the invariant, rigid structure such as the chest, as opposed to the use of varying breast outlines employed in currently available solutions. This ensures robustness and reproducibility of our algorithm.
{"title":"Breast Delineation using Active Contours to Facilitate Coregistration of Serial MRI Studies for Therapy Response Evaluation","authors":"R. Chittineni, M. Su, O. Nalcioglu","doi":"10.1109/ICIP.2007.4379571","DOIUrl":"https://doi.org/10.1109/ICIP.2007.4379571","url":null,"abstract":"MRI is the most accurate imaging modality to monitor response of breast cancer undergoing neoadjuvant chemotherapy, by comparing the tumor volume measured in follow up MRI, taken during the course of therapy, to its baseline value. Due to the deformable nature of the breast, its' shape in MR acquisitions taken in different studies varies significantly. If these images can be co-registered, the location of lesion in each study can be matched. Breast MR images collected often include large areas outside the breast, such as the thoracic region and surrounding air, which may pose a hindrance to registration algorithms. In this paper, we describe a segmentation algorithm to delineate the breast region from the chest by using the invariant, rigid structure such as the chest, as opposed to the use of varying breast outlines employed in currently available solutions. This ensures robustness and reproducibility of our algorithm.","PeriodicalId":131177,"journal":{"name":"2007 IEEE International Conference on Image Processing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116443758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-11-12DOI: 10.1109/ICIP.2007.4379138
K. Hara, R. Kurazume, Kohei Inoue, K. Urahama
The Chan-Vese level set algorithm has been successfully applied to segmentation of images on Cartesian coordinate meshes, including ordinary planar images. In this paper we present a Chan-Vese model for segmentation of images on polar coordinate meshes, such as topography and remote sensing images. The image segmentation is accomplished by formulating the associated evolution equation in the polar coordinate system and then numerically solving the partial differential equation on an overset grid system called the Yin-Yang grid, which is free from the problem of singularity at the poles. We include examples of segmentations of real earth data that demonstrate the performance of our method.
{"title":"Segmentation of Images on Polar Coordinate Meshes","authors":"K. Hara, R. Kurazume, Kohei Inoue, K. Urahama","doi":"10.1109/ICIP.2007.4379138","DOIUrl":"https://doi.org/10.1109/ICIP.2007.4379138","url":null,"abstract":"The Chan-Vese level set algorithm has been successfully applied to segmentation of images on Cartesian coordinate meshes, including ordinary planar images. In this paper we present a Chan-Vese model for segmentation of images on polar coordinate meshes, such as topography and remote sensing images. The image segmentation is accomplished by formulating the associated evolution equation in the polar coordinate system and then numerically solving the partial differential equation on an overset grid system called the Yin-Yang grid, which is free from the problem of singularity at the poles. We include examples of segmentations of real earth data that demonstrate the performance of our method.","PeriodicalId":131177,"journal":{"name":"2007 IEEE International Conference on Image Processing","volume":"106 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116460429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2007-11-12DOI: 10.1109/ICIP.2007.4379811
T. Ell
The linear filtering of color images using hypercomplex convolution and Fourier transforms provides a holistic treatment of color by representing pixels as 3-space vector quantities within the quaternion algebra. But, this technique is limited to images with at most three channels of information, e.g., RGB images. Linear filtering of color images by representing color pixels as multi-vectors embedded in a geometric algebra is presented. This multi-vector representation has similar convolution and Fourier transforms as the quaternion based filters, but provides an avenue for multi-spectral images composed of more than three channels.
{"title":"Multi-Vector Color-Image Filters","authors":"T. Ell","doi":"10.1109/ICIP.2007.4379811","DOIUrl":"https://doi.org/10.1109/ICIP.2007.4379811","url":null,"abstract":"The linear filtering of color images using hypercomplex convolution and Fourier transforms provides a holistic treatment of color by representing pixels as 3-space vector quantities within the quaternion algebra. But, this technique is limited to images with at most three channels of information, e.g., RGB images. Linear filtering of color images by representing color pixels as multi-vectors embedded in a geometric algebra is presented. This multi-vector representation has similar convolution and Fourier transforms as the quaternion based filters, but provides an avenue for multi-spectral images composed of more than three channels.","PeriodicalId":131177,"journal":{"name":"2007 IEEE International Conference on Image Processing","volume":"301 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123464712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}