Pub Date : 2002-11-07DOI: 10.1109/ICDSP.2002.1028153
P. Campisi, A. Neri, L. Sorgi
Due to the enormous amount of information contained in multimedia databases, the design of automatic tools to allow content-based analysis, browsing, and retrieval is of paramount importance. We present an algorithm tailored to the detection of editing effects such as dissolve and fade, which are widely used in television and movie production. A computationally inexpensive, although effective, correlation based algorithm is presented. The experimental results highlight the effectiveness of the proposed method.
{"title":"Automatic dissolve and fade detection for video sequences","authors":"P. Campisi, A. Neri, L. Sorgi","doi":"10.1109/ICDSP.2002.1028153","DOIUrl":"https://doi.org/10.1109/ICDSP.2002.1028153","url":null,"abstract":"Due to the enormous amount of information contained in multimedia databases, the design of automatic tools to allow content-based analysis, browsing, and retrieval is of paramount importance. We present an algorithm tailored to the detection of editing effects such as dissolve and fade, which are widely used in television and movie production. A computationally inexpensive, although effective, correlation based algorithm is presented. The experimental results highlight the effectiveness of the proposed method.","PeriodicalId":351073,"journal":{"name":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125857306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/ICDSP.2002.1028142
A. López, R. Molina, A. Katsaggelos
In this work we develop a Bayesian reconstruction method for SPECT (single photon emission computed tomography) images, using as prior GGMRF (generalized Gaussian Markov random fields) distributions and estimating the scale hyperparameter following the evidence analysis. Preconditioning methods are used to estimate this hyperparameter and the approximations used are compared on synthetic images.
{"title":"Scale hyperparameter estimation for GGMRF prior models with application to SPECT images","authors":"A. López, R. Molina, A. Katsaggelos","doi":"10.1109/ICDSP.2002.1028142","DOIUrl":"https://doi.org/10.1109/ICDSP.2002.1028142","url":null,"abstract":"In this work we develop a Bayesian reconstruction method for SPECT (single photon emission computed tomography) images, using as prior GGMRF (generalized Gaussian Markov random fields) distributions and estimating the scale hyperparameter following the evidence analysis. Preconditioning methods are used to estimate this hyperparameter and the approximations used are compared on synthetic images.","PeriodicalId":351073,"journal":{"name":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127712403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/ICDSP.2002.1028218
E. Jung, A. Schwarzbacher, R. Lawlor
Traditionally the interest in voice gender conversion was of a more theoretical nature rather than founded in real-life applications. However, with the increase in mobile communication and the resulting limitation in transmission bandwidth new approaches to minimising data rates have to be developed. Here voice gender normalisation (VGN) presents a novel method of achieving higher compression rates by using the VGN algorithm to remove all gender specific components of a speech signal and thus leaving only the information content to be transmitted. A second application for VGN is in the field of speech controlled systems, where current speech recognition algorithms have to deal with the voice characteristics of a speaker as well as the information content. Here again the use of VGN can remove the speakers voice characteristics leaving only the pure information. Therefore, such a system would be capable of achieving much higher recognition rates while being independent of the speaker. This paper presents the theory of a gender removal system based on VGN and furthermore, outlines an efficient real-time hardware implementation for use in portable communications equipment.
{"title":"Implementation of real-time AMDF pitch-detection for voice gender normalisation","authors":"E. Jung, A. Schwarzbacher, R. Lawlor","doi":"10.1109/ICDSP.2002.1028218","DOIUrl":"https://doi.org/10.1109/ICDSP.2002.1028218","url":null,"abstract":"Traditionally the interest in voice gender conversion was of a more theoretical nature rather than founded in real-life applications. However, with the increase in mobile communication and the resulting limitation in transmission bandwidth new approaches to minimising data rates have to be developed. Here voice gender normalisation (VGN) presents a novel method of achieving higher compression rates by using the VGN algorithm to remove all gender specific components of a speech signal and thus leaving only the information content to be transmitted. A second application for VGN is in the field of speech controlled systems, where current speech recognition algorithms have to deal with the voice characteristics of a speaker as well as the information content. Here again the use of VGN can remove the speakers voice characteristics leaving only the pure information. Therefore, such a system would be capable of achieving much higher recognition rates while being independent of the speaker. This paper presents the theory of a gender removal system based on VGN and furthermore, outlines an efficient real-time hardware implementation for use in portable communications equipment.","PeriodicalId":351073,"journal":{"name":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114151640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/ICDSP.2002.1028212
S. Billings, G. Newsam
Sharpening and edge-enhancement filters are often applied to geophysical data that were collected with an uneven sample spacing and varying sample density. In this paper we propose an alternative methodology to the usual gridding/digital filtering processing pathway. We first fit a continuous global surface (CGS) to the data and then implicitly apply Fourier domain filtering to the entire surface. The CGS is constructed to optimize some property of the surface (e.g. smoothness). We find that the best approach is to optimize the properties of the filtered surface rather than the surface that fits the data. Otherwise certain filters cause the transformed surface to have singularities at the data points. We demonstrate the viability of the methodology in the computation of the second vertical derivative of a gravity survey.
{"title":"Fourier filtering of continuous global surfaces","authors":"S. Billings, G. Newsam","doi":"10.1109/ICDSP.2002.1028212","DOIUrl":"https://doi.org/10.1109/ICDSP.2002.1028212","url":null,"abstract":"Sharpening and edge-enhancement filters are often applied to geophysical data that were collected with an uneven sample spacing and varying sample density. In this paper we propose an alternative methodology to the usual gridding/digital filtering processing pathway. We first fit a continuous global surface (CGS) to the data and then implicitly apply Fourier domain filtering to the entire surface. The CGS is constructed to optimize some property of the surface (e.g. smoothness). We find that the best approach is to optimize the properties of the filtered surface rather than the surface that fits the data. Otherwise certain filters cause the transformed surface to have singularities at the data points. We demonstrate the viability of the methodology in the computation of the second vertical derivative of a gravity survey.","PeriodicalId":351073,"journal":{"name":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114164424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/ICDSP.2002.1028143
C. Loizou, C. Demetriou, C. Pattichis, R. Istepanian, M. Pantziaris, A. Nicolaides
The objective of this work was to develop six speckle reduction-filtering techniques and evaluate them together with texture analysis in the assessment of 240 ultrasound images of the carotid artery. The de-speckled filters are based on anisotropic diffusion, local statistics with higher moments, and geometric filtering. Results showed that some improvement in class separation (between symptomatic and asymptomatic plaques) of the images was evident after de-speckle filtering.
{"title":"Speckle reduction in ultrasound images of atherosclerotic carotid plaque","authors":"C. Loizou, C. Demetriou, C. Pattichis, R. Istepanian, M. Pantziaris, A. Nicolaides","doi":"10.1109/ICDSP.2002.1028143","DOIUrl":"https://doi.org/10.1109/ICDSP.2002.1028143","url":null,"abstract":"The objective of this work was to develop six speckle reduction-filtering techniques and evaluate them together with texture analysis in the assessment of 240 ultrasound images of the carotid artery. The de-speckled filters are based on anisotropic diffusion, local statistics with higher moments, and geometric filtering. Results showed that some improvement in class separation (between symptomatic and asymptomatic plaques) of the images was evident after de-speckle filtering.","PeriodicalId":351073,"journal":{"name":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122352611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/ICDSP.2002.1028158
Anssi Klapuri, J. Astola
An algorithm is proposed which calculates a computationally efficient approximation of a certain physiologically-motivated representation for sound, called the summary autocorrelation function. This representation has been found very useful in several tasks, such as sound separation, multiple period estimation, and computational auditory scene analysis. However, it has been computationally too complex for most practical applications. The relatively fast algorithm described here proposes only an approximation of the summary autocorrelation function, but the achieved precision is likely to be good enough for most applications.
{"title":"Efficient calculation of a physiologically-motivated representation for sound","authors":"Anssi Klapuri, J. Astola","doi":"10.1109/ICDSP.2002.1028158","DOIUrl":"https://doi.org/10.1109/ICDSP.2002.1028158","url":null,"abstract":"An algorithm is proposed which calculates a computationally efficient approximation of a certain physiologically-motivated representation for sound, called the summary autocorrelation function. This representation has been found very useful in several tasks, such as sound separation, multiple period estimation, and computational auditory scene analysis. However, it has been computationally too complex for most practical applications. The relatively fast algorithm described here proposes only an approximation of the summary autocorrelation function, but the achieved precision is likely to be good enough for most applications.","PeriodicalId":351073,"journal":{"name":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121730318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/ICDSP.2002.1028326
S. J. Park, D. Youn, S. H. Park
For hands-free mobile terminals, it is desirable to implement an acoustic interference cancellation system comprising echo cancellation and noise reduction. In the present paper, an approach to integrate acoustic echo cancellation and noise reduction in hands-free communication is described. For enhanced performance of the system, a method of canceling the residual acoustic interference after echo cancellation is treated. A parameter originating from the cross-correlation property is proposed to control the integrated system.
{"title":"Acoustic interference cancellation for hands-free terminals","authors":"S. J. Park, D. Youn, S. H. Park","doi":"10.1109/ICDSP.2002.1028326","DOIUrl":"https://doi.org/10.1109/ICDSP.2002.1028326","url":null,"abstract":"For hands-free mobile terminals, it is desirable to implement an acoustic interference cancellation system comprising echo cancellation and noise reduction. In the present paper, an approach to integrate acoustic echo cancellation and noise reduction in hands-free communication is described. For enhanced performance of the system, a method of canceling the residual acoustic interference after echo cancellation is treated. A parameter originating from the cross-correlation property is proposed to control the integrated system.","PeriodicalId":351073,"journal":{"name":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126229641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/ICDSP.2002.1028164
P. Maragos, T. Loupas, Vassilis Pitsikalis
This paper deals with improving the time-frequency resolution of Doppler ultrasound spectroscopy, applied to blood flow analysis, by developing robust nonstationary spectrum estimation techniques based on Gabor (1946) filterbanks and multiband AM-FM demodulation that uses an instantaneous energy separation algorithm.
{"title":"Improving Doppler ultrasound spectroscopy with multiband instantaneous energy separation","authors":"P. Maragos, T. Loupas, Vassilis Pitsikalis","doi":"10.1109/ICDSP.2002.1028164","DOIUrl":"https://doi.org/10.1109/ICDSP.2002.1028164","url":null,"abstract":"This paper deals with improving the time-frequency resolution of Doppler ultrasound spectroscopy, applied to blood flow analysis, by developing robust nonstationary spectrum estimation techniques based on Gabor (1946) filterbanks and multiband AM-FM demodulation that uses an instantaneous energy separation algorithm.","PeriodicalId":351073,"journal":{"name":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","volume":"146 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127299712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/ICDSP.2002.1028202
Christos Papathanassiou, M. Petrou
We propose the modification of the cost function that is optimised by the independent component analysis (ICA) algorithm, so that prior knowledge about some statistical properties of one of the components is incorporated in the optimisation process, for this component to be identified first. Once this component is removed from the recordings, the remaining components may be identified in the usual way. We demonstrate our idea using simulated data, for which the true mixing coefficients are known. We show that the incorporation of the constraint broadens the region around the true mixing values over which, if we start the algorithm, it will converge to the desired solution.
{"title":"Incorporating prior knowledge in ICA","authors":"Christos Papathanassiou, M. Petrou","doi":"10.1109/ICDSP.2002.1028202","DOIUrl":"https://doi.org/10.1109/ICDSP.2002.1028202","url":null,"abstract":"We propose the modification of the cost function that is optimised by the independent component analysis (ICA) algorithm, so that prior knowledge about some statistical properties of one of the components is incorporated in the optimisation process, for this component to be identified first. Once this component is removed from the recordings, the remaining components may be identified in the usual way. We demonstrate our idea using simulated data, for which the true mixing coefficients are known. We show that the incorporation of the constraint broadens the region around the true mixing values over which, if we start the algorithm, it will converge to the desired solution.","PeriodicalId":351073,"journal":{"name":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131448505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2002-11-07DOI: 10.1109/ICDSP.2002.1028154
Vladimir Zlokolica, W. Philips, D. Ville
Noise removal techniques, such as the K-nearest neighbour filter and the /spl alpha/-trimmed mean filter, are known to be very robust in still image noise removal, but they have not been exploited in video processing. We investigate their 3D-extension for use in video sequence noise removal. We also determine the optimal balance between temporal and spatial window size, the optimal values of the other parameters and finally we investigate the artefacts introduced by the filters. The results show that the new video K-nearest neighbour filter outperforms the video version of the /spl alpha/-trimmed mean and the state-of-the-art rational filter by G. Ramponi from both a PSNR and a visual quality point of view.
{"title":"Robust non-linear filtering for video processing","authors":"Vladimir Zlokolica, W. Philips, D. Ville","doi":"10.1109/ICDSP.2002.1028154","DOIUrl":"https://doi.org/10.1109/ICDSP.2002.1028154","url":null,"abstract":"Noise removal techniques, such as the K-nearest neighbour filter and the /spl alpha/-trimmed mean filter, are known to be very robust in still image noise removal, but they have not been exploited in video processing. We investigate their 3D-extension for use in video sequence noise removal. We also determine the optimal balance between temporal and spatial window size, the optimal values of the other parameters and finally we investigate the artefacts introduced by the filters. The results show that the new video K-nearest neighbour filter outperforms the video version of the /spl alpha/-trimmed mean and the state-of-the-art rational filter by G. Ramponi from both a PSNR and a visual quality point of view.","PeriodicalId":351073,"journal":{"name":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131306457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}