Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.480560
T. Cooklev, A. Nishihara, Toshiyuki Yoshida, M. Sablatash
Design methods for regular multidimensional perfect reconstruction (PR) filter banks are described. A systematic method for two-channel and four-channel filter banks is presented. The main novelties are: (1) the filters have impulse response with square support, rather than diamond support, (2) regular designs that have rectangular support are also presented, which are highly efficient in practice, since expensive memory is saved; (3) in addition to new diamond filters for the two-channel case, hexagonally-symmetric filters are also derived; and (4) novel 3-D filter banks are also designed. In all cases the filters are linear phase, achieve arbitrarily high regularity and can be used to obtain biorthogonal wavelet bases. The filter banks can be implemented in a structurally perfect-reconstruction manner.
{"title":"Regular multidimensional linear phase FIR digital filter banks and wavelet bases","authors":"T. Cooklev, A. Nishihara, Toshiyuki Yoshida, M. Sablatash","doi":"10.1109/ICASSP.1995.480560","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.480560","url":null,"abstract":"Design methods for regular multidimensional perfect reconstruction (PR) filter banks are described. A systematic method for two-channel and four-channel filter banks is presented. The main novelties are: (1) the filters have impulse response with square support, rather than diamond support, (2) regular designs that have rectangular support are also presented, which are highly efficient in practice, since expensive memory is saved; (3) in addition to new diamond filters for the two-channel case, hexagonally-symmetric filters are also derived; and (4) novel 3-D filter banks are also designed. In all cases the filters are linear phase, achieve arbitrarily high regularity and can be used to obtain biorthogonal wavelet bases. The filter banks can be implemented in a structurally perfect-reconstruction manner.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128801167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.479384
L. Neumeyer, M. Weintraub
This paper compares three techniques for recognizing continuous speech in the presence of additive car noise: (1) transforming the noisy acoustic features using a mapping algorithm, (2) adaptation of the hidden Markov models (HMMs), and (3) combination of mapping and adaptation. To make the signal processing robust to additive noise, we apply a technique called probabilistic optimum filtering. We show that at low signal-to-noise ratio (SNR) levels, compensating in the feature and model domains yields similar performance. We also show that adapting the HMMs with the mapped features produces the best performance. The algorithms were implemented using SRI's DECIPHER speech recognition system and were tested on the 1994 ARPA-sponsored CSR evaluation test spoke 10.
{"title":"Robust speech recognition in noise using adaptation and mapping techniques","authors":"L. Neumeyer, M. Weintraub","doi":"10.1109/ICASSP.1995.479384","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.479384","url":null,"abstract":"This paper compares three techniques for recognizing continuous speech in the presence of additive car noise: (1) transforming the noisy acoustic features using a mapping algorithm, (2) adaptation of the hidden Markov models (HMMs), and (3) combination of mapping and adaptation. To make the signal processing robust to additive noise, we apply a technique called probabilistic optimum filtering. We show that at low signal-to-noise ratio (SNR) levels, compensating in the feature and model domains yields similar performance. We also show that adapting the HMMs with the mapped features produces the best performance. The algorithms were implemented using SRI's DECIPHER speech recognition system and were tested on the 1994 ARPA-sponsored CSR evaluation test spoke 10.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128679669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.479531
Tanja Schultz, I. Rogina
Several improvements of our speech-to-speech translation system JANUS on spontaneous human-to-human dialogs are presented. Common phenomena in spontaneous speech are described, followed by a classification of different types of noise. To handle the variety of spontaneous effects in human-to-human dialogs, special noise models are introduced representing both human and nonhuman noise, as well as word fragments. It is shown that both the acoustic and the language modeling of the noise increase the recognition performance significantly. In the experiments, a clustering of the noise classes is performed and the resulting cluster variants are compared, thus allowing one to determine the best tradeoff between the sensitivity and trainability of the models.
{"title":"Acoustic and language modeling of human and nonhuman noises for human-to-human spontaneous speech recognition","authors":"Tanja Schultz, I. Rogina","doi":"10.1109/ICASSP.1995.479531","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.479531","url":null,"abstract":"Several improvements of our speech-to-speech translation system JANUS on spontaneous human-to-human dialogs are presented. Common phenomena in spontaneous speech are described, followed by a classification of different types of noise. To handle the variety of spontaneous effects in human-to-human dialogs, special noise models are introduced representing both human and nonhuman noise, as well as word fragments. It is shown that both the acoustic and the language modeling of the noise increase the recognition performance significantly. In the experiments, a clustering of the noise classes is performed and the resulting cluster variants are compared, thus allowing one to determine the best tradeoff between the sensitivity and trainability of the models.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129163709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.479898
Jiang Hui, Chao-Huan Hou
An algorithm to process the synthetic aperture radar (SAR) data in wavenumber domain, is discussed. Generated from the seismic migration technique in the area of geophysics, this imaging method has been modified and developed in the area of SAR imaging. The basic idea of original seismic migration technique is analysed and modified to suit the feature of spaceborne SAR imaging. The result shows that the new algorithm will improve the image quality without additional computational time because the correction can be realized by phase shifting and fusion to pulse compression in the azimuth direction.
{"title":"Spaceborne SAR imaging in wavenumber domain","authors":"Jiang Hui, Chao-Huan Hou","doi":"10.1109/ICASSP.1995.479898","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.479898","url":null,"abstract":"An algorithm to process the synthetic aperture radar (SAR) data in wavenumber domain, is discussed. Generated from the seismic migration technique in the area of geophysics, this imaging method has been modified and developed in the area of SAR imaging. The basic idea of original seismic migration technique is analysed and modified to suit the feature of spaceborne SAR imaging. The result shows that the new algorithm will improve the image quality without additional computational time because the correction can be realized by phase shifting and fusion to pulse compression in the azimuth direction.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124596985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.480470
Paolo Gentili, F. Piazza, A. Uncini
This paper presents an efficient genetic approach to the design of digital finite impulse response (FIR) filters with coefficients constrained to be sums of power-of-two terms. To obtain such efficiency, i.e. a reduction of computational costs and an improvement in performance, a specific filter coefficient coding scheme has been studied and implemented. The resulting genetic algorithm (GA) is explained and compared experimentally with other state-of-the-art design techniques on several power-of-two FIR filter design cases. It can be seen that the proposed genetic technique is able to attain results as good as or better than the other methods. Moreover it can be easily implemented on parallel hardware.
{"title":"Efficient genetic algorithm design for power-of-two FIR filters","authors":"Paolo Gentili, F. Piazza, A. Uncini","doi":"10.1109/ICASSP.1995.480470","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.480470","url":null,"abstract":"This paper presents an efficient genetic approach to the design of digital finite impulse response (FIR) filters with coefficients constrained to be sums of power-of-two terms. To obtain such efficiency, i.e. a reduction of computational costs and an improvement in performance, a specific filter coefficient coding scheme has been studied and implemented. The resulting genetic algorithm (GA) is explained and compared experimentally with other state-of-the-art design techniques on several power-of-two FIR filter design cases. It can be seen that the proposed genetic technique is able to attain results as good as or better than the other methods. Moreover it can be easily implemented on parallel hardware.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124960264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.479980
M. Comer, E. Delp
In this paper we present a new algorithm for segmentation of noisy or textured images using a multiresolution Bayesian approach. Our algorithm is different from previously proposed multiresolution segmentation techniques in that we use a multiresolution Gaussian autoregressive (AR) model for the pyramid representation of the observed image. Our algorithm also approximates the "maximization of the posterior marginals" (MPM) estimate of the pixel class labels at each resolution, from coarsest to finest, unlike previously proposed techniques, which have been based on MAP estimation. Experimental results are presented to demonstrate the performance of the new algorithm.
{"title":"Multiresolution image segmentation","authors":"M. Comer, E. Delp","doi":"10.1109/ICASSP.1995.479980","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.479980","url":null,"abstract":"In this paper we present a new algorithm for segmentation of noisy or textured images using a multiresolution Bayesian approach. Our algorithm is different from previously proposed multiresolution segmentation techniques in that we use a multiresolution Gaussian autoregressive (AR) model for the pyramid representation of the observed image. Our algorithm also approximates the \"maximization of the posterior marginals\" (MPM) estimate of the pixel class labels at each resolution, from coarsest to finest, unlike previously proposed techniques, which have been based on MAP estimation. Experimental results are presented to demonstrate the performance of the new algorithm.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129535950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.480412
J. LeBlanc, I. Fijalkow, Birkett Huber, C. Johnson
CMA fractionally spaced equalizers (CMA-FSEs) have been shown, under certain conditions, to be globally asymptotically convergent to a setting which provides perfect equalization. Such a result relies heavily on the assumptions of a white source and no channel noise (as is the case in much of the literature's analysis of CMA). Herein, we relax the white source assumption and examine the effect of source correlation on CMA. Analytic results are meshed with examples showing CMA-FSE source correlation effects. Techniques for finding all stationary and saddle points on the CMA-FSE error surface are presented using recent developments in the algebraic-geometry community.
{"title":"Fractionally spaced CMA equalizers under periodic and correlated inputs","authors":"J. LeBlanc, I. Fijalkow, Birkett Huber, C. Johnson","doi":"10.1109/ICASSP.1995.480412","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.480412","url":null,"abstract":"CMA fractionally spaced equalizers (CMA-FSEs) have been shown, under certain conditions, to be globally asymptotically convergent to a setting which provides perfect equalization. Such a result relies heavily on the assumptions of a white source and no channel noise (as is the case in much of the literature's analysis of CMA). Herein, we relax the white source assumption and examine the effect of source correlation on CMA. Analytic results are meshed with examples showing CMA-FSE source correlation effects. Techniques for finding all stationary and saddle points on the CMA-FSE error surface are presented using recent developments in the algebraic-geometry community.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126706605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.480488
T. Frison, H. Abarbanel, L. Tsimring
We extend methods for processing chaotic time series to two-dimensional images. The motivation is to develop new tools for understanding physical systems that can be imaged, such as the ocean surface. The novel issue addressed is the computation of the average mutual information for images. The average mutual information provides insights into energy transport and information loss rates in the underlying system. It also provides a crucial parameter needed for further processing with chaotic methods.
{"title":"Chaotic methods for image processing","authors":"T. Frison, H. Abarbanel, L. Tsimring","doi":"10.1109/ICASSP.1995.480488","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.480488","url":null,"abstract":"We extend methods for processing chaotic time series to two-dimensional images. The motivation is to develop new tools for understanding physical systems that can be imaged, such as the ocean surface. The novel issue addressed is the computation of the average mutual information for images. The average mutual information provides insights into energy transport and information loss rates in the underlying system. It also provides a crucial parameter needed for further processing with chaotic methods.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129073968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.479866
Jean-Marc Le Caillec, R. Garello, B. Chapron
Higher order moments have been, for the last decade, an important field of interest, but generally limited to one dimensional signal cases. We introduce in this paper a 2D bispectrum to detect nonlinearity in the SAR image mapping process, by using the bicoherency of a 2D signal which is theoretically flat over all frequencies, if the process is linear. Two bicoherency estimators are developed, the first one using a direct method for bispectral estimation and a periodogram for the spectral estimator, and the second one based on an indirect method and correlogram. In order to validate our nonlinearity detection method, we have tested it first on two simulated images, a linear one and a nonlinear one, and then on two SAR images, one seeming to be nonlinear. Conclusions are drawn by comparing the SAR image and the simulated image bicoherencies.
{"title":"Detection of nonlinearity in sea surface SAR imaging process using bispectrum method estimation","authors":"Jean-Marc Le Caillec, R. Garello, B. Chapron","doi":"10.1109/ICASSP.1995.479866","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.479866","url":null,"abstract":"Higher order moments have been, for the last decade, an important field of interest, but generally limited to one dimensional signal cases. We introduce in this paper a 2D bispectrum to detect nonlinearity in the SAR image mapping process, by using the bicoherency of a 2D signal which is theoretically flat over all frequencies, if the process is linear. Two bicoherency estimators are developed, the first one using a direct method for bispectral estimation and a periodogram for the spectral estimator, and the second one based on an indirect method and correlogram. In order to validate our nonlinearity detection method, we have tested it first on two simulated images, a linear one and a nonlinear one, and then on two SAR images, one seeming to be nonlinear. Conclusions are drawn by comparing the SAR image and the simulated image bicoherencies.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130619913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.479447
D. Etter, G. Orsak, Don H. Johnson
Competitive pressures in the global marketplace have forced companies to form teams from the best talent available irrespective of their geographical location. As it comes online, the National information Infrastructure will be increasingly used to support such interactions. American companies are far ahead of the universities in realizing systems to support such geographically distributed interactions. Universities must catch up by exposing their students to such design environments. In addition, universities should help define and evaluate network-based information dissemination systems by serving as testbeds for new interactive strategies. The paper presents initial results in a distance teaming experiment at the University of Colorado, George Mason University, and Rice University.
{"title":"A distance learning laboratory design experiment in undergraduate digital signal processing","authors":"D. Etter, G. Orsak, Don H. Johnson","doi":"10.1109/ICASSP.1995.479447","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.479447","url":null,"abstract":"Competitive pressures in the global marketplace have forced companies to form teams from the best talent available irrespective of their geographical location. As it comes online, the National information Infrastructure will be increasingly used to support such interactions. American companies are far ahead of the universities in realizing systems to support such geographically distributed interactions. Universities must catch up by exposing their students to such design environments. In addition, universities should help define and evaluate network-based information dissemination systems by serving as testbeds for new interactive strategies. The paper presents initial results in a distance teaming experiment at the University of Colorado, George Mason University, and Rice University.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130654488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}