Pub Date : 1995-10-10DOI: 10.1109/ICASSP.1995.479742
S. Kadambe, J. Hieronymus
A task independent spoken language identification (LID) system which uses phonological and lexical models to distinguish languages is described. We demonstrate that the performance of an LID system which is based only on acoustic models can be improved by incorporating higher level linguistic knowledge in the form of trigram phonemotatics and lexical matching. We also present the performance of our LID system for four languages (English, German, Mandarin and Spanish).
{"title":"Language identification with phonological and lexical models","authors":"S. Kadambe, J. Hieronymus","doi":"10.1109/ICASSP.1995.479742","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.479742","url":null,"abstract":"A task independent spoken language identification (LID) system which uses phonological and lexical models to distinguish languages is described. We demonstrate that the performance of an LID system which is based only on acoustic models can be improved by incorporating higher level linguistic knowledge in the form of trigram phonemotatics and lexical matching. We also present the performance of our LID system for four languages (English, German, Mandarin and Spanish).","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130117526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.478492
J. Sheinvald, M. Wax
A new technique for localisation of multiple signals is presented. Unlike existing techniques which require that the whole array be sampled simultaneously and consequently require many receivers, our technique allows us to sample arbitrary subarrays sequentially and consequently significantly reduces the required number of receivers. The estimation method we use in conjunction with this sampling scheme is based on approximating the corresponding maximum likelihood estimator by a computationally simpler generalized least squares (GLS) estimator that is proved to be both consistent and efficient.
{"title":"Localization of multiple signals using subarrays data","authors":"J. Sheinvald, M. Wax","doi":"10.1109/ICASSP.1995.478492","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.478492","url":null,"abstract":"A new technique for localisation of multiple signals is presented. Unlike existing techniques which require that the whole array be sampled simultaneously and consequently require many receivers, our technique allows us to sample arbitrary subarrays sequentially and consequently significantly reduces the required number of receivers. The estimation method we use in conjunction with this sampling scheme is based on approximating the corresponding maximum likelihood estimator by a computationally simpler generalized least squares (GLS) estimator that is proved to be both consistent and efficient.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115415730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.479674
Karen Ward, D. Novick
In this study we examined prosodic characteristics of a word used in several distinct senses in a task-oriented corpus of spontaneous speech. We compared the pitch characteristics of the word "right" used in three different senses: as an acknowledgment, as a direction, and as an affirmative answer to a question. Significant differences in intonation for different classes of usage were found, although the differences are not reliable enough to allow systems to use prosody alone to distinguish between usages. These results suggest that pitch change as reported by a pitch tracker could serve as a confirming cue when analyzing ambiguous speech recognizer output, or could serve as input to a probabilistic parser to aid in disambiguating senses of homonyms.
{"title":"Prosodic cues to word usage","authors":"Karen Ward, D. Novick","doi":"10.1109/ICASSP.1995.479674","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.479674","url":null,"abstract":"In this study we examined prosodic characteristics of a word used in several distinct senses in a task-oriented corpus of spontaneous speech. We compared the pitch characteristics of the word \"right\" used in three different senses: as an acknowledgment, as a direction, and as an affirmative answer to a question. Significant differences in intonation for different classes of usage were found, although the differences are not reliable enough to allow systems to use prosody alone to distinguish between usages. These results suggest that pitch change as reported by a pitch tracker could serve as a confirming cue when analyzing ambiguous speech recognizer output, or could serve as input to a probabilistic parser to aid in disambiguating senses of homonyms.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123047633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.479430
J. Molyneux
Presents a survey of theory and practice in tomographic imaging of subsurface features utilizing ground penetrating radar (GPR). The discussion includes: (i) A brief review of the equations governing radar scattering culminating in the Lippmann Schwinger (LS) equation. (ii) A summary of approximations often made in developing GPR imaging algorithms based on inversion of the LS equation: the scalar wave approximation, the two dimensional scattering model, and neglect of multiple scattering. (iii) A description of certain inversion algorithms used in GPR work. This includes a short discussion of ill-posed problems utilizing a one dimensional model equation, a review of difficulties characteristic of tomographic inversion, a summary of results based on linearized inversion methods such as Devaney's (1992) generalized projection slice theorem, and finally, a brief overview of the some numerical algorithms. (iv) A discussion of a field study using GPR to image shallow targets.
{"title":"Ground penetrating radar tomography","authors":"J. Molyneux","doi":"10.1109/ICASSP.1995.479430","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.479430","url":null,"abstract":"Presents a survey of theory and practice in tomographic imaging of subsurface features utilizing ground penetrating radar (GPR). The discussion includes: (i) A brief review of the equations governing radar scattering culminating in the Lippmann Schwinger (LS) equation. (ii) A summary of approximations often made in developing GPR imaging algorithms based on inversion of the LS equation: the scalar wave approximation, the two dimensional scattering model, and neglect of multiple scattering. (iii) A description of certain inversion algorithms used in GPR work. This includes a short discussion of ill-posed problems utilizing a one dimensional model equation, a review of difficulties characteristic of tomographic inversion, a summary of results based on linearized inversion methods such as Devaney's (1992) generalized projection slice theorem, and finally, a brief overview of the some numerical algorithms. (iv) A discussion of a field study using GPR to image shallow targets.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"133 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124276270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.480439
Wen-Hsien Fang, Yang-Lung Hwang
This paper is concerned with the development of new split algorithms for the design of linear least squares prediction filters with linear phase. The proposed fast algorithm, which fully expresses the inherent symmetry of the problem, requires lower computational complexity than other existing ones. Moreover, unlike other existing ones, the new recurrences involve only the order updates, which lend themselves to more efficient hardware implementations. For parallelization consideration, a new split Schur-like algorithm is also proposed to overcome the nonparallelizable inner product. Some numerical simulation results are provided to verify the proposed fast algorithms and highlight possible applications.
{"title":"New split algorithms for linear least squares prediction filters with linear phase","authors":"Wen-Hsien Fang, Yang-Lung Hwang","doi":"10.1109/ICASSP.1995.480439","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.480439","url":null,"abstract":"This paper is concerned with the development of new split algorithms for the design of linear least squares prediction filters with linear phase. The proposed fast algorithm, which fully expresses the inherent symmetry of the problem, requires lower computational complexity than other existing ones. Moreover, unlike other existing ones, the new recurrences involve only the order updates, which lend themselves to more efficient hardware implementations. For parallelization consideration, a new split Schur-like algorithm is also proposed to overcome the nonparallelizable inner product. Some numerical simulation results are provided to verify the proposed fast algorithms and highlight possible applications.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124280339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.480314
C. Heinrich, J. Bercher, G. L. Besnerais, G. Demoment
Discusses the restoration of spiky sequences distorted by a linear system and corrupted by additive noise. A (now) classical way of coping with this problem is to use a Bayesian approach with a Bernoulli-Gaussian (BG) prior model of the sequence. The authors refine this method using a Bernoulli-Gaussian plus Gaussian (BCG) prior model. This estimation method requires maximization of a posterior probability distribution, which cannot be performed optimally. Thus the authors propose a new non-Bayesian estimation scheme, derived from the Kullback-Leibler information or cross-entropy. This quite general method, called the maximum entropy on the mean method (MEMM) in Gamboa (1989) and le Besnerais (1995) is firmly based on convex analysis and yields a unique solution which can be efficiently calculated in practice, and which is, in this sense, truly optimal. As a conclusion, the authors present results obtained with both methods on a synthetic case
{"title":"Restoration of spiky signals: a new optimal estimate and a comparison","authors":"C. Heinrich, J. Bercher, G. L. Besnerais, G. Demoment","doi":"10.1109/ICASSP.1995.480314","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.480314","url":null,"abstract":"Discusses the restoration of spiky sequences distorted by a linear\u0000system and corrupted by additive noise. A (now) classical way of coping\u0000with this problem is to use a Bayesian approach with a\u0000Bernoulli-Gaussian (BG) prior model of the sequence. The authors refine\u0000this method using a Bernoulli-Gaussian plus Gaussian (BCG) prior model.\u0000This estimation method requires maximization of a posterior probability\u0000distribution, which cannot be performed optimally. Thus the authors\u0000propose a new non-Bayesian estimation scheme, derived from the\u0000Kullback-Leibler information or cross-entropy. This quite general\u0000method, called the maximum entropy on the mean method (MEMM) in Gamboa\u0000(1989) and le Besnerais (1995) is firmly based on convex analysis and\u0000yields a unique solution which can be efficiently calculated in\u0000practice, and which is, in this sense, truly optimal. As a conclusion,\u0000the authors present results obtained with both methods on a synthetic\u0000case","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117050756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.480453
N. Al-Dhahir, J. Cioffi
A bandwidth-optimized and equalized multicarrier transceiver that achieves near-optimum performance at a practical complexity level is described. The equalizer used is a relatively short FIR filter whose taps and delay are set to optimize the performance of the multicarrier transceiver. Simulation results on a set of carrier-serving-area subscriber loops are also presented to demonstrate the separate and joint effects of bandwidth optimization and equalization on performance. Finally, the intriguing idea of using a pole-zero equalizer to achieve the high performance of infinite-complexity FIR equalizers at a much lower implementation cost is investigated.
{"title":"The combination of finite-length geometric equalization and bandwidth optimization for multicarrier transceivers","authors":"N. Al-Dhahir, J. Cioffi","doi":"10.1109/ICASSP.1995.480453","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.480453","url":null,"abstract":"A bandwidth-optimized and equalized multicarrier transceiver that achieves near-optimum performance at a practical complexity level is described. The equalizer used is a relatively short FIR filter whose taps and delay are set to optimize the performance of the multicarrier transceiver. Simulation results on a set of carrier-serving-area subscriber loops are also presented to demonstrate the separate and joint effects of bandwidth optimization and equalization on performance. Finally, the intriguing idea of using a pole-zero equalizer to achieve the high performance of infinite-complexity FIR equalizers at a much lower implementation cost is investigated.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117137995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.480421
S. D. Marco, P. Heller, J. Weiss
This paper develops a two-dimensional M-band translation-invariant wavelet transform (2-D MTI). Use of the MTI overcomes the shift-variance of the wavelet transform by applying a cost function over M shifts of the input signal. The new transform is proven to be translation-invariant. Use of M-band wavelets enables a finer frequency partitioning and greater energy compaction in the transform representation. Examples are presented which show that the translation-invariant transforms provide superior energy concentration compared to the corresponding nominal wavelet transforms. Examples are also presented comparing the energy concentration capability of M-band wavelets and the modulated lapped transform (MLT). We explored the MTI as a tool for image processing by using it to represent several different images.
{"title":"An M-band, 2-dimensional translation-invariant wavelet transform and applications","authors":"S. D. Marco, P. Heller, J. Weiss","doi":"10.1109/ICASSP.1995.480421","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.480421","url":null,"abstract":"This paper develops a two-dimensional M-band translation-invariant wavelet transform (2-D MTI). Use of the MTI overcomes the shift-variance of the wavelet transform by applying a cost function over M shifts of the input signal. The new transform is proven to be translation-invariant. Use of M-band wavelets enables a finer frequency partitioning and greater energy compaction in the transform representation. Examples are presented which show that the translation-invariant transforms provide superior energy concentration compared to the corresponding nominal wavelet transforms. Examples are also presented comparing the energy concentration capability of M-band wavelets and the modulated lapped transform (MLT). We explored the MTI as a tool for image processing by using it to represent several different images.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"13 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120859752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.480581
P. Yip, Yifeng Zhou
A self-calibration DOA estimation algorithm for cyclostationary source signals is presented in which the effects of the sensor gain and phase shift uncertainty have been eliminated. The uniqueness conditions and the asymptotic consistency of the estimates are discussed. An alternating projecting optimization algorithm is provided which lessens the computational load involved in the nonlinear multivariate optimization problem. A numerical example is presented to show the effectiveness of the algorithm.
{"title":"A self-calibration algorithm for cyclostationary signals and its uniqueness analysis","authors":"P. Yip, Yifeng Zhou","doi":"10.1109/ICASSP.1995.480581","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.480581","url":null,"abstract":"A self-calibration DOA estimation algorithm for cyclostationary source signals is presented in which the effects of the sensor gain and phase shift uncertainty have been eliminated. The uniqueness conditions and the asymptotic consistency of the estimates are discussed. An alternating projecting optimization algorithm is provided which lessens the computational load involved in the nonlinear multivariate optimization problem. A numerical example is presented to show the effectiveness of the algorithm.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"16 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120931921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1995-05-09DOI: 10.1109/ICASSP.1995.479773
G. Serebryakov, D. V. Sidorovich, C. Mecklenbräuker
We present an analysis of the interference coherence effects on the performance of an antenna array processor, which maximizes the output signal-to-noise ratio (SNR) for a coherent wave. The analytical expression for the optimal weight vector, directional pattern and output SNR is derived as a function of the coherence coefficient, interference power and parameters of the adaptive processor. We show how different models of interference coherence affect the array performance. Finally, to illustrate the theory, several numerical results are given.
{"title":"Coherence effects of the interference on the performance of optimum/adaptive beamformer","authors":"G. Serebryakov, D. V. Sidorovich, C. Mecklenbräuker","doi":"10.1109/ICASSP.1995.479773","DOIUrl":"https://doi.org/10.1109/ICASSP.1995.479773","url":null,"abstract":"We present an analysis of the interference coherence effects on the performance of an antenna array processor, which maximizes the output signal-to-noise ratio (SNR) for a coherent wave. The analytical expression for the optimal weight vector, directional pattern and output SNR is derived as a function of the coherence coefficient, interference power and parameters of the adaptive processor. We show how different models of interference coherence affect the array performance. Finally, to illustrate the theory, several numerical results are given.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121079700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}