首页 > 最新文献

2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)最新文献

英文 中文
Approximation of the minmax interpolator 最小最大插值器的近似
L. Ying, D. Munson
We consider approximation of the optimal Yen algorithm (1956) for interpolation from a nonuniformly-spaced grid. Although the Yen interpolator is optimal in many senses, it suffers from severe numerical ill conditioning. We suggest a tradeoff between accuracy in computing the interpolator and accuracy in performing the interpolation. A new interpolator is proposed using Choi's expression (1998) for interpolation error. A strategy is suggested to control the error tradeoff. We also generalize the new interpolator to multiple dimensions. The newly designed sinc-kernel interpolator is compared with the Yen, Choi, and usual sinc interpolator with Jacobian weighting using simulations in both one and two dimensions. We show that the new interpolator is robust. It performs similarly to the Yen algorithm when noise is small and similarly to the Choi algorithm when noise is large.
我们考虑从非均匀间隔网格插值的最优Yen算法(1956)的近似。虽然Yen插值器在许多意义上是最优的,但它受到严重的数值病态的影响。我们建议在计算插补器的精度和执行插补的精度之间进行权衡。利用Choi的插值误差表达式(1998)提出了一种新的插值器。提出了一种控制误差权衡的策略。我们还将新的插值器推广到多个维度。通过一维和二维仿真,将新设计的自核插值器与Yen、Choi和常用的具有雅可比加权的自核插值器进行了比较。结果表明,该插值器具有良好的鲁棒性。当噪声较小时,它的性能与Yen算法相似,当噪声较大时,它与Choi算法相似。
{"title":"Approximation of the minmax interpolator","authors":"L. Ying, D. Munson","doi":"10.1109/ICASSP.2000.861962","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.861962","url":null,"abstract":"We consider approximation of the optimal Yen algorithm (1956) for interpolation from a nonuniformly-spaced grid. Although the Yen interpolator is optimal in many senses, it suffers from severe numerical ill conditioning. We suggest a tradeoff between accuracy in computing the interpolator and accuracy in performing the interpolation. A new interpolator is proposed using Choi's expression (1998) for interpolation error. A strategy is suggested to control the error tradeoff. We also generalize the new interpolator to multiple dimensions. The newly designed sinc-kernel interpolator is compared with the Yen, Choi, and usual sinc interpolator with Jacobian weighting using simulations in both one and two dimensions. We show that the new interpolator is robust. It performs similarly to the Yen algorithm when noise is small and similarly to the Choi algorithm when noise is large.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"219 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114682659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Retrieving sinusoids in colored Rayleigh noise by a cumulant-based FBLP approach 基于累积量的FBLP方法提取有色瑞利噪声中的正弦波
R. R. Ghauieb, Y. Horita, T. Murai
It is known that sinusoids generate lines in their spectra but false lines may appear due to additive colored noise. Employing the fourth-order cumulant of the noisy sinusoids for retrieving the sinusoids has become an approach to handling Gaussian noise either white or colored. But the assumption that the noise is Gaussian does not exist in some applications. This paper presents a new investigation of employing cumulants for retrieving sinusoids in colored non-Gaussian noise. It is concerned with estimating the frequencies and spectrum of the sinusoids and no attention is given to the amplitudes. It is shown theoretically and experimentally that employing cumulants is an attractive approach to handling colored Rayleigh noise. Results of a presented cumulant-based forward-backward linear prediction (CBFBLP) approach are compared with that of a correlation-based counterpart.
众所周知,正弦波在其光谱中产生线,但由于加性有色噪声可能出现假线。利用噪声正弦波的四阶累积量提取正弦波已成为处理高斯白噪声或有色噪声的一种方法。但在某些应用中,不存在高斯噪声的假设。本文提出了一种利用累积量在有色非高斯噪声中提取正弦波的新方法。它关注的是估计正弦波的频率和频谱,而没有注意到振幅。理论和实验表明,采用累积量是处理有色瑞利噪声的一种有效方法。提出的基于累积量的前向后线性预测方法(CBFBLP)的结果与基于相关性的对应方法进行了比较。
{"title":"Retrieving sinusoids in colored Rayleigh noise by a cumulant-based FBLP approach","authors":"R. R. Ghauieb, Y. Horita, T. Murai","doi":"10.1109/ICASSP.2000.859066","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.859066","url":null,"abstract":"It is known that sinusoids generate lines in their spectra but false lines may appear due to additive colored noise. Employing the fourth-order cumulant of the noisy sinusoids for retrieving the sinusoids has become an approach to handling Gaussian noise either white or colored. But the assumption that the noise is Gaussian does not exist in some applications. This paper presents a new investigation of employing cumulants for retrieving sinusoids in colored non-Gaussian noise. It is concerned with estimating the frequencies and spectrum of the sinusoids and no attention is given to the amplitudes. It is shown theoretically and experimentally that employing cumulants is an attractive approach to handling colored Rayleigh noise. Results of a presented cumulant-based forward-backward linear prediction (CBFBLP) approach are compared with that of a correlation-based counterpart.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114747627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Evaluation of the feedforward neural network covariance matrix error 前馈神经网络协方差矩阵误差的评价
S. Abid, F. Fnaiech, M. Najim
This paper presents a theoretical approach for the evaluation of a feedforward neural network covariance output error matrix. It is shown how the input signals errors and the different weights errors are linked together and spread over the neural network to form the output covariance matrix error which could may be used to determine an error bound. The formulas of the output covariance matrix error is derived arising the sensitivity of the additive weight perturbations or input perturbations. The analytical formulas is validated via simulation of a function approximation example showing that the theoretical result is in agreement with simulation result.
本文提出了一种评估前馈神经网络协方差输出误差矩阵的理论方法。说明了输入信号误差和不同权重误差是如何联系在一起,并在神经网络中扩散,形成输出协方差矩阵误差,该矩阵误差可用于确定误差界。导出了考虑加性权扰动和输入扰动敏感性的输出协方差矩阵误差计算公式。通过一个函数逼近算例的仿真验证了解析公式的正确性,理论结果与仿真结果吻合较好。
{"title":"Evaluation of the feedforward neural network covariance matrix error","authors":"S. Abid, F. Fnaiech, M. Najim","doi":"10.1109/ICASSP.2000.860151","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.860151","url":null,"abstract":"This paper presents a theoretical approach for the evaluation of a feedforward neural network covariance output error matrix. It is shown how the input signals errors and the different weights errors are linked together and spread over the neural network to form the output covariance matrix error which could may be used to determine an error bound. The formulas of the output covariance matrix error is derived arising the sensitivity of the additive weight perturbations or input perturbations. The analytical formulas is validated via simulation of a function approximation example showing that the theoretical result is in agreement with simulation result.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114849798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Transmit adaptive array without user-specific pilot for 3G CDMA 无用户专用导频的3G CDMA传输自适应阵列
B. Raghothaman, R. T. Derryberry, G. Mandyam
The transmit adaptive array (TxAA) is one of the promising closed-loop downlink diversity schemes being considered for the third generation wireless systems based on code division multiple access (CDMA). The TxAA technique originally proposed, requires a user-specific auxiliary pilot for coherent demodulation. This affects the capacity of the system due to additional power being used by this pilot. The mobile receiver requires additional hardware correlators for demodulating the pilot. Since different channels in spread spectrum systems are distinguished by their spreading sequences, it also uses up an additional Walsh code per user. This paper proposes a decision directed method for demodulation which does not require a user-specific pilot. Our simulations demonstrate that the performance of the technique is comparable to that of the method based on a user specific pilot.
发射自适应阵列(TxAA)是基于码分多址(CDMA)的第三代无线系统中考虑的一种很有前途的闭环下行分集方案。最初提出的TxAA技术需要用户特定的辅助导频进行相干解调。由于该导频器使用了额外的功率,这将影响系统的容量。移动接收机需要额外的硬件相关器来解调导频。由于扩频系统中不同的信道是通过它们的扩频序列来区分的,它也会消耗每个用户额外的沃尔什码。本文提出了一种不需要用户专用导频的定向决策解调方法。我们的仿真表明,该技术的性能与基于用户特定导频的方法相当。
{"title":"Transmit adaptive array without user-specific pilot for 3G CDMA","authors":"B. Raghothaman, R. T. Derryberry, G. Mandyam","doi":"10.1109/ICASSP.2000.861168","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.861168","url":null,"abstract":"The transmit adaptive array (TxAA) is one of the promising closed-loop downlink diversity schemes being considered for the third generation wireless systems based on code division multiple access (CDMA). The TxAA technique originally proposed, requires a user-specific auxiliary pilot for coherent demodulation. This affects the capacity of the system due to additional power being used by this pilot. The mobile receiver requires additional hardware correlators for demodulating the pilot. Since different channels in spread spectrum systems are distinguished by their spreading sequences, it also uses up an additional Walsh code per user. This paper proposes a decision directed method for demodulation which does not require a user-specific pilot. Our simulations demonstrate that the performance of the technique is comparable to that of the method based on a user specific pilot.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"349 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124302812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A local-field extrapolation algorithm for improving the spatial resolution in magnetic resonance dynamic imaging 一种提高磁共振动态成像空间分辨率的局部场外推算法
A. Fahmy, Bassel S. Tawfik, Y. Kadah
In magnetic resonance imaging (MRI), data are collected as spectrum samples. The acquisition time is proportional to the number of the spectrum lines. Therefore, only few lines of the data space may be required in order to track rapid changes of an object. In the current techniques, the missed lines may be zeroed or replaced by the corresponding lines in a reference image, which is acquired a priori for the same anatomical cross-section. However, this always comes at the expense of the spatial-resolution. In this study, we propose an extrapolation iterative algorithm to provide an improved estimate of the missed lines. Additional spatial and spatial-frequency constraints of the reference image are incorporated to enhance the convergence and obtain a better estimate of the initial conditions of the iterations. Results from simulated data verify the theory and indicate that the algorithm may provide better reconstruction in dynamic imaging studies.
在磁共振成像(MRI)中,数据以频谱样本的形式收集。采集时间与谱线数成正比。因此,为了跟踪对象的快速变化,可能只需要几行数据空间。在目前的技术中,缺失的线条可以被归零或替换为参考图像中的相应线条,这些线条是对相同的解剖截面先验获取的。然而,这总是以牺牲空间分辨率为代价。在这项研究中,我们提出了一个外推迭代算法来提供一个改进的估计缺失线。为了提高收敛性并更好地估计迭代的初始条件,引入了参考图像的附加空间和空间频率约束。模拟数据的结果验证了该理论,并表明该算法可以在动态成像研究中提供更好的重建。
{"title":"A local-field extrapolation algorithm for improving the spatial resolution in magnetic resonance dynamic imaging","authors":"A. Fahmy, Bassel S. Tawfik, Y. Kadah","doi":"10.1109/ICASSP.2000.859290","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.859290","url":null,"abstract":"In magnetic resonance imaging (MRI), data are collected as spectrum samples. The acquisition time is proportional to the number of the spectrum lines. Therefore, only few lines of the data space may be required in order to track rapid changes of an object. In the current techniques, the missed lines may be zeroed or replaced by the corresponding lines in a reference image, which is acquired a priori for the same anatomical cross-section. However, this always comes at the expense of the spatial-resolution. In this study, we propose an extrapolation iterative algorithm to provide an improved estimate of the missed lines. Additional spatial and spatial-frequency constraints of the reference image are incorporated to enhance the convergence and obtain a better estimate of the initial conditions of the iterations. Results from simulated data verify the theory and indicate that the algorithm may provide better reconstruction in dynamic imaging studies.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124189436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Detection of subspace waveforms in subspace interference and noise 子空间干扰与噪声中子空间波形的检测
J. A. Gubner, L. Scharf
The natural models of multi-access communication and modern radar and sonar systems involve infinite-dimensional waveform spaces. A common problem in these systems is the detection of subspace signals measured in the presence of subspace interference and broadband noise. By and large, the existing theory for such problems has been developed for finite-dimensional measurement spaces rather than the infinite-dimensional waveform spaces needed here. In this paper the log of the generalized likelihood ratio detector for the waveform problem is derived and is shown to have a certain chi-squared distribution, depending on the hypothesis.
多址通信和现代雷达、声纳系统的自然模型涉及无限维的波形空间。这些系统中的一个共同问题是在存在子空间干扰和宽带噪声的情况下检测子空间信号。总的来说,这类问题的现有理论都是针对有限维的测量空间发展起来的,而不是针对这里需要的无限维的波形空间。本文推导了波形问题的广义似然比检测器的对数,并证明了它有一定的卡方分布,取决于假设。
{"title":"Detection of subspace waveforms in subspace interference and noise","authors":"J. A. Gubner, L. Scharf","doi":"10.1109/ICASSP.2000.861955","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.861955","url":null,"abstract":"The natural models of multi-access communication and modern radar and sonar systems involve infinite-dimensional waveform spaces. A common problem in these systems is the detection of subspace signals measured in the presence of subspace interference and broadband noise. By and large, the existing theory for such problems has been developed for finite-dimensional measurement spaces rather than the infinite-dimensional waveform spaces needed here. In this paper the log of the generalized likelihood ratio detector for the waveform problem is derived and is shown to have a certain chi-squared distribution, depending on the hypothesis.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127755920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
The use of sub-band cepstrum in speaker verification 子带倒频谱在说话人验证中的应用
P. Sivakumaran, A. Ariyaeeinia
This paper focuses on the spectral representation of the sub-band cepstrum in relation to that of the full-band cepstrum. Through theoretical analysis it is shown that the net spectral information content of the cepstral coefficients with the same index in different sub-bands is only comparable to that of a full-band cepstral parameter whose quefrency is given by the product of that specific index with the number of sub-bands. A new method is proposed to tackle this deficiency of the sub-band cepstrum when it is used in the context of text-dependent speaker verification. The experimental investigations have clearly demonstrated the effectiveness of this method in speaker verification.
本文重点研究了子带倒谱与全带倒谱的频谱表示。通过理论分析表明,具有相同指标的倒谱系数在不同子带中的净谱信息含量仅与频率由该指标与子带数乘积给出的全带倒谱参数的净谱信息含量相当。针对子带倒频谱在文本依赖的说话人验证中存在的不足,提出了一种新的子带倒频谱验证方法。实验验证了该方法在说话人验证中的有效性。
{"title":"The use of sub-band cepstrum in speaker verification","authors":"P. Sivakumaran, A. Ariyaeeinia","doi":"10.1109/ICASSP.2000.859149","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.859149","url":null,"abstract":"This paper focuses on the spectral representation of the sub-band cepstrum in relation to that of the full-band cepstrum. Through theoretical analysis it is shown that the net spectral information content of the cepstral coefficients with the same index in different sub-bands is only comparable to that of a full-band cepstral parameter whose quefrency is given by the product of that specific index with the number of sub-bands. A new method is proposed to tackle this deficiency of the sub-band cepstrum when it is used in the context of text-dependent speaker verification. The experimental investigations have clearly demonstrated the effectiveness of this method in speaker verification.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127775454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
On minimizing hierarchical mesh coding overhead: (HASM) hierarchical adaptive structured mesh approach 最小化分层网格编码开销:(HASM)分层自适应结构网格方法
Wael Badawy, M. Bayoumi
This paper presents an efficient mesh coding technique suitable for MPEG-4 video applications. The proposed technique significantly reduces the number of bits that are used to describe the mesh topology. It uses an adaptive structured mesh from coarse to fine, which can be coded as a count of splitting instead of nodes' locations. In the case of the quadtree, less than one bit per node can be achieved. This reduction induces an improvement of either the image quality or the global bit rate.
本文提出了一种适用于MPEG-4视频应用的高效网格编码技术。所提出的技术显著减少了用于描述网格拓扑的比特数。它使用自适应结构网格,从粗到细,可以编码为分裂计数,而不是节点的位置。在四叉树的情况下,每个节点可以实现不到1位。这种减少导致图像质量或全局比特率的改善。
{"title":"On minimizing hierarchical mesh coding overhead: (HASM) hierarchical adaptive structured mesh approach","authors":"Wael Badawy, M. Bayoumi","doi":"10.1109/ICASSP.2000.859205","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.859205","url":null,"abstract":"This paper presents an efficient mesh coding technique suitable for MPEG-4 video applications. The proposed technique significantly reduces the number of bits that are used to describe the mesh topology. It uses an adaptive structured mesh from coarse to fine, which can be coded as a count of splitting instead of nodes' locations. In the case of the quadtree, less than one bit per node can be achieved. This reduction induces an improvement of either the image quality or the global bit rate.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126302557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
Simplified path metric updating in the M algorithm for VLSI implementation VLSI实现中M算法路径度量更新的简化
L. González, E. Boutillon
A VLSI structure for path metric updating in the M algorithm is presented. The architecture is based on the combination of a modified Batcher's (1968) odd-even merging network and a bitonic selection procedure. A feature of the trellis structure allows to replace an existing solution based on two 2M-item sorting operations by three M-item sorting operations with an additional one-layer bitonic merge. These three sorting networks and the bitonic merging procedure permit a reduction of up to 50% in hardware complexity.
提出了一种用于M算法中路径度量更新的VLSI结构。该结构是基于改进的Batcher's(1968)奇偶合并网络和双音选择程序的结合。网格结构的一个特性允许将基于两个2m项目排序操作的现有解决方案替换为三个m项目排序操作,并附带一个额外的单层bitonic合并。这三种排序网络和bitonic合并过程允许将硬件复杂性降低多达50%。
{"title":"Simplified path metric updating in the M algorithm for VLSI implementation","authors":"L. González, E. Boutillon","doi":"10.1109/ICASSP.2000.860125","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.860125","url":null,"abstract":"A VLSI structure for path metric updating in the M algorithm is presented. The architecture is based on the combination of a modified Batcher's (1968) odd-even merging network and a bitonic selection procedure. A feature of the trellis structure allows to replace an existing solution based on two 2M-item sorting operations by three M-item sorting operations with an additional one-layer bitonic merge. These three sorting networks and the bitonic merging procedure permit a reduction of up to 50% in hardware complexity.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126389971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A hidden Markov model based visual speech synthesizer 基于隐马尔可夫模型的视觉语音合成器
J. J. Williams, A. Katsaggelos, M. Randolph
This paper describes a hidden Markov model (HMM) based visual synthesizer designed to assist persons with impaired hearing. This synthesizer builds on results in the area of audio-visual speech recognition. We describe how a correlation HMM can be used to integrate independent acoustic and visual HMMs for speech-to-visual synthesis. Our results show that an HMM correlating model can significantly improve synchronization errors versus techniques which compensate for rate differences through scaling.
本文设计了一种基于隐马尔可夫模型的视觉合成器,用于帮助听力障碍者。这个合成器建立在视听语音识别领域的成果之上。我们描述了如何使用相关HMM来整合独立的声学和视觉HMM,以实现语音到视觉的合成。我们的研究结果表明,与通过缩放来补偿速率差异的技术相比,HMM相关模型可以显著改善同步误差。
{"title":"A hidden Markov model based visual speech synthesizer","authors":"J. J. Williams, A. Katsaggelos, M. Randolph","doi":"10.1109/ICASSP.2000.859323","DOIUrl":"https://doi.org/10.1109/ICASSP.2000.859323","url":null,"abstract":"This paper describes a hidden Markov model (HMM) based visual synthesizer designed to assist persons with impaired hearing. This synthesizer builds on results in the area of audio-visual speech recognition. We describe how a correlation HMM can be used to integrate independent acoustic and visual HMMs for speech-to-visual synthesis. Our results show that an HMM correlating model can significantly improve synchronization errors versus techniques which compensate for rate differences through scaling.","PeriodicalId":164817,"journal":{"name":"2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126412336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1