首页 > 最新文献

2011 International Conference on Communications and Signal Processing最新文献

英文 中文
Predictor performance of decision-directed channel estimation in 3GPP MIMO channels 决策导向的3GPP MIMO信道估计的预测器性能
Pub Date : 2011-03-24 DOI: 10.1109/ICCSP.2011.5739327
P. Beinschob, U. Zolzer
OFDM-based multi antenna systems suffer from inaccurate channel estimates in time variant channels which lead eventually to performance degradation. Decision-directed channel estimation (DDCE) schemes have been proposed to reduce the number of reference symbols. A prediction of channel coefficients is necessary for MIMO detection, whose output is used to acquire new channel estimates recursively. Three approaches to predict channel coefficients are investigated in this work. For validation simulation were conducted with the 3GPP spatial channel model. Channel estimation and prediction accuracy for mobile terminals in a range of velocities and Signal to Noise Ratios were evaluated. Even though the predictors vary strong in computational complexity it is shown in this work the performance benefit of complicated approaches remains small in a variety of channel states.
基于ofdm的多天线系统在时变信道中存在信道估计不准确的问题,最终导致系统性能下降。为了减少参考符号的数量,提出了决策导向信道估计(DDCE)方案。信道系数的预测是MIMO检测的必要条件,其输出用于递归地获取新的信道估计。本文研究了三种预测通道系数的方法。为了验证这一点,采用3GPP空间信道模型进行了仿真。对移动终端在一定速度和信噪比范围内的信道估计和预测精度进行了评价。尽管预测器的计算复杂度变化很大,但在本研究中表明,复杂方法的性能优势在各种通道状态下仍然很小。
{"title":"Predictor performance of decision-directed channel estimation in 3GPP MIMO channels","authors":"P. Beinschob, U. Zolzer","doi":"10.1109/ICCSP.2011.5739327","DOIUrl":"https://doi.org/10.1109/ICCSP.2011.5739327","url":null,"abstract":"OFDM-based multi antenna systems suffer from inaccurate channel estimates in time variant channels which lead eventually to performance degradation. Decision-directed channel estimation (DDCE) schemes have been proposed to reduce the number of reference symbols. A prediction of channel coefficients is necessary for MIMO detection, whose output is used to acquire new channel estimates recursively. Three approaches to predict channel coefficients are investigated in this work. For validation simulation were conducted with the 3GPP spatial channel model. Channel estimation and prediction accuracy for mobile terminals in a range of velocities and Signal to Noise Ratios were evaluated. Even though the predictors vary strong in computational complexity it is shown in this work the performance benefit of complicated approaches remains small in a variety of channel states.","PeriodicalId":408736,"journal":{"name":"2011 International Conference on Communications and Signal Processing","volume":"170 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124244882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Determination of position of detonation of artillery shells 炮弹起爆位置的测定
Pub Date : 2011-03-24 DOI: 10.1109/ICCSP.2011.5739304
J. Jena, T. Kishore Kumar
The aim of this paper is development and implementation of a unconventional method to determine the position of artillery shells using acoustic signature. Hence the authors developed an algorithm to achieve the same. Analysis has been done to determine the error introduced by the various sources on the final position determined by the algorithm.
本文的目的是开发和实现一种利用声学特征确定炮弹位置的非常规方法。因此,作者开发了一种算法来实现同样的目标。通过分析确定了各种源对算法确定的最终位置所带来的误差。
{"title":"Determination of position of detonation of artillery shells","authors":"J. Jena, T. Kishore Kumar","doi":"10.1109/ICCSP.2011.5739304","DOIUrl":"https://doi.org/10.1109/ICCSP.2011.5739304","url":null,"abstract":"The aim of this paper is development and implementation of a unconventional method to determine the position of artillery shells using acoustic signature. Hence the authors developed an algorithm to achieve the same. Analysis has been done to determine the error introduced by the various sources on the final position determined by the algorithm.","PeriodicalId":408736,"journal":{"name":"2011 International Conference on Communications and Signal Processing","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122825345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Free breathing cardiac perfusion MRI reconstruction using a sparse and low rank model: Validation with the Physiologically Improved NCAT phantom 使用稀疏和低秩模型的自由呼吸心脏灌注MRI重建:与生理性改进的NCAT幻影验证
Pub Date : 2011-03-24 DOI: 10.1109/ICCSP.2011.5739310
Sajan Goud, M. Jacob
We recently proposed an accelerated dynamic magnetic resonance imaging (MRI) reconstruction algorithm that exploits the underlying low rank and sparse properties of the data to achieve highly accelerated reconstructions. In this paper, we validate our algorithm in the context of dynamic free breathing cardiac Perfusion MRI on the Physiologically Improved Non Uniform Cardiac Torso Phantom, PINCAT phantom. The practical utilities of our scheme in providing significantly better reconstructions at higher accelerations in comparison to existing methods are studied. We demonstrate that our scheme do not have trade offs with accurate temporal modeling and spatial quality unlike the existing low rank based schemes. Our results also show the capability of our scheme to achieve better reconstruction qualities at high accelerations in comparison to using only the low rank or sparsity properties individually. We argue that the speed up obtained by our scheme could be capitalized in perfusion imaging to provide better spatio-temporal resolutions and volume coverage while the subject is freely breathing.
我们最近提出了一种加速的动态磁共振成像(MRI)重建算法,该算法利用数据的底层低秩和稀疏特性来实现高度加速的重建。在本文中,我们在动态自由呼吸心脏灌注MRI的背景下对生理改进的非均匀心脏躯干幻影(PINCAT)幻影验证了我们的算法。研究了与现有方法相比,我们的方案在高加速度下提供更好的重建效果的实际效用。我们证明,与现有的低秩方案不同,我们的方案不需要在精确的时间建模和空间质量之间进行权衡。我们的结果还表明,与单独使用低秩或稀疏性属性相比,我们的方案能够在高加速度下实现更好的重建质量。我们认为,通过我们的方案获得的速度可以在灌注成像中得到利用,以提供更好的时空分辨率和体积覆盖,而受试者是自由呼吸。
{"title":"Free breathing cardiac perfusion MRI reconstruction using a sparse and low rank model: Validation with the Physiologically Improved NCAT phantom","authors":"Sajan Goud, M. Jacob","doi":"10.1109/ICCSP.2011.5739310","DOIUrl":"https://doi.org/10.1109/ICCSP.2011.5739310","url":null,"abstract":"We recently proposed an accelerated dynamic magnetic resonance imaging (MRI) reconstruction algorithm that exploits the underlying low rank and sparse properties of the data to achieve highly accelerated reconstructions. In this paper, we validate our algorithm in the context of dynamic free breathing cardiac Perfusion MRI on the Physiologically Improved Non Uniform Cardiac Torso Phantom, PINCAT phantom. The practical utilities of our scheme in providing significantly better reconstructions at higher accelerations in comparison to existing methods are studied. We demonstrate that our scheme do not have trade offs with accurate temporal modeling and spatial quality unlike the existing low rank based schemes. Our results also show the capability of our scheme to achieve better reconstruction qualities at high accelerations in comparison to using only the low rank or sparsity properties individually. We argue that the speed up obtained by our scheme could be capitalized in perfusion imaging to provide better spatio-temporal resolutions and volume coverage while the subject is freely breathing.","PeriodicalId":408736,"journal":{"name":"2011 International Conference on Communications and Signal Processing","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134043737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Artificial bandwidth extension of narrowband speech using Gaussian Mixture Model 基于高斯混合模型的窄带语音人工带宽扩展
Pub Date : 2011-03-24 DOI: 10.1109/ICCSP.2011.5739348
D. Murali Mohan, Dileep B. Karpur, M. Narayan, J. Kishore
Spectrum of speech signals have frequency components from 50Hz to 7 kHz (Wideband speech). However, due to historical reasons speech is band-pass filtered between 300 Hz-3.4 kHz in PSTN networks and this speech is referred to as narrowband speech. The missing bandwidth in narrow band speech contributes to speech quality and intelligibility. This paper addresses the problem of artificial bandwidth extension of narrowband speech to wideband speech. The proposed method for bandwidth extension is based on statistical recovery using Gaussian Mixture Model (GMM) for spectral envelope parameters and spectral shifting method is used for excitation extension.
语音信号频谱的频率成分从50Hz到7khz(宽带语音)。然而,由于历史原因,在PSTN网络中,语音在300 Hz-3.4 kHz之间是带通滤波的,这种语音被称为窄带语音。窄带语音中的带宽缺失会影响语音质量和清晰度。本文研究了窄带语音到宽带语音的人工带宽扩展问题。提出了基于高斯混合模型(GMM)的频谱包络参数统计恢复的带宽扩展方法,采用谱移法进行激励扩展。
{"title":"Artificial bandwidth extension of narrowband speech using Gaussian Mixture Model","authors":"D. Murali Mohan, Dileep B. Karpur, M. Narayan, J. Kishore","doi":"10.1109/ICCSP.2011.5739348","DOIUrl":"https://doi.org/10.1109/ICCSP.2011.5739348","url":null,"abstract":"Spectrum of speech signals have frequency components from 50Hz to 7 kHz (Wideband speech). However, due to historical reasons speech is band-pass filtered between 300 Hz-3.4 kHz in PSTN networks and this speech is referred to as narrowband speech. The missing bandwidth in narrow band speech contributes to speech quality and intelligibility. This paper addresses the problem of artificial bandwidth extension of narrowband speech to wideband speech. The proposed method for bandwidth extension is based on statistical recovery using Gaussian Mixture Model (GMM) for spectral envelope parameters and spectral shifting method is used for excitation extension.","PeriodicalId":408736,"journal":{"name":"2011 International Conference on Communications and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134088620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Full wave analysis of a novel multifractal multiband antenna using 3D-FDTD approach 一种新型多重分形多波段天线的三维时域有限差分全波分析
Pub Date : 2011-03-24 DOI: 10.1109/ICCSP.2011.5739315
Vivek Dhoot, Sanjeev Gupta
In this paper, a novel multifractal cantor based multiband monopole antenna is proposed and analyzed using 3-Dimensional Finite Difference Time Domain Method (3D-FDTD). The proposed antenna has multiband characteristics covering several wireless applications in Ultra Wideband (UWB) including WLAN 2.4 GHz and 5.8 GHz, GSM, PCS and DCS applications. A program based on 3D-FDTD method is written and utilized for observing return loss of the proposed antenna.
本文提出了一种新的基于多重分形康托的多波段单极天线,并利用三维时域有限差分法(3D-FDTD)对其进行了分析。所提出的天线具有多频段特性,涵盖超宽带(UWB)中的几种无线应用,包括WLAN 2.4 GHz和5.8 GHz, GSM, PCS和DCS应用。编写了基于三维时域有限差分法的程序,用于观测天线的回波损耗。
{"title":"Full wave analysis of a novel multifractal multiband antenna using 3D-FDTD approach","authors":"Vivek Dhoot, Sanjeev Gupta","doi":"10.1109/ICCSP.2011.5739315","DOIUrl":"https://doi.org/10.1109/ICCSP.2011.5739315","url":null,"abstract":"In this paper, a novel multifractal cantor based multiband monopole antenna is proposed and analyzed using 3-Dimensional Finite Difference Time Domain Method (3D-FDTD). The proposed antenna has multiband characteristics covering several wireless applications in Ultra Wideband (UWB) including WLAN 2.4 GHz and 5.8 GHz, GSM, PCS and DCS applications. A program based on 3D-FDTD method is written and utilized for observing return loss of the proposed antenna.","PeriodicalId":408736,"journal":{"name":"2011 International Conference on Communications and Signal Processing","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132534471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dielectric loss computation of multilayer Coplanar Waveguide 多层共面波导介质损耗计算
Pub Date : 2011-03-24 DOI: 10.1109/ICCSP.2011.5739352
Paramjeet Singh, A. K. Verma
The combined Quasi Static Spectral Domain Approach (SDA) method and Single Layer Reduction (SLR) technique is presented to compute dielectric loss of multilayer Coplanar Waveguide (CPW). The Green's function for the multilayer structure is derived from Transverse Transmission Line (TTL) method. Quasi static SDA method is used to compute effective relative permittivity of the multilayer CPW. The Single Layer Reduction (SLR) technique converts multilayer CPW structure to an equivalent single layer CPW structure. The dielectric loss is computed for the equivalent CPW structure.
提出了准静态谱域法(SDA)和单层还原法(SLR)相结合的多层共面波导介质损耗计算方法。多层结构的格林函数是由横向传输线(TTL)法导出的。采用准静态SDA方法计算多层CPW的有效相对介电常数。单层还原(SLR)技术将多层CPW结构转化为等效的单层CPW结构。计算了等效CPW结构的介电损耗。
{"title":"Dielectric loss computation of multilayer Coplanar Waveguide","authors":"Paramjeet Singh, A. K. Verma","doi":"10.1109/ICCSP.2011.5739352","DOIUrl":"https://doi.org/10.1109/ICCSP.2011.5739352","url":null,"abstract":"The combined Quasi Static Spectral Domain Approach (SDA) method and Single Layer Reduction (SLR) technique is presented to compute dielectric loss of multilayer Coplanar Waveguide (CPW). The Green's function for the multilayer structure is derived from Transverse Transmission Line (TTL) method. Quasi static SDA method is used to compute effective relative permittivity of the multilayer CPW. The Single Layer Reduction (SLR) technique converts multilayer CPW structure to an equivalent single layer CPW structure. The dielectric loss is computed for the equivalent CPW structure.","PeriodicalId":408736,"journal":{"name":"2011 International Conference on Communications and Signal Processing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132668601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Low power Viterbi Decoder by modified ACSU architecture and clock gating method 采用改进ACSU结构和时钟门控方法的低功耗维特比解码器
Pub Date : 2011-03-24 DOI: 10.1109/ICCSP.2011.5739371
Sunil P. Joshi, R. Paily
The use of error-correcting codes has proven to be an effective way to overcome data corruption in digital wireless communication channels, enabling reliable transmission over noisy and fading channel. This requires low power decoders as they consume lot of power. Power reduction in any system can be achieved at device level, at circuit level or at architectural level. In this paper, power reduction is achieved at architecture level. A Viterbi Decoder (VD) with architectural modification for Add-Compare-Select Unit (ACSU) and clock gated Survivor Memory Unit (SMU) are designed for low power wireless applications. A decoder system with code rate of k/n=1/2 with constraint length K=7 has been implemented with 130nm technology. It is synthesized using design compiler of Synopsys and its power is estimated with power compiler. A throughput of 125 Mbps is achieved satisfying the requirement for wireless applications. Bit error rate of proposed system is same as that of modified register exchange VD. Around 66% power is reduced with clock gating technique.
使用纠错码已被证明是克服数字无线通信信道中数据损坏的有效方法,可以在噪声和衰落信道中实现可靠的传输。这需要低功率解码器,因为它们消耗大量的功率。任何系统的功耗降低都可以在器件级、电路级或架构级实现。在本文中,功耗降低是在体系结构级别实现的。Viterbi解码器(VD)的架构修改为添加比较选择单元(ACSU)和时钟门控幸存者内存单元(SMU)是专为低功耗无线应用。采用130nm技术实现了码率为k/n=1/2、约束长度为k =7的译码系统。利用Synopsys的设计编译器对其进行综合,并利用功率编译器对其功率进行估算。达到了125mbps的吞吐量,满足了无线应用的要求。该系统的误码率与改进的寄存器交换VD相同。时钟门控技术降低了大约66%的功率。
{"title":"Low power Viterbi Decoder by modified ACSU architecture and clock gating method","authors":"Sunil P. Joshi, R. Paily","doi":"10.1109/ICCSP.2011.5739371","DOIUrl":"https://doi.org/10.1109/ICCSP.2011.5739371","url":null,"abstract":"The use of error-correcting codes has proven to be an effective way to overcome data corruption in digital wireless communication channels, enabling reliable transmission over noisy and fading channel. This requires low power decoders as they consume lot of power. Power reduction in any system can be achieved at device level, at circuit level or at architectural level. In this paper, power reduction is achieved at architecture level. A Viterbi Decoder (VD) with architectural modification for Add-Compare-Select Unit (ACSU) and clock gated Survivor Memory Unit (SMU) are designed for low power wireless applications. A decoder system with code rate of k/n=1/2 with constraint length K=7 has been implemented with 130nm technology. It is synthesized using design compiler of Synopsys and its power is estimated with power compiler. A throughput of 125 Mbps is achieved satisfying the requirement for wireless applications. Bit error rate of proposed system is same as that of modified register exchange VD. Around 66% power is reduced with clock gating technique.","PeriodicalId":408736,"journal":{"name":"2011 International Conference on Communications and Signal Processing","volume":"27 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132900938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Timing consideration in synchronous system level design 同步系统级设计中的时序考虑
Pub Date : 2011-03-24 DOI: 10.1109/ICCSP.2011.5739379
S. Siddamal, R. Banakar, B. C. Jinaga
This paper describes the architecture of system level design for the analysis of fiber parameters for one simulation step considering the synchronous and timing issues. The challenge in realizing these systems is not only the hardware but also complex control design that marshals the data flow. In a well-thought-out system level design approach it is necessary in splitting the design into several sub-modules, each addressing the specific timing and synchronizing issues. For the split step Fourier algorithm a system level model is designed considering the data path and control architecture. The timing and synchronizing are considering in RTL validation using Xilinx device XC5VLX30TFF655 with speed grade −3.
本文介绍了考虑同步和时序问题的单步仿真光纤参数分析的系统级设计体系结构。实现这些系统的挑战不仅在于硬件,还在于处理数据流的复杂控制设计。在深思熟虑的系统关卡设计方法中,有必要将设计分成几个子模块,每个子模块处理特定的时间和同步问题。对于分步傅里叶算法,考虑了数据路径和控制体系结构,设计了系统级模型。RTL验证采用Xilinx XC5VLX30TFF655器件,速度等级为−3。
{"title":"Timing consideration in synchronous system level design","authors":"S. Siddamal, R. Banakar, B. C. Jinaga","doi":"10.1109/ICCSP.2011.5739379","DOIUrl":"https://doi.org/10.1109/ICCSP.2011.5739379","url":null,"abstract":"This paper describes the architecture of system level design for the analysis of fiber parameters for one simulation step considering the synchronous and timing issues. The challenge in realizing these systems is not only the hardware but also complex control design that marshals the data flow. In a well-thought-out system level design approach it is necessary in splitting the design into several sub-modules, each addressing the specific timing and synchronizing issues. For the split step Fourier algorithm a system level model is designed considering the data path and control architecture. The timing and synchronizing are considering in RTL validation using Xilinx device XC5VLX30TFF655 with speed grade −3.","PeriodicalId":408736,"journal":{"name":"2011 International Conference on Communications and Signal Processing","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127691207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speaker independent continuous speech and isolated digit recognition using VQ and HMM 基于VQ和HMM的独立于说话人的连续语音和隔离数字识别
Pub Date : 2011-03-24 DOI: 10.1109/ICCSP.2011.5739300
A. Revathi, Y. Venkataramani
The main objective of this paper is to explore the effectiveness of perceptual features for performing isolated digits and continuous speech recognition. The proposed perceptual features are captured and code book indices are extracted. Expectation maximization algorithm is used to generate HMM models for the speeches. Speech recognition system is evaluated on clean test speeches and the experimental results reveal the performance of the proposed algorithm in recognizing isolated digits and continuous speeches based on maximum log likelihood value between test features and HMM models for each speech. Performance of these features is tested on speeches randomly chosen from “TI Digits_1”, “TI Digits_2” and “TIMIT” databases. This algorithm is tested for VQ and combination of VQ and HMM speech modeling techniques. Perceptual linear predictive cepstrum yields the accuracy of 86% and 93% for speaker independent isolated digit recognition using VQ and combination of VQ & HMM speech models respectively. This feature also gives 99% and 100% accuracy for speaker independent continuous speech recognition by using VQ and the combination of VQ & HMM speech modeling techniques.
本文的主要目的是探索感知特征在执行孤立数字和连续语音识别中的有效性。所提出的感知特征被捕获,代码本索引被提取。使用期望最大化算法对演讲生成HMM模型。在干净的测试语音上对语音识别系统进行了评估,实验结果表明,基于每个语音的测试特征和HMM模型之间的最大对数似然值,该算法在识别孤立数字和连续语音方面具有良好的性能。这些特征的性能在从“TI Digits_1”、“TI Digits_2”和“TIMIT”数据库中随机选择的演讲上进行测试。对该算法进行了VQ测试,并结合VQ和HMM语音建模技术进行了测试。使用VQ和VQ和HMM语音模型的组合,感知线性预测倒谱分别获得了86%和93%的独立于说话人的孤立数字识别准确率。该特性还通过使用VQ以及VQ和HMM语音建模技术的结合,为独立于说话者的连续语音识别提供99%和100%的准确率。
{"title":"Speaker independent continuous speech and isolated digit recognition using VQ and HMM","authors":"A. Revathi, Y. Venkataramani","doi":"10.1109/ICCSP.2011.5739300","DOIUrl":"https://doi.org/10.1109/ICCSP.2011.5739300","url":null,"abstract":"The main objective of this paper is to explore the effectiveness of perceptual features for performing isolated digits and continuous speech recognition. The proposed perceptual features are captured and code book indices are extracted. Expectation maximization algorithm is used to generate HMM models for the speeches. Speech recognition system is evaluated on clean test speeches and the experimental results reveal the performance of the proposed algorithm in recognizing isolated digits and continuous speeches based on maximum log likelihood value between test features and HMM models for each speech. Performance of these features is tested on speeches randomly chosen from “TI Digits_1”, “TI Digits_2” and “TIMIT” databases. This algorithm is tested for VQ and combination of VQ and HMM speech modeling techniques. Perceptual linear predictive cepstrum yields the accuracy of 86% and 93% for speaker independent isolated digit recognition using VQ and combination of VQ & HMM speech models respectively. This feature also gives 99% and 100% accuracy for speaker independent continuous speech recognition by using VQ and the combination of VQ & HMM speech modeling techniques.","PeriodicalId":408736,"journal":{"name":"2011 International Conference on Communications and Signal Processing","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116278919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Implementation of speaker verification system using Fuzzy Wavelet Network 用模糊小波网络实现说话人验证系统
Pub Date : 2011-03-24 DOI: 10.1109/ICCSP.2011.5739361
P. Shanmugapriya, Y. Venkataramani
A Fuzzy Wavelet network (FWN) is proposed to model the characteristics of a speaker in an automatic speaker verification system in this paper. The neural network using wavelet as activation function is wavelet network (Wavenet). Wavenet has the ability to extract the distinguishable and essential features in frequency rich signals. This is required in classification and identification problems such as speaker verification. Nonlinearity and structured knowledge representation with human perception of fuzzy inference system makes it to be a suitable model for speaker verification when combined with the wavelet network. In this approach, the wavelet theory is combined with the fuzzy based neural network theory which leads to construction of Fuzzy Wavelet Network (FWN). The advantage of fuzzy wavelet network is that the membership functions can be easily merged or divided using the multi resolution properties and the rules can be evaluated during learning. The performance of the proposed speaker verification system is evaluated with TIMIT database. A comparison is made between the proposed system and the system using state of the art model (GMM). Compared with GMM and WNN, FWN provides better verification performance.
本文提出了一种模糊小波网络(FWN)来对说话人自动验证系统中的说话人特征进行建模。以小波为激活函数的神经网络称为小波网络(Wavenet)。小波网络具有从频率丰富的信号中提取可区分的本质特征的能力。这在分类和识别问题(如说话人验证)中是必需的。模糊推理系统的非线性和具有人感知的结构化知识表示使其与小波网络相结合成为一种适合于说话人验证的模型。该方法将小波理论与基于模糊的神经网络理论相结合,构造了模糊小波网络。模糊小波网络的优点是利用模糊小波网络的多分辨率特性可以方便地对隶属函数进行合并或分割,并且可以在学习过程中对规则进行评估。利用TIMIT数据库对该说话人验证系统的性能进行了评价。将所提出的系统与采用最先进模型(GMM)的系统进行了比较。与GMM和WNN相比,FWN具有更好的验证性能。
{"title":"Implementation of speaker verification system using Fuzzy Wavelet Network","authors":"P. Shanmugapriya, Y. Venkataramani","doi":"10.1109/ICCSP.2011.5739361","DOIUrl":"https://doi.org/10.1109/ICCSP.2011.5739361","url":null,"abstract":"A Fuzzy Wavelet network (FWN) is proposed to model the characteristics of a speaker in an automatic speaker verification system in this paper. The neural network using wavelet as activation function is wavelet network (Wavenet). Wavenet has the ability to extract the distinguishable and essential features in frequency rich signals. This is required in classification and identification problems such as speaker verification. Nonlinearity and structured knowledge representation with human perception of fuzzy inference system makes it to be a suitable model for speaker verification when combined with the wavelet network. In this approach, the wavelet theory is combined with the fuzzy based neural network theory which leads to construction of Fuzzy Wavelet Network (FWN). The advantage of fuzzy wavelet network is that the membership functions can be easily merged or divided using the multi resolution properties and the rules can be evaluated during learning. The performance of the proposed speaker verification system is evaluated with TIMIT database. A comparison is made between the proposed system and the system using state of the art model (GMM). Compared with GMM and WNN, FWN provides better verification performance.","PeriodicalId":408736,"journal":{"name":"2011 International Conference on Communications and Signal Processing","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125189890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
2011 International Conference on Communications and Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1