首页 > 最新文献

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)最新文献

英文 中文
Learning ordered word representations with γ-decay dropout 用γ衰减dropout学习有序词表征
Aiting Liu, Chao Xing, Yang Feng, Dong Wang
Learning distributed word representations (word embeddings) has gained much popularity recently. Current learning approaches usually treat all dimensions of the embeddings as homogeneous, which leads to non-structured representations where the dimensions are neither interpretable nor comparable. This paper proposes a method to generate ordered word embed-dings where the significance of the dimensions is in descending order. The ordering mechanism may benefit a wide range of applications such as fast search, vector tailor, and so on. Our method employs a γ-decay dropout algorithm to make sure in the learning process the lower dimensions are more likely to be updated than the higher dimensions so that the lower dimensions can encode more information. The experimental results on the WordSimilarity-353, MEN3000, SCWS and SimLex-999 tasks show that compared to the non-ordered counterparts the proposed method indeed produced more meaningful ordered embeddings and achieved better performance.
学习分布式词表示(词嵌入)最近得到了广泛的应用。当前的学习方法通常将嵌入的所有维度视为同构的,这导致维度既不可解释也不可比较的非结构化表示。本文提出了一种生成有序词嵌入的方法,其中维度的重要性按降序排列。排序机制可能有利于广泛的应用程序,如快速搜索、矢量裁剪等。该方法采用γ衰减dropout算法,确保在学习过程中低维比高维更容易被更新,从而使低维可以编码更多的信息。在WordSimilarity-353、MEN3000、SCWS和SimLex-999任务上的实验结果表明,与无序任务相比,所提出的方法确实产生了更有意义的有序嵌入,并取得了更好的性能。
{"title":"Learning ordered word representations with γ-decay dropout","authors":"Aiting Liu, Chao Xing, Yang Feng, Dong Wang","doi":"10.1109/APSIPA.2016.7820839","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820839","url":null,"abstract":"Learning distributed word representations (word embeddings) has gained much popularity recently. Current learning approaches usually treat all dimensions of the embeddings as homogeneous, which leads to non-structured representations where the dimensions are neither interpretable nor comparable. This paper proposes a method to generate ordered word embed-dings where the significance of the dimensions is in descending order. The ordering mechanism may benefit a wide range of applications such as fast search, vector tailor, and so on. Our method employs a γ-decay dropout algorithm to make sure in the learning process the lower dimensions are more likely to be updated than the higher dimensions so that the lower dimensions can encode more information. The experimental results on the WordSimilarity-353, MEN3000, SCWS and SimLex-999 tasks show that compared to the non-ordered counterparts the proposed method indeed produced more meaningful ordered embeddings and achieved better performance.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126731972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Intra block copy hash reduction for HEVC screen content coding 块内复制哈希减少HEVC屏幕内容编码
Che-Wei Kuo, H. Hang, Chun-Liang Chien
To meet a wide range of needs for video applications such as remote desktop, video conference, distance education, and cloud gaming, the ISO/ITU Joint Collaborative Team on Video Coding (JCT-VC) committee is recently specifying the Screen Content Coding (SCC) standard, as one of the extensions of High Efficiency Video Coding (HEVC). In this paper, the hash search method of the standard adopted Intra Block Copy (IBC) coding tool for SCC is investigated. We collect the coded data using the current hash table and examine their efficiency and explore possible ways for further improvement. A low complexity scheme of selecting effective hash nodes and a modified hash key generation method are presented. Experimental results show that the proposed method reduces on the average 37% or at most 70% hash table memory usage but it preserves the similar BD-rate savings and encoding complexity when integrated into the SCM-3.0 test model.
为了满足远程桌面、视频会议、远程教育和云游戏等视频应用的广泛需求,ISO/ITU视频编码联合协作小组(JCT-VC)委员会最近指定了屏幕内容编码(SCC)标准,作为高效视频编码(HEVC)的扩展之一。本文研究了SCC标准采用的IBC (Intra Block Copy)编码工具的hash搜索方法。我们使用当前哈希表收集编码数据,并检查其效率并探索进一步改进的可能方法。提出了一种选择有效哈希节点的低复杂度方案和一种改进的哈希密钥生成方法。实验结果表明,该方法平均减少了37%或最多70%的哈希表内存使用,但在集成到SCM-3.0测试模型时,它保留了类似的BD-rate节省和编码复杂度。
{"title":"Intra block copy hash reduction for HEVC screen content coding","authors":"Che-Wei Kuo, H. Hang, Chun-Liang Chien","doi":"10.1109/APSIPA.2016.7820766","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820766","url":null,"abstract":"To meet a wide range of needs for video applications such as remote desktop, video conference, distance education, and cloud gaming, the ISO/ITU Joint Collaborative Team on Video Coding (JCT-VC) committee is recently specifying the Screen Content Coding (SCC) standard, as one of the extensions of High Efficiency Video Coding (HEVC). In this paper, the hash search method of the standard adopted Intra Block Copy (IBC) coding tool for SCC is investigated. We collect the coded data using the current hash table and examine their efficiency and explore possible ways for further improvement. A low complexity scheme of selecting effective hash nodes and a modified hash key generation method are presented. Experimental results show that the proposed method reduces on the average 37% or at most 70% hash table memory usage but it preserves the similar BD-rate savings and encoding complexity when integrated into the SCM-3.0 test model.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126335324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Ensemble based speaker verification using adapted score fusion in noisy reverberant environments 在嘈杂混响环境中使用自适应分数融合的基于合奏的扬声器验证
Ryosuke Nakanishi, Sayaka Shiota, H. Kiya
This paper proposes an ensemble based automatic speaker recognition (ASV) using adapted score fusion in noisy reverberant environment. It is well known that background noise and reverberation affect the performance of the ASV systems. Various techniques have been reported to improve the robustness against noise and reverberation, and an ensemble based method is one of the effective techniques in the noisy environment. The ensemble based method uses a combination of several weak learners to achieve higher performance than a single learner method. However, since the performance is depended on the fusion weights, the adequate weight estimation method is required. The proposed weight estimation method is based a supervised adaptation and the evolutionary update algorithm. The QUT-NOISE-SRE protocol, which has been published recently, is used for simulating the reverberation of the clean speech in our experiments. The experimental results report the characteristics of the QUT-NOISE-SRE protocol and the effectiveness of the proposed method in noisy reverberant environment.
提出了一种基于合奏的混响环境下自适应乐谱融合的自动说话人识别方法。众所周知,背景噪声和混响会影响ASV系统的性能。为了提高系统对噪声和混响的鲁棒性,已有多种方法被报道,其中基于集成的方法是在噪声环境下的有效方法之一。基于集成的方法使用多个弱学习器的组合来获得比单个学习器方法更高的性能。然而,由于其性能取决于融合权值,因此需要适当的权值估计方法。提出了一种基于监督自适应和进化更新算法的权重估计方法。在我们的实验中,我们使用了最近发表的QUT-NOISE-SRE协议来模拟干净语音的混响。实验结果表明了QUT-NOISE-SRE协议的特点以及该方法在噪声混响环境下的有效性。
{"title":"Ensemble based speaker verification using adapted score fusion in noisy reverberant environments","authors":"Ryosuke Nakanishi, Sayaka Shiota, H. Kiya","doi":"10.1109/APSIPA.2016.7820794","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820794","url":null,"abstract":"This paper proposes an ensemble based automatic speaker recognition (ASV) using adapted score fusion in noisy reverberant environment. It is well known that background noise and reverberation affect the performance of the ASV systems. Various techniques have been reported to improve the robustness against noise and reverberation, and an ensemble based method is one of the effective techniques in the noisy environment. The ensemble based method uses a combination of several weak learners to achieve higher performance than a single learner method. However, since the performance is depended on the fusion weights, the adequate weight estimation method is required. The proposed weight estimation method is based a supervised adaptation and the evolutionary update algorithm. The QUT-NOISE-SRE protocol, which has been published recently, is used for simulating the reverberation of the clean speech in our experiments. The experimental results report the characteristics of the QUT-NOISE-SRE protocol and the effectiveness of the proposed method in noisy reverberant environment.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126282812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Beamforming networks using spatial covariance features for far-field speech recognition 基于空间协方差特征的远场语音识别波束形成网络
Xiong Xiao, Shinji Watanabe, Chng Eng Siong, Haizhou Li
Recently, a deep beamforming (BF) network was proposed to predict BF weights from phase-carrying features, such as generalized cross correlation (GCC). The BF network is trained jointly with the acoustic model to minimize automatic speech recognition (ASR) cost function. In this paper, we propose to replace GCC with features derived from input signals' spatial covariance matrices (SCM), which contain the phase information of individual frequency bands. Experimental results on the AMI meeting transcription task shows that the BF network using SCM features significantly reduces the word error rate to 44.1% from 47.9% obtained with the conventional ASR pipeline using delay-and-sum BF. Also compared with GCC features, we have observed small but steady gain by 0.6% absolutely. The use of SCM features also facilitate the implementation of more advanced BF methods within a deep learning framework, such as minimum variance distortionless response BF that requires the speech and noise SCM.
近年来,提出了一种深度波束形成(BF)网络,利用广义互相关(GCC)等带相特征来预测BF权重。为了最小化自动语音识别(ASR)的代价函数,将BF网络与声学模型联合训练。在本文中,我们提出用输入信号的空间协方差矩阵(SCM)的特征来代替GCC,其中包含了各个频段的相位信息。AMI会议转录任务的实验结果表明,基于SCM特征的BF网络将单词错误率从传统的基于延迟和BF的ASR管道的47.9%显著降低到44.1%。同样,与GCC的特性相比,我们观察到小幅但稳定的增长,绝对增长了0.6%。单片机特性的使用也有助于在深度学习框架内实现更高级的BF方法,例如需要语音和噪声单片机的最小方差无失真响应BF。
{"title":"Beamforming networks using spatial covariance features for far-field speech recognition","authors":"Xiong Xiao, Shinji Watanabe, Chng Eng Siong, Haizhou Li","doi":"10.1109/APSIPA.2016.7820724","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820724","url":null,"abstract":"Recently, a deep beamforming (BF) network was proposed to predict BF weights from phase-carrying features, such as generalized cross correlation (GCC). The BF network is trained jointly with the acoustic model to minimize automatic speech recognition (ASR) cost function. In this paper, we propose to replace GCC with features derived from input signals' spatial covariance matrices (SCM), which contain the phase information of individual frequency bands. Experimental results on the AMI meeting transcription task shows that the BF network using SCM features significantly reduces the word error rate to 44.1% from 47.9% obtained with the conventional ASR pipeline using delay-and-sum BF. Also compared with GCC features, we have observed small but steady gain by 0.6% absolutely. The use of SCM features also facilitate the implementation of more advanced BF methods within a deep learning framework, such as minimum variance distortionless response BF that requires the speech and noise SCM.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"25 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120895555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
SMT-based lexicon expansion for broadcast transcription 基于smt的广播转录词典扩展
Manon Ichiki, Aiko Hagiwara, Hitoshi Ito, K. Onoe, Shoei Sato, A. Kobayashi
We describe a method of lexicon expansion to tackle variations of spontaneous speech. The variations of utterances are found widely in the programs such as conversations talk shows and are typically observed as unintelligible utterances with a high speech-rate. Unlike read speech in news programs, these variations often severely degrade automatic speech recognition (ASR) performance. Then, these variations are considered as new versions of original entries in the ASR lexicon. The new entries are generated based on the SMT approach, in which translation models are trained from corpus translating phoneme sequence in a lexicon into the sequence obtained by phoneme recognition. We introduce a new method in which unreliable entries are removed from the lexicon. Our SMT-based approach achieved a 0.1 % WER reduction for a variety of broadcasting programs.
我们描述了一种词汇扩展的方法来处理自发语音的变化。话语变异在谈话、脱口秀等节目中广泛存在,通常是高语速的难以理解话语。与新闻节目中的读语音不同,这些变化通常会严重降低自动语音识别(ASR)的性能。然后,这些变体被认为是ASR词典中原始条目的新版本。新条目的生成基于SMT方法,该方法通过语料库将词汇中的音素序列翻译成音素识别得到的序列来训练翻译模型。我们提出了一种从词典中删除不可靠条目的新方法。我们基于smt的方法使各种广播节目的WER降低了0.1%。
{"title":"SMT-based lexicon expansion for broadcast transcription","authors":"Manon Ichiki, Aiko Hagiwara, Hitoshi Ito, K. Onoe, Shoei Sato, A. Kobayashi","doi":"10.1109/APSIPA.2016.7820682","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820682","url":null,"abstract":"We describe a method of lexicon expansion to tackle variations of spontaneous speech. The variations of utterances are found widely in the programs such as conversations talk shows and are typically observed as unintelligible utterances with a high speech-rate. Unlike read speech in news programs, these variations often severely degrade automatic speech recognition (ASR) performance. Then, these variations are considered as new versions of original entries in the ASR lexicon. The new entries are generated based on the SMT approach, in which translation models are trained from corpus translating phoneme sequence in a lexicon into the sequence obtained by phoneme recognition. We introduce a new method in which unreliable entries are removed from the lexicon. Our SMT-based approach achieved a 0.1 % WER reduction for a variety of broadcasting programs.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115216605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust scalable video multicast using triangular network coding in LTE/LTE-Advanced 在LTE/LTE- advanced中使用三角形网络编码的鲁棒可扩展视频多播
Phuc Chau, Yong-woo Lee, T. Bui, Jitae Shin, J. Jeong
The recent research studies showed that inter-layered network coding is a promising approach to provide the unequal error protection for scalable video multicast under the channel heterogeneity. The selection of the optimal transmission distribution performed at eNB increases the system performance with the cost of time and computational complexities. In this paper, we propose an optimal transmission strategy for the scalable video multicast using triangular network coding at the application layer in LTE/LTE-Advanced networks. The proposed transmission strategy comprises of two optimization phases: space reduction and performance maximization. The first optimization reduces the number of searching steps in the dictionary of possible transmission distributions by using a proposed performance predictive algorithm. The following optimization not only maximizes the average number of successfully decoded layers among receivers but also maximizes the number of receivers decoding the video base layer successfully in the second phase. We evaluate the proposed transmission strategy through various simulations with the performance metrics regarding the average number of successfully decoded layers among receivers in a multicast group, throughput, and video quality measurement. The simulation results show that our proposed scheme outperforms other recent studies and adapts well with the variable streaming rates of the video under the extreme time constraints.
近年来的研究表明,层间网络编码是一种很有前途的方法,可以在信道异构的情况下为可扩展的视频组播提供不相等错误保护。在eNB上进行的最优传输分配的选择以时间和计算复杂性为代价提高了系统性能。在LTE/LTE- advanced网络中,我们提出了一种基于应用层三角形网络编码的可扩展视频组播的最佳传输策略。所提出的传输策略包括空间最小化和性能最大化两个优化阶段。第一个优化通过使用提出的性能预测算法减少了可能传输分布字典中的搜索步骤数。以下优化不仅使接收器之间成功解码的平均层数最大化,而且使第二阶段成功解码视频基础层的接收器数量最大化。我们通过各种模拟来评估所提出的传输策略,这些模拟的性能指标涉及多播组中接收器之间成功解码的平均层数、吞吐量和视频质量测量。仿真结果表明,本文提出的方案优于目前已有的研究,能够很好地适应极端时间约束下视频的可变流速率。
{"title":"Robust scalable video multicast using triangular network coding in LTE/LTE-Advanced","authors":"Phuc Chau, Yong-woo Lee, T. Bui, Jitae Shin, J. Jeong","doi":"10.1109/APSIPA.2016.7820711","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820711","url":null,"abstract":"The recent research studies showed that inter-layered network coding is a promising approach to provide the unequal error protection for scalable video multicast under the channel heterogeneity. The selection of the optimal transmission distribution performed at eNB increases the system performance with the cost of time and computational complexities. In this paper, we propose an optimal transmission strategy for the scalable video multicast using triangular network coding at the application layer in LTE/LTE-Advanced networks. The proposed transmission strategy comprises of two optimization phases: space reduction and performance maximization. The first optimization reduces the number of searching steps in the dictionary of possible transmission distributions by using a proposed performance predictive algorithm. The following optimization not only maximizes the average number of successfully decoded layers among receivers but also maximizes the number of receivers decoding the video base layer successfully in the second phase. We evaluate the proposed transmission strategy through various simulations with the performance metrics regarding the average number of successfully decoded layers among receivers in a multicast group, throughput, and video quality measurement. The simulation results show that our proposed scheme outperforms other recent studies and adapts well with the variable streaming rates of the video under the extreme time constraints.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115982907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Prior information directed stage-wise measurement matrix design for block compressed image sensing 基于先验信息的分块压缩图像感知阶段测量矩阵设计
Mei Zhao, Anhong Wang, Zhiwei Xing, Peihao Li
A key issue for compressed Sensing (CS) is to design the measurement matrix. However, the traditional measurement matrix is not optimal due to its non-adaptability without showing discrimination to different components. In this paper, a prior information directed stage-wise measurement matrix is proposed for block compressed image sensing, leading to a st-BCS method. In the first stage, the measurement matrix only takes measurements of the important low frequency components directed by the prior structure information, and then it is updated stage by stage according to the prior information obtained at the decoder side via a feedback. Experimental results show that our st-BCS achieves significant performance improvement over the state-of-art BCS scheme which uses the non-adaptive random matrix.
压缩感知的一个关键问题是测量矩阵的设计。然而,传统的测量矩阵由于其对不同分量的不区分而不具有自适应性,因此不是最优的。本文提出了一种面向先验信息的分段测量矩阵,用于块压缩图像感知,从而得到st-BCS方法。在第一阶段,测量矩阵仅对先验结构信息指导下的重要低频分量进行测量,然后根据解码侧获得的先验信息通过反馈逐级更新。实验结果表明,我们的st-BCS方案比目前使用非自适应随机矩阵的BCS方案性能有显著提高。
{"title":"Prior information directed stage-wise measurement matrix design for block compressed image sensing","authors":"Mei Zhao, Anhong Wang, Zhiwei Xing, Peihao Li","doi":"10.1109/APSIPA.2016.7820678","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820678","url":null,"abstract":"A key issue for compressed Sensing (CS) is to design the measurement matrix. However, the traditional measurement matrix is not optimal due to its non-adaptability without showing discrimination to different components. In this paper, a prior information directed stage-wise measurement matrix is proposed for block compressed image sensing, leading to a st-BCS method. In the first stage, the measurement matrix only takes measurements of the important low frequency components directed by the prior structure information, and then it is updated stage by stage according to the prior information obtained at the decoder side via a feedback. Experimental results show that our st-BCS achieves significant performance improvement over the state-of-art BCS scheme which uses the non-adaptive random matrix.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124956453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A modified FSLMS algorithm for nonlinear ANC 非线性ANC的改进FSLMS算法
Lei Luo, Jinwei Sun, Boyan Huang, Xiangbin Jiang
By analyzing the theory of functional link artificial neural network (FLANN) structure based on filtered-s least mean square (FSLMS) algorithm which is usually used in the nonlinear active noise control (NANC) system, it can be found that the controller coefficients of nonlinear parts are multiple related, this problem causes much unbalance to calculate these coefficients and restraints the performance of FSLMS algorithm. To solve this issue, a modified FSLMS (MFSLMS) algorithm is proposed in this paper, it can weaken these multiple relationships greatly by adding a corrective filter before trigonometric expansion. Compared with conventional FSLMS algorithm and its other improved versions, MFSLMS algorithm not only performs better on noise cancellation, but also has less computational complexity. Extensive simulations are conducted to demonstrate the effectiveness of the proposed algorithm.
通过分析非线性有源噪声控制(NANC)系统中常用的基于滤波最小均方(FSLMS)算法的功能链接人工神经网络(FLANN)结构的原理,可以发现非线性部分的控制器系数是多重相关的,这一问题给这些系数的计算带来很大的不平衡性,制约了FSLMS算法的性能。为了解决这一问题,本文提出了一种改进的FSLMS (MFSLMS)算法,该算法通过在三角展开前添加校正滤波器,大大削弱了这些多重关系。与传统的FSLMS算法及其改进版本相比,MFSLMS算法不仅具有更好的降噪性能,而且具有更低的计算复杂度。大量的仿真验证了该算法的有效性。
{"title":"A modified FSLMS algorithm for nonlinear ANC","authors":"Lei Luo, Jinwei Sun, Boyan Huang, Xiangbin Jiang","doi":"10.1109/APSIPA.2016.7820859","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820859","url":null,"abstract":"By analyzing the theory of functional link artificial neural network (FLANN) structure based on filtered-s least mean square (FSLMS) algorithm which is usually used in the nonlinear active noise control (NANC) system, it can be found that the controller coefficients of nonlinear parts are multiple related, this problem causes much unbalance to calculate these coefficients and restraints the performance of FSLMS algorithm. To solve this issue, a modified FSLMS (MFSLMS) algorithm is proposed in this paper, it can weaken these multiple relationships greatly by adding a corrective filter before trigonometric expansion. Compared with conventional FSLMS algorithm and its other improved versions, MFSLMS algorithm not only performs better on noise cancellation, but also has less computational complexity. Extensive simulations are conducted to demonstrate the effectiveness of the proposed algorithm.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116572378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Pricing based resource allocation scheme for video multicast service in LTE networks LTE网络中基于定价的视频组播业务资源分配方案
Chung-Nan Lee, Han-Ting Lai
How to efficiently use the limited wireless bandwidth is of paramount important in the radio wireless network area. To use the wireless bandwidth in an efficiently way, one can consider a pricing model to improve users received video layers. In this paper, the users are divided into three classes. For different class users, they pay different price and enjoy different QoS. In a pricing model, more video layers are allocated to those users who pay a higher price. We propose a Pricing Class based Resource Allocation Scheme (PCRAS) that considers the price of multicast group and uses channel quality indicator to compute resource allocation priority to dispatch resource blocks efficiently. Experimental results shows that the proposed scheme can increase users' received video layers and users between different classes will receive different level of services. In comparison to other existing scheduling schemes, the proposed scheme can improve users' video experience.
如何有效地利用有限的无线带宽是无线网络领域的头等大事。为了有效地利用无线带宽,可以考虑一种定价模型来提高用户接收的视频层数。本文将用户分为三类。对于不同类别的用户,他们支付不同的价格,享受不同的QoS。在定价模型中,支付更高价格的用户会被分配更多的视频层。提出了一种基于定价类的资源分配方案(PCRAS),该方案考虑组播组的价格,利用信道质量指标计算资源分配优先级,从而有效地调度资源块。实验结果表明,该方案可以增加用户接收的视频层数,不同类别的用户接收到的服务水平也不同。与现有的调度方案相比,该方案可以提高用户的视频体验。
{"title":"Pricing based resource allocation scheme for video multicast service in LTE networks","authors":"Chung-Nan Lee, Han-Ting Lai","doi":"10.1109/APSIPA.2016.7820910","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820910","url":null,"abstract":"How to efficiently use the limited wireless bandwidth is of paramount important in the radio wireless network area. To use the wireless bandwidth in an efficiently way, one can consider a pricing model to improve users received video layers. In this paper, the users are divided into three classes. For different class users, they pay different price and enjoy different QoS. In a pricing model, more video layers are allocated to those users who pay a higher price. We propose a Pricing Class based Resource Allocation Scheme (PCRAS) that considers the price of multicast group and uses channel quality indicator to compute resource allocation priority to dispatch resource blocks efficiently. Experimental results shows that the proposed scheme can increase users' received video layers and users between different classes will receive different level of services. In comparison to other existing scheduling schemes, the proposed scheme can improve users' video experience.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125099423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Iterative closest points method based on photometric weight for 3D object reconstruction 基于光度加权的迭代最近邻法用于三维物体重建
Dong-Won Shin, Yo-Sung Ho
Interests on 3D object reconstruction digitizing the shape and color of an object from the real world are getting popular. 3D object reconstruction consists of various steps such as image acquisition, image refinement, point cloud generation, iterative closest points, bundle adjustment and model surface representation. Among them, iterative closest points method is critical to calculate the accurate initial value for the optimization in the following bundle adjustment step. There is the object drift problem in the existing iterative closest points method due to the accumulated trajectory error as time flows. In this paper, we performed a more accurate registration between point clouds by SIFT features and the weighting on them. We found the proposed method decreases the absolute trajectory error and reduces the object drift problem in the reconstructed 3D object model.
将现实世界中的物体的形状和颜色数字化的三维物体重建的兴趣越来越受欢迎。三维物体重建包括图像采集、图像细化、点云生成、迭代最近点、束调整和模型表面表示等多个步骤。其中,迭代最近邻法对于计算出精确的初始值对于接下来的束平差步骤的优化至关重要。现有的迭代最近点法由于轨迹误差随时间的推移而累积,存在目标漂移问题。在本文中,我们通过SIFT特征及其加权对点云进行了更精确的配准。结果表明,该方法减小了三维物体模型重建过程中的绝对轨迹误差和物体漂移问题。
{"title":"Iterative closest points method based on photometric weight for 3D object reconstruction","authors":"Dong-Won Shin, Yo-Sung Ho","doi":"10.1109/APSIPA.2016.7820707","DOIUrl":"https://doi.org/10.1109/APSIPA.2016.7820707","url":null,"abstract":"Interests on 3D object reconstruction digitizing the shape and color of an object from the real world are getting popular. 3D object reconstruction consists of various steps such as image acquisition, image refinement, point cloud generation, iterative closest points, bundle adjustment and model surface representation. Among them, iterative closest points method is critical to calculate the accurate initial value for the optimization in the following bundle adjustment step. There is the object drift problem in the existing iterative closest points method due to the accumulated trajectory error as time flows. In this paper, we performed a more accurate registration between point clouds by SIFT features and the weighting on them. We found the proposed method decreases the absolute trajectory error and reduces the object drift problem in the reconstructed 3D object model.","PeriodicalId":409448,"journal":{"name":"2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125118641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1