首页 > 最新文献

2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)最新文献

英文 中文
A Study on Improving Acoustic Model for Robust and Far-Field Speech Recognition 鲁棒远场语音识别声学模型改进研究
Pub Date : 2018-11-01 DOI: 10.1109/ICDSP.2018.8631862
Shaofei Xue, Zhijie Yan, Tao Yu, Zhang Liu
Far-field speech recognition is an essential technique for man-machine interactions. It aims to enable smart devices to recognize distant human speech. This technology is applied to many scenarios such as smart home appliances (smart loudspeaker, smart TV) and meeting transcription. Despite the significant advancement made in robust and far-field speech recognition after the introduction of deep neural network based acoustic models, the far-field speech recognition remains a challenging task due to various factors such as background noise, reverberation and even human voice interference. In this paper, we describe several technical advances for improving the performance of large-scale far-field speech recognition, including simulated data generation, improvements on front-end modules and neural network based acoustic models. Experimental results on several Mandarin Chinese speech recognition tasks have demonstrated that the combination of these technical advances can significantly outperform the conventional models.
远场语音识别是人机交互的一项重要技术。它的目标是使智能设备能够识别远距离的人类语音。该技术应用于智能家电(智能扬声器、智能电视)、会议转录等多个场景。尽管引入基于深度神经网络的声学模型后,在鲁棒性和远场语音识别方面取得了重大进展,但由于各种因素,如背景噪声、混响甚至人声干扰,远场语音识别仍然是一项具有挑战性的任务。在本文中,我们描述了提高大规模远场语音识别性能的几个技术进展,包括模拟数据生成、前端模块的改进和基于神经网络的声学模型。在几个普通话语音识别任务上的实验结果表明,这些技术进步的结合可以显著优于传统模型。
{"title":"A Study on Improving Acoustic Model for Robust and Far-Field Speech Recognition","authors":"Shaofei Xue, Zhijie Yan, Tao Yu, Zhang Liu","doi":"10.1109/ICDSP.2018.8631862","DOIUrl":"https://doi.org/10.1109/ICDSP.2018.8631862","url":null,"abstract":"Far-field speech recognition is an essential technique for man-machine interactions. It aims to enable smart devices to recognize distant human speech. This technology is applied to many scenarios such as smart home appliances (smart loudspeaker, smart TV) and meeting transcription. Despite the significant advancement made in robust and far-field speech recognition after the introduction of deep neural network based acoustic models, the far-field speech recognition remains a challenging task due to various factors such as background noise, reverberation and even human voice interference. In this paper, we describe several technical advances for improving the performance of large-scale far-field speech recognition, including simulated data generation, improvements on front-end modules and neural network based acoustic models. Experimental results on several Mandarin Chinese speech recognition tasks have demonstrated that the combination of these technical advances can significantly outperform the conventional models.","PeriodicalId":218806,"journal":{"name":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128166426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
DCH-Net: Densely Connected Highway Convolution Neural Network for Environmental Sound Classification DCH-Net:用于环境声音分类的密集连接公路卷积神经网络
Pub Date : 2018-11-01 DOI: 10.1109/ICDSP.2018.8631632
Xiaohu Zhang, Yuexian Zou
Environmental Sound Classification (ESC) plays a vital role in the field of machine auditory scene. Recently, the Highway Network CNN model has achieved the state-of-art results via solving the vanishing-gradient problem of much deeper CNN. However, carefully analyzing the Highway Network model shows that the Highway Network model lacks ability to maximize information flow between layers, which is essentially benefits the discriminative representation of acoustic events. Besides, the Highway Network model size is larger than 20MB for ESC task, which is still large for mobile applications. Regarding to these two issues, in this study, we propose a novel Densely Connected Highway Convolutional Network (DCH-Net) model for ESC task. Specifically, a densely highway module is developed which is able to ensure the maximum information flow between layers by connecting all layers directly with each other. Besides, to reduce the model size, a global average pooling layer is designed which replaces the traditional fully connection layers and the parameters of the model is greatly reduced. Experimental results show that our DCH-Net ESC model achieves accuracy of 69% and 90% on ESC50 and ESCIO dataset respectively, which is 2% and 10% higher than that of Highway Network based Highway networks ESC model. Meanwhile our model size is only 2MB.
环境声分类(ESC)在机器听觉场景领域中起着至关重要的作用。最近,高速公路网CNN模型通过解决更深层的CNN的梯度消失问题,取得了最先进的结果。然而,仔细分析公路网模型表明,公路网模型缺乏最大化层间信息流的能力,而这本质上有利于声学事件的判别表示。此外,高速公路网络模型大小大于20MB的ESC任务,这仍然是大的移动应用程序。针对这两个问题,在本研究中,我们提出了一种用于ESC任务的新型密集连接公路卷积网络(DCH-Net)模型。具体来说,开发了一个密集的高速公路模块,通过各层之间的直接连接,保证了各层之间最大程度的信息流。此外,为了减小模型尺寸,设计了一个全局平均池化层,取代了传统的全连接层,大大减少了模型的参数。实验结果表明,DCH-Net ESC模型在ESC50和ESCIO数据集上的准确率分别达到69%和90%,比基于公路网的ESC模型分别提高了2%和10%。同时我们的模型大小只有2MB。
{"title":"DCH-Net: Densely Connected Highway Convolution Neural Network for Environmental Sound Classification","authors":"Xiaohu Zhang, Yuexian Zou","doi":"10.1109/ICDSP.2018.8631632","DOIUrl":"https://doi.org/10.1109/ICDSP.2018.8631632","url":null,"abstract":"Environmental Sound Classification (ESC) plays a vital role in the field of machine auditory scene. Recently, the Highway Network CNN model has achieved the state-of-art results via solving the vanishing-gradient problem of much deeper CNN. However, carefully analyzing the Highway Network model shows that the Highway Network model lacks ability to maximize information flow between layers, which is essentially benefits the discriminative representation of acoustic events. Besides, the Highway Network model size is larger than 20MB for ESC task, which is still large for mobile applications. Regarding to these two issues, in this study, we propose a novel Densely Connected Highway Convolutional Network (DCH-Net) model for ESC task. Specifically, a densely highway module is developed which is able to ensure the maximum information flow between layers by connecting all layers directly with each other. Besides, to reduce the model size, a global average pooling layer is designed which replaces the traditional fully connection layers and the parameters of the model is greatly reduced. Experimental results show that our DCH-Net ESC model achieves accuracy of 69% and 90% on ESC50 and ESCIO dataset respectively, which is 2% and 10% higher than that of Highway Network based Highway networks ESC model. Meanwhile our model size is only 2MB.","PeriodicalId":218806,"journal":{"name":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129933731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
WiFi-Based Adaptive Indoor Passive Intrusion Detection 基于wifi的自适应室内被动入侵检测
Pub Date : 2018-11-01 DOI: 10.1109/ICDSP.2018.8631613
Z. Tian, Yong Li, Mu Zhou, Ze Li
Passive intrusion detection, which is an emerging technique to detect whether there exists any intruders in monitored area, is widely used in home security and smart home, etc. Up to now, various indoor fine-grained passive human intrusion detection systems using WiFi signals have been proposed. However, those existing detection systems mostly rely on elaborate off-line training process, which hampers fast deployment of wireless devices and also reduces system robustness. To response those problems, in this paper, we propose APID, a system for adaptive indoor passive intrusion detection, which enables adaptive, device-free human intrusion detection in indoor environments using channel state information (CSI) of WiFi signals. Firstly, APID evaluates dispersion of CSI amplitude, which is not affected by the mean amplitude. Secondly, APID extracts CSI amplitude dispersion ratio between two adjacent time windows as sensitive metrics for intrusion detection. Then, the hypothesis testing is utilized to achieve no-calibration human motion detection. Finally, we implement APID on the commodity WiFi devices and evaluate it in two typical indoor scenarios. The experimental results show that APID can achieve an average detection accuracy of more than 96%.
被动入侵检测是一种新兴的检测被监控区域是否存在入侵者的技术,广泛应用于家庭安防、智能家居等领域。到目前为止,已经提出了各种利用WiFi信号的室内细粒度被动人为入侵检测系统。然而,这些现有的检测系统大多依赖于复杂的离线训练过程,这阻碍了无线设备的快速部署,也降低了系统的鲁棒性。为了解决这些问题,本文提出了一种自适应室内被动入侵检测系统APID,该系统利用WiFi信号的信道状态信息(CSI)在室内环境中实现自适应、无设备的人类入侵检测。首先,APID评估CSI振幅的离散性,不受平均振幅的影响。其次,APID提取两个相邻时间窗之间的CSI振幅色散比作为入侵检测的敏感指标。然后,利用假设检验实现无标定人体运动检测。最后,我们在商品WiFi设备上实现了APID,并在两个典型的室内场景下进行了评估。实验结果表明,APID的平均检测准确率达到96%以上。
{"title":"WiFi-Based Adaptive Indoor Passive Intrusion Detection","authors":"Z. Tian, Yong Li, Mu Zhou, Ze Li","doi":"10.1109/ICDSP.2018.8631613","DOIUrl":"https://doi.org/10.1109/ICDSP.2018.8631613","url":null,"abstract":"Passive intrusion detection, which is an emerging technique to detect whether there exists any intruders in monitored area, is widely used in home security and smart home, etc. Up to now, various indoor fine-grained passive human intrusion detection systems using WiFi signals have been proposed. However, those existing detection systems mostly rely on elaborate off-line training process, which hampers fast deployment of wireless devices and also reduces system robustness. To response those problems, in this paper, we propose APID, a system for adaptive indoor passive intrusion detection, which enables adaptive, device-free human intrusion detection in indoor environments using channel state information (CSI) of WiFi signals. Firstly, APID evaluates dispersion of CSI amplitude, which is not affected by the mean amplitude. Secondly, APID extracts CSI amplitude dispersion ratio between two adjacent time windows as sensitive metrics for intrusion detection. Then, the hypothesis testing is utilized to achieve no-calibration human motion detection. Finally, we implement APID on the commodity WiFi devices and evaluate it in two typical indoor scenarios. The experimental results show that APID can achieve an average detection accuracy of more than 96%.","PeriodicalId":218806,"journal":{"name":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128901287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Interference Analysis in the LTE and NB-IoT Uplink Multiple Access with RF impairments 具有射频损伤的LTE和NB-IoT上行多址干扰分析
Pub Date : 2018-11-01 DOI: 10.1109/ICDSP.2018.8631794
Gustavo J. González, F. Gregorio, J. Cousseau
Narrowband internet of things (NB-IoT) considers the connection of thousands of devices to a single LTE base station (BS). To make possible the coexistence with classic LTE user equipements (UE)s, the BS allocates several IoT UEs into special physical resource blocks (PRB)s. These special PRBs reduce the IoT transmitter complexity but make the LTE signal interfere with the IoT PRBs. IoT nodes are in general low-cost and therefore prone to suffer from RF impairments. The LTE interference and the RF impairments compromise the performance of IoT nodes. In this paper, we analyze the coexistence of LTE and IoT in the multiple access uplink, considering RF impairments. We analyze the use of guard bands to reduce the interference from LTE in IoT. Also, we evaluate the allowable carrier frequency offset (CFO) and I/Q imbalance levels that ensures a reasonable system performance.
窄带物联网(NB-IoT)考虑将数千台设备连接到单个LTE基站(BS)。为了与传统的LTE用户设备(UE)共存,BS将多个物联网终端分配到特殊的物理资源块(PRB)中。这些特殊的prb降低了物联网发射机的复杂性,但使LTE信号干扰物联网prb。物联网节点通常成本较低,因此容易受到射频损伤。LTE干扰和射频损伤会影响物联网节点的性能。在本文中,我们分析了LTE和IoT在多址上行链路中共存的情况,并考虑了射频损伤。我们分析了在物联网中使用保护带来减少LTE的干扰。此外,我们还评估了允许的载波频率偏移(CFO)和I/Q不平衡水平,以确保合理的系统性能。
{"title":"Interference Analysis in the LTE and NB-IoT Uplink Multiple Access with RF impairments","authors":"Gustavo J. González, F. Gregorio, J. Cousseau","doi":"10.1109/ICDSP.2018.8631794","DOIUrl":"https://doi.org/10.1109/ICDSP.2018.8631794","url":null,"abstract":"Narrowband internet of things (NB-IoT) considers the connection of thousands of devices to a single LTE base station (BS). To make possible the coexistence with classic LTE user equipements (UE)s, the BS allocates several IoT UEs into special physical resource blocks (PRB)s. These special PRBs reduce the IoT transmitter complexity but make the LTE signal interfere with the IoT PRBs. IoT nodes are in general low-cost and therefore prone to suffer from RF impairments. The LTE interference and the RF impairments compromise the performance of IoT nodes. In this paper, we analyze the coexistence of LTE and IoT in the multiple access uplink, considering RF impairments. We analyze the use of guard bands to reduce the interference from LTE in IoT. Also, we evaluate the allowable carrier frequency offset (CFO) and I/Q imbalance levels that ensures a reasonable system performance.","PeriodicalId":218806,"journal":{"name":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130527633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
An Improved Lateral Vibration Suppression Strategy of the High-speed Train Using Repetitive Learning Control 基于重复学习控制的高速列车横向振动抑制改进策略
Pub Date : 2018-11-01 DOI: 10.1109/ICDSP.2018.8631552
Chunrong Chen, Duo Zhao, Deqing Huang, Qichao Tang
This paper tends to use a novel perspective to suppress the lateral vibration of high-speed trains (HST), i.e., making use of the periodicity of lateral dynamics. First, the dynamics of a quarter-vehicle model are analysed and modelled. Next, a backstepping controller is designed to suppress the lateral vibration of car body based on a 3-degree-of-freedom (3-DOF) simulation model. And lateral ride comfort improvements are achieved by implementing such control strategy in comparison with passive system. Finally, under the framework of back-stepping design, a repetitive learning control (RLC) scheme is presented to reduce the lateral vibration by periodic tracking control. The learning convergence is proved rigorously in a Lyapunov way and the simulation results demonstrate the control superiority compared with the backstepping controller.
本文试图从一个新的角度来抑制高速列车的横向振动,即利用横向动力学的周期性。首先,对四分之一车辆动力学模型进行了分析和建模。其次,基于3自由度仿真模型,设计了抑制车身横向振动的反步控制器。与被动控制系统相比,实现了横向平顺性的改善。最后,在退步设计框架下,提出了一种重复学习控制(RLC)方案,通过周期跟踪控制来减少横向振动。用李雅普诺夫方法严格证明了该控制器的学习收敛性,仿真结果表明该控制器与反步控制器相比具有控制优势。
{"title":"An Improved Lateral Vibration Suppression Strategy of the High-speed Train Using Repetitive Learning Control","authors":"Chunrong Chen, Duo Zhao, Deqing Huang, Qichao Tang","doi":"10.1109/ICDSP.2018.8631552","DOIUrl":"https://doi.org/10.1109/ICDSP.2018.8631552","url":null,"abstract":"This paper tends to use a novel perspective to suppress the lateral vibration of high-speed trains (HST), i.e., making use of the periodicity of lateral dynamics. First, the dynamics of a quarter-vehicle model are analysed and modelled. Next, a backstepping controller is designed to suppress the lateral vibration of car body based on a 3-degree-of-freedom (3-DOF) simulation model. And lateral ride comfort improvements are achieved by implementing such control strategy in comparison with passive system. Finally, under the framework of back-stepping design, a repetitive learning control (RLC) scheme is presented to reduce the lateral vibration by periodic tracking control. The learning convergence is proved rigorously in a Lyapunov way and the simulation results demonstrate the control superiority compared with the backstepping controller.","PeriodicalId":218806,"journal":{"name":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130580781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Energy Harvesting Modeling and Prediction during Walking Gait for a Sliding Shoe 滑动鞋行走步态能量收集建模与预测
Pub Date : 2018-11-01 DOI: 10.1109/ICDSP.2018.8631849
P. Shull, H. Xia
Harvesting energy during walking is a promising potential power source to decrease size or even eliminate batteries for wearable devices. While sliding shoes may offer a method for harvesting energy during gait, it is important to know their influence on the expected energy harvesting rate and metabolic cost rate. In this paper, we develop two multivariate linear regression models based on subject height, weight, and walking speed to predict energy harvesting rate and metabolic cost rate for walking with custom energy harvesting sliding shoes. Eight healthy subjects performed 200 meter overground walking trials at normal and fast speeds while wearing the custom sliding shoes to harvest energy and a portable gas analysis system to measure metabolic cost. The metabolic cost rate model performed well with only 6.9% error, while the energy harvesting rate model was less accurate with 29.9% error. Future research should focus on improving the models by adding additional features such as step frequency, speed of sliding and length of sliding to capture more of the variance. These findings could help to serve as a foundation to facilitate widespread adoption of wearable devices by reducing the required amount of onboard energy storage.
在行走过程中收集能量是一种很有前途的潜在电源,可以减少可穿戴设备的尺寸,甚至消除电池。虽然滑动鞋可以提供一种在步态中收集能量的方法,但重要的是要知道它们对预期能量收集率和代谢成本率的影响。本文基于受试者身高、体重和步行速度建立了两个多元线性回归模型,预测了穿着定制能量收集滑鞋行走的能量收集率和代谢成本率。8名健康受试者在正常和快速速度下进行200米的地面行走试验,同时穿着定制的滑动鞋来收集能量,并使用便携式气体分析系统来测量代谢成本。代谢成本率模型的准确性较好,误差仅为6.9%,而能量收集率模型的准确性较差,误差为29.9%。未来的研究应侧重于通过增加步进频率、滑动速度和滑动长度等额外特征来改进模型,以捕获更多的方差。这些发现可以通过减少所需的机载能量存储量,为促进可穿戴设备的广泛采用奠定基础。
{"title":"Energy Harvesting Modeling and Prediction during Walking Gait for a Sliding Shoe","authors":"P. Shull, H. Xia","doi":"10.1109/ICDSP.2018.8631849","DOIUrl":"https://doi.org/10.1109/ICDSP.2018.8631849","url":null,"abstract":"Harvesting energy during walking is a promising potential power source to decrease size or even eliminate batteries for wearable devices. While sliding shoes may offer a method for harvesting energy during gait, it is important to know their influence on the expected energy harvesting rate and metabolic cost rate. In this paper, we develop two multivariate linear regression models based on subject height, weight, and walking speed to predict energy harvesting rate and metabolic cost rate for walking with custom energy harvesting sliding shoes. Eight healthy subjects performed 200 meter overground walking trials at normal and fast speeds while wearing the custom sliding shoes to harvest energy and a portable gas analysis system to measure metabolic cost. The metabolic cost rate model performed well with only 6.9% error, while the energy harvesting rate model was less accurate with 29.9% error. Future research should focus on improving the models by adding additional features such as step frequency, speed of sliding and length of sliding to capture more of the variance. These findings could help to serve as a foundation to facilitate widespread adoption of wearable devices by reducing the required amount of onboard energy storage.","PeriodicalId":218806,"journal":{"name":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132200810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Multitask Learning With Enhanced Modules 多任务学习与增强模块
Pub Date : 2018-11-01 DOI: 10.1109/ICDSP.2018.8631696
Zishuo Zheng, Yadong Wei, Zixu Zhao, Xindi Wu, Zhengcheng Li, Pengju Ren
In multitask learning (MTL) paradigm, modularity is an effective way to achieve component and parameter reuse as well as system extensibility. In this work, we introduce two enhanced modules named res-fire module (RF) and dimension reduction module(DR) to improve the performance of modular MTL network – PathNet. In addition, in order to further improve the transfer ability of the network, we apply learnable scale parameters to merge the outputs of the modules in the same layer and then scatter to the next layer. Experiments on MNIST, CIFAR, SVHN and MiniImageNet demonstrate that, with the similar scale as PathNet, our architecture achieves remarkable improvement in both transfer ability and expression ability. Our design used x5.23 fewer generations to achieve 99% accuracy on a source-to-target MNIST classification task compared with DeepMind’s PathNet. We also increase the accuracy of CIFARSVHN transfer task by x1.9. Also we get 70.75% accuracy on miniImageNet.
在多任务学习(MTL)范式中,模块化是实现组件和参数重用以及系统可扩展性的有效途径。在这项工作中,我们引入了两个增强模块,即res-fire模块(RF)和降维模块(DR),以提高模块化MTL网络PathNet的性能。此外,为了进一步提高网络的传递能力,我们采用可学习尺度参数对同一层模块的输出进行合并,然后分散到下一层。在MNIST、CIFAR、SVHN和MiniImageNet上的实验表明,在与PathNet相似的规模下,我们的架构在传输能力和表达能力上都取得了显著的提高。与DeepMind的PathNet相比,我们的设计减少了x5.23代,在源到目标的MNIST分类任务上实现了99%的准确率。我们还将CIFARSVHN传输任务的准确率提高了x1.9。我们在miniImageNet上得到70.75%的准确率。
{"title":"Multitask Learning With Enhanced Modules","authors":"Zishuo Zheng, Yadong Wei, Zixu Zhao, Xindi Wu, Zhengcheng Li, Pengju Ren","doi":"10.1109/ICDSP.2018.8631696","DOIUrl":"https://doi.org/10.1109/ICDSP.2018.8631696","url":null,"abstract":"In multitask learning (MTL) paradigm, modularity is an effective way to achieve component and parameter reuse as well as system extensibility. In this work, we introduce two enhanced modules named res-fire module (RF) and dimension reduction module(DR) to improve the performance of modular MTL network – PathNet. In addition, in order to further improve the transfer ability of the network, we apply learnable scale parameters to merge the outputs of the modules in the same layer and then scatter to the next layer. Experiments on MNIST, CIFAR, SVHN and MiniImageNet demonstrate that, with the similar scale as PathNet, our architecture achieves remarkable improvement in both transfer ability and expression ability. Our design used x5.23 fewer generations to achieve 99% accuracy on a source-to-target MNIST classification task compared with DeepMind’s PathNet. We also increase the accuracy of CIFARSVHN transfer task by x1.9. Also we get 70.75% accuracy on miniImageNet.","PeriodicalId":218806,"journal":{"name":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130963304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speech Enhancement Exploiting Probabilistic Approach Using Maximum A Posterior 基于最大A后验概率方法的语音增强
Pub Date : 2018-11-01 DOI: 10.1109/ICDSP.2018.8631802
Xizhong Shen, Su Chenying
We examine spectral subtraction with both amplitude and phase spectra for improved speech enhancement performance by the method of maximum a posterior. Spectral subtraction is a very valid and direct denoising algorithm, but it has a vital problem, i.e., it may generate 'musical noise'. An adaptive harmonic model is utilized. Maximum a posterior is considered to derive the phase estimator, which is extra applied to amplitude spectral subtraction. Different from others, the extra parameters in our algorithm are considered as random variables, and the main extra parameters are amplitude and phase. The phase of the speech signal is assumed to have von Mises circular distribution, and the amplitude is to have normal distribution. The assumptions are applied to Bayesian theory, and we derived the update formulae of the parameters of the speech model, that is, phase estimator and amplitude estimator. Thus, we obtained the phase and amplitude of each harmonic. Simulation results show the further improvement of spectral subtraction.
我们通过最大后验方法研究了振幅和相位谱的谱减法对语音增强性能的改善。谱减法是一种非常有效和直接的去噪算法,但它有一个重要的问题,即它可能产生“音乐噪声”。采用自适应谐波模型。考虑极大后验来推导相位估计量,并将其应用于振幅谱减法。与其他算法不同的是,我们的算法将额外参数视为随机变量,主要额外参数为振幅和相位。假设语音信号的相位服从von Mises圆形分布,幅度服从正态分布。将这些假设应用到贝叶斯理论中,推导出语音模型参数的更新公式,即相位估计量和幅度估计量。因此,我们得到了每个谐波的相位和振幅。仿真结果表明了谱减法的进一步改进。
{"title":"Speech Enhancement Exploiting Probabilistic Approach Using Maximum A Posterior","authors":"Xizhong Shen, Su Chenying","doi":"10.1109/ICDSP.2018.8631802","DOIUrl":"https://doi.org/10.1109/ICDSP.2018.8631802","url":null,"abstract":"We examine spectral subtraction with both amplitude and phase spectra for improved speech enhancement performance by the method of maximum a posterior. Spectral subtraction is a very valid and direct denoising algorithm, but it has a vital problem, i.e., it may generate 'musical noise'. An adaptive harmonic model is utilized. Maximum a posterior is considered to derive the phase estimator, which is extra applied to amplitude spectral subtraction. Different from others, the extra parameters in our algorithm are considered as random variables, and the main extra parameters are amplitude and phase. The phase of the speech signal is assumed to have von Mises circular distribution, and the amplitude is to have normal distribution. The assumptions are applied to Bayesian theory, and we derived the update formulae of the parameters of the speech model, that is, phase estimator and amplitude estimator. Thus, we obtained the phase and amplitude of each harmonic. Simulation results show the further improvement of spectral subtraction.","PeriodicalId":218806,"journal":{"name":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127860380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An FPGA-Based Liquid Association Calculator for Genome-Wide Co-Expression Analysis 基于fpga的全基因组共表达分析液体关联计算器
Pub Date : 2018-11-01 DOI: 10.1109/ICDSP.2018.8631683
Chien-An Wang, Sheng-Jui Huang, Yu-Cheng Li, Yi-Chang Lu
In this paper, we implement a Liquid Association calculator on an Altera Stratix V FPGA. Using data from high throughput microarrays, the hardware is capable of analyzing whether the presence of a third gene can affect the correlation between the existing two genes. The runtime and power consumption of our FPGA implementation is only 45.5% and 8.14%, respectively, of those required by the GPU version.
在本文中,我们在Altera Stratix V FPGA上实现了一个液体关联计算器。利用来自高通量微阵列的数据,该硬件能够分析第三个基因的存在是否会影响现有两个基因之间的相关性。FPGA实现的运行时间和功耗分别仅为GPU版本所需的45.5%和8.14%。
{"title":"An FPGA-Based Liquid Association Calculator for Genome-Wide Co-Expression Analysis","authors":"Chien-An Wang, Sheng-Jui Huang, Yu-Cheng Li, Yi-Chang Lu","doi":"10.1109/ICDSP.2018.8631683","DOIUrl":"https://doi.org/10.1109/ICDSP.2018.8631683","url":null,"abstract":"In this paper, we implement a Liquid Association calculator on an Altera Stratix V FPGA. Using data from high throughput microarrays, the hardware is capable of analyzing whether the presence of a third gene can affect the correlation between the existing two genes. The runtime and power consumption of our FPGA implementation is only 45.5% and 8.14%, respectively, of those required by the GPU version.","PeriodicalId":218806,"journal":{"name":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128803375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ℓ1/2-Regularization Based Sparse Channel Estimation for MmWave Massive MIMO Systems 基于1/2正则化的毫米波海量MIMO系统稀疏信道估计
Pub Date : 2018-11-01 DOI: 10.1109/ICDSP.2018.8631630
Zhenyue Zhang, Guan Gui, Yan Liang
In the millimeter-wave (mmWave) massive MIMO system, the accuracy of channel estimation directly affects the performance of precoding at the transmitter and detection at the receiver. Hence, it is very important to obtain accurate channel state information (CSI). Considering the channel sparsity of mmWave massive MIMO with hybrid precoding, this paper proposes a ℓ_{1/2}-regularization based sparse channel estimation scheme. The basic idea of the proposed method is to formulate the sparse channel estimation to a compressed sensing problem. Specifically, the scheme firstly constructs an objective function, which is a weighted sum of the ℓ_{1/2}-regularization and the data fitting error. Then optimizes it by means of the gradient descent method iteratively and the weight parameter in the function is also updated each time. Different from the conventional schemes, our proposed scheme can avoid the quantization error and finally achieve super-resolution performance. Simulation results verify that the proposed algorithm can achieve better performance than some recently proposed algorithms.
在毫米波(mmWave)大规模MIMO系统中,信道估计的精度直接影响到发送端预编码和接收端的检测性能。因此,获取准确的信道状态信息(CSI)是非常重要的。针对混合预编码毫米波海量MIMO的信道稀疏性,提出了一种基于1/2正则化的稀疏信道估计方案。该方法的基本思想是将稀疏信道估计表述为压缩感知问题。具体而言,该方案首先构造一个目标函数,该目标函数是1 _{1/2}正则化和数据拟合误差的加权和。然后采用梯度下降法进行迭代优化,每次更新函数中的权值参数。与传统方案不同的是,我们提出的方案可以避免量化误差,最终达到超分辨性能。仿真结果表明,该算法比现有算法具有更好的性能。
{"title":"ℓ1/2-Regularization Based Sparse Channel Estimation for MmWave Massive MIMO Systems","authors":"Zhenyue Zhang, Guan Gui, Yan Liang","doi":"10.1109/ICDSP.2018.8631630","DOIUrl":"https://doi.org/10.1109/ICDSP.2018.8631630","url":null,"abstract":"In the millimeter-wave (mmWave) massive MIMO system, the accuracy of channel estimation directly affects the performance of precoding at the transmitter and detection at the receiver. Hence, it is very important to obtain accurate channel state information (CSI). Considering the channel sparsity of mmWave massive MIMO with hybrid precoding, this paper proposes a ℓ_{1/2}-regularization based sparse channel estimation scheme. The basic idea of the proposed method is to formulate the sparse channel estimation to a compressed sensing problem. Specifically, the scheme firstly constructs an objective function, which is a weighted sum of the ℓ_{1/2}-regularization and the data fitting error. Then optimizes it by means of the gradient descent method iteratively and the weight parameter in the function is also updated each time. Different from the conventional schemes, our proposed scheme can avoid the quantization error and finally achieve super-resolution performance. Simulation results verify that the proposed algorithm can achieve better performance than some recently proposed algorithms.","PeriodicalId":218806,"journal":{"name":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","volume":"327 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125426854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1