首页 > 最新文献

2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)最新文献

英文 中文
Comparison of non-negative matrix factorization and convolution kernel compensation in surface electromyograms of forearm muscles 非负矩阵分解与卷积核补偿在前臂肌表面电图上的比较
M. Šavc, V. Glaser, A. Holobar, I. Cikajlo, Z. Matjačić
This contribution compares performances of nonnegative matrix factorization and high-density surface electromyogram (EMG) decomposition on EMG signals recoded from forearm muscles of young healthy subjects. During the EMG measurements, subjects performed dynamic wrist extensions and flexions and universal haptic device robot was used to oppose their movements and to measure wrist kinematics and excreted muscle forces. Recoded EMG signals were independently decomposed by Convolution Kernel Compensation technique and by alternating least squares non-negative matrix factorization. The identified motor unit discharge patterns were summed into cumulative spike trains and compared with non-negative components in each measurement. The results demonstrated good match (average correlation coefficient of 0.92 ± 0.06), but several discrepancies between the identified components have also been observed. In particular, when limiting the time support of identified components to active EMG signal portions only, the average correlation coefficient dropped to 0.72 ±0.20.
本研究比较了非负矩阵分解和高密度表面肌电图(EMG)分解对年轻健康受试者前臂肌肉肌电信号的处理效果。在肌电图测量过程中,受试者进行动态手腕伸展和屈曲,并使用通用触觉装置机器人来反对他们的运动,并测量手腕的运动学和排泄的肌肉力。采用卷积核补偿技术和交替最小二乘非负矩阵分解技术对编码后的肌电信号进行独立分解。识别的运动单元放电模式被总结成累积尖峰序列,并在每次测量中与非负分量进行比较。结果吻合良好(平均相关系数为0.92±0.06),但鉴定组分之间也存在一些差异。特别是,当仅将识别分量的时间支持限制在活动肌电信号部分时,平均相关系数降至0.72±0.20。
{"title":"Comparison of non-negative matrix factorization and convolution kernel compensation in surface electromyograms of forearm muscles","authors":"M. Šavc, V. Glaser, A. Holobar, I. Cikajlo, Z. Matjačić","doi":"10.1109/CISP-BMEI.2017.8302216","DOIUrl":"https://doi.org/10.1109/CISP-BMEI.2017.8302216","url":null,"abstract":"This contribution compares performances of nonnegative matrix factorization and high-density surface electromyogram (EMG) decomposition on EMG signals recoded from forearm muscles of young healthy subjects. During the EMG measurements, subjects performed dynamic wrist extensions and flexions and universal haptic device robot was used to oppose their movements and to measure wrist kinematics and excreted muscle forces. Recoded EMG signals were independently decomposed by Convolution Kernel Compensation technique and by alternating least squares non-negative matrix factorization. The identified motor unit discharge patterns were summed into cumulative spike trains and compared with non-negative components in each measurement. The results demonstrated good match (average correlation coefficient of 0.92 ± 0.06), but several discrepancies between the identified components have also been observed. In particular, when limiting the time support of identified components to active EMG signal portions only, the average correlation coefficient dropped to 0.72 ±0.20.","PeriodicalId":6474,"journal":{"name":"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"1 1","pages":"1-5"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76287933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Object classification using basic-level categories 使用基本级别类别的对象分类
Mariusz Mulka, Wojciech A. Lorkiewicz, R. Katarzyniak
This paper introduces a computational solution allowing an artificial system to organise large datasets into a set of known basic-level categories. Following cognitive computing paradigm we present an approach towards category-based internal organisation of cognitive agent's semantic memory. In particular, assuming a given set of basic-level categories (predefined or developed) we provide a concise introduction to two perceptron-based computational models allowing an artificial system to classify objects into basic-level categories. Utilising results from other disciplines (psychology, linguistics and cognitive science) we take advantage of the notion of cue validity and incorporate it as underlying weights of input features. Finally, using real bird species dataset we highlight simulation results of classification's precision and recall measures.
本文介绍了一种计算解决方案,允许人工系统将大型数据集组织成一组已知的基本级别类别。遵循认知计算范式,我们提出了一种基于类别的认知主体语义记忆内部组织方法。特别是,假设给定一组基本级别类别(预定义或开发),我们提供了两个基于感知器的计算模型的简要介绍,允许人工系统将对象分类为基本级别类别。利用其他学科(心理学、语言学和认知科学)的结果,我们利用线索有效性的概念,并将其作为输入特征的潜在权重。最后,利用真实的鸟类物种数据,对分类精度和召回率进行了仿真。
{"title":"Object classification using basic-level categories","authors":"Mariusz Mulka, Wojciech A. Lorkiewicz, R. Katarzyniak","doi":"10.1109/CISP-BMEI.2017.8302327","DOIUrl":"https://doi.org/10.1109/CISP-BMEI.2017.8302327","url":null,"abstract":"This paper introduces a computational solution allowing an artificial system to organise large datasets into a set of known basic-level categories. Following cognitive computing paradigm we present an approach towards category-based internal organisation of cognitive agent's semantic memory. In particular, assuming a given set of basic-level categories (predefined or developed) we provide a concise introduction to two perceptron-based computational models allowing an artificial system to classify objects into basic-level categories. Utilising results from other disciplines (psychology, linguistics and cognitive science) we take advantage of the notion of cue validity and incorporate it as underlying weights of input features. Finally, using real bird species dataset we highlight simulation results of classification's precision and recall measures.","PeriodicalId":6474,"journal":{"name":"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"41 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73150666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online dynamic magnetic resonance imaging based on pseudo-polar sampling and GPU acceleration 基于伪极坐标采样和GPU加速的在线动态磁共振成像
Qiushi Meng, Zhaoyang Jin
Most of the online dynamic magnetic resonance imaging (dMRI) techniques are developed based on Cartesian trajectories. Recently, radial trajectories have been proposed to acquire image data for online dMRI. Compared with Cartesian trajectories, radial trajectories cover densely at k-space center and are more incoherent. When using compressed sensing technique to reconstruct dynamic images with under-sampling radial k-space data, the regridding procedure is employed, however it is usually time consuming and introduces numerical errors. In this study, a novel radial-like pseudo-polar (PP) trajectory was used for online dMRI. PP trajectory can avoid regridding and inverse-regridding operation by using a pseudopolar FFT (PPFFT) operation without interpolation. In the reconstructiongraphics processing unit (GPU) is used to further decrease the reconstruction time and achieve real-time online effect. In this simulation study, cardiac k-space dataset was fully acquired and using as a reference dataset. The PP trajectory was used to retrospectively under-sample k-space data with 12.5% and 25% coverage. The reconstruction results show that, the image quality of online dMRI based on PP under-sampling is higher than that of radial under-sampling based method. The reconstruction time was significantly shorten by using GPU acceleration, for the tested case, it is more than 20 times faster than the CPU computing.
大多数在线动态磁共振成像(dMRI)技术都是基于笛卡尔轨迹开发的。最近,径向轨迹被提出用于在线dMRI获取图像数据。与笛卡儿轨迹相比,径向轨迹在k空间中心覆盖密集,不相干性更强。当使用压缩感知技术重建欠采样径向k空间数据的动态图像时,采用了重网格过程,但它通常耗时且引入数值误差。在这项研究中,一种新的放射状伪极(PP)轨迹被用于在线dMRI。PP轨迹采用不加插值的伪极FFT (PPFFT)运算,避免了重格和反重格操作。在重建中,采用图形处理器(GPU)进一步缩短了重建时间,达到了实时在线的效果。在本仿真研究中,充分获取了心脏k空间数据集,并将其作为参考数据集。PP轨迹用于回顾性样本下k空间数据,覆盖率分别为12.5%和25%。重建结果表明,基于PP欠采样的在线dMRI图像质量高于基于径向欠采样的在线dMRI图像质量。使用GPU加速后,重构时间明显缩短,对于测试用例,重构速度比CPU计算快20倍以上。
{"title":"Online dynamic magnetic resonance imaging based on pseudo-polar sampling and GPU acceleration","authors":"Qiushi Meng, Zhaoyang Jin","doi":"10.1109/CISP-BMEI.2017.8302180","DOIUrl":"https://doi.org/10.1109/CISP-BMEI.2017.8302180","url":null,"abstract":"Most of the online dynamic magnetic resonance imaging (dMRI) techniques are developed based on Cartesian trajectories. Recently, radial trajectories have been proposed to acquire image data for online dMRI. Compared with Cartesian trajectories, radial trajectories cover densely at k-space center and are more incoherent. When using compressed sensing technique to reconstruct dynamic images with under-sampling radial k-space data, the regridding procedure is employed, however it is usually time consuming and introduces numerical errors. In this study, a novel radial-like pseudo-polar (PP) trajectory was used for online dMRI. PP trajectory can avoid regridding and inverse-regridding operation by using a pseudopolar FFT (PPFFT) operation without interpolation. In the reconstructiongraphics processing unit (GPU) is used to further decrease the reconstruction time and achieve real-time online effect. In this simulation study, cardiac k-space dataset was fully acquired and using as a reference dataset. The PP trajectory was used to retrospectively under-sample k-space data with 12.5% and 25% coverage. The reconstruction results show that, the image quality of online dMRI based on PP under-sampling is higher than that of radial under-sampling based method. The reconstruction time was significantly shorten by using GPU acceleration, for the tested case, it is more than 20 times faster than the CPU computing.","PeriodicalId":6474,"journal":{"name":"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"137 1","pages":"1-5"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73229150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Power waveforms analysis of GNSS-R delay signals from beidou GEO satellite 北斗GEO卫星GNSS-R延迟信号功率波形分析
Mohammad Shohidul Islam, Dong-Kai Yang, Muhammad Abdul Alim Sikder
GNSS remote sensing can play an important rule for monitoring the earth's surface. For this why, this technique is proposed to retrieve some information from water surface. Specifically, scattered signals from the ocean surface can provide significant information of water surface and its current condition. In this paper, direct and scattered power waveforms of GEO Satellite of Beidou are mainly focused and analyzed. Due to delay of reflected signal, the waveforms pattern & characteristics are clearly different from direct signals. These power waveforms are determined for different coherent integration times, different wind speeds and different satellite elevation angles. Z-V scattering model and Tanos Elfouhaily wave spectrum model are utilized to present the power waveforms of reflected signals and these waveforms are compared with the power waveforms of reflected & direct signals of real data. It is observed that these waveforms are sensitive to integration time, wind speed and satellite elevation angle. It is also found that the waveforms are directly related to these three parameters (time, wind speed & elevation angle). As a result, the model waveforms and real data waveforms have performed a good matching each other.
GNSS遥感可以在地球表面监测中发挥重要作用。为此,提出了一种从水面提取信息的方法。具体来说,海面散射信号可以提供重要的海面信息和当前状况。本文主要对北斗GEO卫星的直接功率波形和散射功率波形进行了研究和分析。由于反射信号的延迟,其波形模式和特征与直接信号明显不同。这些功率波形是在不同的相干积分时间、不同的风速和不同的卫星仰角下确定的。利用Z-V散射模型和Tanos Elfouhaily波谱模型给出了反射信号的功率波形,并将这些波形与实际数据的反射和直接信号的功率波形进行了比较。结果表明,这些波形对积分时间、风速和卫星仰角都很敏感。研究还发现,波形与时间、风速和仰角这三个参数直接相关。结果表明,模型波形与实际数据波形吻合良好。
{"title":"Power waveforms analysis of GNSS-R delay signals from beidou GEO satellite","authors":"Mohammad Shohidul Islam, Dong-Kai Yang, Muhammad Abdul Alim Sikder","doi":"10.1109/CISP-BMEI.2017.8302134","DOIUrl":"https://doi.org/10.1109/CISP-BMEI.2017.8302134","url":null,"abstract":"GNSS remote sensing can play an important rule for monitoring the earth's surface. For this why, this technique is proposed to retrieve some information from water surface. Specifically, scattered signals from the ocean surface can provide significant information of water surface and its current condition. In this paper, direct and scattered power waveforms of GEO Satellite of Beidou are mainly focused and analyzed. Due to delay of reflected signal, the waveforms pattern & characteristics are clearly different from direct signals. These power waveforms are determined for different coherent integration times, different wind speeds and different satellite elevation angles. Z-V scattering model and Tanos Elfouhaily wave spectrum model are utilized to present the power waveforms of reflected signals and these waveforms are compared with the power waveforms of reflected & direct signals of real data. It is observed that these waveforms are sensitive to integration time, wind speed and satellite elevation angle. It is also found that the waveforms are directly related to these three parameters (time, wind speed & elevation angle). As a result, the model waveforms and real data waveforms have performed a good matching each other.","PeriodicalId":6474,"journal":{"name":"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"24 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74619347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Single-color image motion deblurring using MTV model 使用MTV模型的单色图像运动去模糊
Hong Zhang, Guodong Wang, Nan Wu, Guojia Hou, Zhimei Zhang
Image blind restoration has been a significant subject in various application fields. In the paper, we mainly studied the color image. In the process of converting color image into gray image will result in the loss of information because color image has different channels. In order to solve blind deconvolution of color image effectively, we present a method that estimates kernel result from three channels of color image directly based on multiscale framework. And then we ues the Multichannel Total Variation (MTV) model to protect image edges. By using normalized hyper Laplacian prior term, our method can converge to the real solution. The final clear image can be gotten. Although the MTV model will increase the complexity of computation, The correctness of algorithm and the feasibility of methods are proved by experiments.
图像盲恢复已成为各个应用领域的重要课题。本文主要研究的是彩色图像。在将彩色图像转换为灰度图像的过程中,由于彩色图像具有不同的通道,会造成信息的丢失。为了有效地解决彩色图像的盲反卷积问题,提出了一种基于多尺度框架的彩色图像三通道核结果直接估计方法。然后利用多通道全变分(MTV)模型对图像边缘进行保护。通过使用归一化超拉普拉斯先验项,我们的方法收敛到实解。最终得到清晰的图像。虽然MTV模型会增加计算复杂度,但实验证明了算法的正确性和方法的可行性。
{"title":"Single-color image motion deblurring using MTV model","authors":"Hong Zhang, Guodong Wang, Nan Wu, Guojia Hou, Zhimei Zhang","doi":"10.1109/CISP-BMEI.2017.8301961","DOIUrl":"https://doi.org/10.1109/CISP-BMEI.2017.8301961","url":null,"abstract":"Image blind restoration has been a significant subject in various application fields. In the paper, we mainly studied the color image. In the process of converting color image into gray image will result in the loss of information because color image has different channels. In order to solve blind deconvolution of color image effectively, we present a method that estimates kernel result from three channels of color image directly based on multiscale framework. And then we ues the Multichannel Total Variation (MTV) model to protect image edges. By using normalized hyper Laplacian prior term, our method can converge to the real solution. The final clear image can be gotten. Although the MTV model will increase the complexity of computation, The correctness of algorithm and the feasibility of methods are proved by experiments.","PeriodicalId":6474,"journal":{"name":"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"8 1","pages":"1-5"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73980359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel method of text representation on hybrid neural networks 一种基于混合神经网络的文本表示方法
Yanbu Guo, Chen Jin, Weihua Li, Chen Ji, Yuanye Fang, Yunhao Duan
Text representation is one of the fundamental problems in text analysis tasks. The key of text representation is to extract and express the semantic and syntax feature of texts. The order-sensitive sequence models based on neural networks have achieved great progress in text representation. Bidirectional Long Short-Term Memory (BiLSTM) Neural Networks, as an extension of Recurrent Neural Networks (RNN), not only can deal with variable-length texts, capture the long-term dependencies in texts, but also model the forward and backward sequence contexts. Moreover, typical neural networks, Convolutional Neural Networks (CNN), can extract more semantic and structural information from texts, because of their convolution and pooling operations. The paper proposes a hybrid model, which combines the BiLSTM with 2-dimensial convolution and 1-dimensial pooling operations. In other words, the model firstly captures the abstract representation vector of texts by the BiLSTM, and then extracts text semantic features by 2-dimensial convolutional and 1-dimensial pooling operations. Experiments on text classification tasks show that our method obtains preferable performances compared with the state-of-the-art models when applied on the MR1 sentence polarity dataset.
文本表示是文本分析任务中的基本问题之一。文本表示的关键是提取和表达文本的语义和语法特征。基于神经网络的顺序敏感序列模型在文本表示方面取得了很大进展。双向长短期记忆(BiLSTM)神经网络作为递归神经网络(RNN)的扩展,不仅可以处理变长度文本,捕获文本中的长期依赖关系,还可以对前后序列上下文进行建模。此外,典型的神经网络卷积神经网络(CNN)由于其卷积和池化操作,可以从文本中提取更多的语义和结构信息。本文提出了一种将BiLSTM与二维卷积和一维池化操作相结合的混合模型。也就是说,该模型首先通过BiLSTM捕获文本的抽象表示向量,然后通过二维卷积和一维池化操作提取文本的语义特征。文本分类任务实验表明,该方法在MR1句子极性数据集上取得了比现有模型更好的性能。
{"title":"A novel method of text representation on hybrid neural networks","authors":"Yanbu Guo, Chen Jin, Weihua Li, Chen Ji, Yuanye Fang, Yunhao Duan","doi":"10.1109/CISP-BMEI.2017.8302099","DOIUrl":"https://doi.org/10.1109/CISP-BMEI.2017.8302099","url":null,"abstract":"Text representation is one of the fundamental problems in text analysis tasks. The key of text representation is to extract and express the semantic and syntax feature of texts. The order-sensitive sequence models based on neural networks have achieved great progress in text representation. Bidirectional Long Short-Term Memory (BiLSTM) Neural Networks, as an extension of Recurrent Neural Networks (RNN), not only can deal with variable-length texts, capture the long-term dependencies in texts, but also model the forward and backward sequence contexts. Moreover, typical neural networks, Convolutional Neural Networks (CNN), can extract more semantic and structural information from texts, because of their convolution and pooling operations. The paper proposes a hybrid model, which combines the BiLSTM with 2-dimensial convolution and 1-dimensial pooling operations. In other words, the model firstly captures the abstract representation vector of texts by the BiLSTM, and then extracts text semantic features by 2-dimensial convolutional and 1-dimensial pooling operations. Experiments on text classification tasks show that our method obtains preferable performances compared with the state-of-the-art models when applied on the MR1 sentence polarity dataset.","PeriodicalId":6474,"journal":{"name":"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"43 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74114597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Collaborative filtering denoising algorithm based on the nonlocal centralized sparse representation model 基于非局部集中稀疏表示模型的协同滤波去噪算法
Jing Liu, Ruijiao Liu, Jinlei Chen, Yajie Yang, Douli Ma
An improved image denoising algorithm based on block-matching and 3D collaborative filtering (BM3D) is proposed in this manuscript. Instead of using the same filtering model for all patches in an image, we employ two different nonlocal filtering models in edge and smooth regions, respectively. We realize it by using the nonlocal centralized sparse representation (NCSR) to capture both local sparsity of wavelet coefficients and nonlocal similarity of grouped blocks. Experimental results demonstrate that the proposed method outperforms several state-of-the-art denoising methods in terms of objective metrics and visual quality.
提出了一种基于块匹配和三维协同滤波(BM3D)的图像去噪算法。我们不是对图像中的所有斑块使用相同的滤波模型,而是分别在边缘和光滑区域使用两种不同的非局部滤波模型。我们利用非局部集中稀疏表示(NCSR)来捕获小波系数的局部稀疏性和分组块的非局部相似度。实验结果表明,该方法在客观度量和视觉质量方面优于几种最先进的去噪方法。
{"title":"Collaborative filtering denoising algorithm based on the nonlocal centralized sparse representation model","authors":"Jing Liu, Ruijiao Liu, Jinlei Chen, Yajie Yang, Douli Ma","doi":"10.1109/CISP-BMEI.2017.8301951","DOIUrl":"https://doi.org/10.1109/CISP-BMEI.2017.8301951","url":null,"abstract":"An improved image denoising algorithm based on block-matching and 3D collaborative filtering (BM3D) is proposed in this manuscript. Instead of using the same filtering model for all patches in an image, we employ two different nonlocal filtering models in edge and smooth regions, respectively. We realize it by using the nonlocal centralized sparse representation (NCSR) to capture both local sparsity of wavelet coefficients and nonlocal similarity of grouped blocks. Experimental results demonstrate that the proposed method outperforms several state-of-the-art denoising methods in terms of objective metrics and visual quality.","PeriodicalId":6474,"journal":{"name":"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"84 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79358964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Efficient deep learning for stereo matching with larger image patches 基于深度学习的大图像块立体匹配
Yiliu Feng, Zhengfa Liang, Hengzhu Liu
Stereo matching plays an important role in many applications, such as Advanced Driver Assistance Systems, 3D reconstruction, navigation, etc. However it is still an open problem with many difficult. Most difficult are often occlusions, object boundaries, and low or repetitive textures. In this paper, we propose a method for processing the stereo matching problem. We propose an efficient convolutional neural network to measure how likely the two patches matched or not and use the similarity as their stereo matching cost. Then the cost is refined by stereo methods, such as semiglobal maching, subpixel interpolation, median filter, etc. Our architecture uses large image patches which makes the results more robust to texture-less or repetitive textures areas. We experiment our approach on the KITTI2015 dataset which obtain an error rate of 4.42% and only needs 0.8 second for each image pairs.
立体匹配在高级驾驶辅助系统、三维重建、导航等诸多应用中发挥着重要作用。然而,这仍然是一个悬而未决的问题,有许多困难。最困难的通常是遮挡,物体边界和低或重复的纹理。本文提出了一种立体匹配问题的处理方法。我们提出了一个有效的卷积神经网络来衡量两个补丁匹配或不匹配的可能性,并使用相似度作为它们的立体匹配成本。然后采用半全局加工、亚像素插值、中值滤波等立体方法对代价进行细化。我们的架构使用大的图像补丁,这使得结果对纹理较少或重复的纹理区域更加健壮。我们在KITTI2015数据集上进行了实验,得到的错误率为4.42%,每对图像只需要0.8秒。
{"title":"Efficient deep learning for stereo matching with larger image patches","authors":"Yiliu Feng, Zhengfa Liang, Hengzhu Liu","doi":"10.1109/CISP-BMEI.2017.8301999","DOIUrl":"https://doi.org/10.1109/CISP-BMEI.2017.8301999","url":null,"abstract":"Stereo matching plays an important role in many applications, such as Advanced Driver Assistance Systems, 3D reconstruction, navigation, etc. However it is still an open problem with many difficult. Most difficult are often occlusions, object boundaries, and low or repetitive textures. In this paper, we propose a method for processing the stereo matching problem. We propose an efficient convolutional neural network to measure how likely the two patches matched or not and use the similarity as their stereo matching cost. Then the cost is refined by stereo methods, such as semiglobal maching, subpixel interpolation, median filter, etc. Our architecture uses large image patches which makes the results more robust to texture-less or repetitive textures areas. We experiment our approach on the KITTI2015 dataset which obtain an error rate of 4.42% and only needs 0.8 second for each image pairs.","PeriodicalId":6474,"journal":{"name":"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"16 1","pages":"1-5"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84342981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Three class emotions recognition based on deep learning using staked autoencoder 基于深度学习的三类情绪识别
Banghua Yang, Xu Han, Jianzhen Tang
Emotion recognition is a hot spot in advanced humancomputer interaction system, which is of great significance in artificial intelligence, health care, distance education, military field and so on. The paper builds a stacked autoencoder deep learning classification network consist of an input layer, two autoencoder hidden layers and a softmax classifier output layer based on SJTU Emotion EEG Dataset (SEED). Pretrain the first autoencoder employed L-BFGS to optimize the cost function. Then pretrain the second autoencoder with the output of first autoencoder. Finally send to the softmax classifier. Pretrain each autoencoder in forward propagation, then fine-tuning the whole network in back propagation. The well-trained network is used to classify three emotion states including happy, neural and grief. The raw inputs are differential entropy of EEG signal in five rhythmic frequencies band and the differential entropy of whole EEG signal. Fourteen experiments are performed with 5-fold cross validation, the average classification accuracy of three class emotion states is 59.6%, 66.27%, 71.97%, 78.48%, 82.56% and 85.5%. The result shows the higher frequency band differential entropy like Gamma band is more relative to emotion reaction.
情感识别是先进人机交互系统中的一个热点,在人工智能、医疗卫生、远程教育、军事等领域具有重要意义。基于上海交通大学情绪脑电图数据集(SEED),构建了一个由一个输入层、两个自编码器隐藏层和一个softmax分类器输出层组成的堆叠式自编码器深度学习分类网络。采用L-BFGS对第一个自编码器进行预训练,优化代价函数。然后用第一自编码器的输出对第二自编码器进行预训练。最后发送给softmax分类器。在前向传播中预训练每个自编码器,然后在反向传播中微调整个网络。这个训练有素的网络被用来对三种情绪状态进行分类,包括快乐、神经和悲伤。原始输入是脑电信号在五个节奏频带的微分熵和整个脑电信号的微分熵。共进行了14次5重交叉验证实验,3类情绪状态的平均分类准确率分别为59.6%、66.27%、71.97%、78.48%、82.56%和85.5%。结果表明,伽马波段等较高频带的微分熵与情绪反应的关系更大。
{"title":"Three class emotions recognition based on deep learning using staked autoencoder","authors":"Banghua Yang, Xu Han, Jianzhen Tang","doi":"10.1109/CISP-BMEI.2017.8302098","DOIUrl":"https://doi.org/10.1109/CISP-BMEI.2017.8302098","url":null,"abstract":"Emotion recognition is a hot spot in advanced humancomputer interaction system, which is of great significance in artificial intelligence, health care, distance education, military field and so on. The paper builds a stacked autoencoder deep learning classification network consist of an input layer, two autoencoder hidden layers and a softmax classifier output layer based on SJTU Emotion EEG Dataset (SEED). Pretrain the first autoencoder employed L-BFGS to optimize the cost function. Then pretrain the second autoencoder with the output of first autoencoder. Finally send to the softmax classifier. Pretrain each autoencoder in forward propagation, then fine-tuning the whole network in back propagation. The well-trained network is used to classify three emotion states including happy, neural and grief. The raw inputs are differential entropy of EEG signal in five rhythmic frequencies band and the differential entropy of whole EEG signal. Fourteen experiments are performed with 5-fold cross validation, the average classification accuracy of three class emotion states is 59.6%, 66.27%, 71.97%, 78.48%, 82.56% and 85.5%. The result shows the higher frequency band differential entropy like Gamma band is more relative to emotion reaction.","PeriodicalId":6474,"journal":{"name":"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"21 1","pages":"1-5"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84477602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Deep feature selection and projection for cross-age face retrieval 跨年龄人脸检索的深度特征选择与投影
Kaihua Tang, Xiao-Nan Hou, Zhiwen Shao, Lizhuang Ma
While traditional PIE (pose, illumination and expression) face variations have been well settled by latest methods, a new kind of variation, cross-age variation, is drawing attention from researchers. Most of the existing methods fail to maintain the effectiveness in real world applications that contain significant gap of age. Cross-age variation is caused by the shape deformation and texture changing of human faces while people getting old. It will result in tremendous intra-personal changes of face feature that deteriorate the performance of algorithms. This paper proposed a deep feature based framework for face retrieval problem. Our framework uses deep CNNs feature descriptor and two well designed post-processing methods to achieve age-invariance. To the best of our knowledge, this is the first deep feature based method in cross-age face retrieval problem. The deep CNNs model we use is firstly trained on traditional PIE datasets and then fine-tuned by cross-age dataset. The feature selection and projection post-processing we propose is also proved to be very effective in eliminating cross-age variation of deep CNNs feature. The experiments conducted on Cross-Age Celebrity Dataset (CACD), which is the largest public dataset containing cross-age variation, show that our framework outperforms previous state-of-the-art methods.
虽然传统的PIE(姿势、光照和表情)面部变化已经被最新的方法很好地解决了,但一种新的面部变化——跨年龄变化正在引起研究者的注意。大多数现有的方法在实际应用中都不能保持其有效性,因为实际应用中存在明显的年龄差距。跨年龄变异是人随着年龄的增长,面部的形状变形和纹理变化所引起的。这将导致人脸特征的巨大的个人内部变化,从而降低算法的性能。提出了一种基于深度特征的人脸检索框架。我们的框架使用深度cnn特征描述符和两种精心设计的后处理方法来实现年龄不变性。据我们所知,这是第一个基于深度特征的跨年龄人脸检索方法。我们使用的深度cnn模型首先在传统的PIE数据集上进行训练,然后通过跨年龄数据集进行微调。我们提出的特征选择和投影后处理在消除深度cnn特征的跨年龄变化方面也被证明是非常有效的。跨年龄名人数据集(CACD)是包含跨年龄变化的最大公共数据集,在该数据集上进行的实验表明,我们的框架优于以前最先进的方法。
{"title":"Deep feature selection and projection for cross-age face retrieval","authors":"Kaihua Tang, Xiao-Nan Hou, Zhiwen Shao, Lizhuang Ma","doi":"10.1109/CISP-BMEI.2017.8301986","DOIUrl":"https://doi.org/10.1109/CISP-BMEI.2017.8301986","url":null,"abstract":"While traditional PIE (pose, illumination and expression) face variations have been well settled by latest methods, a new kind of variation, cross-age variation, is drawing attention from researchers. Most of the existing methods fail to maintain the effectiveness in real world applications that contain significant gap of age. Cross-age variation is caused by the shape deformation and texture changing of human faces while people getting old. It will result in tremendous intra-personal changes of face feature that deteriorate the performance of algorithms. This paper proposed a deep feature based framework for face retrieval problem. Our framework uses deep CNNs feature descriptor and two well designed post-processing methods to achieve age-invariance. To the best of our knowledge, this is the first deep feature based method in cross-age face retrieval problem. The deep CNNs model we use is firstly trained on traditional PIE datasets and then fine-tuned by cross-age dataset. The feature selection and projection post-processing we propose is also proved to be very effective in eliminating cross-age variation of deep CNNs feature. The experiments conducted on Cross-Age Celebrity Dataset (CACD), which is the largest public dataset containing cross-age variation, show that our framework outperforms previous state-of-the-art methods.","PeriodicalId":6474,"journal":{"name":"2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"52 1","pages":"1-7"},"PeriodicalIF":0.0,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84581452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1