首页 > 最新文献

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)最新文献

英文 中文
Human fall detection using segment-level cnn features and sparse dictionary learning 基于片段级cnn特征和稀疏字典学习的人体跌倒检测
C. Ge, I. Gu, Jie Yang
This paper addresses issues in human fall detection from videos. Unlike using handcrafted features in the conventional machine learning, we extract features from Convolutional Neural Networks (CNNs) for human fall detection. Similar to many existing work using two stream inputs, we use a spatial CNN stream with raw image difference and a temporal CNN stream with optical flow as the inputs of CNN. Different from conventional two stream action recognition work, we exploit sparse representation with residual-based pooling on the CNN extracted features, for obtaining more discriminative feature codes. For characterizing the sequential information in video activity, we use the code vector from long-range dynamic feature representation by concatenating codes in segment-levels as the input to a SVM classifier. Experiments have been conducted on two public video databases for fall detection. Comparisons with six existing methods show the effectiveness of the proposed method.
本文讨论了从视频中检测人体跌倒的问题。与在传统机器学习中使用手工制作的特征不同,我们从卷积神经网络(cnn)中提取特征用于人体跌倒检测。与许多使用两种流输入的现有工作类似,我们使用具有原始图像差分的空间CNN流和光流的时间CNN流作为CNN输入。与传统的两流动作识别工作不同,我们在CNN提取的特征上利用基于残差池化的稀疏表示,以获得更具判别性的特征代码。为了表征视频活动中的顺序信息,我们使用远程动态特征表示的代码向量,通过在段级别上连接代码作为支持向量机分类器的输入。在两个公共视频数据库上进行了跌落检测实验。与已有的六种方法进行了比较,结果表明了该方法的有效性。
{"title":"Human fall detection using segment-level cnn features and sparse dictionary learning","authors":"C. Ge, I. Gu, Jie Yang","doi":"10.1109/MLSP.2017.8168185","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168185","url":null,"abstract":"This paper addresses issues in human fall detection from videos. Unlike using handcrafted features in the conventional machine learning, we extract features from Convolutional Neural Networks (CNNs) for human fall detection. Similar to many existing work using two stream inputs, we use a spatial CNN stream with raw image difference and a temporal CNN stream with optical flow as the inputs of CNN. Different from conventional two stream action recognition work, we exploit sparse representation with residual-based pooling on the CNN extracted features, for obtaining more discriminative feature codes. For characterizing the sequential information in video activity, we use the code vector from long-range dynamic feature representation by concatenating codes in segment-levels as the input to a SVM classifier. Experiments have been conducted on two public video databases for fall detection. Comparisons with six existing methods show the effectiveness of the proposed method.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"17 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77713093","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Blind channel equalization of encoded data over galois fields 伽罗瓦域上编码数据的盲信道均衡
D. Fantinato, A. Neves, D. G. Silva, R. Attux
In communication systems, the study of elements and structures defined over Galois fields are generally limited to data coding. However, in this work, a novel perspective that combines data coding and channel equalization is considered to compose a simplified communication system over the field. Besides the coding advantages, this framework is able to restore distortions or malfunctioning processes, and can be potentially applied in network coding models. Interestingly, the operation of the equalizer is possible from a blind standpoint through the exploration of the redundant information introduced by the encoder. More specifically, we define a blind equalization criterion based on the matching of probability mass functions (PMFs) via the Kullback-Leibler divergence. Simulations involving the main aspects of the equalizer and the criterion are performed, including the use of a genetic algorithm to aid the search for the solution, with promising results.
在通信系统中,对伽罗瓦域上定义的元素和结构的研究通常局限于数据编码。然而,在这项工作中,结合数据编码和信道均衡的新观点被认为是组成一个简化的通信系统。除了编码优势之外,该框架还能够恢复扭曲或故障过程,并且可以潜在地应用于网络编码模型。有趣的是,通过探索编码器引入的冗余信息,均衡器的操作可以从盲的角度来看。更具体地说,我们通过Kullback-Leibler散度定义了一个基于概率质量函数(pmf)匹配的盲均衡准则。进行了涉及均衡器和判据主要方面的模拟,包括使用遗传算法来帮助寻找解决方案,结果很有希望。
{"title":"Blind channel equalization of encoded data over galois fields","authors":"D. Fantinato, A. Neves, D. G. Silva, R. Attux","doi":"10.1109/MLSP.2017.8168135","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168135","url":null,"abstract":"In communication systems, the study of elements and structures defined over Galois fields are generally limited to data coding. However, in this work, a novel perspective that combines data coding and channel equalization is considered to compose a simplified communication system over the field. Besides the coding advantages, this framework is able to restore distortions or malfunctioning processes, and can be potentially applied in network coding models. Interestingly, the operation of the equalizer is possible from a blind standpoint through the exploration of the redundant information introduced by the encoder. More specifically, we define a blind equalization criterion based on the matching of probability mass functions (PMFs) via the Kullback-Leibler divergence. Simulations involving the main aspects of the equalizer and the criterion are performed, including the use of a genetic algorithm to aid the search for the solution, with promising results.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"158 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86347607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The time series cluster kernel 时间序列簇核
Karl Øyvind Mikalsen, F. Bianchi, C. Soguero-Ruíz, R. Jenssen
This paper presents the time series cluster kernel (TCK) for multivariate time series with missing data. Our approach leverages the missing data handling properties of Gaussian mixture models (GMM) augmented with empirical prior distributions. Further, we exploit an ensemble learning approach to ensure robustness to parameters by combining the clustering results of many GMM to form the final kernel. In comparative experiments, we demonstrate that the TCK is robust to parameter choices and illustrate its capabilities of dealing with multivariate time series, both with and without missing data.
提出了多变量缺失数据时间序列的聚类核算法。我们的方法利用了高斯混合模型(GMM)与经验先验分布增强的缺失数据处理特性。此外,我们利用集成学习方法通过组合多个GMM的聚类结果形成最终核来确保对参数的鲁棒性。在对比实验中,我们证明了TCK对参数选择具有鲁棒性,并说明了它处理多变量时间序列的能力,无论有无丢失数据。
{"title":"The time series cluster kernel","authors":"Karl Øyvind Mikalsen, F. Bianchi, C. Soguero-Ruíz, R. Jenssen","doi":"10.1109/MLSP.2017.8168187","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168187","url":null,"abstract":"This paper presents the time series cluster kernel (TCK) for multivariate time series with missing data. Our approach leverages the missing data handling properties of Gaussian mixture models (GMM) augmented with empirical prior distributions. Further, we exploit an ensemble learning approach to ensure robustness to parameters by combining the clustering results of many GMM to form the final kernel. In comparative experiments, we demonstrate that the TCK is robust to parameter choices and illustrate its capabilities of dealing with multivariate time series, both with and without missing data.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"58 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80008304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A deep neural network witharestricted noisy channel for identification of functional introns 基于受限噪声通道的深度神经网络识别功能内含子
Alan Joseph Bekker, M. Chorev, L. Carmel, J. Goldberger
An appreciable fraction of introns is thought to be involved in cellular functions, but there is no obvious way to predict which specific intron is likely to be functional. For each intron we are given a feature representation that is based on its evolutionary patterns. For a small subsets of introns we are also given an indication that they are functional. For all other introns it is not known whether they are functional or not. Our task is to estimate what fraction of introns are functional and, how likely it is that each individual intron is functional. We define a probabilistic classification model that treats the given functionality labels as noisy versions of labels created by a Deep Neural Network model. The maximum-likelihood model parameters are found by utilizing the Expectation-Maximization algorithm. We show that roughly 80% of the functional introns are still not recognized as such, and that roughly a third of all introns are functional.
相当一部分内含子被认为参与细胞功能,但没有明显的方法来预测哪个特定的内含子可能是功能性的。对于每个内含子,我们给出了基于其进化模式的特征表示。对于一小部分内含子,我们也得到了它们是功能性的指示。对于所有其他的内含子,我们不知道它们是否有功能。我们的任务是估计多少内含子是功能性的,以及每个内含子是功能性的可能性有多大。我们定义了一个概率分类模型,该模型将给定的功能标签视为由深度神经网络模型创建的标签的噪声版本。利用期望最大化算法找到最大似然模型参数。我们发现,大约80%的功能性内含子仍未被识别,大约三分之一的内含子是功能性的。
{"title":"A deep neural network witharestricted noisy channel for identification of functional introns","authors":"Alan Joseph Bekker, M. Chorev, L. Carmel, J. Goldberger","doi":"10.1109/MLSP.2017.8168186","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168186","url":null,"abstract":"An appreciable fraction of introns is thought to be involved in cellular functions, but there is no obvious way to predict which specific intron is likely to be functional. For each intron we are given a feature representation that is based on its evolutionary patterns. For a small subsets of introns we are also given an indication that they are functional. For all other introns it is not known whether they are functional or not. Our task is to estimate what fraction of introns are functional and, how likely it is that each individual intron is functional. We define a probabilistic classification model that treats the given functionality labels as noisy versions of labels created by a Deep Neural Network model. The maximum-likelihood model parameters are found by utilizing the Expectation-Maximization algorithm. We show that roughly 80% of the functional introns are still not recognized as such, and that roughly a third of all introns are functional.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"40 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77600545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Noise reduction in low-dose ct using a 3D multiscale sparse denoising autoencoder 基于三维多尺度稀疏去噪自编码器的低剂量ct降噪研究
K. Mentl, B. Mailhé, Florin C. Ghesu, Frank Schebesch, T. Haderlein, A. Maier, M. Nadar
This article presents a novel neural network-based approach for enhancement of 3D medical image data. The proposed networks learn a sparse representation basis by mapping the corrupted input data to corresponding optimal targets. To reinforce the adjustment of the network to the given data, the threshold values are also adaptively learned. In order to capture important image features on various scales and be able to process large computed tomography (CT) volumes in a reasonable time, a multiscale approach is applied. Recursively downsampled versions of the input are used and denoising operator of constant size are learnt at each scale. The networks are trained end-to-end from a database of real highdose acquisitions with synthetic additional noise to simulate the corresponding low-dose scans. Both 2D and 3D networks are evaluated on CT volumes and compared to the block-matching and 3D filtering (BM3D) algorithm. The presented methods achieve an increase of 4% to 11% in the SSIM and of 2.4 to 2.8 dB in the PSNR with respect to the ground truth, outperform BM3D in quantitative comparisions and present no visible texture artifacts. By exploiting volumetric information, 3D networks achieve superior results over 2D networks.
本文提出了一种新的基于神经网络的三维医学图像数据增强方法。该网络通过将损坏的输入数据映射到相应的最优目标来学习稀疏表示基础。为了加强网络对给定数据的调整,阈值也是自适应学习的。为了捕获不同尺度的重要图像特征,并能够在合理的时间内处理大的计算机断层扫描(CT)体积,应用了多尺度方法。使用递归下采样版本的输入,并在每个尺度上学习恒定大小的去噪算子。这些网络端到端从真实高剂量采集的数据库中进行训练,其中含有合成的附加噪声,以模拟相应的低剂量扫描。在CT体积上评估2D和3D网络,并与块匹配和3D滤波(BM3D)算法进行比较。所提出的方法在SSIM上实现了4%至11%的提高,在PSNR上实现了2.4至2.8 dB的提高,在定量比较中优于BM3D,并且没有出现可见的纹理伪影。通过利用体积信息,3D网络比2D网络获得更好的结果。
{"title":"Noise reduction in low-dose ct using a 3D multiscale sparse denoising autoencoder","authors":"K. Mentl, B. Mailhé, Florin C. Ghesu, Frank Schebesch, T. Haderlein, A. Maier, M. Nadar","doi":"10.1109/MLSP.2017.8168176","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168176","url":null,"abstract":"This article presents a novel neural network-based approach for enhancement of 3D medical image data. The proposed networks learn a sparse representation basis by mapping the corrupted input data to corresponding optimal targets. To reinforce the adjustment of the network to the given data, the threshold values are also adaptively learned. In order to capture important image features on various scales and be able to process large computed tomography (CT) volumes in a reasonable time, a multiscale approach is applied. Recursively downsampled versions of the input are used and denoising operator of constant size are learnt at each scale. The networks are trained end-to-end from a database of real highdose acquisitions with synthetic additional noise to simulate the corresponding low-dose scans. Both 2D and 3D networks are evaluated on CT volumes and compared to the block-matching and 3D filtering (BM3D) algorithm. The presented methods achieve an increase of 4% to 11% in the SSIM and of 2.4 to 2.8 dB in the PSNR with respect to the ground truth, outperform BM3D in quantitative comparisions and present no visible texture artifacts. By exploiting volumetric information, 3D networks achieve superior results over 2D networks.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"6 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87680203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Upper bound performance of semi-definite programming for localisation in inhomogeneous media 非均匀介质中半确定规划的上界性能
E. Nadimi, V. Blanes-Vidal
In this paper, we regarded an absorbing inhomogeneous medium as an assembly of thin layers having different propagation properties. We derived a stochastic model for the refractive index and formulated the localisation problem given noisy distance measurements using graph realisation problem. We relaxed the problem using semi-definite programming (SDP) approach in lp realisation domain and derived upper bounds that follow Edmundson-Madansky bound of order 6p (EM6p) on the SDP objective function to provide an estimation of the techniques' localisation accuracy. Our results showed that the inhomogeneity of the media and the choice of lp norm have significant impact on the ratio of the expected value of the localisation error to the upper bound for the expected optimal SDP objective value. The tightest ratio was derived when l∞ norm was used.
本文将非均匀吸收介质看作具有不同传播特性的薄层的集合。我们推导了折射率的随机模型,并利用图形实现问题提出了给定噪声距离测量的定位问题。我们在lp实现域使用半确定规划(SDP)方法放宽问题,并在SDP目标函数上推导出遵循6p阶Edmundson-Madansky界(EM6p)的上界,以提供对技术定位精度的估计。我们的研究结果表明,介质的非均匀性和lp范数的选择对定位误差期望值与期望最优SDP目标值上界的比值有显著影响。采用l∞范数时,得到最紧比值。
{"title":"Upper bound performance of semi-definite programming for localisation in inhomogeneous media","authors":"E. Nadimi, V. Blanes-Vidal","doi":"10.1109/MLSP.2017.8168145","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168145","url":null,"abstract":"In this paper, we regarded an absorbing inhomogeneous medium as an assembly of thin layers having different propagation properties. We derived a stochastic model for the refractive index and formulated the localisation problem given noisy distance measurements using graph realisation problem. We relaxed the problem using semi-definite programming (SDP) approach in lp realisation domain and derived upper bounds that follow Edmundson-Madansky bound of order 6p (EM6p) on the SDP objective function to provide an estimation of the techniques' localisation accuracy. Our results showed that the inhomogeneity of the media and the choice of lp norm have significant impact on the ratio of the expected value of the localisation error to the upper bound for the expected optimal SDP objective value. The tightest ratio was derived when l∞ norm was used.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"1 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90588609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A two-stage training deep neural network for small pedestrian detection 一种用于小型行人检测的两阶段训练深度神经网络
Tran Duy Linh, Masayuki Arai
In the present paper, we propose a deep network architecture in order to improve the accuracy of pedestrian detection. The proposed method contains a proposal network and a classification network that are trained separately. We use a single shot multibox detector (SSD) as a proposal network to generate the set of pedestrian proposals. The proposal network is fine-tuned from a pre-trained network by several pedestrian data sets of large input size (512 × 512 pixels) in order to improve detection accuracy of small pedestrians. Then, we use a classification network to classify pedestrian proposals. We then combine the scores from the proposal network and the classification network to obtain better final detection scores. Experiments were evaluated using the Caltech test set, and, compared to other state-of-the-art methods of pedestrian detection task, the proposed method obtains better results for small pedestrians (30 to 50 pixels in height) with an average miss rate of 42%.
在本文中,我们提出了一种深度网络架构,以提高行人检测的准确性。该方法包含单独训练的提议网络和分类网络。我们使用单镜头多盒检测器(SSD)作为提议网络来生成行人提议集。为了提高对小型行人的检测精度,该网络通过多个大输入尺寸(512 × 512像素)的行人数据集对预训练网络进行微调。然后,我们使用分类网络对行人建议进行分类。然后,我们将提议网络和分类网络的分数结合起来,得到更好的最终检测分数。使用加州理工学院的测试集对实验进行了评估,与其他最先进的行人检测任务方法相比,该方法在小行人(高度为30 ~ 50像素)中获得了更好的结果,平均缺失率为42%。
{"title":"A two-stage training deep neural network for small pedestrian detection","authors":"Tran Duy Linh, Masayuki Arai","doi":"10.1109/MLSP.2017.8168106","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168106","url":null,"abstract":"In the present paper, we propose a deep network architecture in order to improve the accuracy of pedestrian detection. The proposed method contains a proposal network and a classification network that are trained separately. We use a single shot multibox detector (SSD) as a proposal network to generate the set of pedestrian proposals. The proposal network is fine-tuned from a pre-trained network by several pedestrian data sets of large input size (512 × 512 pixels) in order to improve detection accuracy of small pedestrians. Then, we use a classification network to classify pedestrian proposals. We then combine the scores from the proposal network and the classification network to obtain better final detection scores. Experiments were evaluated using the Caltech test set, and, compared to other state-of-the-art methods of pedestrian detection task, the proposed method obtains better results for small pedestrians (30 to 50 pixels in height) with an average miss rate of 42%.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"44 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85674323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Joint learning of deep multi-scale features and diversified metrics for hyperspectral image classification 基于深度多尺度特征和多样化指标的高光谱图像分类联合学习
Z. Gong, P. Zhong, Yang Yu, Jiaxin Shan, W. Hu
Due to the high spectral resolution and the similarity of some spectrums between different classes, hyperspectral image classification turns out to be an important but challenging task. Researches show the powerful ability of deep learning for hyperspectral image classification. However, the lack of training samples makes it difficult to extract discriminative features and achieve performance as expected. To solve the problem, a multi-scale CNN which can extract multi-scale features is designed for hyperspectral image classification. Furthermore, D-DSML, a diversified metric, is proposed to further improve the representational ability of deep methods. In this paper, a D-DSML-MSCNN method, which jointly learns deep multi-scale features and diversified metrics for hyperspectral image classification, is proposed to take both advantages of D-DSML and MSCNN. Experiments are conducted on Pavia University data to show the effectiveness of our method for hyperspectral image classification. The results show the advantage of our method when compared with other recent results.
由于高光谱图像具有较高的光谱分辨率,且不同类别之间的某些光谱具有相似性,因此高光谱图像分类是一项重要而又具有挑战性的任务。研究表明,深度学习在高光谱图像分类中具有强大的能力。然而,缺乏训练样本使得难以提取判别特征并达到预期的性能。为解决这一问题,设计了一种可提取多尺度特征的多尺度CNN用于高光谱图像分类。此外,为了进一步提高深度方法的表征能力,提出了一种多样化的度量D-DSML。本文结合D-DSML和MSCNN的优点,提出了一种D-DSML-MSCNN方法,该方法联合学习深度多尺度特征和多样化度量用于高光谱图像分类。在帕维亚大学的数据上进行了实验,验证了该方法对高光谱图像分类的有效性。结果表明,与其他最近的研究结果相比,我们的方法具有优势。
{"title":"Joint learning of deep multi-scale features and diversified metrics for hyperspectral image classification","authors":"Z. Gong, P. Zhong, Yang Yu, Jiaxin Shan, W. Hu","doi":"10.1109/MLSP.2017.8168157","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168157","url":null,"abstract":"Due to the high spectral resolution and the similarity of some spectrums between different classes, hyperspectral image classification turns out to be an important but challenging task. Researches show the powerful ability of deep learning for hyperspectral image classification. However, the lack of training samples makes it difficult to extract discriminative features and achieve performance as expected. To solve the problem, a multi-scale CNN which can extract multi-scale features is designed for hyperspectral image classification. Furthermore, D-DSML, a diversified metric, is proposed to further improve the representational ability of deep methods. In this paper, a D-DSML-MSCNN method, which jointly learns deep multi-scale features and diversified metrics for hyperspectral image classification, is proposed to take both advantages of D-DSML and MSCNN. Experiments are conducted on Pavia University data to show the effectiveness of our method for hyperspectral image classification. The results show the advantage of our method when compared with other recent results.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"67 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80954721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online function minimization with convex random relu expansions 凸随机relu展开的在线函数最小化
Laurens Bliek, M. Verhaegen, S. Wahls
We propose CDONE, a convex version of the DONE algorithm. DONE is a derivative-free online optimization algorithm that uses surrogate modeling with noisy measurements to find a minimum of objective functions that are expensive to evaluate. Inspired by their success in deep learning, CDONE makes use of rectified linear units, together with a nonnegativity constraint to enforce convexity of the surrogate model. This leads to a sparse and cheap to evaluate surrogate model of the unknown optimization objective that is still accurate and that can be minimized with convex optimization algorithms. The CDONE algorithm is demonstrated on a toy example and on the problem of hyper-parameter optimization for a deep learning example on handwritten digit classification.
我们提出了CDONE, DONE算法的凸版本。DONE是一种无导数的在线优化算法,它使用带有噪声测量的代理建模来找到评估代价昂贵的目标函数的最小值。受他们在深度学习方面的成功启发,CDONE利用整流线性单元,以及非负性约束来增强代理模型的凸性。这导致了一个稀疏和廉价的评估未知优化目标的代理模型,它仍然是准确的,并且可以用凸优化算法最小化。通过一个简单的例子和一个手写体数字分类的深度学习实例,对CDONE算法进行了验证。
{"title":"Online function minimization with convex random relu expansions","authors":"Laurens Bliek, M. Verhaegen, S. Wahls","doi":"10.1109/MLSP.2017.8168109","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168109","url":null,"abstract":"We propose CDONE, a convex version of the DONE algorithm. DONE is a derivative-free online optimization algorithm that uses surrogate modeling with noisy measurements to find a minimum of objective functions that are expensive to evaluate. Inspired by their success in deep learning, CDONE makes use of rectified linear units, together with a nonnegativity constraint to enforce convexity of the surrogate model. This leads to a sparse and cheap to evaluate surrogate model of the unknown optimization objective that is still accurate and that can be minimized with convex optimization algorithms. The CDONE algorithm is demonstrated on a toy example and on the problem of hyper-parameter optimization for a deep learning example on handwritten digit classification.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"17 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90240796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Rényi entropy based mutual information for semi-supervised bird vocalization segmentation 基于rsamnyi熵的互信息半监督鸟类发声分割
Anshul Thakur, V. Abrol, Pulkit Sharma, Padmanabhan Rajan
In this paper we describe a semi-supervised algorithm to segment bird vocalizations using matrix factorization and Rényi entropy based mutual information. Singular value decomposition (SVD) is applied on pooled time-frequency representations of bird vocalizations to learn basis vectors. By utilizing only a few of the bases, a compact feature representation is obtained for input test data. Rényi entropy based mutual information is calculated between feature representations of consecutive frames. After some simple post-processing, a threshold is used to reliably distinguish bird vocalizations from other sounds. The algorithm is evaluated on the field recordings of different bird species and different SNR conditions. The results highlight the effectiveness of the proposed method in all SNR conditions, improvements over other methods, and its generality.
本文描述了一种基于矩阵分解和基于r尼米熵互信息的半监督算法来分割鸟类发声。将奇异值分解(SVD)应用于鸟叫声的时频混合表示,学习基向量。仅利用其中的几个基,就得到了输入测试数据的一个紧凑的特征表示。在连续帧的特征表示之间计算基于r熵的互信息。经过一些简单的后处理后,使用阈值来可靠地区分鸟类的叫声和其他声音。在不同鸟类的野外记录和不同信噪比条件下,对该算法进行了评价。结果突出了所提出方法在所有信噪比条件下的有效性,与其他方法相比的改进,以及其通用性。
{"title":"Rényi entropy based mutual information for semi-supervised bird vocalization segmentation","authors":"Anshul Thakur, V. Abrol, Pulkit Sharma, Padmanabhan Rajan","doi":"10.1109/MLSP.2017.8168118","DOIUrl":"https://doi.org/10.1109/MLSP.2017.8168118","url":null,"abstract":"In this paper we describe a semi-supervised algorithm to segment bird vocalizations using matrix factorization and Rényi entropy based mutual information. Singular value decomposition (SVD) is applied on pooled time-frequency representations of bird vocalizations to learn basis vectors. By utilizing only a few of the bases, a compact feature representation is obtained for input test data. Rényi entropy based mutual information is calculated between feature representations of consecutive frames. After some simple post-processing, a threshold is used to reliably distinguish bird vocalizations from other sounds. The algorithm is evaluated on the field recordings of different bird species and different SNR conditions. The results highlight the effectiveness of the proposed method in all SNR conditions, improvements over other methods, and its generality.","PeriodicalId":6542,"journal":{"name":"2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"101 ","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91553228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1