2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)最新文献

英文中文

Weakly Supervised Deep Learning for Detecting and Counting Dead Cells in Microscopy Images 弱监督深度学习用于显微镜图像中死亡细胞的检测和计数

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

Pub Date : 2019-12-01 DOI: 10.1109/ICMLA.2019.00282

Siteng Chen, Ao Li, Kathleen Lasick, Julie M. Huynh, Linda S. Powers, Janet Roveda, A. Paek

Counting dead cells is a key step in evaluating the performance of chemotherapy treatment and drug screening. Deep convolutional neural networks (CNNs) can learn complex visual features, but require massive ground truth annotations which is expensive in biomedical experiments. Counting cells, especially dead cells, with very few ground truth annotations remains unexplored. In this paper, we automate dead cell counting using a weakly supervised strategy. We took advantage of the fact that cell death is low before chemotherapy treatment and increases after treatment. Motivated by the contrast, we first design image level supervised only classification neural networks to detect dead cells. Based on the class response map in classification networks, we calculate a Dead Confidence Map (DCM) to specify confidence of each dead cell. Associated with peak clustering, local maximums in the DCM are used to count the number of dead cells. In addition, a biological experiment based weakly supervised data preparation strategy is proposed to minimize human intervention. We show classification performance compared to general purpose and cell classification networks, and report results for the image-level supervised counting task.

死亡细胞计数是评估化疗效果和药物筛选的关键步骤。深度卷积神经网络(cnn)可以学习复杂的视觉特征，但需要大量的基础真值注释，这在生物医学实验中是昂贵的。计数细胞，特别是死亡细胞，很少的基础真理注释仍未被探索。在本文中，我们使用弱监督策略自动计数死细胞。我们利用了化疗前细胞死亡率低而化疗后细胞死亡率增加的事实。基于这种对比，我们首先设计了图像级监督分类神经网络来检测死细胞。基于分类网络中的类响应图，我们计算了一个Dead Confidence map (DCM)来指定每个Dead cell的置信度。与峰值聚类相关联，DCM中的局部最大值用于计算死细胞的数量。此外，提出了一种基于生物实验的弱监督数据准备策略，以减少人为干预。我们展示了与通用和细胞分类网络相比的分类性能，并报告了图像级监督计数任务的结果。

{"title":"Weakly Supervised Deep Learning for Detecting and Counting Dead Cells in Microscopy Images","authors":"Siteng Chen, Ao Li, Kathleen Lasick, Julie M. Huynh, Linda S. Powers, Janet Roveda, A. Paek","doi":"10.1109/ICMLA.2019.00282","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00282","url":null,"abstract":"Counting dead cells is a key step in evaluating the performance of chemotherapy treatment and drug screening. Deep convolutional neural networks (CNNs) can learn complex visual features, but require massive ground truth annotations which is expensive in biomedical experiments. Counting cells, especially dead cells, with very few ground truth annotations remains unexplored. In this paper, we automate dead cell counting using a weakly supervised strategy. We took advantage of the fact that cell death is low before chemotherapy treatment and increases after treatment. Motivated by the contrast, we first design image level supervised only classification neural networks to detect dead cells. Based on the class response map in classification networks, we calculate a Dead Confidence Map (DCM) to specify confidence of each dead cell. Associated with peak clustering, local maximums in the DCM are used to count the number of dead cells. In addition, a biological experiment based weakly supervised data preparation strategy is proposed to minimize human intervention. We show classification performance compared to general purpose and cell classification networks, and report results for the image-level supervised counting task.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"106 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114998904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Fusing Visual and Textual Information to Determine Content Safety 融合视觉和文本信息以确定内容安全性

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

Pub Date : 2019-12-01 DOI: 10.1109/ICMLA.2019.00324

Rodrigo Leonardo, Amber Hu, M. Uzair, Qiujing Lu, Iris Fu, Keishin Nishiyama, Sooraj Mangalath Subrahmannian, D. Ravichandran

In advertising, identifying the content safety of web pages is a significant concern since advertisers do not want brands to be associated with threatening content. At the same time, publishers would like to maximize the number of web pages on which they can place ads. Thus, a fine balance must be achieved while classifying content safety in order to satisfy both advertisers and publishers. In this paper, we propose a multimodal machine learning framework that fuses visual and textual information from web pages to improve current predictions of content safety. The primary focus is on late fusion, which involves combining final model outputs of separate modalities, such as images and text, to arrive at a single decision. This paper presents a fully automated machine learning framework that performs binary and multilabel classification using late fusion techniques. We also introduce additional work in early fusion, which involves extracting and fusing intermediate features from the two separate models. Our algorithms are applied to data extracted from relevant web pages in the advertising industry. Both of our late and early fusion methods obtain significant improvements over algorithms currently in use.

在广告中，识别网页内容的安全性是一个重要的问题，因为广告商不希望品牌与威胁内容联系在一起。与此同时，发布商希望最大限度地增加他们可以投放广告的网页数量。因此，在对内容安全进行分类的同时，必须实现一个微妙的平衡，以满足广告商和发布商的要求。在本文中，我们提出了一个多模态机器学习框架，该框架融合了来自网页的视觉和文本信息，以改进当前对内容安全的预测。主要的焦点是后期融合，这涉及到将独立模式(如图像和文本)的最终模型输出结合起来，以得出一个单一的决策。本文提出了一个全自动机器学习框架，该框架使用后期融合技术执行二进制和多标签分类。我们还介绍了早期融合的额外工作，包括从两个独立的模型中提取和融合中间特征。我们的算法应用于从广告行业的相关网页中提取的数据。我们的晚期和早期融合方法都比目前使用的算法有了显著的改进。

{"title":"Fusing Visual and Textual Information to Determine Content Safety","authors":"Rodrigo Leonardo, Amber Hu, M. Uzair, Qiujing Lu, Iris Fu, Keishin Nishiyama, Sooraj Mangalath Subrahmannian, D. Ravichandran","doi":"10.1109/ICMLA.2019.00324","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00324","url":null,"abstract":"In advertising, identifying the content safety of web pages is a significant concern since advertisers do not want brands to be associated with threatening content. At the same time, publishers would like to maximize the number of web pages on which they can place ads. Thus, a fine balance must be achieved while classifying content safety in order to satisfy both advertisers and publishers. In this paper, we propose a multimodal machine learning framework that fuses visual and textual information from web pages to improve current predictions of content safety. The primary focus is on late fusion, which involves combining final model outputs of separate modalities, such as images and text, to arrive at a single decision. This paper presents a fully automated machine learning framework that performs binary and multilabel classification using late fusion techniques. We also introduce additional work in early fusion, which involves extracting and fusing intermediate features from the two separate models. Our algorithms are applied to data extracted from relevant web pages in the advertising industry. Both of our late and early fusion methods obtain significant improvements over algorithms currently in use.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"410 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115231710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Multiscale Geometric Data Analysis via Laplacian Eigenvector Cascading 基于拉普拉斯特征向量级联的多尺度几何数据分析

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

Pub Date : 2019-12-01 DOI: 10.1109/ICMLA.2019.00183

Joshua L. Mike, Jose A. Perea

We develop here an algorithmic framework for constructing consistent multiscale Laplacian eigenfunctions (vectors) on data. Consequently, we address the unsupervised machine learning task of finding scalar functions capturing consistent structure across scales in data, in a way that encodes intrinsic geometric and topological features. This is accomplished by two algorithms for eigenvector cascading. We show via examples that cascading accelerates the computation of graph Laplacian eigenvectors, and more importantly, that one obtains consistent bases of the associated eigenspaces across scales. Finally, we present an application to TDA mapper, showing that our multiscale Laplacian eigenvectors identify stable flair-like structures in mapper graphs of varying granularity.

我们开发了一个算法框架，用于在数据上构造一致的多尺度拉普拉斯特征函数(向量)。因此，我们解决了无监督机器学习任务，即以编码内在几何和拓扑特征的方式，在数据中寻找捕获跨尺度一致结构的标量函数。这是通过两种特征向量级联算法来实现的。我们通过实例表明，级联加速了图拉普拉斯特征向量的计算，更重要的是，人们获得了跨尺度的相关特征空间的一致基。最后，我们给出了一个在TDA映射器上的应用，证明了我们的多尺度拉普拉斯特征向量在不同粒度的映射图中识别出稳定的类形结构。

引用次数: 1

Efficient Evolutionary Architecture Search for CNN Optimization on GTSRB 基于GTSRB的CNN优化的高效进化架构搜索

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

Pub Date : 2019-12-01 DOI: 10.1109/ICMLA.2019.00018

Fabio Marco Johner, J. Wassner

Neural network inference on embedded devices has to meet accuracy and latency requirements under tight resource constraints. The design of suitable network architectures is a challenging and time-consuming task. Therefore, automatic discovery and optimization of neural networks is considered important for continuing the trend of moving classification tasks from cloud to edge computing. This paper presents an evolutionary method to optimize a convolutional neural network (CNN) architecture for classification tasks. The method runs efficiently on a single GPU-workstation and provides simple means to direct the tradeoff between complexity and accuracy of the evolved network. Using this method, we achieved a 11x reduction in the number of multiply-accumulate (MAC) operations of the winning network for the German Traffic Sign Recognition Benchmark (GTSRB) without accuracy reduction. An ensemble of four of our evolved networks competes the winning ensemble with a 0.1% lower accuracy but 70x reduction in MACs and 14x reduction in parameters.

嵌入式设备上的神经网络推理必须在有限的资源约束下满足精度和延迟要求。设计合适的网络体系结构是一项具有挑战性且耗时的任务。因此，神经网络的自动发现和优化对于继续将分类任务从云计算转移到边缘计算的趋势非常重要。本文提出了一种优化卷积神经网络(CNN)分类结构的进化方法。该方法在单个gpu工作站上有效地运行，并提供了一种简单的方法来指导进化网络的复杂性和准确性之间的权衡。使用该方法，我们在不降低精度的情况下，将德国交通标志识别基准(GTSRB)的获胜网络的乘法累积(MAC)操作次数减少了11倍。我们进化的四个网络的集成与获胜的集成竞争，精度降低了0.1%，但mac减少了70倍，参数减少了14倍。

引用次数: 9

Fuzzy-Rough Cognitive Networks: Building Blocks and Their Contribution to Performance 模糊-粗糙认知网络:构建模块及其对性能的贡献

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

Pub Date : 2019-12-01 DOI: 10.1109/ICMLA.2019.00159

M. Vanloffelt, G. Nápoles, K. Vanhoof

Pattern classification is a popular research field within the Machine Learning discipline. Black-box models have proven to be potent classifiers in this particular field. However, their inability to provide a transparent decision mechanism is often regarded as an undesirable feature. Fuzzy-Rough Cognitive Networks are granular classifiers that have proven competitive and effective in such tasks. In this paper, we examine the contribution of the FRCN's main building blocks, being the causal weight matrix and the activation values of the neurons, to the model's average performance. Noise injection is employed to this end. Our findings suggest that optimising the weight matrix might not be as beneficial to the model's performance as suggested in previous research. Furthermore, we found that a powerful activation of the neurons included in the model topology is crucial to performance, as expected. Further research should as such focus on finding more powerful ways to activate these neurons, rather than focus on optimising the causal weight matrix.

模式分类是机器学习学科中的一个热门研究领域。在这个特定领域，黑盒模型已经被证明是有效的分类器。然而，它们不能提供透明的决策机制通常被认为是一个不受欢迎的特性。模糊-粗糙认知网络是颗粒分类器，在这类任务中已被证明具有竞争力和有效性。在本文中，我们研究了FRCN的主要构建块(因果权重矩阵和神经元的激活值)对模型平均性能的贡献。为此，采用了噪声注入。我们的研究结果表明，优化权重矩阵可能不会像以前的研究中建议的那样有利于模型的性能。此外，我们发现，正如预期的那样，模型拓扑中包含的神经元的强大激活对性能至关重要。因此，进一步的研究应该专注于寻找更有效的方法来激活这些神经元，而不是专注于优化因果权重矩阵。

引用次数: 2

An Analysis of Univariate and Multivariate Electrocardiography Signal Classification 单因素和多因素心电图信号分类分析

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

Pub Date : 2019-12-01 DOI: 10.1109/ICMLA.2019.00074

Nelly Elsayed, A. Maida, M. Bayoumi

Heart diseases are mainly diagnosed by the electrocardiogram (ECG) or (EKG). The correct classification of ECG signals helps in diagnosing heart diseases. In this paper, we study and analyze the univariate and multivariate ECG signal classification problems to find the optimal classifier for ECG signals from existing state-of-the-art time series classification models.

心脏病的诊断主要依靠心电图(ECG)或心电图(EKG)。心电信号的正确分类有助于心脏病的诊断。在本文中，我们研究和分析了单变量和多变量心电信号的分类问题，从现有的最先进的时间序列分类模型中找到最优的心电信号分类器。

引用次数: 5

MobileNet-Tiny: A Deep Neural Network-Based Real-Time Object Detection for Rasberry Pi MobileNet-Tiny:基于深度神经网络的Rasberry Pi实时目标检测

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

Pub Date : 2019-12-01 DOI: 10.1109/ICMLA.2019.00118

Nithesh Singh Sanjay, A. Ahmadinia

In this paper, we present a new neural network architecture, MobileNet-Tiny that can be used to harness the power of GPU based real-time object detection in raspberry-pi and also in devices with the absence of a GPU and limited graphic processing capabilities such as mobile phones, laptops, etc. MobileNet-Tiny trained on COCO dataset running on a non-Gpu laptop dell XPS 13, achieves an accuracy of 19.0 mAP and a speed of 19.4 FPS which is 3 times as fast as MobileNetV2, and when running on a raspberry pi, it achieves a speed of 4.5 FPS which is up to 7 times faster than MobileNetV2. MobileNet-Tiny was modeled to offer a compact, quick, and well-balanced object detection solution to a variety of GPU restricted devices.

在本文中，我们提出了一种新的神经网络架构，MobileNet-Tiny，它可以用于利用基于GPU的树莓派实时目标检测的能力，也可以用于没有GPU和图形处理能力有限的设备，如手机，笔记本电脑等。在COCO数据集上训练的MobileNet-Tiny在非gpu笔记本电脑戴尔XPS 13上运行，达到了19.0 mAP的精度和19.4 FPS的速度，是MobileNetV2的3倍，在树莓派上运行时，它达到了4.5 FPS的速度，比MobileNetV2快了7倍。MobileNet-Tiny旨在为各种受GPU限制的设备提供紧凑、快速、平衡良好的目标检测解决方案。

引用次数: 6

Radar-Based Non-intrusive Fall Motion Recognition using Deformable Convolutional Neural Network 基于雷达的可变形卷积神经网络非侵入式跌倒运动识别

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

Pub Date : 2019-12-01 DOI: 10.1109/ICMLA.2019.00279

Y. Shankar, Souvik Hazra, Avik Santra

Radar is an attractive sensing technology for remote and non-intrusive human health monitoring and elderly fall detection due to its ability to work in low lighting conditions, its invariance to the environment, and its ability to operate through obstacles. Radar reflections from humans produce unique micro-Doppler signatures that can be used for classifying human activities and fall motion. However, radar-based elderly fall detection need to handle the indistinctive inter-class differences and large intra-class variations of human fall-motion in a real-world situation. Further, the radar placement in the room and varying aspect angle of the falling subject could result in differing radar micro-Doppler signature of human fall-motion. In this paper, we use a compact short-range 60-GHz frequency modulated continuous wave radar for detecting human fall motion using a novel deformable deep convolutional neural network with novel 1-class contrastive loss function in conjunction to focus loss to recognize elderly fall and address several of these signal processing system challenges. We demonstrate the performance of our proposed system in laboratory conditions under staged fall motion.

雷达是一种有吸引力的遥感技术，用于远程和非侵入式人体健康监测和老年人跌倒检测，因为它能够在低光照条件下工作，对环境具有不变性，并且能够穿越障碍物。来自人类的雷达反射产生独特的微多普勒特征，可用于对人类活动和下落运动进行分类。然而，基于雷达的老年人跌倒检测需要处理现实世界中人类跌倒运动的类间差异和类内变化。此外，雷达在房间中的放置位置和落体物体的不同角度可能导致人体落体运动的不同雷达微多普勒特征。在本文中，我们使用紧凑型近距离60 ghz调频连续波雷达检测人体跌倒运动，使用新颖的可变形深度卷积神经网络与新颖的1级对比损失函数结合聚焦损失来识别老年人跌倒，并解决了这些信号处理系统的几个挑战。我们在实验室条件下演示了我们提出的系统在分阶段下落运动下的性能。

{"title":"Radar-Based Non-intrusive Fall Motion Recognition using Deformable Convolutional Neural Network","authors":"Y. Shankar, Souvik Hazra, Avik Santra","doi":"10.1109/ICMLA.2019.00279","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00279","url":null,"abstract":"Radar is an attractive sensing technology for remote and non-intrusive human health monitoring and elderly fall detection due to its ability to work in low lighting conditions, its invariance to the environment, and its ability to operate through obstacles. Radar reflections from humans produce unique micro-Doppler signatures that can be used for classifying human activities and fall motion. However, radar-based elderly fall detection need to handle the indistinctive inter-class differences and large intra-class variations of human fall-motion in a real-world situation. Further, the radar placement in the room and varying aspect angle of the falling subject could result in differing radar micro-Doppler signature of human fall-motion. In this paper, we use a compact short-range 60-GHz frequency modulated continuous wave radar for detecting human fall motion using a novel deformable deep convolutional neural network with novel 1-class contrastive loss function in conjunction to focus loss to recognize elderly fall and address several of these signal processing system challenges. We demonstrate the performance of our proposed system in laboratory conditions under staged fall motion.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125894594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

A Fast and Light Weight Deep Convolution Neural Network Model for Cancer Disease Identification in Human Lung(s) 一种用于人体肺部肿瘤疾病识别的快速轻量级深度卷积神经网络模型

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

Pub Date : 2019-12-01 DOI: 10.1109/ICMLA.2019.00225

Siva Skandha Sanagala, S. Gupta, V. K. Koppula, M. Agarwal

In the proposed work, a convolution neural network (CNN) based model has been used to identify the cancer disease in human lung(s). Moreover, this approach identifies the single or multi-module in lungs by analyzing the Computer Tomography (CT) scan. For the purpose of the experiment, publicly available dataset named as Early Lung Cancer Action Program (ELCAP) has been used. Moreover, the performance of proposed CNN model has been compared with traditional machine learning approaches i.e. support vector machine, k-NN, Decision Tree, Random Forest, etc under various parameters i.e. accuracy, precision, recall, Cohen Kappa. The performance of proposed model is also compared with famous CNN models i.e. VGG16, Inception V3 in terms of accuracy, storage space and inference time. The experimental results show the efficacy of proposed algorithms over traditional machine learning and pre-trained models by achieving the accuracy of 99.5%

在提出的工作中，基于卷积神经网络(CNN)的模型已被用于识别人类肺部的癌症疾病。此外，该方法通过分析计算机断层扫描(CT)来识别肺部的单个或多个模块。为了实验的目的，使用了名为早期肺癌行动计划(ELCAP)的公开数据集。此外，在准确率、精度、召回率、科恩卡帕等参数下，将本文提出的CNN模型与传统的机器学习方法(支持向量机、k-NN、决策树、随机森林等)的性能进行了比较。并与著名的CNN模型VGG16、盗梦空间V3在准确率、存储空间、推理时间等方面进行了比较。实验结果表明，与传统机器学习和预训练模型相比，所提算法的准确率达到99.5%

引用次数: 8

Encoding in Neural Networks 神经网络编码

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

Pub Date : 2019-12-01 DOI: 10.1109/ICMLA.2019.00065

S. Bharitkar

Data transforms, parameter re-normalization, and activation functions have gained significant attention in the neural network community over the past several years for improving convergence speed. The results in the literature are for computer vision applications, with batch-normalization (BN) and the Rectified Linear Unit (ReLU) activation attracting attention. In this paper, we present a new approach in data transformation in the context of regression during the synthesis of Head-related Transfer Functions (HRTFs) in the field of audio. The encoding technique whitens the real-valued input data delivered to the first hidden layer of a fully-connected neural network (FCNN) thereby providing the training speedup. The experimental results demonstrate, in a statistically significant way, that the presented data encoding approach outperforms other forms of normalization in terms of convergence speed, lower mean-square error, and robustness to network parameter initialization. Towards this, we used some popular first-and second-order gradient techniques such as scaled conjugate gradient, Extreme Learning Machine (ELM), and stochastic gradient descent with momentum and batch normalization. The improvements, as shown through t-SNE based depiction and analysis on the input covariance matrix, confirm the reduction in the condition number of the input covariance matrix (a process similar to whitening).

数据变换、参数再归一化和激活函数在过去几年中在神经网络社区中获得了显著的关注，以提高收敛速度。文献中的结果是用于计算机视觉应用的，批量归一化(BN)和整流线性单元(ReLU)激活引起了人们的注意。本文提出了一种基于回归的音频领域头部相关传递函数(hrtf)合成过程中数据转换的新方法。编码技术将实值输入数据白化到全连接神经网络(FCNN)的第一隐层，从而提供训练加速。实验结果以统计显著的方式表明，所提出的数据编码方法在收敛速度、更低的均方误差和对网络参数初始化的鲁棒性方面优于其他形式的归一化。为此，我们使用了一些流行的一阶和二阶梯度技术，如缩放共轭梯度、极限学习机(ELM)和带动量和批归一化的随机梯度下降。通过对输入协方差矩阵的基于t-SNE的描述和分析所显示的改进，证实了输入协方差矩阵条件个数的减少(类似于白化过程)。

{"title":"Encoding in Neural Networks","authors":"S. Bharitkar","doi":"10.1109/ICMLA.2019.00065","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00065","url":null,"abstract":"Data transforms, parameter re-normalization, and activation functions have gained significant attention in the neural network community over the past several years for improving convergence speed. The results in the literature are for computer vision applications, with batch-normalization (BN) and the Rectified Linear Unit (ReLU) activation attracting attention. In this paper, we present a new approach in data transformation in the context of regression during the synthesis of Head-related Transfer Functions (HRTFs) in the field of audio. The encoding technique whitens the real-valued input data delivered to the first hidden layer of a fully-connected neural network (FCNN) thereby providing the training speedup. The experimental results demonstrate, in a statistically significant way, that the presented data encoding approach outperforms other forms of normalization in terms of convergence speed, lower mean-square error, and robustness to network parameter initialization. Towards this, we used some popular first-and second-order gradient techniques such as scaled conjugate gradient, Extreme Learning Machine (ELM), and stochastic gradient descent with momentum and batch normalization. The improvements, as shown through t-SNE based depiction and analysis on the input covariance matrix, confirm the reduction in the condition number of the input covariance matrix (a process similar to whitening).","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116124697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀