2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文中文

iCASSTLE : Imbalanced Classification Algorithm for Semi Supervised Text Learning 半监督文本学习的不平衡分类算法

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2018-12-01 DOI: 10.1109/ICMLA.2018.00165

Debanjan Banerjee, Gyan Prabhat, Riyanka Bhowal

Information in the form of text can be found in abundance in the web today, which can be mined to solve multifarious problems. Customer reviews, for instance, flow in across multiple sources in thousands per day which can be leveraged to obtain several insights. Our goal is to extract cases of a rare event e.g., recall of products, allegations of ethics or, legal concerns or, threats to product-safety, etc. from this enormous amount of data. Manual identification of such cases to be reported is extremely labour-intensive as well as time-sensitive, but failure to do so can have fatal impact on the industry's overall health and dependability; missing out on even a single case may lead to huge penalties in terms of customer experience, product liability and industry reputation. In this paper, we will discuss classification through Positive and Unlabeled data, PU classification, where the only class, for which instances are available, is a rare event. In iCASSTLE, we propose a two-staged approach where Stage I leverages three unique components of text mining to procure representative training data containing instances of both classes in the right proportion, and Stage II uses results from Stage I to run a semi-supervised classification. We applied this to multiple datasets differing in nature of Product Safety as well as nature of imbalance and iCASSTLE is proven to perform better than the state-of-the-art methods for the relevant use-cases.

如今，在网络上可以找到大量的文本形式的信息，这些信息可以通过挖掘来解决各种各样的问题。例如，每天有成千上万的客户评论从多个来源流入，这可以用来获得一些见解。我们的目标是从海量的数据中提取罕见事件的案例，例如产品召回，道德或法律问题的指控，对产品安全的威胁等。手工确定要报告的这类病例极其耗费人力，而且时间敏感，但如果不这样做，可能对该行业的整体健康和可靠性产生致命影响;即使遗漏一个案例，也可能在客户体验、产品责任和行业声誉方面招致巨额罚款。在本文中，我们将讨论通过Positive和Unlabeled数据的分类，PU分类，其中唯一的类，其实例是可用的，是一个罕见的事件。在iCASSTLE中，我们提出了一种两阶段的方法，其中第一阶段利用文本挖掘的三个独特组件来获取包含适当比例的两类实例的代表性训练数据，第二阶段使用第一阶段的结果来运行半监督分类。我们将其应用于产品安全性质不同的多个数据集，以及不平衡的性质，iCASSTLE被证明比相关用例的最先进方法表现得更好。

{"title":"iCASSTLE : Imbalanced Classification Algorithm for Semi Supervised Text Learning","authors":"Debanjan Banerjee, Gyan Prabhat, Riyanka Bhowal","doi":"10.1109/ICMLA.2018.00165","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00165","url":null,"abstract":"Information in the form of text can be found in abundance in the web today, which can be mined to solve multifarious problems. Customer reviews, for instance, flow in across multiple sources in thousands per day which can be leveraged to obtain several insights. Our goal is to extract cases of a rare event e.g., recall of products, allegations of ethics or, legal concerns or, threats to product-safety, etc. from this enormous amount of data. Manual identification of such cases to be reported is extremely labour-intensive as well as time-sensitive, but failure to do so can have fatal impact on the industry's overall health and dependability; missing out on even a single case may lead to huge penalties in terms of customer experience, product liability and industry reputation. In this paper, we will discuss classification through Positive and Unlabeled data, PU classification, where the only class, for which instances are available, is a rare event. In iCASSTLE, we propose a two-staged approach where Stage I leverages three unique components of text mining to procure representative training data containing instances of both classes in the right proportion, and Stage II uses results from Stage I to run a semi-supervised classification. We applied this to multiple datasets differing in nature of Product Safety as well as nature of imbalance and iCASSTLE is proven to perform better than the state-of-the-art methods for the relevant use-cases.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"12 1","pages":"1012-1016"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75821059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

SEDAT: Sentiment and Emotion Detection in Arabic Text Using CNN-LSTM Deep Learning SEDAT:使用CNN-LSTM深度学习的阿拉伯语文本的情绪和情感检测

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2018-12-01 DOI: 10.1109/ICMLA.2018.00134

Malak Abdullah, M. Hadzikadic, Samira Shaikh

Social media is growing as a communication medium where people can express online their feelings and opinions on a variety of topics in ways they rarely do in person. Detecting sentiments and emotions in text have gained considerable amount of attention in the last few years. The significant role of the Arab region in international politics and in the global economy have led to the investigation of sentiments and emotions in Arabic. This paper describes our system - SEDAT, to detect sentiments and emotions in Arabic tweets. We use word and document embeddings and a set of semantic features and apply CNN-LSTM and a fully connected neural network architectures to obtain performance results that show substantial improvements in Spearman correlation scores over the baseline models.

社交媒体正在成为一种交流媒介，人们可以在网上以很少面对面的方式表达他们对各种话题的感受和观点。在过去的几年里，检测文本中的情绪和情感已经获得了相当多的关注。阿拉伯地区在国际政治和全球经济中的重要作用导致了对阿拉伯语的情绪和情绪的调查。本文描述了我们的系统SEDAT，用于检测阿拉伯语推文中的情绪和情绪。我们使用单词和文档嵌入以及一组语义特征，并应用CNN-LSTM和一个完全连接的神经网络架构来获得性能结果，该结果显示，与基线模型相比，Spearman相关分数有了实质性的提高。

引用次数: 60

An Attention-Based Air Quality Forecasting Method 一种基于注意力的空气质量预测方法

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2018-12-01 DOI: 10.1109/ICMLA.2018.00115

Bo Liu, Shuo Yan, Jianqiang Li, Guangzhi Qu, Yong Li, Jianlei Lang, Rentao Gu

Air pollution is threatening human's health since the industrial revolution, but there are not efficient ways to solve air pollution, so forecasting air quality has become an efficient measure to prevent citizens from hurting of heavy air pollution. In this paper, we proposed an advanced Seq2Seq (Sequence to Sequence) model called attention-based air quality forecasting model (ABAFM) whose RNN encoder is replaced by pure attention mechanism with position embedding. This improvement not only reduces the training time of Seq2Seq model with attention but also enhances the robustness of Seq2Seq models. We implemented ABAFM in Olympic center and Dongsi monitoring stations in Beijing to forecast PM2.5 in future 24 hours. The experimental results showed that the proposed model outperformed the related arts, especially in sudden changes.

自工业革命以来，空气污染一直威胁着人类的健康，但没有有效的方法来解决空气污染，因此空气质量预测成为防止市民遭受重污染伤害的有效措施。本文提出了一种改进的Seq2Seq (Sequence to Sequence)模型，即基于注意力的空气质量预测模型(ABAFM)，该模型将RNN编码器替换为具有位置嵌入的纯注意力机制。这种改进不仅减少了Seq2Seq模型的训练时间，而且增强了Seq2Seq模型的鲁棒性。在北京奥运中心和东四监测站实施ABAFM，预测未来24小时PM2.5。实验结果表明，该模型在突发性变化情况下的表现优于相关算法。

引用次数: 5

A Multi-objective Rule Optimizer with an Application to Risk Management 多目标规则优化器及其在风险管理中的应用

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2018-12-01 DOI: 10.1109/ICMLA.2018.00018

P. Pulkkinen, Neetesh Tiwari, Akhil Kumar, Christopher Jones

Managing risk is important to any E-commerce merchant. Various machine learning (ML) models combined with a rule set as the decision layer is a common practice to manage the risks. Unlike the ML models that can be automatically refreshed periodically based on new risk patterns, rules are generally static and rely on manual updates. To tackle that, this paper presents a data-driven and automated rule optimization method that generates multiple Pareto-optimal rule sets representing different trade-offs between business objectives. This enables business owners to make informed decisions when choosing between optimized rule sets for changing business needs and risks. Furthermore, manual work in rule management is greatly reduced. For scalability this method leverages Apache Spark and runs either on a single host or in a distributed environment in the cloud. This allows us to perform the optimization in a distributed fashion using millions of transactions, hundreds of variables and hundreds of rules during the training. The proposed method is general but we used it for optimizing real-world E-commerce (Amazon) risk rule sets. It could also be used in other fields such as finance and medicine.

管理风险对任何电子商务商家来说都很重要。将各种机器学习(ML)模型与规则集相结合作为决策层是管理风险的常用实践。与可以根据新的风险模式定期自动刷新的ML模型不同，规则通常是静态的，依赖于手动更新。为了解决这个问题，本文提出了一种数据驱动的自动规则优化方法，该方法生成多个pareto最优规则集，表示业务目标之间的不同权衡。这使得业务所有者在为不断变化的业务需求和风险选择优化的规则集时能够做出明智的决策。此外，大大减少了规则管理中的手工工作。对于可伸缩性，这种方法利用Apache Spark，可以在单个主机上运行，也可以在云中的分布式环境中运行。这允许我们在训练期间使用数百万个事务、数百个变量和数百条规则以分布式方式执行优化。所提出的方法是通用的，但我们将其用于优化现实世界的电子商务(Amazon)风险规则集。它也可以用于其他领域，如金融和医药。

{"title":"A Multi-objective Rule Optimizer with an Application to Risk Management","authors":"P. Pulkkinen, Neetesh Tiwari, Akhil Kumar, Christopher Jones","doi":"10.1109/ICMLA.2018.00018","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00018","url":null,"abstract":"Managing risk is important to any E-commerce merchant. Various machine learning (ML) models combined with a rule set as the decision layer is a common practice to manage the risks. Unlike the ML models that can be automatically refreshed periodically based on new risk patterns, rules are generally static and rely on manual updates. To tackle that, this paper presents a data-driven and automated rule optimization method that generates multiple Pareto-optimal rule sets representing different trade-offs between business objectives. This enables business owners to make informed decisions when choosing between optimized rule sets for changing business needs and risks. Furthermore, manual work in rule management is greatly reduced. For scalability this method leverages Apache Spark and runs either on a single host or in a distributed environment in the cloud. This allows us to perform the optimization in a distributed fashion using millions of transactions, hundreds of variables and hundreds of rules during the training. The proposed method is general but we used it for optimizing real-world E-commerce (Amazon) risk rule sets. It could also be used in other fields such as finance and medicine.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"22 1","pages":"66-72"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78914130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Implementation of a Modified Nesterov's Accelerated Quasi-Newton Method on Tensorflow 一种改进的Nesterov加速拟牛顿方法在Tensorflow上的实现

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2018-12-01 DOI: 10.1109/ICMLA.2018.00185

S. Indrapriyadarsini, Shahrzad Mahboubi, H. Ninomiya, H. Asai

Recent studies incorporate Nesterov's accelerated gradient method for the acceleration of gradient based training. The Nesterov's Accelerated Quasi-Newton (NAQ) method has shown to drastically improve the convergence speed compared to the conventional quasi-Newton method. This paper implements NAQ for non-convex optimization on Tensorflow. Two modifications have been proposed to the original NAQ algorithm to ensure global convergence and eliminate linesearch. The performance of the proposed algorithm - mNAQ is evaluated on standard non-convex function approximation benchmark problems and microwave circuit modelling problems. The results show that the improved algorithm converges better and faster compared to first order optimizers such as AdaGrad, RMSProp, Adam, and the second order methods such as the quasi-Newton method.

最近的研究将Nesterov的加速梯度方法用于基于梯度的加速训练。与传统的拟牛顿方法相比，Nesterov的加速拟牛顿(NAQ)方法大大提高了收敛速度。本文在Tensorflow上实现了NAQ算法的非凸优化。对原NAQ算法进行了两处改进，以保证全局收敛并消除线研究。在标准非凸函数逼近基准问题和微波电路建模问题上对该算法的性能进行了评价。结果表明，与AdaGrad、RMSProp、Adam等一阶优化器和拟牛顿法等二阶优化器相比，改进后的算法收敛速度更快、性能更好。

引用次数: 7

Iterative Deep Learning Based Unbiased Stereology with Human-in-the-Loop 基于迭代深度学习的人在环无偏立体学

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2018-12-01 DOI: 10.1109/ICMLA.2018.00106

Saeed S. Alahmari, Dmitry Goldgof, L. Hall, P. Dave, H. A. Phoulady, P. Mouton

Lack of enough labeled data is a major problem in building machine learning based models when the manual annotation (labeling) is error-prone, expensive, tedious, and time-consuming. In this paper, we introduce an iterative deep learning based method to improve segmentation and counting of cells based on unbiased stereology applied to regions of interest of extended depth of field (EDF) images. This method uses an existing machine learning algorithm called the adaptive segmentation algorithm (ASA) to generate masks (verified by a user) for EDF images to train deep learning models. Then an iterative deep learning approach is used to feed newly predicted and accepted deep learning masks/images (verified by a user) to the training set of the deep learning model. The error rate in unbiased stereology count of cells on an unseen test set reduced from about 3 % to less than 1 % after 5 iterations of the iterative deep learning based unbiased stereology process.

在构建基于机器学习的模型时，缺乏足够的标记数据是一个主要问题，因为手动标注(标记)容易出错、昂贵、繁琐且耗时。在本文中，我们介绍了一种基于迭代深度学习的方法来改进基于无偏立体学的细胞分割和计数，该方法应用于扩展景深(EDF)图像的感兴趣区域。该方法使用一种称为自适应分割算法(ASA)的现有机器学习算法，为EDF图像生成遮罩(由用户验证)，以训练深度学习模型。然后使用迭代深度学习方法将新预测和接受的深度学习掩码/图像(由用户验证)馈送到深度学习模型的训练集。在基于迭代深度学习的无偏立体学过程的5次迭代后，在未见测试集上无偏立体学细胞计数的错误率从约3%降至不到1%。

引用次数: 13

Detailed Identification of Fingerprints Using Convolutional Neural Networks 基于卷积神经网络的指纹详细识别

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2018-12-01 DOI: 10.1109/ICMLA.2018.00187

Y. Shehu, Ariel Ruiz-Garcia, V. Palade, Anne James

Fingerprints, as one of the most widely used biometric modalities, can be used to identify and distinguish between genders. Gender classification is very important in reducing the time when investigating criminal offenders and gender impersonation. In this work, we use deep Convolutional Neural Networks (CNNs) to not only classify fingerprints by gender, but also identify individual hands and fingers. Transfer learning is employed to speed up the training of the CNN. The CNN achieves an accuracy of 75.2%, 93.5%, and 76.72% for the classification of gender, hand, and fingers, respectively. These results obtained using our publicly available Sokoto Coventry Fingerprint Dataset (SOCOFing) serve as benchmark classification results on this dataset.

指纹作为一种应用最广泛的生物识别方式，可以用于性别的识别和区分。性别分类对于减少刑事犯罪和性别假冒案件的侦办时间具有十分重要的意义。在这项工作中，我们使用深度卷积神经网络(cnn)不仅可以根据性别对指纹进行分类，还可以识别单个手和手指。采用迁移学习来加快CNN的训练速度。CNN对性别、手和手指的分类准确率分别达到了75.2%、93.5%和76.72%。这些结果使用我们公开的索科托考文垂指纹数据集(SOCOFing)获得，作为该数据集的基准分类结果。

引用次数: 15

Detecting and Classifying Fetal Brain Abnormalities Using Machine Learning Techniques 利用机器学习技术检测和分类胎儿大脑异常

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2018-12-01 DOI: 10.1109/ICMLA.2018.00223

Omneya Attallah, Heba Gadelkarim, M. Sharkas

Detecting and classifying fetal brain abnormalities from magnetic resonance imaging (MRI) is important, as approximately 3 in 1000 women are pregnant with a fetal of abnormal brain. Early detection of fetal brain abnormalities using machine learning techniques can improve the quality of diagnosis and treatment planning. The literature has shown that most of the work made to classify brain abnormalities in very early age is for preterm infants and neonates not fetal. However, research papers that studied fetal brain MRI images have mapped these images with the neonates MRI images to classify an abnormal behavior in newborns not fetal. In this paper, a pipeline process is proposed for fetal brain classification (FBC) which uses machine learning techniques. The main contribution of this paper is the classification of fetal brain abnormalities in early stage, before the fetal is born. The proposed algorithm is capable of detecting and classifying a variety of abnormalities from MRI images with a wide range of fetal gestational age (GA) (from 16 to 39 weeks) using a flexible and simple method with low computational cost. The novel proposed method consists of four phases; segmentation, enhancement, feature extraction and classification. The results have shown that the proposed method has an area under ROC curve (AUC) of 84%, 86%, 80% and 84.5% for, Linear discriminate analysis (LDA), support vector machine (SVM), K-nearest neighbor (KNN), and Ensemble Subspace Discriminates classifiers respectively. This shows that our proposed has successfully classified fetal brain abnormalities with images of different fetal GA. The results are promising. Future work will be done to improve classification results and increase the dataset.

通过磁共振成像(MRI)检测和分类胎儿大脑异常非常重要，因为大约每1000名妇女中就有3名怀孕的胎儿大脑异常。使用机器学习技术早期检测胎儿大脑异常可以提高诊断和治疗计划的质量。文献表明，大多数对早期大脑异常进行分类的工作是针对早产儿和新生儿而不是胎儿。然而，研究胎儿脑MRI图像的研究论文已经将这些图像与新生儿MRI图像进行了映射，以分类新生儿非胎儿的异常行为。本文提出了一种利用机器学习技术进行胎儿脑分类的流水线过程。本文的主要贡献在于胎儿出生前早期胎儿脑异常的分类。该算法采用灵活、简单、计算成本低的方法，能够从大范围胎龄(16 ~ 39周)的MRI图像中检测和分类各种异常。该方法分为四个阶段;分割、增强、特征提取和分类。结果表明，该方法在线性判别分析(LDA)、支持向量机(SVM)、k近邻(KNN)和集合子空间判别分类器上的ROC曲线下面积(AUC)分别为84%、86%、80%和84.5%。这表明我们的方法已经成功地用不同的胎儿GA图像对胎儿脑异常进行了分类。结果是有希望的。未来的工作将改进分类结果和增加数据集。

{"title":"Detecting and Classifying Fetal Brain Abnormalities Using Machine Learning Techniques","authors":"Omneya Attallah, Heba Gadelkarim, M. Sharkas","doi":"10.1109/ICMLA.2018.00223","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00223","url":null,"abstract":"Detecting and classifying fetal brain abnormalities from magnetic resonance imaging (MRI) is important, as approximately 3 in 1000 women are pregnant with a fetal of abnormal brain. Early detection of fetal brain abnormalities using machine learning techniques can improve the quality of diagnosis and treatment planning. The literature has shown that most of the work made to classify brain abnormalities in very early age is for preterm infants and neonates not fetal. However, research papers that studied fetal brain MRI images have mapped these images with the neonates MRI images to classify an abnormal behavior in newborns not fetal. In this paper, a pipeline process is proposed for fetal brain classification (FBC) which uses machine learning techniques. The main contribution of this paper is the classification of fetal brain abnormalities in early stage, before the fetal is born. The proposed algorithm is capable of detecting and classifying a variety of abnormalities from MRI images with a wide range of fetal gestational age (GA) (from 16 to 39 weeks) using a flexible and simple method with low computational cost. The novel proposed method consists of four phases; segmentation, enhancement, feature extraction and classification. The results have shown that the proposed method has an area under ROC curve (AUC) of 84%, 86%, 80% and 84.5% for, Linear discriminate analysis (LDA), support vector machine (SVM), K-nearest neighbor (KNN), and Ensemble Subspace Discriminates classifiers respectively. This shows that our proposed has successfully classified fetal brain abnormalities with images of different fetal GA. The results are promising. Future work will be done to improve classification results and increase the dataset.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"1371-1376"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73147683","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Adam Induces Implicit Weight Sparsity in Rectifier Neural Networks Adam在整流神经网络中引入隐式权稀疏性

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2018-12-01 DOI: 10.1109/ICMLA.2018.00054

A. Yaguchi, Taiji Suzuki, Wataru Asano, Shuhei Nitta, Y. Sakata, A. Tanizawa

In recent years, deep neural networks (DNNs) have been applied to various machine leaning tasks, including image recognition, speech recognition, and machine translation. However, large DNN models are needed to achieve state-of-the-art performance, exceeding the capabilities of edge devices. Model reduction is thus needed for practical use. In this paper, we point out that deep learning automatically induces group sparsity of weights, in which all weights connected to an output channel (node) are zero, when training DNNs under the following three conditions: (1) rectified-linear-unit (ReLU) activations, (2) an L2-regularized objective function, and (3) the Adam optimizer. Next, we analyze this behavior both theoretically and experimentally, and propose a simple model reduction method: eliminate the zero weights after training the DNN. In experiments on MNIST and CIFAR-10 datasets, we demonstrate the sparsity with various training setups. Finally, we show that our method can efficiently reduce the model size and performs well relative to methods that use a sparsity-inducing regularizer.

近年来，深度神经网络(dnn)已被应用于各种机器学习任务，包括图像识别、语音识别和机器翻译。然而，需要大型深度神经网络模型来实现最先进的性能，超过边缘设备的能力。因此，在实际应用中需要模型简化。在本文中，我们指出深度学习在以下三种条件下训练dnn(1)整流线性单元(ReLU)激活，(2)l2正则化目标函数，(3)Adam优化器时，自动诱导权值的组稀疏性，其中连接到输出通道(节点)的所有权值为零。接下来，我们从理论和实验两方面分析了这种行为，并提出了一种简单的模型约简方法:在训练DNN后消除零权值。在MNIST和CIFAR-10数据集的实验中，我们展示了不同训练设置的稀疏性。最后，我们证明了我们的方法可以有效地减小模型大小，并且相对于使用稀疏性诱导正则化器的方法表现良好。

{"title":"Adam Induces Implicit Weight Sparsity in Rectifier Neural Networks","authors":"A. Yaguchi, Taiji Suzuki, Wataru Asano, Shuhei Nitta, Y. Sakata, A. Tanizawa","doi":"10.1109/ICMLA.2018.00054","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00054","url":null,"abstract":"In recent years, deep neural networks (DNNs) have been applied to various machine leaning tasks, including image recognition, speech recognition, and machine translation. However, large DNN models are needed to achieve state-of-the-art performance, exceeding the capabilities of edge devices. Model reduction is thus needed for practical use. In this paper, we point out that deep learning automatically induces group sparsity of weights, in which all weights connected to an output channel (node) are zero, when training DNNs under the following three conditions: (1) rectified-linear-unit (ReLU) activations, (2) an L2-regularized objective function, and (3) the Adam optimizer. Next, we analyze this behavior both theoretically and experimentally, and propose a simple model reduction method: eliminate the zero weights after training the DNN. In experiments on MNIST and CIFAR-10 datasets, we demonstrate the sparsity with various training setups. Finally, we show that our method can efficiently reduce the model size and performs well relative to methods that use a sparsity-inducing regularizer.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"37 1","pages":"318-325"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86364857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

The Semantic Shapes of Popular Music Lyrics: Graph-Based Representation, Analysis, and Interpretation of Popular Music Lyrics in Semantic Natural Language Embedding Space 流行音乐歌词的语义形态:语义自然语言嵌入空间中流行音乐歌词的图表示、分析与解释

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2018-12-01 DOI: 10.1109/ICMLA.2018.00203

M. Ogihara, Daniel Galarraga, Gang Ren, T. Tavares

Popular music lyrics are usually brief in length yet sophisticated in narrative content, emotional expression, and structural aesthetics. In this paper, we propose a graph-based analysis and interpretation framework for popular music lyrics using the sematic word embedding representation. This framework explores the temporal and structural information in music lyrics, such as word sequential pattern, lyric format pattern, and predominate song forms, to enhance the understanding of the interaction between the semantic and structural properties of music lyrics. Our proposed analysis and interpretation framework provides extensive tools for representing various properties of music lyrics as graph structural elements and then we implemented feature extraction tools for a comprehensive characterization of the lyric graph using graph analysis or complex network methodologies. The empirical studies based on contrasting music genres are then presented to illustrate the usage of the proposed tools and to demonstrate its modeling and analysis capabilities.

流行音乐歌词通常短小精悍，但在叙事内容、情感表达和结构美学上都很复杂。在本文中，我们提出了一个基于图形的流行音乐歌词分析和解释框架，该框架使用语义词嵌入表示。该框架探讨了音乐歌词中的时间和结构信息，如词序模式、歌词格式模式和主要歌曲形式，以增强对音乐歌词语义和结构属性之间相互作用的理解。我们提出的分析和解释框架提供了广泛的工具，用于将音乐歌词的各种属性表示为图形结构元素，然后我们实现了特征提取工具，用于使用图形分析或复杂网络方法对歌词图形进行全面表征。然后提出了基于对比音乐类型的实证研究，以说明所提出的工具的使用，并展示了其建模和分析能力。

{"title":"The Semantic Shapes of Popular Music Lyrics: Graph-Based Representation, Analysis, and Interpretation of Popular Music Lyrics in Semantic Natural Language Embedding Space","authors":"M. Ogihara, Daniel Galarraga, Gang Ren, T. Tavares","doi":"10.1109/ICMLA.2018.00203","DOIUrl":"https://doi.org/10.1109/ICMLA.2018.00203","url":null,"abstract":"Popular music lyrics are usually brief in length yet sophisticated in narrative content, emotional expression, and structural aesthetics. In this paper, we propose a graph-based analysis and interpretation framework for popular music lyrics using the sematic word embedding representation. This framework explores the temporal and structural information in music lyrics, such as word sequential pattern, lyric format pattern, and predominate song forms, to enhance the understanding of the interaction between the semantic and structural properties of music lyrics. Our proposed analysis and interpretation framework provides extensive tools for representing various properties of music lyrics as graph structural elements and then we implemented feature extraction tools for a comprehensive characterization of the lyric graph using graph analysis or complex network methodologies. The empirical studies based on contrasting music genres are then presented to illustrate the usage of the proposed tools and to demonstrate its modeling and analysis capabilities.","PeriodicalId":6533,"journal":{"name":"2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"132 1","pages":"1249-1254"},"PeriodicalIF":0.0,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86605275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀