首页 > 最新文献

International Conference on Machine Learning and Computing最新文献

英文 中文
Evolutionary Convolutional Neural Networks Using ABC 基于ABC的进化卷积神经网络
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318301
Wenbo Zhu, Weichang Yeh, Jianwen Chen, Dafeng Chen, Aiyuan Li, Yangyang Lin
Convolutional neural networks (CNNs) have been used over the past years to solve many different artificial intelligence (AI) problems, providing significant advances in some domains and leading to state-of-the-art results. Nonetheless, the design of CNNs architecture remains to be a meticulous and cumbersome process that requires the participation of specialists in the field. In this work, we have explored the neuro-evolution application to the automatic design of CNN topologies, developing a novel solution based on Artificial Bee Colony (ABC). The MNIST dataset is used to evaluate the proposed method, which is proved being highly competitive with the state-of-the-art.
卷积神经网络(cnn)在过去几年中被用于解决许多不同的人工智能(AI)问题,在某些领域取得了重大进展,并产生了最先进的结果。尽管如此,cnn架构的设计仍然是一个细致而繁琐的过程,需要该领域专家的参与。在这项工作中,我们探索了神经进化在CNN拓扑自动设计中的应用,开发了一种基于人工蜂群(Artificial Bee Colony, ABC)的新解决方案。使用MNIST数据集对所提出的方法进行了评估,证明该方法与最先进的方法具有很强的竞争力。
{"title":"Evolutionary Convolutional Neural Networks Using ABC","authors":"Wenbo Zhu, Weichang Yeh, Jianwen Chen, Dafeng Chen, Aiyuan Li, Yangyang Lin","doi":"10.1145/3318299.3318301","DOIUrl":"https://doi.org/10.1145/3318299.3318301","url":null,"abstract":"Convolutional neural networks (CNNs) have been used over the past years to solve many different artificial intelligence (AI) problems, providing significant advances in some domains and leading to state-of-the-art results. Nonetheless, the design of CNNs architecture remains to be a meticulous and cumbersome process that requires the participation of specialists in the field. In this work, we have explored the neuro-evolution application to the automatic design of CNN topologies, developing a novel solution based on Artificial Bee Colony (ABC). The MNIST dataset is used to evaluate the proposed method, which is proved being highly competitive with the state-of-the-art.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129516085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Research on Optimization of CAPTCHA Recognition Algorithm Based on SVM 基于SVM的CAPTCHA识别算法优化研究
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318355
Li Wei, Xiang Li, Tingrong Cao, Quan Zhang, LiangQi Zhou, Wenli Wang
Image verification code is the main mode of security verification in current network applications. The identification efficiency of verification code is a difficult problem that affects network data crawling, which has realistic research significance and value. This paper proposed an SVM (Support Vector Machine)-based recognition method. On the basis of fully analyzing the existing SVM recognition mechanism process, the image is first binarized and filtered for character image preprocessing, then feature extraction, and then classification model is established for recognition. It is proved that this method can achieve fast and accurate results by training and comparing characters in many categories such as adhesion and rotation.
图像验证码是当前网络应用中主要的安全验证方式。验证码的识别效率是影响网络数据爬行的一个难题,具有现实的研究意义和价值。提出了一种基于支持向量机的图像识别方法。在充分分析现有SVM识别机制流程的基础上,首先对图像进行二值化和滤波,进行字符图像预处理,然后进行特征提取,最后建立分类模型进行识别。通过对粘附、旋转等多个类别的特征进行训练和比较,证明了该方法可以获得快速准确的结果。
{"title":"Research on Optimization of CAPTCHA Recognition Algorithm Based on SVM","authors":"Li Wei, Xiang Li, Tingrong Cao, Quan Zhang, LiangQi Zhou, Wenli Wang","doi":"10.1145/3318299.3318355","DOIUrl":"https://doi.org/10.1145/3318299.3318355","url":null,"abstract":"Image verification code is the main mode of security verification in current network applications. The identification efficiency of verification code is a difficult problem that affects network data crawling, which has realistic research significance and value. This paper proposed an SVM (Support Vector Machine)-based recognition method. On the basis of fully analyzing the existing SVM recognition mechanism process, the image is first binarized and filtered for character image preprocessing, then feature extraction, and then classification model is established for recognition. It is proved that this method can achieve fast and accurate results by training and comparing characters in many categories such as adhesion and rotation.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"131 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124571708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Evaluating Acoustic Feature Maps in 2D-CNN for Speaker Identification 基于2D-CNN声学特征映射的说话人识别
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318386
Ali Shariq Imran, Vetle Haflan, Abdolreza Sabzi Shahrebabaki, Negar Olfati, T. Svendsen
This paper presents a study evaluating different acoustic feature map representations in two-dimensional convolutional neural networks (2D-CNN) on the speech dataset for various speech-related activities. Specifically, the task involves identifying useful 2D-CNN input feature maps for enhancing speaker identification with an ultimate goal to improve speaker authentication and enabling voice as a biometric feature. Voice in contrast to fingerprints and image-based biometrics is a natural choice for hands-free communication systems where touch interfaces are inconvenient or dangerous to use. Effective input feature map representation may help CNN exploit intrinsic voice features that not only can address the instability issues of voice as an identifier for textindependent speaker authentication while preserving privacy but can also assist in developing efficacious voice-enabled interfaces. Three different acoustic features with three possible feature map representations are evaluated in this study. Results obtained on three speech corpora shows that an interpolated baseline spectrogram performs best compared to Mel frequency spectral coefficients (MFSC) and Mel frequency cepstral coefficient (MFCC) when tested on a 5-fold cross-validation method using 2D-CNN. On both textdependent and text-independent datasets, raw spectrogram accuracy is 4% better than the traditional acoustic features.
本文提出了一项研究,评估了二维卷积神经网络(2D-CNN)在各种语音相关活动的语音数据集上的不同声学特征映射表示。具体来说,该任务包括识别有用的2D-CNN输入特征映射,以增强说话人识别,最终目标是改善说话人身份验证,并使语音成为一种生物特征。与指纹和基于图像的生物识别技术相比,语音是免提通信系统的自然选择,因为触摸界面使用起来不方便或有危险。有效的输入特征图表示可以帮助CNN利用固有的语音特征,不仅可以解决语音作为文本独立说话人身份验证标识符的不稳定性问题,同时保护隐私,还可以帮助开发有效的语音支持界面。本研究评估了三种不同的声学特征和三种可能的特征映射表示。在3个语音语料库上的结果表明,在2D-CNN的5倍交叉验证方法上,与Mel频谱系数(MFSC)和Mel频率倒谱系数(MFCC)相比,插值后的基线谱图表现最好。在文本依赖和文本独立的数据集上,原始频谱图的精度都比传统声学特征高4%。
{"title":"Evaluating Acoustic Feature Maps in 2D-CNN for Speaker Identification","authors":"Ali Shariq Imran, Vetle Haflan, Abdolreza Sabzi Shahrebabaki, Negar Olfati, T. Svendsen","doi":"10.1145/3318299.3318386","DOIUrl":"https://doi.org/10.1145/3318299.3318386","url":null,"abstract":"This paper presents a study evaluating different acoustic feature map representations in two-dimensional convolutional neural networks (2D-CNN) on the speech dataset for various speech-related activities. Specifically, the task involves identifying useful 2D-CNN input feature maps for enhancing speaker identification with an ultimate goal to improve speaker authentication and enabling voice as a biometric feature. Voice in contrast to fingerprints and image-based biometrics is a natural choice for hands-free communication systems where touch interfaces are inconvenient or dangerous to use. Effective input feature map representation may help CNN exploit intrinsic voice features that not only can address the instability issues of voice as an identifier for textindependent speaker authentication while preserving privacy but can also assist in developing efficacious voice-enabled interfaces. Three different acoustic features with three possible feature map representations are evaluated in this study. Results obtained on three speech corpora shows that an interpolated baseline spectrogram performs best compared to Mel frequency spectral coefficients (MFSC) and Mel frequency cepstral coefficient (MFCC) when tested on a 5-fold cross-validation method using 2D-CNN. On both textdependent and text-independent datasets, raw spectrogram accuracy is 4% better than the traditional acoustic features.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"130 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115892784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Deep Attention Network for Chinese Word Segment 汉语分词的深度注意网络
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318351
Lanxin Li, Ping Gong, L. Ji
Character-level sequence label tagging is the most efficient way to solve unknown words problem for Chinese word segment. But the most widely used model, Conditional Random Fields (CRF), needs a large amount of manual design features. So it is appropriate to combine CRF and neural networks such as recurrent neural network (RNN), which is adopted in many natural language processing (NLP) tasks. However, RNN is rather slow because of the timing dependence between computations and not good at capturing local information of the sentence. In order to solve this problem, we introduce a self-attention mechanism, which completes the calculation between the different positions of the sentence with the same distance, into CWS. And we propose a deep neural network, which combines convolution neural networks and self-attention mechanism. Then, we evaluate the model on the PKU dataset and the MSR dataset. The results show that our model perform much better.
字符级序列标注是解决汉语分词中未知词问题的最有效方法。但是使用最广泛的条件随机场(CRF)模型需要大量的手工设计特征。因此,将CRF与神经网络(如递归神经网络(RNN))相结合是非常合适的,而递归神经网络在许多自然语言处理(NLP)任务中都得到了应用。然而,由于计算之间的时间依赖,RNN的速度很慢,并且不擅长捕获句子的局部信息。为了解决这一问题,我们引入自注意机制,在CWS中完成相同距离句子的不同位置之间的计算。并提出了一种将卷积神经网络与自注意机制相结合的深度神经网络。然后,我们在PKU数据集和MSR数据集上对模型进行了评估。结果表明,该模型的性能要好得多。
{"title":"A Deep Attention Network for Chinese Word Segment","authors":"Lanxin Li, Ping Gong, L. Ji","doi":"10.1145/3318299.3318351","DOIUrl":"https://doi.org/10.1145/3318299.3318351","url":null,"abstract":"Character-level sequence label tagging is the most efficient way to solve unknown words problem for Chinese word segment. But the most widely used model, Conditional Random Fields (CRF), needs a large amount of manual design features. So it is appropriate to combine CRF and neural networks such as recurrent neural network (RNN), which is adopted in many natural language processing (NLP) tasks. However, RNN is rather slow because of the timing dependence between computations and not good at capturing local information of the sentence. In order to solve this problem, we introduce a self-attention mechanism, which completes the calculation between the different positions of the sentence with the same distance, into CWS. And we propose a deep neural network, which combines convolution neural networks and self-attention mechanism. Then, we evaluate the model on the PKU dataset and the MSR dataset. The results show that our model perform much better.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127215503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Robust Mature Tomato Detection in Greenhouse Scenes Using Machine Learning and Color Analysis 基于机器学习和颜色分析的温室场景成熟番茄鲁棒检测
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318338
Guoxu Liu, Shuyi Mao, Hui Jin, J. H. Kim
A new algorithm for automatic tomato detection in regular color images is proposed, which can reduce the influence of illumination, color similarity as well as suppress the effect of occlusion. The method uses a Support Vector Machine (SVM) with Histograms of Oriented Gradients (HOG) to detect the tomatoes, followed by a color analysis method for false positive removal. And the Non-Maximum Suppression Method (NMS) is employed to merge the detection results. Finally, a total of 144 images were used for the experiment. The results showed that the recall and precision of the classifier were 96.67% and 98.64% on the test set. Compared with other methods developed in recent years, the proposed algorithm shows substantial improvement for tomato detection.
提出了一种在规则彩色图像中自动检测番茄的新算法,该算法可以降低光照和颜色相似度的影响,并抑制遮挡的影响。该方法采用支持向量机(SVM)和定向梯度直方图(HOG)对番茄进行检测,然后采用颜色分析方法去除假阳性。采用非最大抑制法(NMS)对检测结果进行合并。最后,实验共使用了144张图像。结果表明,该分类器的查全率和查准率分别为96.67%和98.64%。与近年来开发的其他方法相比,该算法在番茄检测方面有了较大的改进。
{"title":"A Robust Mature Tomato Detection in Greenhouse Scenes Using Machine Learning and Color Analysis","authors":"Guoxu Liu, Shuyi Mao, Hui Jin, J. H. Kim","doi":"10.1145/3318299.3318338","DOIUrl":"https://doi.org/10.1145/3318299.3318338","url":null,"abstract":"A new algorithm for automatic tomato detection in regular color images is proposed, which can reduce the influence of illumination, color similarity as well as suppress the effect of occlusion. The method uses a Support Vector Machine (SVM) with Histograms of Oriented Gradients (HOG) to detect the tomatoes, followed by a color analysis method for false positive removal. And the Non-Maximum Suppression Method (NMS) is employed to merge the detection results. Finally, a total of 144 images were used for the experiment. The results showed that the recall and precision of the classifier were 96.67% and 98.64% on the test set. Compared with other methods developed in recent years, the proposed algorithm shows substantial improvement for tomato detection.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"125 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116845929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Multi-Path and Multi-Loss Network for Person Re-Identification 基于多路径多损失的人员再识别网络
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318331
Jiabao Wang, Shanshan Jiao, Yang Li, Zhuang Miao
In person re-identification (re-ID), most state-of-the-art models extract features by convolutional neural networks to do similarity comparison. Feature representation becomes the key task for person re-ID. However, the learned features are not good enough based on a single-path and single-loss network because the learned objective only achieves one of the multiple minima. To improve feature representation, we propose a multi-path and multi-loss network (MPMLN) and concatenate multi-path features to represent pedestrian. Subsequently, we design MPMLN based on ResNet-50 and construct an end-to-end architecture. The backbone of our proposed network shares the local parameters for multiple paths and multiple losses. It has fewer parameters than multiple independent networks. Experimental results show that our MPMLN achieves the state-of-the-art performance on the public Market1501, DukeMTMC-reID and CUHK03 person re-ID benchmarks.
在人的再识别(re-ID)中,大多数最先进的模型都是通过卷积神经网络提取特征进行相似性比较。特征表示成为人员身份识别的关键任务。然而,基于单路径单损失网络,学习到的特征不够好,因为学习目标只能达到多个最小值中的一个。为了改进特征表示,我们提出了一种多路径多损失网络(MPMLN),并将多路径特征连接起来表示行人。随后,我们设计了基于ResNet-50的MPMLN,并构建了端到端架构。我们提出的网络的主干共享多个路径和多个损失的本地参数。它具有比多个独立网络更少的参数。实验结果表明,我们的MPMLN在公开市场1501,DukeMTMC-reID和CUHK03人重新id基准上达到了最先进的性能。
{"title":"Multi-Path and Multi-Loss Network for Person Re-Identification","authors":"Jiabao Wang, Shanshan Jiao, Yang Li, Zhuang Miao","doi":"10.1145/3318299.3318331","DOIUrl":"https://doi.org/10.1145/3318299.3318331","url":null,"abstract":"In person re-identification (re-ID), most state-of-the-art models extract features by convolutional neural networks to do similarity comparison. Feature representation becomes the key task for person re-ID. However, the learned features are not good enough based on a single-path and single-loss network because the learned objective only achieves one of the multiple minima. To improve feature representation, we propose a multi-path and multi-loss network (MPMLN) and concatenate multi-path features to represent pedestrian. Subsequently, we design MPMLN based on ResNet-50 and construct an end-to-end architecture. The backbone of our proposed network shares the local parameters for multiple paths and multiple losses. It has fewer parameters than multiple independent networks. Experimental results show that our MPMLN achieves the state-of-the-art performance on the public Market1501, DukeMTMC-reID and CUHK03 person re-ID benchmarks.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114925832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Exact Penalty Method for Top-k Multiclass Classification Based on L0 Norm Minimization 基于L0范数最小化的Top-k多类分类的精确惩罚方法
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318335
Haoxian Tan
Top-k multiclass classification is one of the central problems in machine learning and computer vision. It aims at classifying instances into one of three or more classes and allows k guesses to evaluate classifiers based on the top-k error. However, existing solutions either simply ignore the specific structure of the top-k error by directly solving the top-1 error minimization problem or consider convex relaxation approximation for the original top-k error model. In this paper, we formulate the top-k multiclass classification problem as an ℓ0 norm minimization problem. Although the model is NP-hard, we reformulate it as an equivalent mathematical program with equilibrium constraints and consider an exact penalty method to solve it. The algorithm solves a sequence of convex optimization problems to find a desirable solution for the original nonconvex problem. Finally, we demonstrate the efficacy of our method on some real-world data sets. As a result, our method achieves state-of-the-art performance in term of accuracy
Top-k多类分类是机器学习和计算机视觉领域的核心问题之一。它旨在将实例分类为三个或更多类中的一个,并允许k次猜测来评估基于top-k错误的分类器。然而,现有的解决方案要么直接解决top-1误差最小化问题,直接忽略top-k误差的具体结构,要么考虑对原top-k误差模型进行凸松弛逼近。本文将top-k多类分类问题表述为一个l0范数最小化问题。虽然该模型是np困难的,但我们将其重新表述为具有均衡约束的等效数学规划,并考虑了一种精确惩罚方法来求解它。该算法通过求解一系列凸优化问题来寻找原非凸问题的理想解。最后,我们在一些真实世界的数据集上证明了我们的方法的有效性。因此,我们的方法在准确性方面达到了最先进的性能
{"title":"An Exact Penalty Method for Top-k Multiclass Classification Based on L0 Norm Minimization","authors":"Haoxian Tan","doi":"10.1145/3318299.3318335","DOIUrl":"https://doi.org/10.1145/3318299.3318335","url":null,"abstract":"Top-k multiclass classification is one of the central problems in machine learning and computer vision. It aims at classifying instances into one of three or more classes and allows k guesses to evaluate classifiers based on the top-k error. However, existing solutions either simply ignore the specific structure of the top-k error by directly solving the top-1 error minimization problem or consider convex relaxation approximation for the original top-k error model. In this paper, we formulate the top-k multiclass classification problem as an ℓ0 norm minimization problem. Although the model is NP-hard, we reformulate it as an equivalent mathematical program with equilibrium constraints and consider an exact penalty method to solve it. The algorithm solves a sequence of convex optimization problems to find a desirable solution for the original nonconvex problem. Finally, we demonstrate the efficacy of our method on some real-world data sets. As a result, our method achieves state-of-the-art performance in term of accuracy","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126091011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Detecting Climate Change Deniers on Twitter Using a Deep Neural Network 使用深度神经网络检测推特上的气候变化否认者
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318382
Xingyu Chen, L. Zou, Bo Zhao
Climate change or global warming is a global threat to both human communities and natural systems. In recent years, there is an increasingly public debate on the existence of climate change or global warming, but data describing such discussions are difficult to access. Social media provide a new data source to survey public perceptions and attitudes toward such topics. However, enabling computers to automatically determine users' attitudes towards climate change based on social media contents is still challenging. Taking Twitter data as an example, this study analyzed public discussions about climate change and global warming in year 2016. The objectives are: (1) to develop an optimized Deep Neural Network (DNN) classifier to identify users who are climate change deniers based on tweet contents; (2) to examine the temporal patterns of climate change discussions on Twitter and its driving factors. Results demonstrate that the developed DNN model successfully identified climate change deniers based on tweet contents with an overall accuracy of 88%. There are more climate change discussions during September to December 2016, whereas the percentages of climate change deniers were lower in the same period. Public interests and attitudes on climate change were driven by extreme weather events and environmental policy changes. The developed methodology will shed lights on the utility of deep learning in natural language processing, while the results provide improved understanding of factors affecting public attitudes on climate change.
气候变化或全球变暖是对人类社区和自然系统的全球性威胁。近年来,关于气候变化或全球变暖是否存在的公开辩论越来越多,但描述这种讨论的数据很难获得。社交媒体为调查公众对这些话题的看法和态度提供了新的数据来源。然而,让计算机根据社交媒体内容自动判断用户对气候变化的态度仍然具有挑战性。本研究以Twitter数据为例,分析了2016年公众对气候变化和全球变暖的讨论。目标是:(1)开发优化的深度神经网络(DNN)分类器,根据推文内容识别否认气候变化的用户;(2)研究Twitter上气候变化讨论的时间格局及其驱动因素。结果表明,所开发的DNN模型成功地识别了基于tweet内容的气候变化否认者,总体准确率为88%。2016年9月至12月期间有更多关于气候变化的讨论,而同期否认气候变化的比例较低。公众对气候变化的兴趣和态度受到极端天气事件和环境政策变化的影响。所开发的方法将阐明深度学习在自然语言处理中的效用,而结果则有助于更好地理解影响公众对气候变化态度的因素。
{"title":"Detecting Climate Change Deniers on Twitter Using a Deep Neural Network","authors":"Xingyu Chen, L. Zou, Bo Zhao","doi":"10.1145/3318299.3318382","DOIUrl":"https://doi.org/10.1145/3318299.3318382","url":null,"abstract":"Climate change or global warming is a global threat to both human communities and natural systems. In recent years, there is an increasingly public debate on the existence of climate change or global warming, but data describing such discussions are difficult to access. Social media provide a new data source to survey public perceptions and attitudes toward such topics. However, enabling computers to automatically determine users' attitudes towards climate change based on social media contents is still challenging. Taking Twitter data as an example, this study analyzed public discussions about climate change and global warming in year 2016. The objectives are: (1) to develop an optimized Deep Neural Network (DNN) classifier to identify users who are climate change deniers based on tweet contents; (2) to examine the temporal patterns of climate change discussions on Twitter and its driving factors. Results demonstrate that the developed DNN model successfully identified climate change deniers based on tweet contents with an overall accuracy of 88%. There are more climate change discussions during September to December 2016, whereas the percentages of climate change deniers were lower in the same period. Public interests and attitudes on climate change were driven by extreme weather events and environmental policy changes. The developed methodology will shed lights on the utility of deep learning in natural language processing, while the results provide improved understanding of factors affecting public attitudes on climate change.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125770304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Application of ConvLSTM Network in Numerical Temperature Prediction Interpretation ConvLSTM网络在数值温度预报解释中的应用
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318381
Hong Lin, Yunzi Hua, Leiming Ma, Lei Chen
The application research of machine learning methods has attracted attention in the meteorological field. In this paper, we establish a spatiotemporal temperature deviation prediction model (PredTemp) based on convolution and long short-term memory network (ConvLSTM). The model is trained with numerical weatherprediction (NWP), and the prediction results are used to correct the temperature prediction in NWP. Exploring the influence of the weather elements added to the model on the prediction results is also the focus of this paper. Two datasets are constructed for this purpose: the temperature forecast deviationdataset (dataset1) is constructed by using the temperature forecastsand the analysis field in NWP, a precipitation forecast dataset (dataset2) was constructed using the precipitation forecasts in NWP. The experimental results show that the model is effective. Using dataset1 as dataset for training, the accuracy rate of temperaturecorrected by PredTempwas increased by 3%compared to NWP; using dataset1 and dataset2 as dataset for training, the accuracy rate of temperature corrected by PredTemp was increased by 4%. The addition of precipitation elements has played a positive role in improving the accuracy of the model prediction.
机器学习方法在气象领域的应用研究备受关注。本文建立了基于卷积和长短期记忆网络(ConvLSTM)的时空温度偏差预测模型PredTemp。利用数值天气预报(NWP)对模型进行训练,并利用预报结果对数值天气预报中的温度预报进行校正。探讨模型中加入的气象要素对预测结果的影响也是本文研究的重点。为此构建了两个数据集:利用NWP的温度预报和分析场构建温度预报偏差数据集(dataset1),利用NWP的降水预报构建降水预报数据集(dataset2)。实验结果表明,该模型是有效的。使用dataset1作为训练数据集,predtemp的温度校正准确率比NWP提高了3%;使用dataset1和dataset2作为训练数据集,PredTemp对温度校正的准确率提高了4%。降水要素的加入对提高模式预测精度起到了积极的作用。
{"title":"Application of ConvLSTM Network in Numerical Temperature Prediction Interpretation","authors":"Hong Lin, Yunzi Hua, Leiming Ma, Lei Chen","doi":"10.1145/3318299.3318381","DOIUrl":"https://doi.org/10.1145/3318299.3318381","url":null,"abstract":"The application research of machine learning methods has attracted attention in the meteorological field. In this paper, we establish a spatiotemporal temperature deviation prediction model (PredTemp) based on convolution and long short-term memory network (ConvLSTM). The model is trained with numerical weatherprediction (NWP), and the prediction results are used to correct the temperature prediction in NWP. Exploring the influence of the weather elements added to the model on the prediction results is also the focus of this paper. Two datasets are constructed for this purpose: the temperature forecast deviationdataset (dataset1) is constructed by using the temperature forecastsand the analysis field in NWP, a precipitation forecast dataset (dataset2) was constructed using the precipitation forecasts in NWP. The experimental results show that the model is effective. Using dataset1 as dataset for training, the accuracy rate of temperaturecorrected by PredTempwas increased by 3%compared to NWP; using dataset1 and dataset2 as dataset for training, the accuracy rate of temperature corrected by PredTemp was increased by 4%. The addition of precipitation elements has played a positive role in improving the accuracy of the model prediction.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134353159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Minimizing the Misclassification Rate of the Nearest Neighbor Rule Using a Two-stage Method 用两阶段法最小化最近邻规则的误分类率
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318339
Yunlong Gao, Si-Zhe Luo, Jinyan Pan, Baihua Chen, Peng Gao
The kNN classification performance entirely depends on the selected neighbors. In the past, many nearest neighbor (NN)-based methods mainly focus on learning distance measure metrics so that a neighborhood of an approximately constant posteriori probability can be produced, whereas limited works are performed to study the influences of the distribution characteristics of each neighbor. In this paper, we point out why the best distance measurement (BDM) is sensitive to malicious samples, and then a robust best distance measurement (RBDM) is suggested to solve this problem. Moreover, we also investigated the influences of the distribution characteristics of each neighbor for the classification performance, so that a two-stage method, called weighted robust best distance measurement kNN method (WRBDMkNN), is proposed aiming to minimize the misclassification rate of the nearest neighbor rule. Extensive experiments on diversity datasets indicate that the proposed method can achieve more encouraging results compared with some state-of-the-art NN-based methods.
kNN分类性能完全取决于所选择的邻居。过去,许多基于最近邻(NN)的方法主要集中在学习距离度量度量,从而产生近似恒定后验概率的邻域,而对每个邻域分布特征的影响研究较少。本文指出了最佳距离测量(BDM)对恶意样本敏感的原因,并提出了一种鲁棒最佳距离测量(RBDM)来解决这一问题。此外,我们还研究了每个邻居的分布特征对分类性能的影响,从而提出了一种两阶段方法,称为加权鲁棒最佳距离测量kNN方法(WRBDMkNN),旨在最大限度地降低最近邻规则的误分类率。在多样性数据集上的大量实验表明,与一些最先进的基于神经网络的方法相比,该方法可以获得更令人鼓舞的结果。
{"title":"Minimizing the Misclassification Rate of the Nearest Neighbor Rule Using a Two-stage Method","authors":"Yunlong Gao, Si-Zhe Luo, Jinyan Pan, Baihua Chen, Peng Gao","doi":"10.1145/3318299.3318339","DOIUrl":"https://doi.org/10.1145/3318299.3318339","url":null,"abstract":"The kNN classification performance entirely depends on the selected neighbors. In the past, many nearest neighbor (NN)-based methods mainly focus on learning distance measure metrics so that a neighborhood of an approximately constant posteriori probability can be produced, whereas limited works are performed to study the influences of the distribution characteristics of each neighbor. In this paper, we point out why the best distance measurement (BDM) is sensitive to malicious samples, and then a robust best distance measurement (RBDM) is suggested to solve this problem. Moreover, we also investigated the influences of the distribution characteristics of each neighbor for the classification performance, so that a two-stage method, called weighted robust best distance measurement kNN method (WRBDMkNN), is proposed aiming to minimize the misclassification rate of the nearest neighbor rule. Extensive experiments on diversity datasets indicate that the proposed method can achieve more encouraging results compared with some state-of-the-art NN-based methods.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133476594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Conference on Machine Learning and Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1