首页 > 最新文献

International Conference on Machine Learning and Computing最新文献

英文 中文
Facial Expression Recognition Algorithm Based on the Combination of CNN and K-Means 基于CNN和K-Means相结合的面部表情识别算法
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318344
Tongtong Cao, Ming Li
Aiming at the problems of low recognition rate and slow training speed of facial expression recognition method in the background of complex images, an improved facial expression recognition algorithm based on convolutional neural networks is proposed. The proposed algorithm introduces K-Means clustering idea and SVM classifier in the framework of convolutional neural network. Firstly, the algorithm trains the K-Means clustering model by using the label-free expression images, and selects the K-means clustering centers with good data characteristics, which are used as the initial value of the convolution kernel of the CNN model to extract features. Secondly, using the feature extraction processing of the convolutional neural network, the extracted features are fed to the multi-class SVM classifier. The experimental results show that the proposed method reduces the training time of the model overall, improves the accuracies of facial expression recognition under the background of complex images, and has a certain robustness.
针对复杂图像背景下面部表情识别方法识别率低、训练速度慢的问题,提出了一种改进的基于卷积神经网络的面部表情识别算法。该算法在卷积神经网络框架中引入k均值聚类思想和SVM分类器。该算法首先利用无标签表达图像训练K-Means聚类模型,选取数据特征较好的K-Means聚类中心作为CNN模型卷积核的初始值进行特征提取;其次,利用卷积神经网络的特征提取处理,将提取的特征输入到多类SVM分类器中;实验结果表明,该方法总体上减少了模型的训练时间,提高了复杂图像背景下面部表情识别的准确率,并具有一定的鲁棒性。
{"title":"Facial Expression Recognition Algorithm Based on the Combination of CNN and K-Means","authors":"Tongtong Cao, Ming Li","doi":"10.1145/3318299.3318344","DOIUrl":"https://doi.org/10.1145/3318299.3318344","url":null,"abstract":"Aiming at the problems of low recognition rate and slow training speed of facial expression recognition method in the background of complex images, an improved facial expression recognition algorithm based on convolutional neural networks is proposed. The proposed algorithm introduces K-Means clustering idea and SVM classifier in the framework of convolutional neural network. Firstly, the algorithm trains the K-Means clustering model by using the label-free expression images, and selects the K-means clustering centers with good data characteristics, which are used as the initial value of the convolution kernel of the CNN model to extract features. Secondly, using the feature extraction processing of the convolutional neural network, the extracted features are fed to the multi-class SVM classifier. The experimental results show that the proposed method reduces the training time of the model overall, improves the accuracies of facial expression recognition under the background of complex images, and has a certain robustness.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133888993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Learning a Deep Convolutional Network for Speckle Noise Reduction in Underwater Sonar Images 基于深度卷积网络的水下声纳图像散斑降噪研究
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318358
Yuxu Lu, R. W. Liu, Fenge Chen, Liang Xie
Underwater sonar imaging system has been widely utilized to detect and identify the submerged objects of interest. However, imaging quality often suffers from the undesirable signal-dependent speckle noise during signal acquisition and transmission. The speckle noise will restrict the practical applications, such as object detection, tracking and recognition, etc. To enhance the sonar imaging performance, we propose a deep learning approach to directly estimate the speckle noise in logarithmic domain based on the convolutional neural network. Once the speckle noise is obtained, the latent sharp image can then be easily calculated according to the image degradation model. The patch-based loss function, i.e., structural similarity metric, is adopted to preserve the important geometrical structures during speckle noise reduction. Experiments have been implemented on different noise levels to demonstrate the effectiveness of the proposed deep learning approach. Experimental results have illustrated that it outperforms several widely-used speckle noise reduction methods.
水下声纳成像系统已被广泛用于探测和识别水下目标。然而,在信号采集和传输过程中,成像质量经常受到信号相关散斑噪声的影响。散斑噪声的存在将制约其在目标检测、跟踪和识别等方面的实际应用。为了提高声纳成像性能,提出了一种基于卷积神经网络的深度学习方法,在对数域直接估计散斑噪声。一旦获得散斑噪声,就可以根据图像退化模型轻松地计算出潜在的尖锐图像。在散斑降噪过程中,采用基于patch的损失函数即结构相似性度量来保留重要的几何结构。在不同的噪声水平下进行了实验,以证明所提出的深度学习方法的有效性。实验结果表明,该方法优于几种常用的散斑降噪方法。
{"title":"Learning a Deep Convolutional Network for Speckle Noise Reduction in Underwater Sonar Images","authors":"Yuxu Lu, R. W. Liu, Fenge Chen, Liang Xie","doi":"10.1145/3318299.3318358","DOIUrl":"https://doi.org/10.1145/3318299.3318358","url":null,"abstract":"Underwater sonar imaging system has been widely utilized to detect and identify the submerged objects of interest. However, imaging quality often suffers from the undesirable signal-dependent speckle noise during signal acquisition and transmission. The speckle noise will restrict the practical applications, such as object detection, tracking and recognition, etc. To enhance the sonar imaging performance, we propose a deep learning approach to directly estimate the speckle noise in logarithmic domain based on the convolutional neural network. Once the speckle noise is obtained, the latent sharp image can then be easily calculated according to the image degradation model. The patch-based loss function, i.e., structural similarity metric, is adopted to preserve the important geometrical structures during speckle noise reduction. Experiments have been implemented on different noise levels to demonstrate the effectiveness of the proposed deep learning approach. Experimental results have illustrated that it outperforms several widely-used speckle noise reduction methods.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130515940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
DAD-MCNN: DDoS Attack Detection via Multi-channel CNN DAD-MCNN:通过多通道CNN检测DDoS攻击
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318329
Jinyin Chen, Yitao Yang, Keke Hu, Hai-bin Zheng, Zhen Wang
With the continuous development of web services, the web security becomes more and more important. Distributed Denial of Service (DDoS) attack as one of the most common form of attacks, has produced serious economic damages. DDoS attack detection as one of main defense methods is suffered increasing attention by researchers. Most of them use machine learning methods to make good detection performance. However, there are still gaps between real detection rate and expected one, conventional machine learning methods are limited compared with deep learning. In this paper, we propose DAD-MCNN, a multi-channel CNN(MC-CNN) based DDoS attack detection framework, which can fully utilize information from a huge amount of network packages and set up an earlier warning system. Our contributions can be summarized as follows: (1) we propose a new preprocessing method for the network dataset. (2) MC-CNN is applied to detect DDoS attack and the detection result is decided by data in respective channels. (3) We use incremental training method to optimize training procedures and time in MC-CNN. (4) The experiment result shows that MC-CNN has the highest accuracy compared with conventional machine learning methods. The result also proves that our approach has performed well not only in DDoS attack detection but also in other anomaly attack detection.
随着web服务的不断发展,web安全变得越来越重要。分布式拒绝服务(DDoS)攻击作为一种最常见的攻击形式,已经造成了严重的经济损失。DDoS攻击检测作为主要的防御手段之一,越来越受到研究人员的重视。它们大多使用机器学习方法来获得良好的检测性能。然而,实际检测率与预期检测率之间仍然存在差距,传统的机器学习方法与深度学习相比存在局限性。本文提出了一种基于多通道CNN(MC-CNN)的DDoS攻击检测框架DAD-MCNN,该框架可以充分利用大量网络数据包中的信息,并建立早期预警系统。我们的贡献可以概括如下:(1)我们提出了一种新的网络数据集预处理方法。(2)采用MC-CNN对DDoS攻击进行检测,检测结果由各通道数据决定。(3)采用增量训练方法优化MC-CNN的训练过程和时间。(4)实验结果表明,与传统的机器学习方法相比,MC-CNN具有最高的准确率。结果表明,该方法不仅适用于DDoS攻击检测,也适用于其他异常攻击检测。
{"title":"DAD-MCNN: DDoS Attack Detection via Multi-channel CNN","authors":"Jinyin Chen, Yitao Yang, Keke Hu, Hai-bin Zheng, Zhen Wang","doi":"10.1145/3318299.3318329","DOIUrl":"https://doi.org/10.1145/3318299.3318329","url":null,"abstract":"With the continuous development of web services, the web security becomes more and more important. Distributed Denial of Service (DDoS) attack as one of the most common form of attacks, has produced serious economic damages. DDoS attack detection as one of main defense methods is suffered increasing attention by researchers. Most of them use machine learning methods to make good detection performance. However, there are still gaps between real detection rate and expected one, conventional machine learning methods are limited compared with deep learning. In this paper, we propose DAD-MCNN, a multi-channel CNN(MC-CNN) based DDoS attack detection framework, which can fully utilize information from a huge amount of network packages and set up an earlier warning system. Our contributions can be summarized as follows: (1) we propose a new preprocessing method for the network dataset. (2) MC-CNN is applied to detect DDoS attack and the detection result is decided by data in respective channels. (3) We use incremental training method to optimize training procedures and time in MC-CNN. (4) The experiment result shows that MC-CNN has the highest accuracy compared with conventional machine learning methods. The result also proves that our approach has performed well not only in DDoS attack detection but also in other anomaly attack detection.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115209731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Roadside Traffic Sign Detection Based on Faster R-CNN 基于更快R-CNN的路边交通标志检测
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318348
Xingyu Fu, Bin Fang, Jiye Qian, Zhenni Wu, Jiajie Zhu, Tongxin Du
This paper presents an improved traffic sign detection method based on Faster R-CNN with dataset augmentation and subcategory detection scheme. Firstly, we extract natural scene frames from given videos and determine 20 categories of traffic signs. Secondly, we extend the image dataset and extract regions of interest, then manually annotate all categories. Thirdly, we train the Faster R-CNN model based on TensorFlow, then test the model and obtain the following evaluation indexes: the mean average precision is 99.07%, the recall rate is 99.66%, and the precision rate is 97.54%. Finally, we add the subcategory detection scheme to determine traffic light states, and we get the following evaluation indexes: the mean average precision is 99.50%, the recall rate is 100%, and the precision rate is 94.40%. Our experiments prove the robustness and accuracy for both traffic sign detection and subcategory detection of traffic light.
本文提出了一种基于Faster R-CNN的改进交通标志检测方法,并结合数据集增强和子类别检测方案。首先,我们从给定的视频中提取自然场景帧,并确定20类交通标志。其次,我们扩展图像数据集并提取感兴趣的区域,然后手动标注所有类别。第三,基于TensorFlow对Faster R-CNN模型进行训练,并对模型进行测试,得到了平均准确率为99.07%,召回率为99.66%,准确率为97.54%的评价指标。最后,我们加入了子类别检测方案来确定交通灯状态,得到了以下评价指标:平均准确率为99.50%,召回率为100%,准确率为94.40%。实验证明了该方法在交通标志检测和交通信号灯子类别检测方面的鲁棒性和准确性。
{"title":"Roadside Traffic Sign Detection Based on Faster R-CNN","authors":"Xingyu Fu, Bin Fang, Jiye Qian, Zhenni Wu, Jiajie Zhu, Tongxin Du","doi":"10.1145/3318299.3318348","DOIUrl":"https://doi.org/10.1145/3318299.3318348","url":null,"abstract":"This paper presents an improved traffic sign detection method based on Faster R-CNN with dataset augmentation and subcategory detection scheme. Firstly, we extract natural scene frames from given videos and determine 20 categories of traffic signs. Secondly, we extend the image dataset and extract regions of interest, then manually annotate all categories. Thirdly, we train the Faster R-CNN model based on TensorFlow, then test the model and obtain the following evaluation indexes: the mean average precision is 99.07%, the recall rate is 99.66%, and the precision rate is 97.54%. Finally, we add the subcategory detection scheme to determine traffic light states, and we get the following evaluation indexes: the mean average precision is 99.50%, the recall rate is 100%, and the precision rate is 94.40%. Our experiments prove the robustness and accuracy for both traffic sign detection and subcategory detection of traffic light.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121291336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Distributed Machine Learning over Directed Network with Fixed Communication Delays 具有固定通信延迟的定向网络上的分布式机器学习
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318340
Guo Zhenning
In this paper, we present a distributed machine learning algorithm over a network with fixed-delay tolerance. The network is directed and strongly connected. The training dataset is distributed to all agents in the network. We combine the distributed convex optimization (which utilizes double linear iterations) and corresponding machine learning algorithm. Each agent can only access its own local dataset. Suppose the delay between any pair of agents is time-invariant. The simulation shows that our algorithm is able to work under delayed transmission, in the sense that over time at each agent t the ratio of the estimate value xi(t) and scaling variable yi(t) can converge to the optimal point of the global cost function corresponding to the machine learning problem.
在本文中,我们提出了一种基于固定延迟容忍网络的分布式机器学习算法。网络是定向的,紧密相连的。训练数据集被分发给网络中的所有代理。我们将分布式凸优化(利用双线性迭代)和相应的机器学习算法相结合。每个代理只能访问自己的本地数据集。假设任意一对智能体之间的延迟是定常的。仿真表明,我们的算法能够在延迟传输下工作,即随着时间的推移,在每个代理t处,估估值xi(t)与缩放变量yi(t)的比值可以收敛到机器学习问题对应的全局代价函数的最优点。
{"title":"Distributed Machine Learning over Directed Network with Fixed Communication Delays","authors":"Guo Zhenning","doi":"10.1145/3318299.3318340","DOIUrl":"https://doi.org/10.1145/3318299.3318340","url":null,"abstract":"In this paper, we present a distributed machine learning algorithm over a network with fixed-delay tolerance. The network is directed and strongly connected. The training dataset is distributed to all agents in the network. We combine the distributed convex optimization (which utilizes double linear iterations) and corresponding machine learning algorithm. Each agent can only access its own local dataset. Suppose the delay between any pair of agents is time-invariant. The simulation shows that our algorithm is able to work under delayed transmission, in the sense that over time at each agent t the ratio of the estimate value xi(t) and scaling variable yi(t) can converge to the optimal point of the global cost function corresponding to the machine learning problem.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121362752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting Personality Using Facebook Status Based on Semi-supervised Learning 基于半监督学习的Facebook状态预测人格
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318363
Heci Zheng, Chunhua Wu
Personality analysis on social media is a research hotspot due to the importance of personality research in psychology as well as the rapid development of social media. Many studies have used social media status to analyze user's personality, but most of them are conducted on inadequate label data and linguistic features. In this paper, to explore the usage of unlabeled data on personality analysis, a personality analysis framework based on semi-supervised learning is introduced. Besides, for making full use of the language information in social media status, the well-known n-gram model is adopted to extract linguistic features. The experimental results demonstrate the semi-supervised learning can take advantage of unlabeled data and improve the accuracy of prediction model.
由于人格研究在心理学中的重要性以及社交媒体的快速发展,社交媒体上的人格分析成为一个研究热点。许多研究使用社交媒体状态来分析用户的个性,但大多是在标签数据和语言特征不足的情况下进行的。为了探索未标记数据在人格分析中的应用,本文提出了一种基于半监督学习的人格分析框架。此外,为了充分利用社交媒体状态中的语言信息,我们采用了众所周知的n-gram模型来提取语言特征。实验结果表明,半监督学习可以充分利用未标记数据,提高预测模型的准确性。
{"title":"Predicting Personality Using Facebook Status Based on Semi-supervised Learning","authors":"Heci Zheng, Chunhua Wu","doi":"10.1145/3318299.3318363","DOIUrl":"https://doi.org/10.1145/3318299.3318363","url":null,"abstract":"Personality analysis on social media is a research hotspot due to the importance of personality research in psychology as well as the rapid development of social media. Many studies have used social media status to analyze user's personality, but most of them are conducted on inadequate label data and linguistic features. In this paper, to explore the usage of unlabeled data on personality analysis, a personality analysis framework based on semi-supervised learning is introduced. Besides, for making full use of the language information in social media status, the well-known n-gram model is adopted to extract linguistic features. The experimental results demonstrate the semi-supervised learning can take advantage of unlabeled data and improve the accuracy of prediction model.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123568795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Ensemble-Initialized k-Means Clustering Ensemble-Initialized k-Means聚类
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318308
Shasha Xu, Dong Huang
As one of the most classical clustering techniques, the k-means clustering has been widely used in various areas over the past few decades. Despite its significant success, there are still several challenging issues in the k-means clustering research, one of which lies in its high sensitivity to the selection of the initial cluster centers. In this paper, we propose a new cluster center initialization method for k-means based on ensemble learning. Specifically, an ensemble of base clusterings are first constructed by using multiple k-means clusterers with random initializations. Then, a co-association matrix is computed for the base clusterings, upon which the agglomerative clustering algorithm can thereby be performed to build a pre-clustering result. From the pre-clustering, the set of initial cluster centers are obtained and then used for the final k-means clustering process. Experiments on multiple real-world datasets have demonstrated the superiority of the proposed method.
k-均值聚类作为最经典的聚类技术之一,在过去的几十年里被广泛应用于各个领域。尽管k-means聚类研究取得了显著的成功,但仍存在一些具有挑战性的问题,其中之一是其对初始聚类中心选择的高度敏感性。本文提出了一种基于集成学习的k-means聚类中心初始化方法。具体而言,首先使用随机初始化的多个k-means聚类构造一个基本聚类集合。然后,计算基本聚类的协关联矩阵,在此基础上进行聚类算法,构建预聚类结果。从预聚类中获得初始聚类中心集合,然后用于最终的k-means聚类过程。在多个真实数据集上的实验证明了该方法的优越性。
{"title":"Ensemble-Initialized k-Means Clustering","authors":"Shasha Xu, Dong Huang","doi":"10.1145/3318299.3318308","DOIUrl":"https://doi.org/10.1145/3318299.3318308","url":null,"abstract":"As one of the most classical clustering techniques, the k-means clustering has been widely used in various areas over the past few decades. Despite its significant success, there are still several challenging issues in the k-means clustering research, one of which lies in its high sensitivity to the selection of the initial cluster centers. In this paper, we propose a new cluster center initialization method for k-means based on ensemble learning. Specifically, an ensemble of base clusterings are first constructed by using multiple k-means clusterers with random initializations. Then, a co-association matrix is computed for the base clusterings, upon which the agglomerative clustering algorithm can thereby be performed to build a pre-clustering result. From the pre-clustering, the set of initial cluster centers are obtained and then used for the final k-means clustering process. Experiments on multiple real-world datasets have demonstrated the superiority of the proposed method.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124630921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
ALBS: An Active Learning Framework Based on Syncretic Sample Selection Strategy 基于混合样本选择策略的主动学习框架
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318362
Longfei Pan, Xiaojun Wang
Machine learning has achieved outstanding performance in many fields, but its success heavily relies on the large number of annotated training samples. However, for many professional fields, data annotation is not only tedious and time consuming, but also demanding specialty-oriented knowledge and skills, which are not easily accessible. To significantly reduce the cost of annotation, we propose a novel active learning framework called ALBS. ALBS uses the syncretic strategy which incorporates "most discriminative" and "most representative" to seek "worthy" samples from unlabeled dataset and update the model incrementally to enhance the performance continuously. We have evaluated our method on two different audio datasets, demonstrating that the syncretic strategy can makes the promotion of model model's performance more robust and faster than the other strategies, and subsampling the historical labeled dataset can reduce unnecessary computing costs and storage space.
机器学习在许多领域都取得了出色的表现,但它的成功很大程度上依赖于大量带注释的训练样本。然而,对于许多专业领域来说,数据标注不仅繁琐、耗时,而且对专业知识和技能的要求也很高,不易获取。为了显著降低标注成本,我们提出了一种新的主动学习框架ALBS。ALBS采用“最具判别性”和“最具代表性”的融合策略,从未标记的数据集中寻找“有价值”的样本,并对模型进行增量更新,不断提高性能。我们在两个不同的音频数据集上对我们的方法进行了评估,结果表明,混合策略可以使模型模型性能的提升比其他策略更鲁棒和更快,并且对历史标记数据集进行子采样可以减少不必要的计算成本和存储空间。
{"title":"ALBS: An Active Learning Framework Based on Syncretic Sample Selection Strategy","authors":"Longfei Pan, Xiaojun Wang","doi":"10.1145/3318299.3318362","DOIUrl":"https://doi.org/10.1145/3318299.3318362","url":null,"abstract":"Machine learning has achieved outstanding performance in many fields, but its success heavily relies on the large number of annotated training samples. However, for many professional fields, data annotation is not only tedious and time consuming, but also demanding specialty-oriented knowledge and skills, which are not easily accessible. To significantly reduce the cost of annotation, we propose a novel active learning framework called ALBS. ALBS uses the syncretic strategy which incorporates \"most discriminative\" and \"most representative\" to seek \"worthy\" samples from unlabeled dataset and update the model incrementally to enhance the performance continuously. We have evaluated our method on two different audio datasets, demonstrating that the syncretic strategy can makes the promotion of model model's performance more robust and faster than the other strategies, and subsampling the historical labeled dataset can reduce unnecessary computing costs and storage space.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116736660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Convolutional Neural Network Based Ensemble Method for Cancer Prediction Using DNA Methylation Data 基于卷积神经网络的DNA甲基化数据癌症预测集成方法
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318372
Chao Xia, Yawen Xiao, Jun Wu, Xiaodong Zhao, Hua Li
Cancer is a deadly disease all over the world and its morbidity is increasing at an alarming rate in recent years. With the rapid development of computer science and machine learning technologies, computer-aid cancer prediction has achieved increasingly progress. DNA methylation, as an important epigenetic modification, plays a vital role in the formation and progression of cancer, and therefore can be used as a feature for cancer identification. In this study, we introduce a convolutional neural network based multi-model ensemble method for cancer prediction using DNA methylation data. We first choose five basic machine learning methods as the first stage classifiers and conduct prediction individually. Then, a convolutional neural network is used to find the high-level features among the classifiers and gives a credible prediction result. Experimental results on three DNA methylation datasets of Lung Adenocarcinoma, Liver Hepatocellular Carcinoma and Kidney Clear Cell Carcinoma show the proposed ensemble method can uncover the intricate relationship among the classifiers automatically and achieve better performances.
癌症是世界范围内的一种致命疾病,近年来其发病率正以惊人的速度增长。随着计算机科学和机器学习技术的飞速发展,计算机辅助癌症预测取得了越来越大的进展。DNA甲基化作为一种重要的表观遗传修饰,在癌症的形成和发展中起着至关重要的作用,因此可以作为癌症鉴定的一个特征。在这项研究中,我们引入了一种基于卷积神经网络的多模型集成方法,用于DNA甲基化数据的癌症预测。我们首先选择五种基本的机器学习方法作为第一阶段分类器,分别进行预测。然后,使用卷积神经网络在分类器中寻找高级特征,并给出可信的预测结果。在肺腺癌、肝细胞癌和肾透明细胞癌三个DNA甲基化数据集上的实验结果表明,所提出的集成方法可以自动揭示分类器之间复杂的关系,并取得较好的分类效果。
{"title":"A Convolutional Neural Network Based Ensemble Method for Cancer Prediction Using DNA Methylation Data","authors":"Chao Xia, Yawen Xiao, Jun Wu, Xiaodong Zhao, Hua Li","doi":"10.1145/3318299.3318372","DOIUrl":"https://doi.org/10.1145/3318299.3318372","url":null,"abstract":"Cancer is a deadly disease all over the world and its morbidity is increasing at an alarming rate in recent years. With the rapid development of computer science and machine learning technologies, computer-aid cancer prediction has achieved increasingly progress. DNA methylation, as an important epigenetic modification, plays a vital role in the formation and progression of cancer, and therefore can be used as a feature for cancer identification. In this study, we introduce a convolutional neural network based multi-model ensemble method for cancer prediction using DNA methylation data. We first choose five basic machine learning methods as the first stage classifiers and conduct prediction individually. Then, a convolutional neural network is used to find the high-level features among the classifiers and gives a credible prediction result. Experimental results on three DNA methylation datasets of Lung Adenocarcinoma, Liver Hepatocellular Carcinoma and Kidney Clear Cell Carcinoma show the proposed ensemble method can uncover the intricate relationship among the classifiers automatically and achieve better performances.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117012563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Classification of EEG Signal by Training Neural Network with Swarm Optimization for Identification of Epilepsy 基于群优化训练神经网络的脑电信号分类癫痫识别
Pub Date : 2019-02-22 DOI: 10.1145/3318299.3318374
Iqra Tahir, Usman Qamar, Hassan Abbas, Babar Zeb, Sana Abid
EEG signal classification is a pivotal task for identification of different brain related disorders. The paper is about classification of EEG signal presenting a novel approach for the identification of whether the seizure is epileptic or normal that technique is based on training of neural network with having improved simplified swarm optimization algorithm. Our proposed methodology is evaluated with different parameters and testing accuracy of 94 % is reported for a publicly available dataset.
脑电信号分类是识别不同脑相关疾病的关键任务。本文对脑电图信号进行分类,提出了一种基于神经网络训练的识别癫痫发作与正常发作的新方法,该方法采用改进的简化群优化算法。我们提出的方法用不同的参数进行了评估,并且在公开可用的数据集上报告了94%的测试精度。
{"title":"Classification of EEG Signal by Training Neural Network with Swarm Optimization for Identification of Epilepsy","authors":"Iqra Tahir, Usman Qamar, Hassan Abbas, Babar Zeb, Sana Abid","doi":"10.1145/3318299.3318374","DOIUrl":"https://doi.org/10.1145/3318299.3318374","url":null,"abstract":"EEG signal classification is a pivotal task for identification of different brain related disorders. The paper is about classification of EEG signal presenting a novel approach for the identification of whether the seizure is epileptic or normal that technique is based on training of neural network with having improved simplified swarm optimization algorithm. Our proposed methodology is evaluated with different parameters and testing accuracy of 94 % is reported for a publicly available dataset.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"360 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121717901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
International Conference on Machine Learning and Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1