首页 > 最新文献

2021 13th International Conference on Machine Learning and Computing最新文献

英文 中文
One-Shot Face Recognition Based on Multiple Classifiers Training 基于多分类器训练的一次性人脸识别
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457748
Vuliem Khong, Ziyu Zeng, Lu Fang, Shengjin Wang
One-shot face recognition is a challenging problem which requires recognizing novel identities from only one seen face image. One-shot classes are simply neglected because of the lack of training samples. Therefore, these classes contribute less to the improvement of face recognition performance. The main goal of one-shot face recognition task is to use the novel face samples to enhance the ability of network not only in close-set classify, but also in open-set face verification. In this paper, Base data and Novel data is trained separately with two classifiers to reduce the impact of data imbalance. We propose Confidence Constrain Loss to train classifiers in parallel and get better classifiers fusion in the test phase. Besides, we use data augmentation with 3D face reconstruction to obtain a variety of oneshot set's training samples. Thus, our method can effectively increase the recognition accuracy in the novel set without reducing recognition accuracy in base set. Experiments on MS-celeb-1M low-shot dataset demonstrate that our method achieve state-of-the-art which has 98.90% coverage at precision=99% without using external data.
一次性人脸识别是一个具有挑战性的问题,它要求仅从一张人脸图像中识别新的身份。由于缺乏训练样本,一次性类被简单地忽略了。因此,这些类对人脸识别性能的提高贡献较小。一次性人脸识别任务的主要目标是利用新的人脸样本来增强网络的闭集分类能力和开集人脸验证能力。本文采用两个分类器分别对Base数据和Novel数据进行训练,以减少数据不平衡的影响。我们提出了置信度约束损失来并行训练分类器,并在测试阶段得到了更好的分类器融合。此外,我们使用数据增强和三维人脸重建来获得各种单一集的训练样本。因此,我们的方法可以在不降低基集识别精度的前提下,有效地提高新集的识别精度。在MS-celeb-1M低镜头数据集上的实验表明,该方法在不使用外部数据的情况下达到了98.90%的精度=99%的覆盖率。
{"title":"One-Shot Face Recognition Based on Multiple Classifiers Training","authors":"Vuliem Khong, Ziyu Zeng, Lu Fang, Shengjin Wang","doi":"10.1145/3457682.3457748","DOIUrl":"https://doi.org/10.1145/3457682.3457748","url":null,"abstract":"One-shot face recognition is a challenging problem which requires recognizing novel identities from only one seen face image. One-shot classes are simply neglected because of the lack of training samples. Therefore, these classes contribute less to the improvement of face recognition performance. The main goal of one-shot face recognition task is to use the novel face samples to enhance the ability of network not only in close-set classify, but also in open-set face verification. In this paper, Base data and Novel data is trained separately with two classifiers to reduce the impact of data imbalance. We propose Confidence Constrain Loss to train classifiers in parallel and get better classifiers fusion in the test phase. Besides, we use data augmentation with 3D face reconstruction to obtain a variety of oneshot set's training samples. Thus, our method can effectively increase the recognition accuracy in the novel set without reducing recognition accuracy in base set. Experiments on MS-celeb-1M low-shot dataset demonstrate that our method achieve state-of-the-art which has 98.90% coverage at precision=99% without using external data.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133923097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Research on Deep Sound Source Separation 深声源分离技术研究
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457741
Yunuo Yang, Honghui Li
The cocktail party effect is a fundamental problem in sound source separation, and many researchers have worked to solve this problem. In recent years, the most popular algorithms to solve the problem of sound source separation are Support Vector Machine (SVM), Gaussian Mixture Model (GMM), non-negative matrix factorization (NMF), and Variational Autoencoder (VAE). Especially VAE model showed excellent ability in dealing with the problem of sound separation. In this paper, the β-VAE model, combined with a weakly supervised classification proposed by Karamatlı et al., was first reproduced. Since Karamatlı's experiment only completed the connection between sound and words, in order to learn more information about the speaker, this model is used to learn a mapping between sounds and individual speakers and a mapping between sounds and gender. It turns out that the separation results could be obtained by retraining the model after the establishment of the new 'male' and 'female' labels. his result lays a foundation for the future study of the mapping between individuals and words. When the tag is specific to an individual, more data is needed to support this experiment, and the more data available for training, the better result the model will get.
鸡尾酒会效应是声源分离中的一个基本问题,许多研究者都在努力解决这个问题。近年来,解决声源分离问题最流行的算法是支持向量机(SVM)、高斯混合模型(GMM)、非负矩阵分解(NMF)和变分自编码器(VAE)。特别是VAE模型在处理声分离问题上表现出了出色的能力。本文首先再现了β-VAE模型,并结合karamatlati等人提出的弱监督分类。由于karamatlar的实验只完成了声音和单词之间的联系,为了了解更多关于说话人的信息,这个模型被用来学习声音和说话人个体之间的映射,以及声音和性别之间的映射。结果表明,在建立新的“男性”和“女性”标签后,可以通过重新训练模型来获得分离结果。他的研究结果为今后研究个体与词汇之间的映射关系奠定了基础。当标签是针对个体的时候,需要更多的数据来支持这个实验,训练的数据越多,模型得到的结果就越好。
{"title":"Research on Deep Sound Source Separation","authors":"Yunuo Yang, Honghui Li","doi":"10.1145/3457682.3457741","DOIUrl":"https://doi.org/10.1145/3457682.3457741","url":null,"abstract":"The cocktail party effect is a fundamental problem in sound source separation, and many researchers have worked to solve this problem. In recent years, the most popular algorithms to solve the problem of sound source separation are Support Vector Machine (SVM), Gaussian Mixture Model (GMM), non-negative matrix factorization (NMF), and Variational Autoencoder (VAE). Especially VAE model showed excellent ability in dealing with the problem of sound separation. In this paper, the β-VAE model, combined with a weakly supervised classification proposed by Karamatlı et al., was first reproduced. Since Karamatlı's experiment only completed the connection between sound and words, in order to learn more information about the speaker, this model is used to learn a mapping between sounds and individual speakers and a mapping between sounds and gender. It turns out that the separation results could be obtained by retraining the model after the establishment of the new 'male' and 'female' labels. his result lays a foundation for the future study of the mapping between individuals and words. When the tag is specific to an individual, more data is needed to support this experiment, and the more data available for training, the better result the model will get.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"126 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124209903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Spec-CNN-CTC Model for End-to-End Speech Recognition 端到端语音识别的新型Spec-CNN-CTC模型
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457703
Jing Xue, Jun Zhang
This paper discusses the application of a special data augmentation approach for end-to-end phone recognition system on the Deep Neural Networks. The system improves the performance of phone recognition and alleviates overfitting during training. Also, it offers a solution to the problem of few public datasets annotated at the phone level. And we propose the CNN-CTC structure as a baseline model. The model is based on Convolutional Neural Networks (CNNs) and Connectionist Temporal Classification (CTC) objective function. Which is an end-to-end structure, and there is no need to force alignment each frame of audio. The SpecAugment approach directly processes the feature of audio, such as the log Mel-spectrogram. In our experiment, the Spec-CNN-CTC system achieves a phone error rate of 16.11% on TIMIT corpus with no prior linguistic information. Which is outperforming the previous work Acoustic-State-Transition Model (ASTM) by 27.63%, the DNN-HMM with MFCC + IFCC features by 16.8%, the RNN-CRF model by 17.3% and the DBM-DNN model by 22.62%.
本文讨论了一种特殊的数据增强方法在深度神经网络端到端手机识别系统中的应用。该系统提高了手机识别的性能,缓解了训练过程中的过拟合问题。此外,它还解决了在电话级别上标注的公共数据集较少的问题。我们提出了CNN-CTC结构作为基线模型。该模型基于卷积神经网络(cnn)和连接时间分类(CTC)目标函数。这是一个端到端的结构,不需要强制对齐每一帧音频。SpecAugment方法直接处理音频的特征,如对数梅尔谱图。在我们的实验中,Spec-CNN-CTC系统在没有先验语言信息的TIMIT语料库上实现了16.11%的电话错误率。它比之前的声学状态转换模型(ASTM)高27.63%,比具有MFCC + IFCC特征的DNN-HMM高16.8%,比RNN-CRF模型高17.3%,比DBM-DNN模型高22.62%。
{"title":"A Novel Spec-CNN-CTC Model for End-to-End Speech Recognition","authors":"Jing Xue, Jun Zhang","doi":"10.1145/3457682.3457703","DOIUrl":"https://doi.org/10.1145/3457682.3457703","url":null,"abstract":"This paper discusses the application of a special data augmentation approach for end-to-end phone recognition system on the Deep Neural Networks. The system improves the performance of phone recognition and alleviates overfitting during training. Also, it offers a solution to the problem of few public datasets annotated at the phone level. And we propose the CNN-CTC structure as a baseline model. The model is based on Convolutional Neural Networks (CNNs) and Connectionist Temporal Classification (CTC) objective function. Which is an end-to-end structure, and there is no need to force alignment each frame of audio. The SpecAugment approach directly processes the feature of audio, such as the log Mel-spectrogram. In our experiment, the Spec-CNN-CTC system achieves a phone error rate of 16.11% on TIMIT corpus with no prior linguistic information. Which is outperforming the previous work Acoustic-State-Transition Model (ASTM) by 27.63%, the DNN-HMM with MFCC + IFCC features by 16.8%, the RNN-CRF model by 17.3% and the DBM-DNN model by 22.62%.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121212512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Ensemble Learning in Stock Market Prediction 股票市场预测中的集成学习
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457727
Hassan Ezzeddine, Roger Achkar
In recent years, the increasing influence of machine learning in different industries had inspired many traders to benefit from it in the world of finance, stock trading is one of the most important activities. Predicting the direction of stock prices is a widely studied subject in many fields including trading, finance, statistics and computer science. The main concern for Investors is to maximize their profit if they determine when to buy/sell an investment they apply Analytical methods that makes use of different sources ranging from news to price data, all aiming at predicting the company's future stock price ML applications have presented investors with something new. A combination of technologies that could entirely reshape the way they make investment decisions. The purpose of this thesis is to leverage the aggregation of technical, fundamental, and sentiment analysis with stacked machine learning models capable of predicting profitable actions to be executed.
近年来,机器学习在不同行业的影响力越来越大,激发了许多交易者从中受益,在金融领域,股票交易是最重要的活动之一。预测股票价格的走向是一个在许多领域广泛研究的课题,包括交易、金融、统计和计算机科学。投资者主要关心的是,如果他们决定何时买入/卖出投资,他们应用分析方法,利用从新闻到价格数据等不同来源,所有这些方法都旨在预测公司未来的股价,ML应用程序为投资者提供了一些新的东西。这些技术的组合可能会完全重塑他们做出投资决策的方式。本文的目的是利用技术、基础和情绪分析的聚合,以及堆叠的机器学习模型,能够预测将要执行的有利可图的操作。
{"title":"Ensemble Learning in Stock Market Prediction","authors":"Hassan Ezzeddine, Roger Achkar","doi":"10.1145/3457682.3457727","DOIUrl":"https://doi.org/10.1145/3457682.3457727","url":null,"abstract":"In recent years, the increasing influence of machine learning in different industries had inspired many traders to benefit from it in the world of finance, stock trading is one of the most important activities. Predicting the direction of stock prices is a widely studied subject in many fields including trading, finance, statistics and computer science. The main concern for Investors is to maximize their profit if they determine when to buy/sell an investment they apply Analytical methods that makes use of different sources ranging from news to price data, all aiming at predicting the company's future stock price ML applications have presented investors with something new. A combination of technologies that could entirely reshape the way they make investment decisions. The purpose of this thesis is to leverage the aggregation of technical, fundamental, and sentiment analysis with stacked machine learning models capable of predicting profitable actions to be executed.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129293304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Tracking Ground Targets with Road Constraints Using a JMS-GM-PHD Filter 利用JMS-GM-PHD滤波器跟踪道路约束下的地面目标
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457768
Jihong Zheng, He He, Longteng Cong
The probability hypothesis density filter with linear Gaussian jump Markov system multi-target models is an attractive approach to tracking multiple maneuvering targets in the presence of data association uncertainty, clutter, noise, and detection uncertainty. However, these models are not precise enough to describe moving targets on road networks in ground target tracking scenario. In this paper, the road map information is integrated into the jump Markov system Gaussian mixture probability hypothesis density (JMS-GM-PHD) filter, and a road-constraint JMS-GM-PHD filter for ground target tracking is proposed. In addition, we then derive the recursive equation of the proposed filter. Simulation results show that the proposed road-constrained JMS-GM-PHD filter is effective in tracking ground moving targets.
线性高斯跳变马尔可夫系统多目标模型的概率假设密度滤波是在存在数据关联不确定性、杂波、噪声和检测不确定性的情况下跟踪多个机动目标的有效方法。然而,在地面目标跟踪场景中,这些模型对道路网络上运动目标的描述不够精确。本文将道路地图信息集成到跳跃马尔可夫系统高斯混合概率假设密度(JMS-GM-PHD)滤波器中,提出了一种道路约束的JMS-GM-PHD滤波器用于地面目标跟踪。此外,我们还推导了该滤波器的递推方程。仿真结果表明,所提出的道路约束JMS-GM-PHD滤波器能够有效地跟踪地面运动目标。
{"title":"Tracking Ground Targets with Road Constraints Using a JMS-GM-PHD Filter","authors":"Jihong Zheng, He He, Longteng Cong","doi":"10.1145/3457682.3457768","DOIUrl":"https://doi.org/10.1145/3457682.3457768","url":null,"abstract":"The probability hypothesis density filter with linear Gaussian jump Markov system multi-target models is an attractive approach to tracking multiple maneuvering targets in the presence of data association uncertainty, clutter, noise, and detection uncertainty. However, these models are not precise enough to describe moving targets on road networks in ground target tracking scenario. In this paper, the road map information is integrated into the jump Markov system Gaussian mixture probability hypothesis density (JMS-GM-PHD) filter, and a road-constraint JMS-GM-PHD filter for ground target tracking is proposed. In addition, we then derive the recursive equation of the proposed filter. Simulation results show that the proposed road-constrained JMS-GM-PHD filter is effective in tracking ground moving targets.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116347745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Corpus Construction and Entity Recognition for the Field of Industrial Robot Fault Diagnosis 面向工业机器人故障诊断领域的语料库构建与实体识别
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457745
Jiale Zhou, Tao Wang, Jianfeng Deng
The fault logs record the fault information generated during the operation process of industrial robots. It contains a large amount of fault knowledge and solution information. It is necessary to extract this information and build the fault diagnosis knowledge graph of industrial robots, which can support remote fault diagnosis of industrial robots without human help. At present, the research of fault diagnosis knowledge graph is still relatively scarce. In this paper, we propose a method of named entity recognition for extracting the knowledge of industrial robot fault diagnosis. The contribution of our paper is to establish the fault field dataset Fault-Data, propose the ontology concept of the fault diagnosis field, and obtain a good field recognition effect through the verification of the entity recognition model of fault diagnosis. Experimental results show that the F value of named entity recognition reaches 91.99%, which provides a certain reference significance for subsequent knowledge extraction and knowledge graph construction.
故障日志记录了工业机器人在运行过程中产生的故障信息。它包含了大量的故障知识和解决方案信息。对这些信息进行提取,构建工业机器人故障诊断知识图谱,支持工业机器人在不需要人工帮助的情况下进行远程故障诊断。目前,对故障诊断知识图谱的研究还比较匮乏。本文提出了一种用于工业机器人故障诊断知识提取的命名实体识别方法。本文的贡献在于建立了故障场数据集fault - data,提出了故障诊断领域的本体概念,并通过对故障诊断实体识别模型的验证获得了良好的领域识别效果。实验结果表明,命名实体识别的F值达到91.99%,为后续的知识提取和知识图构建提供了一定的参考意义。
{"title":"Corpus Construction and Entity Recognition for the Field of Industrial Robot Fault Diagnosis","authors":"Jiale Zhou, Tao Wang, Jianfeng Deng","doi":"10.1145/3457682.3457745","DOIUrl":"https://doi.org/10.1145/3457682.3457745","url":null,"abstract":"The fault logs record the fault information generated during the operation process of industrial robots. It contains a large amount of fault knowledge and solution information. It is necessary to extract this information and build the fault diagnosis knowledge graph of industrial robots, which can support remote fault diagnosis of industrial robots without human help. At present, the research of fault diagnosis knowledge graph is still relatively scarce. In this paper, we propose a method of named entity recognition for extracting the knowledge of industrial robot fault diagnosis. The contribution of our paper is to establish the fault field dataset Fault-Data, propose the ontology concept of the fault diagnosis field, and obtain a good field recognition effect through the verification of the entity recognition model of fault diagnosis. Experimental results show that the F value of named entity recognition reaches 91.99%, which provides a certain reference significance for subsequent knowledge extraction and knowledge graph construction.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114812374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
GCN2-NAA: Two-stage Graph Convolutional Networks with Node-Aware Attention for Joint Entity and Relation Extraction GCN2-NAA:节点感知关注的两阶段图卷积网络,用于联合实体和关系提取
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457765
WeiCai Niu, Quan Chen, Weiwen Zhang, Jianwen Ma, Zhongqiang Hu
Joint extraction of entities and relations is critical for many tasks of Natural Language Processing (NLP), which aims to extract all triplets in the text. However, the huge challenge is that a sentence usually contains overlapping triplets. In this paper, we propose a joint extraction framework named GCN2-NAA based on a two-stage Graph Convolutional Neural networks (GCN) and Node-Aware Attention mechanism. We obtain multi-granularity representations and regional features of words by stacking multiple feature encoders and 1st-phase GCN. Besides, the node-aware attention mechanism and 2nd-phase GCN to capture the soft attention correlation matrix between all words in each relation type. Based on the constructed soft attention correlation matrix, we utilize GCN to further obtain the interaction between entities, relations, and triplets. Experiment results show that GCN2-NAA outperforms baseline models by 6.5% and 11.4% in terms of F1 score on NYT and WebNLG datasets, respectively.
实体和关系的联合提取对于自然语言处理(NLP)的许多任务至关重要,NLP的目标是提取文本中的所有三元组。然而,一个巨大的挑战是一个句子通常包含重叠的三联体。本文提出了一种基于两阶段图卷积神经网络(GCN)和节点感知注意机制的联合提取框架GCN2-NAA。通过叠加多个特征编码器和第一阶段GCN,得到词的多粒度表示和区域特征。此外,利用节点感知注意机制和第二阶段GCN获取各关系类型中所有词之间的软注意关联矩阵。在构建软注意关联矩阵的基础上,利用GCN进一步获得实体、关系和三元组之间的交互关系。实验结果表明,GCN2-NAA在NYT和WebNLG数据集上的F1得分分别比基线模型高6.5%和11.4%。
{"title":"GCN2-NAA: Two-stage Graph Convolutional Networks with Node-Aware Attention for Joint Entity and Relation Extraction","authors":"WeiCai Niu, Quan Chen, Weiwen Zhang, Jianwen Ma, Zhongqiang Hu","doi":"10.1145/3457682.3457765","DOIUrl":"https://doi.org/10.1145/3457682.3457765","url":null,"abstract":"Joint extraction of entities and relations is critical for many tasks of Natural Language Processing (NLP), which aims to extract all triplets in the text. However, the huge challenge is that a sentence usually contains overlapping triplets. In this paper, we propose a joint extraction framework named GCN2-NAA based on a two-stage Graph Convolutional Neural networks (GCN) and Node-Aware Attention mechanism. We obtain multi-granularity representations and regional features of words by stacking multiple feature encoders and 1st-phase GCN. Besides, the node-aware attention mechanism and 2nd-phase GCN to capture the soft attention correlation matrix between all words in each relation type. Based on the constructed soft attention correlation matrix, we utilize GCN to further obtain the interaction between entities, relations, and triplets. Experiment results show that GCN2-NAA outperforms baseline models by 6.5% and 11.4% in terms of F1 score on NYT and WebNLG datasets, respectively.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114822200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Factors Affecting Accuracy of Genotype Imputation Using Neural Networks in Deep Learning 影响深度学习中神经网络基因型输入准确性的因素
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457688
Tianfeng Shi, Jing Peng
The genotype imputation is an important topic in the field of genomics. Many genome analyses require data without missing values, which requires to impute the missing data. In recent years, deep learning has become hot, and it is more suitable for text sequence type problems, which may fit with the genotype imputation problem. Based on the recurrent neural network and convolutional neural network in deep learning, our study proposes and constructs five model combinations, imputes and compares the results under different missing rate scenarios. And on the basis of the basic model, a higher imputation accuracy is obtained by tuning the model hyperparameters. The results indicated that on all the data sets with various levels of missing rates, the CNN1D-RNNM with tuned hyperparameters well has obtained the best results. The combination of a one-dimensional convolutional neural network and a recurrent neural network with tuned hyperparameters can beat a single convolutional network or a recurrent network at various levels of missing rates. This research provides new solutions for genotype imputation by using the deep learning to build complex neural networks.
基因型插补是基因组学领域的一个重要课题。许多基因组分析需要没有缺失值的数据,这就需要对缺失的数据进行计算。近年来,深度学习成为热门,它更适合于文本序列类型问题,这可能适合基因型归算问题。本研究基于深度学习中的递归神经网络和卷积神经网络,提出并构建了五种模型组合,并对不同缺失率情景下的结果进行了估算和比较。在基本模型的基础上,通过对模型超参数的调整,获得了更高的插补精度。结果表明,在不同缺失率水平的数据集上,超参数调优的CNN1D-RNNM获得了最好的效果。一维卷积神经网络和具有调谐超参数的递归神经网络的组合可以在不同的缺失率水平上击败单个卷积网络或递归网络。本研究利用深度学习构建复杂神经网络,为基因型插补提供了新的解决方案。
{"title":"Factors Affecting Accuracy of Genotype Imputation Using Neural Networks in Deep Learning","authors":"Tianfeng Shi, Jing Peng","doi":"10.1145/3457682.3457688","DOIUrl":"https://doi.org/10.1145/3457682.3457688","url":null,"abstract":"The genotype imputation is an important topic in the field of genomics. Many genome analyses require data without missing values, which requires to impute the missing data. In recent years, deep learning has become hot, and it is more suitable for text sequence type problems, which may fit with the genotype imputation problem. Based on the recurrent neural network and convolutional neural network in deep learning, our study proposes and constructs five model combinations, imputes and compares the results under different missing rate scenarios. And on the basis of the basic model, a higher imputation accuracy is obtained by tuning the model hyperparameters. The results indicated that on all the data sets with various levels of missing rates, the CNN1D-RNNM with tuned hyperparameters well has obtained the best results. The combination of a one-dimensional convolutional neural network and a recurrent neural network with tuned hyperparameters can beat a single convolutional network or a recurrent network at various levels of missing rates. This research provides new solutions for genotype imputation by using the deep learning to build complex neural networks.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127045806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MNGAN: Detecting Anomalies with Memorized Normal Patterns MNGAN:利用记忆正常模式检测异常情况
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457764
Zijian Huang, Changqing Xu
Anomaly detection is an significant problem in machine learning and has been well-studied in a wide range of applications. To model complex and high dimensional data distributions, existing methods, trained with an auto-encoder architecture either directly or indirectly, usually attempt to produce higher reconstruction error for anomalies than normal samples. However, lacking constrains on the latent representation of input data results in an unexpected performance that anomalies can be reconstructed well too, leading to the “missed alarm”. In this work, we propose to reconstruct input data with typical patterns of normal data learned through adversarial networks. Our approach, called MNGAN, which employs an encoder-decoder-encoder architecture with a memory network, learns to memorize prototypical patterns of normal data and simultaneously preserve details of data style for better reconstruction. In test phase, given a input data, the model will reconstruct it with the most relevant memory item, which indicates one normal pattern. Thus, reconstructions of anomalous data are similar to normal samples, resulting in effective detection for anomalies due to the high reconstruction error. Experiments over several benchmark datasets, from varying domains, shows that our proposed method outperforms previous state-of-the-art anomaly detection approaches.
异常检测是机器学习中的一个重要问题,在广泛的应用中得到了深入研究。为了对复杂的高维数据分布进行建模,现有方法直接或间接地使用自动编码器架构进行训练,通常试图对异常数据产生比正常样本更高的重构误差。然而,由于缺乏对输入数据潜在表示的约束,异常数据也能被很好地重建,从而导致 "漏报"。在这项工作中,我们提出利用通过对抗网络学习到的正常数据的典型模式来重建输入数据。我们的方法被称为 MNGAN,它采用了带有记忆网络的编码器-解码器-编码器架构,可以学习记忆正常数据的原型模式,同时保留数据风格的细节,以获得更好的重构效果。在测试阶段,给定一个输入数据,模型将用最相关的记忆项进行重构,该记忆项表示一种正常模式。因此,异常数据的重构与正常样本相似,由于重构误差大,因此能有效检测异常数据。在不同领域的多个基准数据集上进行的实验表明,我们提出的方法优于以往最先进的异常检测方法。
{"title":"MNGAN: Detecting Anomalies with Memorized Normal Patterns","authors":"Zijian Huang, Changqing Xu","doi":"10.1145/3457682.3457764","DOIUrl":"https://doi.org/10.1145/3457682.3457764","url":null,"abstract":"Anomaly detection is an significant problem in machine learning and has been well-studied in a wide range of applications. To model complex and high dimensional data distributions, existing methods, trained with an auto-encoder architecture either directly or indirectly, usually attempt to produce higher reconstruction error for anomalies than normal samples. However, lacking constrains on the latent representation of input data results in an unexpected performance that anomalies can be reconstructed well too, leading to the “missed alarm”. In this work, we propose to reconstruct input data with typical patterns of normal data learned through adversarial networks. Our approach, called MNGAN, which employs an encoder-decoder-encoder architecture with a memory network, learns to memorize prototypical patterns of normal data and simultaneously preserve details of data style for better reconstruction. In test phase, given a input data, the model will reconstruct it with the most relevant memory item, which indicates one normal pattern. Thus, reconstructions of anomalous data are similar to normal samples, resulting in effective detection for anomalies due to the high reconstruction error. Experiments over several benchmark datasets, from varying domains, shows that our proposed method outperforms previous state-of-the-art anomaly detection approaches.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129916706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Maneuvering Target Tracking Based on Neural Network and Error Self-correction Technology
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457708
Lisi Chen, Changcheng Wang, Jiale Huang
Neural network has strong nonlinear data characterization ability and solves many complex problems successfully. Trajectory estimation and prediction is time series forecasting but different from convention problems such as time video analysis. A method based on neural network and error self-correction technology achieving trajectory estimation and prediction is proposed in this paper. The method needs neural network without additional filtering algorithm, so the maneuver models and noise characteristics are not needed. According to the information of the previous moments before the investigated time, the information of the next moment or a specified time later can be obtained. For tracking a simple maneuvering target model with unknown parameters and noise characteristics, numerical simulation results show that FNN achieves filtering and it achieves a higher prediction accuracy than the Least Squares filtering. For tracking a complex maneuvering target model with strong nonlinearity, RNN combining with FNN is employed. For the measurement error with D standard deviation 2m, azimuth angle and the altitude angle measurement errors standard deviation with 2mil, the angle predicting error standard deviation is less than 1.3mil, which shows RNN combing with error self-correction technology has high accuracy. It meets the technical requirements for maneuvering target tracking as well as various similar applications.
神经网络具有很强的非线性数据表征能力,成功地解决了许多复杂的问题。轨迹估计与预测是一种时间序列预测,但不同于时间视频分析等常规问题。提出了一种基于神经网络和误差自校正技术实现弹道估计和预测的方法。该方法使用神经网络,不需要额外的滤波算法,因此不需要机动模型和噪声特性。根据被调查时间之前的前一时刻的信息,可以得到下一时刻或指定时间之后的信息。对于一个参数未知、噪声特性未知的简单机动目标模型,数值仿真结果表明,FNN实现了滤波,并且比最小二乘滤波具有更高的预测精度。针对具有强非线性的复杂机动目标模型,采用RNN与FNN相结合的方法进行跟踪。对于D测量误差标准差为2m,方位角和高度角测量误差标准差为2mil,角度预测误差标准差小于1.3mil,表明RNN结合误差自校正技术具有较高的精度。满足机动目标跟踪以及各种类似应用的技术要求。
{"title":"Maneuvering Target Tracking Based on Neural Network and Error Self-correction Technology","authors":"Lisi Chen, Changcheng Wang, Jiale Huang","doi":"10.1145/3457682.3457708","DOIUrl":"https://doi.org/10.1145/3457682.3457708","url":null,"abstract":"Neural network has strong nonlinear data characterization ability and solves many complex problems successfully. Trajectory estimation and prediction is time series forecasting but different from convention problems such as time video analysis. A method based on neural network and error self-correction technology achieving trajectory estimation and prediction is proposed in this paper. The method needs neural network without additional filtering algorithm, so the maneuver models and noise characteristics are not needed. According to the information of the previous moments before the investigated time, the information of the next moment or a specified time later can be obtained. For tracking a simple maneuvering target model with unknown parameters and noise characteristics, numerical simulation results show that FNN achieves filtering and it achieves a higher prediction accuracy than the Least Squares filtering. For tracking a complex maneuvering target model with strong nonlinearity, RNN combining with FNN is employed. For the measurement error with D standard deviation 2m, azimuth angle and the altitude angle measurement errors standard deviation with 2mil, the angle predicting error standard deviation is less than 1.3mil, which shows RNN combing with error self-correction technology has high accuracy. It meets the technical requirements for maneuvering target tracking as well as various similar applications.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131071881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2021 13th International Conference on Machine Learning and Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1