首页 > 最新文献

2021 13th International Conference on Machine Learning and Computing最新文献

英文 中文
A Gradient heatmap based Table Structure Recognition 基于梯度热图的表结构识别
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457752
Lingjun Kong, Yunchao Bao, Qianwen Wang, Lijun Cao, Shengmei Zhao
Most methods to recognize the structure of a table are to use the object detection approach to directly locate each cell in the table or to segment the table line based on the fully convolutional network (FCN). The problem of the former is that it is laborious to recognize the distorted table, while the problem of the latter is that the sample imbalance makes it difficult to train the model. In this paper, a gradient heatmap based table structure recognition method is proposed, by exploring the gradient heatmaps of the vertical lines and horizontal lines in the table. Specifically, the pixels of the vertical lines of the table are obtained according to the gradient heatmap, then the pixels of the horizontal lines are obtained using the same method, and finally the table structure is restored by using the connected domain search method. Compared with the Single Shot MultiBox Detector (SSD) and Faster RCNN that directly detects cells, our Average Precision (AP) value reached up to 99.5%, which is much higher than the above models. Additionally, we demonstrate that the AP values of the proposed models are reduced almost negligibly when the IoU threshold increased from 0.5 to 0.75, while the AP value of the fast RCNN and SSD model decreased significantly.
大多数识别表结构的方法是使用目标检测方法直接定位表中的每个单元格或基于全卷积网络(FCN)对表行进行分割。前者的问题是识别扭曲的表很费力,而后者的问题是样本不平衡导致模型训练困难。本文提出了一种基于梯度热图的表格结构识别方法,通过探索表格中垂直线和水平线的梯度热图。具体来说,首先根据梯度热图获得表格的垂直线像素,然后使用相同的方法获得水平线像素,最后使用连通域搜索方法恢复表格结构。与直接检测细胞的Single Shot MultiBox Detector (SSD)和Faster RCNN相比,我们的Average Precision (AP)值高达99.5%,大大高于上述模型。此外,我们证明,当IoU阈值从0.5增加到0.75时,所提出模型的AP值降低几乎可以忽略不计,而快速RCNN和SSD模型的AP值显著降低。
{"title":"A Gradient heatmap based Table Structure Recognition","authors":"Lingjun Kong, Yunchao Bao, Qianwen Wang, Lijun Cao, Shengmei Zhao","doi":"10.1145/3457682.3457752","DOIUrl":"https://doi.org/10.1145/3457682.3457752","url":null,"abstract":"Most methods to recognize the structure of a table are to use the object detection approach to directly locate each cell in the table or to segment the table line based on the fully convolutional network (FCN). The problem of the former is that it is laborious to recognize the distorted table, while the problem of the latter is that the sample imbalance makes it difficult to train the model. In this paper, a gradient heatmap based table structure recognition method is proposed, by exploring the gradient heatmaps of the vertical lines and horizontal lines in the table. Specifically, the pixels of the vertical lines of the table are obtained according to the gradient heatmap, then the pixels of the horizontal lines are obtained using the same method, and finally the table structure is restored by using the connected domain search method. Compared with the Single Shot MultiBox Detector (SSD) and Faster RCNN that directly detects cells, our Average Precision (AP) value reached up to 99.5%, which is much higher than the above models. Additionally, we demonstrate that the AP values of the proposed models are reduced almost negligibly when the IoU threshold increased from 0.5 to 0.75, while the AP value of the fast RCNN and SSD model decreased significantly.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125948020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Research on Deep Sound Source Separation 深声源分离技术研究
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457741
Yunuo Yang, Honghui Li
The cocktail party effect is a fundamental problem in sound source separation, and many researchers have worked to solve this problem. In recent years, the most popular algorithms to solve the problem of sound source separation are Support Vector Machine (SVM), Gaussian Mixture Model (GMM), non-negative matrix factorization (NMF), and Variational Autoencoder (VAE). Especially VAE model showed excellent ability in dealing with the problem of sound separation. In this paper, the β-VAE model, combined with a weakly supervised classification proposed by Karamatlı et al., was first reproduced. Since Karamatlı's experiment only completed the connection between sound and words, in order to learn more information about the speaker, this model is used to learn a mapping between sounds and individual speakers and a mapping between sounds and gender. It turns out that the separation results could be obtained by retraining the model after the establishment of the new 'male' and 'female' labels. his result lays a foundation for the future study of the mapping between individuals and words. When the tag is specific to an individual, more data is needed to support this experiment, and the more data available for training, the better result the model will get.
鸡尾酒会效应是声源分离中的一个基本问题,许多研究者都在努力解决这个问题。近年来,解决声源分离问题最流行的算法是支持向量机(SVM)、高斯混合模型(GMM)、非负矩阵分解(NMF)和变分自编码器(VAE)。特别是VAE模型在处理声分离问题上表现出了出色的能力。本文首先再现了β-VAE模型,并结合karamatlati等人提出的弱监督分类。由于karamatlar的实验只完成了声音和单词之间的联系,为了了解更多关于说话人的信息,这个模型被用来学习声音和说话人个体之间的映射,以及声音和性别之间的映射。结果表明,在建立新的“男性”和“女性”标签后,可以通过重新训练模型来获得分离结果。他的研究结果为今后研究个体与词汇之间的映射关系奠定了基础。当标签是针对个体的时候,需要更多的数据来支持这个实验,训练的数据越多,模型得到的结果就越好。
{"title":"Research on Deep Sound Source Separation","authors":"Yunuo Yang, Honghui Li","doi":"10.1145/3457682.3457741","DOIUrl":"https://doi.org/10.1145/3457682.3457741","url":null,"abstract":"The cocktail party effect is a fundamental problem in sound source separation, and many researchers have worked to solve this problem. In recent years, the most popular algorithms to solve the problem of sound source separation are Support Vector Machine (SVM), Gaussian Mixture Model (GMM), non-negative matrix factorization (NMF), and Variational Autoencoder (VAE). Especially VAE model showed excellent ability in dealing with the problem of sound separation. In this paper, the β-VAE model, combined with a weakly supervised classification proposed by Karamatlı et al., was first reproduced. Since Karamatlı's experiment only completed the connection between sound and words, in order to learn more information about the speaker, this model is used to learn a mapping between sounds and individual speakers and a mapping between sounds and gender. It turns out that the separation results could be obtained by retraining the model after the establishment of the new 'male' and 'female' labels. his result lays a foundation for the future study of the mapping between individuals and words. When the tag is specific to an individual, more data is needed to support this experiment, and the more data available for training, the better result the model will get.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"126 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124209903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Spec-CNN-CTC Model for End-to-End Speech Recognition 端到端语音识别的新型Spec-CNN-CTC模型
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457703
Jing Xue, Jun Zhang
This paper discusses the application of a special data augmentation approach for end-to-end phone recognition system on the Deep Neural Networks. The system improves the performance of phone recognition and alleviates overfitting during training. Also, it offers a solution to the problem of few public datasets annotated at the phone level. And we propose the CNN-CTC structure as a baseline model. The model is based on Convolutional Neural Networks (CNNs) and Connectionist Temporal Classification (CTC) objective function. Which is an end-to-end structure, and there is no need to force alignment each frame of audio. The SpecAugment approach directly processes the feature of audio, such as the log Mel-spectrogram. In our experiment, the Spec-CNN-CTC system achieves a phone error rate of 16.11% on TIMIT corpus with no prior linguistic information. Which is outperforming the previous work Acoustic-State-Transition Model (ASTM) by 27.63%, the DNN-HMM with MFCC + IFCC features by 16.8%, the RNN-CRF model by 17.3% and the DBM-DNN model by 22.62%.
本文讨论了一种特殊的数据增强方法在深度神经网络端到端手机识别系统中的应用。该系统提高了手机识别的性能,缓解了训练过程中的过拟合问题。此外,它还解决了在电话级别上标注的公共数据集较少的问题。我们提出了CNN-CTC结构作为基线模型。该模型基于卷积神经网络(cnn)和连接时间分类(CTC)目标函数。这是一个端到端的结构,不需要强制对齐每一帧音频。SpecAugment方法直接处理音频的特征,如对数梅尔谱图。在我们的实验中,Spec-CNN-CTC系统在没有先验语言信息的TIMIT语料库上实现了16.11%的电话错误率。它比之前的声学状态转换模型(ASTM)高27.63%,比具有MFCC + IFCC特征的DNN-HMM高16.8%,比RNN-CRF模型高17.3%,比DBM-DNN模型高22.62%。
{"title":"A Novel Spec-CNN-CTC Model for End-to-End Speech Recognition","authors":"Jing Xue, Jun Zhang","doi":"10.1145/3457682.3457703","DOIUrl":"https://doi.org/10.1145/3457682.3457703","url":null,"abstract":"This paper discusses the application of a special data augmentation approach for end-to-end phone recognition system on the Deep Neural Networks. The system improves the performance of phone recognition and alleviates overfitting during training. Also, it offers a solution to the problem of few public datasets annotated at the phone level. And we propose the CNN-CTC structure as a baseline model. The model is based on Convolutional Neural Networks (CNNs) and Connectionist Temporal Classification (CTC) objective function. Which is an end-to-end structure, and there is no need to force alignment each frame of audio. The SpecAugment approach directly processes the feature of audio, such as the log Mel-spectrogram. In our experiment, the Spec-CNN-CTC system achieves a phone error rate of 16.11% on TIMIT corpus with no prior linguistic information. Which is outperforming the previous work Acoustic-State-Transition Model (ASTM) by 27.63%, the DNN-HMM with MFCC + IFCC features by 16.8%, the RNN-CRF model by 17.3% and the DBM-DNN model by 22.62%.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121212512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Ensemble Learning in Stock Market Prediction 股票市场预测中的集成学习
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457727
Hassan Ezzeddine, Roger Achkar
In recent years, the increasing influence of machine learning in different industries had inspired many traders to benefit from it in the world of finance, stock trading is one of the most important activities. Predicting the direction of stock prices is a widely studied subject in many fields including trading, finance, statistics and computer science. The main concern for Investors is to maximize their profit if they determine when to buy/sell an investment they apply Analytical methods that makes use of different sources ranging from news to price data, all aiming at predicting the company's future stock price ML applications have presented investors with something new. A combination of technologies that could entirely reshape the way they make investment decisions. The purpose of this thesis is to leverage the aggregation of technical, fundamental, and sentiment analysis with stacked machine learning models capable of predicting profitable actions to be executed.
近年来,机器学习在不同行业的影响力越来越大,激发了许多交易者从中受益,在金融领域,股票交易是最重要的活动之一。预测股票价格的走向是一个在许多领域广泛研究的课题,包括交易、金融、统计和计算机科学。投资者主要关心的是,如果他们决定何时买入/卖出投资,他们应用分析方法,利用从新闻到价格数据等不同来源,所有这些方法都旨在预测公司未来的股价,ML应用程序为投资者提供了一些新的东西。这些技术的组合可能会完全重塑他们做出投资决策的方式。本文的目的是利用技术、基础和情绪分析的聚合,以及堆叠的机器学习模型,能够预测将要执行的有利可图的操作。
{"title":"Ensemble Learning in Stock Market Prediction","authors":"Hassan Ezzeddine, Roger Achkar","doi":"10.1145/3457682.3457727","DOIUrl":"https://doi.org/10.1145/3457682.3457727","url":null,"abstract":"In recent years, the increasing influence of machine learning in different industries had inspired many traders to benefit from it in the world of finance, stock trading is one of the most important activities. Predicting the direction of stock prices is a widely studied subject in many fields including trading, finance, statistics and computer science. The main concern for Investors is to maximize their profit if they determine when to buy/sell an investment they apply Analytical methods that makes use of different sources ranging from news to price data, all aiming at predicting the company's future stock price ML applications have presented investors with something new. A combination of technologies that could entirely reshape the way they make investment decisions. The purpose of this thesis is to leverage the aggregation of technical, fundamental, and sentiment analysis with stacked machine learning models capable of predicting profitable actions to be executed.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129293304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Tracking Ground Targets with Road Constraints Using a JMS-GM-PHD Filter 利用JMS-GM-PHD滤波器跟踪道路约束下的地面目标
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457768
Jihong Zheng, He He, Longteng Cong
The probability hypothesis density filter with linear Gaussian jump Markov system multi-target models is an attractive approach to tracking multiple maneuvering targets in the presence of data association uncertainty, clutter, noise, and detection uncertainty. However, these models are not precise enough to describe moving targets on road networks in ground target tracking scenario. In this paper, the road map information is integrated into the jump Markov system Gaussian mixture probability hypothesis density (JMS-GM-PHD) filter, and a road-constraint JMS-GM-PHD filter for ground target tracking is proposed. In addition, we then derive the recursive equation of the proposed filter. Simulation results show that the proposed road-constrained JMS-GM-PHD filter is effective in tracking ground moving targets.
线性高斯跳变马尔可夫系统多目标模型的概率假设密度滤波是在存在数据关联不确定性、杂波、噪声和检测不确定性的情况下跟踪多个机动目标的有效方法。然而,在地面目标跟踪场景中,这些模型对道路网络上运动目标的描述不够精确。本文将道路地图信息集成到跳跃马尔可夫系统高斯混合概率假设密度(JMS-GM-PHD)滤波器中,提出了一种道路约束的JMS-GM-PHD滤波器用于地面目标跟踪。此外,我们还推导了该滤波器的递推方程。仿真结果表明,所提出的道路约束JMS-GM-PHD滤波器能够有效地跟踪地面运动目标。
{"title":"Tracking Ground Targets with Road Constraints Using a JMS-GM-PHD Filter","authors":"Jihong Zheng, He He, Longteng Cong","doi":"10.1145/3457682.3457768","DOIUrl":"https://doi.org/10.1145/3457682.3457768","url":null,"abstract":"The probability hypothesis density filter with linear Gaussian jump Markov system multi-target models is an attractive approach to tracking multiple maneuvering targets in the presence of data association uncertainty, clutter, noise, and detection uncertainty. However, these models are not precise enough to describe moving targets on road networks in ground target tracking scenario. In this paper, the road map information is integrated into the jump Markov system Gaussian mixture probability hypothesis density (JMS-GM-PHD) filter, and a road-constraint JMS-GM-PHD filter for ground target tracking is proposed. In addition, we then derive the recursive equation of the proposed filter. Simulation results show that the proposed road-constrained JMS-GM-PHD filter is effective in tracking ground moving targets.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116347745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Corpus Construction and Entity Recognition for the Field of Industrial Robot Fault Diagnosis 面向工业机器人故障诊断领域的语料库构建与实体识别
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457745
Jiale Zhou, Tao Wang, Jianfeng Deng
The fault logs record the fault information generated during the operation process of industrial robots. It contains a large amount of fault knowledge and solution information. It is necessary to extract this information and build the fault diagnosis knowledge graph of industrial robots, which can support remote fault diagnosis of industrial robots without human help. At present, the research of fault diagnosis knowledge graph is still relatively scarce. In this paper, we propose a method of named entity recognition for extracting the knowledge of industrial robot fault diagnosis. The contribution of our paper is to establish the fault field dataset Fault-Data, propose the ontology concept of the fault diagnosis field, and obtain a good field recognition effect through the verification of the entity recognition model of fault diagnosis. Experimental results show that the F value of named entity recognition reaches 91.99%, which provides a certain reference significance for subsequent knowledge extraction and knowledge graph construction.
故障日志记录了工业机器人在运行过程中产生的故障信息。它包含了大量的故障知识和解决方案信息。对这些信息进行提取,构建工业机器人故障诊断知识图谱,支持工业机器人在不需要人工帮助的情况下进行远程故障诊断。目前,对故障诊断知识图谱的研究还比较匮乏。本文提出了一种用于工业机器人故障诊断知识提取的命名实体识别方法。本文的贡献在于建立了故障场数据集fault - data,提出了故障诊断领域的本体概念,并通过对故障诊断实体识别模型的验证获得了良好的领域识别效果。实验结果表明,命名实体识别的F值达到91.99%,为后续的知识提取和知识图构建提供了一定的参考意义。
{"title":"Corpus Construction and Entity Recognition for the Field of Industrial Robot Fault Diagnosis","authors":"Jiale Zhou, Tao Wang, Jianfeng Deng","doi":"10.1145/3457682.3457745","DOIUrl":"https://doi.org/10.1145/3457682.3457745","url":null,"abstract":"The fault logs record the fault information generated during the operation process of industrial robots. It contains a large amount of fault knowledge and solution information. It is necessary to extract this information and build the fault diagnosis knowledge graph of industrial robots, which can support remote fault diagnosis of industrial robots without human help. At present, the research of fault diagnosis knowledge graph is still relatively scarce. In this paper, we propose a method of named entity recognition for extracting the knowledge of industrial robot fault diagnosis. The contribution of our paper is to establish the fault field dataset Fault-Data, propose the ontology concept of the fault diagnosis field, and obtain a good field recognition effect through the verification of the entity recognition model of fault diagnosis. Experimental results show that the F value of named entity recognition reaches 91.99%, which provides a certain reference significance for subsequent knowledge extraction and knowledge graph construction.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114812374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
GCN2-NAA: Two-stage Graph Convolutional Networks with Node-Aware Attention for Joint Entity and Relation Extraction GCN2-NAA:节点感知关注的两阶段图卷积网络,用于联合实体和关系提取
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457765
WeiCai Niu, Quan Chen, Weiwen Zhang, Jianwen Ma, Zhongqiang Hu
Joint extraction of entities and relations is critical for many tasks of Natural Language Processing (NLP), which aims to extract all triplets in the text. However, the huge challenge is that a sentence usually contains overlapping triplets. In this paper, we propose a joint extraction framework named GCN2-NAA based on a two-stage Graph Convolutional Neural networks (GCN) and Node-Aware Attention mechanism. We obtain multi-granularity representations and regional features of words by stacking multiple feature encoders and 1st-phase GCN. Besides, the node-aware attention mechanism and 2nd-phase GCN to capture the soft attention correlation matrix between all words in each relation type. Based on the constructed soft attention correlation matrix, we utilize GCN to further obtain the interaction between entities, relations, and triplets. Experiment results show that GCN2-NAA outperforms baseline models by 6.5% and 11.4% in terms of F1 score on NYT and WebNLG datasets, respectively.
实体和关系的联合提取对于自然语言处理(NLP)的许多任务至关重要,NLP的目标是提取文本中的所有三元组。然而,一个巨大的挑战是一个句子通常包含重叠的三联体。本文提出了一种基于两阶段图卷积神经网络(GCN)和节点感知注意机制的联合提取框架GCN2-NAA。通过叠加多个特征编码器和第一阶段GCN,得到词的多粒度表示和区域特征。此外,利用节点感知注意机制和第二阶段GCN获取各关系类型中所有词之间的软注意关联矩阵。在构建软注意关联矩阵的基础上,利用GCN进一步获得实体、关系和三元组之间的交互关系。实验结果表明,GCN2-NAA在NYT和WebNLG数据集上的F1得分分别比基线模型高6.5%和11.4%。
{"title":"GCN2-NAA: Two-stage Graph Convolutional Networks with Node-Aware Attention for Joint Entity and Relation Extraction","authors":"WeiCai Niu, Quan Chen, Weiwen Zhang, Jianwen Ma, Zhongqiang Hu","doi":"10.1145/3457682.3457765","DOIUrl":"https://doi.org/10.1145/3457682.3457765","url":null,"abstract":"Joint extraction of entities and relations is critical for many tasks of Natural Language Processing (NLP), which aims to extract all triplets in the text. However, the huge challenge is that a sentence usually contains overlapping triplets. In this paper, we propose a joint extraction framework named GCN2-NAA based on a two-stage Graph Convolutional Neural networks (GCN) and Node-Aware Attention mechanism. We obtain multi-granularity representations and regional features of words by stacking multiple feature encoders and 1st-phase GCN. Besides, the node-aware attention mechanism and 2nd-phase GCN to capture the soft attention correlation matrix between all words in each relation type. Based on the constructed soft attention correlation matrix, we utilize GCN to further obtain the interaction between entities, relations, and triplets. Experiment results show that GCN2-NAA outperforms baseline models by 6.5% and 11.4% in terms of F1 score on NYT and WebNLG datasets, respectively.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114822200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Factors Affecting Accuracy of Genotype Imputation Using Neural Networks in Deep Learning 影响深度学习中神经网络基因型输入准确性的因素
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457688
Tianfeng Shi, Jing Peng
The genotype imputation is an important topic in the field of genomics. Many genome analyses require data without missing values, which requires to impute the missing data. In recent years, deep learning has become hot, and it is more suitable for text sequence type problems, which may fit with the genotype imputation problem. Based on the recurrent neural network and convolutional neural network in deep learning, our study proposes and constructs five model combinations, imputes and compares the results under different missing rate scenarios. And on the basis of the basic model, a higher imputation accuracy is obtained by tuning the model hyperparameters. The results indicated that on all the data sets with various levels of missing rates, the CNN1D-RNNM with tuned hyperparameters well has obtained the best results. The combination of a one-dimensional convolutional neural network and a recurrent neural network with tuned hyperparameters can beat a single convolutional network or a recurrent network at various levels of missing rates. This research provides new solutions for genotype imputation by using the deep learning to build complex neural networks.
基因型插补是基因组学领域的一个重要课题。许多基因组分析需要没有缺失值的数据,这就需要对缺失的数据进行计算。近年来,深度学习成为热门,它更适合于文本序列类型问题,这可能适合基因型归算问题。本研究基于深度学习中的递归神经网络和卷积神经网络,提出并构建了五种模型组合,并对不同缺失率情景下的结果进行了估算和比较。在基本模型的基础上,通过对模型超参数的调整,获得了更高的插补精度。结果表明,在不同缺失率水平的数据集上,超参数调优的CNN1D-RNNM获得了最好的效果。一维卷积神经网络和具有调谐超参数的递归神经网络的组合可以在不同的缺失率水平上击败单个卷积网络或递归网络。本研究利用深度学习构建复杂神经网络,为基因型插补提供了新的解决方案。
{"title":"Factors Affecting Accuracy of Genotype Imputation Using Neural Networks in Deep Learning","authors":"Tianfeng Shi, Jing Peng","doi":"10.1145/3457682.3457688","DOIUrl":"https://doi.org/10.1145/3457682.3457688","url":null,"abstract":"The genotype imputation is an important topic in the field of genomics. Many genome analyses require data without missing values, which requires to impute the missing data. In recent years, deep learning has become hot, and it is more suitable for text sequence type problems, which may fit with the genotype imputation problem. Based on the recurrent neural network and convolutional neural network in deep learning, our study proposes and constructs five model combinations, imputes and compares the results under different missing rate scenarios. And on the basis of the basic model, a higher imputation accuracy is obtained by tuning the model hyperparameters. The results indicated that on all the data sets with various levels of missing rates, the CNN1D-RNNM with tuned hyperparameters well has obtained the best results. The combination of a one-dimensional convolutional neural network and a recurrent neural network with tuned hyperparameters can beat a single convolutional network or a recurrent network at various levels of missing rates. This research provides new solutions for genotype imputation by using the deep learning to build complex neural networks.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127045806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MNGAN: Detecting Anomalies with Memorized Normal Patterns MNGAN:利用记忆正常模式检测异常情况
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457764
Zijian Huang, Changqing Xu
Anomaly detection is an significant problem in machine learning and has been well-studied in a wide range of applications. To model complex and high dimensional data distributions, existing methods, trained with an auto-encoder architecture either directly or indirectly, usually attempt to produce higher reconstruction error for anomalies than normal samples. However, lacking constrains on the latent representation of input data results in an unexpected performance that anomalies can be reconstructed well too, leading to the “missed alarm”. In this work, we propose to reconstruct input data with typical patterns of normal data learned through adversarial networks. Our approach, called MNGAN, which employs an encoder-decoder-encoder architecture with a memory network, learns to memorize prototypical patterns of normal data and simultaneously preserve details of data style for better reconstruction. In test phase, given a input data, the model will reconstruct it with the most relevant memory item, which indicates one normal pattern. Thus, reconstructions of anomalous data are similar to normal samples, resulting in effective detection for anomalies due to the high reconstruction error. Experiments over several benchmark datasets, from varying domains, shows that our proposed method outperforms previous state-of-the-art anomaly detection approaches.
异常检测是机器学习中的一个重要问题,在广泛的应用中得到了深入研究。为了对复杂的高维数据分布进行建模,现有方法直接或间接地使用自动编码器架构进行训练,通常试图对异常数据产生比正常样本更高的重构误差。然而,由于缺乏对输入数据潜在表示的约束,异常数据也能被很好地重建,从而导致 "漏报"。在这项工作中,我们提出利用通过对抗网络学习到的正常数据的典型模式来重建输入数据。我们的方法被称为 MNGAN,它采用了带有记忆网络的编码器-解码器-编码器架构,可以学习记忆正常数据的原型模式,同时保留数据风格的细节,以获得更好的重构效果。在测试阶段,给定一个输入数据,模型将用最相关的记忆项进行重构,该记忆项表示一种正常模式。因此,异常数据的重构与正常样本相似,由于重构误差大,因此能有效检测异常数据。在不同领域的多个基准数据集上进行的实验表明,我们提出的方法优于以往最先进的异常检测方法。
{"title":"MNGAN: Detecting Anomalies with Memorized Normal Patterns","authors":"Zijian Huang, Changqing Xu","doi":"10.1145/3457682.3457764","DOIUrl":"https://doi.org/10.1145/3457682.3457764","url":null,"abstract":"Anomaly detection is an significant problem in machine learning and has been well-studied in a wide range of applications. To model complex and high dimensional data distributions, existing methods, trained with an auto-encoder architecture either directly or indirectly, usually attempt to produce higher reconstruction error for anomalies than normal samples. However, lacking constrains on the latent representation of input data results in an unexpected performance that anomalies can be reconstructed well too, leading to the “missed alarm”. In this work, we propose to reconstruct input data with typical patterns of normal data learned through adversarial networks. Our approach, called MNGAN, which employs an encoder-decoder-encoder architecture with a memory network, learns to memorize prototypical patterns of normal data and simultaneously preserve details of data style for better reconstruction. In test phase, given a input data, the model will reconstruct it with the most relevant memory item, which indicates one normal pattern. Thus, reconstructions of anomalous data are similar to normal samples, resulting in effective detection for anomalies due to the high reconstruction error. Experiments over several benchmark datasets, from varying domains, shows that our proposed method outperforms previous state-of-the-art anomaly detection approaches.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129916706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Maneuvering Target Tracking Based on Neural Network and Error Self-correction Technology
Pub Date : 2021-02-26 DOI: 10.1145/3457682.3457708
Lisi Chen, Changcheng Wang, Jiale Huang
Neural network has strong nonlinear data characterization ability and solves many complex problems successfully. Trajectory estimation and prediction is time series forecasting but different from convention problems such as time video analysis. A method based on neural network and error self-correction technology achieving trajectory estimation and prediction is proposed in this paper. The method needs neural network without additional filtering algorithm, so the maneuver models and noise characteristics are not needed. According to the information of the previous moments before the investigated time, the information of the next moment or a specified time later can be obtained. For tracking a simple maneuvering target model with unknown parameters and noise characteristics, numerical simulation results show that FNN achieves filtering and it achieves a higher prediction accuracy than the Least Squares filtering. For tracking a complex maneuvering target model with strong nonlinearity, RNN combining with FNN is employed. For the measurement error with D standard deviation 2m, azimuth angle and the altitude angle measurement errors standard deviation with 2mil, the angle predicting error standard deviation is less than 1.3mil, which shows RNN combing with error self-correction technology has high accuracy. It meets the technical requirements for maneuvering target tracking as well as various similar applications.
神经网络具有很强的非线性数据表征能力,成功地解决了许多复杂的问题。轨迹估计与预测是一种时间序列预测,但不同于时间视频分析等常规问题。提出了一种基于神经网络和误差自校正技术实现弹道估计和预测的方法。该方法使用神经网络,不需要额外的滤波算法,因此不需要机动模型和噪声特性。根据被调查时间之前的前一时刻的信息,可以得到下一时刻或指定时间之后的信息。对于一个参数未知、噪声特性未知的简单机动目标模型,数值仿真结果表明,FNN实现了滤波,并且比最小二乘滤波具有更高的预测精度。针对具有强非线性的复杂机动目标模型,采用RNN与FNN相结合的方法进行跟踪。对于D测量误差标准差为2m,方位角和高度角测量误差标准差为2mil,角度预测误差标准差小于1.3mil,表明RNN结合误差自校正技术具有较高的精度。满足机动目标跟踪以及各种类似应用的技术要求。
{"title":"Maneuvering Target Tracking Based on Neural Network and Error Self-correction Technology","authors":"Lisi Chen, Changcheng Wang, Jiale Huang","doi":"10.1145/3457682.3457708","DOIUrl":"https://doi.org/10.1145/3457682.3457708","url":null,"abstract":"Neural network has strong nonlinear data characterization ability and solves many complex problems successfully. Trajectory estimation and prediction is time series forecasting but different from convention problems such as time video analysis. A method based on neural network and error self-correction technology achieving trajectory estimation and prediction is proposed in this paper. The method needs neural network without additional filtering algorithm, so the maneuver models and noise characteristics are not needed. According to the information of the previous moments before the investigated time, the information of the next moment or a specified time later can be obtained. For tracking a simple maneuvering target model with unknown parameters and noise characteristics, numerical simulation results show that FNN achieves filtering and it achieves a higher prediction accuracy than the Least Squares filtering. For tracking a complex maneuvering target model with strong nonlinearity, RNN combining with FNN is employed. For the measurement error with D standard deviation 2m, azimuth angle and the altitude angle measurement errors standard deviation with 2mil, the angle predicting error standard deviation is less than 1.3mil, which shows RNN combing with error self-correction technology has high accuracy. It meets the technical requirements for maneuvering target tracking as well as various similar applications.","PeriodicalId":142045,"journal":{"name":"2021 13th International Conference on Machine Learning and Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131071881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2021 13th International Conference on Machine Learning and Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1