首页 > 最新文献

Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence最新文献

英文 中文
A Novel Interactive Recurrent Attention Network for Emotion-Cause Pair Extraction 一种用于情绪-原因对提取的交互式循环注意网络
Xiangyu Jia, Xinhai Chen, Qian Wan, Jie Liu
Unlike Emotion Cause Extraction (ECE) task which consists of pre-annotate emotions and passage, emotion-cause pair extraction (ECPE) aims at extracting potential emotions and corresponding causes in the document without the need for pre-annotations. Traditional ECPE solutions divide the extracting emotions and causes operation into two separate parts. However, separating the bidirectional dependence between emotion and cause may lose a lot of potentially useful information. In this paper, we propose a novel interactive recurrent attention network (IRAN). Our approach focuses on the bidirectional impact between emotions and causes, and extracts emotions and causes simultaneously. The information in the document can be fully exploited through multiple modeling and information extraction. Our emotion-specific transformation and distance fusion correlation can adaptively focus on the emotions and the distance, gracefully incorporate them into a distinguishable neural network attention framework. The experimental results show that our proposed model achieves better performance than other widely-used models on the ECPE corpus.
与情绪原因抽取(ECE)任务不同,情绪原因对抽取(ECPE)任务的目的是在不需要预先标注的情况下提取文档中潜在的情绪和相应的原因。传统的ECPE解决方案将情感提取和原因操作分为两个独立的部分。然而,分离情感和原因之间的双向依赖可能会失去很多潜在的有用信息。本文提出了一种新的交互式循环注意网络(IRAN)。我们的方法关注情绪和原因之间的双向影响,同时提取情绪和原因。通过多次建模和信息提取,可以充分利用文档中的信息。我们的情感特异性转换和距离融合关联可以自适应地关注情感和距离,并将它们优雅地融合到一个可识别的神经网络注意框架中。实验结果表明,该模型在ECPE语料库上取得了比其他常用模型更好的性能。
{"title":"A Novel Interactive Recurrent Attention Network for Emotion-Cause Pair Extraction","authors":"Xiangyu Jia, Xinhai Chen, Qian Wan, Jie Liu","doi":"10.1145/3446132.3446195","DOIUrl":"https://doi.org/10.1145/3446132.3446195","url":null,"abstract":"Unlike Emotion Cause Extraction (ECE) task which consists of pre-annotate emotions and passage, emotion-cause pair extraction (ECPE) aims at extracting potential emotions and corresponding causes in the document without the need for pre-annotations. Traditional ECPE solutions divide the extracting emotions and causes operation into two separate parts. However, separating the bidirectional dependence between emotion and cause may lose a lot of potentially useful information. In this paper, we propose a novel interactive recurrent attention network (IRAN). Our approach focuses on the bidirectional impact between emotions and causes, and extracts emotions and causes simultaneously. The information in the document can be fully exploited through multiple modeling and information extraction. Our emotion-specific transformation and distance fusion correlation can adaptively focus on the emotions and the distance, gracefully incorporate them into a distinguishable neural network attention framework. The experimental results show that our proposed model achieves better performance than other widely-used models on the ECPE corpus.","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129705855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Sentimental Analysis based on hybrid approach of Latent Dirichlet Allocation and Machine Learning for Large-Scale of Imbalanced Twitter Data 基于潜在Dirichlet分配和机器学习混合方法的大规模不平衡Twitter数据情感分析
Nasir Jamal, Xianqiao Chen, Junaid Hussain Abro, Doniyor Tukhtakhunov
Emotions classification in large amount of Twitter's data is very effective to analyze the users’ mood about a concerned product, news, topic, and so on. However, it is really a challenging task to extract meaningful features from a burst of raw tweets as emotions are subjective with limited fuzzy boundaries. These subjective features can be expressed in different terminologies and perceptions. In this paper, we proposed a hybrid approach of LDA and machine learning to predict emotions for large scale of imbalanced tweets. First, the raw tweets are preprocessed using tokenization method for capturing useful features without noisy information. Second, the local and global feature's importance is estimated by applying TFIDF statistical technique. Third, the Latent Dirichlet Allocation (LDA) topic modeling method is used to extract topics from these features. These topics explain concepts of related tweet which is really helpful for classification. Fourth, the Adaptive Synthetic (ADASYN) class balancing technique is applied to oversample the data and balance each class of topic. Finally, the K-Nearest Neighbor (KNN) machine learning algorithm is applied to predict the emotions in extracted topics. The class balancing method increase the significance of minor classes and solve the problem of class imbalance. The proposed approach is evaluated on two different Twitters’ emotions datasets. It is proved that, this methodology outperformed as compared to the popular state of the art methods in terms of precision, recall, f-measure and classification accuracy.
在大量的Twitter数据中进行情绪分类,对于分析用户对关注的产品、新闻、话题等的情绪是非常有效的。然而,从大量原始tweet中提取有意义的特征确实是一项具有挑战性的任务,因为情绪是主观的,具有有限的模糊界限。这些主观特征可以用不同的术语和感知来表达。在本文中,我们提出了一种LDA和机器学习的混合方法来预测大规模不平衡推文的情绪。首先,使用标记化方法对原始tweet进行预处理,以捕获无噪声信息的有用特征。其次,利用TFIDF统计技术估计局部和全局特征的重要性。第三,利用潜狄利克雷分配(Latent Dirichlet Allocation, LDA)主题建模方法从这些特征中提取主题。这些主题解释了相关tweet的概念,这对分类非常有帮助。第四,采用自适应合成(ADASYN)类平衡技术对数据进行过采样,平衡各类主题。最后,应用k -最近邻(KNN)机器学习算法对提取的主题进行情绪预测。班级平衡法提高了辅修班级的重要性,解决了班级失衡问题。该方法在两个不同的twitter情绪数据集上进行了评估。事实证明,该方法在精度,召回率,f-measure和分类精度方面优于流行的最新方法。
{"title":"Sentimental Analysis based on hybrid approach of Latent Dirichlet Allocation and Machine Learning for Large-Scale of Imbalanced Twitter Data","authors":"Nasir Jamal, Xianqiao Chen, Junaid Hussain Abro, Doniyor Tukhtakhunov","doi":"10.1145/3446132.3446413","DOIUrl":"https://doi.org/10.1145/3446132.3446413","url":null,"abstract":"Emotions classification in large amount of Twitter's data is very effective to analyze the users’ mood about a concerned product, news, topic, and so on. However, it is really a challenging task to extract meaningful features from a burst of raw tweets as emotions are subjective with limited fuzzy boundaries. These subjective features can be expressed in different terminologies and perceptions. In this paper, we proposed a hybrid approach of LDA and machine learning to predict emotions for large scale of imbalanced tweets. First, the raw tweets are preprocessed using tokenization method for capturing useful features without noisy information. Second, the local and global feature's importance is estimated by applying TFIDF statistical technique. Third, the Latent Dirichlet Allocation (LDA) topic modeling method is used to extract topics from these features. These topics explain concepts of related tweet which is really helpful for classification. Fourth, the Adaptive Synthetic (ADASYN) class balancing technique is applied to oversample the data and balance each class of topic. Finally, the K-Nearest Neighbor (KNN) machine learning algorithm is applied to predict the emotions in extracted topics. The class balancing method increase the significance of minor classes and solve the problem of class imbalance. The proposed approach is evaluated on two different Twitters’ emotions datasets. It is proved that, this methodology outperformed as compared to the popular state of the art methods in terms of precision, recall, f-measure and classification accuracy.","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"134 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124642614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A unified framework for attribute extraction in electronic medical records 电子病历中属性提取的统一框架
Ming Du, Wenkun Wang, Sufen Wang, Bo Xu
Electronic medical records(EMR) contain a lot of medical diagnosis information; In order to mine the value of data, it is necessary to extract the attributes of the electronic medical record. The deep learning method has been widely used in attribute extraction tasks and has achieved remarkable results in general text datasets. However, in specific medical fields, such as our electronic medical record extraction task, attribute extraction often lacks a lot of high-quality annotation data; besides, the attributes in the corpus can be divided into two types: discriminative attribute and extractive attribute, there is a strong correlation between some attributes. Independent Modeling each attribute cannot use this information, which will lead to insufficient information that the model can learn. This paper proposes a unified framework for medical record attribute extraction based on ALBERT, uses a large amount of general corpus as external knowledge for pre-training and fine-tuning, and adopts multi-task learning to make all attributes share the underlying cod-ing and train. Experiments show that this framework is greatly improved than the traditional LSTM-CRF model; it performs better in practical application scenarios.
电子病历(EMR)包含大量的医疗诊断信息;为了挖掘数据的价值,需要对电子病历的属性进行提取。深度学习方法在属性提取任务中得到了广泛的应用,并在一般文本数据集中取得了显著的效果。然而,在特定的医疗领域,比如我们的电子病历提取任务中,属性提取往往缺乏大量高质量的标注数据;此外,语料库中的属性可分为判别属性和抽取属性两种类型,部分属性之间存在较强的相关性。独立建模的每个属性不能使用这些信息,这将导致模型可以学习的信息不足。本文提出了一种基于ALBERT的病案属性提取的统一框架,利用大量通用语料库作为外部知识进行预训练和微调,并采用多任务学习使所有属性共享底层编码和训练。实验表明,该框架比传统的LSTM-CRF模型有很大的改进;它在实际应用场景中表现更好。
{"title":"A unified framework for attribute extraction in electronic medical records","authors":"Ming Du, Wenkun Wang, Sufen Wang, Bo Xu","doi":"10.1145/3446132.3446410","DOIUrl":"https://doi.org/10.1145/3446132.3446410","url":null,"abstract":"Electronic medical records(EMR) contain a lot of medical diagnosis information; In order to mine the value of data, it is necessary to extract the attributes of the electronic medical record. The deep learning method has been widely used in attribute extraction tasks and has achieved remarkable results in general text datasets. However, in specific medical fields, such as our electronic medical record extraction task, attribute extraction often lacks a lot of high-quality annotation data; besides, the attributes in the corpus can be divided into two types: discriminative attribute and extractive attribute, there is a strong correlation between some attributes. Independent Modeling each attribute cannot use this information, which will lead to insufficient information that the model can learn. This paper proposes a unified framework for medical record attribute extraction based on ALBERT, uses a large amount of general corpus as external knowledge for pre-training and fine-tuning, and adopts multi-task learning to make all attributes share the underlying cod-ing and train. Experiments show that this framework is greatly improved than the traditional LSTM-CRF model; it performs better in practical application scenarios.","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132369081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep Factorization Machines network with Non-linear interaction for Recommender System 推荐系统的非线性交互深度分解机器网络
Chuchu Yu, Xinmei Yang, Han Jiang
In recent years, leveraging the characteristics of users’ historical behavior to predict click-through rates (CTRs) has become a key point of interest in studies of recommender systems. Although theoretical and experimental investigations of CTR models have increased substantially, most models focus on linear feature interaction; however, crucial user characteristics in the real world are discovered implicitly by non-linear features. In this paper, we propose a novel model that integrates the advantages of linear and non-linear feature interaction. Our deep factorization machines network with non-linear interaction for recommend systems (DFNR) model identifies non-linear feature interactions by designing a new Non-linear interaction (NL-interaction) layer. We also incorporate a deeper multilayer perceptron (MLP) than other CTR models, which yields more accurate information about higher-order feature interactions. The MLP in the proposed model is unique because we use the residual structure to correct problems caused by a deeper network structure. Findings show that our DFNR model performs better on a CTR prediction task compared to other models. Results demonstrate the effective-ness of our model based on its non-linear interaction layer and deeper neural network architecture.
近年来,利用用户的历史行为特征来预测点击率已经成为推荐系统研究的一个重点。尽管CTR模型的理论和实验研究已经大量增加,但大多数模型都集中在线性特征相互作用上;然而,现实世界中的关键用户特征是通过非线性特征隐含地发现的。在本文中,我们提出了一个新的模型,集成了线性和非线性特征交互的优点。我们的推荐系统(DFNR)模型通过设计一个新的非线性交互(nl -交互)层来识别非线性特征交互。我们还结合了一个比其他CTR模型更深的多层感知器(MLP),它产生了关于高阶特征相互作用的更准确的信息。所提模型中的MLP是独特的,因为我们使用残差结构来纠正由更深层次的网络结构引起的问题。研究结果表明,与其他模型相比,DFNR模型在CTR预测任务上表现更好。结果表明,基于非线性交互层和深层神经网络结构的模型是有效的。
{"title":"Deep Factorization Machines network with Non-linear interaction for Recommender System","authors":"Chuchu Yu, Xinmei Yang, Han Jiang","doi":"10.1145/3446132.3446134","DOIUrl":"https://doi.org/10.1145/3446132.3446134","url":null,"abstract":"In recent years, leveraging the characteristics of users’ historical behavior to predict click-through rates (CTRs) has become a key point of interest in studies of recommender systems. Although theoretical and experimental investigations of CTR models have increased substantially, most models focus on linear feature interaction; however, crucial user characteristics in the real world are discovered implicitly by non-linear features. In this paper, we propose a novel model that integrates the advantages of linear and non-linear feature interaction. Our deep factorization machines network with non-linear interaction for recommend systems (DFNR) model identifies non-linear feature interactions by designing a new Non-linear interaction (NL-interaction) layer. We also incorporate a deeper multilayer perceptron (MLP) than other CTR models, which yields more accurate information about higher-order feature interactions. The MLP in the proposed model is unique because we use the residual structure to correct problems caused by a deeper network structure. Findings show that our DFNR model performs better on a CTR prediction task compared to other models. Results demonstrate the effective-ness of our model based on its non-linear interaction layer and deeper neural network architecture.","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132895638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The Design and Construction of the Corpus of China English 《中国英语》语料库的设计与构建
L. Xia, Yun Xia
The paper describes the development a corpus of an English variety, i.e. China English, in or-der to provide a linguistic resource for researchers in the field of China English. The Corpus of China English (CCE) was built with due consideration given to its representativeness and authenticity. It was composed of more than 13,962,102 tokens in 15,333 texts evenly divided between the following four genres: newspapers, magazines, fiction and academic writings. The texts cover a wide range of domains, such as news, financial, politics, environment, social, culture, technology, sports, education, philosophy, literary, etc. It is a helpful resource for research on China English, computational linguistics, natural language processing, corpus linguistics and English language education.
本文介绍了中国英语语料库的开发情况,以期为中国英语研究领域的研究者提供语言资源。中国英语语料库的构建充分考虑了语料库的代表性和真实性。它由超过13,962,102个符号组成,分布在15,333个文本中,平均分为以下四种类型:报纸,杂志,小说和学术著作。这些文本涵盖了广泛的领域,如新闻、金融、政治、环境、社会、文化、科技、体育、教育、哲学、文学等。它是中国英语、计算语言学、自然语言处理、语料库语言学和英语教育研究的有益资源。
{"title":"The Design and Construction of the Corpus of China English","authors":"L. Xia, Yun Xia","doi":"10.1145/3446132.3446398","DOIUrl":"https://doi.org/10.1145/3446132.3446398","url":null,"abstract":"The paper describes the development a corpus of an English variety, i.e. China English, in or-der to provide a linguistic resource for researchers in the field of China English. The Corpus of China English (CCE) was built with due consideration given to its representativeness and authenticity. It was composed of more than 13,962,102 tokens in 15,333 texts evenly divided between the following four genres: newspapers, magazines, fiction and academic writings. The texts cover a wide range of domains, such as news, financial, politics, environment, social, culture, technology, sports, education, philosophy, literary, etc. It is a helpful resource for research on China English, computational linguistics, natural language processing, corpus linguistics and English language education.","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124045150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Underwater Object Detection Based on Improved Single Shot MultiBox Detector 基于改进单镜头多盒探测器的水下目标检测
Zhongyun Jiang, Rong-Sheng Wang
Underwater optical images are scarce, and there are varying degrees of blur and color distortion, which brings great challenges to the detection of underwater objects. In view of the shortcomings of the original Single Shot MultiBox Detector (SSD), in this paper, a shallow object detection layer is added to the original SSD model to improve the network's ability to detect small objects. At the same time, this article improves the confidence loss to narrow the ability of SSD to detect different types of objects. Using the Multi-Scale Retinex with Color Restoration (MSRCR) algorithm to process the original images, enhance the feature information of the objects in the underwater images. Training the improved SSD network through transfer learning to overcome the limitations of insufficient underwater images. Experimental results show that the algorithm proposed in this paper has better detection performance than the original SSD, YOLO v3 and other algorithms, which is of great significance to the realization of underwater object detection.
水下光学图像稀缺,并且存在不同程度的模糊和色彩失真,这给水下物体的检测带来了很大的挑战。针对原有单射多盒检测器(Single Shot MultiBox Detector, SSD)存在的不足,本文在原有的SSD模型上增加了一个浅层的目标检测层,以提高网络对小目标的检测能力。同时,本文通过改进置信度损失来缩小SSD检测不同类型对象的能力。利用多尺度Retinex with Color Restoration (MSRCR)算法对原始图像进行处理,增强水下图像中物体的特征信息。通过迁移学习训练改进后的SSD网络,克服水下图像不足的局限性。实验结果表明,本文提出的算法比原有的SSD、YOLO v3等算法具有更好的检测性能,对实现水下目标检测具有重要意义。
{"title":"Underwater Object Detection Based on Improved Single Shot MultiBox Detector","authors":"Zhongyun Jiang, Rong-Sheng Wang","doi":"10.1145/3446132.3446170","DOIUrl":"https://doi.org/10.1145/3446132.3446170","url":null,"abstract":"Underwater optical images are scarce, and there are varying degrees of blur and color distortion, which brings great challenges to the detection of underwater objects. In view of the shortcomings of the original Single Shot MultiBox Detector (SSD), in this paper, a shallow object detection layer is added to the original SSD model to improve the network's ability to detect small objects. At the same time, this article improves the confidence loss to narrow the ability of SSD to detect different types of objects. Using the Multi-Scale Retinex with Color Restoration (MSRCR) algorithm to process the original images, enhance the feature information of the objects in the underwater images. Training the improved SSD network through transfer learning to overcome the limitations of insufficient underwater images. Experimental results show that the algorithm proposed in this paper has better detection performance than the original SSD, YOLO v3 and other algorithms, which is of great significance to the realization of underwater object detection.","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"120 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120960456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
An application of LSTM prediction model based on keystroke data 基于按键数据的LSTM预测模型的应用
O. Min, Zhang Wei, Zhou Nian, Xie Su
Based on the subject's keyboard typing time series dataset, an long short term (LSTM) network model was developed to predict the early-stage Parkinson's disease. The training and test results show that the area under ROC curve (AUC) is 0.82, accuracy rate (ACC) is 0.84, precision (PRE) is 0.85, recall rate (REC) is 0.98, and F1 score is 0.90. This indicates that the LSTM prediction model can botain high accuracy, precision and sensitivity results by automatically extracting keyboard typing time series characteristics of keyboard typing time series data.
基于受试者的键盘输入时间序列数据集,建立了一个长期短期(LSTM)网络模型来预测早期帕金森病。训练和测试结果表明,该方法的ROC曲线下面积(AUC)为0.82,准确率(ACC)为0.84,准确率(PRE)为0.85,召回率(REC)为0.98,F1得分为0.90。这表明LSTM预测模型通过自动提取键盘输入时间序列数据的键盘输入时间序列特征,可以获得较高的准确度、精度和灵敏度。
{"title":"An application of LSTM prediction model based on keystroke data","authors":"O. Min, Zhang Wei, Zhou Nian, Xie Su","doi":"10.1145/3446132.3446191","DOIUrl":"https://doi.org/10.1145/3446132.3446191","url":null,"abstract":"Based on the subject's keyboard typing time series dataset, an long short term (LSTM) network model was developed to predict the early-stage Parkinson's disease. The training and test results show that the area under ROC curve (AUC) is 0.82, accuracy rate (ACC) is 0.84, precision (PRE) is 0.85, recall rate (REC) is 0.98, and F1 score is 0.90. This indicates that the LSTM prediction model can botain high accuracy, precision and sensitivity results by automatically extracting keyboard typing time series characteristics of keyboard typing time series data.","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115646282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Domain Adaptation for Tibetan-Chinese Neural Machine Translation 藏汉神经机器翻译的领域自适应
Maoxian Zhou, Jia Secha, Rangjia Cai
The meaning of the same word or sentence is likely to change in different semantic contexts, which challenges general-purpose translation system to maintain stable performance across different domains. Therefore, domain adaptation is an essential researching topic in Neural Machine Translation practice. In order to efficiently train translation models for different domains, in this work we take the Tibetan-Chinese general translation model as the parent model, and obtain two domain-specific Tibetan-Chinese translation models with small-scale in-domain data. The empirical results indicate that the method provides a positive approach for domain adaptation in low-resource scenarios, resulting in better bleu metrics as well as faster training speed over our general baseline models.
在不同的语义语境中,同一个词或句子的意义可能会发生变化,这给通用翻译系统在不同领域保持稳定的性能带来了挑战。因此,领域自适应是神经网络机器翻译实践中的一个重要研究课题。为了有效地训练不同领域的翻译模型,本工作以藏汉通用翻译模型为父模型,获得了两个具有小尺度域内数据的特定领域的藏汉翻译模型。实证结果表明,该方法为低资源场景下的域适应提供了积极的途径,从而获得了更好的bleu指标,并且比我们的一般基线模型更快的训练速度。
{"title":"Domain Adaptation for Tibetan-Chinese Neural Machine Translation","authors":"Maoxian Zhou, Jia Secha, Rangjia Cai","doi":"10.1145/3446132.3446404","DOIUrl":"https://doi.org/10.1145/3446132.3446404","url":null,"abstract":"The meaning of the same word or sentence is likely to change in different semantic contexts, which challenges general-purpose translation system to maintain stable performance across different domains. Therefore, domain adaptation is an essential researching topic in Neural Machine Translation practice. In order to efficiently train translation models for different domains, in this work we take the Tibetan-Chinese general translation model as the parent model, and obtain two domain-specific Tibetan-Chinese translation models with small-scale in-domain data. The empirical results indicate that the method provides a positive approach for domain adaptation in low-resource scenarios, resulting in better bleu metrics as well as faster training speed over our general baseline models.","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132668561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Robustness analysis of noise network link prediction 噪声网络链路预测的鲁棒性分析
Cunchao Zhu, Guangquan Cheng, Yang Ma, Jiuyao Jiang, M. Wang, Tingfei Huang
Link prediction is an important application in complex networks. It predicts existing but undiscovered associations or possible future relationships in the network. However, networks in real life have much noise. The networks we observe are incomplete or redundant which interfere with the effect of link prediction. This paper summarizes and constructs four kinds of common noises in social networks, then analyzes the robustness of traditional link prediction methods and methods based on network representation under the influence of different kinds and different degrees of noises on multiple social networks. The experimental results confirm that algorithms using local network properties have higher link accuracy, while methods based on the global properties have higher robustness. CCS CONCEPTS • Networks∼Network performance evaluation∼Network performance analysis • Networks∼Network performance evaluation∼Network experimentation • Networks∼Network performance evaluation∼Network performance modeling
链路预测是复杂网络中的一个重要应用。它预测网络中现有但尚未发现的关联或可能的未来关系。然而,现实生活中的网络有很多噪音。我们观察到的网络是不完整的或冗余的,这干扰了链路预测的效果。本文总结并构建了社交网络中常见的四种噪声,分析了传统的链接预测方法和基于网络表示的方法在多个社交网络中不同种类、不同程度的噪声影响下的鲁棒性。实验结果表明,基于局部网络属性的算法具有更高的链路精度,而基于全局属性的方法具有更高的鲁棒性。CCS概念•网络~网络性能评估~网络性能分析•网络~网络性能评估~网络实验•网络~网络性能评估~网络性能建模
{"title":"Robustness analysis of noise network link prediction","authors":"Cunchao Zhu, Guangquan Cheng, Yang Ma, Jiuyao Jiang, M. Wang, Tingfei Huang","doi":"10.1145/3446132.3446143","DOIUrl":"https://doi.org/10.1145/3446132.3446143","url":null,"abstract":"Link prediction is an important application in complex networks. It predicts existing but undiscovered associations or possible future relationships in the network. However, networks in real life have much noise. The networks we observe are incomplete or redundant which interfere with the effect of link prediction. This paper summarizes and constructs four kinds of common noises in social networks, then analyzes the robustness of traditional link prediction methods and methods based on network representation under the influence of different kinds and different degrees of noises on multiple social networks. The experimental results confirm that algorithms using local network properties have higher link accuracy, while methods based on the global properties have higher robustness. CCS CONCEPTS • Networks∼Network performance evaluation∼Network performance analysis • Networks∼Network performance evaluation∼Network experimentation • Networks∼Network performance evaluation∼Network performance modeling","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130705716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Context Event Features and Event Embedding Enhanced Event Detection 事件特性和事件嵌入增强的事件检测
Xin Shi, Xiaoyang Zeng, Jie Wu, Mengshu Hou, Hao Zhu
Extracting valuable information from text has always been a hot point for research and event detection is an essential subtask of information extraction. Most existing methods of event detection only focus on sentence-level information and do not consider the correlation between different event types. To address these problems, in this paper, we propose a novel pre-trained language model based event detection framework named CFEE that utilizes document-level information and event correlation to enhance the event detection task. To obtain event correlation, we project all event types into a shared semantic space through a Skip-gram model, where the event correlation can be represented as the distance between event embeddings. In order to capture document-level information, we utilize a bidirectional recurrent neural network to fuse the context information. Experiments on the ACE2005 dataset demonstrate that our proposed model is better than most existing methods, and also demonstrate the effectiveness of event correlation and document-level information.
从文本中提取有价值的信息一直是研究的热点,事件检测是信息提取的重要子任务。大多数现有的事件检测方法只关注句子级信息,没有考虑不同事件类型之间的相关性。为了解决这些问题,本文提出了一种新的基于预训练语言模型的事件检测框架CFEE,该框架利用文档级信息和事件相关性来增强事件检测任务。为了获得事件相关性,我们通过Skip-gram模型将所有事件类型投影到共享语义空间中,其中事件相关性可以表示为事件嵌入之间的距离。为了捕获文档级信息,我们利用双向递归神经网络融合上下文信息。在ACE2005数据集上的实验表明,我们的模型优于大多数现有的方法,也证明了事件关联和文档级信息的有效性。
{"title":"Context Event Features and Event Embedding Enhanced Event Detection","authors":"Xin Shi, Xiaoyang Zeng, Jie Wu, Mengshu Hou, Hao Zhu","doi":"10.1145/3446132.3446397","DOIUrl":"https://doi.org/10.1145/3446132.3446397","url":null,"abstract":"Extracting valuable information from text has always been a hot point for research and event detection is an essential subtask of information extraction. Most existing methods of event detection only focus on sentence-level information and do not consider the correlation between different event types. To address these problems, in this paper, we propose a novel pre-trained language model based event detection framework named CFEE that utilizes document-level information and event correlation to enhance the event detection task. To obtain event correlation, we project all event types into a shared semantic space through a Skip-gram model, where the event correlation can be represented as the distance between event embeddings. In order to capture document-level information, we utilize a bidirectional recurrent neural network to fuse the context information. Experiments on the ACE2005 dataset demonstrate that our proposed model is better than most existing methods, and also demonstrate the effectiveness of event correlation and document-level information.","PeriodicalId":125388,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence","volume":"14 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131751366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2020 3rd International Conference on Algorithms, Computing and Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1