基于多通道卷积神经网络模型的跨域情感分析

A. Rozie, Andria Arisal, D. Munandar
{"title":"基于多通道卷积神经网络模型的跨域情感分析","authors":"A. Rozie, Andria Arisal, D. Munandar","doi":"10.1109/ISRITI48646.2019.9034599","DOIUrl":null,"url":null,"abstract":"Analyzing sentiment analysis with deep learning requires massive labeled datasets where such data is not always available. The annotation process is also time-consuming and tedious. Further, even after we train the sentiment analysis, it creates another problem. Because this model is domain-dependent, the performance in another domain estimated to perform poorly. In this paper, we present the transfer learning approach to transfer knowledge gained from the source dataset into the target dataset with the expectation to improve the target model. Multichannel Convolutional Neural Network deploys different n-grams as the input channel in a single CNN model to grasp meaningful features from the text. This method has proven to perform well in sentiment analysis problems. We train our three datasets with different domains using this method as the baseline. The largest dataset then becomes the source model for transfer learning and other datasets as the target. Fine-tuning our source model also needed when retraining it into the target dataset. From the evaluation, we show that several transfer learning strategies outperform the domain-specific model, even when the data is imbalanced. We also highlight certain failing strategies that inflict lousy results on the target model performance.","PeriodicalId":367363,"journal":{"name":"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Transferring Multi-Channel Convolutional Neural Network Model for Cross-Domain Sentiment Analysis\",\"authors\":\"A. Rozie, Andria Arisal, D. Munandar\",\"doi\":\"10.1109/ISRITI48646.2019.9034599\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Analyzing sentiment analysis with deep learning requires massive labeled datasets where such data is not always available. The annotation process is also time-consuming and tedious. Further, even after we train the sentiment analysis, it creates another problem. Because this model is domain-dependent, the performance in another domain estimated to perform poorly. In this paper, we present the transfer learning approach to transfer knowledge gained from the source dataset into the target dataset with the expectation to improve the target model. Multichannel Convolutional Neural Network deploys different n-grams as the input channel in a single CNN model to grasp meaningful features from the text. This method has proven to perform well in sentiment analysis problems. We train our three datasets with different domains using this method as the baseline. The largest dataset then becomes the source model for transfer learning and other datasets as the target. Fine-tuning our source model also needed when retraining it into the target dataset. From the evaluation, we show that several transfer learning strategies outperform the domain-specific model, even when the data is imbalanced. We also highlight certain failing strategies that inflict lousy results on the target model performance.\",\"PeriodicalId\":367363,\"journal\":{\"name\":\"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISRITI48646.2019.9034599\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISRITI48646.2019.9034599","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

使用深度学习进行情感分析需要大量标记数据集,而这些数据并不总是可用的。注释过程也很耗时和繁琐。此外,即使在我们训练了情感分析之后,它也会产生另一个问题。因为这个模型是领域相关的,所以在另一个领域的性能估计会很差。在本文中,我们提出了一种迁移学习方法,将从源数据集中获得的知识迁移到目标数据集中,以期改进目标模型。多通道卷积神经网络在单个CNN模型中部署不同的n-gram作为输入通道,从文本中抓取有意义的特征。该方法已被证明在情感分析问题中表现良好。我们使用该方法作为基线,对三个不同域的数据集进行训练。然后最大的数据集成为迁移学习的源模型,其他数据集作为目标。在将源模型重新训练到目标数据集时也需要对其进行微调。从评估中,我们发现即使在数据不平衡的情况下,几种迁移学习策略也优于特定领域模型。我们还强调了某些失败的策略,这些策略会对目标模型的性能造成糟糕的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Transferring Multi-Channel Convolutional Neural Network Model for Cross-Domain Sentiment Analysis
Analyzing sentiment analysis with deep learning requires massive labeled datasets where such data is not always available. The annotation process is also time-consuming and tedious. Further, even after we train the sentiment analysis, it creates another problem. Because this model is domain-dependent, the performance in another domain estimated to perform poorly. In this paper, we present the transfer learning approach to transfer knowledge gained from the source dataset into the target dataset with the expectation to improve the target model. Multichannel Convolutional Neural Network deploys different n-grams as the input channel in a single CNN model to grasp meaningful features from the text. This method has proven to perform well in sentiment analysis problems. We train our three datasets with different domains using this method as the baseline. The largest dataset then becomes the source model for transfer learning and other datasets as the target. Fine-tuning our source model also needed when retraining it into the target dataset. From the evaluation, we show that several transfer learning strategies outperform the domain-specific model, even when the data is imbalanced. We also highlight certain failing strategies that inflict lousy results on the target model performance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
TrendiTex: An Intelligent Fashion Designer Pair Extraction of Aspect and Implicit Opinion Word based on its Co-occurrence in Corpus of Bahasa Indonesia Parameter Tuning of G-mapping SLAM (Simultaneous Localization and Mapping) on Mobile Robot with Laser-Range Finder 360° Sensor ISRITI 2019 Committees Network Architecture Design of Indonesia Research and Education Network (IDREN)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1