A comparative study of deep learning approaches for Chinese Sentence Classification

Zhu Zeng
{"title":"A comparative study of deep learning approaches for Chinese Sentence Classification","authors":"Zhu Zeng","doi":"10.1109/ICCEAI52939.2021.00045","DOIUrl":null,"url":null,"abstract":"One of the most commonly used natural language processing technologies is text classification. Spam detection, news text classification, information retrieval, emotion analysis, and intention judgment, among other applications, are all popular text classification applications [25]. Text classification is the process of assigning pre-defined class labels to text documents in order to shape semantic classes. Engineering, medical science, life science, social sciences and humanities, marketing, and government are only a few of the real-world applications. Machine learning and deep learning algorithms have recently become common and efficient methods for dealing with text classification problems involving labeled data [26]. The primary goal of text classification is to automatically assign texts to pre-defined categories based on their content. In this study, we will conduct a comparative study of the accuracies of different deep learning methods that include Bidirectional Encoder Representations from Transformers (BERT), Recurrent Neural Network (RNN), Convolutional Neural Network (CNN), and Region-based convolutional neural networks and compare the effectiveness of these deep-learning approaches in classifying Chinese news title text using the THUCNews dataset.","PeriodicalId":331409,"journal":{"name":"2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCEAI52939.2021.00045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

One of the most commonly used natural language processing technologies is text classification. Spam detection, news text classification, information retrieval, emotion analysis, and intention judgment, among other applications, are all popular text classification applications [25]. Text classification is the process of assigning pre-defined class labels to text documents in order to shape semantic classes. Engineering, medical science, life science, social sciences and humanities, marketing, and government are only a few of the real-world applications. Machine learning and deep learning algorithms have recently become common and efficient methods for dealing with text classification problems involving labeled data [26]. The primary goal of text classification is to automatically assign texts to pre-defined categories based on their content. In this study, we will conduct a comparative study of the accuracies of different deep learning methods that include Bidirectional Encoder Representations from Transformers (BERT), Recurrent Neural Network (RNN), Convolutional Neural Network (CNN), and Region-based convolutional neural networks and compare the effectiveness of these deep-learning approaches in classifying Chinese news title text using the THUCNews dataset.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
中文句子分类的深度学习方法比较研究
文本分类是最常用的自然语言处理技术之一。垃圾邮件检测、新闻文本分类、信息检索、情感分析、意图判断等都是比较流行的文本分类应用[25]。文本分类是将预定义的类标签分配给文本文档以形成语义类的过程。工程、医学、生命科学、社会科学和人文科学、市场营销和政府只是实际应用中的一小部分。机器学习和深度学习算法最近已经成为处理涉及标记数据的文本分类问题的常见而有效的方法[26]。文本分类的主要目标是根据文本的内容自动将文本分配到预定义的类别中。在这项研究中,我们将对不同深度学习方法的准确性进行比较研究,包括来自变形器的双向编码器表示(BERT)、循环神经网络(RNN)、卷积神经网络(CNN)和基于区域的卷积神经网络,并比较这些深度学习方法在使用THUCNews数据集分类中文新闻标题文本方面的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Inventory sharing based on supplier-led inventory transshipment Nursing intervention of postoperative hypoglycemia in elderly patients with endometrial cancer and diabetes mellitus Improved Deeplabv3 For Better Road Segmentation In Remote Sensing Images A Literature Review of Innovation and Corporate Social Responsibilities Heart sound recognition method of congenital heart disease based on improved cepstrum coefficient features
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1