A comparative study of deep learning approaches for Chinese Sentence Classification

2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI) Pub Date : 2021-08-01 DOI:10.1109/ICCEAI52939.2021.00045

Zhu Zeng

{"title":"A comparative study of deep learning approaches for Chinese Sentence Classification","authors":"Zhu Zeng","doi":"10.1109/ICCEAI52939.2021.00045","DOIUrl":null,"url":null,"abstract":"One of the most commonly used natural language processing technologies is text classification. Spam detection, news text classification, information retrieval, emotion analysis, and intention judgment, among other applications, are all popular text classification applications [25]. Text classification is the process of assigning pre-defined class labels to text documents in order to shape semantic classes. Engineering, medical science, life science, social sciences and humanities, marketing, and government are only a few of the real-world applications. Machine learning and deep learning algorithms have recently become common and efficient methods for dealing with text classification problems involving labeled data [26]. The primary goal of text classification is to automatically assign texts to pre-defined categories based on their content. In this study, we will conduct a comparative study of the accuracies of different deep learning methods that include Bidirectional Encoder Representations from Transformers (BERT), Recurrent Neural Network (RNN), Convolutional Neural Network (CNN), and Region-based convolutional neural networks and compare the effectiveness of these deep-learning approaches in classifying Chinese news title text using the THUCNews dataset.","PeriodicalId":331409,"journal":{"name":"2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCEAI52939.2021.00045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

One of the most commonly used natural language processing technologies is text classification. Spam detection, news text classification, information retrieval, emotion analysis, and intention judgment, among other applications, are all popular text classification applications [25]. Text classification is the process of assigning pre-defined class labels to text documents in order to shape semantic classes. Engineering, medical science, life science, social sciences and humanities, marketing, and government are only a few of the real-world applications. Machine learning and deep learning algorithms have recently become common and efficient methods for dealing with text classification problems involving labeled data [26]. The primary goal of text classification is to automatically assign texts to pre-defined categories based on their content. In this study, we will conduct a comparative study of the accuracies of different deep learning methods that include Bidirectional Encoder Representations from Transformers (BERT), Recurrent Neural Network (RNN), Convolutional Neural Network (CNN), and Region-based convolutional neural networks and compare the effectiveness of these deep-learning approaches in classifying Chinese news title text using the THUCNews dataset.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

中文句子分类的深度学习方法比较研究

文本分类是最常用的自然语言处理技术之一。垃圾邮件检测、新闻文本分类、信息检索、情感分析、意图判断等都是比较流行的文本分类应用[25]。文本分类是将预定义的类标签分配给文本文档以形成语义类的过程。工程、医学、生命科学、社会科学和人文科学、市场营销和政府只是实际应用中的一小部分。机器学习和深度学习算法最近已经成为处理涉及标记数据的文本分类问题的常见而有效的方法[26]。文本分类的主要目标是根据文本的内容自动将文本分配到预定义的类别中。在这项研究中，我们将对不同深度学习方法的准确性进行比较研究，包括来自变形器的双向编码器表示(BERT)、循环神经网络(RNN)、卷积神经网络(CNN)和基于区域的卷积神经网络，并比较这些深度学习方法在使用THUCNews数据集分类中文新闻标题文本方面的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2021 International Conference on Computer Engineering and Artificial Intelligence (ICCEAI)

自引率

0.00%

发文量