Transformer and Graph Convolutional Network for Text Classification

IF 2.9 4区计算机科学 International Journal of Computational Intelligence Systems Pub Date : 2023-10-04 DOI:10.1007/s44196-023-00337-z

Boting Liu, Weili Guan, Changjin Yang, Zhijie Fang, Zhiheng Lu

引用次数: 0

Abstract

Abstract Graph convolutional network (GCN) is an effective tool for feature clustering. However, in the text classification task, the traditional TextGCN (GCN for Text Classification) ignores the context word order of the text. In addition, TextGCN constructs the text graph only according to the context relationship, so it is difficult for the word nodes to learn an effective semantic representation. Based on this, this paper proposes a text classification method that combines Transformer and GCN. To improve the semantic accuracy of word node features, we add a part of speech (POS) to the word-document graph and build edges between words based on POS. In the layer-to-layer of GCN, the Transformer is used to extract the contextual and sequential information of the text. We conducted the experiment on five representative datasets. The results show that our method can effectively improve the accuracy of text classification and is better than the comparison method.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于文本分类的变压器和图卷积网络

图卷积网络(GCN)是一种有效的特征聚类工具。然而，在文本分类任务中，传统的textcn (GCN for text classification)忽略了文本的上下文词序。此外，TextGCN仅根据上下文关系构建文本图，因此单词节点很难学习到有效的语义表示。在此基础上，本文提出了一种结合Transformer和GCN的文本分类方法。为了提高词节点特征的语义准确性，我们在词-文档图中加入词性(POS)，并基于词性(POS)在词与词之间建立边缘。在分层GCN中，使用Transformer提取文本的上下文信息和顺序信息。我们在五个有代表性的数据集上进行了实验。结果表明，该方法能有效提高文本分类的准确率，优于比较法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Journal of Computational Intelligence Systems 工程技术-计算机：跨学科应用

自引率

3.40%

发文量

期刊介绍： The International Journal of Computational Intelligence Systems publishes original research on all aspects of applied computational intelligence, especially targeting papers demonstrating the use of techniques and methods originating from computational intelligence theory. The core theories of computational intelligence are fuzzy logic, neural networks, evolutionary computation and probabilistic reasoning. The journal publishes only articles related to the use of computational intelligence and broadly covers the following topics: -Autonomous reasoning- Bio-informatics- Cloud computing- Condition monitoring- Data science- Data mining- Data visualization- Decision support systems- Fault diagnosis- Intelligent information retrieval- Human-machine interaction and interfaces- Image processing- Internet and networks- Noise analysis- Pattern recognition- Prediction systems- Power (nuclear) safety systems- Process and system control- Real-time systems- Risk analysis and safety-related issues- Robotics- Signal and image processing- IoT and smart environments- Systems integration- System control- System modelling and optimization- Telecommunications- Time series prediction- Warning systems- Virtual reality- Web intelligence- Deep learning