Cao Yu-kun, Wei Zi-yue, Tang Yi-jia, Jin Cheng-kun
{"title":"Hierarchical Label Text Classification Method with Deep-Level Label-Assisted Classification","authors":"Cao Yu-kun, Wei Zi-yue, Tang Yi-jia, Jin Cheng-kun","doi":"10.1109/DDCLS58216.2023.10166293","DOIUrl":null,"url":null,"abstract":"Hierarchical label text classification is a challenging task in the field of natural language processing, where each document needs to be correctly classified into multiple labels with hierarchical structure. However, in the label set, due to the insufficient semantic information contained in the labels and the small number of documents classified under deep-level labels, the training of deep-level labels is insufficient, leading to a significant imbalance in label training. To address this, a hierarchical label text classification method with deep-level label-assisted classification (DLAC) is proposed. The method proposes a deep-level label-assisted classifier, which effectively utilizes text features and rich features of shallow label nodes corresponding to deep label nodes (i.e., shallow label's rich features) on the basis of enhanced label semantics to enhance the classification performance of deep labels. The comparison experiment results with eleven algorithms on three datasets show that the model can effectively improve the classification performance of deep-level labels and achieve good results.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDCLS58216.2023.10166293","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Hierarchical label text classification is a challenging task in the field of natural language processing, where each document needs to be correctly classified into multiple labels with hierarchical structure. However, in the label set, due to the insufficient semantic information contained in the labels and the small number of documents classified under deep-level labels, the training of deep-level labels is insufficient, leading to a significant imbalance in label training. To address this, a hierarchical label text classification method with deep-level label-assisted classification (DLAC) is proposed. The method proposes a deep-level label-assisted classifier, which effectively utilizes text features and rich features of shallow label nodes corresponding to deep label nodes (i.e., shallow label's rich features) on the basis of enhanced label semantics to enhance the classification performance of deep labels. The comparison experiment results with eleven algorithms on three datasets show that the model can effectively improve the classification performance of deep-level labels and achieve good results.