Neural Architecture Search Based on Bipartite Graphs for Text Classification

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE transactions on neural networks and learning systems Pub Date : 2024-12-30 DOI:10.1109/TNNLS.2024.3514708

Xueming Yan;Han Huang;Yaochu Jin;Zilong Wang;Zhifeng Hao

{"title":"Neural Architecture Search Based on Bipartite Graphs for Text Classification","authors":"Xueming Yan;Han Huang;Yaochu Jin;Zilong Wang;Zhifeng Hao","doi":"10.1109/TNNLS.2024.3514708","DOIUrl":null,"url":null,"abstract":"Neural architecture search (NAS) is crucial for text representation in natural language processing (NLP); however, much less work on NAS for text classification has been proposed compared with NAS for computer vision. Similar to NAS for vision tasks, most existing work rely on a manually designed search space defined by a directed acyclic graph (DAG), resulting in limited generalization capability and high computational complexity. In text classification, the topological order of the NAS operators is essential for enhancing generalization, which cannot be accurately represented by a DAG. To address this issue, we propose a bipartite graph-based NAS (BGNAS) for text classification, which converts a DAG into a dual graph and then into a bipartite graph. This transformation makes it possible to accurately capture the topological order using multi-bigraph matching. In addition, we formulate NAS as a problem of identifying the lower bound of a submodular function, theoretically ensuring that optimal architectures in a bipartite graph-based search space can be identified using fewer search operators. Reduction of the search space is achieved by eliminating ineffective associated matching rules among search operators with a pruning strategy. As a result, the bipartite graph-based search space becomes more compact and less dependent on complex contextual semantics of text data. Experimental results on public benchmark problems demonstrate that BGNAS achieves better performance than the state-of-the-art NAS algorithms and is computationally more efficient. We also demonstrate that the bipartite graph search space can more effectively capture contextual semantics, thereby enhancing the generalization capability.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"36 6","pages":"10749-10763"},"PeriodicalIF":8.9000,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on neural networks and learning systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10817777/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Neural architecture search (NAS) is crucial for text representation in natural language processing (NLP); however, much less work on NAS for text classification has been proposed compared with NAS for computer vision. Similar to NAS for vision tasks, most existing work rely on a manually designed search space defined by a directed acyclic graph (DAG), resulting in limited generalization capability and high computational complexity. In text classification, the topological order of the NAS operators is essential for enhancing generalization, which cannot be accurately represented by a DAG. To address this issue, we propose a bipartite graph-based NAS (BGNAS) for text classification, which converts a DAG into a dual graph and then into a bipartite graph. This transformation makes it possible to accurately capture the topological order using multi-bigraph matching. In addition, we formulate NAS as a problem of identifying the lower bound of a submodular function, theoretically ensuring that optimal architectures in a bipartite graph-based search space can be identified using fewer search operators. Reduction of the search space is achieved by eliminating ineffective associated matching rules among search operators with a pruning strategy. As a result, the bipartite graph-based search space becomes more compact and less dependent on complex contextual semantics of text data. Experimental results on public benchmark problems demonstrate that BGNAS achieves better performance than the state-of-the-art NAS algorithms and is computationally more efficient. We also demonstrate that the bipartite graph search space can more effectively capture contextual semantics, thereby enhancing the generalization capability.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于二部图的文本分类神经结构搜索

神经结构搜索（NAS）是自然语言处理（NLP）中文本表示的关键。然而，与用于计算机视觉的NAS相比，用于文本分类的NAS工作要少得多。与NAS用于视觉任务类似，大多数现有工作依赖于人工设计的由有向无环图（DAG）定义的搜索空间，导致泛化能力有限且计算复杂度高。在文本分类中，NAS算子的拓扑顺序是提高泛化能力的关键，而DAG不能准确地表示这种泛化能力。为了解决这个问题，我们提出了一种基于二部图的文本分类NAS (BGNAS)，它将DAG转换为对偶图，然后再转换为二部图。这种转换使得使用多图匹配准确捕获拓扑顺序成为可能。此外，我们将NAS表述为识别子模块函数的下界问题，从理论上确保可以使用更少的搜索算子识别基于二部图的搜索空间中的最佳架构。通过使用修剪策略消除搜索操作符之间无效的关联匹配规则来减小搜索空间。因此，基于二部图的搜索空间变得更加紧凑，并且较少依赖于文本数据的复杂上下文语义。在公共基准问题上的实验结果表明，BGNAS比目前最先进的NAS算法取得了更好的性能，并且计算效率更高。我们还证明了二部图搜索空间可以更有效地捕获上下文语义，从而提高泛化能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on neural networks and learning systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CiteScore

23.80

自引率

9.60%

发文量

2102

审稿时长

3-8 weeks

期刊介绍： The focus of IEEE Transactions on Neural Networks and Learning Systems is to present scholarly articles discussing the theory, design, and applications of neural networks as well as other learning systems. The journal primarily highlights technical and scientific research in this domain.