Efficient and Flexible Method for Reducing Moderate-Size Deep Neural Networks with Condensation

IF 2 3区物理与天体物理 Q2 PHYSICS, MULTIDISCIPLINARY Entropy Pub Date : 2024-06-30 DOI:10.3390/e26070567

Tianyi Chen, Zhi-Qin John Xu

{"title":"Efficient and Flexible Method for Reducing Moderate-Size Deep Neural Networks with Condensation","authors":"Tianyi Chen, Zhi-Qin John Xu","doi":"10.3390/e26070567","DOIUrl":null,"url":null,"abstract":"Neural networks have been extensively applied to a variety of tasks, achieving astounding results. Applying neural networks in the scientific field is an important research direction that is gaining increasing attention. In scientific applications, the scale of neural networks is generally moderate size, mainly to ensure the speed of inference during application. Additionally, comparing neural networks to traditional algorithms in scientific applications is inevitable. These applications often require rapid computations, making the reduction in neural network sizes increasingly important. Existing work has found that the powerful capabilities of neural networks are primarily due to their nonlinearity. Theoretical work has discovered that under strong nonlinearity, neurons in the same layer tend to behave similarly, a phenomenon known as condensation. Condensation offers an opportunity to reduce the scale of neural networks to a smaller subnetwork with a similar performance. In this article, we propose a condensation reduction method to verify the feasibility of this idea in practical problems, thereby validating existing theories. Our reduction method can currently be applied to both fully connected networks and convolutional networks, achieving positive results. In complex combustion acceleration tasks, we reduced the size of the neural network to 41.7% of its original scale while maintaining prediction accuracy. In the CIFAR10 image classification task, we reduced the network size to 11.5% of the original scale, still maintaining a satisfactory validation accuracy. Our method can be applied to most trained neural networks, reducing computational pressure and improving inference speed.","PeriodicalId":11694,"journal":{"name":"Entropy","volume":"10 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2024-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Entropy","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.3390/e26070567","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Neural networks have been extensively applied to a variety of tasks, achieving astounding results. Applying neural networks in the scientific field is an important research direction that is gaining increasing attention. In scientific applications, the scale of neural networks is generally moderate size, mainly to ensure the speed of inference during application. Additionally, comparing neural networks to traditional algorithms in scientific applications is inevitable. These applications often require rapid computations, making the reduction in neural network sizes increasingly important. Existing work has found that the powerful capabilities of neural networks are primarily due to their nonlinearity. Theoretical work has discovered that under strong nonlinearity, neurons in the same layer tend to behave similarly, a phenomenon known as condensation. Condensation offers an opportunity to reduce the scale of neural networks to a smaller subnetwork with a similar performance. In this article, we propose a condensation reduction method to verify the feasibility of this idea in practical problems, thereby validating existing theories. Our reduction method can currently be applied to both fully connected networks and convolutional networks, achieving positive results. In complex combustion acceleration tasks, we reduced the size of the neural network to 41.7% of its original scale while maintaining prediction accuracy. In the CIFAR10 image classification task, we reduced the network size to 11.5% of the original scale, still maintaining a satisfactory validation accuracy. Our method can be applied to most trained neural networks, reducing computational pressure and improving inference speed.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

高效灵活的凝缩式适度规模深度神经网络方法

神经网络已被广泛应用于各种任务，并取得了惊人的成果。将神经网络应用于科学领域是一个重要的研究方向，越来越受到人们的关注。在科学应用中，神经网络的规模一般适中，主要是为了保证应用过程中的推理速度。此外，在科学应用中，将神经网络与传统算法进行比较是不可避免的。这些应用往往需要快速计算，因此缩小神经网络的规模变得越来越重要。现有研究发现，神经网络的强大功能主要归功于其非线性特性。理论研究发现，在强烈的非线性条件下，同一层中的神经元往往会表现出相似的行为，这种现象被称为凝聚。凝聚现象为缩小神经网络的规模，使其成为具有类似性能的更小的子网络提供了机会。在本文中，我们提出了一种浓缩缩减方法，以验证这一想法在实际问题中的可行性，从而验证现有理论。目前，我们的缩减方法既可用于全连接网络，也可用于卷积网络，并取得了积极的效果。在复杂的燃烧加速任务中，我们将神经网络的规模缩小到原来的 41.7%，同时保持了预测精度。在 CIFAR10 图像分类任务中，我们将网络规模缩小到原始规模的 11.5%，仍然保持了令人满意的验证精度。我们的方法可以应用于大多数训练有素的神经网络，从而减轻计算压力，提高推理速度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Entropy PHYSICS, MULTIDISCIPLINARY-

CiteScore

4.90

自引率

11.10%

发文量

1580

审稿时长

21.05 days

期刊介绍： Entropy (ISSN 1099-4300), an international and interdisciplinary journal of entropy and information studies, publishes reviews, regular research papers and short notes. Our aim is to encourage scientists to publish as much as possible their theoretical and experimental details. There is no restriction on the length of the papers. If there are computation and the experiment, the details must be provided so that the results can be reproduced.