Ieee-Caa Journal of Automatica Sinica最新文献

英文中文

Neural Tucker Factorization

IF 15.3 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2025-01-20 DOI: 10.1109/JAS.2024.124977

Peng Tang;Xin Luo

This letter presents a novel latent factorization model for high dimensional and incomplete (HDI) tensor, namely the neural Tucker factorization (NeuTucF), which is a generic neural network-based latent-factorization-of-tensors model under the Tucker decomposition framework. It first interprets the traditional Tucker framework into a neural network with embeddings for different tensor modes. Afterwards, a Tucker interaction layer is innovatively built to accurately represent the complex spatiotemporal feature interactions among different tensor modes. Experiments on real-world datasets demonstrate that the proposed NeuTucF model significantly outperforms several state-of-the-art models in terms of estimation accuracy to missing data in an HDI tensor, owing to its ability of accurately representing an HDI tensor via modeling the complex interaction among different input modes. Interestingly, the results also indicate that our model has a certain level of implicit regularization.

引用次数: 0

Penalty Function-Based Distributed Primal-Dual Algorithm for Nonconvex Optimization Problem

IF 15.3 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2025-01-20 DOI: 10.1109/JAS.2024.124935

Xiasheng Shi;Changyin Sun

This paper addresses the distributed nonconvex optimization problem, where both the global cost function and local inequality constraint function are nonconvex. To tackle this issue, the p-power transformation and penalty function techniques are introduced to reframe the nonconvex optimization problem. This ensures that the Hessian matrix of the augmented Lagrangian function becomes local positive definite by choosing appropriate control parameters. A multi-timescale primal-dual method is then devised based on the Karush-Kuhn-Tucker (KKT) point of the reformulated nonconvex problem to attain convergence. The Lyapunov theory guarantees the model's stability in the presence of an undirected and connected communication network. Finally, two nonconvex optimization problems are presented to demonstrate the efficacy of the previously developed method.

引用次数: 0

Residential Energy Scheduling With Solar Energy Based on Dyna Adaptive Dynamic Programming

IF 15.3 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2025-01-20 DOI: 10.1109/JAS.2024.124809

Kang Xiong;Qinglai Wei;Hongyang Li

Learning-based methods have become mainstream for solving residential energy scheduling problems. In order to improve the learning efficiency of existing methods and increase the utilization of renewable energy, we propose the Dyna action-dependent heuristic dynamic programming (Dyna-ADHDP) method, which incorporates the ideas of learning and planning from the Dyna framework in action-dependent heuristic dynamic programming. This method defines a continuous action space for precise control of an energy storage system and allows online optimization of algorithm performance during the real-time operation of the residential energy model. Meanwhile, the target network is introduced during the training process to make the training smoother and more efficient. We conducted experimental comparisons with the benchmark method using simulated and real data to verify its applicability and performance. The results confirm the method's excellent performance and generalization capabilities, as well as its excellence in increasing renewable energy utilization and extending equipment life.

引用次数: 0

Soft Resource Slicing for Industrial Mixed Traffic in 5G Networks

IF 15.3 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2025-01-20 DOI: 10.1109/JAS.2024.124761

Jingfang Ding;Meng Zheng;Haibin Yu

This letter proposes a dynamic switching soft slicing strategy for industrial mixed traffic in 5G networks. Considering two types of traffic, periodic delay-sensitive (PDS) traffic and sporadic delay-tolerant (SDT) traffic, we design a dynamic switching strategy based on a traffic-QoS-aware soft slicing (TQASS) scheme and a resource-efficiency-aware soft slicing (REASS) scheme. The proposed strategy ensures the reliability of PDS traffic under delay constraints, while dynamically allocating remaining resources to SDT traffic. Simulation results show that the proposed soft slicing strategy out-performs existing works in meeting the strict QoS requirements of industrial mixed traffic.

引用次数: 0

DI-YOLOv5: An Improved Dual-Wavelet-Based YOLOv5 for Dense Small Object Detection

IF 15.3 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2025-01-20 DOI: 10.1109/JAS.2024.124368

Zi-Xin Li;Yu-Long Wang;Fei Wang

This letter focuses on the fact that small objects with few pixels disappear in feature maps with large receptive fields, as the network deepens, in object detection tasks. Therefore, the detection of dense small objects is challenging. A DI-YOLOv5 object detection algorithm is proposed. Specifically, a dual-wavelet convolution module (DWCM), which contains DWT_Conv and IWT_Conv, is proposed to reduce the loss of feature map information while obtaining feature maps with a large receptive field. The DWT _ Conv and IWT _ Conv can be used as replacements for downsampling and upsampling operations. Moreover, in the process of information transmission to the deep layer, a CSPCoA module is proposed to further capture the location information and information dependencies in different spatial directions. DWCM and CSPCoA are single, generic, plug-and-play units. We propose DI-YOLOv5 with YOLOv5 [1] as the baseline, and extensively evaluate the performance of these two modules on small object detection. Experiments demonstrate that DI-YOLOv5 can effectively improve the accuracy of object detection.

{"title":"DI-YOLOv5: An Improved Dual-Wavelet-Based YOLOv5 for Dense Small Object Detection","authors":"Zi-Xin Li;Yu-Long Wang;Fei Wang","doi":"10.1109/JAS.2024.124368","DOIUrl":"https://doi.org/10.1109/JAS.2024.124368","url":null,"abstract":"This letter focuses on the fact that small objects with few pixels disappear in feature maps with large receptive fields, as the network deepens, in object detection tasks. Therefore, the detection of dense small objects is challenging. A DI-YOLOv5 object detection algorithm is proposed. Specifically, a dual-wavelet convolution module (DWCM), which contains DWT_Conv and IWT_Conv, is proposed to reduce the loss of feature map information while obtaining feature maps with a large receptive field. The DWT _ Conv and IWT _ Conv can be used as replacements for downsampling and upsampling operations. Moreover, in the process of information transmission to the deep layer, a CSPCoA module is proposed to further capture the location information and information dependencies in different spatial directions. DWCM and CSPCoA are single, generic, plug-and-play units. We propose DI-YOLOv5 with YOLOv5 [1] as the baseline, and extensively evaluate the performance of these two modules on small object detection. Experiments demonstrate that DI-YOLOv5 can effectively improve the accuracy of object detection.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"12 2","pages":"457-459"},"PeriodicalIF":15.3,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10846924","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143106574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decentralized Federated Learning Algorithm Under Adversary Eavesdropping

IF 15.3 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2025-01-20 DOI: 10.1109/JAS.2024.125079

Lei Xu;Danya Xu;Xinlei Yi;Chao Deng;Tianyou Chai;Tao Yang

In this paper, we study the decentralized federated learning problem, which involves the collaborative training of a global model among multiple devices while ensuring data privacy. In classical federated learning, the communication channel between the devices poses a potential risk of compromising private information. To reduce the risk of adversary eavesdropping in the communication channel, we propose TRADE (transmit difference weight) concept. This concept replaces the decentralized federated learning algorithm's transmitted weight parameters with differential weight parameters, enhancing the privacy data against eavesdropping. Subsequently, by integrating the TRADE concept with the primal-dual stochastic gradient descent (SGD) algorithm, we propose a decentralized TRADE primal-dual SGD algorithm. We demonstrate that our proposed algorithm's convergence properties are the same as those of the primal-dual SGD algorithm while providing enhanced privacy protection. We validate the algorithm's performance on fault diagnosis task using the Case Western Reserve University dataset, and image classification tasks using the CIFAR-10 and CIFAR-100 datasets, revealing model accuracy comparable to centralized federated learning. Additionally, the experiments confirm the algorithm's privacy protection capability.

{"title":"Decentralized Federated Learning Algorithm Under Adversary Eavesdropping","authors":"Lei Xu;Danya Xu;Xinlei Yi;Chao Deng;Tianyou Chai;Tao Yang","doi":"10.1109/JAS.2024.125079","DOIUrl":"https://doi.org/10.1109/JAS.2024.125079","url":null,"abstract":"In this paper, we study the decentralized federated learning problem, which involves the collaborative training of a global model among multiple devices while ensuring data privacy. In classical federated learning, the communication channel between the devices poses a potential risk of compromising private information. To reduce the risk of adversary eavesdropping in the communication channel, we propose TRADE (transmit difference weight) concept. This concept replaces the decentralized federated learning algorithm's transmitted weight parameters with differential weight parameters, enhancing the privacy data against eavesdropping. Subsequently, by integrating the TRADE concept with the primal-dual stochastic gradient descent (SGD) algorithm, we propose a decentralized TRADE primal-dual SGD algorithm. We demonstrate that our proposed algorithm's convergence properties are the same as those of the primal-dual SGD algorithm while providing enhanced privacy protection. We validate the algorithm's performance on fault diagnosis task using the Case Western Reserve University dataset, and image classification tasks using the CIFAR-10 and CIFAR-100 datasets, revealing model accuracy comparable to centralized federated learning. Additionally, the experiments confirm the algorithm's privacy protection capability.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"12 2","pages":"448-456"},"PeriodicalIF":15.3,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143106699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PromptFusion: Harmonized Semantic Prompt Learning for Infrared and Visible Image Fusion

IF 15.3 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2024-12-24 DOI: 10.1109/JAS.2024.124878

Jinyuan Liu;Xingyuan Li;Zirui Wang;Zhiying Jiang;Wei Zhong;Wei Fan;Bin Xu

The goal of infrared and visible image fusion (IVIF) is to integrate the unique advantages of both modalities to achieve a more comprehensive understanding of a scene. However, existing methods struggle to effectively handle modal disparities, resulting in visual degradation of the details and prominent targets of the fused images. To address these challenges, we introduce PromptFusion, a prompt-based approach that harmoniously combines multi-modality images under the guidance of semantic prompts. Firstly, to better characterize the features of different modalities, a contourlet autoencoder is designed to separate and extract the high-/low-frequency components of different modalities, thereby improving the extraction of fine details and textures. We also introduce a prompt learning mechanism using positive and negative prompts, leveraging Vision-Language Models to improve the fusion model's understanding and identification of targets in multi-modality images, leading to improved performance in downstream tasks. Furthermore, we employ bi-level asymptotic convergence optimization. This approach simplifies the intricate non-singleton non-convex bi-level problem into a series of convergent and differentiable single optimization problems that can be effectively resolved through gradient descent. Our approach advances the state-of-the-art, delivering superior fusion quality and boosting the performance of related downstream tasks. Project page: https://github.com/hey-it-s-me/PromptFusion.

{"title":"PromptFusion: Harmonized Semantic Prompt Learning for Infrared and Visible Image Fusion","authors":"Jinyuan Liu;Xingyuan Li;Zirui Wang;Zhiying Jiang;Wei Zhong;Wei Fan;Bin Xu","doi":"10.1109/JAS.2024.124878","DOIUrl":"https://doi.org/10.1109/JAS.2024.124878","url":null,"abstract":"The goal of infrared and visible image fusion (IVIF) is to integrate the unique advantages of both modalities to achieve a more comprehensive understanding of a scene. However, existing methods struggle to effectively handle modal disparities, resulting in visual degradation of the details and prominent targets of the fused images. To address these challenges, we introduce PromptFusion, a prompt-based approach that harmoniously combines multi-modality images under the guidance of semantic prompts. Firstly, to better characterize the features of different modalities, a contourlet autoencoder is designed to separate and extract the high-/low-frequency components of different modalities, thereby improving the extraction of fine details and textures. We also introduce a prompt learning mechanism using positive and negative prompts, leveraging Vision-Language Models to improve the fusion model's understanding and identification of targets in multi-modality images, leading to improved performance in downstream tasks. Furthermore, we employ bi-level asymptotic convergence optimization. This approach simplifies the intricate non-singleton non-convex bi-level problem into a series of convergent and differentiable single optimization problems that can be effectively resolved through gradient descent. Our approach advances the state-of-the-art, delivering superior fusion quality and boosting the performance of related downstream tasks. Project page: https://github.com/hey-it-s-me/PromptFusion.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"12 3","pages":"502-515"},"PeriodicalIF":15.3,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143535534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Online Exploratory Maximum Likelihood Estimation Approach to Adaptive Kalman Filtering 一种自适应卡尔曼滤波的在线探索性极大似然估计方法

IF 15.3 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2024-12-24 DOI: 10.1109/JAS.2024.125001

Jiajun Cheng;Haonan Chen;Zhirui Xue;Yulong Huang;Yonggang Zhang

Over the past few decades, numerous adaptive Kalman filters (AKFs) have been proposed. However, achieving online estimation with both high estimation accuracy and fast convergence speed is challenging, especially when both the process noise and measurement noise covariance matrices are relatively inaccurate. Maximum likelihood estimation (MLE) possesses the potential to achieve this goal, since its theoretical accuracy is guaranteed by asymptotic optimality and the convergence speed is fast due to weak dependence on accurate state estimation. Unfortunately, the maximum likelihood cost function is so intricate that the existing MLE methods can only simply ignore all historical measurement information to achieve online estimation, which cannot adequately realize the potential of MLE. In order to design online MLE-based AKFs with high estimation accuracy and fast convergence speed, an online exploratory MLE approach is proposed, based on which a mini-batch coordinate descent noise covariance matrix estimation framework is developed. In this framework, the maximum likelihood cost function is simplified for online estimation with fewer and simpler terms which are selected in a mini-batch and calculated with a backtracking method. This maximum likelihood cost function is sidestepped and solved by exploring possible estimated noise covariance matrices adaptively while the historical measurement information is adequately utilized. Furthermore, four specific algorithms are derived under this framework to meet different practical requirements in terms of convergence speed, estimation accuracy, and calculation load. Abundant simulations and experiments are carried out to verify the validity and superiority of the proposed algorithms as compared with existing state-of-the-art AKFs.

在过去的几十年里，人们提出了许多自适应卡尔曼滤波器（akf）。然而，实现高估计精度和快速收敛速度的在线估计是具有挑战性的，特别是当过程噪声和测量噪声协方差矩阵都相对不准确时。极大似然估计（MLE）具有渐近最优性保证理论精度和对精确状态估计依赖性弱收敛速度快的特点，具有实现这一目标的潜力。遗憾的是，极大似然代价函数过于复杂，现有的最大似然代价函数方法只能简单地忽略所有历史测量信息来实现在线估计，无法充分发挥最大似然代价函数的潜力。为了设计在线估计精度高、收敛速度快的基于MLE的akf，提出了一种在线探索性MLE方法，并在此基础上开发了小批量坐标下降噪声协方差矩阵估计框架。在该框架中，最大似然代价函数简化为在线估计，在小批量中选择更少和更简单的项，并使用回溯方法计算。在充分利用历史测量信息的情况下，通过自适应探索可能估计的噪声协方差矩阵来回避最大似然代价函数。在此框架下，推导出四种具体的算法，以满足不同的收敛速度、估计精度和计算量的实际要求。进行了大量的仿真和实验，以验证所提出算法与现有最先进的akf相比的有效性和优越性。

{"title":"An Online Exploratory Maximum Likelihood Estimation Approach to Adaptive Kalman Filtering","authors":"Jiajun Cheng;Haonan Chen;Zhirui Xue;Yulong Huang;Yonggang Zhang","doi":"10.1109/JAS.2024.125001","DOIUrl":"https://doi.org/10.1109/JAS.2024.125001","url":null,"abstract":"Over the past few decades, numerous adaptive Kalman filters (AKFs) have been proposed. However, achieving online estimation with both high estimation accuracy and fast convergence speed is challenging, especially when both the process noise and measurement noise covariance matrices are relatively inaccurate. Maximum likelihood estimation (MLE) possesses the potential to achieve this goal, since its theoretical accuracy is guaranteed by asymptotic optimality and the convergence speed is fast due to weak dependence on accurate state estimation. Unfortunately, the maximum likelihood cost function is so intricate that the existing MLE methods can only simply ignore all historical measurement information to achieve online estimation, which cannot adequately realize the potential of MLE. In order to design online MLE-based AKFs with high estimation accuracy and fast convergence speed, an online exploratory MLE approach is proposed, based on which a mini-batch coordinate descent noise covariance matrix estimation framework is developed. In this framework, the maximum likelihood cost function is simplified for online estimation with fewer and simpler terms which are selected in a mini-batch and calculated with a backtracking method. This maximum likelihood cost function is sidestepped and solved by exploring possible estimated noise covariance matrices adaptively while the historical measurement information is adequately utilized. Furthermore, four specific algorithms are derived under this framework to meet different practical requirements in terms of convergence speed, estimation accuracy, and calculation load. Abundant simulations and experiments are carried out to verify the validity and superiority of the proposed algorithms as compared with existing state-of-the-art AKFs.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"12 1","pages":"228-254"},"PeriodicalIF":15.3,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142993536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Time-Varying Formation Tracking Control of Heterogeneous Multi-Agent Systems With Intermittent Communications and Directed Switching Networks 具有间歇通信和定向交换网络的异构多智能体系统时变队形跟踪控制

IF 15.3 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2024-12-24 DOI: 10.1109/JAS.2023.123924

Yuhan Wang;Zhuping Wang;Hao Zhang;Huaicheng Yan

Dear Editor, This letter is concerned with the problem of time-varying formation tracking for heterogeneous multiagent systems (MASs) under directed switching networks. For this purpose, our first step is to present some sufficient conditions for the exponential stability of a particular category of switched systems. Then, we apply the theoretical results to design a distributed observer for reference leader under directed switching topologies. Based on the above designed observer, a novel event-triggered distributed control protocol is proposed for each follower to achieve the desired formation. Finally, we demonstrate the effectiveness of our proposed results through numerical simulations.

这封信是关于有向交换网络下异构多智能体系统（MASs）的时变编队跟踪问题。为此，我们的第一步是给出一类特定切换系统指数稳定性的几个充分条件。然后，我们应用理论结果设计了有向交换拓扑下参考先导的分布式观测器。在上述观测器的基础上，提出了一种新的事件触发分布式控制协议，以实现对每个follower的期望编队。最后，通过数值模拟验证了所提结果的有效性。

引用次数: 0

Cas-FNE: Cascaded Face Normal Estimation Cas-FNE：级联人脸法线估计

IF 15.3 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Ieee-Caa Journal of Automatica Sinica

Pub Date : 2024-11-21 DOI: 10.1109/JAS.2024.124899

Meng Wang;Jiawan Zhang;Jiayi Ma;Xiaojie Guo

Capturing high-fidelity normals from single face images plays a core role in numerous computer vision and graphics applications. Though significant progress has been made in recent years, how to effectively and efficiently explore normal priors remains challenging. Most existing approaches depend on the development of intricate network architectures and complex calculations for in-the-wild face images. To overcome the above issue, we propose a simple yet effective cascaded neural network, called Cas-Fne, which progressively boosts the quality of predicted normals with marginal model parameters and computational cost. Meanwhile, it can mitigate the imbalance issue between training data and real-world face images due to the progressive refinement mechanism, and thus boost the generalization ability of the model. Specifically, in the training phase, our model relies solely on a small amount of labeled data. The earlier prediction serves as guidance for following refinement. In addition, our shared-parameter cascaded block employs a recurrent mechanism, allowing it to be applied multiple times for optimization without increasing network parameters. Quantitative and qualitative evaluations on benchmark datasets are conducted to show that our Cas-FNE can faithfully maintain facial details and reveal its superiority over state-of-the-art methods. The code is available at https://github.com/AutoHDR/CasFNE.git.

从单张人脸图像中捕捉高保真法线在众多计算机视觉和图形应用中发挥着核心作用。尽管近年来已取得了重大进展，但如何有效、高效地探索法线先验仍然充满挑战。现有的大多数方法都依赖于开发复杂的网络架构和对野生人脸图像进行复杂的计算。为了克服上述问题，我们提出了一种简单而有效的级联神经网络，称为 Cas-Fne，它能在模型参数和计算成本微不足道的情况下逐步提高预测法线的质量。同时，由于渐进细化机制，它可以缓解训练数据与真实世界人脸图像之间的不平衡问题，从而提高模型的泛化能力。具体来说，在训练阶段，我们的模型仅依赖于少量的标记数据。先前的预测为后续的细化提供了指导。此外，我们的共享参数级联块采用了递归机制，允许在不增加网络参数的情况下多次应用于优化。在基准数据集上进行的定量和定性评估表明，我们的 Cas-FNE 可以忠实地保留面部细节，并显示出它优于最先进方法的地方。代码可在 https://github.com/AutoHDR/CasFNE.git 上获取。

{"title":"Cas-FNE: Cascaded Face Normal Estimation","authors":"Meng Wang;Jiawan Zhang;Jiayi Ma;Xiaojie Guo","doi":"10.1109/JAS.2024.124899","DOIUrl":"https://doi.org/10.1109/JAS.2024.124899","url":null,"abstract":"Capturing high-fidelity normals from single face images plays a core role in numerous computer vision and graphics applications. Though significant progress has been made in recent years, how to effectively and efficiently explore normal priors remains challenging. Most existing approaches depend on the development of intricate network architectures and complex calculations for in-the-wild face images. To overcome the above issue, we propose a simple yet effective cascaded neural network, called Cas-Fne, which progressively boosts the quality of predicted normals with marginal model parameters and computational cost. Meanwhile, it can mitigate the imbalance issue between training data and real-world face images due to the progressive refinement mechanism, and thus boost the generalization ability of the model. Specifically, in the training phase, our model relies solely on a small amount of labeled data. The earlier prediction serves as guidance for following refinement. In addition, our shared-parameter cascaded block employs a recurrent mechanism, allowing it to be applied multiple times for optimization without increasing network parameters. Quantitative and qualitative evaluations on benchmark datasets are conducted to show that our Cas-FNE can faithfully maintain facial details and reveal its superiority over state-of-the-art methods. The code is available at https://github.com/AutoHDR/CasFNE.git.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"11 12","pages":"2423-2434"},"PeriodicalIF":15.3,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142679310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Ieee-Caa Journal of Automatica Sinica

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀