ViTAE-SL: A vision transformer-based autoencoder and spatial interpolation learner for field reconstruction

IF 3.4 2区物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computer Physics Communications Pub Date : 2024-12-16 DOI:10.1016/j.cpc.2024.109464

Hongwei Fan , Sibo Cheng , Audrey J. de Nazelle , Rossella Arcucci

{"title":"ViTAE-SL: A vision transformer-based autoencoder and spatial interpolation learner for field reconstruction","authors":"Hongwei Fan , Sibo Cheng , Audrey J. de Nazelle , Rossella Arcucci","doi":"10.1016/j.cpc.2024.109464","DOIUrl":null,"url":null,"abstract":"<div><div>Reliable and accurate reconstruction for large-scale and complex physical fields in real-time from limited observations has been a longstanding challenge. In recent years, sensors have been increasingly deployed in numerous physical systems. However, the locations of these sensors can shift over time, such as with mobile sensors, or when sensors are deployed and removed. These sparse and randomly located sensors further exacerbate the difficulty of reconstructing the physical field. In this paper, we present a new deep learning model called Vision Transformer-based Autoencoder (ViTAE) for reconstructing large-scale and complex fields. The proposed network structure is based on a novel core design: vision transformer encoder and Convolutional Neural Network (CNN) decoder. First, we split a two-dimensional field into patches and developed a vision transformer encoder to transfer patches into latent representations. We then reshape the linear latent representations to patches before concatenation, along with a CNN decoder, to reconstruct the field. The proposed model is tested in four different numerical experiments, using generated synthetic data, spatially distributed PM2.5 data, Computational Fluid Dynamics (CFD) simulation data and National Oceanic and Atmospheric Administration (NOAA) sea surface temperature data. The numerical results highlight the strength of ViTAE-SL compared to Kriging and state-of-the-art deep-learning models with significantly higher reconstruction accuracy, computational efficiency, and robust scaling behavior.</div></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"308 ","pages":"Article 109464"},"PeriodicalIF":3.4000,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Physics Communications","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010465524003874","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Reliable and accurate reconstruction for large-scale and complex physical fields in real-time from limited observations has been a longstanding challenge. In recent years, sensors have been increasingly deployed in numerous physical systems. However, the locations of these sensors can shift over time, such as with mobile sensors, or when sensors are deployed and removed. These sparse and randomly located sensors further exacerbate the difficulty of reconstructing the physical field. In this paper, we present a new deep learning model called Vision Transformer-based Autoencoder (ViTAE) for reconstructing large-scale and complex fields. The proposed network structure is based on a novel core design: vision transformer encoder and Convolutional Neural Network (CNN) decoder. First, we split a two-dimensional field into patches and developed a vision transformer encoder to transfer patches into latent representations. We then reshape the linear latent representations to patches before concatenation, along with a CNN decoder, to reconstruct the field. The proposed model is tested in four different numerical experiments, using generated synthetic data, spatially distributed PM2.5 data, Computational Fluid Dynamics (CFD) simulation data and National Oceanic and Atmospheric Administration (NOAA) sea surface temperature data. The numerical results highlight the strength of ViTAE-SL compared to Kriging and state-of-the-art deep-learning models with significantly higher reconstruction accuracy, computational efficiency, and robust scaling behavior.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于视觉变换的自编码器和空间插值学习器

从有限的观测数据中实时可靠、准确地重建大规模、复杂的物理场一直是一个长期存在的挑战。近年来，传感器越来越多地部署在许多物理系统中。然而，这些传感器的位置可以随着时间的推移而变化，例如移动传感器，或者当传感器被部署和移除时。这些稀疏且随机分布的传感器进一步加剧了物理场重建的难度。在本文中，我们提出了一种新的深度学习模型，称为基于视觉变换的自编码器（ViTAE），用于重建大规模和复杂的场。该网络结构基于一种新颖的核心设计：视觉变压器编码器和卷积神经网络（CNN）解码器。首先，我们将二维场分割成小块，并开发了视觉转换编码器，将小块转换成潜在表征。然后，我们在拼接之前将线性潜在表示重塑为补丁，并使用CNN解码器重建场。采用生成的合成数据、空间分布的PM2.5数据、计算流体动力学（CFD）模拟数据和美国国家海洋和大气管理局（NOAA）的海面温度数据，对所提出的模型进行了四种不同的数值实验。数值结果表明，与Kriging和最先进的深度学习模型相比，ViTAE-SL具有更高的重建精度、计算效率和鲁棒的缩放行为。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Computer Physics Communications 物理-计算机：跨学科应用

CiteScore

12.10

自引率

3.20%

发文量

287

审稿时长

5.3 months

期刊介绍： The focus of CPC is on contemporary computational methods and techniques and their implementation, the effectiveness of which will normally be evidenced by the author(s) within the context of a substantive problem in physics. Within this setting CPC publishes two types of paper. Computer Programs in Physics (CPiP) These papers describe significant computer programs to be archived in the CPC Program Library which is held in the Mendeley Data repository. The submitted software must be covered by an approved open source licence. Papers and associated computer programs that address a problem of contemporary interest in physics that cannot be solved by current software are particularly encouraged. Computational Physics Papers (CP) These are research papers in, but are not limited to, the following themes across computational physics and related disciplines. mathematical and numerical methods and algorithms; computational models including those associated with the design, control and analysis of experiments; and algebraic computation. Each will normally include software implementation and performance details. The software implementation should, ideally, be available via GitHub, Zenodo or an institutional repository.In addition, research papers on the impact of advanced computer architecture and special purpose computers on computing in the physical sciences and software topics related to, and of importance in, the physical sciences may be considered.