ViTAE-SL: A vision transformer-based autoencoder and spatial interpolation learner for field reconstruction

IF 3.4 2区 物理与天体物理 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computer Physics Communications Pub Date : 2024-12-16 DOI:10.1016/j.cpc.2024.109464
Hongwei Fan , Sibo Cheng , Audrey J. de Nazelle , Rossella Arcucci
{"title":"ViTAE-SL: A vision transformer-based autoencoder and spatial interpolation learner for field reconstruction","authors":"Hongwei Fan ,&nbsp;Sibo Cheng ,&nbsp;Audrey J. de Nazelle ,&nbsp;Rossella Arcucci","doi":"10.1016/j.cpc.2024.109464","DOIUrl":null,"url":null,"abstract":"<div><div>Reliable and accurate reconstruction for large-scale and complex physical fields in real-time from limited observations has been a longstanding challenge. In recent years, sensors have been increasingly deployed in numerous physical systems. However, the locations of these sensors can shift over time, such as with mobile sensors, or when sensors are deployed and removed. These sparse and randomly located sensors further exacerbate the difficulty of reconstructing the physical field. In this paper, we present a new deep learning model called Vision Transformer-based Autoencoder (ViTAE) for reconstructing large-scale and complex fields. The proposed network structure is based on a novel core design: vision transformer encoder and Convolutional Neural Network (CNN) decoder. First, we split a two-dimensional field into patches and developed a vision transformer encoder to transfer patches into latent representations. We then reshape the linear latent representations to patches before concatenation, along with a CNN decoder, to reconstruct the field. The proposed model is tested in four different numerical experiments, using generated synthetic data, spatially distributed PM2.5 data, Computational Fluid Dynamics (CFD) simulation data and National Oceanic and Atmospheric Administration (NOAA) sea surface temperature data. The numerical results highlight the strength of ViTAE-SL compared to Kriging and state-of-the-art deep-learning models with significantly higher reconstruction accuracy, computational efficiency, and robust scaling behavior.</div></div>","PeriodicalId":285,"journal":{"name":"Computer Physics Communications","volume":"308 ","pages":"Article 109464"},"PeriodicalIF":3.4000,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Physics Communications","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010465524003874","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Reliable and accurate reconstruction for large-scale and complex physical fields in real-time from limited observations has been a longstanding challenge. In recent years, sensors have been increasingly deployed in numerous physical systems. However, the locations of these sensors can shift over time, such as with mobile sensors, or when sensors are deployed and removed. These sparse and randomly located sensors further exacerbate the difficulty of reconstructing the physical field. In this paper, we present a new deep learning model called Vision Transformer-based Autoencoder (ViTAE) for reconstructing large-scale and complex fields. The proposed network structure is based on a novel core design: vision transformer encoder and Convolutional Neural Network (CNN) decoder. First, we split a two-dimensional field into patches and developed a vision transformer encoder to transfer patches into latent representations. We then reshape the linear latent representations to patches before concatenation, along with a CNN decoder, to reconstruct the field. The proposed model is tested in four different numerical experiments, using generated synthetic data, spatially distributed PM2.5 data, Computational Fluid Dynamics (CFD) simulation data and National Oceanic and Atmospheric Administration (NOAA) sea surface temperature data. The numerical results highlight the strength of ViTAE-SL compared to Kriging and state-of-the-art deep-learning models with significantly higher reconstruction accuracy, computational efficiency, and robust scaling behavior.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于视觉变换的自编码器和空间插值学习器
从有限的观测数据中实时可靠、准确地重建大规模、复杂的物理场一直是一个长期存在的挑战。近年来,传感器越来越多地部署在许多物理系统中。然而,这些传感器的位置可以随着时间的推移而变化,例如移动传感器,或者当传感器被部署和移除时。这些稀疏且随机分布的传感器进一步加剧了物理场重建的难度。在本文中,我们提出了一种新的深度学习模型,称为基于视觉变换的自编码器(ViTAE),用于重建大规模和复杂的场。该网络结构基于一种新颖的核心设计:视觉变压器编码器和卷积神经网络(CNN)解码器。首先,我们将二维场分割成小块,并开发了视觉转换编码器,将小块转换成潜在表征。然后,我们在拼接之前将线性潜在表示重塑为补丁,并使用CNN解码器重建场。采用生成的合成数据、空间分布的PM2.5数据、计算流体动力学(CFD)模拟数据和美国国家海洋和大气管理局(NOAA)的海面温度数据,对所提出的模型进行了四种不同的数值实验。数值结果表明,与Kriging和最先进的深度学习模型相比,ViTAE-SL具有更高的重建精度、计算效率和鲁棒的缩放行为。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computer Physics Communications
Computer Physics Communications 物理-计算机:跨学科应用
CiteScore
12.10
自引率
3.20%
发文量
287
审稿时长
5.3 months
期刊介绍: The focus of CPC is on contemporary computational methods and techniques and their implementation, the effectiveness of which will normally be evidenced by the author(s) within the context of a substantive problem in physics. Within this setting CPC publishes two types of paper. Computer Programs in Physics (CPiP) These papers describe significant computer programs to be archived in the CPC Program Library which is held in the Mendeley Data repository. The submitted software must be covered by an approved open source licence. Papers and associated computer programs that address a problem of contemporary interest in physics that cannot be solved by current software are particularly encouraged. Computational Physics Papers (CP) These are research papers in, but are not limited to, the following themes across computational physics and related disciplines. mathematical and numerical methods and algorithms; computational models including those associated with the design, control and analysis of experiments; and algebraic computation. Each will normally include software implementation and performance details. The software implementation should, ideally, be available via GitHub, Zenodo or an institutional repository.In addition, research papers on the impact of advanced computer architecture and special purpose computers on computing in the physical sciences and software topics related to, and of importance in, the physical sciences may be considered.
期刊最新文献
DTLreactingFoam: An efficient CFD tool for laminar reacting flow simulations using detailed chemistry and transport with time-correlated thermophysical properties A three-dimensional multi-phase-field vesicles model and its practical finite difference solver Local reduced-order modeling for electrostatic plasmas by physics-informed solution manifold decomposition Scalable neural network driven molecular dynamics simulation Quantum lattice boltzmann method for multiple time steps without reinitialization for linear advection-Diffusion problems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1