Seq2seq modelling for cross-site temporal forecasting of urban air pollutant concentrations leveraging sensor data

IF 7.6 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY Building and Environment Pub Date : 2025-02-01 DOI:10.1016/j.buildenv.2024.112463
Jiading Zhong, Jianlin Liu
{"title":"Seq2seq modelling for cross-site temporal forecasting of urban air pollutant concentrations leveraging sensor data","authors":"Jiading Zhong,&nbsp;Jianlin Liu","doi":"10.1016/j.buildenv.2024.112463","DOIUrl":null,"url":null,"abstract":"<div><div>Urban air pollution presents significant health risks, requiring effective monitoring, forecasting and controlling strategies. Comprehensive monitoring is often hindered by the limited availability of measurement data. This study introduces a seq2seq model designed to perform operational forecasting of air pollutant concentrations at an unmonitored site using an upwind sensor. The effectiveness of seq2seq model is systematically evaluated through test cases that aim to explore effects of several influencing factors, including network architecture, embedding method, model complexity, and sensor placement. The test cases involve 252 seq2seq model candidates, which are trained and tested on a synthetic dataset established using a validated large eddy simulation (LES) model for the typical street canyon urban setting, ensuring controlled conditions and reproducibility. Additionally, scheduled sampling is used during model training to mitigate error accumulation. Results demonstrate that a decoder-only model is only capable of making flatline predictions, while a well-tuned seq2seq model informed by a strategically placed upwind sensor provides reasonable operational predictions. Despite its simplicity, the linear network, using the positional embedding, a two-layer structure, and the sensor placed 0.17<em>H</em> above the ground, exhibits the best performance among seq2seq models. The study also challenges a priori belief that favors higher sensor locations based on statistical similarity measures, as the sensor at 0.17<em>H</em> enables the best performing model. These findings underscore the potential of seq2seq models to enhance urban air quality monitoring, offering a robust scientific basis for informed urban planning and pollution management strategies.</div></div>","PeriodicalId":9273,"journal":{"name":"Building and Environment","volume":"269 ","pages":"Article 112463"},"PeriodicalIF":7.6000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building and Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360132324013040","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Urban air pollution presents significant health risks, requiring effective monitoring, forecasting and controlling strategies. Comprehensive monitoring is often hindered by the limited availability of measurement data. This study introduces a seq2seq model designed to perform operational forecasting of air pollutant concentrations at an unmonitored site using an upwind sensor. The effectiveness of seq2seq model is systematically evaluated through test cases that aim to explore effects of several influencing factors, including network architecture, embedding method, model complexity, and sensor placement. The test cases involve 252 seq2seq model candidates, which are trained and tested on a synthetic dataset established using a validated large eddy simulation (LES) model for the typical street canyon urban setting, ensuring controlled conditions and reproducibility. Additionally, scheduled sampling is used during model training to mitigate error accumulation. Results demonstrate that a decoder-only model is only capable of making flatline predictions, while a well-tuned seq2seq model informed by a strategically placed upwind sensor provides reasonable operational predictions. Despite its simplicity, the linear network, using the positional embedding, a two-layer structure, and the sensor placed 0.17H above the ground, exhibits the best performance among seq2seq models. The study also challenges a priori belief that favors higher sensor locations based on statistical similarity measures, as the sensor at 0.17H enables the best performing model. These findings underscore the potential of seq2seq models to enhance urban air quality monitoring, offering a robust scientific basis for informed urban planning and pollution management strategies.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用传感器数据进行城市空气污染物浓度跨站点时间预测的Seq2seq模型
城市空气污染带来重大健康风险,需要有效的监测、预测和控制战略。测量数据的有限可用性往往阻碍了全面监测。本研究介绍了一个seq2seq模型,该模型旨在使用逆风传感器对未监测站点的空气污染物浓度进行操作预测。通过测试用例系统地评估了seq2seq模型的有效性,这些测试用例旨在探讨网络架构、嵌入方法、模型复杂性和传感器放置等几个影响因素的影响。测试用例涉及252个seq2seq候选模型,这些模型在一个合成数据集上进行训练和测试,该数据集使用经过验证的大涡模拟(LES)模型建立,用于典型的街道峡谷城市环境,以确保受控条件和可重复性。此外,在模型训练期间使用预定采样来减少误差积累。结果表明,只有解码器的模型只能做出平坦线预测,而经过精心调整的seq2seq模型,由战略放置的逆风传感器提供合理的操作预测。线性网络采用位置嵌入,采用两层结构,传感器放置在离地面0.17H的位置,虽然简单,但在seq2seq模型中表现出最好的性能。该研究还挑战了一种先验的信念,即基于统计相似性度量的更高传感器位置,因为0.17H的传感器使模型表现最佳。这些发现强调了seq2seq模型在加强城市空气质量监测方面的潜力,为明智的城市规划和污染管理策略提供了强有力的科学依据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Building and Environment
Building and Environment 工程技术-工程:环境
CiteScore
12.50
自引率
23.00%
发文量
1130
审稿时长
27 days
期刊介绍: Building and Environment, an international journal, is dedicated to publishing original research papers, comprehensive review articles, editorials, and short communications in the fields of building science, urban physics, and human interaction with the indoor and outdoor built environment. The journal emphasizes innovative technologies and knowledge verified through measurement and analysis. It covers environmental performance across various spatial scales, from cities and communities to buildings and systems, fostering collaborative, multi-disciplinary research with broader significance.
期刊最新文献
A Kriging-Markov hybrid method for real-time spatiotemporal indoor daylight prediction under data-sparse and sensor-limited conditions toward adaptive daylight control applications Temporal myopia in building Life Cycle Assessment? Granular versus coarse dynamics in climate change and grid decarbonisation A novel compatible fluid connector for liquid cooling system in data center BIM-based wind-driven rain modelling using ISO 15927-3 and geometry-based exposure Ten questions concerning participant engagement in building energy research
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1