Online monitoring of air quality using PCA-based sequential learning

Xiulin Xie, Nicole Qian, Peihua Qiu
{"title":"Online monitoring of air quality using PCA-based sequential learning","authors":"Xiulin Xie, Nicole Qian, Peihua Qiu","doi":"10.1214/23-aoas1803","DOIUrl":null,"url":null,"abstract":"Air pollution surveillance is critically important for public health. One air pollutant, ozone, is extremely challenging to analyze properly, as it is a secondary pollutant caused by complex chemical reactions in the air, and does not emit directly into the atmosphere. Numerous environmental studies confirm that ozone concentration levels are associated with meteorological conditions, and long-term exposure to high ozone concentration levels is associated with the incidence of many diseases, including asthma, respiratory, and cardiovascular diseases. Thus, it is important to develop an air pollution surveillance system to collect both air pollution and meteorological data and monitor the data continuously over time. To this end, statistical process control (SPC) charts provide a major statistical tool. But, most existing SPC charts are designed for cases when the in-control (IC) process observations at different times are assumed to be independent and identically distributed. The air pollution and meteorological data would not satisfy these conditions due to serial data correlation, high dimensionality, seasonality, and other complex data structure. Motivated by an application to monitor the ground ozone concentration levels in the Houston-Galveston-Brazoria (HGB) area, we developed a new process monitoring method using principal component analysis and sequential learning. The new method can accommodate high dimensionality, time-varying IC process distribution, serial data correlation, and non-parametric data distribution. It is shown to be a reliable analytic tool for on-line monitoring of air quality.","PeriodicalId":188068,"journal":{"name":"The Annals of Applied Statistics","volume":"121 23","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Annals of Applied Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1214/23-aoas1803","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Air pollution surveillance is critically important for public health. One air pollutant, ozone, is extremely challenging to analyze properly, as it is a secondary pollutant caused by complex chemical reactions in the air, and does not emit directly into the atmosphere. Numerous environmental studies confirm that ozone concentration levels are associated with meteorological conditions, and long-term exposure to high ozone concentration levels is associated with the incidence of many diseases, including asthma, respiratory, and cardiovascular diseases. Thus, it is important to develop an air pollution surveillance system to collect both air pollution and meteorological data and monitor the data continuously over time. To this end, statistical process control (SPC) charts provide a major statistical tool. But, most existing SPC charts are designed for cases when the in-control (IC) process observations at different times are assumed to be independent and identically distributed. The air pollution and meteorological data would not satisfy these conditions due to serial data correlation, high dimensionality, seasonality, and other complex data structure. Motivated by an application to monitor the ground ozone concentration levels in the Houston-Galveston-Brazoria (HGB) area, we developed a new process monitoring method using principal component analysis and sequential learning. The new method can accommodate high dimensionality, time-varying IC process distribution, serial data correlation, and non-parametric data distribution. It is shown to be a reliable analytic tool for on-line monitoring of air quality.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用基于 PCA 的序列学习对空气质量进行在线监测
空气污染监测对公众健康至关重要。有一种空气污染物--臭氧--极难进行正确分析,因为它是由空气中复杂的化学反应引起的二次污染物,不会直接排放到大气中。大量环境研究证实,臭氧浓度水平与气象条件有关,长期暴露在高浓度臭氧环境中与多种疾病的发病率有关,包括哮喘、呼吸道疾病和心血管疾病。因此,必须开发一个空气污染监测系统,收集空气污染和气象数据,并对数据进行长期连续监测。为此,统计过程控制 (SPC) 图表提供了一个重要的统计工具。但是,现有的大多数 SPC 图表都是针对假定不同时间的在控 (IC) 过程观测值是独立且同分布的情况而设计的。由于串行数据相关性、高维度、季节性和其他复杂的数据结构,空气污染和气象数据无法满足这些条件。受侯斯顿-加尔维斯顿-布拉佐里亚(HGB)地区地面臭氧浓度水平监测应用的启发,我们利用主成分分析和序列学习开发了一种新的过程监测方法。新方法可以适应高维度、时变 IC 过程分布、序列数据相关性和非参数数据分布。结果表明,它是在线监测空气质量的可靠分析工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A hierarchical spline model for correcting and hindcasting temperature data Risk-aware restricted outcome learning for individualized treatment regimes of schizophrenia A framework for analysing longitudinal data involving time-varying covariates How are PreLaunch online movie reviews related to box office revenues? Tensor regression for incomplete observations with application to longitudinal studies
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1