{"title":"A zero-inflated model for spatiotemporal count data with extra zeros: application to 1950–2015 tornado data in Kansas","authors":"Hong-Ding Yang, Audrey Chang, Wei-Wen Hsu, Chun-Shu Chen","doi":"10.1007/s10651-023-00586-3","DOIUrl":null,"url":null,"abstract":"<p>In many tornado climate studies, the number of tornado touchdowns is often the primary outcome of interest. These outcome measures are usually generated under a spatiotemporal correlation structure and contains many zeros due to the rarity of tornado occurrence at a specific location and time interval. To model the spatiotemporal count data with excess zeros, we propose a spatiotemporal zero-inflated Poisson (ZIP) model, which lends itself to ease of interpretation and computational simplicity. Technically, we embed a modified conditional autoregressive model in the ZIP model to describe the spatial and temporal correlations, where the probability of a pure zero in the ZIP is purposely designed to depend on locations but independent of time. Illustrated with the longitudinal tornado touchdown data in the state of Kansas from 1950 to 2015, our model suggests that the spatial correlation among the counties and the corresponding temperature are significant factors attributed to the tornado touchdowns. Through the model, we can also estimate the probabilities of no tornado touchdowns for each county over time. These estimated probabilities substantially help us understand the pattern of touchdowns and further identify the risk areas across Kansas. Moreover, these estimates can be iteratively updated when more current touchdown data are available. The final model for Kansas tornado touchdown data is evaluated using more recent data.</p>","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"78 1","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2023-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental and Ecological Statistics","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s10651-023-00586-3","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
In many tornado climate studies, the number of tornado touchdowns is often the primary outcome of interest. These outcome measures are usually generated under a spatiotemporal correlation structure and contains many zeros due to the rarity of tornado occurrence at a specific location and time interval. To model the spatiotemporal count data with excess zeros, we propose a spatiotemporal zero-inflated Poisson (ZIP) model, which lends itself to ease of interpretation and computational simplicity. Technically, we embed a modified conditional autoregressive model in the ZIP model to describe the spatial and temporal correlations, where the probability of a pure zero in the ZIP is purposely designed to depend on locations but independent of time. Illustrated with the longitudinal tornado touchdown data in the state of Kansas from 1950 to 2015, our model suggests that the spatial correlation among the counties and the corresponding temperature are significant factors attributed to the tornado touchdowns. Through the model, we can also estimate the probabilities of no tornado touchdowns for each county over time. These estimated probabilities substantially help us understand the pattern of touchdowns and further identify the risk areas across Kansas. Moreover, these estimates can be iteratively updated when more current touchdown data are available. The final model for Kansas tornado touchdown data is evaluated using more recent data.
在许多龙卷风气候研究中,龙卷风触地次数往往是主要的研究结果。这些结果度量通常是在时空相关结构下产生的,由于龙卷风在特定地点和时间间隔内发生的罕见性,这些结果度量包含许多零。为了对零点过多的时空计数数据建模,我们提出了时空零膨胀泊松(ZIP)模型,该模型易于解释,计算简单。从技术上讲,我们在 ZIP 模型中嵌入了一个修正的条件自回归模型,以描述空间和时间相关性,其中 ZIP 中出现纯零的概率特意设计为与地点相关,但与时间无关。以堪萨斯州 1950 年至 2015 年的龙卷风触地纵向数据为例,我们的模型表明,各县之间的空间相关性和相应的温度是导致龙卷风触地的重要因素。通过该模型,我们还可以估算出每个县在不同时期没有龙卷风触地的概率。这些估算出的概率极大地帮助我们了解龙卷风触地的模式,并进一步确定堪萨斯州的风险区域。此外,当获得更多最新触地数据时,这些估计值还可以不断更新。堪萨斯州龙卷风触地数据的最终模型将使用更新的数据进行评估。
期刊介绍:
Environmental and Ecological Statistics publishes papers on practical applications of statistics and related quantitative methods to environmental science addressing contemporary issues.
Emphasis is on applied mathematical statistics, statistical methodology, and data interpretation and improvement for future use, with a view to advance statistics for environment, ecology and environmental health, and to advance environmental theory and practice using valid statistics.
Besides clarity of exposition, a single most important criterion for publication is the appropriateness of the statistical method to the particular environmental problem. The Journal covers all aspects of the collection, analysis, presentation and interpretation of environmental data for research, policy and regulation. The Journal is cross-disciplinary within the context of contemporary environmental issues and the associated statistical tools, concepts and methods. The Journal broadly covers theory and methods, case studies and applications, environmental change and statistical ecology, environmental health statistics and stochastics, and related areas. Special features include invited discussion papers; research communications; technical notes and consultation corner; mini-reviews; letters to the Editor; news, views and announcements; hardware and software reviews; data management etc.