A Machine Learning Approach to Improve the Usability of Severe Thunderstorm Wind Reports

IF 6.9 1区 地球科学 Q1 METEOROLOGY & ATMOSPHERIC SCIENCES Bulletin of the American Meteorological Society Pub Date : 2024-02-23 DOI:10.1175/bams-d-22-0268.1
Elizabeth Tirone, Subrata Pal, William A Gallus, Somak Dutta, Ranjan Maitra, Jennifer Newman, Eric Weber, Israel Jirak
{"title":"A Machine Learning Approach to Improve the Usability of Severe Thunderstorm Wind Reports","authors":"Elizabeth Tirone, Subrata Pal, William A Gallus, Somak Dutta, Ranjan Maitra, Jennifer Newman, Eric Weber, Israel Jirak","doi":"10.1175/bams-d-22-0268.1","DOIUrl":null,"url":null,"abstract":"Abstract Many concerns are known to exist with thunderstorm wind reports in the National Center for Environmental Information Storm Events Database, including the overestimation of wind speed, changes in report frequency due to population density, and differences in reporting due to damage tracers. These concerns are especially pronounced with reports that are not associated with a wind speed measurement, but are estimated, which make up almost 90% of the database. We have used machine learning to predict the probability that a severe wind report was caused by severe intensity wind, or wind ≥ 50 kt. A total of six machine learning models were trained on 11 years of measured thunderstorm wind reports, along with meteorological parameters, population density, and elevation. Objective skill metrics such as the area under the ROC curve (AUC), Brier score, and reliability curves suggest that the best performing model is the stacked generalized linear model, which has an AUC around 0.9 and a Brier score around 0.1. The outputs from these models have many potential uses such as forecast verification and quality control for implementation in forecast tools. Our tool was evaluated favorably at the Hazardous Weather Testbed Spring Forecasting Experiments in 2020, 2021, and 2022.","PeriodicalId":9464,"journal":{"name":"Bulletin of the American Meteorological Society","volume":null,"pages":null},"PeriodicalIF":6.9000,"publicationDate":"2024-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of the American Meteorological Society","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1175/bams-d-22-0268.1","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract Many concerns are known to exist with thunderstorm wind reports in the National Center for Environmental Information Storm Events Database, including the overestimation of wind speed, changes in report frequency due to population density, and differences in reporting due to damage tracers. These concerns are especially pronounced with reports that are not associated with a wind speed measurement, but are estimated, which make up almost 90% of the database. We have used machine learning to predict the probability that a severe wind report was caused by severe intensity wind, or wind ≥ 50 kt. A total of six machine learning models were trained on 11 years of measured thunderstorm wind reports, along with meteorological parameters, population density, and elevation. Objective skill metrics such as the area under the ROC curve (AUC), Brier score, and reliability curves suggest that the best performing model is the stacked generalized linear model, which has an AUC around 0.9 and a Brier score around 0.1. The outputs from these models have many potential uses such as forecast verification and quality control for implementation in forecast tools. Our tool was evaluated favorably at the Hazardous Weather Testbed Spring Forecasting Experiments in 2020, 2021, and 2022.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
提高强雷暴大风报告可用性的机器学习方法
摘要 众所周知,国家环境信息中心风暴事件数据库中的雷暴风报告存在许多问题,包括高估风速、人口密度导致的报告频率变化以及损害追踪器导致的报告差异。这些问题在没有相关风速测量数据,而是估计风速的报告中尤为突出,而这些报告几乎占了数据库的 90%。我们使用机器学习来预测严重风灾报告由严重强度风或风力≥ 50 kt 引起的概率。我们在 11 年的雷雨大风实测报告以及气象参数、人口密度和海拔高度的基础上,共训练了六个机器学习模型。ROC 曲线下面积 (AUC)、Brier 分数和可靠性曲线等客观技能指标表明,性能最好的模型是堆叠广义线性模型,其 AUC 约为 0.9,Brier 分数约为 0.1。这些模型的输出结果有许多潜在用途,如预测验证和质量控制,以便在预测工具中实施。我们的工具在 2020 年、2021 年和 2022 年的危险天气试验台春季预报实验中获得了良好的评价。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
9.80
自引率
6.20%
发文量
231
审稿时长
6-12 weeks
期刊介绍: The Bulletin of the American Meteorological Society (BAMS) is the flagship magazine of AMS and publishes articles of interest and significance for the weather, water, and climate community as well as news, editorials, and reviews for AMS members.
期刊最新文献
Can expert prediction markets forecast climate-related risks? Bridging the COSMOS: How the inclusion of and collaboration with Faith-based Understandings and Indigenous Knowledges can transform the Weather, Water, and Climate Enterprise Peering into cloud physics using ultra-fine resolution radar and lidar systems Advancing Marine Arctic Science Through Facilitating International Collaborations Quasi-linear convective systems and derechoes across Europe: climatology, accompanying hazards and societal impacts
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1