S. Gentili , G.D. Chiappetta , G. Petrillo , P. Brondi , J. Zhuang
{"title":"Forecasting strong subsequent earthquakes in Japan using an improved version of NESTORE machine learning algorithm","authors":"S. Gentili , G.D. Chiappetta , G. Petrillo , P. Brondi , J. Zhuang","doi":"10.1016/j.gsf.2025.102016","DOIUrl":null,"url":null,"abstract":"<div><div>In this study, the advanced machine learning algorithm NESTORE (Next STrOng Related Earthquake) was applied to the Japan Meteorological Agency catalog (1973–2024). It calculates the probability that the aftershocks will reach or exceed a magnitude equal to the magnitude of the mainshock minus one and classifies the clusters as type A or type B, depending on whether this condition is met or not. It has been shown useful in the tests in Italy, western Slovenia, Greece, and California. Due to Japan’s high and complex seismic activity, new algorithms were developed to complement NESTORE: a hybrid cluster identification method, which uses both ETAS-based stochastic declustering and deterministic graph-based selection, and REPENESE (RElevant features, class imbalance PErcentage, NEighbour detection, SElection), an algorithm for detecting outliers in skewed class distributions, which takes in account if one class has a larger number of samples with respect to the other (class imbalance).</div><div>Trained with data from 1973 to 2004 (7 type A and 43 type B clusters) and tested from 2005 to 2023 (4 type A and 27 type B clusters), the method correctly forecasted 75% of A clusters and 96% of B clusters, achieving a precision of 0.75 and an accuracy of 0.94 six hours after the mainshock. It accurately classified the 2011 Tōhoku event cluster. Near-real-time forecasting was applied to the sequence after the April 17, 2024 M6.6 earthquake in Shikoku, correctly classifying it as a “Type B cluster”. These results highlight the potential for the forecasting of strong aftershocks in regions with high seismicity and class imbalance, as evidenced by the high recall, precision and accuracy values achieved in the test phase.</div></div>","PeriodicalId":12711,"journal":{"name":"Geoscience frontiers","volume":"16 3","pages":"Article 102016"},"PeriodicalIF":8.5000,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoscience frontiers","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1674987125000167","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
本研究将先进的机器学习算法 NESTORE(Next STrOng Related Earthquake)应用于日本气象厅的地震目录(1973-2024 年)。该算法计算余震达到或超过等于主震震级减一的概率,并根据是否满足这一条件将震群划分为 A 类或 B 类。在意大利、斯洛文尼亚西部、希腊和加利福尼亚州进行的测试表明,该方法非常有用。由于日本的地震活动频繁而复杂,我们开发了新的算法来补充 NESTORE:一种混合聚类识别方法,同时使用基于 ETAS 的随机去聚类和基于确定性图的选择;以及 REPENESE(RElevant features, class imbalance PErcentage, NEighbour detection, SElection),一种在偏斜类分布中检测异常值的算法,它考虑了一个类相对于另一个类是否有更多的样本(类不平衡)。该方法使用 1973 年至 2004 年的数据(7 个 A 类集群和 43 个 B 类集群)进行了训练,并对 2005 年至 2023 年的数据(4 个 A 类集群和 27 个 B 类集群)进行了测试,结果正确预测了 75% 的 A 类集群和 96% 的 B 类集群,在主震发生六小时后达到了 0.75 的精度和 0.94 的准确度。该方法准确地对 2011 年东北事件群进行了分类。对 2024 年 4 月 17 日四国 M6.6 级地震后的序列进行了近实时预报,并正确地将其归类为 "B 类地震群"。这些结果凸显了在地震频发和等级不平衡地区预报强余震的潜力,测试阶段取得的高召回率、高精确度和高准确度值证明了这一点。
Forecasting strong subsequent earthquakes in Japan using an improved version of NESTORE machine learning algorithm
In this study, the advanced machine learning algorithm NESTORE (Next STrOng Related Earthquake) was applied to the Japan Meteorological Agency catalog (1973–2024). It calculates the probability that the aftershocks will reach or exceed a magnitude equal to the magnitude of the mainshock minus one and classifies the clusters as type A or type B, depending on whether this condition is met or not. It has been shown useful in the tests in Italy, western Slovenia, Greece, and California. Due to Japan’s high and complex seismic activity, new algorithms were developed to complement NESTORE: a hybrid cluster identification method, which uses both ETAS-based stochastic declustering and deterministic graph-based selection, and REPENESE (RElevant features, class imbalance PErcentage, NEighbour detection, SElection), an algorithm for detecting outliers in skewed class distributions, which takes in account if one class has a larger number of samples with respect to the other (class imbalance).
Trained with data from 1973 to 2004 (7 type A and 43 type B clusters) and tested from 2005 to 2023 (4 type A and 27 type B clusters), the method correctly forecasted 75% of A clusters and 96% of B clusters, achieving a precision of 0.75 and an accuracy of 0.94 six hours after the mainshock. It accurately classified the 2011 Tōhoku event cluster. Near-real-time forecasting was applied to the sequence after the April 17, 2024 M6.6 earthquake in Shikoku, correctly classifying it as a “Type B cluster”. These results highlight the potential for the forecasting of strong aftershocks in regions with high seismicity and class imbalance, as evidenced by the high recall, precision and accuracy values achieved in the test phase.
Geoscience frontiersEarth and Planetary Sciences-General Earth and Planetary Sciences
CiteScore
17.80
自引率
3.40%
发文量
147
审稿时长
35 days
期刊介绍:
Geoscience Frontiers (GSF) is the Journal of China University of Geosciences (Beijing) and Peking University. It publishes peer-reviewed research articles and reviews in interdisciplinary fields of Earth and Planetary Sciences. GSF covers various research areas including petrology and geochemistry, lithospheric architecture and mantle dynamics, global tectonics, economic geology and fuel exploration, geophysics, stratigraphy and paleontology, environmental and engineering geology, astrogeology, and the nexus of resources-energy-emissions-climate under Sustainable Development Goals. The journal aims to bridge innovative, provocative, and challenging concepts and models in these fields, providing insights on correlations and evolution.