{"title":"利用统计技术分析多源温度数据集","authors":"Vishal Sharma, Sanjay Kumar Ghosh","doi":"10.1007/s12647-024-00756-z","DOIUrl":null,"url":null,"abstract":"<div><p>Temperature is considered as one of the important weather parameters so a qualified study for multi-source dataset is carried out by inspecting different parameters. The maximum temperature (<i>T</i><sub><i>max</i></sub>) and minimum temperature (<i>T</i><sub><i>min</i></sub>) for the Haridwar district are taken from two distinct sources, including data from the Indian Meteorology Department (IMD) and the National Aeronautics and Space Administration (NASA). Initially, NASA Power Larc and IMD datasets are compared using Standard Anomaly graphical representation. Afterwards, different indices were evaluated for Maximum and Minimum Temperature such as Bias, mean absolute error, mean square error, root mean square error. Further, correlation coefficient analysis & Wilcoxon–Mann–Whiney test is carried out to test the equivalence of both datasets. The outcome shows that the yearly standard anomalies of the Power Larc data and the annual standard anomalies of the IMD data both follow the same trend. According to the various indices assessed, MAE, MSE, and RMSE all remain in one standard deviation of the data being observed. Moreover, Power Larc data is well correlated and shows equivalence with IMD dataset. It is observed that both datasets show very strong (0.87) to weak correlation (0.37) for minimum temperature on seasonal scale and strong correlation (0.71) on annual scale. For maximum temperature both datasets show a correlation range of 0.41–0.83 on seasonal scale and weak (0.37) correlation on annual scale. From the correlation values it is observed that both datasets are identical on monthly and seasonal scale. Therefore, it may be concluded that Power Larc dataset is reliable dataset and may be used in place of IMD dataset.</p></div>","PeriodicalId":689,"journal":{"name":"MAPAN","volume":"39 4","pages":"799 - 813"},"PeriodicalIF":1.0000,"publicationDate":"2024-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Analysis of Multi-Source Temperature Datasets using Statistical Techniques\",\"authors\":\"Vishal Sharma, Sanjay Kumar Ghosh\",\"doi\":\"10.1007/s12647-024-00756-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Temperature is considered as one of the important weather parameters so a qualified study for multi-source dataset is carried out by inspecting different parameters. The maximum temperature (<i>T</i><sub><i>max</i></sub>) and minimum temperature (<i>T</i><sub><i>min</i></sub>) for the Haridwar district are taken from two distinct sources, including data from the Indian Meteorology Department (IMD) and the National Aeronautics and Space Administration (NASA). Initially, NASA Power Larc and IMD datasets are compared using Standard Anomaly graphical representation. Afterwards, different indices were evaluated for Maximum and Minimum Temperature such as Bias, mean absolute error, mean square error, root mean square error. Further, correlation coefficient analysis & Wilcoxon–Mann–Whiney test is carried out to test the equivalence of both datasets. The outcome shows that the yearly standard anomalies of the Power Larc data and the annual standard anomalies of the IMD data both follow the same trend. According to the various indices assessed, MAE, MSE, and RMSE all remain in one standard deviation of the data being observed. Moreover, Power Larc data is well correlated and shows equivalence with IMD dataset. It is observed that both datasets show very strong (0.87) to weak correlation (0.37) for minimum temperature on seasonal scale and strong correlation (0.71) on annual scale. For maximum temperature both datasets show a correlation range of 0.41–0.83 on seasonal scale and weak (0.37) correlation on annual scale. From the correlation values it is observed that both datasets are identical on monthly and seasonal scale. Therefore, it may be concluded that Power Larc dataset is reliable dataset and may be used in place of IMD dataset.</p></div>\",\"PeriodicalId\":689,\"journal\":{\"name\":\"MAPAN\",\"volume\":\"39 4\",\"pages\":\"799 - 813\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2024-06-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"MAPAN\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s12647-024-00756-z\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"INSTRUMENTS & INSTRUMENTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"MAPAN","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s12647-024-00756-z","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0
摘要
温度被认为是重要的天气参数之一,因此我们通过检测不同的参数,对多来源数据集进行了合格的研究。哈里德瓦尔地区的最高气温(Tmax)和最低气温(Tmin)来自两个不同的来源,包括印度气象局(IMD)和美国国家航空航天局(NASA)的数据。首先,使用标准异常图形对 NASA Power Larc 数据集和 IMD 数据集进行比较。然后,对最高气温和最低气温的不同指数进行评估,如偏差、平均绝对误差、均方误差、均方根误差。此外,还进行了相关系数分析 & Wilcoxon-Mann-Whiney 检验,以测试两个数据集的等效性。结果表明,Power Larc 数据的年度标准异常值和 IMD 数据的年度标准异常值都遵循相同的趋势。根据各种评估指标,MAE、MSE 和 RMSE 都保持在所观测数据的一个标准差之内。此外,Power Larc 数据与 IMD 数据集具有良好的相关性和等效性。据观察,这两个数据集在季节尺度上的最低气温相关性从强(0.87)到弱(0.37)不等,在年度尺度上的相关性很强(0.71)。在最高气温方面,两个数据集在季节尺度上的相关性范围为 0.41-0.83,在年度尺度上的相关性范围较弱(0.37)。从相关值可以看出,两个数据集在月尺度和季节尺度上是相同的。因此,可以断定 Power Larc 数据集是可靠的数据集,可以用来替代 IMD 数据集。
An Analysis of Multi-Source Temperature Datasets using Statistical Techniques
Temperature is considered as one of the important weather parameters so a qualified study for multi-source dataset is carried out by inspecting different parameters. The maximum temperature (Tmax) and minimum temperature (Tmin) for the Haridwar district are taken from two distinct sources, including data from the Indian Meteorology Department (IMD) and the National Aeronautics and Space Administration (NASA). Initially, NASA Power Larc and IMD datasets are compared using Standard Anomaly graphical representation. Afterwards, different indices were evaluated for Maximum and Minimum Temperature such as Bias, mean absolute error, mean square error, root mean square error. Further, correlation coefficient analysis & Wilcoxon–Mann–Whiney test is carried out to test the equivalence of both datasets. The outcome shows that the yearly standard anomalies of the Power Larc data and the annual standard anomalies of the IMD data both follow the same trend. According to the various indices assessed, MAE, MSE, and RMSE all remain in one standard deviation of the data being observed. Moreover, Power Larc data is well correlated and shows equivalence with IMD dataset. It is observed that both datasets show very strong (0.87) to weak correlation (0.37) for minimum temperature on seasonal scale and strong correlation (0.71) on annual scale. For maximum temperature both datasets show a correlation range of 0.41–0.83 on seasonal scale and weak (0.37) correlation on annual scale. From the correlation values it is observed that both datasets are identical on monthly and seasonal scale. Therefore, it may be concluded that Power Larc dataset is reliable dataset and may be used in place of IMD dataset.
期刊介绍:
MAPAN-Journal Metrology Society of India is a quarterly publication. It is exclusively devoted to Metrology (Scientific, Industrial or Legal). It has been fulfilling an important need of Metrologists and particularly of quality practitioners by publishing exclusive articles on scientific, industrial and legal metrology.
The journal publishes research communication or technical articles of current interest in measurement science; original work, tutorial or survey papers in any metrology related area; reviews and analytical studies in metrology; case studies on reliability, uncertainty in measurements; and reports and results of intercomparison and proficiency testing.