{"title":"从真实网站和二手数据源收集天气数据用于降雨预测","authors":"Deepak Sharma, Dr. Priti Sharma","doi":"10.54105/ijdm.b1626.113223","DOIUrl":null,"url":null,"abstract":"The field of data mining and machine learning has been grown many folds from the last two decades. Almost every other problem can be solved using data mining and this becomes the most tempting part of it for the scientist and researchers all over the world. Data mining can be viewed as a process of discovering knowledge. This discovery of knowledge starts with the collection of data and ends with the acquired knowledge in the form of patterns. Data collection lays the foundation for the process of knowledge discovery. In this paper, various secondary data sources from where data can be collected for rainfall prediction are deeply studied and analyzed. Some of these authentic websites and secondary data sources are NCDC (National climate data center), Kaggle, Datahub.io, UCI machine learning repository, Earth Data etc. The data collected from these secondary data sources for rainfall prediction have been critically analyzed and compared on the parameters of Accuracy, Completeness, reliability, relevance, and timeliness.","PeriodicalId":375116,"journal":{"name":"Indian Journal of Data Mining","volume":"65 12","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Collection of Weather Data from Authentic Websites and Secondary Data Sources for Rainfall Prediction\",\"authors\":\"Deepak Sharma, Dr. Priti Sharma\",\"doi\":\"10.54105/ijdm.b1626.113223\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The field of data mining and machine learning has been grown many folds from the last two decades. Almost every other problem can be solved using data mining and this becomes the most tempting part of it for the scientist and researchers all over the world. Data mining can be viewed as a process of discovering knowledge. This discovery of knowledge starts with the collection of data and ends with the acquired knowledge in the form of patterns. Data collection lays the foundation for the process of knowledge discovery. In this paper, various secondary data sources from where data can be collected for rainfall prediction are deeply studied and analyzed. Some of these authentic websites and secondary data sources are NCDC (National climate data center), Kaggle, Datahub.io, UCI machine learning repository, Earth Data etc. The data collected from these secondary data sources for rainfall prediction have been critically analyzed and compared on the parameters of Accuracy, Completeness, reliability, relevance, and timeliness.\",\"PeriodicalId\":375116,\"journal\":{\"name\":\"Indian Journal of Data Mining\",\"volume\":\"65 12\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Indian Journal of Data Mining\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.54105/ijdm.b1626.113223\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indian Journal of Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54105/ijdm.b1626.113223","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
过去二十年来,数据挖掘和机器学习领域成倍增长。几乎所有其他问题都可以通过数据挖掘来解决,这也成为全世界科学家和研究人员最感兴趣的部分。数据挖掘可以看作是一个发现知识的过程。这种知识发现始于数据收集,终于以模式的形式获得知识。数据收集为知识发现过程奠定了基础。本文深入研究和分析了可用于收集降雨预测数据的各种二手数据源。其中一些真实的网站和二手数据源包括 NCDC(国家气候数据中心)、Kaggle、Datahub.io、UCI 机器学习资源库、Earth Data 等。从这些二手数据源收集到的降雨预测数据在准确性、完整性、可靠性、相关性和及时性等参数上进行了严格的分析和比较。
Collection of Weather Data from Authentic Websites and Secondary Data Sources for Rainfall Prediction
The field of data mining and machine learning has been grown many folds from the last two decades. Almost every other problem can be solved using data mining and this becomes the most tempting part of it for the scientist and researchers all over the world. Data mining can be viewed as a process of discovering knowledge. This discovery of knowledge starts with the collection of data and ends with the acquired knowledge in the form of patterns. Data collection lays the foundation for the process of knowledge discovery. In this paper, various secondary data sources from where data can be collected for rainfall prediction are deeply studied and analyzed. Some of these authentic websites and secondary data sources are NCDC (National climate data center), Kaggle, Datahub.io, UCI machine learning repository, Earth Data etc. The data collected from these secondary data sources for rainfall prediction have been critically analyzed and compared on the parameters of Accuracy, Completeness, reliability, relevance, and timeliness.