{"title":"The mass data mining research based on the information platform of Internet of Things","authors":"Juan Li, Xuan Luo, Fengqi Hao","doi":"10.1109/ICAIT.2017.8388923","DOIUrl":null,"url":null,"abstract":"This paper analyzes the key technologies of the distributed data, and proposes the solution of the mass data processing based on the Information platform of Internet of Things. The solution uses Hadoop as the open source framework to realize the distributed computing system, and uses Mahout as the data mining algorithm library to realize the parallelization of k-means clustering algorithm. This will achieve the high efficiency and the large expansibility through the mass data processing. The mass data source used in this project is from the intelligent agricultural Information service platform. The system takes the deployment test on the mass sensing information, and it optimizes and parallelizes the K-means algorithm to realize the mass data processing based on the Information platform of Internet of Things. It can make the statistical analysis, and provide fine management and other services. the algorithm is used to improve the efficiency and the accuracy of the platform with the supercomputing and the reliable storage ability.","PeriodicalId":376884,"journal":{"name":"2017 9th International Conference on Advanced Infocomm Technology (ICAIT)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 9th International Conference on Advanced Infocomm Technology (ICAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIT.2017.8388923","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper analyzes the key technologies of the distributed data, and proposes the solution of the mass data processing based on the Information platform of Internet of Things. The solution uses Hadoop as the open source framework to realize the distributed computing system, and uses Mahout as the data mining algorithm library to realize the parallelization of k-means clustering algorithm. This will achieve the high efficiency and the large expansibility through the mass data processing. The mass data source used in this project is from the intelligent agricultural Information service platform. The system takes the deployment test on the mass sensing information, and it optimizes and parallelizes the K-means algorithm to realize the mass data processing based on the Information platform of Internet of Things. It can make the statistical analysis, and provide fine management and other services. the algorithm is used to improve the efficiency and the accuracy of the platform with the supercomputing and the reliable storage ability.