{"title":"Analysis and Research on the Use Situation of Public Bicycles Based on Spark Machine Learning","authors":"Chengang Li, Yu Liu, Chengcheng Li","doi":"10.1145/3335656.3335704","DOIUrl":null,"url":null,"abstract":"Public bicycles are a healthy and environmentally friendly means of transportation that facilitates people's travel. However, due to the uncertainty of urban travel, especially the tidal phenomenon, public bicycles often \"difficult to borrow a car\" and \"return the car\". This will result in unreasonable distribution of the site during the operation of the public bicycle system, unbalanced bicycle processes at various sites during peak hours, and unbalanced operation and management, which restricts the development of public bicycles. This paper uses the data of the San Francisco Bay Area as the experimental data of this paper, using Spark SQL and Spark Dataframe to analyze the use of public bicycle users and sites, according to the impact of different user types on the use of public bicycles, using K-means clustering algorithm Analyze the use of the site. Based on the Spark MLlib machine learning library, the gradient usage algorithm is used to predict daily usage.","PeriodicalId":396772,"journal":{"name":"Proceedings of the 2019 International Conference on Data Mining and Machine Learning","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 International Conference on Data Mining and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3335656.3335704","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Public bicycles are a healthy and environmentally friendly means of transportation that facilitates people's travel. However, due to the uncertainty of urban travel, especially the tidal phenomenon, public bicycles often "difficult to borrow a car" and "return the car". This will result in unreasonable distribution of the site during the operation of the public bicycle system, unbalanced bicycle processes at various sites during peak hours, and unbalanced operation and management, which restricts the development of public bicycles. This paper uses the data of the San Francisco Bay Area as the experimental data of this paper, using Spark SQL and Spark Dataframe to analyze the use of public bicycle users and sites, according to the impact of different user types on the use of public bicycles, using K-means clustering algorithm Analyze the use of the site. Based on the Spark MLlib machine learning library, the gradient usage algorithm is used to predict daily usage.