{"title":"基于“可解释”机器学习模型的蓝领工人出行行为建模——以卡塔尔为例","authors":"A. AlKhereibi, A. Abuzaid, T. Wakjira","doi":"10.29117/quarfe.2021.0198","DOIUrl":null,"url":null,"abstract":"This paper presents a novel study on the examination of explainable machine learning (ML) technique to predict the mode choice for communities with a majority of blue-collared workers. A total of 4875 trip records for 1050 blue-collared workers have been used to predict their travel mode choices based on 11 trips and socio-economic attributes. The data used in this paper are obtained from the Ministry of Transportation and Communication (MoTC), which targeted blue-collared workers as they represent 89% of the total population in the State of Qatar. A total of four ML models are evaluated to propose the best predictive model. The four models were examined using different performance metrics. The models’ prediction results showed that the random forest (RF) model had the highest accuracy with a predictive accuracy of 0.97. Moreover, SHapley Additive exPlanation (SHAP) approach is used to investigate the significance of the input features and explain the output of the RF model. The results of SHAP analysis revealed that occupation level is the most significant feature that influences the mode choice followed by occupation section, arrival time, and arrival municipality.","PeriodicalId":9295,"journal":{"name":"Building Resilience at Universities: Role of Innovation and Entrepreneurship","volume":"43 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Blue-collared Workers’ Travel Behavior Modeling using “exPlainable” Machine Learning Model: The Case of Qatar\",\"authors\":\"A. AlKhereibi, A. Abuzaid, T. Wakjira\",\"doi\":\"10.29117/quarfe.2021.0198\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a novel study on the examination of explainable machine learning (ML) technique to predict the mode choice for communities with a majority of blue-collared workers. A total of 4875 trip records for 1050 blue-collared workers have been used to predict their travel mode choices based on 11 trips and socio-economic attributes. The data used in this paper are obtained from the Ministry of Transportation and Communication (MoTC), which targeted blue-collared workers as they represent 89% of the total population in the State of Qatar. A total of four ML models are evaluated to propose the best predictive model. The four models were examined using different performance metrics. The models’ prediction results showed that the random forest (RF) model had the highest accuracy with a predictive accuracy of 0.97. Moreover, SHapley Additive exPlanation (SHAP) approach is used to investigate the significance of the input features and explain the output of the RF model. The results of SHAP analysis revealed that occupation level is the most significant feature that influences the mode choice followed by occupation section, arrival time, and arrival municipality.\",\"PeriodicalId\":9295,\"journal\":{\"name\":\"Building Resilience at Universities: Role of Innovation and Entrepreneurship\",\"volume\":\"43 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Building Resilience at Universities: Role of Innovation and Entrepreneurship\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.29117/quarfe.2021.0198\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building Resilience at Universities: Role of Innovation and Entrepreneurship","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29117/quarfe.2021.0198","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Blue-collared Workers’ Travel Behavior Modeling using “exPlainable” Machine Learning Model: The Case of Qatar
This paper presents a novel study on the examination of explainable machine learning (ML) technique to predict the mode choice for communities with a majority of blue-collared workers. A total of 4875 trip records for 1050 blue-collared workers have been used to predict their travel mode choices based on 11 trips and socio-economic attributes. The data used in this paper are obtained from the Ministry of Transportation and Communication (MoTC), which targeted blue-collared workers as they represent 89% of the total population in the State of Qatar. A total of four ML models are evaluated to propose the best predictive model. The four models were examined using different performance metrics. The models’ prediction results showed that the random forest (RF) model had the highest accuracy with a predictive accuracy of 0.97. Moreover, SHapley Additive exPlanation (SHAP) approach is used to investigate the significance of the input features and explain the output of the RF model. The results of SHAP analysis revealed that occupation level is the most significant feature that influences the mode choice followed by occupation section, arrival time, and arrival municipality.