基于“可解释”机器学习模型的蓝领工人出行行为建模——以卡塔尔为例

A. AlKhereibi, A. Abuzaid, T. Wakjira
{"title":"基于“可解释”机器学习模型的蓝领工人出行行为建模——以卡塔尔为例","authors":"A. AlKhereibi, A. Abuzaid, T. Wakjira","doi":"10.29117/quarfe.2021.0198","DOIUrl":null,"url":null,"abstract":"This paper presents a novel study on the examination of explainable machine learning (ML) technique to predict the mode choice for communities with a majority of blue-collared workers. A total of 4875 trip records for 1050 blue-collared workers have been used to predict their travel mode choices based on 11 trips and socio-economic attributes. The data used in this paper are obtained from the Ministry of Transportation and Communication (MoTC), which targeted blue-collared workers as they represent 89% of the total population in the State of Qatar. A total of four ML models are evaluated to propose the best predictive model. The four models were examined using different performance metrics. The models’ prediction results showed that the random forest (RF) model had the highest accuracy with a predictive accuracy of 0.97. Moreover, SHapley Additive exPlanation (SHAP) approach is used to investigate the significance of the input features and explain the output of the RF model. The results of SHAP analysis revealed that occupation level is the most significant feature that influences the mode choice followed by occupation section, arrival time, and arrival municipality.","PeriodicalId":9295,"journal":{"name":"Building Resilience at Universities: Role of Innovation and Entrepreneurship","volume":"43 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Blue-collared Workers’ Travel Behavior Modeling using “exPlainable” Machine Learning Model: The Case of Qatar\",\"authors\":\"A. AlKhereibi, A. Abuzaid, T. Wakjira\",\"doi\":\"10.29117/quarfe.2021.0198\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a novel study on the examination of explainable machine learning (ML) technique to predict the mode choice for communities with a majority of blue-collared workers. A total of 4875 trip records for 1050 blue-collared workers have been used to predict their travel mode choices based on 11 trips and socio-economic attributes. The data used in this paper are obtained from the Ministry of Transportation and Communication (MoTC), which targeted blue-collared workers as they represent 89% of the total population in the State of Qatar. A total of four ML models are evaluated to propose the best predictive model. The four models were examined using different performance metrics. The models’ prediction results showed that the random forest (RF) model had the highest accuracy with a predictive accuracy of 0.97. Moreover, SHapley Additive exPlanation (SHAP) approach is used to investigate the significance of the input features and explain the output of the RF model. The results of SHAP analysis revealed that occupation level is the most significant feature that influences the mode choice followed by occupation section, arrival time, and arrival municipality.\",\"PeriodicalId\":9295,\"journal\":{\"name\":\"Building Resilience at Universities: Role of Innovation and Entrepreneurship\",\"volume\":\"43 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Building Resilience at Universities: Role of Innovation and Entrepreneurship\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.29117/quarfe.2021.0198\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building Resilience at Universities: Role of Innovation and Entrepreneurship","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29117/quarfe.2021.0198","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一项关于检验可解释机器学习(ML)技术的新研究,以预测拥有大多数蓝领工人的社区的模式选择。基于11次出行和社会经济属性,1050名蓝领工人共4875次出行记录被用来预测他们的出行方式选择。本文中使用的数据来自交通运输部(MoTC),其目标是蓝领工人,因为他们占卡塔尔总人口的89%。总共评估了四个ML模型,以提出最佳的预测模型。使用不同的性能指标检查了这四种模型。模型预测结果表明,随机森林(RF)模型的预测精度最高,为0.97。此外,使用SHapley加性解释(SHAP)方法来研究输入特征的重要性并解释RF模型的输出。SHAP分析结果表明,职业水平是影响模式选择的最显著特征,其次是职业区域、到达时间和到达城市。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Blue-collared Workers’ Travel Behavior Modeling using “exPlainable” Machine Learning Model: The Case of Qatar
This paper presents a novel study on the examination of explainable machine learning (ML) technique to predict the mode choice for communities with a majority of blue-collared workers. A total of 4875 trip records for 1050 blue-collared workers have been used to predict their travel mode choices based on 11 trips and socio-economic attributes. The data used in this paper are obtained from the Ministry of Transportation and Communication (MoTC), which targeted blue-collared workers as they represent 89% of the total population in the State of Qatar. A total of four ML models are evaluated to propose the best predictive model. The four models were examined using different performance metrics. The models’ prediction results showed that the random forest (RF) model had the highest accuracy with a predictive accuracy of 0.97. Moreover, SHapley Additive exPlanation (SHAP) approach is used to investigate the significance of the input features and explain the output of the RF model. The results of SHAP analysis revealed that occupation level is the most significant feature that influences the mode choice followed by occupation section, arrival time, and arrival municipality.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Investigating the Concomitant Removal of Hydrocarbons and Heavy Metals by highly adapted Bacillus and Pseudomonas strains Exploring QU Health Students' Experiences of Burnout, Anxiety, and Empathy during the COVID-19 Pandemic: A Mixed Method Study Dietary Patterns and Risk of Inflammatory Bowel Disease: Findings from a Case-Control Study Understanding COVID-19-related Burnout in Qatar’s Community Pharmacists using the Job Demands-Resources Theory Experimental Investigations of Gas Kick for Single and Two-Phase Gas-liquid Flow in near Horizontal Wells
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1