Application of machine learning to predict transport modes from GPS, accelerometer, and heart rate data.

IF 3 2区 医学 Q2 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH International Journal of Health Geographics Pub Date : 2022-11-16 DOI:10.1186/s12942-022-00319-y
Santosh Giri, Ruben Brondeel, Tarik El Aarbaoui, Basile Chaix
{"title":"Application of machine learning to predict transport modes from GPS, accelerometer, and heart rate data.","authors":"Santosh Giri,&nbsp;Ruben Brondeel,&nbsp;Tarik El Aarbaoui,&nbsp;Basile Chaix","doi":"10.1186/s12942-022-00319-y","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>There has been an increased focus on active transport, but the measurement of active transport is still difficult and error-prone. Sensor data have been used to predict active transport. While heart rate data have very rarely been considered before, this study used random forests (RF) to predict transport modes using Global Positioning System (GPS), accelerometer, and heart rate data and paid attention to methodological issues related to the prediction strategy and post-processing.</p><p><strong>Methods: </strong>The RECORD MultiSensor study collected GPS, accelerometer, and heart rate data over seven days from 126 participants living in the Ile-de-France region. RF models were built to predict transport modes for every minute (ground truth information on modes is from a GPS-based mobility survey), splitting observations between a Training dataset and a Test dataset at the participant level instead at the minute level. Moreover, several window sizes were tested for the post-processing moving average of the predicted transport mode.</p><p><strong>Results: </strong>The minute-level prediction rate of being on trips vs. at a visited location was 90%. Final prediction rates of transport modes ranged from 65% for public transport to 95% for biking. Using minute-level observations from the same participants in the Training and Test sets (as RF spontaneously does) upwardly biases prediction rates. The inclusion of heart rate data improved prediction rates only for biking. A 3 to 5-min bandwidth moving average was optimum for a posteriori homogenization.</p><p><strong>Conclusion: </strong>Heart rate only very slightly contributed to better predictions for specific transport modes. Moreover, our study shows that Training and Test sets must be carefully defined in RF models and that post-processing with carefully chosen moving average windows can improve predictions.</p>","PeriodicalId":48739,"journal":{"name":"International Journal of Health Geographics","volume":"21 1","pages":"19"},"PeriodicalIF":3.0000,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9667683/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Health Geographics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12942-022-00319-y","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

Abstract

Background: There has been an increased focus on active transport, but the measurement of active transport is still difficult and error-prone. Sensor data have been used to predict active transport. While heart rate data have very rarely been considered before, this study used random forests (RF) to predict transport modes using Global Positioning System (GPS), accelerometer, and heart rate data and paid attention to methodological issues related to the prediction strategy and post-processing.

Methods: The RECORD MultiSensor study collected GPS, accelerometer, and heart rate data over seven days from 126 participants living in the Ile-de-France region. RF models were built to predict transport modes for every minute (ground truth information on modes is from a GPS-based mobility survey), splitting observations between a Training dataset and a Test dataset at the participant level instead at the minute level. Moreover, several window sizes were tested for the post-processing moving average of the predicted transport mode.

Results: The minute-level prediction rate of being on trips vs. at a visited location was 90%. Final prediction rates of transport modes ranged from 65% for public transport to 95% for biking. Using minute-level observations from the same participants in the Training and Test sets (as RF spontaneously does) upwardly biases prediction rates. The inclusion of heart rate data improved prediction rates only for biking. A 3 to 5-min bandwidth moving average was optimum for a posteriori homogenization.

Conclusion: Heart rate only very slightly contributed to better predictions for specific transport modes. Moreover, our study shows that Training and Test sets must be carefully defined in RF models and that post-processing with carefully chosen moving average windows can improve predictions.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
应用机器学习从GPS、加速度计和心率数据预测运输模式。
背景:人们越来越关注主动输运,但主动输运的测量仍然很困难,而且容易出错。传感器数据已被用于预测主动运输。虽然之前很少考虑心率数据,但本研究使用随机森林(RF)来预测全球定位系统(GPS)、加速度计和心率数据的运输方式,并关注与预测策略和后处理相关的方法问题。方法:RECORD MultiSensor研究收集了居住在法兰西岛地区的126名参与者在7天内的GPS、加速度计和心率数据。RF模型的建立是为了预测每分钟的运输模式(模式的地面真实信息来自基于gps的移动调查),在参与者级别而不是在分钟级别将训练数据集和测试数据集之间的观察结果分开。此外,对预测运输模式的后处理移动平均进行了几种窗口大小的测试。结果:在旅行中与在访问地点的分钟级预测率为90%。交通方式的最终预测率从公共交通的65%到自行车的95%不等。在训练和测试集中使用同一参与者的分钟级观察(就像RF自发地做的那样),预测率向上偏置。心率数据的加入只提高了骑自行车的预测率。3至5分钟带宽移动平均是最佳的后验均匀化。结论:心率对特定运输方式的预测贡献很小。此外,我们的研究表明,在RF模型中必须仔细定义训练集和测试集,并且使用精心选择的移动平均窗口进行后处理可以改进预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of Health Geographics
International Journal of Health Geographics PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH -
CiteScore
10.20
自引率
2.00%
发文量
17
审稿时长
12 weeks
期刊介绍: A leader among the field, International Journal of Health Geographics is an interdisciplinary, open access journal publishing internationally significant studies of geospatial information systems and science applications in health and healthcare. With an exceptional author satisfaction rate and a quick time to first decision, the journal caters to readers across an array of healthcare disciplines globally. International Journal of Health Geographics welcomes novel studies in the health and healthcare context spanning from spatial data infrastructure and Web geospatial interoperability research, to research into real-time Geographic Information Systems (GIS)-enabled surveillance services, remote sensing applications, spatial epidemiology, spatio-temporal statistics, internet GIS and cyberspace mapping, participatory GIS and citizen sensing, geospatial big data, healthy smart cities and regions, and geospatial Internet of Things and blockchain.
期刊最新文献
Development of an approach to forecast future takeaway outlet growth around schools and population exposure to takeaways in England. Using spatial video and deep learning for automated mapping of ground-level context in relief camps. The influence of malaria control interventions and climate variability on changes in the geographical distribution of parasite prevalence in Kenya between 2015 and 2020. Understanding Ixodes ricinus occurrence in private yards: influence of yard and landscape features. Accessibility, neighborhood socioeconomic disadvantage and expenditures on electronic gambling machines: a spatial analysis based on player account data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1