Cho-Chun Chiu, Tuan Nguyen, Ting He, Shiqiang Wang, Beom-Su Kim, Ki-Il Kim
{"title":"基于 WBAN 的健康监测的主动学习","authors":"Cho-Chun Chiu, Tuan Nguyen, Ting He, Shiqiang Wang, Beom-Su Kim, Ki-Il Kim","doi":"arxiv-2408.02849","DOIUrl":null,"url":null,"abstract":"We consider a novel active learning problem motivated by the need of learning\nmachine learning models for health monitoring in wireless body area network\n(WBAN). Due to the limited resources at body sensors, collecting each unlabeled\nsample in WBAN incurs a nontrivial cost. Moreover, training health monitoring\nmodels typically requires labels indicating the patient's health state that\nneed to be generated by healthcare professionals, which cannot be obtained at\nthe same pace as data collection. These challenges make our problem\nfundamentally different from classical active learning, where unlabeled samples\nare free and labels can be queried in real time. To handle these challenges, we\npropose a two-phased active learning method, consisting of an online phase\nwhere a coreset construction algorithm is proposed to select a subset of\nunlabeled samples based on their noisy predictions, and an offline phase where\nthe selected samples are labeled to train the target model. The samples\nselected by our algorithm are proved to yield a guaranteed error in\napproximating the full dataset in evaluating the loss function. Our evaluation\nbased on real health monitoring data and our own experimentation demonstrates\nthat our solution can drastically save the data curation cost without\nsacrificing the quality of the target model.","PeriodicalId":501280,"journal":{"name":"arXiv - CS - Networking and Internet Architecture","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Active Learning for WBAN-based Health Monitoring\",\"authors\":\"Cho-Chun Chiu, Tuan Nguyen, Ting He, Shiqiang Wang, Beom-Su Kim, Ki-Il Kim\",\"doi\":\"arxiv-2408.02849\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider a novel active learning problem motivated by the need of learning\\nmachine learning models for health monitoring in wireless body area network\\n(WBAN). Due to the limited resources at body sensors, collecting each unlabeled\\nsample in WBAN incurs a nontrivial cost. Moreover, training health monitoring\\nmodels typically requires labels indicating the patient's health state that\\nneed to be generated by healthcare professionals, which cannot be obtained at\\nthe same pace as data collection. These challenges make our problem\\nfundamentally different from classical active learning, where unlabeled samples\\nare free and labels can be queried in real time. To handle these challenges, we\\npropose a two-phased active learning method, consisting of an online phase\\nwhere a coreset construction algorithm is proposed to select a subset of\\nunlabeled samples based on their noisy predictions, and an offline phase where\\nthe selected samples are labeled to train the target model. The samples\\nselected by our algorithm are proved to yield a guaranteed error in\\napproximating the full dataset in evaluating the loss function. Our evaluation\\nbased on real health monitoring data and our own experimentation demonstrates\\nthat our solution can drastically save the data curation cost without\\nsacrificing the quality of the target model.\",\"PeriodicalId\":501280,\"journal\":{\"name\":\"arXiv - CS - Networking and Internet Architecture\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Networking and Internet Architecture\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.02849\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Networking and Internet Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.02849","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
We consider a novel active learning problem motivated by the need of learning
machine learning models for health monitoring in wireless body area network
(WBAN). Due to the limited resources at body sensors, collecting each unlabeled
sample in WBAN incurs a nontrivial cost. Moreover, training health monitoring
models typically requires labels indicating the patient's health state that
need to be generated by healthcare professionals, which cannot be obtained at
the same pace as data collection. These challenges make our problem
fundamentally different from classical active learning, where unlabeled samples
are free and labels can be queried in real time. To handle these challenges, we
propose a two-phased active learning method, consisting of an online phase
where a coreset construction algorithm is proposed to select a subset of
unlabeled samples based on their noisy predictions, and an offline phase where
the selected samples are labeled to train the target model. The samples
selected by our algorithm are proved to yield a guaranteed error in
approximating the full dataset in evaluating the loss function. Our evaluation
based on real health monitoring data and our own experimentation demonstrates
that our solution can drastically save the data curation cost without
sacrificing the quality of the target model.