PulseSight : A novel method for contactless oxygen saturation (SpO2) monitoring using smartphone cameras, remote photoplethysmography and machine learning
Kazi Zawad Arefin , Kazi Shafiul Alam , Sayed Mashroor Mamun , Nafi Us Sabbir Sabith , Masud Rabbani , Parama Sridevi , Sheikh Iqbal Ahamed
{"title":"PulseSight : A novel method for contactless oxygen saturation (SpO2) monitoring using smartphone cameras, remote photoplethysmography and machine learning","authors":"Kazi Zawad Arefin , Kazi Shafiul Alam , Sayed Mashroor Mamun , Nafi Us Sabbir Sabith , Masud Rabbani , Parama Sridevi , Sheikh Iqbal Ahamed","doi":"10.1016/j.smhl.2025.100542","DOIUrl":null,"url":null,"abstract":"<div><div>Monitoring oxygen saturation (SpO<span><math><msub><mrow></mrow><mrow><mn>2</mn></mrow></msub></math></span>) level is crucial for evaluating the current cardiac and respiratory condition of a person, particularly in medical settings. Conventional pulse oximetry, while efficient, has drawbacks such as the requirement for physical touch and vulnerability to certain environmental influences. In this paper, we propose an innovative approach for estimating SpO<span><math><msub><mrow></mrow><mrow><mn>2</mn></mrow></msub></math></span> levels utilizing smartphone cameras and video-based photoplethysmography (PPG) without physical touch. Our framework consists of an Android mobile application that records 20-second face videos, which a cloud-based backend server then analyzes. The server utilizes deep learning-based facial recognition and signal processing techniques to extract remote photoplethysmography (rPPG) signals from specific facial regions and predict oxygen saturation (SpO<span><math><msub><mrow></mrow><mrow><mn>2</mn></mrow></msub></math></span>) levels using a Support Vector Regression (SVR) Machine learning model. Signal noise and motion artifacts are mitigated by extracting relevant characteristics from the rPPG. The system was validated by experimental studies, which contained 40 sets of videos collected from 10 participants. The study was conducted under different illumination conditions, which showed low RMSE score (1.45 ±0.1) and MAE score (0.92 ±0.01). Also, our system shows high usability, as indicated by the System Usability Scale (SUS) score of 80.5. The results demonstrate that our method offers a dependable and contactless substitute for continuous SpO2 monitoring, with potential uses in telemedicine and remote patient monitoring.</div></div>","PeriodicalId":37151,"journal":{"name":"Smart Health","volume":"36 ","pages":"Article 100542"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart Health","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352648325000030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Health Professions","Score":null,"Total":0}
引用次数: 0
Abstract
Monitoring oxygen saturation (SpO) level is crucial for evaluating the current cardiac and respiratory condition of a person, particularly in medical settings. Conventional pulse oximetry, while efficient, has drawbacks such as the requirement for physical touch and vulnerability to certain environmental influences. In this paper, we propose an innovative approach for estimating SpO levels utilizing smartphone cameras and video-based photoplethysmography (PPG) without physical touch. Our framework consists of an Android mobile application that records 20-second face videos, which a cloud-based backend server then analyzes. The server utilizes deep learning-based facial recognition and signal processing techniques to extract remote photoplethysmography (rPPG) signals from specific facial regions and predict oxygen saturation (SpO) levels using a Support Vector Regression (SVR) Machine learning model. Signal noise and motion artifacts are mitigated by extracting relevant characteristics from the rPPG. The system was validated by experimental studies, which contained 40 sets of videos collected from 10 participants. The study was conducted under different illumination conditions, which showed low RMSE score (1.45 ±0.1) and MAE score (0.92 ±0.01). Also, our system shows high usability, as indicated by the System Usability Scale (SUS) score of 80.5. The results demonstrate that our method offers a dependable and contactless substitute for continuous SpO2 monitoring, with potential uses in telemedicine and remote patient monitoring.