Chieh-Sin Hsu, Ji-Yan Han, Ying-Hui Lai, Chi-Te Wang
{"title":"Ambulatory Phonation Monitoring Using Wireless Bluetooth Earphones.","authors":"Chieh-Sin Hsu, Ji-Yan Han, Ying-Hui Lai, Chi-Te Wang","doi":"10.1016/j.jvoice.2024.09.010","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Ambulatory phonation monitoring (APM) has a long evolving history. Current devices mostly use a contact microphone or accelerometer over the anterior neck, limiting its general acceptance outside of academic purposes. This study applied wireless Bluetooth earphones to receive voice signals. We also designed a mobile App with personalized AI model to identify phonation segments.</p><p><strong>Study design: </strong>Proof of concept study.</p><p><strong>Setting: </strong>Acoustic laboratory.</p><p><strong>Methods: </strong>The materials comprised 1-hour audio files from seven teachers recorded in the classroom. The first 5minutes were used to train the personalized SpeechDetection models using deep neural networks. Another six segments (30 seconds each) were selected for assessing the accuracy of this APM system using two parameters: (1) speech intensity, which was compared to the gold standard measured by CLIO 12, a professional system for voice recording, and (2) phonation segments, which was compared with manual labeling.</p><p><strong>Results: </strong>The training accuracy of the SpeechDetection model ranged from 91.2% to 98.5%, with a mean of 95.4%. The testing accuracy for detecting phonation segments ranged from 88.4% to 97.0% (mean: 91.5%). The Kappa value of consistency ranged from 0.710 to 0.931 (mean: 0.813, P < 0.001 for all seven participants). After linear calibration, the accuracy of measuring speech intensity ranged from 0.846 to 0.927 (mean: 0.885, P < 0.001, Pearson correlation coefficient).</p><p><strong>Conclusions: </strong>The study results demonstrated that a novel APM system using wireless earphones with mobile apps can accurately measure phonation segments and speech intensity for teachers in the classrooms. Further experiments under different environments with more participants are mandatory before extrapolating this system to real-world use cases.</p><p><strong>Level of evidence: </strong>N/A.</p>","PeriodicalId":49954,"journal":{"name":"Journal of Voice","volume":null,"pages":null},"PeriodicalIF":2.5000,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Voice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jvoice.2024.09.010","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: Ambulatory phonation monitoring (APM) has a long evolving history. Current devices mostly use a contact microphone or accelerometer over the anterior neck, limiting its general acceptance outside of academic purposes. This study applied wireless Bluetooth earphones to receive voice signals. We also designed a mobile App with personalized AI model to identify phonation segments.
Study design: Proof of concept study.
Setting: Acoustic laboratory.
Methods: The materials comprised 1-hour audio files from seven teachers recorded in the classroom. The first 5minutes were used to train the personalized SpeechDetection models using deep neural networks. Another six segments (30 seconds each) were selected for assessing the accuracy of this APM system using two parameters: (1) speech intensity, which was compared to the gold standard measured by CLIO 12, a professional system for voice recording, and (2) phonation segments, which was compared with manual labeling.
Results: The training accuracy of the SpeechDetection model ranged from 91.2% to 98.5%, with a mean of 95.4%. The testing accuracy for detecting phonation segments ranged from 88.4% to 97.0% (mean: 91.5%). The Kappa value of consistency ranged from 0.710 to 0.931 (mean: 0.813, P < 0.001 for all seven participants). After linear calibration, the accuracy of measuring speech intensity ranged from 0.846 to 0.927 (mean: 0.885, P < 0.001, Pearson correlation coefficient).
Conclusions: The study results demonstrated that a novel APM system using wireless earphones with mobile apps can accurately measure phonation segments and speech intensity for teachers in the classrooms. Further experiments under different environments with more participants are mandatory before extrapolating this system to real-world use cases.
期刊介绍:
The Journal of Voice is widely regarded as the world''s premiere journal for voice medicine and research. This peer-reviewed publication is listed in Index Medicus and is indexed by the Institute for Scientific Information. The journal contains articles written by experts throughout the world on all topics in voice sciences, voice medicine and surgery, and speech-language pathologists'' management of voice-related problems. The journal includes clinical articles, clinical research, and laboratory research. Members of the Foundation receive the journal as a benefit of membership.