Lucheng Fang, Jie Cai, Zilin Huang, Aikebaier Tuohuti, Xiong Chen
{"title":"Assessment of simulated snoring sounds with artificial intelligence for the diagnosis of obstructive sleep apnea.","authors":"Lucheng Fang, Jie Cai, Zilin Huang, Aikebaier Tuohuti, Xiong Chen","doi":"10.1016/j.sleep.2024.11.018","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Performing simulated snoring (SS) is a commonly used method to evaluate the source of snoring in obstructive sleep apnea (OSA). SS sounds is considered as a potential biomarker for OSA. SS sounds can be easily recorded, which is a cost-effective method for prescreening purposes.</p><p><strong>Objective: </strong>This study aimed to validate the performance of artificial intelligence (AI) models using SS sounds for OSA diagnosis.</p><p><strong>Methods: </strong>All participants underwent full-night polysomnography (PSG) monitoring at the sleep center. SS sounds of the participants were recorded during the laryngoscopy procedure. The audio data were processed via Python, and relevant features were extracted. OSA diagnostic models were developed using three machine learning (ML) algorithms and one deep learning (DL) algorithm. The diagnostic performance was evaluated by multiple indicators.</p><p><strong>Results: </strong>A total of 465 participants were included. For the support vector machine algorithm, the accuracy values at apnea-hypopnea index (AHI) levels of 5, 15, and 30 per hour were 0.914, 0.887, and 0.807, respectively. For the K-nearest neighbor algorithm, the accuracy values at AHI levels of 5, 15, and 30 per hour were 0.896, 0.872, and 0.756, respectively. For the random forest algorithm, the accuracy values at AHI levels of 5, 15, and 30 per hour were 0.905, 0.881, and 0.804, respectively. For the audio spectrogram transformer algorithm, the accuracy values at AHI levels of 5, 15, and 30 per hour were 0.926, 0.887, and 0.830, respectively.</p><p><strong>Conclusions: </strong>Our study demonstrates that DL models can effectively screen and identify OSA with commendable performance. In addition, the identification ability of the DL models was better than that of any of the ML models.</p>","PeriodicalId":21874,"journal":{"name":"Sleep medicine","volume":"125 ","pages":"100-107"},"PeriodicalIF":3.8000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sleep medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.sleep.2024.11.018","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Performing simulated snoring (SS) is a commonly used method to evaluate the source of snoring in obstructive sleep apnea (OSA). SS sounds is considered as a potential biomarker for OSA. SS sounds can be easily recorded, which is a cost-effective method for prescreening purposes.
Objective: This study aimed to validate the performance of artificial intelligence (AI) models using SS sounds for OSA diagnosis.
Methods: All participants underwent full-night polysomnography (PSG) monitoring at the sleep center. SS sounds of the participants were recorded during the laryngoscopy procedure. The audio data were processed via Python, and relevant features were extracted. OSA diagnostic models were developed using three machine learning (ML) algorithms and one deep learning (DL) algorithm. The diagnostic performance was evaluated by multiple indicators.
Results: A total of 465 participants were included. For the support vector machine algorithm, the accuracy values at apnea-hypopnea index (AHI) levels of 5, 15, and 30 per hour were 0.914, 0.887, and 0.807, respectively. For the K-nearest neighbor algorithm, the accuracy values at AHI levels of 5, 15, and 30 per hour were 0.896, 0.872, and 0.756, respectively. For the random forest algorithm, the accuracy values at AHI levels of 5, 15, and 30 per hour were 0.905, 0.881, and 0.804, respectively. For the audio spectrogram transformer algorithm, the accuracy values at AHI levels of 5, 15, and 30 per hour were 0.926, 0.887, and 0.830, respectively.
Conclusions: Our study demonstrates that DL models can effectively screen and identify OSA with commendable performance. In addition, the identification ability of the DL models was better than that of any of the ML models.
期刊介绍:
Sleep Medicine aims to be a journal no one involved in clinical sleep medicine can do without.
A journal primarily focussing on the human aspects of sleep, integrating the various disciplines that are involved in sleep medicine: neurology, clinical neurophysiology, internal medicine (particularly pulmonology and cardiology), psychology, psychiatry, sleep technology, pediatrics, neurosurgery, otorhinolaryngology, and dentistry.
The journal publishes the following types of articles: Reviews (also intended as a way to bridge the gap between basic sleep research and clinical relevance); Original Research Articles; Full-length articles; Brief communications; Controversies; Case reports; Letters to the Editor; Journal search and commentaries; Book reviews; Meeting announcements; Listing of relevant organisations plus web sites.