An end-to-end audio classification framework with diverse features for obstructive sleep apnea-hypopnea syndrome diagnosis

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Applied Intelligence Pub Date : 2025-02-10 DOI:10.1007/s10489-025-06299-3

Bin Li, Xihe Qiu, Xiaoyu Tan, Long Yang, Jing Tao, Zhijun Fang, Jingjing Huang

{"title":"An end-to-end audio classification framework with diverse features for obstructive sleep apnea-hypopnea syndrome diagnosis","authors":"Bin Li, Xihe Qiu, Xiaoyu Tan, Long Yang, Jing Tao, Zhijun Fang, Jingjing Huang","doi":"10.1007/s10489-025-06299-3","DOIUrl":null,"url":null,"abstract":"<div><p>Obstructive sleep apnea-hypopnea syndrome (OSAHS) is a prevalent chronic disorder that affects sleep quality and general health. The current diagnostic methods, primarily polysomnography (PSG), are laborious. Furthermore, audio-based methods for diagnosing OSAHS face limited sample sizes and neglect patients’ physiological signs and medical histories. To address these challenges, we introduce a data-driven framework called DFNet, which also considers patients’ medical histories and health indicators. DFNet incorporates an automated audio segmentation- and labeling-based preprocessing procedure to reduce expert annotation costs and subjective errors. We employed random convolutional kernels based on receptive fields for audio feature extraction purposes. These kernels captured both local and global features within the input audio. Additionally, for the first time, we introduced a medical language model that utilizes patients’ medical histories and physiological information as covariates to enhance features. We extensively validated DFNet on an OSAHS dataset obtained from a collaborative university hospital. Our framework classified patients into four categories according to their OSAHS severity: normal, mild, moderate, and severe. DFNet achieved state-of-the-art performance, with a four-class accuracy of 84.12%. DFNet offers a large-scale and cost-effective screening approach for diagnosing OSAHS, reducing the labor requirements of diagnosis. Our code is available at https://github.com/testlbin/DFNet.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 6","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-025-06299-3","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Obstructive sleep apnea-hypopnea syndrome (OSAHS) is a prevalent chronic disorder that affects sleep quality and general health. The current diagnostic methods, primarily polysomnography (PSG), are laborious. Furthermore, audio-based methods for diagnosing OSAHS face limited sample sizes and neglect patients’ physiological signs and medical histories. To address these challenges, we introduce a data-driven framework called DFNet, which also considers patients’ medical histories and health indicators. DFNet incorporates an automated audio segmentation- and labeling-based preprocessing procedure to reduce expert annotation costs and subjective errors. We employed random convolutional kernels based on receptive fields for audio feature extraction purposes. These kernels captured both local and global features within the input audio. Additionally, for the first time, we introduced a medical language model that utilizes patients’ medical histories and physiological information as covariates to enhance features. We extensively validated DFNet on an OSAHS dataset obtained from a collaborative university hospital. Our framework classified patients into four categories according to their OSAHS severity: normal, mild, moderate, and severe. DFNet achieved state-of-the-art performance, with a four-class accuracy of 84.12%. DFNet offers a large-scale and cost-effective screening approach for diagnosing OSAHS, reducing the labor requirements of diagnosis. Our code is available at https://github.com/testlbin/DFNet.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

求助全文

约1分钟内获得全文去求助

来源期刊

Applied Intelligence 工程技术-计算机：人工智能

CiteScore

6.60

自引率

20.80%

发文量

1361

审稿时长

5.9 months

期刊介绍： With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance. The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.