Jachih Fu , Ping-Huan Lee , Chen-Chi Wang, Ying-Cheng Lin, Chun-Yi Chuang, Yung-An Tsou, Yen-Yang Chen, Sheng-Shun Yang, Han-Chung Lien
{"title":"A cascade deep learning model for diagnosing pharyngeal acid reflux episodes using hypopharyngeal multichannel intraluminal Impedance-pH signals","authors":"Jachih Fu , Ping-Huan Lee , Chen-Chi Wang, Ying-Cheng Lin, Chun-Yi Chuang, Yung-An Tsou, Yen-Yang Chen, Sheng-Shun Yang, Han-Chung Lien","doi":"10.1016/j.ibmed.2023.100131","DOIUrl":null,"url":null,"abstract":"<div><p>Detecting pharyngeal acid reflux (PAR) episodes from 24-h ambulatory hypopharyngeal multichannel intraluminal impedance-pH (HMII-pH) signals is crucial for diagnosing laryngopharyngeal reflux (LPR). Currently, a lack of effective software for PAR episode detection requires time-consuming manual interpretation, which is prone to inter-rater variability. This study introduces a deep learning-based artificial intelligence (AI) system for PAR episode detection and diagnosis using HMII-pH signals. Ninety patients with suspected LPR and 28 healthy volunteers underwent HMII-pH testing in three Taiwanese medical centers. Candidate PAR episodes were defined as esophagopharyngeal pH drops exceeding 2 units, with nadir pH below 5 within 30 seconds during esophageal acidification. A consensus review by three experts validated 84 PAR episodes in 17 subjects. Data preprocessing identified 225 candidate PAR episodes, including 84 PAR episodes and 141 swallows/artifacts, were divided into training, validation, and test datasets (6:2:2 ratio). Three cascade deep learning AI models were trained. Among them, the cascade Multivariate Long Short-Term Memory with Fully Convolutional Network (MLSTM-FCN) model performed best in the test dataset. At the episode level, this model achieved 0.936 accuracy, 0.941 precision, 0.889 recall, 0.966 specificity, 0.914 F<sub>1</sub> score, and 0.864 Matthew's correlation coefficient (MCC). For subject-level evaluation, the corresponding metrics were 0.917 accuracy, 1.000 precision, 0.818 recall, 1.000 specificity, 0.900 F<sub>1</sub> score, and 0.842 MCC. In conclusion, the cascade MLSTM-FCN model demonstrates robust accuracy in diagnosing PAR episodes from HMII-pH signals, offering a promising tool for efficient and consistent PAR episode detection in LPR diagnosis.</p></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"8 ","pages":"Article 100131"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666521223000455/pdfft?md5=e7ec271a1ce7d43fee77be9059a01637&pid=1-s2.0-S2666521223000455-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligence-based medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666521223000455","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Detecting pharyngeal acid reflux (PAR) episodes from 24-h ambulatory hypopharyngeal multichannel intraluminal impedance-pH (HMII-pH) signals is crucial for diagnosing laryngopharyngeal reflux (LPR). Currently, a lack of effective software for PAR episode detection requires time-consuming manual interpretation, which is prone to inter-rater variability. This study introduces a deep learning-based artificial intelligence (AI) system for PAR episode detection and diagnosis using HMII-pH signals. Ninety patients with suspected LPR and 28 healthy volunteers underwent HMII-pH testing in three Taiwanese medical centers. Candidate PAR episodes were defined as esophagopharyngeal pH drops exceeding 2 units, with nadir pH below 5 within 30 seconds during esophageal acidification. A consensus review by three experts validated 84 PAR episodes in 17 subjects. Data preprocessing identified 225 candidate PAR episodes, including 84 PAR episodes and 141 swallows/artifacts, were divided into training, validation, and test datasets (6:2:2 ratio). Three cascade deep learning AI models were trained. Among them, the cascade Multivariate Long Short-Term Memory with Fully Convolutional Network (MLSTM-FCN) model performed best in the test dataset. At the episode level, this model achieved 0.936 accuracy, 0.941 precision, 0.889 recall, 0.966 specificity, 0.914 F1 score, and 0.864 Matthew's correlation coefficient (MCC). For subject-level evaluation, the corresponding metrics were 0.917 accuracy, 1.000 precision, 0.818 recall, 1.000 specificity, 0.900 F1 score, and 0.842 MCC. In conclusion, the cascade MLSTM-FCN model demonstrates robust accuracy in diagnosing PAR episodes from HMII-pH signals, offering a promising tool for efficient and consistent PAR episode detection in LPR diagnosis.