Camilo L. M. Morais, Kássio M. G. Lima, Andrew W. Dickinson, Tarek Saba, Thomas Bongers, Maneesh N. Singh, Francis L. Martin and Danielle Bury
{"title":"Non-invasive diagnostic test for lung cancer using biospectroscopy and variable selection techniques in saliva samples†","authors":"Camilo L. M. Morais, Kássio M. G. Lima, Andrew W. Dickinson, Tarek Saba, Thomas Bongers, Maneesh N. Singh, Francis L. Martin and Danielle Bury","doi":"10.1039/D4AN00726C","DOIUrl":null,"url":null,"abstract":"<p >Lung cancer is one of the most commonly occurring malignant tumours worldwide. Although some reference methods such as X-ray, computed tomography or bronchoscope are widely used for clinical diagnosis of lung cancer, there is still a need to develop new methods for early detection of lung cancer. Especially needed are approaches that might be non-invasive and fast with high analytical precision and statistically reliable. Herein, we developed a swab “dip” test in saliva whereby swabs were analysed using attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy harnessed to principal component analysis–quadratic discriminant analysis (QDA) and variable selection techniques employing successive projections algorithm (SPA) and genetic algorithm (GA) for feature selection/extraction combined with QDA. A total of 1944 saliva samples (56 designated as lung-cancer positive and 1888 designed as controls) were obtained in a lung cancer-screening programme being undertaken in North-West England. GA-QDA models achieved, for the test set, sensitivity and specificity values of 100.0% and 99.1%, respectively. Three wavenumbers (1422 cm<small><sup>−1</sup></small>, 1546 cm<small><sup>−1</sup></small> and 1578 cm<small><sup>−1</sup></small>) were identified using the GA-QDA model to distinguish between lung cancer and controls, including ring C–C stretching, C<img>N adenine, Amide II [<em>δ</em>(NH), <em>ν</em>(CN)] and <em>ν</em><small><sub>s</sub></small>(COO<small><sup>−</sup></small>) (polysaccharides, pectin). These findings highlight the potential of using biospectroscopy associated with multivariate classification algorithms to discriminate between benign saliva samples and those with underlying lung cancer.</p>","PeriodicalId":63,"journal":{"name":"Analyst","volume":" 19","pages":" 4851-4861"},"PeriodicalIF":3.3000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analyst","FirstCategoryId":"92","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2024/an/d4an00726c","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Lung cancer is one of the most commonly occurring malignant tumours worldwide. Although some reference methods such as X-ray, computed tomography or bronchoscope are widely used for clinical diagnosis of lung cancer, there is still a need to develop new methods for early detection of lung cancer. Especially needed are approaches that might be non-invasive and fast with high analytical precision and statistically reliable. Herein, we developed a swab “dip” test in saliva whereby swabs were analysed using attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy harnessed to principal component analysis–quadratic discriminant analysis (QDA) and variable selection techniques employing successive projections algorithm (SPA) and genetic algorithm (GA) for feature selection/extraction combined with QDA. A total of 1944 saliva samples (56 designated as lung-cancer positive and 1888 designed as controls) were obtained in a lung cancer-screening programme being undertaken in North-West England. GA-QDA models achieved, for the test set, sensitivity and specificity values of 100.0% and 99.1%, respectively. Three wavenumbers (1422 cm−1, 1546 cm−1 and 1578 cm−1) were identified using the GA-QDA model to distinguish between lung cancer and controls, including ring C–C stretching, CN adenine, Amide II [δ(NH), ν(CN)] and νs(COO−) (polysaccharides, pectin). These findings highlight the potential of using biospectroscopy associated with multivariate classification algorithms to discriminate between benign saliva samples and those with underlying lung cancer.