Camilo de Morais, Kassio Michell Gomes de Lima, Andrew W Dickinson, Tarek Saba, Thomas Bongers, Maneesh N Singh, Francis L Martin, Danielle Bury
{"title":"Non-invasive diagnostic test for lung cancer using biospectroscopy and variable selection techniques in saliva samples","authors":"Camilo de Morais, Kassio Michell Gomes de Lima, Andrew W Dickinson, Tarek Saba, Thomas Bongers, Maneesh N Singh, Francis L Martin, Danielle Bury","doi":"10.1039/d4an00726c","DOIUrl":null,"url":null,"abstract":"Lung cancer is one of the most commonly occurring malignant tumours worldwide. Although some reference methods such as X-ray, computed tomography or bronchoscope are widely used for clinical diagnosis of lung cancer, there is still a need to develop new methods for early detection of lung cancer. Especially needed are approaches that might be non-invasive and fast with high analytical precision and statistically reliable. Herein, we developed a swab “dip” test in saliva whereby swabs were analysed using attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy harnessed to principal component analysis–quadratic discriminant analysis (QDA) and variable selection techniques employing successive projections algorithm (SPA) and genetic algorithm (GA) for feature selection/extraction combined with QDA. A total of 1944 saliva samples (56 designated as lung-cancer positive and 1888 designed as controls) were obtained in a lung cancer-screening programme being undertaken in North-West England. GA-QDA models achieved, for the test set, sensitivity and specificity values of 100.0% and 99.1%, respectively. Three wavenumbers (1422 cm-1, 1546 cm-1 and 1578 cm-1) were identified using the GA-QDA model to distinguish between lung cancer and controls, including ring C-C stretching, C=N adenine, Amide II [δ(NH), ν(CN)] and νs(COO-) (polysaccharides, pectin). These findings highlight the potential of using biospectroscopy associated with multivariate classification algorithms to discriminate between benign saliva samples and those with underlying lung cancer.","PeriodicalId":63,"journal":{"name":"Analyst","volume":null,"pages":null},"PeriodicalIF":3.6000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analyst","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1039/d4an00726c","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Lung cancer is one of the most commonly occurring malignant tumours worldwide. Although some reference methods such as X-ray, computed tomography or bronchoscope are widely used for clinical diagnosis of lung cancer, there is still a need to develop new methods for early detection of lung cancer. Especially needed are approaches that might be non-invasive and fast with high analytical precision and statistically reliable. Herein, we developed a swab “dip” test in saliva whereby swabs were analysed using attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy harnessed to principal component analysis–quadratic discriminant analysis (QDA) and variable selection techniques employing successive projections algorithm (SPA) and genetic algorithm (GA) for feature selection/extraction combined with QDA. A total of 1944 saliva samples (56 designated as lung-cancer positive and 1888 designed as controls) were obtained in a lung cancer-screening programme being undertaken in North-West England. GA-QDA models achieved, for the test set, sensitivity and specificity values of 100.0% and 99.1%, respectively. Three wavenumbers (1422 cm-1, 1546 cm-1 and 1578 cm-1) were identified using the GA-QDA model to distinguish between lung cancer and controls, including ring C-C stretching, C=N adenine, Amide II [δ(NH), ν(CN)] and νs(COO-) (polysaccharides, pectin). These findings highlight the potential of using biospectroscopy associated with multivariate classification algorithms to discriminate between benign saliva samples and those with underlying lung cancer.