使用机器学习算法,利用活体皮肤病变的近红外光谱数据诊断皮肤癌

IF 5.3 2区 医学 Q1 ENGINEERING, BIOMEDICAL Biocybernetics and Biomedical Engineering Pub Date : 2024-10-01 DOI:10.1016/j.bbe.2024.10.001
{"title":"使用机器学习算法,利用活体皮肤病变的近红外光谱数据诊断皮肤癌","authors":"","doi":"10.1016/j.bbe.2024.10.001","DOIUrl":null,"url":null,"abstract":"<div><div>Skin lesions are classified in benign or malignant. Among the malignant, melanoma is a very aggressive cancer and the major cause of deaths. So, early diagnosis of skin cancer is very desired. In the last few years, there is a growing interest in computer aided diagnostic (CAD) of skin lesions. Near-Infrared (NIR) spectroscopy may provide an alternative source of information to automated CAD of skin lesions to be used with the modern techniques of machine learning and deep learning (MDL). One of the main limitations to apply MDL to spectroscopy is the lack of public datasets. Since there is no public dataset of NIR spectral data to skin lesions, as far as we know, an effort has been made and a new dataset named NIR-SC-UFES, has been collected, annotated and analyzed generating the gold-standard for classification of NIR spectral data to skin cancer. Next, the machine learning algorithms XGBoost, CatBoost, LightGBM, 1D-convolutional neural network (1D-CNN) and standard algorithms as SVM and PLS-DA were investigated to classify cancer and non-cancer skin lesions. Experimental results indicate that the best performance was obtained by LightGBM with pre-processing using standard normal variate (SNV), feature extraction and data augmentation with Generative Adversarial Networks (GAN) providing values of 0.839 for balanced accuracy, 0.851 for recall, 0.852 for precision, and 0.850 for F-score. The obtained results indicate the first steps in CAD of skin lesions aiming the automated triage of patients with skin lesions <em>in vivo</em> using NIR spectral data.</div></div>","PeriodicalId":55381,"journal":{"name":"Biocybernetics and Biomedical Engineering","volume":null,"pages":null},"PeriodicalIF":5.3000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms\",\"authors\":\"\",\"doi\":\"10.1016/j.bbe.2024.10.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Skin lesions are classified in benign or malignant. Among the malignant, melanoma is a very aggressive cancer and the major cause of deaths. So, early diagnosis of skin cancer is very desired. In the last few years, there is a growing interest in computer aided diagnostic (CAD) of skin lesions. Near-Infrared (NIR) spectroscopy may provide an alternative source of information to automated CAD of skin lesions to be used with the modern techniques of machine learning and deep learning (MDL). One of the main limitations to apply MDL to spectroscopy is the lack of public datasets. Since there is no public dataset of NIR spectral data to skin lesions, as far as we know, an effort has been made and a new dataset named NIR-SC-UFES, has been collected, annotated and analyzed generating the gold-standard for classification of NIR spectral data to skin cancer. Next, the machine learning algorithms XGBoost, CatBoost, LightGBM, 1D-convolutional neural network (1D-CNN) and standard algorithms as SVM and PLS-DA were investigated to classify cancer and non-cancer skin lesions. Experimental results indicate that the best performance was obtained by LightGBM with pre-processing using standard normal variate (SNV), feature extraction and data augmentation with Generative Adversarial Networks (GAN) providing values of 0.839 for balanced accuracy, 0.851 for recall, 0.852 for precision, and 0.850 for F-score. The obtained results indicate the first steps in CAD of skin lesions aiming the automated triage of patients with skin lesions <em>in vivo</em> using NIR spectral data.</div></div>\",\"PeriodicalId\":55381,\"journal\":{\"name\":\"Biocybernetics and Biomedical Engineering\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2024-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biocybernetics and Biomedical Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0208521624000822\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biocybernetics and Biomedical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0208521624000822","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

摘要

皮肤病变分为良性和恶性。在恶性肿瘤中,黑色素瘤是一种侵袭性很强的癌症,也是导致死亡的主要原因。因此,早期诊断皮肤癌是非常必要的。最近几年,人们对皮肤病变的计算机辅助诊断(CAD)越来越感兴趣。近红外(NIR)光谱可为皮肤病变的自动计算机辅助诊断提供另一种信息来源,可与现代机器学习和深度学习(MDL)技术结合使用。将 MDL 应用于光谱学的主要限制之一是缺乏公共数据集。据我们所知,目前还没有关于皮肤病变的近红外光谱数据的公共数据集,因此我们努力收集、注释和分析了一个名为 NIR-SC-UFES 的新数据集,该数据集为皮肤癌的近红外光谱数据分类提供了黄金标准。接下来,研究了机器学习算法 XGBoost、CatBoost、LightGBM、一维卷积神经网络(1D-CNN)以及 SVM 和 PLS-DA 等标准算法,以对癌症和非癌症皮肤病变进行分类。实验结果表明,在使用标准正态变异(SNV)进行预处理、特征提取和使用生成式对抗网络(GAN)进行数据增强后,LightGBM 的性能最佳,其平衡准确率为 0.839,召回率为 0.851,精确度为 0.852,F-score 为 0.850。所获得的结果表明,利用近红外光谱数据对体内皮肤病变患者进行自动分诊是皮肤病变计算机辅助诊断的第一步。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms
Skin lesions are classified in benign or malignant. Among the malignant, melanoma is a very aggressive cancer and the major cause of deaths. So, early diagnosis of skin cancer is very desired. In the last few years, there is a growing interest in computer aided diagnostic (CAD) of skin lesions. Near-Infrared (NIR) spectroscopy may provide an alternative source of information to automated CAD of skin lesions to be used with the modern techniques of machine learning and deep learning (MDL). One of the main limitations to apply MDL to spectroscopy is the lack of public datasets. Since there is no public dataset of NIR spectral data to skin lesions, as far as we know, an effort has been made and a new dataset named NIR-SC-UFES, has been collected, annotated and analyzed generating the gold-standard for classification of NIR spectral data to skin cancer. Next, the machine learning algorithms XGBoost, CatBoost, LightGBM, 1D-convolutional neural network (1D-CNN) and standard algorithms as SVM and PLS-DA were investigated to classify cancer and non-cancer skin lesions. Experimental results indicate that the best performance was obtained by LightGBM with pre-processing using standard normal variate (SNV), feature extraction and data augmentation with Generative Adversarial Networks (GAN) providing values of 0.839 for balanced accuracy, 0.851 for recall, 0.852 for precision, and 0.850 for F-score. The obtained results indicate the first steps in CAD of skin lesions aiming the automated triage of patients with skin lesions in vivo using NIR spectral data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
16.50
自引率
6.20%
发文量
77
审稿时长
38 days
期刊介绍: Biocybernetics and Biomedical Engineering is a quarterly journal, founded in 1981, devoted to publishing the results of original, innovative and creative research investigations in the field of Biocybernetics and biomedical engineering, which bridges mathematical, physical, chemical and engineering methods and technology to analyse physiological processes in living organisms as well as to develop methods, devices and systems used in biology and medicine, mainly in medical diagnosis, monitoring systems and therapy. The Journal''s mission is to advance scientific discovery into new or improved standards of care, and promotion a wide-ranging exchange between science and its application to humans.
期刊最新文献
Automating synaptic plasticity analysis: A deep learning approach to segmenting hippocampal field potential signal Probabilistic and explainable modeling of Phase–Phase Cross-Frequency Coupling patterns in EEG. Application to dyslexia diagnosis Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms Validation of a body sensor network for cardiorespiratory monitoring during dynamic activities Quantitative evaluation of the effect of circle of willis structures on cerebral hyperperfusion: A multi-scale model analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1