Classification Prediction of Lung Cancer Based on Machine Learning Method

Dantong Li, Guixin Li, Shuang Li, Ashley Bang
{"title":"Classification Prediction of Lung Cancer Based on Machine Learning Method","authors":"Dantong Li, Guixin Li, Shuang Li, Ashley Bang","doi":"10.4018/ijhisi.333631","DOIUrl":null,"url":null,"abstract":"The K-nearest neighbor interpolation method was used to fill in missing data of five indicators of coronary heart disease, diabetes, total cholesterol, triglycerides, and albumin;, and the SMOTE algorithm was used to balance the number of variable indicators. The Relief-F algorithm was used to remove 18 variable indicators and retain 42 variable indicators. LASSO and ridge regression algorithms were used to remove eight variable indicators and retain 52 variable indicators; The prediction accuracy, recall, and AUC values of the linear kernel support vector machine model filtered using Relief-F and LASSO features are high, and the prediction results are optimal; The test result of random forest screened by Relief-F and LASSO features is better than that of the support vector machine model. It is concluded that the random forest model screened by Relief-F features is better as a prediction of lung cancer typing. The research results provide theoretical data support for predicting lung cancer classification using machine learning methods.","PeriodicalId":56158,"journal":{"name":"International Journal of Healthcare Information Systems and Informatics","volume":"4623 2 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2023-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Healthcare Information Systems and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/ijhisi.333631","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

The K-nearest neighbor interpolation method was used to fill in missing data of five indicators of coronary heart disease, diabetes, total cholesterol, triglycerides, and albumin;, and the SMOTE algorithm was used to balance the number of variable indicators. The Relief-F algorithm was used to remove 18 variable indicators and retain 42 variable indicators. LASSO and ridge regression algorithms were used to remove eight variable indicators and retain 52 variable indicators; The prediction accuracy, recall, and AUC values of the linear kernel support vector machine model filtered using Relief-F and LASSO features are high, and the prediction results are optimal; The test result of random forest screened by Relief-F and LASSO features is better than that of the support vector machine model. It is concluded that the random forest model screened by Relief-F features is better as a prediction of lung cancer typing. The research results provide theoretical data support for predicting lung cancer classification using machine learning methods.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于机器学习方法的肺癌分类预测
采用 K-nearest neighbor 插值法填补冠心病、糖尿病、总胆固醇、甘油三酯和白蛋白 5 个指标的缺失数据;采用 SMOTE 算法平衡变量指标的数量。使用 Relief-F 算法删除了 18 个变量指标,保留了 42 个变量指标。采用 Relief-F 和 LASSO 算法筛选的线性核支持向量机模型的预测准确率、召回率和 AUC 值均较高,预测结果最优;采用 Relief-F 和 LASSO 算法筛选的随机森林的测试结果优于支持向量机模型。由此得出结论,采用 Relief-F 特征筛选的随机森林模型在肺癌分型预测方面效果更好。研究结果为使用机器学习方法预测肺癌分型提供了理论数据支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
3.30
自引率
0.00%
发文量
12
期刊最新文献
The Prevention and Nursing Care of Common Injuries in Long-Distance Running of College Students Classification Prediction of Lung Cancer Based on Machine Learning Method Assessing the Alignment Between Existing Finnish Patient Portals and the Newly Implemented Finnish Well-Being Reform Effect of Framing and Feedback Levels on Funding and Emotional Support in Medical Crowdfunding Modeling the Factors That Drive the Need for Inter-Facility Transfers to Downstream Services in US Emergency Departments
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1