PCA的实现使支持向量机使用细胞因子来区分吸烟者和非吸烟者。

Seema Singh Saharan, P. Nagar, K. Creasy, E. Stock, James Feng, M. Malloy, J. Kane
{"title":"PCA的实现使支持向量机使用细胞因子来区分吸烟者和非吸烟者。","authors":"Seema Singh Saharan, P. Nagar, K. Creasy, E. Stock, James Feng, M. Malloy, J. Kane","doi":"10.1109/CSCI54926.2021.00125","DOIUrl":null,"url":null,"abstract":"Presently, the role of cytokines in severe illness like COPD, cancer, cardiac disease associated with smoking is being explored to enable preemptive diagnosis and delivery of treatment interventions. We are investigating the connection between the elevation of inflammatory plasma cytokine in smokers versus nonsmokers. Disease indicator cytokines can be used to monitor the progression of disease which can help in the crucial task of prognosis and definitive diagnosis.Powerful and versatile Machine Learning algorithms can be leveraged to extract insights that cannot be obtained manually. We have applied Support Vector Machine (SVM) on 65 plasma cytokines and other traditional biomarkers to differentiate smokers and nonsmokers. To optimize the classification separability, we have used the following techniques: Principal component analysis (PCA), 10-fold cross validation and variable importance. The primary metric of evaluation is Area Under Receiver Operating Curve (AUROC), though we have additionally recorded and compared prediction accuracy across classifiers.The results are very promising. The AUROC classification accuracy achieved by SVM using the selected predictor feature variables is 89.2% with a 95%CI (85.4%,93.1%). The most prominent cytokines, contributing to the classification, in the order of importance are: I-TAC, Age, TG, G-CSF-CSF-3, MDCCCL22, Eotaxin-3, LIF, IL-2, Eotaxin-2, MIP-3alpha. The AUROC classification accuracy improved to 93% with a 95% CI (90.1%,99.5%) upon choosing the five most prominent cytokines.The versatile prowess of Machine Learning algorithms such as Support Vector Machine can translate pioneering molecular discoveries into actionable insights that can be applied in the field of translational and precision medicine to save life.","PeriodicalId":206881,"journal":{"name":"2021 International Conference on Computational Science and Computational Intelligence (CSCI)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Implementation of PCA enabled Support Vector Machine using cytokines to differentiate smokers versus nonsmokers.\",\"authors\":\"Seema Singh Saharan, P. Nagar, K. Creasy, E. Stock, James Feng, M. Malloy, J. Kane\",\"doi\":\"10.1109/CSCI54926.2021.00125\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Presently, the role of cytokines in severe illness like COPD, cancer, cardiac disease associated with smoking is being explored to enable preemptive diagnosis and delivery of treatment interventions. We are investigating the connection between the elevation of inflammatory plasma cytokine in smokers versus nonsmokers. Disease indicator cytokines can be used to monitor the progression of disease which can help in the crucial task of prognosis and definitive diagnosis.Powerful and versatile Machine Learning algorithms can be leveraged to extract insights that cannot be obtained manually. We have applied Support Vector Machine (SVM) on 65 plasma cytokines and other traditional biomarkers to differentiate smokers and nonsmokers. To optimize the classification separability, we have used the following techniques: Principal component analysis (PCA), 10-fold cross validation and variable importance. The primary metric of evaluation is Area Under Receiver Operating Curve (AUROC), though we have additionally recorded and compared prediction accuracy across classifiers.The results are very promising. The AUROC classification accuracy achieved by SVM using the selected predictor feature variables is 89.2% with a 95%CI (85.4%,93.1%). The most prominent cytokines, contributing to the classification, in the order of importance are: I-TAC, Age, TG, G-CSF-CSF-3, MDCCCL22, Eotaxin-3, LIF, IL-2, Eotaxin-2, MIP-3alpha. The AUROC classification accuracy improved to 93% with a 95% CI (90.1%,99.5%) upon choosing the five most prominent cytokines.The versatile prowess of Machine Learning algorithms such as Support Vector Machine can translate pioneering molecular discoveries into actionable insights that can be applied in the field of translational and precision medicine to save life.\",\"PeriodicalId\":206881,\"journal\":{\"name\":\"2021 International Conference on Computational Science and Computational Intelligence (CSCI)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Computational Science and Computational Intelligence (CSCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSCI54926.2021.00125\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computational Science and Computational Intelligence (CSCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSCI54926.2021.00125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目前,细胞因子在慢性阻塞性肺病、癌症、心脏病等与吸烟相关的严重疾病中的作用正在被探索,以实现先发制人的诊断和治疗干预。我们正在研究吸烟者与非吸烟者炎症血浆细胞因子升高之间的联系。疾病指示因子可用于监测疾病的进展,有助于预后和明确诊断的关键任务。强大而通用的机器学习算法可以用来提取无法手动获得的见解。我们将支持向量机(SVM)应用于65种血浆细胞因子和其他传统生物标志物上,以区分吸烟者和非吸烟者。为了优化分类可分离性,我们使用了以下技术:主成分分析(PCA), 10倍交叉验证和变量重要性。评估的主要指标是接受者工作曲线下的面积(AUROC),尽管我们还记录并比较了不同分类器的预测精度。结果非常有希望。使用所选择的预测特征变量,SVM的AUROC分类准确率为89.2%,ci为95%(85.4%,93.1%)。在此分类中,最重要的细胞因子按重要性排序为:I-TAC、Age、TG、G-CSF-CSF-3、MDCCCL22、Eotaxin-3、LIF、IL-2、Eotaxin-2、MIP-3alpha。选择五种最突出的细胞因子后,AUROC分类准确率提高到93%,95% CI(90.1%,99.5%)。支持向量机(Support Vector Machine)等机器学习算法的全能能力可以将开创性的分子发现转化为可操作的见解,可应用于转化和精准医学领域,以挽救生命。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Implementation of PCA enabled Support Vector Machine using cytokines to differentiate smokers versus nonsmokers.
Presently, the role of cytokines in severe illness like COPD, cancer, cardiac disease associated with smoking is being explored to enable preemptive diagnosis and delivery of treatment interventions. We are investigating the connection between the elevation of inflammatory plasma cytokine in smokers versus nonsmokers. Disease indicator cytokines can be used to monitor the progression of disease which can help in the crucial task of prognosis and definitive diagnosis.Powerful and versatile Machine Learning algorithms can be leveraged to extract insights that cannot be obtained manually. We have applied Support Vector Machine (SVM) on 65 plasma cytokines and other traditional biomarkers to differentiate smokers and nonsmokers. To optimize the classification separability, we have used the following techniques: Principal component analysis (PCA), 10-fold cross validation and variable importance. The primary metric of evaluation is Area Under Receiver Operating Curve (AUROC), though we have additionally recorded and compared prediction accuracy across classifiers.The results are very promising. The AUROC classification accuracy achieved by SVM using the selected predictor feature variables is 89.2% with a 95%CI (85.4%,93.1%). The most prominent cytokines, contributing to the classification, in the order of importance are: I-TAC, Age, TG, G-CSF-CSF-3, MDCCCL22, Eotaxin-3, LIF, IL-2, Eotaxin-2, MIP-3alpha. The AUROC classification accuracy improved to 93% with a 95% CI (90.1%,99.5%) upon choosing the five most prominent cytokines.The versatile prowess of Machine Learning algorithms such as Support Vector Machine can translate pioneering molecular discoveries into actionable insights that can be applied in the field of translational and precision medicine to save life.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Remote Video Surveillance Effects of Social Distancing Intention, Affective Risk Perception, and Cabin Fever Syndrome on Perceived Value of E-learning : Type of submission: Late Breaking Paper / Most relevant symposium: CSCI-ISED Cybersecurity Integration: Deploying Critical Infrastructure Security and Resilience Topics into the Undergraduate Curriculum Distributed Algorithms for k-Coverage in Mobile Sensor Networks Software Development Methodologies for Virtual Reality
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1