支持向量机在基于癌症生物标志物的患者群体分层中发挥作用吗？

Archives of proteomics and bioinformatics Pub Date : 2021-01-01

Ben Lanza, Deepak Parashar

{"title":"支持向量机在基于癌症生物标志物的患者群体分层中发挥作用吗？","authors":"Ben Lanza, Deepak Parashar","doi":"","DOIUrl":null,"url":null,"abstract":"Biomarkers are known to be the key driver behind targeted cancer therapies by either stratifying the patients into risk categories or identifying patient subgroups most likely to benefit. However, the ability of a biomarker to stratify patients relies heavily on the type of clinical endpoint data being collected. Of particular interest is the scenario when the biomarker involved is a continuous one where the challenge is often to identify cut-offs or thresholds that would stratify the population according to the level of clinical outcome or treatment benefit. On the other hand, there are well-established Machine Learning (ML) methods such as the Support Vector Machines (SVM) that classify data, both linear as well as non-linear, into subgroups in an optimal way. SVMs have proven to be immensely useful in data-centric engineering and recently researchers have also sought its applications in healthcare. Despite their wide applicability, SVMs are not yet in the mainstream of toolkits to be utilised in observational clinical studies or in clinical trials. This research investigates the very role of SVMs in stratifying the patient population based on a continuous biomarker across a variety of datasets. Based on the mathematical framework underlying SVMs, we formulate and fit algorithms in the context of biomarker stratified cancer datasets to evaluate their merits. The analysis reveals their superior performance for certain data-types when compared to other ML methods suggesting that SVMs may have the potential to provide a robust yet simplistic solution to stratify real cancer patients based on continuous biomarkers, and hence accelerate the identification of subgroups for improved clinical outcomes or guide targeted cancer therapies.","PeriodicalId":87222,"journal":{"name":"Archives of proteomics and bioinformatics","volume":"2 1","pages":"20-38"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7611982/pdf/","citationCount":"0","resultStr":"{\"title\":\"Do Support Vector Machines Play a Role in Stratifying Patient Population Based on Cancer Biomarkers?\",\"authors\":\"Ben Lanza, Deepak Parashar\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Biomarkers are known to be the key driver behind targeted cancer therapies by either stratifying the patients into risk categories or identifying patient subgroups most likely to benefit. However, the ability of a biomarker to stratify patients relies heavily on the type of clinical endpoint data being collected. Of particular interest is the scenario when the biomarker involved is a continuous one where the challenge is often to identify cut-offs or thresholds that would stratify the population according to the level of clinical outcome or treatment benefit. On the other hand, there are well-established Machine Learning (ML) methods such as the Support Vector Machines (SVM) that classify data, both linear as well as non-linear, into subgroups in an optimal way. SVMs have proven to be immensely useful in data-centric engineering and recently researchers have also sought its applications in healthcare. Despite their wide applicability, SVMs are not yet in the mainstream of toolkits to be utilised in observational clinical studies or in clinical trials. This research investigates the very role of SVMs in stratifying the patient population based on a continuous biomarker across a variety of datasets. Based on the mathematical framework underlying SVMs, we formulate and fit algorithms in the context of biomarker stratified cancer datasets to evaluate their merits. The analysis reveals their superior performance for certain data-types when compared to other ML methods suggesting that SVMs may have the potential to provide a robust yet simplistic solution to stratify real cancer patients based on continuous biomarkers, and hence accelerate the identification of subgroups for improved clinical outcomes or guide targeted cancer therapies.\",\"PeriodicalId\":87222,\"journal\":{\"name\":\"Archives of proteomics and bioinformatics\",\"volume\":\"2 1\",\"pages\":\"20-38\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7611982/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Archives of proteomics and bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archives of proteomics and bioinformatics","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

已知生物标志物是靶向癌症治疗背后的关键驱动因素，可以将患者分为风险类别或确定最有可能受益的患者亚组。然而，生物标志物对患者进行分层的能力在很大程度上依赖于所收集的临床终点数据的类型。特别令人感兴趣的是，当所涉及的生物标志物是连续的，其中的挑战往往是确定截断或阈值，根据临床结果或治疗益处的水平对人群进行分层。另一方面，有完善的机器学习（ML）方法，如支持向量机（SVM），它以最佳方式将线性和非线性数据分类到子组中。事实证明，svm在以数据为中心的工程中非常有用，最近研究人员也在医疗保健领域寻求其应用。尽管支持向量机具有广泛的适用性，但它尚未成为用于观察性临床研究或临床试验的主流工具包。本研究探讨了支持向量机在基于各种数据集的连续生物标志物对患者群体进行分层中的作用。基于支持向量机的数学框架，我们在生物标志物分层癌症数据集的背景下制定和拟合算法，以评估其优点。分析显示，与其他ML方法相比，支持向量机在某些数据类型上具有优越的性能，这表明支持向量机可能有潜力提供一种强大而简单的解决方案，根据连续的生物标志物对真实的癌症患者进行分层，从而加速识别亚组，以改善临床结果或指导靶向癌症治疗。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Do Support Vector Machines Play a Role in Stratifying Patient Population Based on Cancer Biomarkers?

Biomarkers are known to be the key driver behind targeted cancer therapies by either stratifying the patients into risk categories or identifying patient subgroups most likely to benefit. However, the ability of a biomarker to stratify patients relies heavily on the type of clinical endpoint data being collected. Of particular interest is the scenario when the biomarker involved is a continuous one where the challenge is often to identify cut-offs or thresholds that would stratify the population according to the level of clinical outcome or treatment benefit. On the other hand, there are well-established Machine Learning (ML) methods such as the Support Vector Machines (SVM) that classify data, both linear as well as non-linear, into subgroups in an optimal way. SVMs have proven to be immensely useful in data-centric engineering and recently researchers have also sought its applications in healthcare. Despite their wide applicability, SVMs are not yet in the mainstream of toolkits to be utilised in observational clinical studies or in clinical trials. This research investigates the very role of SVMs in stratifying the patient population based on a continuous biomarker across a variety of datasets. Based on the mathematical framework underlying SVMs, we formulate and fit algorithms in the context of biomarker stratified cancer datasets to evaluate their merits. The analysis reveals their superior performance for certain data-types when compared to other ML methods suggesting that SVMs may have the potential to provide a robust yet simplistic solution to stratify real cancer patients based on continuous biomarkers, and hence accelerate the identification of subgroups for improved clinical outcomes or guide targeted cancer therapies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Archives of proteomics and bioinformatics

自引率

0.00%

发文量