Seungjun Ahn, Eun Jeong Oh, Matthew I Saleem, Tristan Tham
{"title":"口咽癌延长放疗分类中的机器学习方法:全国癌症数据库","authors":"Seungjun Ahn, Eun Jeong Oh, Matthew I Saleem, Tristan Tham","doi":"10.1002/ohn.926","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To investigate the accuracy of machine learning (ML) algorithms in stratifying risk of prolonged radiation treatment duration (RTD), defined as greater than 50 days, for patients with oropharyngeal squamous cell carcinoma (OPSCC).</p><p><strong>Study design: </strong>Retrospective cohort study.</p><p><strong>Setting: </strong>National Cancer Database (NCDB).</p><p><strong>Methods: </strong>The NCDB was queried between 2004 to 2016 for patients with OPSCC treated with radiation therapy (RT) or chemoradiation as primary treatment. To predict risk of prolonged RTD, 8 different ML algorithms were compared against traditional logistic regression using various performance metrics. Data was split into a distribution of 70% for training and 30% for testing.</p><p><strong>Results: </strong>A total of 3152 patients were included (1928 prolonged RT, 1224 not prolonged RT). As a whole, based on performance metrics, random forest (RF) was found to most accurately predict prolonged RTD compared to both other ML methods and traditional logistic regression.</p><p><strong>Conclusion: </strong>Our assessment of various ML techniques showed that RF was superior to traditional logistic regression at classifying OPSCC patients at risk of prolonged RTD. Application of such algorithms may have potential to identify high risk patients and enable early interventions to improve survival.</p>","PeriodicalId":19707,"journal":{"name":"Otolaryngology- Head and Neck Surgery","volume":" ","pages":"1764-1772"},"PeriodicalIF":2.6000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning Methods in Classification of Prolonged Radiation Therapy in Oropharyngeal Cancer: National Cancer Database.\",\"authors\":\"Seungjun Ahn, Eun Jeong Oh, Matthew I Saleem, Tristan Tham\",\"doi\":\"10.1002/ohn.926\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>To investigate the accuracy of machine learning (ML) algorithms in stratifying risk of prolonged radiation treatment duration (RTD), defined as greater than 50 days, for patients with oropharyngeal squamous cell carcinoma (OPSCC).</p><p><strong>Study design: </strong>Retrospective cohort study.</p><p><strong>Setting: </strong>National Cancer Database (NCDB).</p><p><strong>Methods: </strong>The NCDB was queried between 2004 to 2016 for patients with OPSCC treated with radiation therapy (RT) or chemoradiation as primary treatment. To predict risk of prolonged RTD, 8 different ML algorithms were compared against traditional logistic regression using various performance metrics. Data was split into a distribution of 70% for training and 30% for testing.</p><p><strong>Results: </strong>A total of 3152 patients were included (1928 prolonged RT, 1224 not prolonged RT). As a whole, based on performance metrics, random forest (RF) was found to most accurately predict prolonged RTD compared to both other ML methods and traditional logistic regression.</p><p><strong>Conclusion: </strong>Our assessment of various ML techniques showed that RF was superior to traditional logistic regression at classifying OPSCC patients at risk of prolonged RTD. Application of such algorithms may have potential to identify high risk patients and enable early interventions to improve survival.</p>\",\"PeriodicalId\":19707,\"journal\":{\"name\":\"Otolaryngology- Head and Neck Surgery\",\"volume\":\" \",\"pages\":\"1764-1772\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Otolaryngology- Head and Neck Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/ohn.926\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/7/31 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"OTORHINOLARYNGOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Otolaryngology- Head and Neck Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/ohn.926","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/31 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"OTORHINOLARYNGOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
目的调查机器学习(ML)算法对口咽鳞状细胞癌(OPSCC)患者放疗时间延长(RTD)(定义为超过50天)风险分层的准确性:研究设计:回顾性队列研究:国家癌症数据库(NCDB):2004年至2016年期间,对NCDB中以放疗(RT)或化放疗作为主要治疗手段的OPSCC患者进行了查询。为了预测RTD延长的风险,使用各种性能指标将8种不同的ML算法与传统的逻辑回归进行了比较。数据被分成70%用于训练,30%用于测试:共纳入了 3152 例患者(1928 例长期 RT,1224 例非长期 RT)。总体而言,根据性能指标,随机森林(RF)与其他 ML 方法和传统的逻辑回归相比,能最准确地预测延长的 RTD:我们对各种ML技术的评估表明,RF在对有延长RTD风险的OPSCC患者进行分类方面优于传统的逻辑回归。应用这种算法有可能识别出高风险患者,并进行早期干预以提高生存率。
Machine Learning Methods in Classification of Prolonged Radiation Therapy in Oropharyngeal Cancer: National Cancer Database.
Objective: To investigate the accuracy of machine learning (ML) algorithms in stratifying risk of prolonged radiation treatment duration (RTD), defined as greater than 50 days, for patients with oropharyngeal squamous cell carcinoma (OPSCC).
Study design: Retrospective cohort study.
Setting: National Cancer Database (NCDB).
Methods: The NCDB was queried between 2004 to 2016 for patients with OPSCC treated with radiation therapy (RT) or chemoradiation as primary treatment. To predict risk of prolonged RTD, 8 different ML algorithms were compared against traditional logistic regression using various performance metrics. Data was split into a distribution of 70% for training and 30% for testing.
Results: A total of 3152 patients were included (1928 prolonged RT, 1224 not prolonged RT). As a whole, based on performance metrics, random forest (RF) was found to most accurately predict prolonged RTD compared to both other ML methods and traditional logistic regression.
Conclusion: Our assessment of various ML techniques showed that RF was superior to traditional logistic regression at classifying OPSCC patients at risk of prolonged RTD. Application of such algorithms may have potential to identify high risk patients and enable early interventions to improve survival.
期刊介绍:
Otolaryngology–Head and Neck Surgery (OTO-HNS) is the official peer-reviewed publication of the American Academy of Otolaryngology–Head and Neck Surgery Foundation. The mission of Otolaryngology–Head and Neck Surgery is to publish contemporary, ethical, clinically relevant information in otolaryngology, head and neck surgery (ear, nose, throat, head, and neck disorders) that can be used by otolaryngologists, clinicians, scientists, and specialists to improve patient care and public health.