机器学习算法与过采样技术在前列腺癌放疗后尿毒性预测中的比较

E. Mylona, Clement Lebreton, P. Fontaine, S. Supiot, N. Magné, G. Créhange, R. Crevoisier, O. Acosta
{"title":"机器学习算法与过采样技术在前列腺癌放疗后尿毒性预测中的比较","authors":"E. Mylona, Clement Lebreton, P. Fontaine, S. Supiot, N. Magné, G. Créhange, R. Crevoisier, O. Acosta","doi":"10.1109/BIBE.2019.00180","DOIUrl":null,"url":null,"abstract":"Prostate cancer radiotherapy unavoidably involves the irradiation not only of the target volume, but also of healthy organs-at-risk, neighboring the prostate, likely causing adverse, toxicity-related side-effects. Specifically, in the case of urinary toxicity, these side effects might be associated with a variety of dosimetric, clinical and genetic factors, making its prediction particularly challenging. Given the inconsistency of available data concerning radiation-induced toxicity, it is crucial to develop robust models with superior predictive performance in order to perform tailored treatments. Machine Learning techniques emerge as appealing in this context, nevertheless without any consensus on the best algorithms to be used. This work proposes a comparison of several machine-learning strategies together with different minority class oversampling techniques for prediction of urinary toxicity following prostate cancer radiotherapy using dosimetric and clinical data. The performance of these classifiers was evaluated on the original dataset and using four different synthetic oversampling techniques. The area under the ROC curve (AUC) and the F-measure were employed to evaluate their performance. Results suggest that, regardless of the technique, oversampling always increases the prediction performance of the models (p=0.004). Overall, oversampling with Synthetic Minority Oversampling Technique (SMOTE) followed by Edited Nearest Neighbour algorithm (ENN) together with Regularized Discriminant Analysis (RDA) classifier provide the best performance (AUC=0.71).","PeriodicalId":318819,"journal":{"name":"2019 IEEE 19th International Conference on Bioinformatics and Bioengineering (BIBE)","volume":"26 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Comparison of Machine Learning Algorithms and Oversampling Techniques for Urinary Toxicity Prediction After Prostate Cancer Radiotherapy\",\"authors\":\"E. Mylona, Clement Lebreton, P. Fontaine, S. Supiot, N. Magné, G. Créhange, R. Crevoisier, O. Acosta\",\"doi\":\"10.1109/BIBE.2019.00180\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Prostate cancer radiotherapy unavoidably involves the irradiation not only of the target volume, but also of healthy organs-at-risk, neighboring the prostate, likely causing adverse, toxicity-related side-effects. Specifically, in the case of urinary toxicity, these side effects might be associated with a variety of dosimetric, clinical and genetic factors, making its prediction particularly challenging. Given the inconsistency of available data concerning radiation-induced toxicity, it is crucial to develop robust models with superior predictive performance in order to perform tailored treatments. Machine Learning techniques emerge as appealing in this context, nevertheless without any consensus on the best algorithms to be used. This work proposes a comparison of several machine-learning strategies together with different minority class oversampling techniques for prediction of urinary toxicity following prostate cancer radiotherapy using dosimetric and clinical data. The performance of these classifiers was evaluated on the original dataset and using four different synthetic oversampling techniques. The area under the ROC curve (AUC) and the F-measure were employed to evaluate their performance. Results suggest that, regardless of the technique, oversampling always increases the prediction performance of the models (p=0.004). Overall, oversampling with Synthetic Minority Oversampling Technique (SMOTE) followed by Edited Nearest Neighbour algorithm (ENN) together with Regularized Discriminant Analysis (RDA) classifier provide the best performance (AUC=0.71).\",\"PeriodicalId\":318819,\"journal\":{\"name\":\"2019 IEEE 19th International Conference on Bioinformatics and Bioengineering (BIBE)\",\"volume\":\"26 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 19th International Conference on Bioinformatics and Bioengineering (BIBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBE.2019.00180\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 19th International Conference on Bioinformatics and Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2019.00180","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

前列腺癌放疗不可避免地不仅涉及靶体积的照射,还涉及邻近前列腺的健康危险器官的照射,可能引起不良的毒性相关副作用。具体来说,在尿毒性的情况下,这些副作用可能与各种剂量学、临床和遗传因素有关,这使得其预测特别具有挑战性。鉴于有关辐射毒性的现有数据不一致,开发具有卓越预测性能的稳健模型以实施量身定制的治疗至关重要。在这种背景下,机器学习技术显得很有吸引力,然而,对于使用的最佳算法没有达成任何共识。这项工作提出了几种机器学习策略的比较,以及使用剂量学和临床数据预测前列腺癌放疗后尿毒性的不同少数类过采样技术。这些分类器的性能在原始数据集上进行了评估,并使用了四种不同的合成过采样技术。采用ROC曲线下面积(AUC)和f值来评价其疗效。结果表明,无论采用何种技术,过采样总是能提高模型的预测性能(p=0.004)。总体而言,使用合成少数过采样技术(SMOTE)进行过采样,然后使用编辑近邻算法(ENN)和正则化判别分析(RDA)分类器进行过采样,提供了最佳性能(AUC=0.71)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Comparison of Machine Learning Algorithms and Oversampling Techniques for Urinary Toxicity Prediction After Prostate Cancer Radiotherapy
Prostate cancer radiotherapy unavoidably involves the irradiation not only of the target volume, but also of healthy organs-at-risk, neighboring the prostate, likely causing adverse, toxicity-related side-effects. Specifically, in the case of urinary toxicity, these side effects might be associated with a variety of dosimetric, clinical and genetic factors, making its prediction particularly challenging. Given the inconsistency of available data concerning radiation-induced toxicity, it is crucial to develop robust models with superior predictive performance in order to perform tailored treatments. Machine Learning techniques emerge as appealing in this context, nevertheless without any consensus on the best algorithms to be used. This work proposes a comparison of several machine-learning strategies together with different minority class oversampling techniques for prediction of urinary toxicity following prostate cancer radiotherapy using dosimetric and clinical data. The performance of these classifiers was evaluated on the original dataset and using four different synthetic oversampling techniques. The area under the ROC curve (AUC) and the F-measure were employed to evaluate their performance. Results suggest that, regardless of the technique, oversampling always increases the prediction performance of the models (p=0.004). Overall, oversampling with Synthetic Minority Oversampling Technique (SMOTE) followed by Edited Nearest Neighbour algorithm (ENN) together with Regularized Discriminant Analysis (RDA) classifier provide the best performance (AUC=0.71).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Stability Investigation Using Hydrogen Bonds for Different Mutations and Drug Resistance in Non-Small Cell Lung Cancer Patients A Temporal Convolution Network Solution for EEG Motor Imagery Classification Evaluation of a Serious Game Promoting Nutrition and Food Literacy: Experiment Design and Preliminary Results Towards a Robust and Accurate Screening Tool for Dyslexia with Data Augmentation using GANs Exploring Fibrotic Disease Networks to Identify Common Molecular Mechanisms with IPF
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1