基于数据挖掘系统LERS的黑色素瘤预测

J. P. Grzymala-Busse, J. Grzymala-Busse, Z. Hippe
{"title":"基于数据挖掘系统LERS的黑色素瘤预测","authors":"J. P. Grzymala-Busse, J. Grzymala-Busse, Z. Hippe","doi":"10.1109/CMPSAC.2001.960676","DOIUrl":null,"url":null,"abstract":"One of the important tools for early diagnosis of malignant melanoma is the total dermatoscopy score (TDS), computed using the ABCD (asymmetry, border, color, diameter) formula. Our primary objective was to check whether the ABCD formula is optimal. Using a data set containing 276 cases of melanoma and the LERS (Learning from Examples based on Rough Sets) data mining system, we checked more than 20,000 modified formulas for ABCD, computing the predicted error rate of melanoma diagnosis using 10-fold cross-validation for every modified formula. As a result, we found the optimal ABCD formula for our setup: discretization based on cluster analysis, the LEM2 (Learning from Examples Module, version 2) algorithm (one of the four LERS algorithms for rule induction) and the standard LERS classification scheme. The error rate for the standard ABCD formula was 10.21 %, while for the optimal ABCD formula the error rate was reduced to 6.04%. Some research in melanoma diagnosis shows that the use of the ABCD formula does not improve the error rate. Our research shows that the ABCD formula is useful, since, for our data set, the error rate without the use of the ABCD formula was higher (13.73%).","PeriodicalId":269568,"journal":{"name":"25th Annual International Computer Software and Applications Conference. COMPSAC 2001","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"48","resultStr":"{\"title\":\"Melanoma prediction using data mining system LERS\",\"authors\":\"J. P. Grzymala-Busse, J. Grzymala-Busse, Z. Hippe\",\"doi\":\"10.1109/CMPSAC.2001.960676\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the important tools for early diagnosis of malignant melanoma is the total dermatoscopy score (TDS), computed using the ABCD (asymmetry, border, color, diameter) formula. Our primary objective was to check whether the ABCD formula is optimal. Using a data set containing 276 cases of melanoma and the LERS (Learning from Examples based on Rough Sets) data mining system, we checked more than 20,000 modified formulas for ABCD, computing the predicted error rate of melanoma diagnosis using 10-fold cross-validation for every modified formula. As a result, we found the optimal ABCD formula for our setup: discretization based on cluster analysis, the LEM2 (Learning from Examples Module, version 2) algorithm (one of the four LERS algorithms for rule induction) and the standard LERS classification scheme. The error rate for the standard ABCD formula was 10.21 %, while for the optimal ABCD formula the error rate was reduced to 6.04%. Some research in melanoma diagnosis shows that the use of the ABCD formula does not improve the error rate. Our research shows that the ABCD formula is useful, since, for our data set, the error rate without the use of the ABCD formula was higher (13.73%).\",\"PeriodicalId\":269568,\"journal\":{\"name\":\"25th Annual International Computer Software and Applications Conference. COMPSAC 2001\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"48\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"25th Annual International Computer Software and Applications Conference. COMPSAC 2001\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CMPSAC.2001.960676\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"25th Annual International Computer Software and Applications Conference. COMPSAC 2001","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CMPSAC.2001.960676","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 48

摘要

早期诊断恶性黑色素瘤的重要工具之一是使用ABCD(不对称、边界、颜色、直径)公式计算的皮肤镜总评分(TDS)。我们的主要目标是检查ABCD公式是否是最优的。使用包含276例黑色素瘤病例的数据集和LERS(基于粗糙集的学习示例)数据挖掘系统,我们检查了20,000多个ABCD修改公式,对每个修改公式使用10倍交叉验证计算黑色素瘤诊断的预测错误率。因此,我们为我们的设置找到了最优的ABCD公式:基于聚类分析的离散化,LEM2 (Learning from Examples Module, version 2)算法(用于规则归纳的四种LERS算法之一)和标准的LERS分类方案。ABCD标准配方的误差率为10.21%,最佳ABCD配方的误差率为6.04%。一些黑色素瘤诊断的研究表明,使用ABCD公式并没有提高错误率。我们的研究表明,ABCD公式是有用的,因为对于我们的数据集,不使用ABCD公式的错误率更高(13.73%)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Melanoma prediction using data mining system LERS
One of the important tools for early diagnosis of malignant melanoma is the total dermatoscopy score (TDS), computed using the ABCD (asymmetry, border, color, diameter) formula. Our primary objective was to check whether the ABCD formula is optimal. Using a data set containing 276 cases of melanoma and the LERS (Learning from Examples based on Rough Sets) data mining system, we checked more than 20,000 modified formulas for ABCD, computing the predicted error rate of melanoma diagnosis using 10-fold cross-validation for every modified formula. As a result, we found the optimal ABCD formula for our setup: discretization based on cluster analysis, the LEM2 (Learning from Examples Module, version 2) algorithm (one of the four LERS algorithms for rule induction) and the standard LERS classification scheme. The error rate for the standard ABCD formula was 10.21 %, while for the optimal ABCD formula the error rate was reduced to 6.04%. Some research in melanoma diagnosis shows that the use of the ABCD formula does not improve the error rate. Our research shows that the ABCD formula is useful, since, for our data set, the error rate without the use of the ABCD formula was higher (13.73%).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Using SOAP to clean up configuration management An application of data warehouse technology to the measurement system for UML based artifacts InfoSleuth: agent-based system for data integration and analysis Formal and use-case driven requirement analysis in UML A long and winding road (Progress on the road to a software engineering profession)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1