Boosting medical diagnostics with a novel gradient-based sample selection method

IF 7 2区 医学 Q1 BIOLOGY Computers in biology and medicine Pub Date : 2024-09-24 DOI:10.1016/j.compbiomed.2024.109165
{"title":"Boosting medical diagnostics with a novel gradient-based sample selection method","authors":"","doi":"10.1016/j.compbiomed.2024.109165","DOIUrl":null,"url":null,"abstract":"<div><div>In the rapidly expanding landscape of medical data, the need for innovative approaches to maximize classification performance has become increasingly critical. As data volumes grow, ensuring that diagnostic systems work with accurate and relevant data is paramount for effective and generalizable classification. This study introduces a novel gradient-based sample selection method, the first of its kind in the literature, specifically designed to enhance classification accuracy by removing redundant and non-informative data. Unlike traditional methods that focus solely on feature selection, this approach integrates an advanced sample selection technique to optimize the input data, leading to more accurate and efficient diagnostics. The method is validated on multiple disease datasets, including the Wisconsin Diagnostic Breast Cancer (WDBC) dataset and the Cleveland Coronary Artery Disease (CAD) dataset, demonstrating its broad applicability and effectiveness. To address dataset imbalance, the Adaptive Synthetic Sampling (ADASYN) method is employed, followed by Particle Swarm Optimization (PSO) for feature selection. The refined datasets are then classified using a Support Vector Machine (SVM), showing that even traditional classifiers can achieve substantial improvements when enhanced with advanced sample selection. The results underscore the critical importance of precise sample selection in boosting classification performance, setting a new standard for computer-aided diagnostics and paving the way for future innovations in handling large and complex medical datasets.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":null,"pages":null},"PeriodicalIF":7.0000,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482524012502","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

In the rapidly expanding landscape of medical data, the need for innovative approaches to maximize classification performance has become increasingly critical. As data volumes grow, ensuring that diagnostic systems work with accurate and relevant data is paramount for effective and generalizable classification. This study introduces a novel gradient-based sample selection method, the first of its kind in the literature, specifically designed to enhance classification accuracy by removing redundant and non-informative data. Unlike traditional methods that focus solely on feature selection, this approach integrates an advanced sample selection technique to optimize the input data, leading to more accurate and efficient diagnostics. The method is validated on multiple disease datasets, including the Wisconsin Diagnostic Breast Cancer (WDBC) dataset and the Cleveland Coronary Artery Disease (CAD) dataset, demonstrating its broad applicability and effectiveness. To address dataset imbalance, the Adaptive Synthetic Sampling (ADASYN) method is employed, followed by Particle Swarm Optimization (PSO) for feature selection. The refined datasets are then classified using a Support Vector Machine (SVM), showing that even traditional classifiers can achieve substantial improvements when enhanced with advanced sample selection. The results underscore the critical importance of precise sample selection in boosting classification performance, setting a new standard for computer-aided diagnostics and paving the way for future innovations in handling large and complex medical datasets.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用基于梯度的新型样本选择方法提高医疗诊断水平
在医疗数据迅速增长的形势下,采用创新方法最大限度地提高分类性能的需求变得越来越迫切。随着数据量的增长,确保诊断系统使用准确、相关的数据对于有效、可推广的分类至关重要。本研究介绍了一种新颖的基于梯度的样本选择方法,这在文献中尚属首次,专门用于通过去除冗余和非信息数据来提高分类准确性。与只关注特征选择的传统方法不同,这种方法整合了先进的样本选择技术,以优化输入数据,从而提高诊断的准确性和效率。该方法在多个疾病数据集上进行了验证,包括威斯康星诊断乳腺癌(WDBC)数据集和克利夫兰冠状动脉疾病(CAD)数据集,证明了其广泛的适用性和有效性。为解决数据集的不平衡问题,采用了自适应合成采样(ADASYN)方法,然后用粒子群优化(PSO)方法进行特征选择。然后使用支持向量机(SVM)对改进后的数据集进行分类,结果表明,即使是传统的分类器,在使用先进的样本选择技术后也能实现大幅改进。这些结果强调了精确样本选择在提高分类性能方面的极端重要性,为计算机辅助诊断设定了新标准,并为未来处理大型复杂医疗数据集的创新铺平了道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computers in biology and medicine
Computers in biology and medicine 工程技术-工程:生物医学
CiteScore
11.70
自引率
10.40%
发文量
1086
审稿时长
74 days
期刊介绍: Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.
期刊最新文献
Lightweight medical image segmentation network with multi-scale feature-guided fusion. Shuffled ECA-Net for stress detection from multimodal wearable sensor data. Stacking based ensemble learning framework for identification of nitrotyrosine sites. Two-stage deep learning framework for occlusal crown depth image generation. A joint analysis proposal of nonlinear longitudinal and time-to-event right-, interval-censored data for modeling pregnancy miscarriage.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1