The More, the Better? Evaluating the Role of EEG Preprocessing for Deep Learning Applications

IF 4.8 2区医学 Q2 ENGINEERING, BIOMEDICAL IEEE Transactions on Neural Systems and Rehabilitation Engineering Pub Date : 2025-03-03 DOI:10.1109/TNSRE.2025.3547616

Federico Del Pup;Andrea Zanola;Louis Fabrice Tshimanga;Alessandra Bertoldo;Manfredo Atzori

{"title":"The More, the Better? Evaluating the Role of EEG Preprocessing for Deep Learning Applications","authors":"Federico Del Pup;Andrea Zanola;Louis Fabrice Tshimanga;Alessandra Bertoldo;Manfredo Atzori","doi":"10.1109/TNSRE.2025.3547616","DOIUrl":null,"url":null,"abstract":"The last decade has witnessed a notable surge in deep learning applications for electroencephalography (EEG) data analysis, showing promising improvements over conventional statistical techniques. However, deep learning models can underperform if trained with bad processed data. Preprocessing is crucial for EEG data analysis, yet there is no consensus on the optimal strategies in deep learning scenarios, leading to uncertainty about the extent of preprocessing required for optimal results. This study is the first to thoroughly investigate the effects of EEG preprocessing in deep learning applications, drafting guidelines for future research. It evaluates the effects of varying preprocessing levels, from raw and minimally filtered data to complex pipelines with automated artifact removal algorithms. Six classification tasks (eye blinking, motor imagery, Parkinson’s, Alzheimer’s disease, sleep deprivation, and first episode psychosis) and four established EEG architectures were considered for the evaluation. The analysis of 4800 trained models revealed statistical differences between preprocessing pipelines at the intra-task level for each model and at the inter-task level for the largest model. Models trained on raw data consistently performed poorly, always ranking last in average scores. In addition, models seem to benefit more from minimal pipelines without artifact handling methods. These findings suggest that EEG artifacts may affect the performance and generalizability of deep neural networks.","PeriodicalId":13419,"journal":{"name":"IEEE Transactions on Neural Systems and Rehabilitation Engineering","volume":"33 ","pages":"1061-1070"},"PeriodicalIF":4.8000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10909332","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Neural Systems and Rehabilitation Engineering","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10909332/","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

The last decade has witnessed a notable surge in deep learning applications for electroencephalography (EEG) data analysis, showing promising improvements over conventional statistical techniques. However, deep learning models can underperform if trained with bad processed data. Preprocessing is crucial for EEG data analysis, yet there is no consensus on the optimal strategies in deep learning scenarios, leading to uncertainty about the extent of preprocessing required for optimal results. This study is the first to thoroughly investigate the effects of EEG preprocessing in deep learning applications, drafting guidelines for future research. It evaluates the effects of varying preprocessing levels, from raw and minimally filtered data to complex pipelines with automated artifact removal algorithms. Six classification tasks (eye blinking, motor imagery, Parkinson’s, Alzheimer’s disease, sleep deprivation, and first episode psychosis) and four established EEG architectures were considered for the evaluation. The analysis of 4800 trained models revealed statistical differences between preprocessing pipelines at the intra-task level for each model and at the inter-task level for the largest model. Models trained on raw data consistently performed poorly, always ranking last in average scores. In addition, models seem to benefit more from minimal pipelines without artifact handling methods. These findings suggest that EEG artifacts may affect the performance and generalizability of deep neural networks.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

过去十年间，深度学习在脑电图（EEG）数据分析方面的应用明显激增，与传统统计技术相比取得了可喜的进步。然而，如果使用处理不当的数据进行训练，深度学习模型可能会表现不佳。预处理对于脑电图数据分析至关重要，但对于深度学习场景中的最佳策略还没有达成共识，这导致对获得最佳结果所需的预处理程度存在不确定性。本研究首次深入研究了深度学习应用中脑电图预处理的效果，为未来研究起草了指导原则。它评估了不同预处理水平的效果，从原始数据和最小过滤数据到带有自动人工痕迹去除算法的复杂管道。评估考虑了六种分类任务（眨眼、运动图像、帕金森病、阿尔茨海默病、睡眠剥夺和首次发作的精神病）和四种成熟的脑电图架构。对 4800 个训练有素的模型进行的分析表明，预处理管道在每个模型的任务内级别和最大模型的任务间级别上存在统计差异。在原始数据上训练的模型一直表现不佳，平均得分总是排在最后。此外，模型似乎更受益于没有人工痕迹处理方法的最小管道。这些发现表明，脑电图伪影可能会影响深度神经网络的性能和泛化能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Neural Systems and Rehabilitation Engineering 医学-工程：生物医学

CiteScore

8.60

自引率

8.20%

发文量

479

审稿时长

6-12 weeks

期刊介绍： Rehabilitative and neural aspects of biomedical engineering, including functional electrical stimulation, acoustic dynamics, human performance measurement and analysis, nerve stimulation, electromyography, motor control and stimulation; and hardware and software applications for rehabilitation engineering and assistive devices.