首页 > 最新文献

Brain Informatics最新文献

英文 中文
Advancing Alzheimer's disease detection: a novel convolutional neural network based framework leveraging EEG data and segment length analysis. 推进阿尔茨海默病检测:利用脑电图数据和片段长度分析的新型卷积神经网络框架。
Q1 Computer Science Pub Date : 2025-06-04 DOI: 10.1186/s40708-025-00260-3
Md Nurul Ahad Tawhid, Siuly Siuly, Enamul Kabir, Yan Li

Alzheimer's disease (AD) is a progressive neurodegenerative disorder that primarily affects memory, thinking, and behavior, leading to dementia, a severe cognitive decline. While no cure currently exists, recent advancements in preventive drug trials and therapeutic management have increased interest in developing clinical algorithms for early detection and biomarker identification. Electroencephalography (EEG) is noninvasive, cost-effective, and has high temporal resolution, making it a promising tool for automated AD detection. However, conventional machine learning approaches often fall short in accurately detecting AD due to their limited architectures. We also need to investigate the impact of EEG signal segment length on classification accuracy. To address these issues, a deep learning-based framework is proposed to detect AD using EEG data, focusing on determining the optimal segment length for classification. This framework contains EEG data collection, pre-processing for noise removal, temporal segmentation, convolutional neural network (CNN) model training and classification, and finally, evaluation. We have tested different segment lengths to test the impact on AD detection. We have used both 10-fold and leave-one-out cross-validation techniques and obtained accuracy of 97.08% and 96.90%, respectively, on a publicly available dataset from AHEPA General University Hospital of Thessaloniki. We have also tested the generalizability of the proposed model by testing it to detect frontotemporal dementia and obtained better results than existing studies. Furthermore, we have validated our proposed CNN model using several ablation studies and layer-wise extracted feature visualization. This study will establish a pioneering direction for future researchers and technology experts in the field of neurodiseases.

阿尔茨海默病(AD)是一种进行性神经退行性疾病,主要影响记忆、思维和行为,导致痴呆,一种严重的认知能力下降。虽然目前还没有治愈方法,但预防性药物试验和治疗管理的最新进展增加了人们对开发早期检测和生物标志物识别的临床算法的兴趣。脑电图(EEG)具有无创、低成本和高时间分辨率的特点,是一种很有前途的自动化AD检测工具。然而,传统的机器学习方法由于其有限的架构,往往无法准确检测AD。我们还需要研究脑电信号片段长度对分类精度的影响。为了解决这些问题,提出了一种基于深度学习的框架,利用脑电图数据检测AD,重点是确定用于分类的最佳片段长度。该框架包括脑电数据采集、去噪预处理、时间分割、卷积神经网络(CNN)模型训练和分类,最后进行评估。我们测试了不同的段长度来测试对AD检测的影响。我们使用了10倍交叉验证技术和留一交叉验证技术,在塞萨洛尼基AHEPA综合大学医院的公开数据集上分别获得了97.08%和96.90%的准确率。我们还通过测试其检测额颞叶痴呆来测试所提出模型的泛化性,并获得比现有研究更好的结果。此外,我们还使用几个消融研究和分层提取的特征可视化验证了我们提出的CNN模型。这项研究将为未来神经疾病领域的研究人员和技术专家建立一个开创性的方向。
{"title":"Advancing Alzheimer's disease detection: a novel convolutional neural network based framework leveraging EEG data and segment length analysis.","authors":"Md Nurul Ahad Tawhid, Siuly Siuly, Enamul Kabir, Yan Li","doi":"10.1186/s40708-025-00260-3","DOIUrl":"10.1186/s40708-025-00260-3","url":null,"abstract":"<p><p>Alzheimer's disease (AD) is a progressive neurodegenerative disorder that primarily affects memory, thinking, and behavior, leading to dementia, a severe cognitive decline. While no cure currently exists, recent advancements in preventive drug trials and therapeutic management have increased interest in developing clinical algorithms for early detection and biomarker identification. Electroencephalography (EEG) is noninvasive, cost-effective, and has high temporal resolution, making it a promising tool for automated AD detection. However, conventional machine learning approaches often fall short in accurately detecting AD due to their limited architectures. We also need to investigate the impact of EEG signal segment length on classification accuracy. To address these issues, a deep learning-based framework is proposed to detect AD using EEG data, focusing on determining the optimal segment length for classification. This framework contains EEG data collection, pre-processing for noise removal, temporal segmentation, convolutional neural network (CNN) model training and classification, and finally, evaluation. We have tested different segment lengths to test the impact on AD detection. We have used both 10-fold and leave-one-out cross-validation techniques and obtained accuracy of 97.08% and 96.90%, respectively, on a publicly available dataset from AHEPA General University Hospital of Thessaloniki. We have also tested the generalizability of the proposed model by testing it to detect frontotemporal dementia and obtained better results than existing studies. Furthermore, we have validated our proposed CNN model using several ablation studies and layer-wise extracted feature visualization. This study will establish a pioneering direction for future researchers and technology experts in the field of neurodiseases.</p>","PeriodicalId":37465,"journal":{"name":"Brain Informatics","volume":"12 1","pages":"13"},"PeriodicalIF":0.0,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12137833/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144217156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Treatment journey clustering with a novel kernel k-means machine learning algorithm: a retrospective analysis of insurance claims in bipolar I disorder. 一种新型核k-均值机器学习算法的治疗过程聚类:双相I型障碍保险索赔的回顾性分析。
Q1 Computer Science Pub Date : 2025-05-22 DOI: 10.1186/s40708-025-00258-x
Matthew Littman, Huy-Binh Nguyen, Joanna Campbell, Katelyn R Keyloun

In real-world psychiatric practice, patients may experience complex treatment journeys, including various diagnoses and lines of therapy. Insurance claims databases could potentially provide insight into outcomes of psychiatric treatment processes, but the diversity of event sequences restricts analyses with currently available methods. Here, we developed a novel kernel k-means clustering algorithm for event sequences that can accommodate highly diverse event types and sequence lengths. The approach, Divisive Optimized Clustering using Kernel K-means for Event Sequences (DOCKKES), also leverages a novel performance metric, the transition score, which measures sequence coherence in individual clusters. The performance of DOCKKES was evaluated in the context of bipolar I disorder, which is characterized by heterogeneous treatment journeys. We conducted a retrospective, observational analysis of a large sample (n = 31,578) of patients with bipolar I disorder from the MarketScan® Commercial Database. Using insurance claims, bipolar episode diagnoses and mental health-related lines of therapy were identified as events of interest for patient clustering. The dataset included 202,122 events; 75% of the cohort experienced unique treatment journeys. Based on an optimal run, DOCKKES identified 16 treatment journey clusters, which were evenly split for initial manic/mixed or depressive episodes (8 clusters each) and varied in sequence length and early lines of therapy. Variability across clusters was also observed for demographics, comorbidities, and mental health-related healthcare resource utilization and cost. This proof-of-concept study demonstrated the use of DOCKKES for integrating information from large datasets, enabling comparisons between patient clusters and evaluation of real-world treatment journeys in the context of evidence-based guidelines.

在现实世界的精神病学实践中,患者可能会经历复杂的治疗过程,包括各种诊断和治疗方法。保险索赔数据库可以潜在地提供对精神病治疗过程结果的洞察,但是事件序列的多样性限制了当前可用方法的分析。在这里,我们为事件序列开发了一种新的核k-均值聚类算法,该算法可以适应高度不同的事件类型和序列长度。该方法,分裂优化聚类使用核k -均值事件序列(DOCKKES),也利用了一种新的性能指标,过渡分数,衡量序列一致性在单个集群。DOCKKES的表现在双相I型障碍的背景下进行了评估,其特点是异质性的治疗过程。我们对来自MarketScan®商业数据库的大样本(n = 31,578)双相I型障碍患者进行了回顾性观察性分析。使用保险索赔,双相情感障碍发作诊断和精神健康相关的治疗线被确定为患者聚类感兴趣的事件。该数据集包括202,122个事件;75%的队列经历了独特的治疗旅程。基于最佳运行,DOCKKES确定了16个治疗旅程集群,这些集群平均分配给初始躁狂/混合性或抑郁发作(每个8个集群),并且序列长度和早期治疗线不同。在人口统计学、合并症和精神健康相关的医疗资源利用和成本方面,也观察到不同集群之间的差异。这项概念验证研究展示了DOCKKES用于整合来自大型数据集的信息,能够在循证指南的背景下比较患者群和评估现实世界的治疗过程。
{"title":"Treatment journey clustering with a novel kernel k-means machine learning algorithm: a retrospective analysis of insurance claims in bipolar I disorder.","authors":"Matthew Littman, Huy-Binh Nguyen, Joanna Campbell, Katelyn R Keyloun","doi":"10.1186/s40708-025-00258-x","DOIUrl":"10.1186/s40708-025-00258-x","url":null,"abstract":"<p><p>In real-world psychiatric practice, patients may experience complex treatment journeys, including various diagnoses and lines of therapy. Insurance claims databases could potentially provide insight into outcomes of psychiatric treatment processes, but the diversity of event sequences restricts analyses with currently available methods. Here, we developed a novel kernel k-means clustering algorithm for event sequences that can accommodate highly diverse event types and sequence lengths. The approach, Divisive Optimized Clustering using Kernel K-means for Event Sequences (DOCKKES), also leverages a novel performance metric, the transition score, which measures sequence coherence in individual clusters. The performance of DOCKKES was evaluated in the context of bipolar I disorder, which is characterized by heterogeneous treatment journeys. We conducted a retrospective, observational analysis of a large sample (n = 31,578) of patients with bipolar I disorder from the MarketScan® Commercial Database. Using insurance claims, bipolar episode diagnoses and mental health-related lines of therapy were identified as events of interest for patient clustering. The dataset included 202,122 events; 75% of the cohort experienced unique treatment journeys. Based on an optimal run, DOCKKES identified 16 treatment journey clusters, which were evenly split for initial manic/mixed or depressive episodes (8 clusters each) and varied in sequence length and early lines of therapy. Variability across clusters was also observed for demographics, comorbidities, and mental health-related healthcare resource utilization and cost. This proof-of-concept study demonstrated the use of DOCKKES for integrating information from large datasets, enabling comparisons between patient clusters and evaluation of real-world treatment journeys in the context of evidence-based guidelines.</p>","PeriodicalId":37465,"journal":{"name":"Brain Informatics","volume":"12 1","pages":"12"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12098244/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144120486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HoRNS-CNN model: an energy-efficient fully homomorphic residue number system convolutional neural network model for privacy-preserving classification of dyslexia neural-biomarkers. 霍恩- cnn模型:一种节能的全同态残数系统卷积神经网络模型,用于失读症神经生物标记物的隐私保护分类。
Q1 Computer Science Pub Date : 2025-04-30 DOI: 10.1186/s40708-025-00256-z
Opeyemi Lateef Usman, Ravie Chandren Muniyandi, Khairuddin Omar, Mazlyfarina Mohamad, Ayoade Akeem Owoade, Morufat Adebola Kareem

Recent advancements in cloud-based machine learning (ML) now allow for the rapid and remote identification of neural-biomarkers associated with common neuro-developmental disorders from neuroimaging datasets. Due to the sensitive nature of these datasets, secure deep learning (DL) algorithms are essential. Although, fully homomorphic encryption (FHE)-based methods have been proposed to maintain data confidentiality and privacy, however, existing FHE deep convolutional neural network (CNN) models still face some issues such as low accuracy, high encryption/decryption latency, energy inefficiency, long feature extraction times, and significant cipher-image expansion. To address these issues, this study introduces the HoRNS-CNN model, which integrates the energy-efficient features of the residue number system FHE scheme (RNS-FHE scheme) with the high accuracy of pre-trained deep CNN models in the cloud for efficient, privacy-preserving predictions and provide some proofs of its energy efficiency and homomorphism. The RNS-FHE scheme's FPGA implementation includes embedded RNS pixel-bitstream homomorphic encoder/decoder circuits for encrypting 8-bit grayscale pixels, with cloud CNN models performing remote classification on the encrypted images. In the HoRNS-CNN architecture, the ReLU activation functions of deep CNNs were initially trained for stability and later adapted for homomorphic computations using a Taylor polynomial approximation of degree 3 and batch normalization to achieve high accuracy. The findings show that the HoRNS-CNN model effectively manages cipher-image expansion with an asymptotic complexity of O n 3 , offering better performance and faster feature extraction compared to its peers. The model can predict 400,000 neural-biomarker features in one hour, providing an effective tool for analyzing neuroimages while ensuring privacy and security.

基于云的机器学习(ML)的最新进展现在允许从神经成像数据集中快速和远程识别与常见神经发育障碍相关的神经生物标志物。由于这些数据集的敏感性,安全的深度学习(DL)算法至关重要。虽然已经提出了基于完全同态加密(FHE)的方法来保持数据的机密性和隐私性,但现有的FHE深度卷积神经网络(CNN)模型仍然存在准确率低、加解密延迟高、能量低、特征提取时间长、密码图像扩展大等问题。为了解决这些问题,本研究引入了HoRNS-CNN模型,该模型将残数系统FHE方案(RNS-FHE方案)的高能效特征与云端预训练深度CNN模型的高精度相结合,以实现高效、隐私保护的预测,并提供了一些能效和同态的证明。RNS- fhe方案的FPGA实现包括嵌入式RNS像素-比特流同态编码器/解码器电路,用于加密8位灰度像素,云CNN模型对加密图像进行远程分类。在HoRNS-CNN架构中,深度cnn的ReLU激活函数首先进行稳定性训练,然后使用3度的泰勒多项式近似和批处理归一化进行同态计算,以实现高精度。研究结果表明,HoRNS-CNN模型有效地管理了密码图像扩展,其渐近复杂度为O n 3,与同类模型相比,具有更好的性能和更快的特征提取速度。该模型可以在一小时内预测40万个神经生物标志物特征,为分析神经图像提供了有效的工具,同时确保了隐私和安全。
{"title":"HoRNS-CNN model: an energy-efficient fully homomorphic residue number system convolutional neural network model for privacy-preserving classification of dyslexia neural-biomarkers.","authors":"Opeyemi Lateef Usman, Ravie Chandren Muniyandi, Khairuddin Omar, Mazlyfarina Mohamad, Ayoade Akeem Owoade, Morufat Adebola Kareem","doi":"10.1186/s40708-025-00256-z","DOIUrl":"https://doi.org/10.1186/s40708-025-00256-z","url":null,"abstract":"<p><p>Recent advancements in cloud-based machine learning (ML) now allow for the rapid and remote identification of neural-biomarkers associated with common neuro-developmental disorders from neuroimaging datasets. Due to the sensitive nature of these datasets, secure deep learning (DL) algorithms are essential. Although, fully homomorphic encryption (FHE)-based methods have been proposed to maintain data confidentiality and privacy, however, existing FHE deep convolutional neural network (CNN) models still face some issues such as low accuracy, high encryption/decryption latency, energy inefficiency, long feature extraction times, and significant cipher-image expansion. To address these issues, this study introduces the HoRNS-CNN model, which integrates the energy-efficient features of the residue number system FHE scheme (RNS-FHE scheme) with the high accuracy of pre-trained deep CNN models in the cloud for efficient, privacy-preserving predictions and provide some proofs of its energy efficiency and homomorphism. The RNS-FHE scheme's FPGA implementation includes embedded RNS pixel-bitstream homomorphic encoder/decoder circuits for encrypting 8-bit grayscale pixels, with cloud CNN models performing remote classification on the encrypted images. In the HoRNS-CNN architecture, the ReLU activation functions of deep CNNs were initially trained for stability and later adapted for homomorphic computations using a Taylor polynomial approximation of degree 3 and batch normalization to achieve high accuracy. The findings show that the HoRNS-CNN model effectively manages cipher-image expansion with an asymptotic complexity of <math><mrow><mi>O</mi> <mfenced> <msup><mrow><mi>n</mi></mrow> <mn>3</mn></msup> </mfenced> </mrow> </math> , offering better performance and faster feature extraction compared to its peers. The model can predict 400,000 neural-biomarker features in one hour, providing an effective tool for analyzing neuroimages while ensuring privacy and security.</p>","PeriodicalId":37465,"journal":{"name":"Brain Informatics","volume":"12 1","pages":"11"},"PeriodicalIF":0.0,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12044152/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144056981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable CNN for brain tumor detection and classification through XAI based key features identification. 基于XAI关键特征识别的可解释CNN脑肿瘤检测与分类。
Q1 Computer Science Pub Date : 2025-04-30 DOI: 10.1186/s40708-025-00257-y
Shagufta Iftikhar, Nadeem Anjum, Abdul Basit Siddiqui, Masood Ur Rehman, Naeem Ramzan

Despite significant advancements in brain tumor classification, many existing models suffer from complex structures that make them difficult to interpret. This complexity can hinder the transparency of the decision-making process, causing models to rely on irrelevant features or normal soft tissues. Besides, these models often include additional layers and parameters, which further complicate the classification process. Our work addresses these limitations by introducing a novel methodology that combines Explainable AI (XAI) techniques with a Convolutional Neural Network (CNN) architecture. The major contribution of this paper is ensuring that the model focuses on the most relevant features for tumor detection and classification, while simultaneously reducing complexity, by minimizing the number of layers. This approach enhances the model's transparency and robustness, giving clear insights into its decision-making process through XAI techniques such as Gradient-weighted Class Activation Mapping (Grad-Cam), Shapley Additive explanations (Shap), and Local Interpretable Model-agnostic Explanations (LIME). Additionally, the approach demonstrates better performance, achieving 99% accuracy on seen data and 95% on unseen data, highlighting its generalizability and reliability. This balance of simplicity, interpretability, and high accuracy represents a significant advancement in the classification of brain tumor.

尽管在脑肿瘤分类方面取得了重大进展,但许多现有模型的结构复杂,难以解释。这种复杂性会阻碍决策过程的透明度,导致模型依赖于不相关的特征或正常的软组织。此外,这些模型通常包含额外的层和参数,这进一步使分类过程复杂化。我们的工作通过引入一种将可解释人工智能(XAI)技术与卷积神经网络(CNN)架构相结合的新方法来解决这些限制。本文的主要贡献是确保模型专注于肿瘤检测和分类的最相关特征,同时通过最小化层数来降低复杂性。这种方法增强了模型的透明度和鲁棒性,通过XAI技术,如梯度加权类激活映射(gradcam)、Shapley加性解释(Shap)和局部可解释的模型不可知解释(LIME),对其决策过程提供了清晰的见解。此外,该方法还展示了更好的性能,对可见数据的准确率达到99%,对未见数据的准确率达到95%,突出了其通用性和可靠性。这种简单性、可解释性和高精度的平衡代表了脑肿瘤分类的重大进步。
{"title":"Explainable CNN for brain tumor detection and classification through XAI based key features identification.","authors":"Shagufta Iftikhar, Nadeem Anjum, Abdul Basit Siddiqui, Masood Ur Rehman, Naeem Ramzan","doi":"10.1186/s40708-025-00257-y","DOIUrl":"https://doi.org/10.1186/s40708-025-00257-y","url":null,"abstract":"<p><p>Despite significant advancements in brain tumor classification, many existing models suffer from complex structures that make them difficult to interpret. This complexity can hinder the transparency of the decision-making process, causing models to rely on irrelevant features or normal soft tissues. Besides, these models often include additional layers and parameters, which further complicate the classification process. Our work addresses these limitations by introducing a novel methodology that combines Explainable AI (XAI) techniques with a Convolutional Neural Network (CNN) architecture. The major contribution of this paper is ensuring that the model focuses on the most relevant features for tumor detection and classification, while simultaneously reducing complexity, by minimizing the number of layers. This approach enhances the model's transparency and robustness, giving clear insights into its decision-making process through XAI techniques such as Gradient-weighted Class Activation Mapping (Grad-Cam), Shapley Additive explanations (Shap), and Local Interpretable Model-agnostic Explanations (LIME). Additionally, the approach demonstrates better performance, achieving 99% accuracy on seen data and 95% on unseen data, highlighting its generalizability and reliability. This balance of simplicity, interpretability, and high accuracy represents a significant advancement in the classification of brain tumor.</p>","PeriodicalId":37465,"journal":{"name":"Brain Informatics","volume":"12 1","pages":"10"},"PeriodicalIF":0.0,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12044100/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144037051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Breakdown of the compositional data approach in psychometric Likert scale big data analysis: about the loss of statistical power of two-sample t-tests applied to heavy-tailed big data. 心理测量学Likert量表大数据分析中成分数据方法的分解:关于双样本t检验应用于重尾大数据的统计效力的丧失
Q1 Computer Science Pub Date : 2025-04-07 DOI: 10.1186/s40708-025-00253-2
René Lehmann, Bodo Vogt

Bipolar psychometric scale data play a crucial role in psychological healthcare and health economics, such as in psychotherapeutic profiling and setting standards. Creating an accurate psychological profile not only benefits the patient but also saves time and costs. The quality of psychotherapeutic measures directly impacts grant funding decisions, influencing managerial choices. Moreover, the accuracy of consumer data analyses affects costs, profits, and the long-term sustainability of decisions. Considering psychometric bipolar scale data as compositional data can enhance the statistical power of well-known paired and unpaired two-sample t-tests, supporting managerial decision-making and the development or implementation of health interventions. This increase in statistical power is observed when the central limit theorem (CLT) holds true in statistics. Through stochastic simulation, this study explores the impact of violating the CLT on statistical power of the unpaired t-test under heavy-tailed data generating processes (DGPs) with finite variance. The findings reveal a reduction in statistical power based on specific parameters like the psychometric limit of quantification, the number of items in a questionnaire, the response scale used, and the dispersion of the DGP.

双相心理测量量表数据在心理保健和健康经济学中发挥着至关重要的作用,例如在心理治疗分析和设定标准方面。创建准确的心理档案不仅有利于患者,而且节省了时间和成本。心理治疗措施的质量直接影响拨款决策,影响管理选择。此外,消费者数据分析的准确性影响成本、利润和决策的长期可持续性。将心理测量双相量表数据作为组成数据,可以增强众所周知的配对和非配对双样本t检验的统计能力,支持管理决策和健康干预措施的制定或实施。当中心极限定理(CLT)在统计学中成立时,可以观察到统计能力的增加。通过随机模拟,本研究探讨了在有限方差的重尾数据生成过程(DGPs)下,违反CLT对非配对t检验统计能力的影响。研究结果显示,基于特定参数(如量化的心理测量极限、问卷中的项目数量、使用的反应量表和DGP的离散度),统计能力有所降低。
{"title":"Breakdown of the compositional data approach in psychometric Likert scale big data analysis: about the loss of statistical power of two-sample t-tests applied to heavy-tailed big data.","authors":"René Lehmann, Bodo Vogt","doi":"10.1186/s40708-025-00253-2","DOIUrl":"10.1186/s40708-025-00253-2","url":null,"abstract":"<p><p>Bipolar psychometric scale data play a crucial role in psychological healthcare and health economics, such as in psychotherapeutic profiling and setting standards. Creating an accurate psychological profile not only benefits the patient but also saves time and costs. The quality of psychotherapeutic measures directly impacts grant funding decisions, influencing managerial choices. Moreover, the accuracy of consumer data analyses affects costs, profits, and the long-term sustainability of decisions. Considering psychometric bipolar scale data as compositional data can enhance the statistical power of well-known paired and unpaired two-sample t-tests, supporting managerial decision-making and the development or implementation of health interventions. This increase in statistical power is observed when the central limit theorem (CLT) holds true in statistics. Through stochastic simulation, this study explores the impact of violating the CLT on statistical power of the unpaired t-test under heavy-tailed data generating processes (DGPs) with finite variance. The findings reveal a reduction in statistical power based on specific parameters like the psychometric limit of quantification, the number of items in a questionnaire, the response scale used, and the dispersion of the DGP.</p>","PeriodicalId":37465,"journal":{"name":"Brain Informatics","volume":"12 1","pages":"9"},"PeriodicalIF":0.0,"publicationDate":"2025-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11977074/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143804412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine-learning models for Alzheimer's disease diagnosis using neuroimaging data: survey, reproducibility, and generalizability evaluation. 使用神经成像数据诊断阿尔茨海默病的机器学习模型:调查、可重复性和概括性评估。
Q1 Computer Science Pub Date : 2025-03-21 DOI: 10.1186/s40708-025-00252-3
Maryam Akhavan Aghdam, Serdar Bozdag, Fahad Saeed

Clinical diagnosis of Alzheimer's disease (AD) is usually made after symptoms such as short-term memory loss are exhibited, which minimizes the intervention and treatment options. The existing screening techniques cannot distinguish between stable MCI (sMCI) cases (i.e., patients who do not convert to AD for at least three years) and progressive MCI (pMCI) cases (i.e., patients who convert to AD in three years or sooner). Delayed diagnosis of AD also disproportionately affects underrepresented and socioeconomically disadvantaged populations. The significant positive impact of an early diagnosis solution for AD across diverse ethno-racial and demographic groups is well-known and recognized. While advancements in high-throughput technologies have enabled the generation of vast amounts of multimodal clinical, and neuroimaging datasets related to AD, most methods utilizing these data sets for diagnostic purposes have not found their way in clinical settings. To better understand the landscape, we surveyed the major preprocessing, data management, traditional machine-learning (ML), and deep learning (DL) techniques used for diagnosing AD using neuroimaging data such as structural magnetic resonance imaging (sMRI), functional magnetic resonance imaging (fMRI), and positron emission tomography (PET). Once we had a good understanding of the methods available, we conducted a study to assess the reproducibility and generalizability of open-source ML models. Our evaluation shows that existing models show reduced generalizability when different cohorts of the data modality are used while controlling other computational factors. The paper concludes with a discussion of major challenges that plague ML models for AD diagnosis and biomarker discovery.

阿尔茨海默病(AD)的临床诊断通常是在出现短期记忆丧失等症状后做出的,这使干预和治疗选择最小化。现有的筛查技术无法区分稳定型MCI (sMCI)病例(即至少3年内未转化为AD的患者)和进行性MCI (pMCI)病例(即在3年或更短时间内转化为AD的患者)。阿尔茨海默病的延迟诊断也不成比例地影响到代表性不足和社会经济上处于不利地位的人群。在不同民族、种族和人口群体中,AD早期诊断解决方案的显著积极影响是众所周知和公认的。虽然高通量技术的进步已经能够产生大量与AD相关的多模式临床和神经成像数据集,但大多数利用这些数据集进行诊断的方法尚未在临床环境中找到自己的方式。为了更好地了解这一前景,我们调查了用于诊断AD的主要预处理、数据管理、传统机器学习(ML)和深度学习(DL)技术,这些技术使用神经成像数据,如结构磁共振成像(sMRI)、功能磁共振成像(fMRI)和正电子发射断层扫描(PET)。一旦我们对可用的方法有了很好的理解,我们就进行了一项研究,以评估开源ML模型的可重复性和泛化性。我们的评估表明,在控制其他计算因素的情况下,当使用不同数据模式的队列时,现有模型的泛化性降低。本文最后讨论了困扰ML模型用于AD诊断和生物标志物发现的主要挑战。
{"title":"Machine-learning models for Alzheimer's disease diagnosis using neuroimaging data: survey, reproducibility, and generalizability evaluation.","authors":"Maryam Akhavan Aghdam, Serdar Bozdag, Fahad Saeed","doi":"10.1186/s40708-025-00252-3","DOIUrl":"10.1186/s40708-025-00252-3","url":null,"abstract":"<p><p>Clinical diagnosis of Alzheimer's disease (AD) is usually made after symptoms such as short-term memory loss are exhibited, which minimizes the intervention and treatment options. The existing screening techniques cannot distinguish between stable MCI (sMCI) cases (i.e., patients who do not convert to AD for at least three years) and progressive MCI (pMCI) cases (i.e., patients who convert to AD in three years or sooner). Delayed diagnosis of AD also disproportionately affects underrepresented and socioeconomically disadvantaged populations. The significant positive impact of an early diagnosis solution for AD across diverse ethno-racial and demographic groups is well-known and recognized. While advancements in high-throughput technologies have enabled the generation of vast amounts of multimodal clinical, and neuroimaging datasets related to AD, most methods utilizing these data sets for diagnostic purposes have not found their way in clinical settings. To better understand the landscape, we surveyed the major preprocessing, data management, traditional machine-learning (ML), and deep learning (DL) techniques used for diagnosing AD using neuroimaging data such as structural magnetic resonance imaging (sMRI), functional magnetic resonance imaging (fMRI), and positron emission tomography (PET). Once we had a good understanding of the methods available, we conducted a study to assess the reproducibility and generalizability of open-source ML models. Our evaluation shows that existing models show reduced generalizability when different cohorts of the data modality are used while controlling other computational factors. The paper concludes with a discussion of major challenges that plague ML models for AD diagnosis and biomarker discovery.</p>","PeriodicalId":37465,"journal":{"name":"Brain Informatics","volume":"12 1","pages":"8"},"PeriodicalIF":0.0,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11928716/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143674661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring multi-granularity balance strategy for class incremental learning via three-way granular computing. 探索基于三向颗粒计算的班级增量学习的多粒度平衡策略。
Q1 Computer Science Pub Date : 2025-03-17 DOI: 10.1186/s40708-025-00255-0
Yan Xian, Hong Yu, Ye Wang, Guoyin Wang

Class incremental learning (CIL) is a specific scenario in incremental learning. It aims to continuously learn new classes from the data stream, which suffers from the challenge of catastrophic forgetting. Inspired by the human hippocampus, the CIL method for replaying episodic memory offers a promising solution. However, the limited buffer budget restricts the number of old class samples that can be stored, resulting in an imbalance between new and old class samples during each incremental learning stage. This imbalance adversely affects the mitigation of catastrophic forgetting. Therefore, we propose a novel CIL method based on multi-granularity balance strategy (MGBCIL), which is inspired by the three-way granular computing in human problem-solving. In order to mitigate the adverse effects of imbalances on catastrophic forgetting at fine-, medium-, and coarse-grained levels during training, MGBCIL introduces specific strategies across the batch, task, and decision stages. Specifically, a weighted cross-entropy loss function with a smoothing factor is proposed for batch processing. In the process of task updating and classification decision, contrastive learning with different anchor point settings is employed to promote local and global separation between new and old classes. Additionally, the knowledge distillation technology is used to preserve knowledge of the old classes. Experimental evaluations on CIFAR-10 and CIFAR-100 datasets show that MGBCIL outperforms other methods in most incremental settings. Specifically, when storing 3 exemplars on CIFAR-10 with Base2 Inc2 setting, the average accuracy is improved by up to 9.59% and the forgetting rate is reduced by up to 25.45%.

类增量学习(Class incremental learning, CIL)是增量学习中的一种特殊场景。它旨在不断地从数据流中学习新的课程,而这些课程面临着灾难性遗忘的挑战。受人类海马体的启发,重现情景记忆的CIL方法提供了一个有希望的解决方案。然而,有限的缓冲预算限制了可以存储的旧类样本的数量,导致在每个增量学习阶段新旧类样本之间的不平衡。这种不平衡对灾难性遗忘的缓解产生了不利影响。因此,我们提出了一种基于多粒度平衡策略(MGBCIL)的新型CIL方法,该方法的灵感来自于人类问题解决中的三向颗粒计算。为了减轻训练期间细粒度、中粒度和粗粒度级别的不平衡对灾难性遗忘的不利影响,MGBCIL在批处理、任务和决策阶段引入了特定的策略。具体来说,提出了一种带平滑因子的加权交叉熵损失函数。在任务更新和分类决策过程中,采用不同锚点设置的对比学习,促进新旧类的局部和全局分离。此外,利用知识蒸馏技术对旧类的知识进行保存。在CIFAR-10和CIFAR-100数据集上的实验评估表明,MGBCIL在大多数增量设置中优于其他方法。具体来说,当在CIFAR-10上存储3个样本时,使用Base2 Inc2设置,平均准确率提高了9.59%,遗忘率降低了25.45%。
{"title":"Exploring multi-granularity balance strategy for class incremental learning via three-way granular computing.","authors":"Yan Xian, Hong Yu, Ye Wang, Guoyin Wang","doi":"10.1186/s40708-025-00255-0","DOIUrl":"10.1186/s40708-025-00255-0","url":null,"abstract":"<p><p>Class incremental learning (CIL) is a specific scenario in incremental learning. It aims to continuously learn new classes from the data stream, which suffers from the challenge of catastrophic forgetting. Inspired by the human hippocampus, the CIL method for replaying episodic memory offers a promising solution. However, the limited buffer budget restricts the number of old class samples that can be stored, resulting in an imbalance between new and old class samples during each incremental learning stage. This imbalance adversely affects the mitigation of catastrophic forgetting. Therefore, we propose a novel CIL method based on multi-granularity balance strategy (MGBCIL), which is inspired by the three-way granular computing in human problem-solving. In order to mitigate the adverse effects of imbalances on catastrophic forgetting at fine-, medium-, and coarse-grained levels during training, MGBCIL introduces specific strategies across the batch, task, and decision stages. Specifically, a weighted cross-entropy loss function with a smoothing factor is proposed for batch processing. In the process of task updating and classification decision, contrastive learning with different anchor point settings is employed to promote local and global separation between new and old classes. Additionally, the knowledge distillation technology is used to preserve knowledge of the old classes. Experimental evaluations on CIFAR-10 and CIFAR-100 datasets show that MGBCIL outperforms other methods in most incremental settings. Specifically, when storing 3 exemplars on CIFAR-10 with Base2 Inc2 setting, the average accuracy is improved by up to 9.59% and the forgetting rate is reduced by up to 25.45%.</p>","PeriodicalId":37465,"journal":{"name":"Brain Informatics","volume":"12 1","pages":"7"},"PeriodicalIF":0.0,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11914578/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143650755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Localization of epileptic foci from intracranial EEG using the GRU-GC algorithm. 基于GRU-GC算法的颅内脑电图癫痫病灶定位。
Q1 Computer Science Pub Date : 2025-03-15 DOI: 10.1186/s40708-025-00254-1
Xiaojia Wang, Dayang Wu, Chunfeng Yang

Epilepsy is one of the most common clinical diseases, which is caused by abnormal discharge of brain nerves. Around 30% of patients will develop drug-resistant epilepsy that are hard to be cured by anti-epileptic drug treatment. This patient cohort are ideal candidate for surgical resection of the epileptic focus. For safety and maximum effective rate, the key to success of the operation is to identify the focus area and normal functional area accurately in the preoperative evaluation stage. Intracranial EEG (iEEG) has attracted much attention for its precise capture of the state of rapid brain activity and its strong locality. To automate the process of iEEG inspection and surgical evaluation, this paper propose a Gated Recurrent Unit-Granger Causality (GRU-GC) algorithm to detect effective connectivity between channels and construct a directed graph. From six local features, the top five feature combinations were selected to differentiate between epileptic foci and non-epileptic regions. Experiments indicate that these features are most discriminative during the ictal phase, yielding superior classification accuracy. Compared to traditional time-series-based methods, this study shows that GRU-GC algorithm is efficient in building effective graph model for improving preoperative epilepsy evaluations.

癫痫是临床上最常见的疾病之一,它是由脑神经异常放电引起的。约30%的患者会发展为难以通过抗癫痫药物治疗治愈的耐药癫痫。这组患者是手术切除癫痫病灶的理想候选者。术前评估阶段准确识别病灶区和正常功能区是手术成功的关键,以保证手术的安全性和最大的有效率。颅内脑电图(iEEG)以其对大脑快速活动状态的精确捕捉和强局部性而备受关注。为了实现iEEG检查和手术评估过程的自动化,本文提出了一种门控递归单元-格兰杰因果关系(GRU-GC)算法来检测通道之间的有效连通性并构建有向图。从6个局部特征中,选择前5个特征组合来区分癫痫灶和非癫痫区。实验表明,这些特征在初始阶段最具判别性,分类精度较高。与传统的基于时间序列的方法相比,本研究表明,GRU-GC算法在构建有效图模型以改善癫痫术前评估方面是有效的。
{"title":"Localization of epileptic foci from intracranial EEG using the GRU-GC algorithm.","authors":"Xiaojia Wang, Dayang Wu, Chunfeng Yang","doi":"10.1186/s40708-025-00254-1","DOIUrl":"10.1186/s40708-025-00254-1","url":null,"abstract":"<p><p>Epilepsy is one of the most common clinical diseases, which is caused by abnormal discharge of brain nerves. Around 30% of patients will develop drug-resistant epilepsy that are hard to be cured by anti-epileptic drug treatment. This patient cohort are ideal candidate for surgical resection of the epileptic focus. For safety and maximum effective rate, the key to success of the operation is to identify the focus area and normal functional area accurately in the preoperative evaluation stage. Intracranial EEG (iEEG) has attracted much attention for its precise capture of the state of rapid brain activity and its strong locality. To automate the process of iEEG inspection and surgical evaluation, this paper propose a Gated Recurrent Unit-Granger Causality (GRU-GC) algorithm to detect effective connectivity between channels and construct a directed graph. From six local features, the top five feature combinations were selected to differentiate between epileptic foci and non-epileptic regions. Experiments indicate that these features are most discriminative during the ictal phase, yielding superior classification accuracy. Compared to traditional time-series-based methods, this study shows that GRU-GC algorithm is efficient in building effective graph model for improving preoperative epilepsy evaluations.</p>","PeriodicalId":37465,"journal":{"name":"Brain Informatics","volume":"12 1","pages":"6"},"PeriodicalIF":0.0,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11910482/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143634674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Harnessing the synergy of statistics and deep learning for BCI competition 4 dataset 4: a novel approach. 利用统计和深度学习的协同作用为BCI竞赛4数据集4:一种新方法。
Q1 Computer Science Pub Date : 2025-02-15 DOI: 10.1186/s40708-025-00250-5
Gauttam Jangir, Nisheeth Joshi, Gaurav Purohit

Human brain signal processing and finger's movement coordination is a complex mechanism. In this mechanism finger's movement is mostly performed for every day's task. It is well known that to capture such movement EEG or ECoG signals are used. In this order to find the patterns from these signals is important. The BCI competition 4 dataset 4 is one such standard dataset of ECoG signals for individual finger movement provided by University of Washington, USA. In this work, this dataset is, statistically analyzed to understand the nature of data and outliers in it. Effectiveness of pre-processing algorithm is then visualized. The cleaned dataset has dual polarity and gaussian distribution nature which makes Tanh activation function suitable for the neural network BC4D4 model. BC4D4 uses Convolutional neural network for feature extraction, dense neural network for pattern identification and incorporating dropout & regularization making the proposed model more resilient. Our model outperforms the state of the art work on the dataset 4 achieving 0.85 correlation value that is 1.85X (Winner of BCI competition 4, 2012) & 1.25X (Finger Flex model, 2022).

人脑信号处理和手指运动协调是一个复杂的机制。在这一机制中,手指运动主要是为了完成日常任务。众所周知,捕捉这种运动需要使用脑电图或心电图信号。因此,从这些信号中找出规律非常重要。BCI 竞赛 4 数据集 4 就是由美国华盛顿大学提供的单个手指运动 ECoG 信号标准数据集。在这项工作中,将对该数据集进行统计分析,以了解数据的性质和异常值。然后对预处理算法的效果进行可视化。清理后的数据集具有双极性和高斯分布的性质,这使得 Tanh 激活函数适用于神经网络 BC4D4 模型。BC4D4 使用卷积神经网络进行特征提取,使用密集神经网络进行模式识别,并结合了 dropout 和正则化,使所提出的模型更具弹性。我们的模型在数据集 4 上的相关性达到了 0.85 倍(2012 年第 4 届生物识别竞赛优胜者)和 1.25 倍(2022 年 Finger Flex 模型),超过了目前最先进的模型。
{"title":"Harnessing the synergy of statistics and deep learning for BCI competition 4 dataset 4: a novel approach.","authors":"Gauttam Jangir, Nisheeth Joshi, Gaurav Purohit","doi":"10.1186/s40708-025-00250-5","DOIUrl":"10.1186/s40708-025-00250-5","url":null,"abstract":"<p><p>Human brain signal processing and finger's movement coordination is a complex mechanism. In this mechanism finger's movement is mostly performed for every day's task. It is well known that to capture such movement EEG or ECoG signals are used. In this order to find the patterns from these signals is important. The BCI competition 4 dataset 4 is one such standard dataset of ECoG signals for individual finger movement provided by University of Washington, USA. In this work, this dataset is, statistically analyzed to understand the nature of data and outliers in it. Effectiveness of pre-processing algorithm is then visualized. The cleaned dataset has dual polarity and gaussian distribution nature which makes Tanh activation function suitable for the neural network BC4D4 model. BC4D4 uses Convolutional neural network for feature extraction, dense neural network for pattern identification and incorporating dropout & regularization making the proposed model more resilient. Our model outperforms the state of the art work on the dataset 4 achieving 0.85 correlation value that is 1.85X (Winner of BCI competition 4, 2012) & 1.25X (Finger Flex model, 2022).</p>","PeriodicalId":37465,"journal":{"name":"Brain Informatics","volume":"12 1","pages":"5"},"PeriodicalIF":0.0,"publicationDate":"2025-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11829870/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143426431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Novel machine learning-driven comparative analysis of CSP, STFT, and CSP-STFT fusion for EEG data classification across multiple meditation and non-meditation sessions in BCI pipeline. 基于机器学习驱动的CSP、STFT和CSP-STFT融合的脑机接口管道中多个冥想和非冥想时段脑电数据分类的比较分析
Q1 Computer Science Pub Date : 2025-02-08 DOI: 10.1186/s40708-025-00251-4
Nalinda D Liyanagedera, Corinne A Bareham, Heather Kempton, Hans W Guesgen

This study focuses on classifying multiple sessions of loving kindness meditation (LKM) and non-meditation electroencephalography (EEG) data. This novel study focuses on using multiple sessions of EEG data from a single individual to train a machine learning pipeline, and then using a new session data from the same individual for the classification. Here, two meditation techniques, LKM-Self and LKM-Others were compared with non-meditation EEG data for 12 participants. Among many tested, three BCI pipelines we built produced promising results, successfully detecting features in meditation/ non-meditation EEG data. While testing different feature extraction algorithms, a common neural network structure was used as the classification algorithm to compare the performance of the feature extraction algorithms. For two of those pipelines, Common Spatial Patterns (CSP) and Short Time Fourier Transform (STFT) were successfully used as feature extraction algorithms where both these algorithms are significantly new for meditation EEG. As a novel concept, the third BCI pipeline used a feature extraction algorithm that fused the features of CSP and STFT, achieving the highest classification accuracies among all tested pipelines. Analyses were conducted using EEG data of 3, 4 or 5 sessions, totaling 3960 tests on the entire dataset. At the end of the study, when considering all the tests, the overall classification accuracy using SCP alone was 67.1%, and it was 67.8% for STFT alone. The algorithm combining the features of CSP and STFT achieved an overall classification accuracy of 72.9% which is more than 5% higher than the other two pipelines. At the same time, the highest mean classification accuracy for the 12 participants was achieved using the pipeline with the combination of CSP STFT algorithm, reaching 75.5% for LKM-Self/ non-meditation for the case of 5 sessions of data. Additionally, the highest individual classification accuracy of 88.9% was obtained by the participant no. 14. Furthermore, the results showed that the classification accuracies for all three pipelines increased with the number of training sessions increased from 2 to 3 and then to 4. The study was successful in classifying a new session of EEG meditation/ non-meditation data after training machine learning algorithms using a different set of session data, and this achievement will be beneficial in the development of algorithms that support meditation.

本研究的重点是对多时段仁爱冥想(LKM)和非冥想脑电图(EEG)数据进行分类。这项新颖的研究侧重于使用来自单个个体的多会话脑电图数据来训练机器学习管道,然后使用来自同一个体的新会话数据进行分类。本研究将12名参与者的两种冥想方法——LKM-Self和LKM-Others与非冥想的脑电图数据进行了比较。在许多测试中,我们建立的三个脑机接口管道产生了令人鼓舞的结果,成功地检测了冥想/非冥想脑电图数据的特征。在测试不同特征提取算法的同时,采用一种通用的神经网络结构作为分类算法,比较不同特征提取算法的性能。对于其中两个管道,我们成功地使用了公共空间模式(CSP)和短时傅里叶变换(STFT)作为特征提取算法,这两种算法对于冥想脑电图来说都是非常新颖的。作为一个新颖的概念,第三个BCI管道使用了融合CSP和STFT特征的特征提取算法,在所有测试管道中实现了最高的分类准确率。使用3次、4次或5次的脑电图数据进行分析,对整个数据集进行了总共3960次测试。研究结束时,综合考虑所有测试,单独使用SCP的分类准确率为67.1%,单独使用STFT的分类准确率为67.8%。该算法结合了CSP和STFT的特征,总体分类准确率达到72.9%,比其他两种管道高出5%以上。同时,结合CSP STFT算法的流水线对12个参与者的平均分类准确率最高,在5次数据的情况下,LKM-Self/ non-meditation的分类准确率达到75.5%。此外,参与者编号为88.9%的个体分类准确率最高。14. 此外,结果表明,随着训练次数从2次增加到3次,再增加到4次,三种管道的分类准确率都有所增加。在使用不同的会话数据集训练机器学习算法后,该研究成功地对新会话的EEG冥想/非冥想数据进行了分类,这一成就将有助于开发支持冥想的算法。
{"title":"Novel machine learning-driven comparative analysis of CSP, STFT, and CSP-STFT fusion for EEG data classification across multiple meditation and non-meditation sessions in BCI pipeline.","authors":"Nalinda D Liyanagedera, Corinne A Bareham, Heather Kempton, Hans W Guesgen","doi":"10.1186/s40708-025-00251-4","DOIUrl":"10.1186/s40708-025-00251-4","url":null,"abstract":"<p><p>This study focuses on classifying multiple sessions of loving kindness meditation (LKM) and non-meditation electroencephalography (EEG) data. This novel study focuses on using multiple sessions of EEG data from a single individual to train a machine learning pipeline, and then using a new session data from the same individual for the classification. Here, two meditation techniques, LKM-Self and LKM-Others were compared with non-meditation EEG data for 12 participants. Among many tested, three BCI pipelines we built produced promising results, successfully detecting features in meditation/ non-meditation EEG data. While testing different feature extraction algorithms, a common neural network structure was used as the classification algorithm to compare the performance of the feature extraction algorithms. For two of those pipelines, Common Spatial Patterns (CSP) and Short Time Fourier Transform (STFT) were successfully used as feature extraction algorithms where both these algorithms are significantly new for meditation EEG. As a novel concept, the third BCI pipeline used a feature extraction algorithm that fused the features of CSP and STFT, achieving the highest classification accuracies among all tested pipelines. Analyses were conducted using EEG data of 3, 4 or 5 sessions, totaling 3960 tests on the entire dataset. At the end of the study, when considering all the tests, the overall classification accuracy using SCP alone was 67.1%, and it was 67.8% for STFT alone. The algorithm combining the features of CSP and STFT achieved an overall classification accuracy of 72.9% which is more than 5% higher than the other two pipelines. At the same time, the highest mean classification accuracy for the 12 participants was achieved using the pipeline with the combination of CSP STFT algorithm, reaching 75.5% for LKM-Self/ non-meditation for the case of 5 sessions of data. Additionally, the highest individual classification accuracy of 88.9% was obtained by the participant no. 14. Furthermore, the results showed that the classification accuracies for all three pipelines increased with the number of training sessions increased from 2 to 3 and then to 4. The study was successful in classifying a new session of EEG meditation/ non-meditation data after training machine learning algorithms using a different set of session data, and this achievement will be beneficial in the development of algorithms that support meditation.</p>","PeriodicalId":37465,"journal":{"name":"Brain Informatics","volume":"12 1","pages":"4"},"PeriodicalIF":0.0,"publicationDate":"2025-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11807047/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143374904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Brain Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1