Pub Date : 2025-12-01Epub Date: 2025-01-13DOI: 10.1007/s11571-024-10202-0
V Kavitha, R Siva
Autism spectrum disorder (ASD) is one of the complicated neurodevelopmental disorders that impacts the daily functioning and social interactions of individuals. It includes diverse symptoms and severity levels, making it challenging to diagnose and treat efficiently. Various deep learning (DL) based methods have been developed for diagnosing ASD, which rely heavily on behavioral assessment. However, existing techniques have suffered from poor diagnostic outcomes, higher computational complexity, and overfitting issues. To address these challenges, this research work introduces an innovative framework called 3T Dilated Inception Network (3T-DINet) for effective ASD diagnosis using resting-state functional Magnetic Resonance Imaging (rs-fMRI) images. The proposed 3T-DINet technique designs a 3T dilated inception module that incorporates dilated convolutions along with the inception module, allowing it to extract multi-scale features from brain connectivity patterns. The 3T dilated inception module uses three distinct dilation rates (low, medium, and high) in parallel to determine local, mid-level, and global features from the brain. In addition, the proposed approach implements Residual networks (ResNet) to avoid the vanishing gradient problem and enhance the feature extraction ability. The model is further optimized using a Crossover-based Black Widow Optimization (CBWO) algorithm that fine-tunes the hyperparameters thereby enhancing the overall performance of the model. Further, the performance of the 3T-DINet model is evaluated using the five ASD datasets with distinct evaluation parameters. The proposed 3T-DINet technique achieved superior diagnosis results compared to recent previous works. From this simulation validation, it's clear that the 3T-DINet provides an excellent contribution to early ASD diagnosis and enhances patient treatment outcomes.
{"title":"3T dilated inception network for enhanced autism spectrum disorder diagnosis using resting-state fMRI data.","authors":"V Kavitha, R Siva","doi":"10.1007/s11571-024-10202-0","DOIUrl":"10.1007/s11571-024-10202-0","url":null,"abstract":"<p><p>Autism spectrum disorder (ASD) is one of the complicated neurodevelopmental disorders that impacts the daily functioning and social interactions of individuals. It includes diverse symptoms and severity levels, making it challenging to diagnose and treat efficiently. Various deep learning (DL) based methods have been developed for diagnosing ASD, which rely heavily on behavioral assessment. However, existing techniques have suffered from poor diagnostic outcomes, higher computational complexity, and overfitting issues. To address these challenges, this research work introduces an innovative framework called 3T Dilated Inception Network (3T-DINet) for effective ASD diagnosis using resting-state functional Magnetic Resonance Imaging (rs-fMRI) images. The proposed 3T-DINet technique designs a 3T dilated inception module that incorporates dilated convolutions along with the inception module, allowing it to extract multi-scale features from brain connectivity patterns. The 3T dilated inception module uses three distinct dilation rates (low, medium, and high) in parallel to determine local, mid-level, and global features from the brain. In addition, the proposed approach implements Residual networks (ResNet) to avoid the vanishing gradient problem and enhance the feature extraction ability. The model is further optimized using a Crossover-based Black Widow Optimization (CBWO) algorithm that fine-tunes the hyperparameters thereby enhancing the overall performance of the model. Further, the performance of the 3T-DINet model is evaluated using the five ASD datasets with distinct evaluation parameters. The proposed 3T-DINet technique achieved superior diagnosis results compared to recent previous works. From this simulation validation, it's clear that the 3T-DINet provides an excellent contribution to early ASD diagnosis and enhances patient treatment outcomes.</p>","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"22"},"PeriodicalIF":3.1,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11729590/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143001551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-01Epub Date: 2025-01-24DOI: 10.1007/s11571-025-10220-6
Long Chen, Yihao Hu, Zhongpeng Wang, Lei Zhang, Chuxiang Jian, Shengcui Cheng, Dong Ming
Motor planning plays a pivotal role in daily life. Transcutaneous auricular vagus nerve stimulation (taVNS) has been demonstrated to enhance decision-making efficiency, illustrating its potential use in cognitive modulation. However, current research primarily focuses on behavioral and single-modal electrophysiological signal, such as electroencephalography (EEG) and electrocardiography (ECG). To investigate the effect of taVNS on motor planning, a total of 21 subjects were recruited for this study and were divided into two groups: active group (n = 10) and sham group (n = 11). Each subject was required to be involved in a single-blind, sham-controlled, between-subject end-state comfort (ESC) experiment. The study compared behavioral indicators and electrophysiological features before and following taVNS. The results indicated a notable reduction in reaction time and an appreciable increase in the proportion of end-state comfort among the participants following taVNS, accompanied by notable alterations in motor-related cortical potential (MRCP) amplitude, low-frequency power of HRV (LF), and cortico-cardiac coherence, particularly in the parietal and occipital regions. These findings show that taVNS may impact the brain and heart, potentially enhancing their interaction, and improve participants' ability of motor planning.
{"title":"Effects of transcutaneous auricular vagus nerve stimulation (taVNS) on motor planning: a multimodal signal study.","authors":"Long Chen, Yihao Hu, Zhongpeng Wang, Lei Zhang, Chuxiang Jian, Shengcui Cheng, Dong Ming","doi":"10.1007/s11571-025-10220-6","DOIUrl":"10.1007/s11571-025-10220-6","url":null,"abstract":"<p><p>Motor planning plays a pivotal role in daily life. Transcutaneous auricular vagus nerve stimulation (taVNS) has been demonstrated to enhance decision-making efficiency, illustrating its potential use in cognitive modulation. However, current research primarily focuses on behavioral and single-modal electrophysiological signal, such as electroencephalography (EEG) and electrocardiography (ECG). To investigate the effect of taVNS on motor planning, a total of 21 subjects were recruited for this study and were divided into two groups: active group (n = 10) and sham group (n = 11). Each subject was required to be involved in a single-blind, sham-controlled, between-subject end-state comfort (ESC) experiment. The study compared behavioral indicators and electrophysiological features before and following taVNS. The results indicated a notable reduction in reaction time and an appreciable increase in the proportion of end-state comfort among the participants following taVNS, accompanied by notable alterations in motor-related cortical potential (MRCP) amplitude, low-frequency power of HRV (LF), and cortico-cardiac coherence, particularly in the parietal and occipital regions. These findings show that taVNS may impact the brain and heart, potentially enhancing their interaction, and improve participants' ability of motor planning.</p>","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"35"},"PeriodicalIF":3.1,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11759740/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143045764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Identifying the cognitive state can help educators understand the evolving thought processes of learners, and it is important in promoting the development of higher-order thinking skills (HOTS). Cognitive neuroscience research identifies cognitive states by designing experimental tasks and recording electroencephalography (EEG) signals during task performance. However, most of the previous studies primarily concentrated on extracting features from individual channels in single-type tasks, ignoring the interconnection across channels. In this study, three learning activities (i.e., video watching activity, keyword extracting activity, and essay creating activity) were designed based on a revised Bloom's taxonomy and the Interactive-Constructive-Active-Passive framework and used with 31 college students. The EEG signals were recorded when they were engaged in these activities. First, whole-brain network temporal dynamics were characterized by EEG microstate sequence analysis. Such dynamic changes rely on learning activity and corresponding functional brain systems. Subsequently, phase locking value was used to construct synchrony-based functional brain networks. The network characteristics were extracted to be inputted into different machine learning classifiers: Support Vector Machine, K-Nearest Neighbour, Random Forest, and eXtreme Gradient Boosting (XGBoost). XGBoost showed superior performance in the classification of cognitive states, with an accuracy of 88.07%. Furthermore, SHapley Additive exPlanations (SHAP) was adopted to reveal the connections between different brain regions that contributed to the classification of cognitive state. SHAP analysis reveals that the connections in the frontal, temporal, and central regions are most important for the high cognitive state. Collectively, this study may provide further evidence for educators to design cognitive-guided instructional activities to enhance learners' HOTS.
{"title":"Multi-level cognitive state classification of learners using complex brain networks and interpretable machine learning.","authors":"Xiuling He, Yue Li, Xiong Xiao, Yingting Li, Jing Fang, Ruijie Zhou","doi":"10.1007/s11571-024-10203-z","DOIUrl":"https://doi.org/10.1007/s11571-024-10203-z","url":null,"abstract":"<p><p>Identifying the cognitive state can help educators understand the evolving thought processes of learners, and it is important in promoting the development of higher-order thinking skills (HOTS). Cognitive neuroscience research identifies cognitive states by designing experimental tasks and recording electroencephalography (EEG) signals during task performance. However, most of the previous studies primarily concentrated on extracting features from individual channels in single-type tasks, ignoring the interconnection across channels. In this study, three learning activities (i.e., video watching activity, keyword extracting activity, and essay creating activity) were designed based on a revised Bloom's taxonomy and the Interactive-Constructive-Active-Passive framework and used with 31 college students. The EEG signals were recorded when they were engaged in these activities. First, whole-brain network temporal dynamics were characterized by EEG microstate sequence analysis. Such dynamic changes rely on learning activity and corresponding functional brain systems. Subsequently, phase locking value was used to construct synchrony-based functional brain networks. The network characteristics were extracted to be inputted into different machine learning classifiers: Support Vector Machine, K-Nearest Neighbour, Random Forest, and eXtreme Gradient Boosting (XGBoost). XGBoost showed superior performance in the classification of cognitive states, with an accuracy of 88.07%. Furthermore, SHapley Additive exPlanations (SHAP) was adopted to reveal the connections between different brain regions that contributed to the classification of cognitive state. SHAP analysis reveals that the connections in the frontal, temporal, and central regions are most important for the high cognitive state. Collectively, this study may provide further evidence for educators to design cognitive-guided instructional activities to enhance learners' HOTS.</p>","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"5"},"PeriodicalIF":3.1,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11699182/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142930821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-01Epub Date: 2025-01-09DOI: 10.1007/s11571-024-10190-1
K G Shanthi, A Mary Joy Kinol, S Rukmani Devi, K Kannan
In recent years, Wireless Sensor Networks (WSN) have become vital because of their versatility in numerous applications. Nevertheless, the attain problems like inherent noise, and limited node computation capabilities, result in reduced sensor node lifespan as well as enhanced power consumption. To tackle such problems, this study develops a Modified-Distributed Arithmetic-Offset Binary Coding-based Adaptive Finite Impulse Response (MDA-OBC based AFIR) framework. By leveraging Modified Distributed Arithmetic (MDA) which optimizes arithmetic operations by replacing the multipliers with lookup tables (LUT) hence minimizing energy consumption as well as computational complexity. Offset Binary Coding (OBC) enhanced the efficiency of data transmission by minimizing the data representation overhead. In addition to this, the adaptive strategy is incorporated with the Adaptive Finite Impulse Response (AFIR) framework permitting the filters to dynamically adjust to varying signal characteristics, thus offering high noise suppression and low distortion rates. Comprehensive simulations and comparative analysis validate the effectiveness of the proposed MDA-OBC-based AFIR method. The proposed method attained a lower energy consumption of 1.5 J and 130 W power consumption than the traditional implementations, resulting in significant energy efficiency and data transmission in signal preprocessing and noise suppression in WSNs.
{"title":"Cognitive neurodynamic approaches to adaptive signal processing in wireless sensor networks.","authors":"K G Shanthi, A Mary Joy Kinol, S Rukmani Devi, K Kannan","doi":"10.1007/s11571-024-10190-1","DOIUrl":"10.1007/s11571-024-10190-1","url":null,"abstract":"<p><p>In recent years, Wireless Sensor Networks (WSN) have become vital because of their versatility in numerous applications. Nevertheless, the attain problems like inherent noise, and limited node computation capabilities, result in reduced sensor node lifespan as well as enhanced power consumption. To tackle such problems, this study develops a Modified-Distributed Arithmetic-Offset Binary Coding-based Adaptive Finite Impulse Response (MDA-OBC based AFIR) framework. By leveraging Modified Distributed Arithmetic (MDA) which optimizes arithmetic operations by replacing the multipliers with lookup tables (LUT) hence minimizing energy consumption as well as computational complexity. Offset Binary Coding (OBC) enhanced the efficiency of data transmission by minimizing the data representation overhead. In addition to this, the adaptive strategy is incorporated with the Adaptive Finite Impulse Response (AFIR) framework permitting the filters to dynamically adjust to varying signal characteristics, thus offering high noise suppression and low distortion rates. Comprehensive simulations and comparative analysis validate the effectiveness of the proposed MDA-OBC-based AFIR method. The proposed method attained a lower energy consumption of 1.5 J and 130 W power consumption than the traditional implementations, resulting in significant energy efficiency and data transmission in signal preprocessing and noise suppression in WSNs.</p>","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"11"},"PeriodicalIF":3.1,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11717781/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142969905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-01Epub Date: 2025-01-09DOI: 10.1007/s11571-024-10204-y
Haibin Yin, Xiaojuan Sun, Kai Yang, Yueheng Lan, Zeying Lu
The dentate gyrus (DG) in hippocampus is reported to perform pattern separation, converting similar inputs into different outputs and thus avoiding memory interference. Previous studies have found that human and mice with epilepsy have significant pattern separation defects and a portion of adult-born granule cells (abGCs) migrate abnormally into the hilus, forming hilus ectopic granule cells (HEGCs). For the lack of relevant pathophysiological experiments, how HEGCs affect pattern separation remains unclear. Therefore, in this paper, we will construct the DG neuronal circuit and focus on discussing effects of HEGCs on pattern separation numerically. The obtained results showed that HEGCs impaired pattern separation efficiency since the sparse firing of granule cells (GCs) was destroyed. We provided new insights into the underlining mechanisms of HEGCs impairing pattern separation through analyzing two excitatory circuits: GC-HEGC-GC and GC-Mossy cell (MC)-GC, both of which involve the participation of HEGCs within the DG. It is revealed that the recurrent excitatory circuit GC-HEGC-GC formed by HEGCs mossy fiber sprouting significantly enhanced GCs activity, consequently disrupted pattern separation. However, another excitatory circuit had negligible effects on pattern separation due to the direct and indirect influences of MCs on GCs, which in turn led to the GCs sparse firing. Thus, HEGCs impair DG pattern separation mainly through the GC-HEGC-GC circuit and therefore ablating HEGCs may be one of the effective ways to improve pattern separation in patients with epilepsy.
{"title":"Regulation of dentate gyrus pattern separation by hilus ectopic granule cells.","authors":"Haibin Yin, Xiaojuan Sun, Kai Yang, Yueheng Lan, Zeying Lu","doi":"10.1007/s11571-024-10204-y","DOIUrl":"10.1007/s11571-024-10204-y","url":null,"abstract":"<p><p>The dentate gyrus (DG) in hippocampus is reported to perform pattern separation, converting similar inputs into different outputs and thus avoiding memory interference. Previous studies have found that human and mice with epilepsy have significant pattern separation defects and a portion of adult-born granule cells (abGCs) migrate abnormally into the hilus, forming hilus ectopic granule cells (HEGCs). For the lack of relevant pathophysiological experiments, how HEGCs affect pattern separation remains unclear. Therefore, in this paper, we will construct the DG neuronal circuit and focus on discussing effects of HEGCs on pattern separation numerically. The obtained results showed that HEGCs impaired pattern separation efficiency since the sparse firing of granule cells (GCs) was destroyed. We provided new insights into the underlining mechanisms of HEGCs impairing pattern separation through analyzing two excitatory circuits: GC-HEGC-GC and GC-Mossy cell (MC)-GC, both of which involve the participation of HEGCs within the DG. It is revealed that the recurrent excitatory circuit GC-HEGC-GC formed by HEGCs mossy fiber sprouting significantly enhanced GCs activity, consequently disrupted pattern separation. However, another excitatory circuit had negligible effects on pattern separation due to the direct and indirect influences of MCs on GCs, which in turn led to the GCs sparse firing. Thus, HEGCs impair DG pattern separation mainly through the GC-HEGC-GC circuit and therefore ablating HEGCs may be one of the effective ways to improve pattern separation in patients with epilepsy.</p>","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"10"},"PeriodicalIF":3.1,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11718051/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142969958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The increasing adoption of wearable technologies highlights the potential of electroencephalogram (EEG) signals for biometric recognition. However, the intrinsic variability in cross-session EEG data presents substantial challenges in maintaining model stability and reliability. Moreover, the diversity within single-task protocols complicates achieving consistent and generalized model performance. To address these issues, we propose the Joint Disentangled Representation with Domain Adversarial Training (JDR-DAT) framework for EEG-based cross-session biometric recognition within single-task protocols. The JDR-DAT framework disentangles identity-specific features through mutual information estimation and incorporates domain adversarial training to enhance longitudinal robustness. Extensive experiments on longitudinal EEG data from two publicly available single-task protocol datasets-RSVP-based (Rapid Serial Visual Presentation) and MI-based (Motor Imagery)-demonstrate the efficacy of the JDR-DAT framework, with the proposed method achieving average accuracies of 85.83% and 96.72%, respectively.
{"title":"Joint disentangled representation and domain adversarial training for EEG-based cross-session biometric recognition in single-task protocols.","authors":"Honggang Liu, Xuanyu Jin, Dongjun Liu, Wanzeng Kong, Jiajia Tang, Yong Peng","doi":"10.1007/s11571-024-10214-w","DOIUrl":"10.1007/s11571-024-10214-w","url":null,"abstract":"<p><p>The increasing adoption of wearable technologies highlights the potential of electroencephalogram (EEG) signals for biometric recognition. However, the intrinsic variability in cross-session EEG data presents substantial challenges in maintaining model stability and reliability. Moreover, the diversity within single-task protocols complicates achieving consistent and generalized model performance. To address these issues, we propose the Joint Disentangled Representation with Domain Adversarial Training (JDR-DAT) framework for EEG-based cross-session biometric recognition within single-task protocols. The JDR-DAT framework disentangles identity-specific features through mutual information estimation and incorporates domain adversarial training to enhance longitudinal robustness. Extensive experiments on longitudinal EEG data from two publicly available single-task protocol datasets-RSVP-based (Rapid Serial Visual Presentation) and MI-based (Motor Imagery)-demonstrate the efficacy of the JDR-DAT framework, with the proposed method achieving average accuracies of 85.83% and 96.72%, respectively.</p>","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"31"},"PeriodicalIF":3.1,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11757832/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143045783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
To deploy Electroencephalogram (EEG) based Mental Workload Recognition (MWR) systems in the real world, it is crucial to develop general models that can be applied across subjects. Previous studies have utilized domain adaptation to mitigate inter-subject discrepancies in EEG data distributions. However, they have focused on reducing global domain discrepancy, while neglecting local workload-categorical domain divergence. This degrades the workload-discriminating ability of subject-invariant features. To deal with this problem, we propose a novel joint category-wise and domain-wise alignment Domain Adaptation (cdaDA) algorithm, using bi-classifier learning and domain discriminative adversarial learning. The bi-classifier learning approach is adopted to address the similarities and differences between categories, helping to align EEG data within the same mental workload categories. Additionally, the domain discriminative adversarial learning technique is adopted to consider global domain information by minimizing global domain discrepancy. By integrating both local category information and global domain information, the cdaDA model performs a coarse-to-fine alignment and achieves promising cross-subject MWR results.
{"title":"Cross-subject mental workload recognition using bi-classifier domain adversarial learning.","authors":"Yueying Zhou, Pengpai Wang, Peiliang Gong, Peng Wan, Xuyun Wen, Daoqiang Zhang","doi":"10.1007/s11571-024-10215-9","DOIUrl":"10.1007/s11571-024-10215-9","url":null,"abstract":"<p><p>To deploy Electroencephalogram (EEG) based Mental Workload Recognition (MWR) systems in the real world, it is crucial to develop general models that can be applied across subjects. Previous studies have utilized domain adaptation to mitigate inter-subject discrepancies in EEG data distributions. However, they have focused on reducing global domain discrepancy, while neglecting local workload-categorical domain divergence. This degrades the workload-discriminating ability of subject-invariant features. To deal with this problem, we propose a novel joint category-wise and domain-wise alignment Domain Adaptation (cdaDA) algorithm, using bi-classifier learning and domain discriminative adversarial learning. The bi-classifier learning approach is adopted to address the similarities and differences between categories, helping to align EEG data within the same mental workload categories. Additionally, the domain discriminative adversarial learning technique is adopted to consider global domain information by minimizing global domain discrepancy. By integrating both local category information and global domain information, the cdaDA model performs a coarse-to-fine alignment and achieves promising cross-subject MWR results.</p>","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"16"},"PeriodicalIF":3.1,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11718037/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142969952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-01Epub Date: 2025-01-13DOI: 10.1007/s11571-024-10186-x
Xiaoliang Guo, Shuo Zhai
Enhancing the accuracy of emotion recognition models through multimodal learning is a common approach. However, challenges such as insufficient modal feature learning in multimodal inference and scarcity of sample data continue to pose obstacles that need to be overcome. Therefore, we propose a novel adaptive lightweight multimodal efficient feature inference network (ALME-FIN). We introduce a time-domain lightweight adaptive network (TDLAN) and a two-dimensional dynamic focusing network (TDDFN) for multimodal feature learning. The TDLAN incorporates the denoising process as an integral part of network training, achieving adaptive denoising for each sample through the continuous optimization of the trainable filtering threshold. Simultaneously, it incorporates an interactive convolutional sampling module, enabling lightweight multi-scale feature extraction in the time domain. TDDFN effectively extracts core image features while filtering out redundancies. During the training process, the Multi-network dynamic gradient adjustment framework (MDGAF) dynamically monitors the feature learning efficacy across different modalities. It timely adjusts the training gradients of networks to allocate additional optimization time for under-optimized modalities, thereby maximizing the utilization of multimodal feature information. Moreover, the introduction of a Multi-class relationship interaction module prior to the classifier aids the model in clearly understanding the relationships among different category samples. This approach enables the model to achieve relatively accurate emotion recognition even in scenarios of limited sample availability. Compared to existing multimodal learning techniques, ALME-FIN exhibits a more efficient multimodal feature inference method that can achieve satisfactory emotional recognition performance even with a limited number of samples.
{"title":"A novel adaptive lightweight multimodal efficient feature inference network ALME-FIN for EEG emotion recognition.","authors":"Xiaoliang Guo, Shuo Zhai","doi":"10.1007/s11571-024-10186-x","DOIUrl":"10.1007/s11571-024-10186-x","url":null,"abstract":"<p><p>Enhancing the accuracy of emotion recognition models through multimodal learning is a common approach. However, challenges such as insufficient modal feature learning in multimodal inference and scarcity of sample data continue to pose obstacles that need to be overcome. Therefore, we propose a novel adaptive lightweight multimodal efficient feature inference network (ALME-FIN). We introduce a time-domain lightweight adaptive network (TDLAN) and a two-dimensional dynamic focusing network (TDDFN) for multimodal feature learning. The TDLAN incorporates the denoising process as an integral part of network training, achieving adaptive denoising for each sample through the continuous optimization of the trainable filtering threshold. Simultaneously, it incorporates an interactive convolutional sampling module, enabling lightweight multi-scale feature extraction in the time domain. TDDFN effectively extracts core image features while filtering out redundancies. During the training process, the Multi-network dynamic gradient adjustment framework (MDGAF) dynamically monitors the feature learning efficacy across different modalities. It timely adjusts the training gradients of networks to allocate additional optimization time for under-optimized modalities, thereby maximizing the utilization of multimodal feature information. Moreover, the introduction of a Multi-class relationship interaction module prior to the classifier aids the model in clearly understanding the relationships among different category samples. This approach enables the model to achieve relatively accurate emotion recognition even in scenarios of limited sample availability. Compared to existing multimodal learning techniques, ALME-FIN exhibits a more efficient multimodal feature inference method that can achieve satisfactory emotional recognition performance even with a limited number of samples.</p>","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"24"},"PeriodicalIF":3.1,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11729629/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143000626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Fatigue-induced incidents in transportation, aerospace, military, and other areas have been on the rise, posing a threat to human life and safety. The determination of fatigue states holds significant importance, especially through reliable and conveniently available physiological indicators. Here, a portable custom-built fNIRS system was used to monitor the fatigue state caused by nap deprivation. fNIRS signals in ten channels at the prefrontal cortex were collected, changes in blood oxygen concentration were analyzed, followed by a deep learning model to classify fatigue states. For the high-dimensionality and multi-channel characteristics of the fNIRS signal data, a novel 1D revised CNN-ResNet network was proposed based on the double-layer channel attenuation residual block. The results showed a 97.78% accuracy in fatigue state classification, significantly superior than several conventional methods. Furthermore, a fatigue-arousal experiment was designed to explore the feasibility of forced arousal of fatigued subjects through exercise stimulation. The fNIRS results showed a significant increase in brain activity with the conduction of exercise. The proposed method serves as a reliable tool for the evaluation of fatigue states, potentially reducing fatigue-induced harms and risks.
{"title":"Monitoring nap deprivation-induced fatigue using fNIRS and deep learning.","authors":"Pei Ma, Chenyang Pan, Huijuan Shen, Wushuang Shen, Hui Chen, Xuedian Zhang, Shuyu Xu, Jingzhou Xu, Tong Su","doi":"10.1007/s11571-025-10219-z","DOIUrl":"10.1007/s11571-025-10219-z","url":null,"abstract":"<p><p>Fatigue-induced incidents in transportation, aerospace, military, and other areas have been on the rise, posing a threat to human life and safety. The determination of fatigue states holds significant importance, especially through reliable and conveniently available physiological indicators. Here, a portable custom-built fNIRS system was used to monitor the fatigue state caused by nap deprivation. fNIRS signals in ten channels at the prefrontal cortex were collected, changes in blood oxygen concentration were analyzed, followed by a deep learning model to classify fatigue states. For the high-dimensionality and multi-channel characteristics of the fNIRS signal data, a novel 1D revised CNN-ResNet network was proposed based on the double-layer channel attenuation residual block. The results showed a 97.78% accuracy in fatigue state classification, significantly superior than several conventional methods. Furthermore, a fatigue-arousal experiment was designed to explore the feasibility of forced arousal of fatigued subjects through exercise stimulation. The fNIRS results showed a significant increase in brain activity with the conduction of exercise. The proposed method serves as a reliable tool for the evaluation of fatigue states, potentially reducing fatigue-induced harms and risks.</p>","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"30"},"PeriodicalIF":3.1,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11757655/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143045786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-01Epub Date: 2025-01-09DOI: 10.1007/s11571-024-10200-2
Dongye Zhao, Bailu Si
Hippocampus in the mammalian brain supports navigation by building a cognitive map of the environment. However, only a few studies have investigated cognitive maps in large-scale arenas. To reveal the computational mechanisms underlying the formation of cognitive maps in large-scale environments, we propose a neural network model of the entorhinal-hippocampal neural circuit that integrates both spatial and non-spatial information. Spatial information is relayed from the grid units in medial entorhinal cortex (MEC) by integrating multimodal sensory-motor signals. Non-spatial, such as object, information is imparted from the visual units in lateral entorhinal cortex (LEC) by encoding visual scenes through a deep neural network. The synaptic weights from the grid units and the visual units to the place units in the hippocampus are learned by a competitive learning rule. We simulated the model in a large box maze. The place units in the model form irregularly-spaced multiple fields across the environment. When the strength of visual inputs is dominant, the responses of place units become conjunctive and egocentric. These results point to the key role of the hippocampus in balancing spatial and non-spatial information relayed via LEC and MEC.
{"title":"Formation of cognitive maps in large-scale environments by sensorimotor integration.","authors":"Dongye Zhao, Bailu Si","doi":"10.1007/s11571-024-10200-2","DOIUrl":"10.1007/s11571-024-10200-2","url":null,"abstract":"<p><p>Hippocampus in the mammalian brain supports navigation by building a cognitive map of the environment. However, only a few studies have investigated cognitive maps in large-scale arenas. To reveal the computational mechanisms underlying the formation of cognitive maps in large-scale environments, we propose a neural network model of the entorhinal-hippocampal neural circuit that integrates both spatial and non-spatial information. Spatial information is relayed from the grid units in medial entorhinal cortex (MEC) by integrating multimodal sensory-motor signals. Non-spatial, such as object, information is imparted from the visual units in lateral entorhinal cortex (LEC) by encoding visual scenes through a deep neural network. The synaptic weights from the grid units and the visual units to the place units in the hippocampus are learned by a competitive learning rule. We simulated the model in a large box maze. The place units in the model form irregularly-spaced multiple fields across the environment. When the strength of visual inputs is dominant, the responses of place units become conjunctive and egocentric. These results point to the key role of the hippocampus in balancing spatial and non-spatial information relayed via LEC and MEC.</p>","PeriodicalId":10500,"journal":{"name":"Cognitive Neurodynamics","volume":"19 1","pages":"19"},"PeriodicalIF":3.1,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11717777/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142969953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}