首页 > 最新文献

IEEE Access最新文献

英文 中文
A Physics-Informed Reinforcement Learning Approach to Vehicle-to-Grid Control With Real-Time Battery Degradation 基于物理信息的车辆-电网实时电池退化控制强化学习方法
IF 3.6 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-12 DOI: 10.1109/ACCESS.2026.3652847
Muhammad Ahsan Niazi;Vikram Kumar;Qazi Sajid;Mohsin Ali Koondhar;Yun-Su Kim;Muhammad Ammirrul Atiqi Mohd Zainuri;Ezzeddine Touti
The Vehicle-to-Grid (V2G) system has been a significant solution for enhancing the power grid’s stability and supporting renewable energy. However, the primary barrier to the practical application of V2G technology has been a fundamental economic conflict: the accelerated battery degradation from aggressive battery cycling required to provide grid services creates a direct trade-off between generating revenue from grid services and preserving the battery asset’s life. Therefore, the objective of this research is to develop an intelligent control framework to optimize both the profit generated from grid services provided and the longevity of the batteries. The proposed Physics-Informed Deep Reinforcement Learning (PI-DRL) framework utilizes a Digital Twin of the electrochemical behavior of the batteries to generate a real-time physics-based cost signal of degradation, which guides the learning of the policy by a deep reinforcement learning agent. Comprehensive VPP simulation results demonstrate that the proposed PI-DRL framework outperforms all benchmark approaches, achieving significant increases in net profitability and drastic reductions in fleet-wide capacity fade. The agent learned sophisticated control strategies, including making proactive deviations from the optimal control trajectory to avoid acute mechanical stress on the battery and optimizing operations across a heterogeneous fleet by using robust chemistries for high-intensity grid service tasks. A key implication of this research is that there is now a viable blueprint for the economically sustainable and equitable provision of V2G services, with an asset-preserving strategy being the most profitable method.
车辆到电网(V2G)系统已成为提高电网稳定性和支持可再生能源的重要解决方案。然而,V2G技术实际应用的主要障碍是一个根本的经济冲突:提供电网服务所需的积极电池循环会加速电池退化,这在从电网服务中获得收入和保护电池资产寿命之间造成了直接的权衡。因此,本研究的目的是开发一种智能控制框架,以优化所提供的电网服务产生的利润和电池的寿命。提出的基于物理的深度强化学习(PI-DRL)框架利用电池电化学行为的数字孪生来生成基于物理的实时退化成本信号,该信号指导深度强化学习代理对策略的学习。综合VPP仿真结果表明,所提出的PI-DRL框架优于所有基准方法,实现了净盈利能力的显著提高,并大幅减少了整个船队的容量衰减。智能体学习了复杂的控制策略,包括主动偏离最优控制轨迹,以避免对电池造成急性机械应力,以及通过使用强大的化学物质来优化异构车队的操作,以完成高强度的电网服务任务。这项研究的一个关键含义是,现在有一个可行的蓝图,经济上可持续和公平地提供V2G服务,其中资产保护策略是最有利可图的方法。
{"title":"A Physics-Informed Reinforcement Learning Approach to Vehicle-to-Grid Control With Real-Time Battery Degradation","authors":"Muhammad Ahsan Niazi;Vikram Kumar;Qazi Sajid;Mohsin Ali Koondhar;Yun-Su Kim;Muhammad Ammirrul Atiqi Mohd Zainuri;Ezzeddine Touti","doi":"10.1109/ACCESS.2026.3652847","DOIUrl":"https://doi.org/10.1109/ACCESS.2026.3652847","url":null,"abstract":"The Vehicle-to-Grid (V2G) system has been a significant solution for enhancing the power grid’s stability and supporting renewable energy. However, the primary barrier to the practical application of V2G technology has been a fundamental economic conflict: the accelerated battery degradation from aggressive battery cycling required to provide grid services creates a direct trade-off between generating revenue from grid services and preserving the battery asset’s life. Therefore, the objective of this research is to develop an intelligent control framework to optimize both the profit generated from grid services provided and the longevity of the batteries. The proposed Physics-Informed Deep Reinforcement Learning (PI-DRL) framework utilizes a Digital Twin of the electrochemical behavior of the batteries to generate a real-time physics-based cost signal of degradation, which guides the learning of the policy by a deep reinforcement learning agent. Comprehensive VPP simulation results demonstrate that the proposed PI-DRL framework outperforms all benchmark approaches, achieving significant increases in net profitability and drastic reductions in fleet-wide capacity fade. The agent learned sophisticated control strategies, including making proactive deviations from the optimal control trajectory to avoid acute mechanical stress on the battery and optimizing operations across a heterogeneous fleet by using robust chemistries for high-intensity grid service tasks. A key implication of this research is that there is now a viable blueprint for the economically sustainable and equitable provision of V2G services, with an asset-preserving strategy being the most profitable method.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"7879-7894"},"PeriodicalIF":3.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11345442","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Physics-Informed Deep Learning for 2-D Phased Array Beamforming With Imperfection Tolerance 具有缺陷容限的二维相控阵波束形成的物理信息深度学习
IF 3.6 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-12 DOI: 10.1109/ACCESS.2026.3651811
Tarek Sallam;Ahmed M. Attiya
Designing efficient and reliable two-dimensional (2D) beamforming for phased array antennas (PAAs) remains a significant challenge because of the heavy computational demands involved. In this study, we introduce a beamforming framework that adopts a physics-informed deep neural network (PIDNN). The proposed model incorporates physical principles directly into the training process through a customized loss function, which minimizes the mean squared error between the array response and a target reference signal. The beamforming weights produced by the PIDNN are systematically evaluated against the theoretically optimal Wiener solution. To assess robustness, the method is implemented on an $8times 8$ PAA and evaluated against both a shallow architecture—the radial basis function neural network (RBFNN)—and a deeper model, the convolutional neural network (CNN). Moreover, the PIDNN is applied to a large PAA to assess its performance when the number of antenna elements increases. Experimental results demonstrate that the PIDNN closely approximates Wiener-optimal weights while maintaining robustness to array imperfections and achieving this with markedly reduced computational cost.
为相控阵天线(PAAs)设计高效可靠的二维波束形成仍然是一个重大挑战,因为涉及到大量的计算需求。在本研究中,我们引入了一种采用物理信息深度神经网络(PIDNN)的波束形成框架。该模型通过自定义损失函数将物理原理直接融入到训练过程中,使阵列响应与目标参考信号之间的均方误差最小化。根据理论上最优的维纳解,系统地评估了PIDNN产生的波束形成权重。为了评估鲁棒性,该方法在$8 × 8$ PAA上实现,并针对浅结构-径向基函数神经网络(RBFNN)和更深的模型-卷积神经网络(CNN)进行评估。此外,将PIDNN应用于大型PAA中,以评估其在天线单元数量增加时的性能。实验结果表明,PIDNN非常接近维纳最优权重,同时保持对阵列缺陷的鲁棒性,并且显著降低了计算成本。
{"title":"Physics-Informed Deep Learning for 2-D Phased Array Beamforming With Imperfection Tolerance","authors":"Tarek Sallam;Ahmed M. Attiya","doi":"10.1109/ACCESS.2026.3651811","DOIUrl":"https://doi.org/10.1109/ACCESS.2026.3651811","url":null,"abstract":"Designing efficient and reliable two-dimensional (2D) beamforming for phased array antennas (PAAs) remains a significant challenge because of the heavy computational demands involved. In this study, we introduce a beamforming framework that adopts a physics-informed deep neural network (PIDNN). The proposed model incorporates physical principles directly into the training process through a customized loss function, which minimizes the mean squared error between the array response and a target reference signal. The beamforming weights produced by the PIDNN are systematically evaluated against the theoretically optimal Wiener solution. To assess robustness, the method is implemented on an <inline-formula> <tex-math>$8times 8$ </tex-math></inline-formula> PAA and evaluated against both a shallow architecture—the radial basis function neural network (RBFNN)—and a deeper model, the convolutional neural network (CNN). Moreover, the PIDNN is applied to a large PAA to assess its performance when the number of antenna elements increases. Experimental results demonstrate that the PIDNN closely approximates Wiener-optimal weights while maintaining robustness to array imperfections and achieving this with markedly reduced computational cost.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"7841-7849"},"PeriodicalIF":3.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11339486","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Item Response Time Analysis Using Ex-Gaussian Distribution for Disengagement Detection in Online Low-Stakes Tests 基于前高斯分布的在线低风险测试脱离检测项目反应时间分析
IF 3.6 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-12 DOI: 10.1109/ACCESS.2026.3652353
N. Chotikakamthorn
This study addresses the problem of detecting disengagement in online low-stakes tests used in blended learning within higher education. The detection method was developed based on an analysis of item responses and associated response times. The method applied the ex-Gaussian mixture model to response times, rather than the conventional lognormal model. The mixture component with the smallest Gaussian mean was chosen to represent the response times distribution of early correct responses. The selected mixture component was used to obtain the model’s mode, which then served as the threshold for classifying item responses into early and subsequent response groups. Based on the two classified groups, descriptive statistics and graphical visualizations were introduced to support manual inspection and provide insight into item- and person-level characteristics. A test statistic for disengagement detection was formulated based on the distribution of the number of early responses. Drawing on prior knowledge of the success probabilities associated with disengaged responses, two detection boundaries were defined to classify item-preknowledge and rapid-guessing behaviors. Unlike existing model-based methods for rapid guessing and item preknowledge behavior detections, the proposed non-parametric method does not require prior knowledge of item or person parameters, nor does it involve modeling or estimating such characteristics. The method’s performance was assessed using both real and simulated data, and results for true positive rates and false positive rates were reported under various test conditions. The findings indicate that the method’s performance improves with an increasing number of test items and a higher proportion of disengaged responses. Simulation results further demonstrated the method’s robustness to measurement error and small variations in response times, in contrast to the person-level adaptation of the NT10 and CUMP methods, whose performance varied significantly under the same conditions.
本研究解决了在高等教育混合学习中使用的在线低风险测试中检测脱离的问题。检测方法是基于对项目反应和相关反应时间的分析而开发的。该方法采用前高斯混合模型来计算响应时间,而不是传统的对数正态模型。选取高斯均值最小的混合分量表示早期正确响应的响应时间分布。选择的混合分量用于获得模型的模式,然后作为阈值将项目反应分为早期和随后的反应组。基于这两个分类组,引入了描述性统计和图形可视化来支持人工检查,并提供对项目和个人级别特征的洞察。根据早期反应数的分布,建立了脱离接触检测的检验统计量。利用与不参与反应相关的成功概率的先验知识,定义了两个检测边界来分类项目预知和快速猜测行为。与现有的基于模型的快速猜测和项目预知行为检测方法不同,本文提出的非参数方法不需要对项目或人员参数的先验知识,也不涉及对这些特征的建模或估计。使用真实和模拟数据对该方法的性能进行了评估,并报告了各种测试条件下的真阳性率和假阳性率的结果。研究结果表明,该方法的性能随着测试项目数量的增加和不参与反应比例的提高而提高。仿真结果进一步证明了该方法对测量误差和响应时间变化的鲁棒性,而NT10和CUMP方法的个人水平适应性在相同条件下变化显著。
{"title":"Item Response Time Analysis Using Ex-Gaussian Distribution for Disengagement Detection in Online Low-Stakes Tests","authors":"N. Chotikakamthorn","doi":"10.1109/ACCESS.2026.3652353","DOIUrl":"https://doi.org/10.1109/ACCESS.2026.3652353","url":null,"abstract":"This study addresses the problem of detecting disengagement in online low-stakes tests used in blended learning within higher education. The detection method was developed based on an analysis of item responses and associated response times. The method applied the ex-Gaussian mixture model to response times, rather than the conventional lognormal model. The mixture component with the smallest Gaussian mean was chosen to represent the response times distribution of early correct responses. The selected mixture component was used to obtain the model’s mode, which then served as the threshold for classifying item responses into early and subsequent response groups. Based on the two classified groups, descriptive statistics and graphical visualizations were introduced to support manual inspection and provide insight into item- and person-level characteristics. A test statistic for disengagement detection was formulated based on the distribution of the number of early responses. Drawing on prior knowledge of the success probabilities associated with disengaged responses, two detection boundaries were defined to classify item-preknowledge and rapid-guessing behaviors. Unlike existing model-based methods for rapid guessing and item preknowledge behavior detections, the proposed non-parametric method does not require prior knowledge of item or person parameters, nor does it involve modeling or estimating such characteristics. The method’s performance was assessed using both real and simulated data, and results for true positive rates and false positive rates were reported under various test conditions. The findings indicate that the method’s performance improves with an increasing number of test items and a higher proportion of disengaged responses. Simulation results further demonstrated the method’s robustness to measurement error and small variations in response times, in contrast to the person-level adaptation of the NT10 and CUMP methods, whose performance varied significantly under the same conditions.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"7860-7878"},"PeriodicalIF":3.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11343808","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Examination of Learning-Augmented Approaches for Initializing Distributed Optimal Power Flow With Consensus ADMM 用一致ADMM初始化分布式最优潮流的学习增强方法的检验
IF 3.6 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-12 DOI: 10.1109/ACCESS.2026.3652328
Woohyeong Lee;Byungkwon Park
The rapid integration of distributed energy resources requires optimization and control of power systems with many controllable devices, driving a growing interest in efficient distributed optimization algorithms. To this end, this paper proposes and examines learning-augmented initialization to enhance the convergence speed of the Alternating Direction Method of Multipliers (ADMM) for solving the distributed DC and AC optimal power flow (OPF) problems. The core concept is to leverage deep learning techniques, designed with feedforward and recurrent neural networks, as auxiliary tools to accelerate the convergence of ADMM. We perform comprehensive numerical case studies and empirically validate the benefits of the proposed methods on the IEEE 14, 118, and 1888-bus test networks in the DC model and IEEE 14, 118, and 2746-bus test networks in the AC model under different loading scenarios. In particular, the proposed method has achieved up to a 78% reduction in the average number of ADMM iterations for the DC-OPF problem and a 48% reduction for the AC-OPF problem. These findings illustrate the significant potential of combining deep learning frameworks with ADMM and possibly other distributed optimization algorithms to enhance the efficiency and reliability of future power system and energy market operations.
分布式能源的快速集成需要对具有许多可控设备的电力系统进行优化和控制,这推动了人们对高效分布式优化算法的兴趣日益浓厚。为此,本文提出并研究了学习增强初始化,以提高乘法器交替方向法(ADMM)求解分布式直流和交流最优潮流(OPF)问题的收敛速度。核心概念是利用深度学习技术,采用前馈和循环神经网络设计,作为辅助工具来加速ADMM的收敛。我们进行了全面的数值案例研究,并在不同负载情况下,在直流模型的IEEE 14、118和1888总线测试网络以及交流模型的IEEE 14、118和2746总线测试网络上验证了所提出方法的有效性。特别是,所提出的方法在DC-OPF问题上实现了高达78%的ADMM迭代平均次数减少,在AC-OPF问题上减少了48%。这些发现表明,将深度学习框架与ADMM以及其他分布式优化算法相结合,以提高未来电力系统和能源市场运营的效率和可靠性,具有巨大的潜力。
{"title":"Examination of Learning-Augmented Approaches for Initializing Distributed Optimal Power Flow With Consensus ADMM","authors":"Woohyeong Lee;Byungkwon Park","doi":"10.1109/ACCESS.2026.3652328","DOIUrl":"https://doi.org/10.1109/ACCESS.2026.3652328","url":null,"abstract":"The rapid integration of distributed energy resources requires optimization and control of power systems with many controllable devices, driving a growing interest in efficient distributed optimization algorithms. To this end, this paper proposes and examines learning-augmented initialization to enhance the convergence speed of the Alternating Direction Method of Multipliers (ADMM) for solving the distributed DC and AC optimal power flow (OPF) problems. The core concept is to leverage deep learning techniques, designed with feedforward and recurrent neural networks, as auxiliary tools to accelerate the convergence of ADMM. We perform comprehensive numerical case studies and empirically validate the benefits of the proposed methods on the IEEE 14, 118, and 1888-bus test networks in the DC model and IEEE 14, 118, and 2746-bus test networks in the AC model under different loading scenarios. In particular, the proposed method has achieved up to a 78% reduction in the average number of ADMM iterations for the DC-OPF problem and a 48% reduction for the AC-OPF problem. These findings illustrate the significant potential of combining deep learning frameworks with ADMM and possibly other distributed optimization algorithms to enhance the efficiency and reliability of future power system and energy market operations.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"7600-7615"},"PeriodicalIF":3.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11343750","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bridging Domain Gaps With ProtoAlign: Teacher–Student Few-Shot Prototype Alignment for Cross-Domain Spoken Language Recognition 用ProtoAlign弥合领域差距:跨领域口语识别的师生少镜头原型对齐
IF 3.6 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-12 DOI: 10.1109/ACCESS.2026.3653462
Omkar Vilas Sawant;Anirban Bhowmick
Spoken language recognition in low-resource settings is hindered by domain shift and limited labeled data. We propose ProtoAlign, a teacher–student few-shot prototype alignment framework that learns domain-invariant, language-discriminative representations with minimal target supervision. The student uses a compact transformer-style backbone with Feature Reweighting Layer (FRL). Source-domain class prototypes are maintained as exponential moving averages and serve as stable anchors. A target-to-source Information Noise Contrastive Estimate(InfoNCE) alignment term pulls few-shot target embeddings toward their language-matched source prototypes, while a lightweight knowledge-distillation loss from a source-only teacher preserves source accuracy. Warm-start schedules for the alignment and distillation weights stabilize optimization, and a pairing sampler ensures each batch contains target samples with same-language source counterparts. We evaluate language recognition performance across five heterogeneous domains, namely All India Radio (AIR), Common Voice (CV), Kaggle, the Indian Institute of Technology Hyderabad (IIT-H), and Indic TTS datasets. With at most ten labeled target examples per language, ProtoAlign consistently outperforms a strong transformer baseline in cross-domain tests and produces visibly tighter, more domain-invariant clusters in the embedding space. These results indicate that prototype anchoring combined with gentle teacher guidance provides a simple, scalable, and label-efficient path to robust cross-domain spoken language recognition.
在低资源环境下的口语识别受到领域转移和有限的标记数据的阻碍。我们提出了ProtoAlign,这是一个师生少镜头原型对齐框架,它在最小的目标监督下学习领域不变的语言判别表示。该学生使用具有特征重加权层(FRL)的紧凑变压器式主干。源域类原型作为指数移动平均线进行维护,并作为稳定的锚点。目标到源的信息噪声对比估计(InfoNCE)对齐术语将少数目标嵌入拉向其语言匹配的源原型,而来自仅源的教师的轻量级知识蒸馏损失保留了源的准确性。热启动时间表对准和蒸馏重量稳定优化,配对采样器确保每批包含目标样本与相同的语言源对应。我们评估了五个异构领域的语言识别性能,即全印度广播(AIR),共同声音(CV), Kaggle,海德拉巴印度理工学院(IIT-H)和印度TTS数据集。每种语言最多有10个标记的目标示例,ProtoAlign在跨领域测试中始终优于强大的transformer基线,并在嵌入空间中产生明显更紧密、更不变性的集群。这些结果表明,原型锚定与温和的教师指导相结合,为稳健的跨领域口语识别提供了一个简单、可扩展和标签高效的途径。
{"title":"Bridging Domain Gaps With ProtoAlign: Teacher–Student Few-Shot Prototype Alignment for Cross-Domain Spoken Language Recognition","authors":"Omkar Vilas Sawant;Anirban Bhowmick","doi":"10.1109/ACCESS.2026.3653462","DOIUrl":"https://doi.org/10.1109/ACCESS.2026.3653462","url":null,"abstract":"Spoken language recognition in low-resource settings is hindered by domain shift and limited labeled data. We propose ProtoAlign, a teacher–student few-shot prototype alignment framework that learns domain-invariant, language-discriminative representations with minimal target supervision. The student uses a compact transformer-style backbone with Feature Reweighting Layer (FRL). Source-domain class prototypes are maintained as exponential moving averages and serve as stable anchors. A target-to-source Information Noise Contrastive Estimate(InfoNCE) alignment term pulls few-shot target embeddings toward their language-matched source prototypes, while a lightweight knowledge-distillation loss from a source-only teacher preserves source accuracy. Warm-start schedules for the alignment and distillation weights stabilize optimization, and a pairing sampler ensures each batch contains target samples with same-language source counterparts. We evaluate language recognition performance across five heterogeneous domains, namely All India Radio (AIR), Common Voice (CV), Kaggle, the Indian Institute of Technology Hyderabad (IIT-H), and Indic TTS datasets. With at most ten labeled target examples per language, ProtoAlign consistently outperforms a strong transformer baseline in cross-domain tests and produces visibly tighter, more domain-invariant clusters in the embedding space. These results indicate that prototype anchoring combined with gentle teacher guidance provides a simple, scalable, and label-efficient path to robust cross-domain spoken language recognition.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"10635-10653"},"PeriodicalIF":3.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11346933","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalized ECG Heartbeat Classification Using Time-Series Transformers 基于时间序列变压器的心电心跳广义分类
IF 3.6 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-12 DOI: 10.1109/ACCESS.2026.3651029
Timo de Waele;Jaron Fontaine;Eli de Poorter;Adnan Shahid
This research investigates Time-Series Transformer architectures for Electrocardiogram (ECG) heartbeat classification, particularly focusing on their generalization capabilities towards new patients and varying signal sampling rates, a critical challenge in real-world clinical applications. This study conducts a systematic comparison between Transformers and Convolutional Neural Network (CNN) models using the St. Petersburg Institute of Cardiological Technics 12-lead Arrhythmia Database (INCART). Key aspects explored include the impact of different input modalities (raw ECG, Continuous Wavelet Transform (CWT) scalograms, and their combination), various Positional Encoding (PE) schemes for Transformers, and the effect of integrating expert-derived RR interval features through different feature fusion techniques. Transformers, especially with concatenation-based PE schemes and CWT or combined ECG+CWT inputs, consistently outperformed CNNs in classification accuracy and generalization when expert features were not used. They demonstrated more than 30% better generalization to unseen patients, and 20% better generalization to unseen patients and sampling rates. Ultimately, this study emphasizes that robust ECG classifiers depend heavily on deliberate architectural choices, positional encoding schemes, input representations, and the integration of expert features to handle inter-patient and sampling rate variations. Importantly, it also demonstrates that Time-Series Transformers can achieve strong results even with relatively modest model sizes and datasets.
本研究探讨了用于心电图(ECG)心跳分类的时序变压器架构,特别关注它们对新患者和不同信号采样率的泛化能力,这是现实世界临床应用中的一个关键挑战。本研究使用圣彼得堡心脏病技术研究所12导联心律失常数据库(INCART)对变压器和卷积神经网络(CNN)模型进行了系统比较。研究的关键方面包括不同输入模式(原始心电、连续小波变换(CWT)尺度图及其组合)的影响,变压器的各种位置编码(PE)方案,以及通过不同的特征融合技术整合专家衍生的RR区间特征的影响。当不使用专家特征时,变压器,特别是基于串联的PE方案和CWT或ECG+CWT组合输入时,在分类精度和泛化方面始终优于cnn。他们对未见过的病人的泛化率提高了30%,对未见过的病人的泛化率提高了20%。最后,本研究强调,鲁棒ECG分类器在很大程度上依赖于深思熟虑的架构选择、位置编码方案、输入表示和专家特征的集成,以处理患者之间和采样率的变化。重要的是,它还表明,即使使用相对适度的模型大小和数据集,时间序列变压器也可以获得强有力的结果。
{"title":"Generalized ECG Heartbeat Classification Using Time-Series Transformers","authors":"Timo de Waele;Jaron Fontaine;Eli de Poorter;Adnan Shahid","doi":"10.1109/ACCESS.2026.3651029","DOIUrl":"https://doi.org/10.1109/ACCESS.2026.3651029","url":null,"abstract":"This research investigates Time-Series Transformer architectures for Electrocardiogram (ECG) heartbeat classification, particularly focusing on their generalization capabilities towards new patients and varying signal sampling rates, a critical challenge in real-world clinical applications. This study conducts a systematic comparison between Transformers and Convolutional Neural Network (CNN) models using the St. Petersburg Institute of Cardiological Technics 12-lead Arrhythmia Database (INCART). Key aspects explored include the impact of different input modalities (raw ECG, Continuous Wavelet Transform (CWT) scalograms, and their combination), various Positional Encoding (PE) schemes for Transformers, and the effect of integrating expert-derived RR interval features through different feature fusion techniques. Transformers, especially with concatenation-based PE schemes and CWT or combined ECG+CWT inputs, consistently outperformed CNNs in classification accuracy and generalization when expert features were not used. They demonstrated more than 30% better generalization to unseen patients, and 20% better generalization to unseen patients and sampling rates. Ultimately, this study emphasizes that robust ECG classifiers depend heavily on deliberate architectural choices, positional encoding schemes, input representations, and the integration of expert features to handle inter-patient and sampling rate variations. Importantly, it also demonstrates that Time-Series Transformers can achieve strong results even with relatively modest model sizes and datasets.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"7699-7714"},"PeriodicalIF":3.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11347064","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning for Wi-Fi Intrusion Detection: A Comparative Study of Accuracy, Explainability, and Adversarial Robustness Wi-Fi入侵检测的机器学习:准确性、可解释性和对抗鲁棒性的比较研究
IF 3.6 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-12 DOI: 10.1109/ACCESS.2026.3651992
Anna Ledwoń;Marek Natkaniec
The rapid advancement of artificial intelligence (AI) has sparked extensive discussions regarding its potential to enhance daily life through various applications. Among these, machine learning (ML) has emerged as a promising tool for detecting threats in Wi-Fi networks, a domain increasingly vulnerable to attacks due to the widespread use of wireless communication. This paper addresses the limitations of existing research, which often relies on standard metrics without a comprehensive analysis of model performance, explainability, and robustness. The study aims to provide an in-depth evaluation of ML models for Wi-Fi threat detection by employing metrics such as accuracy, precision, recall, and F1-score, alongside confusion matrices to assess classification effectiveness. Additionally, the research will analyze training and inference times, model sizes, and the impact of features using Shapley Additive Explanations (SHAP) values. Misclassifications will be scrutinized to identify potential errors stemming from dataset properties, emphasizing the necessity of thorough dataset preprocessing for broader applicability. Furthermore, the robustness of the models will be tested against adversarial attacks tailored for Wi-Fi detection. The findings will culminate in a comparative analysis of the models, underscoring the significance of each methodological step and the potential consequences of neglecting critical aspects. This work aims to contribute to the field by enhancing the understanding of ML applications in cybersecurity and promoting the development of more reliable and explainable detection systems.
人工智能(AI)的快速发展引发了关于其通过各种应用改善日常生活的潜力的广泛讨论。其中,机器学习(ML)已成为检测Wi-Fi网络威胁的有前途的工具,由于无线通信的广泛使用,Wi-Fi网络越来越容易受到攻击。本文解决了现有研究的局限性,这些研究通常依赖于标准度量,而没有对模型性能、可解释性和鲁棒性进行全面分析。该研究旨在通过采用准确性、精密度、召回率和f1分数等指标,以及混淆矩阵来评估分类效果,对Wi-Fi威胁检测的ML模型进行深入评估。此外,该研究将使用Shapley加性解释(SHAP)值分析训练和推理时间、模型大小以及特征的影响。错误分类将被仔细审查,以识别源于数据集属性的潜在错误,强调彻底的数据集预处理的必要性,以获得更广泛的适用性。此外,将测试模型的鲁棒性,以抵御针对Wi-Fi检测量身定制的对抗性攻击。最后将对模型进行比较分析,强调每个方法步骤的重要性以及忽视关键方面的潜在后果。这项工作旨在通过增强对机器学习在网络安全中的应用的理解,促进更可靠和可解释的检测系统的开发,从而为该领域做出贡献。
{"title":"Machine Learning for Wi-Fi Intrusion Detection: A Comparative Study of Accuracy, Explainability, and Adversarial Robustness","authors":"Anna Ledwoń;Marek Natkaniec","doi":"10.1109/ACCESS.2026.3651992","DOIUrl":"https://doi.org/10.1109/ACCESS.2026.3651992","url":null,"abstract":"The rapid advancement of artificial intelligence (AI) has sparked extensive discussions regarding its potential to enhance daily life through various applications. Among these, machine learning (ML) has emerged as a promising tool for detecting threats in Wi-Fi networks, a domain increasingly vulnerable to attacks due to the widespread use of wireless communication. This paper addresses the limitations of existing research, which often relies on standard metrics without a comprehensive analysis of model performance, explainability, and robustness. The study aims to provide an in-depth evaluation of ML models for Wi-Fi threat detection by employing metrics such as accuracy, precision, recall, and F1-score, alongside confusion matrices to assess classification effectiveness. Additionally, the research will analyze training and inference times, model sizes, and the impact of features using Shapley Additive Explanations (SHAP) values. Misclassifications will be scrutinized to identify potential errors stemming from dataset properties, emphasizing the necessity of thorough dataset preprocessing for broader applicability. Furthermore, the robustness of the models will be tested against adversarial attacks tailored for Wi-Fi detection. The findings will culminate in a comparative analysis of the models, underscoring the significance of each methodological step and the potential consequences of neglecting critical aspects. This work aims to contribute to the field by enhancing the understanding of ML applications in cybersecurity and promoting the development of more reliable and explainable detection systems.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"7582-7599"},"PeriodicalIF":3.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11340615","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Chat to Academia: Calibrating Formality in Low-Resource Languages 从聊天到学术:低资源语言的正式性校准
IF 3.6 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-12 DOI: 10.1109/ACCESS.2026.3652343
Seda Efendioglu;Huseyin Pehlivan
How formal should a sentence sound? The answer is rarely limited to formal or informal. In natural communication, formality changes gradually from casual conversation to professional writing and highly academic prose. However, for many low-resource languages, this continuum of graded stylistic shifts remains largely unmodeled. Turkish, despite its rich stylistic variation, still lacks a systematic framework for capturing such gradual shifts. This study introduces LyreSense, a framework designed to represent the full spectrum of formality in Turkish. We integrate human-written texts with annotation assisted by large language models (LLMs) and controlled synthetic text generation to construct a stylistically diverse corpus. Building on this, we propose a style-intensity–calibrated triplet loss that adapts its margin to differences in formality, enabling embeddings to disentangle subtle stylistic variation independently of semantic content. To train efficiently while preserving model capacity, we apply Low-Rank Adaptation (LoRA) during fine-tuning. Experiments across four incremental formality classes (informal, neutral, formal, and highly formal) demonstrate that LyreSense achieves Macro-F1 of 0.69. Misclassifications are concentrated between adjacent categories, reflecting the natural continuity of formality, while extreme classes are consistently distinguished. LyreSense is more than a framework for Turkish: it establishes a scalable, language-agnostic pipeline for style-sensitive NLP in low-resource settings. By moving beyond binary style distinctions, it demonstrates how lightweight, efficient models can provide nuanced, human-like style awareness for both research and practical applications.
一个句子听起来应该有多正式?答案很少局限于正式或非正式。在自然交流中,形式从随意的谈话逐渐转变为专业的写作和高度学术的散文。然而,对于许多资源贫乏的语言来说,这种连续的分级风格变化在很大程度上仍然没有建模。土耳其语,尽管其丰富的风格变化,仍然缺乏一个系统的框架来捕捉这种渐进的变化。本研究介绍了LyreSense,一个旨在代表土耳其语中所有正式形式的框架。我们将人工编写的文本与大型语言模型(llm)辅助的注释结合起来,并控制合成文本生成,以构建风格多样的语料库。在此基础上,我们提出了一种风格强度校准的三连音损失,它可以根据形式的差异调整其边际,使嵌入能够独立于语义内容分离微妙的风格变化。为了在保持模型容量的同时有效地训练,我们在微调过程中应用了低秩自适应(LoRA)。在四个增量形式类(非正式、中性、正式和高度正式)中进行的实验表明,LyreSense实现了0.69的Macro-F1。错误的分类集中在相邻的类别之间,反映了形式的自然连续性,而极端的类别则始终被区分开来。LyreSense不仅仅是一个土耳其语框架:它为低资源环境下的风格敏感NLP建立了一个可扩展的、与语言无关的管道。通过超越二元风格的区别,它展示了轻量级、高效的模型如何为研究和实际应用提供细微的、类似人类的风格意识。
{"title":"From Chat to Academia: Calibrating Formality in Low-Resource Languages","authors":"Seda Efendioglu;Huseyin Pehlivan","doi":"10.1109/ACCESS.2026.3652343","DOIUrl":"https://doi.org/10.1109/ACCESS.2026.3652343","url":null,"abstract":"How formal should a sentence sound? The answer is rarely limited to formal or informal. In natural communication, formality changes gradually from casual conversation to professional writing and highly academic prose. However, for many low-resource languages, this continuum of graded stylistic shifts remains largely unmodeled. Turkish, despite its rich stylistic variation, still lacks a systematic framework for capturing such gradual shifts. This study introduces LyreSense, a framework designed to represent the full spectrum of formality in Turkish. We integrate human-written texts with annotation assisted by large language models (LLMs) and controlled synthetic text generation to construct a stylistically diverse corpus. Building on this, we propose a style-intensity–calibrated triplet loss that adapts its margin to differences in formality, enabling embeddings to disentangle subtle stylistic variation independently of semantic content. To train efficiently while preserving model capacity, we apply Low-Rank Adaptation (LoRA) during fine-tuning. Experiments across four incremental formality classes (informal, neutral, formal, and highly formal) demonstrate that LyreSense achieves Macro-F1 of 0.69. Misclassifications are concentrated between adjacent categories, reflecting the natural continuity of formality, while extreme classes are consistently distinguished. LyreSense is more than a framework for Turkish: it establishes a scalable, language-agnostic pipeline for style-sensitive NLP in low-resource settings. By moving beyond binary style distinctions, it demonstrates how lightweight, efficient models can provide nuanced, human-like style awareness for both research and practical applications.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"7776-7791"},"PeriodicalIF":3.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11343771","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145982133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large-Scale Crop Image Recognition Based on ConvLSTM Spatiotemporal Feature Extraction and LCMST-Net Model 基于ConvLSTM时空特征提取和LCMST-Net模型的大规模农作物图像识别
IF 3.6 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-12 DOI: 10.1109/ACCESS.2026.3653414
Luoyi Feng;Sha Zong
With the advancement of agricultural modernization, the role of large-scale crop image recognition in precision agriculture, pest and disease monitoring, and crop yield prediction has become increasingly significant. To enhance the accuracy and efficiency of large-scale crop image recognition, this study introduces coordinate attention mechanism and ASPP module to improve the ConvLSTM model. It further combines the Localized Convolutional Multi-Scale Temporal Network (LCMST-Net) model to efficiently classify and recognize the extracted spatiotemporal features. In the results, the improved ConvLSTM achieved a 0.979 accuracy and a 0.926 recall. The average classification accuracy of LCMST-Net was 0.982, significantly higher than the control model. LCMST-Net performed well in metrics including MSE and MAE, further validating its advantages in prediction accuracy and classification performance. Research has shown that improved ConvLSTM and LCMST-Net models have significant advantages in feature extraction and classification performance, especially when dealing with complex spatiotemporal features, they can more accurately identify different types of crops. This study contributes to the automation and intelligence of crop growth monitoring, improving agricultural production efficiency and resource utilization efficiency.
随着农业现代化进程的推进,大规模作物图像识别在精准农业、病虫害监测、作物产量预测等方面的作用日益显著。为了提高大规模作物图像识别的准确性和效率,本研究引入了坐标注意机制和ASPP模块对ConvLSTM模型进行改进。结合局部卷积多尺度时间网络(LCMST-Net)模型对提取的时空特征进行有效分类和识别。结果表明,改进后的ConvLSTM准确率为0.979,召回率为0.926。LCMST-Net的平均分类准确率为0.982,显著高于对照模型。LCMST-Net在MSE和MAE等指标上表现良好,进一步验证了其在预测精度和分类性能上的优势。研究表明,改进的ConvLSTM和LCMST-Net模型在特征提取和分类性能上具有显著优势,特别是在处理复杂的时空特征时,可以更准确地识别不同类型的作物。该研究有助于作物生长监测的自动化和智能化,提高农业生产效率和资源利用效率。
{"title":"Large-Scale Crop Image Recognition Based on ConvLSTM Spatiotemporal Feature Extraction and LCMST-Net Model","authors":"Luoyi Feng;Sha Zong","doi":"10.1109/ACCESS.2026.3653414","DOIUrl":"https://doi.org/10.1109/ACCESS.2026.3653414","url":null,"abstract":"With the advancement of agricultural modernization, the role of large-scale crop image recognition in precision agriculture, pest and disease monitoring, and crop yield prediction has become increasingly significant. To enhance the accuracy and efficiency of large-scale crop image recognition, this study introduces coordinate attention mechanism and ASPP module to improve the ConvLSTM model. It further combines the Localized Convolutional Multi-Scale Temporal Network (LCMST-Net) model to efficiently classify and recognize the extracted spatiotemporal features. In the results, the improved ConvLSTM achieved a 0.979 accuracy and a 0.926 recall. The average classification accuracy of LCMST-Net was 0.982, significantly higher than the control model. LCMST-Net performed well in metrics including MSE and MAE, further validating its advantages in prediction accuracy and classification performance. Research has shown that improved ConvLSTM and LCMST-Net models have significant advantages in feature extraction and classification performance, especially when dealing with complex spatiotemporal features, they can more accurately identify different types of crops. This study contributes to the automation and intelligence of crop growth monitoring, improving agricultural production efficiency and resource utilization efficiency.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"10719-10734"},"PeriodicalIF":3.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11347006","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ensemble Machine Learning Model for Inner Speech Recognition: A Subject-Specific Investigation 内部语音识别的集成机器学习模型:一个特定主题的研究
IF 3.6 3区 计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2026-01-06 DOI: 10.1109/ACCESS.2025.3644494
Shahamat Mustavi Tasin;Muhammad E. H. Chowdhury;Shona Pedersen;Malek Chabbouh;Diala Bushnaq;Raghad Aljindi;Saidul Kabir;Anwarul Hasan
Inner speech recognition has gained enormous interest in recent years due to its applications in rehabilitation, developing assistive technology, and cognitive assessment. However, since language and speech productions are a complex process, for which identifying speech components has remained a challenging task. Different approaches were taken previously to reach this goal, but new approaches remain to be explored. Also, a subject-oriented analysis is necessary to understand the underlying brain dynamics during inner speech production, which can bring novel methods to neurological research. A publicly available dataset, “Thinking Out Loud Dataset,” has been used to develop a Machine Learning (ML)-based technique to classify inner speech using 128-channel surface Electroencephalography (EEG) signals. The dataset is collected on a Spanish cohort of ten subjects while uttering four words (“Arriba,” “Abajo,” “Derecha,” and “Izquierda”) by each participant. Statistical methods were employed to detect and remove motion artifacts from the signals. A large number (191 per channel) of time-, frequency- and time-frequency-domain features were extracted. Eight feature selection algorithms are explored, and the best feature selection technique is selected for subsequent evaluations. The performance of six ML algorithms is evaluated, and an ensemble model is proposed. Deep Learning models are also explored, and the results are compared with the classical ML approach. The proposed ensemble model, by stacking the five best logistic regression models, generated a promising result with an overall accuracy of 81.13% and an F1 score of 81.12% in the classification of four inner speech words using surface EEG signals.
近年来,内部语音识别因其在康复、开发辅助技术和认知评估方面的应用而获得了极大的关注。然而,由于语言和语音产生是一个复杂的过程,因此识别语音成分仍然是一项具有挑战性的任务。以前采取了不同的方法来实现这一目标,但新的方法仍有待探索。此外,以主体为导向的分析是了解内在言语产生过程中潜在的大脑动态的必要条件,这可以为神经学研究带来新的方法。一个公开可用的数据集“Thinking Out Loud dataset”已被用于开发一种基于机器学习(ML)的技术,该技术使用128通道表面脑电图(EEG)信号对内部语音进行分类。数据集是在一个西班牙队列中收集的,每个参与者说四个词(“Arriba”,“Abajo”,“Derecha”和“Izquierda”)。采用统计方法检测和去除信号中的运动伪影。提取了大量的时间域、频率域和时频域特征(每通道191个)。探索了八种特征选择算法,并选择了最佳特征选择技术进行后续评价。对六种机器学习算法的性能进行了评价,并提出了一个集成模型。还对深度学习模型进行了探索,并将结果与经典ML方法进行了比较。本文提出的集成模型将5个最佳逻辑回归模型进行叠加,得到了利用表面脑电信号对4个内部语音词进行分类的总体准确率为81.13%,F1得分为81.12%的结果。
{"title":"Ensemble Machine Learning Model for Inner Speech Recognition: A Subject-Specific Investigation","authors":"Shahamat Mustavi Tasin;Muhammad E. H. Chowdhury;Shona Pedersen;Malek Chabbouh;Diala Bushnaq;Raghad Aljindi;Saidul Kabir;Anwarul Hasan","doi":"10.1109/ACCESS.2025.3644494","DOIUrl":"https://doi.org/10.1109/ACCESS.2025.3644494","url":null,"abstract":"Inner speech recognition has gained enormous interest in recent years due to its applications in rehabilitation, developing assistive technology, and cognitive assessment. However, since language and speech productions are a complex process, for which identifying speech components has remained a challenging task. Different approaches were taken previously to reach this goal, but new approaches remain to be explored. Also, a subject-oriented analysis is necessary to understand the underlying brain dynamics during inner speech production, which can bring novel methods to neurological research. A publicly available dataset, “Thinking Out Loud Dataset,” has been used to develop a Machine Learning (ML)-based technique to classify inner speech using 128-channel surface Electroencephalography (EEG) signals. The dataset is collected on a Spanish cohort of ten subjects while uttering four words (“Arriba,” “Abajo,” “Derecha,” and “Izquierda”) by each participant. Statistical methods were employed to detect and remove motion artifacts from the signals. A large number (191 per channel) of time-, frequency- and time-frequency-domain features were extracted. Eight feature selection algorithms are explored, and the best feature selection technique is selected for subsequent evaluations. The performance of six ML algorithms is evaluated, and an ensemble model is proposed. Deep Learning models are also explored, and the results are compared with the classical ML approach. The proposed ensemble model, by stacking the five best logistic regression models, generated a promising result with an overall accuracy of 81.13% and an F1 score of 81.12% in the classification of four inner speech words using surface EEG signals.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"14 ","pages":"10811-10827"},"PeriodicalIF":3.6,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11333930","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Access
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1