首页 > 最新文献

IEEE Transactions on Emerging Topics in Computational Intelligence最新文献

英文 中文
Swarm Imitation Learning From Observations 从观察中进行群体模仿学习
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-27 DOI: 10.1109/TETCI.2025.3569762
Aya Hussein;Eleni Petraki;Hussein A. Abbass
Learning from observation (LfO) is a process where an agent learns a task by passively observing a more competent agent perform it. LfO differs from classical Learning from demonstration (LfD) in that the former requires access to the demonstrator' s states only, whereas the latter requires both the demonstrator' s states and the corresponding actions. On the one hand, LfO avoids the sometimes costly or impractical burden of collecting the demonstrator' s actions, and instead only requires the demonstrator' s states which are more easily captured through cameras or sensors. On the other hand, LfO is more challenging than classical LfD because of the lack of detailed guidance from action labels. Despite the success of LfO in single-agent tasks, the literature falls short of assessing its feasibility in swarm systems, where multiple agents act simultaneously to enact a system-level state change. We tackle this research gap by proposing Swarm-LfO that extends single-agent LfO by leveraging the centralised training with decentralised execution framework to learn a useful agent-centric inverse dynamic model (AIDM). AIDM enables the imitator swarm to predict agent-level actions that would lead to swarm state transitions similar to those exhibited by the demonstrator swarm. Pairs of states and the corresponding estimated actions are then used for learning to imitate the demonstrated behaviour in a supervised learning manner. Evaluation experiments are conducted using four tasks that require different levels of coordination between swarm members: flocking, sheltering, dispersion, and herding. The results show that the performance of Swarm-LfO is comparable to classical LfD methods that require access to action information. Swarm-LfO is extensively evaluated and has demonstrated continued success under various experimental conditions including noise and different sizes of the demonstrator and imitator swarms. Our contribution will pave the way for imitation learning in swarms with diverse platforms, where the demonstrator and imitator swarms operate on different action spaces.
从观察中学习(LfO)是一个智能体通过被动地观察更有能力的智能体执行任务来学习任务的过程。LfO与经典的从演示中学习(LfD)的不同之处在于,前者只需要获得演示者的状态,而后者既需要演示者的状态,也需要相应的动作。一方面,LfO避免了有时昂贵或不切实际的收集演示者动作的负担,而是只需要演示者的状态,这些状态更容易通过相机或传感器捕获。另一方面,由于缺乏行动标签的详细指导,LfO比传统的LfD更具挑战性。尽管LfO在单智能体任务中取得了成功,但文献缺乏评估其在群系统中的可行性,在群系统中,多个智能体同时行动以制定系统级状态变化。我们通过提出Swarm-LfO来解决这一研究缺口,该算法通过利用集中式训练和分散执行框架来学习一个有用的以智能体为中心的逆动态模型(AIDM),扩展了单智能体LfO。AIDM使模仿者群体能够预测代理级行为,这些行为将导致群体状态转换,类似于演示者群体所展示的状态转换。然后将状态对和相应的估计动作用于学习,以监督学习的方式模仿演示的行为。评估实验采用四种任务进行,这些任务需要群体成员之间不同程度的协调:群集、庇护、分散和放牧。结果表明,该算法的性能与经典的LfD方法相当,后者需要访问动作信息。对Swarm-LfO进行了广泛的评估,并在各种实验条件下证明了持续的成功,包括噪音和不同大小的演示器和模仿者群体。我们的贡献将为具有不同平台的群体模仿学习铺平道路,在这些平台上,演示者和模仿者群体在不同的行动空间中操作。
{"title":"Swarm Imitation Learning From Observations","authors":"Aya Hussein;Eleni Petraki;Hussein A. Abbass","doi":"10.1109/TETCI.2025.3569762","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3569762","url":null,"abstract":"Learning from observation (LfO) is a process where an agent learns a task by passively observing a more competent agent perform it. LfO differs from classical Learning from demonstration (LfD) in that the former requires access to the demonstrator' s states only, whereas the latter requires both the demonstrator' s states and the corresponding actions. On the one hand, LfO avoids the sometimes costly or impractical burden of collecting the demonstrator' s actions, and instead only requires the demonstrator' s states which are more easily captured through cameras or sensors. On the other hand, LfO is more challenging than classical LfD because of the lack of detailed guidance from action labels. Despite the success of LfO in single-agent tasks, the literature falls short of assessing its feasibility in swarm systems, where multiple agents act simultaneously to enact a system-level state change. We tackle this research gap by proposing <italic>Swarm-LfO</i> that extends single-agent LfO by leveraging the <italic>centralised training with decentralised execution</i> framework to learn a useful agent-centric inverse dynamic model (AIDM). AIDM enables the imitator swarm to predict agent-level actions that would lead to swarm state transitions similar to those exhibited by the demonstrator swarm. Pairs of states and the corresponding estimated actions are then used for learning to imitate the demonstrated behaviour in a supervised learning manner. Evaluation experiments are conducted using four tasks that require different levels of coordination between swarm members: flocking, sheltering, dispersion, and herding. The results show that the performance of <italic>Swarm-LfO</i> is comparable to classical LfD methods that require access to action information. <italic>Swarm-LfO</i> is extensively evaluated and has demonstrated continued success under various experimental conditions including noise and different sizes of the demonstrator and imitator swarms. Our contribution will pave the way for imitation learning in swarms with diverse platforms, where the demonstrator and imitator swarms operate on different action spaces.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 6","pages":"4092-4105"},"PeriodicalIF":5.3,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145584646","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Emerging Topics in Computational Intelligence Information for Authors IEEE计算智能信息新主题汇刊
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-26 DOI: 10.1109/TETCI.2025.3548334
{"title":"IEEE Transactions on Emerging Topics in Computational Intelligence Information for Authors","authors":"","doi":"10.1109/TETCI.2025.3548334","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3548334","url":null,"abstract":"","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 2","pages":"C3-C3"},"PeriodicalIF":5.3,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10939045","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143706603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Emerging Topics in Computational Intelligence Publication Information IEEE计算智能新兴主题汇刊
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-26 DOI: 10.1109/TETCI.2025.3548330
{"title":"IEEE Transactions on Emerging Topics in Computational Intelligence Publication Information","authors":"","doi":"10.1109/TETCI.2025.3548330","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3548330","url":null,"abstract":"","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 2","pages":"C2-C2"},"PeriodicalIF":5.3,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10939046","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143716532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Computational Intelligence Society Information IEEE计算智能学会信息
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-26 DOI: 10.1109/TETCI.2025.3548332
{"title":"IEEE Computational Intelligence Society Information","authors":"","doi":"10.1109/TETCI.2025.3548332","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3548332","url":null,"abstract":"","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 2","pages":"C4-C4"},"PeriodicalIF":5.3,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10939049","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143706630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FLAME: Federated Learning With Masked Autoencoders and Mean-Prototypes Embedding for Sparsely Labeled Medical Images 基于蒙面自编码器和均值原型嵌入的稀疏标记医学图像的联邦学习
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-26 DOI: 10.1109/TETCI.2025.3569759
Mayank Kumar Kundalwal;Deepak Mishra
Federated Learning (FL) has emerged as a promising paradigm for collaborative and privacy-preserving model training in medical imaging. However, FL faces major challenges such as data heterogeneity among hospitals or institutions and scarcity of labeled data, particularly in healthcare applications. To address these challenges, we propose FLAME (Federated Learning with masked Autoencoders and Mean-prototypes Embedding) for sparsely labeled medical images. FLAME implements an integrated learning framework where a masked autoencoder (MAE) learns robust feature representations through reconstruction-based self-supervision, while a Prototypical Network head guides these representations to enhance class separation through mean-prototype embeddings. This learning mechanism enables the encoder to simultaneously capture rich contextual features from unlabeled data while learning discriminative boundaries among classes using limited labeled samples. Our experiments on diverse medical imaging tasks, including PathMNIST, Dermnet, COVID-19 chest X-ray dataset, and Skin-FL, demonstrate FLAME's superior performance over existing FL techniques. The framework shows significant improvements in both classification accuracy and convergence speed, while maintaining privacy and reducing dependence on labeled data. Most importantly, the proposed integration of MAE and Prototypical Network opens new possibilities for the domains that suffer from label scarcity and data heterogeneity, making it particularly valuable for applications like medical diagnostics.
联邦学习(FL)已成为医学成像中协作和隐私保护模型训练的一个有前途的范例。然而,FL面临着主要挑战,如医院或机构之间的数据异构性和标记数据的稀缺性,特别是在医疗保健应用中。为了解决这些挑战,我们提出了用于稀疏标记医学图像的FLAME(带掩码自编码器和均值原型嵌入的联邦学习)。FLAME实现了一个集成的学习框架,其中掩码自编码器(MAE)通过基于重建的自我监督学习鲁棒特征表示,而原型网络头部通过均值-原型嵌入来指导这些表示以增强类分离。这种学习机制使编码器能够同时从未标记的数据中捕获丰富的上下文特征,同时使用有限的标记样本学习类之间的判别边界。我们在各种医学成像任务上的实验,包括PathMNIST、Dermnet、COVID-19胸部x射线数据集和Skin-FL,证明了FLAME优于现有FL技术的性能。该框架在分类精度和收敛速度方面都有显著提高,同时保持了隐私性并减少了对标记数据的依赖。最重要的是,提议的MAE和原型网络的集成为遭受标签稀缺和数据异构的领域开辟了新的可能性,使其对医疗诊断等应用特别有价值。
{"title":"FLAME: Federated Learning With Masked Autoencoders and Mean-Prototypes Embedding for Sparsely Labeled Medical Images","authors":"Mayank Kumar Kundalwal;Deepak Mishra","doi":"10.1109/TETCI.2025.3569759","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3569759","url":null,"abstract":"Federated Learning (FL) has emerged as a promising paradigm for collaborative and privacy-preserving model training in medical imaging. However, FL faces major challenges such as data heterogeneity among hospitals or institutions and scarcity of labeled data, particularly in healthcare applications. To address these challenges, we propose FLAME (Federated Learning with masked Autoencoders and Mean-prototypes Embedding) for sparsely labeled medical images. FLAME implements an integrated learning framework where a masked autoencoder (MAE) learns robust feature representations through reconstruction-based self-supervision, while a Prototypical Network head guides these representations to enhance class separation through mean-prototype embeddings. This learning mechanism enables the encoder to simultaneously capture rich contextual features from unlabeled data while learning discriminative boundaries among classes using limited labeled samples. Our experiments on diverse medical imaging tasks, including PathMNIST, Dermnet, COVID-19 chest X-ray dataset, and Skin-FL, demonstrate FLAME's superior performance over existing FL techniques. The framework shows significant improvements in both classification accuracy and convergence speed, while maintaining privacy and reducing dependence on labeled data. Most importantly, the proposed integration of MAE and Prototypical Network opens new possibilities for the domains that suffer from label scarcity and data heterogeneity, making it particularly valuable for applications like medical diagnostics.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 6","pages":"4080-4091"},"PeriodicalIF":5.3,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145584642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatio-Temporal Enhancement-Based Spiking Neural Network for Morphological Neuron Classification 基于时空增强的脉冲神经网络形态学神经元分类
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-26 DOI: 10.1109/TETCI.2025.3549763
Chunli Sun;Qinghai Guo;Luziwei Leng;Feng Wu;Feng Zhao
The morphology of neurons plays a crucial role in identifying their types and investigating the structure and function of the brain. While existing methods recognize neuron types through efficient morphology representations based on their tree-like structure, they can be further enhanced when analyzing neurons with complex and varied morphologies. In this paper, we introduce a shallow yet efficient multi-branch spatio-temporal enhancement-based Spiking Neural Network (SNN), consisting of three spiking VGG5 models, to fully delineate neuronal morphologies and precisely identify neuron types. Our method captures neuronal morphologies from the spatio-temporal domain and explores the relationships among different neuronal branches, thereby providing a comprehensive description of neurons with complex structures and significantly improving the classification performance. Specifically, we first decompose the neuron tree with complex and varied morphologies into multiple subtrees to represent neuronal morphology fully and then explicitly project these subtrees onto the temporal dimension. Then, we introduce the spiking VGG5 model to characterize neuronal morphology through spiking sequences and learn the relation of these subtrees from the spatio-temporal dimensions. Furthermore, we design a plug-and-play Spatio-Temporal Enhancement Module (STEM) for the spiking VGG5, enabling maximal activation of the spiking activity and facilitating information transfer and representation learning. In this way, our SNN architecture can comprehensively learn neuronal morphology representations based on the tree-like structure and depict the relationships of subtrees, accurately describing the morphological features of neurons with complex arbors. Experimental results demonstrate that our method precisely depicts the neuronal morphologies and achieves accuracies of 87.40% and 82.96% on two NeuroMorpho datasets, respectively, outperforming other approaches. Besides, our method displays significant generalizability and performs remarkably on the JML and BIL datasets.
神经元的形态在识别神经元类型和研究大脑的结构和功能方面起着至关重要的作用。虽然现有的方法是通过基于树形结构的有效形态学表征来识别神经元类型,但在分析具有复杂和多样形态的神经元时,它们可以进一步增强。在本文中,我们引入了一种浅而高效的基于多分支时空增强的峰值神经网络(SNN),该网络由三个峰值VGG5模型组成,以全面描绘神经元形态并精确识别神经元类型。我们的方法从时空域捕获神经元形态,探索不同神经元分支之间的关系,从而提供复杂结构神经元的全面描述,显著提高分类性能。具体来说,我们首先将具有复杂多变形态的神经元树分解成多个子树来完整地表示神经元形态,然后将这些子树明确地投射到时间维度上。然后,引入峰值VGG5模型,通过峰值序列表征神经元形态,并从时空维度学习这些子树之间的关系。此外,我们还设计了一个即插即用的时空增强模块(STEM),用于峰值VGG5,以最大限度地激活峰值活动,促进信息传递和表征学习。这样,我们的SNN架构可以全面学习基于树状结构的神经元形态表征,并描绘子树之间的关系,准确描述具有复杂树杈的神经元的形态特征。实验结果表明,该方法能够准确地描述神经元形态,在两个NeuroMorpho数据集上的准确率分别达到87.40%和82.96%,优于其他方法。此外,该方法在JML和BIL数据集上具有显著的泛化性能。
{"title":"Spatio-Temporal Enhancement-Based Spiking Neural Network for Morphological Neuron Classification","authors":"Chunli Sun;Qinghai Guo;Luziwei Leng;Feng Wu;Feng Zhao","doi":"10.1109/TETCI.2025.3549763","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3549763","url":null,"abstract":"The morphology of neurons plays a crucial role in identifying their types and investigating the structure and function of the brain. While existing methods recognize neuron types through efficient morphology representations based on their tree-like structure, they can be further enhanced when analyzing neurons with complex and varied morphologies. In this paper, we introduce a shallow yet efficient multi-branch spatio-temporal enhancement-based Spiking Neural Network (SNN), consisting of three spiking VGG5 models, to fully delineate neuronal morphologies and precisely identify neuron types. Our method captures neuronal morphologies from the spatio-temporal domain and explores the relationships among different neuronal branches, thereby providing a comprehensive description of neurons with complex structures and significantly improving the classification performance. Specifically, we first decompose the neuron tree with complex and varied morphologies into multiple subtrees to represent neuronal morphology fully and then explicitly project these subtrees onto the temporal dimension. Then, we introduce the spiking VGG5 model to characterize neuronal morphology through spiking sequences and learn the relation of these subtrees from the spatio-temporal dimensions. Furthermore, we design a plug-and-play Spatio-Temporal Enhancement Module (STEM) for the spiking VGG5, enabling maximal activation of the spiking activity and facilitating information transfer and representation learning. In this way, our SNN architecture can comprehensively learn neuronal morphology representations based on the tree-like structure and depict the relationships of subtrees, accurately describing the morphological features of neurons with complex arbors. Experimental results demonstrate that our method precisely depicts the neuronal morphologies and achieves accuracies of 87.40% and 82.96% on two NeuroMorpho datasets, respectively, outperforming other approaches. Besides, our method displays significant generalizability and performs remarkably on the JML and BIL datasets.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 6","pages":"4273-4287"},"PeriodicalIF":5.3,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145584678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generalization-Enhanced Feature-Wise and Correlation Attentions for Cross-Database Facial Expression Recognition 基于泛化增强特征和相关性的跨数据库面部表情识别
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-26 DOI: 10.1109/TETCI.2025.3548727
Weicheng Xie;Tao Zhong;Fan Yang;Siyang Song;Zitong Yu;Linlin Shen
Cross-database facial expression recognition (CDFER) has attracted increasing attention when evaluating the systems' generalization performance. Although the attention mechanism can capture the feature-wise importance or feature-correlation of expression sensitive regions, the attention-based network suffers from the overfitting to the source database, due to possible over-dependence on most salient features, without exploring feature characteristics during removal of feature redundancy. To address this issue, this paper introduces a multi-kernel competitive convolution in feature-wise attention to obtain more salient regions and let each kernel compete with others to enhance the expressive ability of features, by reducing attention overfitting to the source domain. For feature-correlation attention, we resort to a Monte Carlo-based dropout to not only reduce the over-learning of the feature relationship, but also model the dropout probability distribution more specifically by taking the characteristics of feature maps into account. Experimental results show that our algorithm achieves much better generalization performances than the state of the arts (SOTAs) on six publicly available datasets, in the scenarios of single source domain, multiple source domains and domain adaption.
跨数据库面部表情识别(CDFER)在评价系统的泛化性能方面受到越来越多的关注。尽管注意机制可以捕获表达敏感区域的特征重要性或特征相关性,但由于可能过度依赖最显著的特征,基于注意的网络在去除特征冗余时没有探索特征特征,因此存在对源数据库的过拟合问题。为了解决这个问题,本文在特征关注方面引入了一种多核竞争卷积,以获得更多的显著区域,并让每个核相互竞争,通过减少对源域的关注过拟合来增强特征的表达能力。对于特征相关关注,我们采用基于蒙特卡罗的dropout方法,不仅减少了特征关系的过度学习,而且通过考虑特征映射的特征,更具体地建模了dropout概率分布。实验结果表明,在单源域、多源域和域自适应场景下,该算法在6个公开可用的数据集上取得了比SOTAs更好的泛化性能。
{"title":"Generalization-Enhanced Feature-Wise and Correlation Attentions for Cross-Database Facial Expression Recognition","authors":"Weicheng Xie;Tao Zhong;Fan Yang;Siyang Song;Zitong Yu;Linlin Shen","doi":"10.1109/TETCI.2025.3548727","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3548727","url":null,"abstract":"Cross-database facial expression recognition (CDFER) has attracted increasing attention when evaluating the systems' generalization performance. Although the attention mechanism can capture the feature-wise importance or feature-correlation of expression sensitive regions, the attention-based network suffers from the overfitting to the source database, due to possible over-dependence on most salient features, without exploring feature characteristics during removal of feature redundancy. To address this issue, this paper introduces a multi-kernel competitive convolution in feature-wise attention to obtain more salient regions and let each kernel compete with others to enhance the expressive ability of features, by reducing attention overfitting to the source domain. For feature-correlation attention, we resort to a Monte Carlo-based dropout to not only reduce the over-learning of the feature relationship, but also model the dropout probability distribution more specifically by taking the characteristics of feature maps into account. Experimental results show that our algorithm achieves much better generalization performances than the state of the arts (SOTAs) on six publicly available datasets, in the scenarios of single source domain, multiple source domains and domain adaption.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 6","pages":"4258-4272"},"PeriodicalIF":5.3,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145584671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
In Vivo Computational Strategy for Tumor Targeting in Co-Associated Biological Landscapes 协同相关生物景观中肿瘤靶向的体内计算策略
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-26 DOI: 10.1109/TETCI.2025.3569765
Shaolong Shi;Zhaoyang Jiang;Qiang Liu;Qingfu Zhang;Yifan Chen
Recently, a novel framework of in vivo computation has been proposed by modeling the tumor targetingproblem as a natural computation problem. The tumor-triggered biological gradient field (BGF) which provides targeting information for the nanorobots is viewed as the objective function to be optimized. The previous work focuses on the scenario of single BGF, which is interpreted as a uni-objective optimization problem. However, in real-life scenarios, various BGFs will be induced by the arising of a tumor lesion because of the variations of different kinds of biological information around the lesion (e.g., blood velocity, pH, oxygen, glucose, lactate, and $rm H^{+}$ ions). It is plausible to utilize BGF information as much as possible to target the tumor efficiently and robustly. Thus, we propose a BGF selector, which consists of a neural network “VisionaryNet”, a swarm intelligence algorithm, and a weak priority evolution strategy (WP-ES) in this article. Various artificial BGF landscapes are used to train the proposed VisionaryNet, which is employed to choose the alternative BGFs combined with several on-line estimated features during each iterative step. To demonstrate the effectiveness of the proposed BGF selector, a random selection approach is used as the benchmark. Comprehensive in silico experiments are carried out by taking into consideration the in vivo constraints of the nanobiosensing process. Furthermore, the correlation between the number of employed BGFs and the targeting result is investigated as the increasing of the number of BGFs will lead to excessive computation, which is adverse to the computational accuracy.
近年来,人们提出了一种新的体内计算框架,将肿瘤靶向问题建模为自然计算问题。肿瘤触发的生物梯度场(BGF)为纳米机器人提供了靶向信息,被视为需要优化的目标函数。以往的工作主要集中在单个BGF的场景,将其解释为一个单目标优化问题。然而,在现实生活中,由于肿瘤病变周围不同种类的生物信息(如血流速度、pH值、氧、葡萄糖、乳酸和H离子)的变化,肿瘤病变的产生会诱发各种BGFs。尽可能多地利用BGF信息来高效、稳健地靶向肿瘤是可行的。因此,本文提出了一种由神经网络“VisionaryNet”、群体智能算法和弱优先进化策略(WP-ES)组成的BGF选择器。使用各种人工BGF景观来训练所提出的VisionaryNet,该网络用于在每个迭代步骤中结合多个在线估计特征选择备选BGF。为了证明所提出的BGF选择器的有效性,采用随机选择方法作为基准。考虑到纳米生物传感过程的体内限制,进行了全面的硅实验。此外,由于BGFs数量的增加会导致计算量过大,不利于计算精度,因此研究了BGFs数量与目标效果的相关性。
{"title":"In Vivo Computational Strategy for Tumor Targeting in Co-Associated Biological Landscapes","authors":"Shaolong Shi;Zhaoyang Jiang;Qiang Liu;Qingfu Zhang;Yifan Chen","doi":"10.1109/TETCI.2025.3569765","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3569765","url":null,"abstract":"Recently, a novel framework of in vivo computation has been proposed by modeling the tumor targetingproblem as a natural computation problem. The tumor-triggered biological gradient field (BGF) which provides targeting information for the nanorobots is viewed as the objective function to be optimized. The previous work focuses on the scenario of single BGF, which is interpreted as a uni-objective optimization problem. However, in real-life scenarios, various BGFs will be induced by the arising of a tumor lesion because of the variations of different kinds of biological information around the lesion (e.g., blood velocity, pH, oxygen, glucose, lactate, and <inline-formula><tex-math>$rm H^{+}$</tex-math></inline-formula> ions). It is plausible to utilize BGF information as much as possible to target the tumor efficiently and robustly. Thus, we propose a BGF selector, which consists of a neural network “VisionaryNet”, a swarm intelligence algorithm, and a weak priority evolution strategy (WP-ES) in this article. Various artificial BGF landscapes are used to train the proposed VisionaryNet, which is employed to choose the alternative BGFs combined with several on-line estimated features during each iterative step. To demonstrate the effectiveness of the proposed BGF selector, a random selection approach is used as the benchmark. Comprehensive <italic>in silico</i> experiments are carried out by taking into consideration the in vivo constraints of the nanobiosensing process. Furthermore, the correlation between the number of employed BGFs and the targeting result is investigated as the increasing of the number of BGFs will lead to excessive computation, which is adverse to the computational accuracy.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 6","pages":"4133-4144"},"PeriodicalIF":5.3,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145584601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Textual Graph Representation With Syntactic Weighting for Implicit Sentiment Analysis 基于句法加权的文本图表示隐式情感分析
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-25 DOI: 10.1109/TETCI.2025.3550515
Shunxiang Zhang;Jiawei Li;Shuyu Li;Wenjie Duan;Zhongliang Wei;Kuan-Ching Li
Implicit sentiment analysis seeks to identify and interpret the underlying sentiment within texts that lack explicit sentiment words, significantly enhancing the capabilities of opinion analysis. Current methods often overlook the impact of context-dependent sequential text with graph neural networks, leading to an inadequate semantic representation of the text. In this paper, we propose a textual graph representation method with syntactic weighting for implicit sentiment analysis. This method improves textual semantic association by modeling the graph structure of the word position relationship in the text. It integrates syntactic weighting with an attention mechanism and guides node interactions in the graph attention network to generate textual graph representation with enhanced semantic depth and richness. The word semantics are enriched by introducing external knowledge. The proposed model is compared with existing models on the public implicit sentiment dataset SMP2019-ECISA, the explicit sentiment dataset NLPCC2014-SC, and the self-built dataset containing both explicit and implicit sentiment. The experimental results show that the proposed method can not only efficiently identify implicit sentiment, but also achieve some generalization and robustness in explicit sentiment recognition.
内隐情感分析旨在识别和解释缺乏明确情感词的文本中的潜在情感,显著提高了意见分析的能力。目前的方法往往忽略了上下文相关的顺序文本与图神经网络的影响,导致文本的语义表示不足。本文提出了一种基于句法加权的文本图表示方法,用于隐式情感分析。该方法通过对文本中单词位置关系的图结构建模,提高了文本的语义关联。它将句法加权与注意机制相结合,引导图注意网络中的节点交互,生成具有增强语义深度和丰富度的文本图表示。通过引入外部知识,丰富了词的语义。将该模型与公共隐式情感数据集SMP2019-ECISA、显式情感数据集NLPCC2014-SC以及包含显式和隐式情感的自建数据集上的现有模型进行比较。实验结果表明,该方法不仅可以有效地识别内隐情感,而且在显式情感识别中具有一定的泛化和鲁棒性。
{"title":"Textual Graph Representation With Syntactic Weighting for Implicit Sentiment Analysis","authors":"Shunxiang Zhang;Jiawei Li;Shuyu Li;Wenjie Duan;Zhongliang Wei;Kuan-Ching Li","doi":"10.1109/TETCI.2025.3550515","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3550515","url":null,"abstract":"Implicit sentiment analysis seeks to identify and interpret the underlying sentiment within texts that lack explicit sentiment words, significantly enhancing the capabilities of opinion analysis. Current methods often overlook the impact of context-dependent sequential text with graph neural networks, leading to an inadequate semantic representation of the text. In this paper, we propose a textual graph representation method with syntactic weighting for implicit sentiment analysis. This method improves textual semantic association by modeling the graph structure of the word position relationship in the text. It integrates syntactic weighting with an attention mechanism and guides node interactions in the graph attention network to generate textual graph representation with enhanced semantic depth and richness. The word semantics are enriched by introducing external knowledge. The proposed model is compared with existing models on the public implicit sentiment dataset SMP2019-ECISA, the explicit sentiment dataset NLPCC2014-SC, and the self-built dataset containing both explicit and implicit sentiment. The experimental results show that the proposed method can not only efficiently identify implicit sentiment, but also achieve some generalization and robustness in explicit sentiment recognition.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 6","pages":"4288-4299"},"PeriodicalIF":5.3,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145584643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Error Refinement in Stochastic Gradient Descent-Based Latent Factor Analysis via Diversified PID Controllers 基于多元PID控制器的随机梯度下降潜因子分析的学习误差细化
IF 5.3 3区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-03-24 DOI: 10.1109/TETCI.2025.3547854
Jinli Li;Ye Yuan;Xin Luo
In Big Data-based applications, high-dimensional and incomplete (HDI) data are frequently used to represent the complicated interactions among numerous nodes. A stochastic gradient descent (SGD)-based latent factor analysis (LFA) model can process such data efficiently. Unfortunately, a standard SGD algorithm trains a single latent factor relying on the stochastic gradient related to the current learning error only, leading to a slow convergence rate. To break through this bottleneck, this study establishes an SGD-based LFA model as the backbone, and proposes six proportional-integral-derivative (PID)-incorporated LFA models with diversified PID-controllers with the following two-fold ideas: a) refining the instant learning error in stochastic gradient by the principle of six PID-variants, i.e., a standard PID, an integral separated PID, a gearshift integral PID, a dead zone PID, an anti-windup PID, and an incomplete differential PID, to assimilate historical update information into the learning scheme in an efficient way; b) making the hyper-parameters adaptation by utilizing the mechanism of particle swarm optimization for acquiring high practicality. In addition, considering the diversified PID-variants, an effective ensemble is implemented for the six PID-incorporated LFA models. Experimental results on industrial HDI datasets illustrate that in comparison with state-of-the-art models, the proposed models obtain superior computational efficiency while maintaining competitive accuracy in predicting missing data within an HDI matrix. Moreover, their ensemble further improves performance in terms of prediction accuracy.
在基于大数据的应用中,经常使用高维不完整数据(high-dimensional and incomplete, HDI)来表示众多节点之间复杂的交互。基于随机梯度下降(SGD)的潜在因素分析(LFA)模型可以有效地处理这些数据。不幸的是,标准的SGD算法只依靠与当前学习误差相关的随机梯度来训练单个潜在因子,导致收敛速度缓慢。为了突破这一瓶颈,本研究建立了一个基于sgd的LFA模型作为主干,并提出了6个包含比例-积分-导数(PID)的LFA模型,采用多种PID控制器,其思路如下:a)利用标准PID、积分分离PID、换挡积分PID、死区PID、反上卷PID、不完全微分PID等6个PID变量的原理,改进随机梯度中的瞬时学习误差,有效地将历史更新信息吸收到学习方案中;B)利用粒子群优化机制进行超参数自适应,以获得较高的实用性。此外,考虑到pid变量的多样性,对6个包含pid的LFA模型进行了有效的集成。工业HDI数据集的实验结果表明,与最先进的模型相比,所提出的模型在预测HDI矩阵中缺失数据时获得了更高的计算效率,同时保持了竞争的准确性。此外,它们的集成进一步提高了预测精度。
{"title":"Learning Error Refinement in Stochastic Gradient Descent-Based Latent Factor Analysis via Diversified PID Controllers","authors":"Jinli Li;Ye Yuan;Xin Luo","doi":"10.1109/TETCI.2025.3547854","DOIUrl":"https://doi.org/10.1109/TETCI.2025.3547854","url":null,"abstract":"In Big Data-based applications, high-dimensional and incomplete (HDI) data are frequently used to represent the complicated interactions among numerous nodes. A stochastic gradient descent (SGD)-based latent factor analysis (LFA) model can process such data efficiently. Unfortunately, a standard SGD algorithm trains a single latent factor relying on the stochastic gradient related to the current learning error only, leading to a slow convergence rate. To break through this bottleneck, this study establishes an SGD-based LFA model as the backbone, and proposes six proportional-integral-derivative (PID)-incorporated LFA models with diversified PID-controllers with the following two-fold ideas: a) refining the instant learning error in stochastic gradient by the principle of six PID-variants, i.e., a standard PID, an integral separated PID, a gearshift integral PID, a dead zone PID, an anti-windup PID, and an incomplete differential PID, to assimilate historical update information into the learning scheme in an efficient way; b) making the hyper-parameters adaptation by utilizing the mechanism of particle swarm optimization for acquiring high practicality. In addition, considering the diversified PID-variants, an effective ensemble is implemented for the six PID-incorporated LFA models. Experimental results on industrial HDI datasets illustrate that in comparison with state-of-the-art models, the proposed models obtain superior computational efficiency while maintaining competitive accuracy in predicting missing data within an HDI matrix. Moreover, their ensemble further improves performance in terms of prediction accuracy.","PeriodicalId":13135,"journal":{"name":"IEEE Transactions on Emerging Topics in Computational Intelligence","volume":"9 5","pages":"3582-3597"},"PeriodicalIF":5.3,"publicationDate":"2025-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145128456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Emerging Topics in Computational Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1