首页 > 最新文献

Computational Intelligence最新文献

英文 中文
HyperED: A hierarchy-aware network based on hyperbolic geometry for event detection HyperED:基于双曲几何的事件检测分层感知网络
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-01-04 DOI: 10.1111/coin.12627
Meng Zhang, Zhiwen Xie, Jin Liu, Xiao Liu, Xiao Yu, Bo Huang

Event detection plays an essential role in the task of event extraction. It aims at identifying event trigger words in a sentence and classifying event types. Generally, multiple event types are usually well-organized with a hierarchical structure in real-world scenarios, and hierarchical correlations between event types can be used to enhance event detection performance. However, such kind of hierarchical information has received insufficient attention which can lead to misclassification between multiple event types. In addition, the most existing methods perform event detection in Euclidean space, which cannot adequately represent hierarchical relationships. To address these issues, we propose a novel event detection network HyperED which embeds the event context and types in Poincaré ball of hyperbolic geometry to help learn hierarchical features between events. Specifically, for the event detection context, we first leverage the pre-trained BERT or BiLSTM in Euclidean space to learn the semantic features of ED sentences. Meanwhile, to make full use of the dependency knowledge, a GNN-based model is applied when encoding event types to learn the correlations between events. Then we use a simple neural-based transformation to project the embeddings into the Poincaré ball to capture hierarchical features, and a distance score in hyperbolic space is computed for prediction. The experiments on MAVEN and ACE 2005 datasets indicate the effectiveness of the HyperED model and prove the natural advantages of hyperbolic spaces in expressing hierarchies in an intuitive way.

事件检测在事件提取任务中发挥着至关重要的作用。它旨在识别句子中的事件触发词,并对事件类型进行分类。一般来说,在现实世界的场景中,多个事件类型通常具有良好的组织层次结构,事件类型之间的层次相关性可用于提高事件检测性能。然而,这类分层信息没有得到足够重视,可能导致多种事件类型之间的错误分类。此外,现有的大多数方法都是在欧几里得空间中进行事件检测,无法充分体现层次关系。为了解决这些问题,我们提出了一种新颖的事件检测网络 HyperED,它将事件上下文和类型嵌入双曲几何的波恩卡莱球中,以帮助学习事件之间的层次特征。具体来说,对于事件检测上下文,我们首先利用欧几里得空间中预先训练好的 BERT 或 BiLSTM 来学习 ED 句子的语义特征。同时,为了充分利用依赖性知识,我们在对事件类型进行编码时采用了基于 GNN 的模型来学习事件之间的相关性。然后,我们使用基于神经的简单变换将嵌入投影到波恩卡莱球中,以捕捉分层特征,并计算双曲空间中的距离得分,从而进行预测。在 MAVEN 和 ACE 2005 数据集上的实验表明了 HyperED 模型的有效性,并证明了双曲空间在直观表达层次结构方面的天然优势。
{"title":"HyperED: A hierarchy-aware network based on hyperbolic geometry for event detection","authors":"Meng Zhang,&nbsp;Zhiwen Xie,&nbsp;Jin Liu,&nbsp;Xiao Liu,&nbsp;Xiao Yu,&nbsp;Bo Huang","doi":"10.1111/coin.12627","DOIUrl":"10.1111/coin.12627","url":null,"abstract":"<p>Event detection plays an essential role in the task of event extraction. It aims at identifying event trigger words in a sentence and classifying event types. Generally, multiple event types are usually well-organized with a hierarchical structure in real-world scenarios, and hierarchical correlations between event types can be used to enhance event detection performance. However, such kind of hierarchical information has received insufficient attention which can lead to misclassification between multiple event types. In addition, the most existing methods perform event detection in Euclidean space, which cannot adequately represent hierarchical relationships. To address these issues, we propose a novel event detection network HyperED which embeds the event context and types in Poincaré ball of hyperbolic geometry to help learn hierarchical features between events. Specifically, for the event detection context, we first leverage the pre-trained BERT or BiLSTM in Euclidean space to learn the semantic features of ED sentences. Meanwhile, to make full use of the dependency knowledge, a GNN-based model is applied when encoding event types to learn the correlations between events. Then we use a simple neural-based transformation to project the embeddings into the Poincaré ball to capture hierarchical features, and a distance score in hyperbolic space is computed for prediction. The experiments on MAVEN and ACE 2005 datasets indicate the effectiveness of the HyperED model and prove the natural advantages of hyperbolic spaces in expressing hierarchies in an intuitive way.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 1","pages":""},"PeriodicalIF":2.8,"publicationDate":"2024-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139094283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A mechanism for network resource allocation and task offloading in mobile edge computing and network engineering 移动边缘计算和网络工程中的网络资源分配和任务卸载机制
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-01-03 DOI: 10.1111/coin.12628
Zhixu Shu, Kewang Zhang

At present, most of the resource allocation methods in mobile edge computing allocate computing resources according to the time order in which task requests are calculated and unloaded, without considering the priority of tasks in practical applications. According to the computing requirements in such cases, a priority task-oriented resource allocation method is proposed. According to the average processing time of the task execution, the corresponding priority for task is given. The tasks with different priorities are weighted to allocate computing resources, which not only ensures that the high-priority tasks obtain sufficient computing resources, but also reduces the total time and energy consumption to complete the calculation of all tasks, thus improving the quality of service. The experimental results show that the proposed method can achieve better performance.

目前,移动边缘计算中的大多数资源分配方法都是根据任务请求计算和卸载的时间顺序来分配计算资源,没有考虑实际应用中任务的优先级。根据这种情况下的计算需求,提出了一种面向任务优先级的资源分配方法。根据任务执行的平均处理时间,给出相应的任务优先级。通过对不同优先级的任务进行加权来分配计算资源,既能保证高优先级任务获得充足的计算资源,又能减少完成所有任务计算的总时间和能耗,从而提高服务质量。实验结果表明,所提出的方法可以实现更好的性能。
{"title":"A mechanism for network resource allocation and task offloading in mobile edge computing and network engineering","authors":"Zhixu Shu,&nbsp;Kewang Zhang","doi":"10.1111/coin.12628","DOIUrl":"10.1111/coin.12628","url":null,"abstract":"<p>At present, most of the resource allocation methods in mobile edge computing allocate computing resources according to the time order in which task requests are calculated and unloaded, without considering the priority of tasks in practical applications. According to the computing requirements in such cases, a priority task-oriented resource allocation method is proposed. According to the average processing time of the task execution, the corresponding priority for task is given. The tasks with different priorities are weighted to allocate computing resources, which not only ensures that the high-priority tasks obtain sufficient computing resources, but also reduces the total time and energy consumption to complete the calculation of all tasks, thus improving the quality of service. The experimental results show that the proposed method can achieve better performance.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 1","pages":""},"PeriodicalIF":2.8,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139094211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint optimization of UAV position and user grouping for UAV-assisted hybrid NOMA systems 为无人机辅助混合 NOMA 系统联合优化无人机位置和用户分组
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-01-02 DOI: 10.1111/coin.12625
Yuan Sun, Zhicheng Dong, Liuqing Yang, Donghong Cai, Weixi Zhou, Yanxia Zhou

This article investigates the use of unmanned aerial vehicles (UAVs) in assisting hybrid non-orthogonal multiple access (NOMA) systems to enhance spectrum efficiency and communication connectivity. A joint optimization problem is formulated for UAV positioning and user grouping to maximize the sum rate. The formulated problem exhibits non-convexity, calling for an effective solution. To address this issue, a two-stage approach is proposed. In the first stage, a particle swarm optimization algorithm is employed to optimize the UAV positions without considering user grouping. With the UAV positions optimized, a game theory-based approach is utilized in the second stage to optimize user grouping and improve the sum rate of the hybrid NOMA system. Simulation results demonstrate that the proposed two-stage method achieves solutions close to the global optimum of the original problem. By optimizing the positions of UAVs and user groups, the sum rate can be effectively improved. Additionally, optimizing the deployment of UAVs ensures better fairness in providing communication services to multiple users.

本文研究了无人机在辅助混合非正交多址(NOMA)系统中的应用,以提高频谱效率和通信连接性。针对无人机定位和用户分组提出了一个联合优化问题,以最大化总和速率。该问题具有非凸性,需要有效的解决方案。为解决这一问题,提出了一种两阶段方法。在第一阶段,采用粒子群优化算法优化无人机位置,而不考虑用户分组。无人机位置优化后,第二阶段采用基于博弈论的方法优化用户分组,提高混合 NOMA 系统的总和率。仿真结果表明,所提出的两阶段方法获得了接近原始问题全局最优的解决方案。通过优化无人机和用户组的位置,可以有效提高总和率。此外,优化无人机的部署可确保为多个用户提供更公平的通信服务。
{"title":"Joint optimization of UAV position and user grouping for UAV-assisted hybrid NOMA systems","authors":"Yuan Sun,&nbsp;Zhicheng Dong,&nbsp;Liuqing Yang,&nbsp;Donghong Cai,&nbsp;Weixi Zhou,&nbsp;Yanxia Zhou","doi":"10.1111/coin.12625","DOIUrl":"10.1111/coin.12625","url":null,"abstract":"<p>This article investigates the use of unmanned aerial vehicles (UAVs) in assisting hybrid non-orthogonal multiple access (NOMA) systems to enhance spectrum efficiency and communication connectivity. A joint optimization problem is formulated for UAV positioning and user grouping to maximize the sum rate. The formulated problem exhibits non-convexity, calling for an effective solution. To address this issue, a two-stage approach is proposed. In the first stage, a particle swarm optimization algorithm is employed to optimize the UAV positions without considering user grouping. With the UAV positions optimized, a game theory-based approach is utilized in the second stage to optimize user grouping and improve the sum rate of the hybrid NOMA system. Simulation results demonstrate that the proposed two-stage method achieves solutions close to the global optimum of the original problem. By optimizing the positions of UAVs and user groups, the sum rate can be effectively improved. Additionally, optimizing the deployment of UAVs ensures better fairness in providing communication services to multiple users.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 1","pages":""},"PeriodicalIF":2.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139077260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrated indoor positioning methods to optimize computations and prediction accuracy enhancement 优化计算和提高预测精度的综合室内定位方法
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-01-02 DOI: 10.1111/coin.12620
Yongho Kim, Jiha Kim, Cheolwoo You, Hyunhee Park

Indoor GPS location estimation encounters accuracy challenges from intricate building structures and diverse signal interferences. Trilateration methods utilising APs are typically employed to estimate indoor locations. Nevertheless, estimation errors from multipath effects and high power consumption of sensors employed in location estimation curtail battery life. To address this issue, research into location estimation methods utilising machine learning has been conducted. However, challenges involving the selection of the optimal access point locations and obtaining dense RSSI data have been noted. In this article presents a solution based on sparse radio maps for decreasing the expenses of collecting RSSI data while simultaneously enhancing indoor location accuracy through the integration of image data. The proposed approach integrates matrix-based RSSI indoor positioning (M-RIP) for initial location estimation and feature-based image indoor positioning (F-IIP) for position determination via image feature matching. Furthermore, extended area-based post-processing (EA-PP) is employed to augment M-RIP's precision and minimize image matching computation in F-IIP, improving overall performance. This article utilizes actual building data to validate the precision of the position estimation and efficiency of computation reduction using the proposed method.

室内 GPS 定位估算面临着复杂的建筑结构和各种信号干扰带来的精度挑战。通常采用利用接入点的三摄法来估算室内位置。然而,多径效应造成的估算误差和位置估算所用传感器的高能耗会缩短电池寿命。为解决这一问题,人们对利用机器学习的位置估算方法进行了研究。然而,在选择最佳接入点位置和获取高密度 RSSI 数据方面存在挑战。本文提出了一种基于稀疏无线电地图的解决方案,以减少收集 RSSI 数据的费用,同时通过整合图像数据提高室内定位精度。所提出的方法整合了基于矩阵的 RSSI 室内定位(M-RIP)和基于特征的图像室内定位(F-IIP),前者用于初始位置估计,后者通过图像特征匹配确定位置。此外,还采用了基于区域的扩展后处理(EA-PP)来提高 M-RIP 的精度,并尽量减少 F-IIP 中的图像匹配计算,从而提高整体性能。本文利用实际建筑数据验证了所提方法的位置估计精度和计算量减少的效率。
{"title":"Integrated indoor positioning methods to optimize computations and prediction accuracy enhancement","authors":"Yongho Kim,&nbsp;Jiha Kim,&nbsp;Cheolwoo You,&nbsp;Hyunhee Park","doi":"10.1111/coin.12620","DOIUrl":"10.1111/coin.12620","url":null,"abstract":"<p>Indoor GPS location estimation encounters accuracy challenges from intricate building structures and diverse signal interferences. Trilateration methods utilising APs are typically employed to estimate indoor locations. Nevertheless, estimation errors from multipath effects and high power consumption of sensors employed in location estimation curtail battery life. To address this issue, research into location estimation methods utilising machine learning has been conducted. However, challenges involving the selection of the optimal access point locations and obtaining dense RSSI data have been noted. In this article presents a solution based on sparse radio maps for decreasing the expenses of collecting RSSI data while simultaneously enhancing indoor location accuracy through the integration of image data. The proposed approach integrates matrix-based RSSI indoor positioning (M-RIP) for initial location estimation and feature-based image indoor positioning (F-IIP) for position determination via image feature matching. Furthermore, extended area-based post-processing (EA-PP) is employed to augment M-RIP's precision and minimize image matching computation in F-IIP, improving overall performance. This article utilizes actual building data to validate the precision of the position estimation and efficiency of computation reduction using the proposed method.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 1","pages":""},"PeriodicalIF":2.8,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139094493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial intelligence control for trust-based detection of attackers in 5G social networks 基于信任的人工智能控制,用于检测 5G 社交网络中的攻击者
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-12-25 DOI: 10.1111/coin.12618
Davinder Kaur, Suleyman Uslu, Mimoza Durresi, Arjan Durresi

This study introduces a comprehensive framework designed for detecting and mitigating fake and potentially threatening user communities within 5G social networks. Leveraging geo-location data, community trust dynamics, and AI-driven community detection algorithms, this framework aims to pinpoint users posing potential harm. Including an artificial control model facilitates the selection of suitable community detection algorithms, coupled with a trust-based strategy to effectively identify and filter potential attackers. A distinctive feature of this framework lies in its ability to consider attributes that prove challenging for malicious users to emulate, such as the established trust within the community, geographical location, and adaptability to diverse attack scenarios. To validate its efficacy, we illustrate the framework using synthetic social network data, demonstrating its ability to distinguish potential malicious users from trustworthy ones.

本研究介绍了一个综合框架,旨在检测和减轻 5G 社交网络中的虚假和潜在威胁用户社区。该框架利用地理位置数据、社区信任动态和人工智能驱动的社区检测算法,旨在找出具有潜在危害的用户。人工控制模型有助于选择合适的社区检测算法,并结合基于信任的策略来有效识别和过滤潜在的攻击者。该框架的一个显著特点在于它能够考虑恶意用户难以模仿的属性,如社区内已建立的信任、地理位置和对不同攻击场景的适应性。为了验证该框架的有效性,我们使用合成社交网络数据对其进行了说明,证明了该框架区分潜在恶意用户和可信用户的能力。
{"title":"Artificial intelligence control for trust-based detection of attackers in 5G social networks","authors":"Davinder Kaur,&nbsp;Suleyman Uslu,&nbsp;Mimoza Durresi,&nbsp;Arjan Durresi","doi":"10.1111/coin.12618","DOIUrl":"10.1111/coin.12618","url":null,"abstract":"<p>This study introduces a comprehensive framework designed for detecting and mitigating fake and potentially threatening user communities within 5G social networks. Leveraging geo-location data, community trust dynamics, and AI-driven community detection algorithms, this framework aims to pinpoint users posing potential harm. Including an artificial control model facilitates the selection of suitable community detection algorithms, coupled with a trust-based strategy to effectively identify and filter potential attackers. A distinctive feature of this framework lies in its ability to consider attributes that prove challenging for malicious users to emulate, such as the established trust within the community, geographical location, and adaptability to diverse attack scenarios. To validate its efficacy, we illustrate the framework using synthetic social network data, demonstrating its ability to distinguish potential malicious users from trustworthy ones.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 1","pages":""},"PeriodicalIF":2.8,"publicationDate":"2023-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139056688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tripartite-structure transformer for hyperspectral image classification 用于高光谱图像分类的三方结构变换器
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-12-21 DOI: 10.1111/coin.12611
Liuwei Wan, Meili Zhou, Shengqin Jiang, Zongwen Bai, Haokui Zhang

Hyperspectral images contain rich spatial and spectral information, which provides a strong basis for distinguishing different land-cover objects. Therefore, hyperspectral image (HSI) classification has been a hot research topic. With the advent of deep learning, convolutional neural networks (CNNs) have become a popular method for hyperspectral image classification. However, convolutional neural network (CNN) has strong local feature extraction ability but cannot deal with long-distance dependence well. Vision Transformer (ViT) is a recent development that can address this limitation, but it is not effective in extracting local features and has low computational efficiency. To overcome these drawbacks, we propose a hybrid classification network that combines the strengths of both CNN and ViT, names Spatial-Spectral Former(SSF). The shallow layer employs 3D convolution to extract local features and reduce data dimensions. The deep layer employs a spectral-spatial transformer module for global feature extraction and information enhancement in spectral and spatial dimensions. Our proposed model achieves promising results on widely used public HSI datasets compared to other deep learning methods, including CNN, ViT, and hybrid models.

高光谱图像包含丰富的空间和光谱信息,为区分不同的陆地覆盖物提供了坚实的基础。因此,高光谱图像(HSI)分类一直是研究热点。随着深度学习技术的发展,卷积神经网络(CNN)已成为高光谱图像分类的一种流行方法。然而,卷积神经网络(CNN)具有很强的局部特征提取能力,却不能很好地处理长距离依赖关系。视觉变换器(ViT)是最近开发的一种可以解决这一局限性的方法,但它在提取局部特征方面效果不佳,而且计算效率较低。为了克服这些缺点,我们提出了一种混合分类网络,它结合了 CNN 和 ViT 的优点,名为空间-频谱前置(SSF)。浅层采用三维卷积来提取局部特征并降低数据维度。深层采用频谱-空间变换器模块来提取全局特征,并在频谱和空间维度上增强信息。与其他深度学习方法(包括 CNN、ViT 和混合模型)相比,我们提出的模型在广泛使用的公共 HSI 数据集上取得了可喜的成果。
{"title":"Tripartite-structure transformer for hyperspectral image classification","authors":"Liuwei Wan,&nbsp;Meili Zhou,&nbsp;Shengqin Jiang,&nbsp;Zongwen Bai,&nbsp;Haokui Zhang","doi":"10.1111/coin.12611","DOIUrl":"10.1111/coin.12611","url":null,"abstract":"<p>Hyperspectral images contain rich spatial and spectral information, which provides a strong basis for distinguishing different land-cover objects. Therefore, hyperspectral image (HSI) classification has been a hot research topic. With the advent of deep learning, convolutional neural networks (CNNs) have become a popular method for hyperspectral image classification. However, convolutional neural network (CNN) has strong local feature extraction ability but cannot deal with long-distance dependence well. Vision Transformer (ViT) is a recent development that can address this limitation, but it is not effective in extracting local features and has low computational efficiency. To overcome these drawbacks, we propose a hybrid classification network that combines the strengths of both CNN and ViT, names Spatial-Spectral Former(SSF). The shallow layer employs 3D convolution to extract local features and reduce data dimensions. The deep layer employs a spectral-spatial transformer module for global feature extraction and information enhancement in spectral and spatial dimensions. Our proposed model achieves promising results on widely used public HSI datasets compared to other deep learning methods, including CNN, ViT, and hybrid models.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 1","pages":""},"PeriodicalIF":2.8,"publicationDate":"2023-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138826105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing visual question answering with a two-way co-attention mechanism and integrated multimodal features 利用双向共同注意机制和综合多模态特征增强视觉问题解答能力
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-12-21 DOI: 10.1111/coin.12624
Mayank Agrawal, Anand Singh Jalal, Himanshu Sharma

In Visual question answering (VQA), a natural language answer is generated for a given image and a question related to that image. There is a significant growth in the VQA task by applying an efficient attention mechanism. However, current VQA models use region features or object features that are not adequate to improve the accuracy of generated answers. To deal with this issue, we have used a Two-way Co-Attention Mechanism (TCAM), which is capable enough to fuse different visual features (region, object, and concept) from diverse perspectives. These diverse features lead to different sets of answers, and also, there is an inherent relationship between these visual features. We have developed a powerful attention mechanism that uses these two critical aspects by using both bottom-up and top-down TCAM to extract discriminative feature information. We have proposed a Collective Feature Integration Module (CFIM) to combine multimodal attention features and thus capture the valuable information from these visual features by employing a TCAM. Further, we have formulated a Vertical CFIM for fusing the features belonging to the same class and a Horizontal CFIM for combining the features belonging to different types, thus balancing the influence of top-down and bottom-up co-attention. The experiments are conducted on two significant datasets, VQA 1.0 and VQA 2.0. On VQA 1.0, the overall accuracy of our proposed method is 71.23 on the test-dev set and 71.94 on the test-std set. On VQA 2.0, the overall accuracy of our proposed method is 75.89 on the test-dev set and 76.32 on the test-std set. The above overall accuracy clearly reflecting the superiority of our proposed TCAM based approach over the existing methods.

在视觉问题解答(VQA)中,针对给定图像和与该图像相关的问题生成自然语言答案。通过应用高效的注意力机制,VQA 任务有了显著增长。然而,当前的 VQA 模型使用的区域特征或对象特征不足以提高生成答案的准确性。为了解决这个问题,我们采用了双向协同注意机制(TCAM),它能够从不同角度融合不同的视觉特征(区域、物体和概念)。这些不同的特征会产生不同的答案,而且这些视觉特征之间存在着内在联系。我们开发了一种强大的注意力机制,通过使用自下而上和自上而下的 TCAM 来提取辨别特征信息,从而利用这两个关键方面。我们提出了集体特征整合模块(CFIM),通过使用 TCAM 来组合多模态注意特征,从而从这些视觉特征中获取有价值的信息。此外,我们还提出了一个用于融合同类特征的垂直特征集成模块(Vertical CFIM)和一个用于融合不同类型特征的水平特征集成模块(Horizontal CFIM),从而平衡了自上而下和自下而上共同注意的影响。实验在两个重要的数据集 VQA 1.0 和 VQA 2.0 上进行。在 VQA 1.0 上,我们提出的方法在 test-dev 集上的总体准确率为 71.23,在 test-std 集上的准确率为 71.94。在 VQA 2.0 上,我们提出的方法在 test-dev 集上的总体准确率为 75.89,在 test-std 集上的总体准确率为 76.32。上述总体准确率清楚地反映了我们提出的基于 TCAM 的方法优于现有方法。
{"title":"Enhancing visual question answering with a two-way co-attention mechanism and integrated multimodal features","authors":"Mayank Agrawal,&nbsp;Anand Singh Jalal,&nbsp;Himanshu Sharma","doi":"10.1111/coin.12624","DOIUrl":"10.1111/coin.12624","url":null,"abstract":"<p>In Visual question answering (VQA), a natural language answer is generated for a given image and a question related to that image. There is a significant growth in the VQA task by applying an efficient attention mechanism. However, current VQA models use region features or object features that are not adequate to improve the accuracy of generated answers. To deal with this issue, we have used a Two-way Co-Attention Mechanism (TCAM), which is capable enough to fuse different visual features (region, object, and concept) from diverse perspectives. These diverse features lead to different sets of answers, and also, there is an inherent relationship between these visual features. We have developed a powerful attention mechanism that uses these two critical aspects by using both bottom-up and top-down TCAM to extract discriminative feature information. We have proposed a Collective Feature Integration Module (CFIM) to combine multimodal attention features and thus capture the valuable information from these visual features by employing a TCAM. Further, we have formulated a Vertical CFIM for fusing the features belonging to the same class and a Horizontal CFIM for combining the features belonging to different types, thus balancing the influence of top-down and bottom-up co-attention. The experiments are conducted on two significant datasets, VQA 1.0 and VQA 2.0. On VQA 1.0, the overall accuracy of our proposed method is 71.23 on the test-dev set and 71.94 on the test-std set. On VQA 2.0, the overall accuracy of our proposed method is 75.89 on the test-dev set and 76.32 on the test-std set. The above overall accuracy clearly reflecting the superiority of our proposed TCAM based approach over the existing methods.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 1","pages":""},"PeriodicalIF":2.8,"publicationDate":"2023-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138951447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature redundancy removal for text classification using correlated feature subsets 利用相关特征子集去除文本分类中的特征冗余
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-12-21 DOI: 10.1111/coin.12621
Lazhar Farek, Amira Benaidja

The curse of high dimensionality in text classification is a worrisome problem that requires efficient and optimal feature selection (FS) methods to improve classification accuracy and reduce learning time. Existing filter-based FS methods evaluate features independently of other related ones, which can then lead to selecting a large number of redundant features, especially in high-dimensional datasets, resulting in more learning time and less classification performance, whereas information theory-based methods aim to maximize feature dependency with the class variable and minimize its redundancy for all selected features, which gradually becomes impractical when increasing the feature space. To overcome the time complexity issue of information theory-based methods while taking into account the redundancy issue, in this article, we propose a new feature selection method for text classification termed correlation-based redundancy removal, which aims to minimize the redundancy using subsets of features having close mutual information scores without sequentially seeking already selected features. The idea is that it is not important to assess the redundancy of a dominant feature having high classification information with another irrelevant feature having low classification information and vice-versa since they are implicitly weakly correlated. Our method, tested on seven datasets using both traditional classifiers (Naive Bayes and support vector machines) and deep learning models (long short-term memory and convolutional neural networks), demonstrated strong performance by reducing redundancy and improving classification compared to ten competitive metrics.

文本分类中的高维诅咒是一个令人担忧的问题,需要高效、优化的特征选择(FS)方法来提高分类准确率并缩短学习时间。现有的基于滤波器的特征选择方法在评估特征时不考虑其他相关特征,这可能会导致选择大量冗余特征,尤其是在高维数据集中,从而导致学习时间增加,分类性能降低;而基于信息论的方法旨在最大化特征与类变量的依赖性,并最小化所有被选特征的冗余度,当特征空间增大时,这种方法逐渐变得不切实际。为了克服基于信息论方法的时间复杂性问题,同时考虑到冗余问题,我们在本文中提出了一种新的文本分类特征选择方法,即基于相关性的冗余去除方法,其目的是使用互信息得分接近的特征子集来最小化冗余,而不需要依次寻找已选定的特征。我们的想法是,评估一个具有高分类信息的主要特征与另一个具有低分类信息的不相关特征之间的冗余度并不重要,反之亦然,因为它们之间隐含着弱相关性。我们的方法使用传统分类器(奈夫贝叶斯和支持向量机)和深度学习模型(长短期记忆和卷积神经网络)在七个数据集上进行了测试,与十个竞争指标相比,我们的方法减少了冗余,提高了分类效果,表现出了强劲的性能。
{"title":"Feature redundancy removal for text classification using correlated feature subsets","authors":"Lazhar Farek,&nbsp;Amira Benaidja","doi":"10.1111/coin.12621","DOIUrl":"10.1111/coin.12621","url":null,"abstract":"<p>The curse of high dimensionality in text classification is a worrisome problem that requires efficient and optimal feature selection (FS) methods to improve classification accuracy and reduce learning time. Existing filter-based FS methods evaluate features independently of other related ones, which can then lead to selecting a large number of redundant features, especially in high-dimensional datasets, resulting in more learning time and less classification performance, whereas information theory-based methods aim to maximize feature dependency with the class variable and minimize its redundancy for all selected features, which gradually becomes impractical when increasing the feature space. To overcome the time complexity issue of information theory-based methods while taking into account the redundancy issue, in this article, we propose a new feature selection method for text classification termed correlation-based redundancy removal, which aims to minimize the redundancy using subsets of features having close mutual information scores without sequentially seeking already selected features. The idea is that it is not important to assess the redundancy of a dominant feature having high classification information with another irrelevant feature having low classification information and vice-versa since they are implicitly weakly correlated. Our method, tested on seven datasets using both traditional classifiers (Naive Bayes and support vector machines) and deep learning models (long short-term memory and convolutional neural networks), demonstrated strong performance by reducing redundancy and improving classification compared to ten competitive metrics.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 1","pages":""},"PeriodicalIF":2.8,"publicationDate":"2023-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138952032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sentiment analysis on Hindi tweets during COVID-19 pandemic COVID-19 大流行期间印地语推文的情感分析
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-12-20 DOI: 10.1111/coin.12622
Anita Saroj, Akash Thakur, Sukomal Pal

A gap among the people has been created due to a lack of social interactions. The physical void has led to an increase in online interaction among users on social media platforms. Sentiment analysis of such interactions can help us analyze the general public psychology during the pandemic. However, the lack of data in non-English and low-resource languages like ‘Hindi’ makes it difficult to study it among native and non-English speaking masses. Here, we create a small collection of ‘Hindi’ tweets on COVID-19 during the pandemic containing 10,011 tweets for sentiment analysis, which is named as sentiment analysis for Hindi (SAFH). In this article, we describe the process of collecting, creating, annotating the corpus, and sentiment classification. The claims have been verified using different word embedding with a deep learning classifier through the proposed model. The achieved accuracy of the proposed model yields up to a permissible rate of 90.9%.

由于缺乏社交互动,人们之间产生了隔阂。身体上的空白导致用户在社交媒体平台上的在线互动增加。对此类互动的情感分析有助于我们分析大流行病期间的大众心理。然而,由于缺乏非英语和低资源语言(如 "印地语")的数据,因此很难对母语和非英语大众进行研究。在此,我们在 COVID-19 上创建了一个包含 10,011 条大流行期间 "印地语 "推文的小型情感分析集合,并将其命名为印地语情感分析(SAFH)。在本文中,我们将介绍收集、创建、注释语料库和情感分类的过程。通过所提出的模型,使用深度学习分类器对不同的词嵌入进行了验证。该模型的准确率高达 90.9%。
{"title":"Sentiment analysis on Hindi tweets during COVID-19 pandemic","authors":"Anita Saroj,&nbsp;Akash Thakur,&nbsp;Sukomal Pal","doi":"10.1111/coin.12622","DOIUrl":"10.1111/coin.12622","url":null,"abstract":"<p>A gap among the people has been created due to a lack of social interactions. The physical void has led to an increase in online interaction among users on social media platforms. Sentiment analysis of such interactions can help us analyze the general public psychology during the pandemic. However, the lack of data in non-English and low-resource languages like ‘Hindi’ makes it difficult to study it among native and non-English speaking masses. Here, we create a small collection of ‘Hindi’ tweets on COVID-19 during the pandemic containing 10,011 tweets for sentiment analysis, which is named as sentiment analysis for Hindi (SAFH). In this article, we describe the process of collecting, creating, annotating the corpus, and sentiment classification. The claims have been verified using different word embedding with a deep learning classifier through the proposed model. The achieved accuracy of the proposed model yields up to a permissible rate of 90.9%.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 1","pages":""},"PeriodicalIF":2.8,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138826070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent IoT framework with GAN-synthesized images for enhanced defect detection in manufacturing 利用 GAN 合成图像的智能物联网框架增强制造业缺陷检测能力
IF 2.8 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2023-12-18 DOI: 10.1111/coin.12619
Somrawee Aramkul, Prompong Sugunnasil

The manufacturing industry is always exploring techniques to optimize processes, increase product quality, and more accurately identify defects. The technique of deep learning is the strategy that will be used to handle the issues presented. However, the challenge of using AI in this domain is the small and imbalanced dataset for training affected by the severe shortage of defective data. Moreover, data acquisition requires significant labor, time, and resources. In response to these demands, this research presents an intelligent internet of things (IoT) framework enriched by generative adversarial network (GAN). This framework was developed in response to the needs outlined above. The framework applies the IoT for real-time data collection and communication, while GAN are utilized to synthesize high-fidelity images of manufacturing defects. The quality of the GAN-synthesized image is quantified by the average FID score of 8.312 for non-defective images and 7.459 for defective images. As evidenced by the similarity between the distributions of synthetic and real images, the proposed generative model can generate visually authentic and high-fidelity images. As demonstrated by the results of the defect detection experiments, the accuracy can be enhanced to the maximum of 96.5% by integrating the GAN-synthesized images with the real images. Concurrently, this integration reduces the occurrence of false alarms.

制造业一直在探索优化流程、提高产品质量和更准确地识别缺陷的技术。深度学习技术就是用于处理上述问题的策略。然而,在这一领域使用人工智能所面临的挑战是,受缺陷数据严重不足的影响,用于训练的数据集较小且不平衡。此外,数据采集需要大量的人力、时间和资源。针对这些需求,本研究提出了一个由生成式对抗网络(GAN)丰富的智能物联网(IoT)框架。该框架就是针对上述需求开发的。该框架将物联网用于实时数据收集和通信,同时利用生成式对抗网络合成制造缺陷的高保真图像。非缺陷图像的平均 FID 分数为 8.312,缺陷图像的平均 FID 分数为 7.459,以此来量化 GAN 合成图像的质量。从合成图像和真实图像分布的相似性可以看出,所提出的生成模型可以生成视觉上真实的高保真图像。缺陷检测实验结果表明,通过将 GAN 合成图像与真实图像整合,准确率最高可提高到 96.5%。同时,这种整合还能减少误报的发生。
{"title":"Intelligent IoT framework with GAN-synthesized images for enhanced defect detection in manufacturing","authors":"Somrawee Aramkul,&nbsp;Prompong Sugunnasil","doi":"10.1111/coin.12619","DOIUrl":"10.1111/coin.12619","url":null,"abstract":"<p>The manufacturing industry is always exploring techniques to optimize processes, increase product quality, and more accurately identify defects. The technique of deep learning is the strategy that will be used to handle the issues presented. However, the challenge of using AI in this domain is the small and imbalanced dataset for training affected by the severe shortage of defective data. Moreover, data acquisition requires significant labor, time, and resources. In response to these demands, this research presents an intelligent internet of things (IoT) framework enriched by generative adversarial network (GAN). This framework was developed in response to the needs outlined above. The framework applies the IoT for real-time data collection and communication, while GAN are utilized to synthesize high-fidelity images of manufacturing defects. The quality of the GAN-synthesized image is quantified by the average FID score of 8.312 for non-defective images and 7.459 for defective images. As evidenced by the similarity between the distributions of synthetic and real images, the proposed generative model can generate visually authentic and high-fidelity images. As demonstrated by the results of the defect detection experiments, the accuracy can be enhanced to the maximum of 96.5% by integrating the GAN-synthesized images with the real images. Concurrently, this integration reduces the occurrence of false alarms.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":"40 2","pages":""},"PeriodicalIF":2.8,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138827033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computational Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1