Icon最新文献

英文中文

Multi-Objective Virtual Machine Placement Algorithm Based on Improved Discrete Differential Evolution 基于改进离散差分进化的多目标虚拟机布局算法

Q3 Arts and Humanities

Icon

Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00086

Li Liu, Wujun Yang, Zhixian Chang

Aiming at the problem of high energy consumption and resource fragmentation caused by unbalanced multidimensional resource usage of servers in current cloud data centers, a virtual machine placement algorithm based on improved discrete differential evolution(IDDE) algorithm was proposed. According to the multi-dimensional resource requirements of virtual machines, the population initialization was used to improve the convergence speed of the algorithm, and the discrete differential mutation and crossover operations were used to ensure the diversity of the population. A multi-group elite selection strategy based on $varepsilon$ relaxation was proposed to select the optimal virtual machine placement scheme and enhance the global search ability of the algorithm. The simulation results show that compared with the other three algorithms such as the DE algorithm, the IDDE algorithm has a certain improvement effect in reducing energy consumption, improving resource utilization and reducing resource fragmentation.

针对当前云数据中心服务器多维资源使用不平衡导致的高能耗和资源碎片化问题，提出了一种基于改进离散差分进化(IDDE)算法的虚拟机布局算法。根据虚拟机的多维资源需求，采用种群初始化来提高算法的收敛速度，采用离散微分变异和交叉操作来保证种群的多样性。为了选择最优的虚拟机布局方案，提高算法的全局搜索能力，提出了一种基于$varepsilon$松弛的多群体精英选择策略。仿真结果表明，与DE算法等其他三种算法相比，IDDE算法在降低能耗、提高资源利用率、减少资源碎片化等方面具有一定的改进效果。

引用次数: 0

Unsupervised Contradiction Detection using Sentence Transformations 基于句子变换的无监督矛盾检测

Q3 Arts and Humanities

Icon

Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00065

Gerrit Schumann, Jorge Marx Gómez

Contradiction detection (CD) is a subfield of Natural Language Inference (NLI) that is relevant to many domains where contradictory statements in texts should be avoided (e.g., in financial or regulatory documents). With the advent of large annotated NLI datasets, there has been an increased focus on supervised deep-learning approaches in this research area. However, since this training data does not necessarily reflect the characteristic properties of the application data, unsupervised CD approaches are still relevant for certain domains or languages. In this paper, we therefore take up a recently published unsupervised NLI approach, reproduce parts of the proposed sentence transformations, extend it with various modifications, and evaluate it for the sole task of contradiction detection. The results show that under the exclusion of certain transformations types, an accuracy of 71.42 can be achieved on the SNLI test dataset.

矛盾检测(CD)是自然语言推理(NLI)的一个子领域，与文本中应该避免矛盾陈述的许多领域相关(例如，在金融或监管文件中)。随着大型注释NLI数据集的出现，人们越来越关注该研究领域的监督深度学习方法。然而，由于这种训练数据不一定反映应用程序数据的特征属性，因此无监督CD方法仍然与某些领域或语言相关。因此，在本文中，我们采用了最近发表的一种无监督NLI方法，再现了所提出的句子转换的部分内容，用各种修改对其进行扩展，并对其进行评估，以完成矛盾检测的唯一任务。结果表明，在排除某些转换类型的情况下，在SNLI测试数据集上可以达到71.42的精度。

引用次数: 0

Multi-level Feature Extraction and Edge Reconstruction Fused Generative Adversarial Networks for Image Super Resolution 面向图像超分辨率的多层次特征提取与边缘重建融合生成对抗网络

Q3 Arts and Humanities

Icon

Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00027

Yinghua Li, Yue Liu, Y. Liu, Yangge Qiao, Jinglu He

At present, the image super-resolution method based on convolutional neural network has achieved a very high PSNR, but the high-frequency information obtained by using the mean square error as the loss function is not sufficient, and when the scale factor is large, the detail texture of the restored image is blurred, and it is not completely consistent with the human visual perception. Therefore, this paper proposes an image super-resolution algorithm based on GAN. We modify the residual block of the original SRGAN generator network into three modules: Edge-Reconstruction network, Low-Frequency feature (LF-feature) extraction module and Residual network. The Edge-Reconstruction network reconstructs the edge of SR image, and the LF-feature extraction module extracts the low-frequency information of the image. After that, the two parts of information are fused and transmitted to Residual network to extract the high-frequency information of the image, and then the SR image is reconstructed and enlarged. And use skip connection in the network to increase the network depth. The training results show that our network has better performance in both objective evaluation indicators and subjective vision. Even with a large-scale factor, our network can recover fine texture information.

目前，基于卷积神经网络的图像超分辨率方法虽然取得了很高的PSNR，但利用均方误差作为损失函数获得的高频信息并不充分，而且当比例因子较大时，恢复图像的细节纹理模糊，与人的视觉感知不完全一致。为此，本文提出了一种基于GAN的图像超分辨率算法。我们将原有SRGAN发生器网络的残差块修改为三个模块:边缘重建网络、低频特征提取模块和残差网络。边缘重建网络重建SR图像的边缘，lf特征提取模块提取图像的低频信息。然后将两部分信息融合并传输到残差网络中提取图像的高频信息，再对SR图像进行重构和放大。并在网络中采用跳接方式，增加网络深度。训练结果表明，我们的网络在客观评价指标和主观视觉上都有较好的表现。即使是大规模的因素，我们的网络也能恢复出精细的纹理信息。

{"title":"Multi-level Feature Extraction and Edge Reconstruction Fused Generative Adversarial Networks for Image Super Resolution","authors":"Yinghua Li, Yue Liu, Y. Liu, Yangge Qiao, Jinglu He","doi":"10.1109/ICNLP58431.2023.00027","DOIUrl":"https://doi.org/10.1109/ICNLP58431.2023.00027","url":null,"abstract":"At present, the image super-resolution method based on convolutional neural network has achieved a very high PSNR, but the high-frequency information obtained by using the mean square error as the loss function is not sufficient, and when the scale factor is large, the detail texture of the restored image is blurred, and it is not completely consistent with the human visual perception. Therefore, this paper proposes an image super-resolution algorithm based on GAN. We modify the residual block of the original SRGAN generator network into three modules: Edge-Reconstruction network, Low-Frequency feature (LF-feature) extraction module and Residual network. The Edge-Reconstruction network reconstructs the edge of SR image, and the LF-feature extraction module extracts the low-frequency information of the image. After that, the two parts of information are fused and transmitted to Residual network to extract the high-frequency information of the image, and then the SR image is reconstructed and enlarged. And use skip connection in the network to increase the network depth. The training results show that our network has better performance in both objective evaluation indicators and subjective vision. Even with a large-scale factor, our network can recover fine texture information.","PeriodicalId":53637,"journal":{"name":"Icon","volume":"32 1","pages":"113-120"},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84894210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Perceptual assimilation of Mandarin consonants in second language acquisition 普通话辅音在二语习得中的知觉同化

Q3 Arts and Humanities

Icon

Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00054

Dan Du, Minghu Jiang

Research on adult cross-language speech perception suggests that adults face challenges perceiving segments distinctions that are not employed contrastively in their own language. Cross-language speech perceptual similarity has played a significant role in predicting and explaining L2 speech perception. It’s necessary to carry out a perceptual assimilation study on Mandarin which is an influencing language and has an increasing number of L2 learners around the world. The present study investigates the perception of Mandarin consonants by native Urdu speakers. To this purpose, 15 Urdu speakers were tested in an assimilation task in which they were asked to assimilate Mandarin consonants to their native language categories. The results show that native language interfere with the perception of non-native consonants in a certain extend that is in conformity to what PAM proposed. Furthermore, the results expand the application of PAM in Mandarin and provide a baseline for future relevant studies.

对成人跨语言语音感知的研究表明，成年人在感知自己语言中没有对比的词段差异时面临挑战。跨语言语音感知相似性在预测和解释二语语音感知方面发挥了重要作用。汉语是一种具有影响力的语言，在世界范围内有越来越多的第二语言学习者，因此有必要对汉语进行感知同化研究。本研究调查了以乌尔都语为母语的人对普通话辅音的感知。为此目的，15名说乌尔都语的人在同化任务中接受了测试，要求他们将普通话辅音同化到他们的母语类别中。结果表明，母语在一定程度上干扰了非母语辅音的感知，这与PAM的理论一致。此外，该研究结果拓展了PAM在普通话中的应用，为今后的相关研究提供了基础。

引用次数: 0

Intelligent Fault Diagnosis of Rolling Bearing based on VMD and Improved Self-training Semi-supervised Ensemble Learning 基于VMD和改进自训练半监督集成学习的滚动轴承智能故障诊断

Q3 Arts and Humanities

Icon

Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00080

Xiangyu Li, Yao Liu, Gaige Chen, Jiantao Chang

Intelligent fault diagnosis of rolling bearing is of great importance to improve the predictive maintenance ability of key assets in the context of industrial big data and smart manufacturing. Due to the usually high cost or infeasibility of obtaining data labels, large amount of data is unlabeled in practical industrial scenarios, which poses a challenge for conducting data-driven bearing fault diagnosis. In view of the characteristics of non-stationary and low signal-to-noise ratio of bearing vibration signals and the fact of lacking labeled samples but there exist lots of unlabeled samples, this paper proposes an intelligent diagnosis method for bearing faults based on variational mode decomposition (VMD) and improved self-training semi-supervised ensemble learning. Firstly, the original vibration signal is decomposed into several intrinsic mode functions using VMD, then correlation coefficient criterion is used to select the bearing fault feature bands to improve the signal-to-noise ratio, then time domain features are extracted, the labeled samples are expanded by the improved self-training semisupervised learning model, and finally the bearing fault diagnosis model is established based on ensemble learning by stacking method. Through the validation on two different experimental data sets, the proposed method was able to effectively extract the bearing fault feature information and improve the model accuracy by using unlabeled data compared with typical supervised learning models and other comparative models, which can meet the demand for intelligent diagnosis of bearing fault under the scenario of lacking labeled samples in real industries.

在工业大数据和智能制造背景下，滚动轴承智能故障诊断对提高关键资产的预测性维护能力具有重要意义。由于获取数据标签的成本通常较高或不可行，在实际工业场景中大量数据未被标记，这对进行数据驱动的轴承故障诊断提出了挑战。针对轴承振动信号的非平稳和低信噪比的特点，以及缺乏标记样本而存在大量未标记样本的事实，提出了一种基于变分模态分解(VMD)和改进的自训练半监督集成学习的轴承故障智能诊断方法。首先利用VMD将原始振动信号分解为多个固有模态函数，然后利用相关系数准则选择轴承故障特征带以提高信噪比，然后提取时域特征，利用改进的自训练半监督学习模型对标记样本进行扩展，最后利用叠加法建立基于集成学习的轴承故障诊断模型。通过在两个不同的实验数据集上的验证，与典型的监督学习模型和其他比较模型相比，所提出的方法能够有效地提取轴承故障特征信息，并提高了使用无标记数据的模型精度，能够满足实际工业中缺乏标记样本情况下轴承故障智能诊断的需求。

{"title":"Intelligent Fault Diagnosis of Rolling Bearing based on VMD and Improved Self-training Semi-supervised Ensemble Learning","authors":"Xiangyu Li, Yao Liu, Gaige Chen, Jiantao Chang","doi":"10.1109/ICNLP58431.2023.00080","DOIUrl":"https://doi.org/10.1109/ICNLP58431.2023.00080","url":null,"abstract":"Intelligent fault diagnosis of rolling bearing is of great importance to improve the predictive maintenance ability of key assets in the context of industrial big data and smart manufacturing. Due to the usually high cost or infeasibility of obtaining data labels, large amount of data is unlabeled in practical industrial scenarios, which poses a challenge for conducting data-driven bearing fault diagnosis. In view of the characteristics of non-stationary and low signal-to-noise ratio of bearing vibration signals and the fact of lacking labeled samples but there exist lots of unlabeled samples, this paper proposes an intelligent diagnosis method for bearing faults based on variational mode decomposition (VMD) and improved self-training semi-supervised ensemble learning. Firstly, the original vibration signal is decomposed into several intrinsic mode functions using VMD, then correlation coefficient criterion is used to select the bearing fault feature bands to improve the signal-to-noise ratio, then time domain features are extracted, the labeled samples are expanded by the improved self-training semisupervised learning model, and finally the bearing fault diagnosis model is established based on ensemble learning by stacking method. Through the validation on two different experimental data sets, the proposed method was able to effectively extract the bearing fault feature information and improve the model accuracy by using unlabeled data compared with typical supervised learning models and other comparative models, which can meet the demand for intelligent diagnosis of bearing fault under the scenario of lacking labeled samples in real industries.","PeriodicalId":53637,"journal":{"name":"Icon","volume":"71 1","pages":"405-413"},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84010040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-user Computing Offloading Based on Deep Reinforcement Learning 基于深度强化学习的多用户计算卸载

Q3 Arts and Humanities

Icon

Pub Date : 2023-03-01 DOI: 10.1109/icnlp58431.2023.00091

Liyuan Feng, Wujun Yang

With the rise of mobile edge computing, how to deal with the problem of edge computing task offloading has become one of the research hotspots. In order to solve the problem of serious congestion on wireless communication link caused by multi-users unloading to MEC server at the same time and competition for server computing resources among multi-user tasks after unloading, a joint optimization method for offloading decision and resource allocation was proposed. In this paper, a system task offloading model based on OFDMA technology is proposed, which takes into account the intensive and indivisible task resources generated by each user device. On this basis, a dynamic task offloading and resource allocation algorithm based on Nature DQN is proposed to solve the multi-client optimal offloading decision and multi-client computing resource allocation scheme. Finally, the simulation results show that the proposed task offloading model and the computational offloading algorithm based on Nature DQN are effective in optimizing the total delay of the long-term system.

随着移动边缘计算的兴起，如何处理边缘计算任务卸载问题成为研究热点之一。为了解决多用户同时卸载到MEC服务器造成的无线通信链路严重拥塞以及卸载后多用户任务之间对服务器计算资源的竞争问题，提出了一种卸载决策与资源分配的联合优化方法。本文提出了一种基于OFDMA技术的系统任务分流模型，该模型考虑到每个用户设备产生的任务资源密集且不可分割。在此基础上，提出了一种基于自然DQN的动态任务卸载和资源分配算法，解决了多客户端最优卸载决策和多客户端计算资源分配方案。最后，仿真结果表明，所提出的任务卸载模型和基于自然DQN的计算式卸载算法在优化长期系统总延迟方面是有效的。

引用次数: 0

Named Entity Recognition Method Based on BERT-whitening and Dynamic Fusion Model 基于bert -白化和动态融合模型的命名实体识别方法

Q3 Arts and Humanities

Icon

Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00041

Meng Liang, Yao Shi

In the context of Natural Language Processing (NLP), Named Entity Recognition (NER) plays a crucial role in tasks like entity relationship extraction and knowledge graph construction. The accuracy of Chinese NER heavily relies on the representation of word embeddings. However, traditional word representation methods like word2vec suffer from word ambiguity and singular word vectors. Similarly, BERT-based word embeddings also exhibit anisotropy. To tackle these challenges, we propose a novel NER method that leverages BERT-whitening and dynamic fusion of BERT’s output from different layers. The dynamic fusion module calculates a weighted sum of BERT’s output across multiple layers, while the whitening module applies a whitening operation to eliminate the anisotropy of word embeddings. By integrating these modules, our model effectively captures the characteristics of input words, providing robust support for subsequent decoding. We evaluate our approach on the CLUENER2020 Chinese fine-grained named entity recognition dataset. Experimental results demonstrate that our method outperforms the traditional BERT-BiLSTM-CRF model without external resources and data expansion, leading to significant improvements in performance.

在自然语言处理(NLP)的背景下，命名实体识别(NER)在实体关系提取和知识图谱构建等任务中起着至关重要的作用。中文NER识别的准确率很大程度上依赖于词嵌入的表示。然而，传统的词表示方法如word2vec存在词歧义和词向量奇异的问题。同样，基于bert的词嵌入也表现出各向异性。为了解决这些挑战，我们提出了一种新的NER方法，该方法利用BERT美白和不同层的BERT输出的动态融合。动态融合模块计算BERT跨多层输出的加权和，而美白模块应用美白操作来消除词嵌入的各向异性。通过集成这些模块，我们的模型有效地捕获了输入词的特征，为后续解码提供了强大的支持。我们在CLUENER2020中文细粒度命名实体识别数据集上评估了我们的方法。实验结果表明，该方法优于传统的BERT-BiLSTM-CRF模型，无需外部资源和数据扩展，性能显著提高。

{"title":"Named Entity Recognition Method Based on BERT-whitening and Dynamic Fusion Model","authors":"Meng Liang, Yao Shi","doi":"10.1109/ICNLP58431.2023.00041","DOIUrl":"https://doi.org/10.1109/ICNLP58431.2023.00041","url":null,"abstract":"In the context of Natural Language Processing (NLP), Named Entity Recognition (NER) plays a crucial role in tasks like entity relationship extraction and knowledge graph construction. The accuracy of Chinese NER heavily relies on the representation of word embeddings. However, traditional word representation methods like word2vec suffer from word ambiguity and singular word vectors. Similarly, BERT-based word embeddings also exhibit anisotropy. To tackle these challenges, we propose a novel NER method that leverages BERT-whitening and dynamic fusion of BERT’s output from different layers. The dynamic fusion module calculates a weighted sum of BERT’s output across multiple layers, while the whitening module applies a whitening operation to eliminate the anisotropy of word embeddings. By integrating these modules, our model effectively captures the characteristics of input words, providing robust support for subsequent decoding. We evaluate our approach on the CLUENER2020 Chinese fine-grained named entity recognition dataset. Experimental results demonstrate that our method outperforms the traditional BERT-BiLSTM-CRF model without external resources and data expansion, leading to significant improvements in performance.","PeriodicalId":53637,"journal":{"name":"Icon","volume":"105 1","pages":"191-197"},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88991284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Research on Switching Strategy with Reinforcement Learning and Game Theory in Satellite-Terrestrial Integrated Networks 基于强化学习和博弈论的星地集成网络切换策略研究

Q3 Arts and Humanities

Icon

Pub Date : 2023-03-01 DOI: 10.1109/icnlp58431.2023.00088

Shihao Fei, Junxuan Wang, Fan Jiang, Yuan Ren, Senhu Zhou

In Satellite-Terrestrial Integrated Networks (STIN), from the perspective of increasing the capacity of the networks, the user experience, and the adaptability to high-speed motion occasions, a non-cooperative multi-service network selection scheme based on Q-learning and game theory (QRSG) is proposed. QRSG first obtains the multi-service network utility through the fuzzy process and uses it as the reward of Q-learning. The state of Q-learning includes the quality of service (QoS) and price attributes of the network currently connected by the user, as well as the situation of the user speed. The corresponding network selection strategy is the action of Q-learning. Then, the user predicts the payoff of the network selection strategy through a game algorithm to avoid access to an overloaded network. In addition, Binary Exponential Backoff Algorithm is introduced in QRSG to solve the problem of inaccurate throughput prediction in the scenario where multiple users concurrently switch to the same service node (SN). Simulations reveal that: 1) With QRSG, users with different speeds and QoS requirements can adaptively switch to the most suitable network. 2) Compared with the existing algorithms, QRSG can increase network throughput by more than 8% and reduce the total number of switching by about 60% in the case of a maximum loss of 1 to 2% of the system fairness.

在星地融合网络(STIN)中，从提高网络容量、用户体验和适应高速运动场合的角度出发，提出了一种基于q -学习和博弈论(QRSG)的非合作多业务网络选择方案。QRSG首先通过模糊过程得到多业务网络效用，并将其作为q学习的奖励。Q-learning的状态包括用户当前连接的网络的QoS (quality of service)和价格属性，以及用户的网速情况。相应的网络选择策略是Q-learning的动作。然后，用户通过博弈算法预测网络选择策略的收益，以避免进入过载的网络。此外，在QRSG中引入了二进制指数回退算法(Binary Exponential Backoff Algorithm)，解决了多个用户同时切换到同一业务节点(SN)的场景下吞吐量预测不准确的问题。仿真结果表明:1)使用QRSG，不同速率和QoS要求的用户可以自适应切换到最适合的网络。2)与现有算法相比，在系统公平性最大损失1 ~ 2%的情况下，QRSG可使网络吞吐量提高8%以上，总交换次数减少约60%。

{"title":"Research on Switching Strategy with Reinforcement Learning and Game Theory in Satellite-Terrestrial Integrated Networks","authors":"Shihao Fei, Junxuan Wang, Fan Jiang, Yuan Ren, Senhu Zhou","doi":"10.1109/icnlp58431.2023.00088","DOIUrl":"https://doi.org/10.1109/icnlp58431.2023.00088","url":null,"abstract":"In Satellite-Terrestrial Integrated Networks (STIN), from the perspective of increasing the capacity of the networks, the user experience, and the adaptability to high-speed motion occasions, a non-cooperative multi-service network selection scheme based on Q-learning and game theory (QRSG) is proposed. QRSG first obtains the multi-service network utility through the fuzzy process and uses it as the reward of Q-learning. The state of Q-learning includes the quality of service (QoS) and price attributes of the network currently connected by the user, as well as the situation of the user speed. The corresponding network selection strategy is the action of Q-learning. Then, the user predicts the payoff of the network selection strategy through a game algorithm to avoid access to an overloaded network. In addition, Binary Exponential Backoff Algorithm is introduced in QRSG to solve the problem of inaccurate throughput prediction in the scenario where multiple users concurrently switch to the same service node (SN). Simulations reveal that: 1) With QRSG, users with different speeds and QoS requirements can adaptively switch to the most suitable network. 2) Compared with the existing algorithms, QRSG can increase network throughput by more than 8% and reduce the total number of switching by about 60% in the case of a maximum loss of 1 to 2% of the system fairness.","PeriodicalId":53637,"journal":{"name":"Icon","volume":"6 1","pages":"458-463"},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88838960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Target Tracking Algorithm Based on Mixed Attention and Siamese Network 基于混合注意力和暹罗网络的目标跟踪算法

Q3 Arts and Humanities

Icon

Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00013

Haibo Ge, Yu An, Wenhao He, Haodong Feng, Chaofeng Huang, Shuxian Wang

Siamese convolutional neural network, which is a classic framework for object tracking, has received extensive attention from the research community. The method uses a convolutional neural network to obtain target features and matches them with the search area features to achieve target tracking. Aiming at the problems that multi-layer features are difficult to extract effectively and network model parameters are complex, a target tracking algorithm (MA-SiamRPN++) with mixed attention mechanism is proposed based on SiamRPN++. Firstly, the channel attention mechanism is inserted into the backbone network, and then the output features of the channel attention network are fed into the spatial attention network, so as to improve the efficiency of feature extraction in different convolution layers by using mixed attention. At the same time, the deep cross -correlation network is used to better retain the feature information that is conducive to tracking and reduce the parameter complexity of the network to maintain the tracking speed. Finally, experiments on OTB100, VOT2016, and the long-term tracking dataset LaSOT show that the tracker proposed in this paper achieves higher accuracy and success rate than other state-of-the-art trackers.

连体卷积神经网络作为一种经典的目标跟踪框架，受到了研究界的广泛关注。该方法利用卷积神经网络获取目标特征，并与搜索区域特征进行匹配，实现目标跟踪。针对多层特征难以有效提取和网络模型参数复杂的问题，提出了一种基于siamrpn++的混合关注机制的目标跟踪算法(ma - siamrpn++)。首先将通道注意机制插入到骨干网络中，然后将通道注意网络的输出特征输入到空间注意网络中，利用混合注意提高不同卷积层的特征提取效率。同时，利用深度互相关网络更好地保留有利于跟踪的特征信息，降低网络参数复杂度，保持跟踪速度。最后，在OTB100、VOT2016和长期跟踪数据集LaSOT上的实验表明，本文提出的跟踪器比其他最先进的跟踪器具有更高的精度和成功率。

{"title":"Target Tracking Algorithm Based on Mixed Attention and Siamese Network","authors":"Haibo Ge, Yu An, Wenhao He, Haodong Feng, Chaofeng Huang, Shuxian Wang","doi":"10.1109/ICNLP58431.2023.00013","DOIUrl":"https://doi.org/10.1109/ICNLP58431.2023.00013","url":null,"abstract":"Siamese convolutional neural network, which is a classic framework for object tracking, has received extensive attention from the research community. The method uses a convolutional neural network to obtain target features and matches them with the search area features to achieve target tracking. Aiming at the problems that multi-layer features are difficult to extract effectively and network model parameters are complex, a target tracking algorithm (MA-SiamRPN++) with mixed attention mechanism is proposed based on SiamRPN++. Firstly, the channel attention mechanism is inserted into the backbone network, and then the output features of the channel attention network are fed into the spatial attention network, so as to improve the efficiency of feature extraction in different convolution layers by using mixed attention. At the same time, the deep cross -correlation network is used to better retain the feature information that is conducive to tracking and reduce the parameter complexity of the network to maintain the tracking speed. Finally, experiments on OTB100, VOT2016, and the long-term tracking dataset LaSOT show that the tracker proposed in this paper achieves higher accuracy and success rate than other state-of-the-art trackers.","PeriodicalId":53637,"journal":{"name":"Icon","volume":"67 1","pages":"25-30"},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77061572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Design and Optimization of Loss Functions in Fine-grained Sentiment and Emotion Analysis 细粒度情感与情绪分析中损失函数的设计与优化

Q3 Arts and Humanities

Icon

Pub Date : 2023-03-01 DOI: 10.1109/ICNLP58431.2023.00037

Shiren Ye, Ding Li, Ali Md Rinku

The unbalanced distribution of category labels and the correlation between these labels tend to cause over-learning issues in deep learning models. In fine-grained sentiment analysis datasets, the correlation between category labels and the heterogeneity of tag distribution are prominent. In the deep learning model, we use the adjusted circle-loss to introduce margin and gradient attenuation in the loss function to handle the challenges caused by unbalanced label distribution and non-independence between labels. This method can be well combined with pre-trained models and adapt to various learning models and algorithms. Compared with the state-of-the-art typical models, our loss function mechanism achieves significant improvement using SemEval18 and GoeEmotions by measure of Jaccard coefficient, micro-F1, and macro-F1. It implies that our solution could work efficiently for sentiment analysis and sentiment analysis tasks.

类别标签的不平衡分布以及这些标签之间的相关性往往会导致深度学习模型中的过度学习问题。在细粒度情感分析数据集中，类别标签之间的相关性和标签分布的异质性是突出的。在深度学习模型中，我们使用调整后的圆损失在损失函数中引入边际和梯度衰减，以处理标签分布不平衡和标签之间不独立所带来的挑战。该方法可以很好地与预训练模型相结合，适应各种学习模型和算法。与最先进的典型模型相比，我们使用SemEval18和GoeEmotions通过测量Jaccard系数、微观f1和宏观f1，我们的损失函数机制得到了显著改进。这意味着我们的解决方案可以有效地用于情感分析和情感分析任务。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Icon

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀