首页 > 最新文献

2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)最新文献

英文 中文
Performance Analysis of Low Temperature Solid Oxide Fuel Cell Based on Artificial Intelligence Technology 基于人工智能技术的低温固体氧化物燃料电池性能分析
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137934
Y. Liu
In order to solve the problems of poor output performance and large output oscillation of traditional low-temperature solid oxide fuel cells, artificial intelligence technology was introduced in this paper to analyze the performance of low-temperature solid oxide fuel cells. Firstly, the steady-state control system of the battery was constructed, and the three-dimensional structure design and electrical performance optimization of the battery were realized. Then, the electrode potential induction analysis model was constructed to analyze the carbon / metal oxide electrode materials with stable mechanical and electrochemical properties. Thirdly, combined with the three-phase regulation of the battery electrode, the microstructure area in the fuel cell electric field is controlled. Finally, according to the fuel cell output voltage, the fuel cell ion mass conservation model is constructed. Artificial intelligence is used to obtain the optimal solution of fuel cell voltage output, so as to complete the analysis of fuel cell steady-state performance. The simulation results show that this method can control the output of the low-temperature solid oxide fuel cell well and reduce the output oscillation of the cell, which has a certain theoretical reference significance for the performance of the cell.
为了解决传统低温固体氧化物燃料电池输出性能差、输出振荡大的问题,本文引入人工智能技术对低温固体氧化物燃料电池的性能进行分析。首先,构建了电池稳态控制系统,实现了电池的三维结构设计和电性能优化。然后,建立电极电位感应分析模型,对力学性能和电化学性能稳定的碳/金属氧化物电极材料进行分析。第三,结合电池电极的三相调节,对燃料电池电场中的微结构区域进行控制。最后,根据燃料电池输出电压,建立了燃料电池离子质量守恒模型。利用人工智能获得燃料电池电压输出的最优解,从而完成燃料电池稳态性能的分析。仿真结果表明,该方法可以很好地控制低温固体氧化物燃料电池的输出,减小电池的输出振荡,对电池的性能具有一定的理论参考意义。
{"title":"Performance Analysis of Low Temperature Solid Oxide Fuel Cell Based on Artificial Intelligence Technology","authors":"Y. Liu","doi":"10.1109/ACAIT56212.2022.10137934","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137934","url":null,"abstract":"In order to solve the problems of poor output performance and large output oscillation of traditional low-temperature solid oxide fuel cells, artificial intelligence technology was introduced in this paper to analyze the performance of low-temperature solid oxide fuel cells. Firstly, the steady-state control system of the battery was constructed, and the three-dimensional structure design and electrical performance optimization of the battery were realized. Then, the electrode potential induction analysis model was constructed to analyze the carbon / metal oxide electrode materials with stable mechanical and electrochemical properties. Thirdly, combined with the three-phase regulation of the battery electrode, the microstructure area in the fuel cell electric field is controlled. Finally, according to the fuel cell output voltage, the fuel cell ion mass conservation model is constructed. Artificial intelligence is used to obtain the optimal solution of fuel cell voltage output, so as to complete the analysis of fuel cell steady-state performance. The simulation results show that this method can control the output of the low-temperature solid oxide fuel cell well and reduce the output oscillation of the cell, which has a certain theoretical reference significance for the performance of the cell.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114969972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Speech Emotion Model Based on CNN and LSTM Networks 一种基于CNN和LSTM网络的语音情感模型
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137926
Benguo Ye, Xiaofeng Yuan, Gang Peng, Weizhen Zeng
LSTM is a sequential model containing the long short-term memory cells gated recurrent units. Compared to the traditional RNN, LSTM introduces three gates which solve the exploding and vanishing gradient problems of RNN. In this paper, we propose a new speech emotion model by combining CNN and LSTM. The model is implemented based on the CASIA data sets, the Python librosa library and the opensmile tool to get the speech emotion features by extracting the multi-feature of the fusion acoustics which would then be compared to the features based on different configurations to evaluate the recognition accuracy. The experimental results show that the features extracted from the emobase2010 configuration can achieve 84% recognition accuracy based on the CASIA dataset. Compared with other models, the recognition accuracy of the model introduced in1 this paper is 3.3% higher than that of the SVM model, but 6.3% lower than that of the ConvLSTM model.
LSTM是一个包含长短期记忆单元的序列模型。与传统RNN相比,LSTM引入了三个门,解决了RNN的梯度爆炸和梯度消失问题。本文将CNN与LSTM相结合,提出了一种新的语音情感模型。该模型基于CASIA数据集、Python librosa库和opensmile工具实现,通过提取融合声学的多特征得到语音情感特征,并与基于不同配置的特征进行比较,评估识别精度。实验结果表明,基于CASIA数据集,emobase2010配置提取的特征可以达到84%的识别准确率。与其他模型相比,本文1引入的模型的识别精度比SVM模型高3.3%,但比ConvLSTM模型低6.3%。
{"title":"A Novel Speech Emotion Model Based on CNN and LSTM Networks","authors":"Benguo Ye, Xiaofeng Yuan, Gang Peng, Weizhen Zeng","doi":"10.1109/ACAIT56212.2022.10137926","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137926","url":null,"abstract":"LSTM is a sequential model containing the long short-term memory cells gated recurrent units. Compared to the traditional RNN, LSTM introduces three gates which solve the exploding and vanishing gradient problems of RNN. In this paper, we propose a new speech emotion model by combining CNN and LSTM. The model is implemented based on the CASIA data sets, the Python librosa library and the opensmile tool to get the speech emotion features by extracting the multi-feature of the fusion acoustics which would then be compared to the features based on different configurations to evaluate the recognition accuracy. The experimental results show that the features extracted from the emobase2010 configuration can achieve 84% recognition accuracy based on the CASIA dataset. Compared with other models, the recognition accuracy of the model introduced in1 this paper is 3.3% higher than that of the SVM model, but 6.3% lower than that of the ConvLSTM model.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"2013 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127435211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pool-UNet: Ischemic Stroke Segmentation from CT Perfusion Scans Using Poolformer UNet 池-UNet:利用Poolformer UNet从CT灌注扫描中分割缺血性脑卒中
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137834
R. Liu, Wei Pu, Yangyang Zou, Linfeng Jiang, Zhiyong Ye
Ischemic strokes are the most common acute brain disorder, and seriously threaten patients’ lives. In order to help physicians determine the location of ischemic stroke lesions and other information as early as possible, many scholars have used convolutional neural networks and Transformer segmentation networks to segment lesions on CT perfusion images. However, convolutional neural networks are not capable of extracting spatial information sufficiently, which leads to loss of effective lesion information. In addition, the global attention mechanism module of Transformer is computationally intensive at runtime, which is not suitable for use in high-resolution input and intensive prediction tasks. We designed a DSE-ResNet module to solve these problems to establish spatial channel information correlation. Then we innovatively propose the Pool-UNet model, which combines the Poolformer structure with a convolutional neural network. It can efficiently model the global context and learn multi-scale features while maintaining a grasp of the lowlevel details. The segmentation results on the ISLES-2018 dataset show that PoolUNet achieves 67.82% precision, 56.54% recall, 56.04% Dice coefficient, and 21.14 mm Haushofer distance. Compared with the classical UNet, R2UNet, and TransUNet 3 segmentation models, Pool-UNet improved at least 0.26%, 1.52%, 1.07%, and 0. 17mm in accuracy, recall, Dice coefficient, and Hausdorff distance, respectively. Pool-UNet has a competitive advantage over other classical and advanced medical segmentation algorithms.
缺血性脑卒中是最常见的急性脑疾病,严重威胁患者的生命安全。为了帮助医生尽早确定缺血性脑卒中病变位置等信息,许多学者利用卷积神经网络和Transformer分割网络对CT灌注图像上的病变进行分割。然而,卷积神经网络不能充分提取空间信息,导致有效的病变信息丢失。此外,Transformer的全局注意机制模块在运行时计算量大,不适合用于高分辨率输入和密集预测任务。为了解决这些问题,我们设计了一个DSE-ResNet模块来建立空间信道信息的相关性。然后,我们创新地提出了将Poolformer结构与卷积神经网络相结合的Pool-UNet模型。它可以有效地建模全局上下文,学习多尺度特征,同时保持对低层细节的掌握。在ISLES-2018数据集上的分割结果表明,PoolUNet的分割精度为67.82%,召回率为56.54%,Dice系数为56.04%,Haushofer距离为21.14 mm。与经典的UNet、R2UNet和TransUNet 3分割模型相比,Pool-UNet分别提高了0.26%、1.52%、1.07%和0。准确率,召回率,骰子系数和豪斯多夫距离分别为17mm。与其他经典和先进的医学分割算法相比,Pool-UNet具有竞争优势。
{"title":"Pool-UNet: Ischemic Stroke Segmentation from CT Perfusion Scans Using Poolformer UNet","authors":"R. Liu, Wei Pu, Yangyang Zou, Linfeng Jiang, Zhiyong Ye","doi":"10.1109/ACAIT56212.2022.10137834","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137834","url":null,"abstract":"Ischemic strokes are the most common acute brain disorder, and seriously threaten patients’ lives. In order to help physicians determine the location of ischemic stroke lesions and other information as early as possible, many scholars have used convolutional neural networks and Transformer segmentation networks to segment lesions on CT perfusion images. However, convolutional neural networks are not capable of extracting spatial information sufficiently, which leads to loss of effective lesion information. In addition, the global attention mechanism module of Transformer is computationally intensive at runtime, which is not suitable for use in high-resolution input and intensive prediction tasks. We designed a DSE-ResNet module to solve these problems to establish spatial channel information correlation. Then we innovatively propose the Pool-UNet model, which combines the Poolformer structure with a convolutional neural network. It can efficiently model the global context and learn multi-scale features while maintaining a grasp of the lowlevel details. The segmentation results on the ISLES-2018 dataset show that PoolUNet achieves 67.82% precision, 56.54% recall, 56.04% Dice coefficient, and 21.14 mm Haushofer distance. Compared with the classical UNet, R2UNet, and TransUNet 3 segmentation models, Pool-UNet improved at least 0.26%, 1.52%, 1.07%, and 0. 17mm in accuracy, recall, Dice coefficient, and Hausdorff distance, respectively. Pool-UNet has a competitive advantage over other classical and advanced medical segmentation algorithms.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124935013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IAFormer: A Transformer Network for Image Aesthetic Evaluation and Cropping IAFormer:一个用于图像审美评价和裁剪的变压器网络
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137804
Lei Wang, Yue Jin
Aesthetic quality evaluation of images has an important role in the field of visual analysis, and the widespread use of high-quality image editing has gradually increased the importance of image aesthetic evaluation in automatic image processing tasks. Previous researchers have mostly explored the mapping relationship between images and labeled scores using convolutional neural networks, but the aesthetic features of different regions on images have not been explored sufficiently, when an image is rich in background information and it is necessary to correlate the aesthetic features of different regions to evaluate the image, convolutional neural networks often cannot extract the aesthetic features of the image adequately due to the lack of the advantage of global feature modeling. We introduce a novel Transformer architecture for image aesthetic quality assessment(IAFormer), IAFormer can model the global aesthetic features of an image, and it is a framework that unifies the aesthetic quality assessment of images and the aesthetic cropping of images, while the aesthetic quality of the image is evaluated, the aesthetic weights on different patches within the image can be calculated to give valid reference information for the aesthetic cropping task.
图像的审美质量评价在视觉分析领域有着重要的作用,而高质量图像编辑的广泛使用也逐渐增加了图像审美评价在自动图像处理任务中的重要性。以往的研究大多是利用卷积神经网络来探索图像与标记分数之间的映射关系,但当图像背景信息丰富,需要将不同区域的审美特征联系起来评价图像时,对图像上不同区域的审美特征的探索还不够充分。由于缺乏全局特征建模的优势,卷积神经网络往往不能充分提取图像的美学特征。本文介绍了一种新的用于图像审美质量评价的Transformer架构(IAFormer), IAFormer可以对图像的全局审美特征进行建模,是一个将图像的审美质量评价与图像的审美裁剪统一起来的框架,在对图像的审美质量进行评价的同时,可以计算出图像内不同斑块的审美权重,为图像的审美裁剪任务提供有效的参考信息。
{"title":"IAFormer: A Transformer Network for Image Aesthetic Evaluation and Cropping","authors":"Lei Wang, Yue Jin","doi":"10.1109/ACAIT56212.2022.10137804","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137804","url":null,"abstract":"Aesthetic quality evaluation of images has an important role in the field of visual analysis, and the widespread use of high-quality image editing has gradually increased the importance of image aesthetic evaluation in automatic image processing tasks. Previous researchers have mostly explored the mapping relationship between images and labeled scores using convolutional neural networks, but the aesthetic features of different regions on images have not been explored sufficiently, when an image is rich in background information and it is necessary to correlate the aesthetic features of different regions to evaluate the image, convolutional neural networks often cannot extract the aesthetic features of the image adequately due to the lack of the advantage of global feature modeling. We introduce a novel Transformer architecture for image aesthetic quality assessment(IAFormer), IAFormer can model the global aesthetic features of an image, and it is a framework that unifies the aesthetic quality assessment of images and the aesthetic cropping of images, while the aesthetic quality of the image is evaluated, the aesthetic weights on different patches within the image can be calculated to give valid reference information for the aesthetic cropping task.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125276357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Context Encoder Network with Channel-Wise Attention Mechanism for Nerve Fibers Detection in Corneal Confocal Microscopy Images 角膜共聚焦显微镜图像中神经纤维检测的基于通道注意机制的上下文编码器网络
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137928
Wenyuan Li, Zheng Tang, Lulu Zhao, Wanyong Tian, Taotao Qi
Diabetic peripheral neuropathy (DPN), one of the common long-term complications of diabetes, may affect the physical condition and quality of life of patients. Corneal confocal microscopy (CCM) is a rapid non-invasive ophthalmic imaging technique that can be used to observe the form of nerve fibers in sub-basal corneal nerve plexus directly. Analysis of the nerve fibers in CCM images quantifies features of nerve fibers, can apply to clinical diagnosis of DPN. This paper presents an attention deep learning model for detecting nerve fibers from CCM images, which combine context encoder network and squeeze-and-excitation networks. The algorithm with attention mechanism can solve the problem of the segmentation result is easily influenced by high level noise in CCM images and imbalance of nerve fiber pixels and background pixels to a certain degree. The proposed algorithm shows the best performance among common image segmentation deep learning model.
糖尿病周围神经病变(DPN)是糖尿病常见的长期并发症之一,影响患者的身体状况和生活质量。角膜共聚焦显微镜(CCM)是一种快速无创的眼科成像技术,可以直接观察角膜基底下神经丛神经纤维的形态。CCM图像中神经纤维的分析量化了神经纤维的特征,可用于DPN的临床诊断。本文提出了一种结合上下文编码器网络和挤压激励网络的CCM图像神经纤维检测注意深度学习模型。该算法具有注意机制,可以解决CCM图像中容易受到高噪声影响,以及神经纤维像素与背景像素在一定程度上不平衡的问题。该算法在常用的图像分割深度学习模型中表现出最好的性能。
{"title":"Context Encoder Network with Channel-Wise Attention Mechanism for Nerve Fibers Detection in Corneal Confocal Microscopy Images","authors":"Wenyuan Li, Zheng Tang, Lulu Zhao, Wanyong Tian, Taotao Qi","doi":"10.1109/ACAIT56212.2022.10137928","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137928","url":null,"abstract":"Diabetic peripheral neuropathy (DPN), one of the common long-term complications of diabetes, may affect the physical condition and quality of life of patients. Corneal confocal microscopy (CCM) is a rapid non-invasive ophthalmic imaging technique that can be used to observe the form of nerve fibers in sub-basal corneal nerve plexus directly. Analysis of the nerve fibers in CCM images quantifies features of nerve fibers, can apply to clinical diagnosis of DPN. This paper presents an attention deep learning model for detecting nerve fibers from CCM images, which combine context encoder network and squeeze-and-excitation networks. The algorithm with attention mechanism can solve the problem of the segmentation result is easily influenced by high level noise in CCM images and imbalance of nerve fiber pixels and background pixels to a certain degree. The proposed algorithm shows the best performance among common image segmentation deep learning model.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127018361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speech Augmentation Using Conditional Generative Adversarial Nets in Mongolian Speech Recognition 基于条件生成对抗网络的语音增强蒙古语语音识别
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137828
Zhiqiang Ma, Jinyi Li, Junpeng Zhang
Aiming at the problem of uneven regional distribution of speech caused by the lack of labeled data in the Mongolian speech data set, this paper proposes a Mongolian speech data augmentation model based on a conditional generation confrontation network. The model uses conditional speech generators and multiple fusion discriminators for adversarial learning, and uses Mongolian text and specified regional features to generate Mongolian speech with specified regional features. The original data set was augmented by using the methods of speech rate perturbation and spectrogram enhancement, and compared with the end-to-end Mongolian speech recognition model trained on different augment data sets and the original data sets, it was found that the word error rate in the end-to-end Mongolian speech recognition model trained on the augment data set of the specified regional characteristics is 3.1%; Compared with the end-to-end Mongolian speech recognition model trained on the original data set, the speech rate disturbance data set, and the spectrogram enhancement data set, the word error rate dropped by 2%, 0.5%, and 0.8%.
针对蒙古语语音数据集中缺乏标注数据导致语音区域分布不均匀的问题,提出了一种基于条件生成对抗网络的蒙古语语音数据增强模型。该模型使用条件语音生成器和多个融合判别器进行对抗学习,使用蒙古语文本和指定区域特征生成具有指定区域特征的蒙古语语音。采用语音率扰动和谱图增强的方法对原始数据集进行增强,并将基于不同增强数据集训练的端到端蒙古语语音识别模型与原始数据集进行对比,发现基于特定区域特征增强数据集训练的端到端蒙古语语音识别模型的词错误率为3.1%;与在原始数据集、语音率干扰数据集和谱图增强数据集上训练的端到端蒙古语语音识别模型相比,单词错误率分别下降了2%、0.5%和0.8%。
{"title":"Speech Augmentation Using Conditional Generative Adversarial Nets in Mongolian Speech Recognition","authors":"Zhiqiang Ma, Jinyi Li, Junpeng Zhang","doi":"10.1109/ACAIT56212.2022.10137828","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137828","url":null,"abstract":"Aiming at the problem of uneven regional distribution of speech caused by the lack of labeled data in the Mongolian speech data set, this paper proposes a Mongolian speech data augmentation model based on a conditional generation confrontation network. The model uses conditional speech generators and multiple fusion discriminators for adversarial learning, and uses Mongolian text and specified regional features to generate Mongolian speech with specified regional features. The original data set was augmented by using the methods of speech rate perturbation and spectrogram enhancement, and compared with the end-to-end Mongolian speech recognition model trained on different augment data sets and the original data sets, it was found that the word error rate in the end-to-end Mongolian speech recognition model trained on the augment data set of the specified regional characteristics is 3.1%; Compared with the end-to-end Mongolian speech recognition model trained on the original data set, the speech rate disturbance data set, and the spectrogram enhancement data set, the word error rate dropped by 2%, 0.5%, and 0.8%.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115541039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Color Offset Compensation Method of Art Image Based on Augmented Reality Technology 基于增强现实技术的艺术图像色彩偏移补偿方法
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137889
Yiwei Zhang
In the production of multi-scale block-fused art images, the image quality is poor due to the sudden change and occlusion of color shift. This paper puts forward a compensation method for color shift of multi-scale block-fused art images based on augmented reality technology. Based on the attenuation of optical parameters and the control of color balance, a multi-dimensional color fusion model of background light correlation of multi-scale block fused art images is established, and the augmented reality model of multi-scale block fused art images is constructed by combining the analysis method of surface features and illumination features distribution model of art images. Through the parameter analysis of each level feature map model of the similarity degree of the previous frame under color deviation, The gray texture and color texture features of multi-scale block-fused art images with complementary advantages and disadvantages are extracted, and augmented reality technology is adopted to realize gray scale enhancement and color enhancement in the process of color compensation of art images. Combined parameter identification method is adopted to realize color adjustment and feedback compensation control of output stability of art images. According to the characteristics of high-order moment output stability of color features, color offset compensation and optimal imaging processing of multi-scale block-fused art images are realized by calculating and counting boundary corner information and texture parameter analysis. The test shows that this method performs the color offset processing of multi-scale block fusion art image sensor, improves the color offset compensation ability of art images and the true color imaging quality of images, and increases the peak signal-to-noise ratio of output images.
在制作多尺度块融合艺术图像时,由于色移的突然变化和遮挡,导致图像质量较差。提出了一种基于增强现实技术的多尺度块融合艺术图像色移补偿方法。基于光学参数衰减和色彩平衡控制,建立了多尺度块融合艺术图像背景光相关的多维色彩融合模型,并结合艺术图像表面特征分析方法和光照特征分布模型构建了多尺度块融合艺术图像增强现实模型。通过对颜色偏差下前一帧相似度的各级特征映射模型进行参数分析,提取出优势互补、劣势互补的多尺度块融合艺术图像的灰度纹理和颜色纹理特征,并采用增强现实技术实现艺术图像色彩补偿过程中的灰度增强和色彩增强。采用组合参数辨识方法实现艺术图像输出稳定性的色彩调节和反馈补偿控制。根据彩色特征高阶矩输出稳定性的特点,通过计算和计数边角信息和纹理参数分析,实现了多尺度块融合艺术图像的彩色偏移补偿和优化成像处理。测试表明,该方法完成了多尺度块融合艺术图像传感器的色彩偏移处理,提高了艺术图像的色彩偏移补偿能力和图像的真彩色成像质量,提高了输出图像的峰值信噪比。
{"title":"Color Offset Compensation Method of Art Image Based on Augmented Reality Technology","authors":"Yiwei Zhang","doi":"10.1109/ACAIT56212.2022.10137889","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137889","url":null,"abstract":"In the production of multi-scale block-fused art images, the image quality is poor due to the sudden change and occlusion of color shift. This paper puts forward a compensation method for color shift of multi-scale block-fused art images based on augmented reality technology. Based on the attenuation of optical parameters and the control of color balance, a multi-dimensional color fusion model of background light correlation of multi-scale block fused art images is established, and the augmented reality model of multi-scale block fused art images is constructed by combining the analysis method of surface features and illumination features distribution model of art images. Through the parameter analysis of each level feature map model of the similarity degree of the previous frame under color deviation, The gray texture and color texture features of multi-scale block-fused art images with complementary advantages and disadvantages are extracted, and augmented reality technology is adopted to realize gray scale enhancement and color enhancement in the process of color compensation of art images. Combined parameter identification method is adopted to realize color adjustment and feedback compensation control of output stability of art images. According to the characteristics of high-order moment output stability of color features, color offset compensation and optimal imaging processing of multi-scale block-fused art images are realized by calculating and counting boundary corner information and texture parameter analysis. The test shows that this method performs the color offset processing of multi-scale block fusion art image sensor, improves the color offset compensation ability of art images and the true color imaging quality of images, and increases the peak signal-to-noise ratio of output images.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129418306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Model of Spoken Language Understanding Combining with Multi-Head Self-Attention 结合多头自我关注的口语理解模型
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137905
Dafei Lin, Jiangfeng Zhou, Xinlai Xing, Xiaochuan Zhang
Spoken Language Understanding (SLU) is a very important module in intelligent dialogue systems. It is usually constructed based on a bi-directional long and short-term memory network (BiLSTM). It has some shortcomings, such as relative single representation of feature space and fuzzy semantic features. For this reason, this study constructs a SLU model which combines the temporal characteristics of context and the characteristics of multi-layer representation space. The model combines a bi-directional long and short-term memory network and a multi-head self-attention to extract different feature information of contextual temporal features and multisemantic representation space of the text, respectively; then, the two features are fused using a residual linking method to enhance the features of word dependence at different locations within the text; meanwhile, the gate mechanism is then used to enable the intent detection task to establish an influence relationship on the slot filling task. Finally, the SNIPS dataset, the ATIS dataset, and the slot-gated model are selected for comparison experiments. The slot filling F1 value is increased by 4.14% and 1.1% respectively, and the accuracy of semantic framework is increased by 4.25% and 2.50% respectively. The results show the effectiveness of the model of SLU task.
口语理解(SLU)是智能对话系统中一个非常重要的模块。它通常基于双向长短期记忆网络(BiLSTM)构建。它存在一些缺陷,如特征空间表示相对单一、语义特征模糊等。因此,本研究结合上下文的时间特性和多层表示空间的特点,构建了一个 SLU 模型。该模型结合双向长短期记忆网络和多头自注意力,分别提取文本上下文时态特征和多层语义表征空间的不同特征信息;然后,利用残差链接方法将两种特征融合,增强文本中不同位置的词依赖特征;同时,再利用门机制使意图检测任务与槽填充任务建立影响关系。最后,选取 SNIPS 数据集、ATIS 数据集和插槽门控模型进行对比实验。结果表明,槽填充 F1 值分别提高了 4.14% 和 1.1%,语义框架准确率分别提高了 4.25% 和 2.50%。这些结果表明了该模型在 SLU 任务中的有效性。
{"title":"A Model of Spoken Language Understanding Combining with Multi-Head Self-Attention","authors":"Dafei Lin, Jiangfeng Zhou, Xinlai Xing, Xiaochuan Zhang","doi":"10.1109/ACAIT56212.2022.10137905","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137905","url":null,"abstract":"Spoken Language Understanding (SLU) is a very important module in intelligent dialogue systems. It is usually constructed based on a bi-directional long and short-term memory network (BiLSTM). It has some shortcomings, such as relative single representation of feature space and fuzzy semantic features. For this reason, this study constructs a SLU model which combines the temporal characteristics of context and the characteristics of multi-layer representation space. The model combines a bi-directional long and short-term memory network and a multi-head self-attention to extract different feature information of contextual temporal features and multisemantic representation space of the text, respectively; then, the two features are fused using a residual linking method to enhance the features of word dependence at different locations within the text; meanwhile, the gate mechanism is then used to enable the intent detection task to establish an influence relationship on the slot filling task. Finally, the SNIPS dataset, the ATIS dataset, and the slot-gated model are selected for comparison experiments. The slot filling F1 value is increased by 4.14% and 1.1% respectively, and the accuracy of semantic framework is increased by 4.25% and 2.50% respectively. The results show the effectiveness of the model of SLU task.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130389127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Vehicle Re-Identification Method Based on Fine-Grained Features and Metric Learning 基于细粒度特征和度量学习的车辆再识别方法
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137947
He Yan, Yao Li, Kuilin Huang, Xiaotang Wang
To solve the problem that the global features extracted by the ResNet-50 network has insufficient recognition capability in similar vehicle re-identification task, a new Re-ID method combining metric learning is proposed. Firstly, the fine-grained features of vehicles are extracted by using triplet constraints, and then combined with the global features extracted by the backbone network as vehicle features. Secondly, the similarity of different vehicle features is judged and ranked by Euclidean distance, so as to obtain more accurate results. Finally, a comparative experiment is conducted on the VeRi-776 dataset for different network models. The results show that our method has high recognition accuracy in Re-ID tasks. Compared with ResNet-50, the mean average accuracy (mAP) is improved by 2.30 %, rank-l increased by 2.31 %, and the rank-5 increased by 2.05 %. It is verified that this model can effectively improve the recognition accuracy in vehicle Re-ID.
针对ResNet-50网络提取的全局特征在类似车辆再识别任务中识别能力不足的问题,提出了一种结合度量学习的Re-ID方法。首先利用三元组约束提取车辆的细粒度特征,然后与骨干网络提取的全局特征相结合作为车辆特征;其次,通过欧几里得距离对不同车辆特征的相似度进行判断和排序,从而获得更准确的结果。最后,在不同网络模型的VeRi-776数据集上进行了对比实验。结果表明,该方法在Re-ID任务中具有较高的识别准确率。与ResNet-50相比,mAP的平均准确率提高了2.30%,rank- 1提高了2.31%,rank-5提高了2.05%。验证了该模型能有效提高车辆Re-ID的识别精度。
{"title":"A Vehicle Re-Identification Method Based on Fine-Grained Features and Metric Learning","authors":"He Yan, Yao Li, Kuilin Huang, Xiaotang Wang","doi":"10.1109/ACAIT56212.2022.10137947","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137947","url":null,"abstract":"To solve the problem that the global features extracted by the ResNet-50 network has insufficient recognition capability in similar vehicle re-identification task, a new Re-ID method combining metric learning is proposed. Firstly, the fine-grained features of vehicles are extracted by using triplet constraints, and then combined with the global features extracted by the backbone network as vehicle features. Secondly, the similarity of different vehicle features is judged and ranked by Euclidean distance, so as to obtain more accurate results. Finally, a comparative experiment is conducted on the VeRi-776 dataset for different network models. The results show that our method has high recognition accuracy in Re-ID tasks. Compared with ResNet-50, the mean average accuracy (mAP) is improved by 2.30 %, rank-l increased by 2.31 %, and the rank-5 increased by 2.05 %. It is verified that this model can effectively improve the recognition accuracy in vehicle Re-ID.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132493624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RF Signal Source Target Detection Method Based on Monte Carlo Algorithm in Distributed Electromagnetic Spectrum Monitoring System 分布式电磁频谱监测系统中基于蒙特卡罗算法的射频信号源目标检测方法
Pub Date : 2022-12-09 DOI: 10.1109/ACAIT56212.2022.10137812
Zhenjia Chen, Lihui Wang
In order to improve the blind localization accuracy of radio signal sources, this paper proposes a radio frequency (RF) signal source target localization method based on Monte Carlo algorithm. Combined with the free-space propagation loss characteristics of electromagnetic waves, real-time electromagnetic spectrum detection data based on distributed detection nodes are analyzed. Based on the spatial distribution of electromagnetic spectrum, the target location method of radio frequency signal source is studied. Distributed detection nodes detect electromagnetic spectrum data of the same radio frequency signal source target at different spatial locations. The Monte Carlo algorithm is used for the cooperative detection target localization method of the RF signal source. With the increase of random times, the estimation result with the smallest cumulative distance error is used as the estimation result of the target position of the radio frequency signal source. The measured data show that the method can improve the blind detection and positioning accuracy of radio signal sources in the target area.
为了提高射频信号源的盲定位精度,提出了一种基于蒙特卡罗算法的射频信号源目标定位方法。结合电磁波的自由空间传播损耗特性,对基于分布式检测节点的实时电磁频谱检测数据进行了分析。基于电磁波谱的空间分布,研究了射频信号源的目标定位方法。分布式检测节点对同一射频信号源目标在不同空间位置的电磁频谱数据进行检测。将蒙特卡罗算法用于射频信号源的协同检测目标定位方法。随着随机次数的增加,使用累积距离误差最小的估计结果作为射频信号源目标位置的估计结果。实测数据表明,该方法可以提高目标区域内无线电信号源的盲检测和定位精度。
{"title":"RF Signal Source Target Detection Method Based on Monte Carlo Algorithm in Distributed Electromagnetic Spectrum Monitoring System","authors":"Zhenjia Chen, Lihui Wang","doi":"10.1109/ACAIT56212.2022.10137812","DOIUrl":"https://doi.org/10.1109/ACAIT56212.2022.10137812","url":null,"abstract":"In order to improve the blind localization accuracy of radio signal sources, this paper proposes a radio frequency (RF) signal source target localization method based on Monte Carlo algorithm. Combined with the free-space propagation loss characteristics of electromagnetic waves, real-time electromagnetic spectrum detection data based on distributed detection nodes are analyzed. Based on the spatial distribution of electromagnetic spectrum, the target location method of radio frequency signal source is studied. Distributed detection nodes detect electromagnetic spectrum data of the same radio frequency signal source target at different spatial locations. The Monte Carlo algorithm is used for the cooperative detection target localization method of the RF signal source. With the increase of random times, the estimation result with the smallest cumulative distance error is used as the estimation result of the target position of the radio frequency signal source. The measured data show that the method can improve the blind detection and positioning accuracy of radio signal sources in the target area.","PeriodicalId":398228,"journal":{"name":"2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131081437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 6th Asian Conference on Artificial Intelligence Technology (ACAIT)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1