首页 > 最新文献

2022 7th International Conference on Communication, Image and Signal Processing (CCISP)最新文献

英文 中文
Optimal Stopping Theory-Enabled VVC Intra Prediction with Texture 最优停止理论支持的纹理VVC内预测
Pub Date : 2022-11-01 DOI: 10.1109/CCISP55629.2022.9974416
Yucheng Li, Xiantao Jiang, Wei Li, Jiayuan Jin, Dezhi Han, Tian Song, Fei Yu
Versatile Video Coding (VVC) introduces the new quad-tree with a nested multi-type tree (QTMT) block division structure, which increases the flexibility of block division, the more complex block division structure increases the coding complexity of VVC by nearly 26 times compared with High-Efficiency Video Coding (HEVC). Therefore, it is urgent to reduce the coding complexity of VVC. In this paper, we propose a fast CU division method based on optimal stopping theory and block texture decision. Firstly, by analyzing the division depth of the Coding Tree Unit (CTU) at the same position as neighboring frames, we use the optimal stopping theory to determine the optimal division layer of the current CTU, to terminate the division process in advance. Then, by judging the texture direction of the current Coding Unit (CU), the calculation of several CU division methods is selected to be skipped, thus reducing the computational effort of coding. The experimental results show that the coding time of this scheme is reduced by 45.65% on average, while the BDBR only increases by 1.64%.
通用视频编码(VVC)引入了一种新的四叉树嵌套多类型树(QTMT)块划分结构,增加了块划分的灵活性,更复杂的块划分结构使VVC的编码复杂度比高效视频编码(HEVC)提高了近26倍。因此,降低VVC的编码复杂度是当务之急。本文提出了一种基于最优停止理论和块纹理判定的快速CU分割方法。首先,通过分析相邻帧在同一位置的编码树单元(CTU)的分割深度,利用最优停止理论确定当前CTU的最优分割层,提前终止分割过程;然后,通过判断当前编码单元(Coding Unit, CU)的纹理方向,选择几种CU划分方法的计算跳过,从而减少编码的计算量。实验结果表明,该方案的编码时间平均缩短了45.65%,而BDBR仅增加了1.64%。
{"title":"Optimal Stopping Theory-Enabled VVC Intra Prediction with Texture","authors":"Yucheng Li, Xiantao Jiang, Wei Li, Jiayuan Jin, Dezhi Han, Tian Song, Fei Yu","doi":"10.1109/CCISP55629.2022.9974416","DOIUrl":"https://doi.org/10.1109/CCISP55629.2022.9974416","url":null,"abstract":"Versatile Video Coding (VVC) introduces the new quad-tree with a nested multi-type tree (QTMT) block division structure, which increases the flexibility of block division, the more complex block division structure increases the coding complexity of VVC by nearly 26 times compared with High-Efficiency Video Coding (HEVC). Therefore, it is urgent to reduce the coding complexity of VVC. In this paper, we propose a fast CU division method based on optimal stopping theory and block texture decision. Firstly, by analyzing the division depth of the Coding Tree Unit (CTU) at the same position as neighboring frames, we use the optimal stopping theory to determine the optimal division layer of the current CTU, to terminate the division process in advance. Then, by judging the texture direction of the current Coding Unit (CU), the calculation of several CU division methods is selected to be skipped, thus reducing the computational effort of coding. The experimental results show that the coding time of this scheme is reduced by 45.65% on average, while the BDBR only increases by 1.64%.","PeriodicalId":431851,"journal":{"name":"2022 7th International Conference on Communication, Image and Signal Processing (CCISP)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124406639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Construction Technology of Software Defined Modular Swarm Network Information Architecture 软件定义模块化群网络信息体系结构构建技术
Pub Date : 2022-11-01 DOI: 10.1109/CCISP55629.2022.9974179
Yuyong Cui, Dawei Liu, Shengzhe Wang, Xinvi Gao, Mutian Guo
The swarm information network requires outstanding features such as flexible networking, fast networking, security, and controllability. According to the information interaction requirements of the swarm information network, a Software Defined Network (SDN) modularized swarm network information architecture is proposed, the business requirements and characteristics of the swarm network are analyzed, and the swarm network information architecture mode is designed to improve the openness and reliability of the swarm network information system, which provides support for swarms to carry out tasks autonomously in complex scenes.
群信息网络需要组网灵活、组网快速、安全、可控性等突出特点。根据蜂群信息网络的信息交互需求,提出了一种软件定义网络(Software Defined network, SDN)模块化的蜂群网络信息架构,分析了蜂群网络的业务需求和特点,设计了蜂群网络信息架构模式,提高了蜂群网络信息系统的开放性和可靠性,为蜂群在复杂场景下自主执行任务提供了支持。
{"title":"Construction Technology of Software Defined Modular Swarm Network Information Architecture","authors":"Yuyong Cui, Dawei Liu, Shengzhe Wang, Xinvi Gao, Mutian Guo","doi":"10.1109/CCISP55629.2022.9974179","DOIUrl":"https://doi.org/10.1109/CCISP55629.2022.9974179","url":null,"abstract":"The swarm information network requires outstanding features such as flexible networking, fast networking, security, and controllability. According to the information interaction requirements of the swarm information network, a Software Defined Network (SDN) modularized swarm network information architecture is proposed, the business requirements and characteristics of the swarm network are analyzed, and the swarm network information architecture mode is designed to improve the openness and reliability of the swarm network information system, which provides support for swarms to carry out tasks autonomously in complex scenes.","PeriodicalId":431851,"journal":{"name":"2022 7th International Conference on Communication, Image and Signal Processing (CCISP)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116939563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Arbitrary Direction Inkjet Character Recognition Based on Spatial Transformation 基于空间变换的任意方向喷墨字符识别
Pub Date : 2022-11-01 DOI: 10.1109/CCISP55629.2022.9974507
Wentao Cai, Hao Zhao, Heng Wang, Xue Deng
Aiming at the problem of the low recognition accuracy caused by the arbitrary characters. In this paper, we propose an arbitrary direction character recognition network. Firstly, a lightweight spatial transformation network (STNet) is designed based on the MobileNetV2, which is used to extract the spatial features of the arbitrary characters and perform spatial transformation. Simultaneously, we introduced the SE attention block into the feature extraction backbone network, which makes the network focuses on the key regions of characters. Then, we build a text recognizer based on recurrent neural network and introduce the Connectionist Temporal Classification (CTC) loss to achieve the flexible alignment between the visual features and the prediction outputs. Extensive experiments are carried out on the IIIT5K and a self-made inkjet characters dataset. The recognition accuracy of our proposed method reaches 95.7% and 86.3% respectively. Compared with the benchmarks, the maximum accuracy of the proposed method is improved by 17.5%. Experimental results show the effectiveness of our proposed method.
针对任意字符导致识别精度低的问题。本文提出了一种任意方向字符识别网络。首先,基于MobileNetV2设计了一个轻量级空间变换网络(STNet),用于提取任意字符的空间特征并进行空间变换;同时,我们将SE注意块引入特征提取骨干网络,使网络集中在字符的关键区域。然后,我们建立了一个基于递归神经网络的文本识别器,并引入了连接时间分类(CTC)损失来实现视觉特征与预测输出之间的灵活对齐。在IIIT5K和自制的喷墨字符数据集上进行了大量的实验。该方法的识别准确率分别达到95.7%和86.3%。与基准算法相比,该方法的最大准确率提高了17.5%。实验结果表明了该方法的有效性。
{"title":"Arbitrary Direction Inkjet Character Recognition Based on Spatial Transformation","authors":"Wentao Cai, Hao Zhao, Heng Wang, Xue Deng","doi":"10.1109/CCISP55629.2022.9974507","DOIUrl":"https://doi.org/10.1109/CCISP55629.2022.9974507","url":null,"abstract":"Aiming at the problem of the low recognition accuracy caused by the arbitrary characters. In this paper, we propose an arbitrary direction character recognition network. Firstly, a lightweight spatial transformation network (STNet) is designed based on the MobileNetV2, which is used to extract the spatial features of the arbitrary characters and perform spatial transformation. Simultaneously, we introduced the SE attention block into the feature extraction backbone network, which makes the network focuses on the key regions of characters. Then, we build a text recognizer based on recurrent neural network and introduce the Connectionist Temporal Classification (CTC) loss to achieve the flexible alignment between the visual features and the prediction outputs. Extensive experiments are carried out on the IIIT5K and a self-made inkjet characters dataset. The recognition accuracy of our proposed method reaches 95.7% and 86.3% respectively. Compared with the benchmarks, the maximum accuracy of the proposed method is improved by 17.5%. Experimental results show the effectiveness of our proposed method.","PeriodicalId":431851,"journal":{"name":"2022 7th International Conference on Communication, Image and Signal Processing (CCISP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122906302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Channel estimation of 5G OFDM system based on ConvLSTM network 基于ConvLSTM网络的5G OFDM系统信道估计
Pub Date : 2022-11-01 DOI: 10.1109/CCISP55629.2022.9974588
Yiyuan Wang, Jun Chang, Zhongkui Lu, Fuhui Yu, Jiaqi Wei, Yan Xu
In view of the requirement of high speed and low delay in 5G system, traditional channel estimation algorithms are difficult to meet the requirements. This paper regards the channel estimation problem in communication systems as an image processing problem in deep learning, and proposes a channel estimation network based on ConvLSTM network. Convolutional neural network is used in channel estimation, and LSTM structure is introduced to capture the correlation of the channel. The parameters are set to generate the channel data information set of the physical downlink shared channel (PDSCH) based on the 5G new radio (NR) standard, which is used to evaluate the performance of the proposed and existing algorithms. Experimental simulations show that the proposed algorithm has obvious performance improvement and strong robustness compared with least squares algorithm, practical channel estimation and T-CNN network based on image processing.
鉴于5G系统对高速、低时延的要求,传统的信道估计算法难以满足要求。本文将通信系统中的信道估计问题视为深度学习中的图像处理问题,提出了一种基于ConvLSTM网络的信道估计网络。采用卷积神经网络进行信道估计,并引入LSTM结构捕捉信道的相关性。设置这些参数是为了生成基于5G新无线电(NR)标准的物理下行共享信道(PDSCH)的信道数据信息集,用于评估所提算法和现有算法的性能。实验仿真表明,与最小二乘算法、实用信道估计和基于图像处理的T-CNN网络相比,该算法具有明显的性能提升和较强的鲁棒性。
{"title":"Channel estimation of 5G OFDM system based on ConvLSTM network","authors":"Yiyuan Wang, Jun Chang, Zhongkui Lu, Fuhui Yu, Jiaqi Wei, Yan Xu","doi":"10.1109/CCISP55629.2022.9974588","DOIUrl":"https://doi.org/10.1109/CCISP55629.2022.9974588","url":null,"abstract":"In view of the requirement of high speed and low delay in 5G system, traditional channel estimation algorithms are difficult to meet the requirements. This paper regards the channel estimation problem in communication systems as an image processing problem in deep learning, and proposes a channel estimation network based on ConvLSTM network. Convolutional neural network is used in channel estimation, and LSTM structure is introduced to capture the correlation of the channel. The parameters are set to generate the channel data information set of the physical downlink shared channel (PDSCH) based on the 5G new radio (NR) standard, which is used to evaluate the performance of the proposed and existing algorithms. Experimental simulations show that the proposed algorithm has obvious performance improvement and strong robustness compared with least squares algorithm, practical channel estimation and T-CNN network based on image processing.","PeriodicalId":431851,"journal":{"name":"2022 7th International Conference on Communication, Image and Signal Processing (CCISP)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123033187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lossless Compression for Hyperspectral Images Using Back Pixel Search with Adaptive Threshold 基于自适应阈值的后像素搜索高光谱图像无损压缩
Pub Date : 2022-11-01 DOI: 10.1109/CCISP55629.2022.9974344
Fuquan Zhu, Liping Yang
The special statistical property produced by radiative correction has a serious impact on prediction accuracy. Back pixel search (BPS) algorithm is currently the most effective way to solve this problem. However, the effectiveness of BPS algorithm depends on optimal threshold and the prediction accuracy of the first prediction. In this paper, an effective lossless compression method for hyperspectral image based on conventional recursive least squares (CRLS) algorithm and BPS algorithm with adaptive threshold is proposed. Firstly, the CRLS predictor is adopted in the first prediction to improve the accuracy of predicted reference values. Afterwards, a recursive error mean estimation with scaling factor is used to estimate the optimal search threshold in the BPS predictor. Finally, the arithmetic encoder is used to entropy-encode the residuals generated by prediction. The experimental results on Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) images set show that this method significantly improves the compression effect and reduces the computational complexity compared with the typical methods already reported.
辐射校正产生的特殊统计性质严重影响预测精度。后置像素搜索(BPS)算法是目前解决这一问题最有效的方法。然而,BPS算法的有效性取决于最优阈值和第一次预测的预测精度。提出了一种基于传统递归最小二乘(CRLS)算法和自适应阈值BPS算法的高光谱图像无损压缩方法。首先,在第一次预测中采用CRLS预测器,提高预测参考值的精度。然后,使用带比例因子的递归误差均值估计来估计BPS预测器中的最优搜索阈值。最后,利用算法编码器对预测产生的残差进行熵编码。在机载可见/红外成像光谱仪(AVIRIS)图像集上的实验结果表明,与已有的典型方法相比,该方法显著提高了压缩效果,降低了计算复杂度。
{"title":"Lossless Compression for Hyperspectral Images Using Back Pixel Search with Adaptive Threshold","authors":"Fuquan Zhu, Liping Yang","doi":"10.1109/CCISP55629.2022.9974344","DOIUrl":"https://doi.org/10.1109/CCISP55629.2022.9974344","url":null,"abstract":"The special statistical property produced by radiative correction has a serious impact on prediction accuracy. Back pixel search (BPS) algorithm is currently the most effective way to solve this problem. However, the effectiveness of BPS algorithm depends on optimal threshold and the prediction accuracy of the first prediction. In this paper, an effective lossless compression method for hyperspectral image based on conventional recursive least squares (CRLS) algorithm and BPS algorithm with adaptive threshold is proposed. Firstly, the CRLS predictor is adopted in the first prediction to improve the accuracy of predicted reference values. Afterwards, a recursive error mean estimation with scaling factor is used to estimate the optimal search threshold in the BPS predictor. Finally, the arithmetic encoder is used to entropy-encode the residuals generated by prediction. The experimental results on Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) images set show that this method significantly improves the compression effect and reduces the computational complexity compared with the typical methods already reported.","PeriodicalId":431851,"journal":{"name":"2022 7th International Conference on Communication, Image and Signal Processing (CCISP)","volume":"274 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128988857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on preprocessing method based on cylindrical image restoration of steel pipe 基于钢管圆柱图像复原的预处理方法研究
Pub Date : 2022-11-01 DOI: 10.1109/CCISP55629.2022.9974159
Cai Xiang, Kang Yihua, Ma Hongbao, Hong Ning, Qiu Gongzhe
Machine vision is more and more widely used in industrial inspection. In order to improve the industrial production of steel pipes, machine vision is used to replace manual inspection of steel pipe surface defects. Because the surface of the steel pipe is cylindrical, there are different degrees of deformation and information loss in the edge parts on both sides of the image when capturing the steel pipe surface image. Aiming at the deformation and distortion problems of the edge of the steel pipe image, a mathematical model is established and calculated. According to the principle of cylindrical projection, this paper deduces the expression of cylindrical orthographic projection, and puts forward a restoration algorithm of cylindrical back projection, which realizes the cylindrical expansion of steel pipe image, and solves the problems of defect shape distortion caused by the deformation of the edges on both sides of the image and the difficulty of defect detection. The results show that this method is effective and lays a foundation for steel pipe detection.
机器视觉在工业检测中的应用越来越广泛。为了提高钢管的工业化生产,采用机器视觉代替人工对钢管表面缺陷进行检测。由于钢管表面为圆柱形,在采集钢管表面图像时,图像两侧边缘部分存在不同程度的变形和信息损失。针对钢管图像边缘的变形和畸变问题,建立了数学模型并进行了计算。根据圆柱投影的原理,推导出圆柱正投影的表达式,并提出了圆柱背投影的恢复算法,实现了钢管图像的圆柱扩展,解决了由于图像两侧边缘变形造成的缺陷形状畸变和缺陷检测困难的问题。结果表明,该方法是有效的,为钢管检测奠定了基础。
{"title":"Research on preprocessing method based on cylindrical image restoration of steel pipe","authors":"Cai Xiang, Kang Yihua, Ma Hongbao, Hong Ning, Qiu Gongzhe","doi":"10.1109/CCISP55629.2022.9974159","DOIUrl":"https://doi.org/10.1109/CCISP55629.2022.9974159","url":null,"abstract":"Machine vision is more and more widely used in industrial inspection. In order to improve the industrial production of steel pipes, machine vision is used to replace manual inspection of steel pipe surface defects. Because the surface of the steel pipe is cylindrical, there are different degrees of deformation and information loss in the edge parts on both sides of the image when capturing the steel pipe surface image. Aiming at the deformation and distortion problems of the edge of the steel pipe image, a mathematical model is established and calculated. According to the principle of cylindrical projection, this paper deduces the expression of cylindrical orthographic projection, and puts forward a restoration algorithm of cylindrical back projection, which realizes the cylindrical expansion of steel pipe image, and solves the problems of defect shape distortion caused by the deformation of the edges on both sides of the image and the difficulty of defect detection. The results show that this method is effective and lays a foundation for steel pipe detection.","PeriodicalId":431851,"journal":{"name":"2022 7th International Conference on Communication, Image and Signal Processing (CCISP)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129277801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of the effect of carbon emissions on meteorological factors in Yunnan province 云南省碳排放对气象因子的影响分析
Pub Date : 2022-11-01 DOI: 10.1109/CCISP55629.2022.9974575
Guilan Luo, Xin Ma, Xuan Liu, Anshun Hu, Caikui Wang, Lianbiao Fang
In recent years, global warming and climate extremes have occurred frequently, seriously affecting human life, production and sustainable development.To study the correlation between carbon emissions and climate change, complex networks and big data statistical analysis methods were used to construct a multi-factor climate network of Yunnan carbon emissions and multi-factor climate network by determining the connectivity of edges through Pearson correlation coefficients, and sliding series correlation was used to analyse the effect of carbon emissions on meteorological factors.The results show that carbon emissions show a positive correlation with temperature, wind speed and sunshine, and a negative correlation with air pressure, precipitation and humidity.On long time scales carbon emissions have an impact on changes in meteorological factors, with an immediate effect on wind speed and a trend from lagging to immediate on precipitation.The study provides some theoretical reference for the control of CO2 emissions in Yunnan Province.
近年来,全球气候变暖和极端气候频繁发生,严重影响了人类的生活、生产和可持续发展。为研究碳排放与气候变化的相关性,采用复杂网络和大数据统计分析方法,通过Pearson相关系数确定边的连通性,构建了云南碳排放多因子气候网络和多因子气候网络,并采用滑动序列相关分析了碳排放对气象因子的影响。结果表明:碳排放与气温、风速、日照呈显著正相关,与气压、降水、湿度呈显著负相关;在长时间尺度上,碳排放对气象因子的变化有影响,对风速有直接影响,对降水有从滞后到直接的趋势。研究结果为云南省二氧化碳排放的控制提供了一定的理论参考。
{"title":"Analysis of the effect of carbon emissions on meteorological factors in Yunnan province","authors":"Guilan Luo, Xin Ma, Xuan Liu, Anshun Hu, Caikui Wang, Lianbiao Fang","doi":"10.1109/CCISP55629.2022.9974575","DOIUrl":"https://doi.org/10.1109/CCISP55629.2022.9974575","url":null,"abstract":"In recent years, global warming and climate extremes have occurred frequently, seriously affecting human life, production and sustainable development.To study the correlation between carbon emissions and climate change, complex networks and big data statistical analysis methods were used to construct a multi-factor climate network of Yunnan carbon emissions and multi-factor climate network by determining the connectivity of edges through Pearson correlation coefficients, and sliding series correlation was used to analyse the effect of carbon emissions on meteorological factors.The results show that carbon emissions show a positive correlation with temperature, wind speed and sunshine, and a negative correlation with air pressure, precipitation and humidity.On long time scales carbon emissions have an impact on changes in meteorological factors, with an immediate effect on wind speed and a trend from lagging to immediate on precipitation.The study provides some theoretical reference for the control of CO2 emissions in Yunnan Province.","PeriodicalId":431851,"journal":{"name":"2022 7th International Conference on Communication, Image and Signal Processing (CCISP)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114614867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AFMTD: Anchor-free Frame for Multi-scale Target Detection AFMTD:多尺度目标检测的无锚框架
Pub Date : 2022-11-01 DOI: 10.1109/CCISP55629.2022.9974392
Xueting Liu, Jingrou Xu, Ruoxi Lin, Jinyang Pan, Junying Mao, Guangqiang Yin
Target detection task plays the most fundamental and important role in computer vision. The appearance of deep learning method has produced a positive effect on target detection, but multi-scale target detection is poor. The reasons could be attributed to two aspects; the first one is that the small target tends to contain less semantic information, which leads algorithm be hard to detect it; the other is that the sample distribution in the practical application scenarios is random, and the different-scaled target features will interfere with each other, which poses negative effect on multi-scale target detection. Based on existing technical issues, we propose an anchor-free frame for the multi-scale target detection (AFMTD) algorithm as solution. First, from the direction of feature fusion, we propose a spatial attention fusion module (SAFM), which designs same scale transformation (SST) based on Bi-FPN, strengthens the valuable information between adjacent feature layers, and suppresses interference features, improving the detection accuracy and resolution ability of targets of different scales. Then, from the direction of anchor-free frame detection, the heatmap-based multi-scale detection module (HMDM) is proposed; by introducing a scale distribution mechanism (SDM) and Heatmap-IOU (HIOU) loss function, the module allocates different targets to different corresponding feature maps, which makes the model converge faster and more accurately. Through experiments on the MS COCO dataset, our approach achieved 40.5% average precision (AP), and the AP of large, medium, and small-scale targets is 24.5%, 44.1%, and 53.9%, respectively.
目标检测任务是计算机视觉中最基础、最重要的任务。深度学习方法的出现对目标检测产生了积极的影响,但多尺度目标检测效果不佳。原因可以归结为两个方面;一是小目标往往包含较少的语义信息,导致算法难以检测到小目标;二是实际应用场景中的样本分布是随机的,不同尺度的目标特征会相互干扰,对多尺度目标检测产生不利影响。针对目前存在的技术问题,提出了一种无锚框架的多尺度目标检测算法作为解决方案。首先,从特征融合的方向,提出了空间注意融合模块(SAFM),该模块设计了基于Bi-FPN的同尺度变换(SST),增强了相邻特征层之间的有价值信息,抑制了干扰特征,提高了不同尺度目标的检测精度和分辨能力。然后,从无锚帧检测的方向,提出了基于热图的多尺度检测模块(HMDM);通过引入SDM (scale distribution mechanism)和HIOU (Heatmap-IOU)损失函数,将不同的目标分配到不同的对应特征映射中,使模型收敛更快、更准确。通过在MS COCO数据集上的实验,我们的方法达到了40.5%的平均精度(AP),大、中、小目标的平均精度分别为24.5%、44.1%和53.9%。
{"title":"AFMTD: Anchor-free Frame for Multi-scale Target Detection","authors":"Xueting Liu, Jingrou Xu, Ruoxi Lin, Jinyang Pan, Junying Mao, Guangqiang Yin","doi":"10.1109/CCISP55629.2022.9974392","DOIUrl":"https://doi.org/10.1109/CCISP55629.2022.9974392","url":null,"abstract":"Target detection task plays the most fundamental and important role in computer vision. The appearance of deep learning method has produced a positive effect on target detection, but multi-scale target detection is poor. The reasons could be attributed to two aspects; the first one is that the small target tends to contain less semantic information, which leads algorithm be hard to detect it; the other is that the sample distribution in the practical application scenarios is random, and the different-scaled target features will interfere with each other, which poses negative effect on multi-scale target detection. Based on existing technical issues, we propose an anchor-free frame for the multi-scale target detection (AFMTD) algorithm as solution. First, from the direction of feature fusion, we propose a spatial attention fusion module (SAFM), which designs same scale transformation (SST) based on Bi-FPN, strengthens the valuable information between adjacent feature layers, and suppresses interference features, improving the detection accuracy and resolution ability of targets of different scales. Then, from the direction of anchor-free frame detection, the heatmap-based multi-scale detection module (HMDM) is proposed; by introducing a scale distribution mechanism (SDM) and Heatmap-IOU (HIOU) loss function, the module allocates different targets to different corresponding feature maps, which makes the model converge faster and more accurately. Through experiments on the MS COCO dataset, our approach achieved 40.5% average precision (AP), and the AP of large, medium, and small-scale targets is 24.5%, 44.1%, and 53.9%, respectively.","PeriodicalId":431851,"journal":{"name":"2022 7th International Conference on Communication, Image and Signal Processing (CCISP)","volume":"214 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114846197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Root cause analysis of network fault based on random forest 基于随机森林的网络故障根本原因分析
Pub Date : 2022-11-01 DOI: 10.1109/CCISP55629.2022.9974518
Li Liu, Ke Zhang, Linjun Liu, Le Zhang, Jun Zhang
Artificial intelligence (AI) has become an important means of network anomaly detection and fault root cause analysis (RCA), but most applications are only for a certain segment of the network. In the process of our research on the end-to-end experience analysis of mobile Internet services, we have summarized a set of network end-to-end root cause analysis methods, mainly using non-orthogonal random forest modeling method, AI-based dynamic threshold adjustment and indicator feature extraction, classification modeling and cross-validation of the poor quality of the entire network and the poor quality of the cells. This method has been verified in practice in the production network. The results of root cause analysis are consistent with the actual situation of the production network up to 96%. Practice has proved that this method has played a positive role in supporting the network operation, and greatly improved the production efficiency.
人工智能(AI)已成为网络异常检测和故障根本原因分析(RCA)的重要手段,但大多数应用仅针对某一网络段。在对移动互联网服务端到端体验分析的研究过程中,我们总结了一套网络端到端根本原因分析方法,主要采用非正交随机森林建模方法、基于ai的动态阈值调整和指标特征提取、对整个网络质量差和小区质量差进行分类建模和交叉验证。该方法在实际生产网络中得到了验证。根本原因分析结果与生产网络实际情况的符合率高达96%。实践证明,该方法对网络运营起到了积极的支持作用,大大提高了生产效率。
{"title":"Root cause analysis of network fault based on random forest","authors":"Li Liu, Ke Zhang, Linjun Liu, Le Zhang, Jun Zhang","doi":"10.1109/CCISP55629.2022.9974518","DOIUrl":"https://doi.org/10.1109/CCISP55629.2022.9974518","url":null,"abstract":"Artificial intelligence (AI) has become an important means of network anomaly detection and fault root cause analysis (RCA), but most applications are only for a certain segment of the network. In the process of our research on the end-to-end experience analysis of mobile Internet services, we have summarized a set of network end-to-end root cause analysis methods, mainly using non-orthogonal random forest modeling method, AI-based dynamic threshold adjustment and indicator feature extraction, classification modeling and cross-validation of the poor quality of the entire network and the poor quality of the cells. This method has been verified in practice in the production network. The results of root cause analysis are consistent with the actual situation of the production network up to 96%. Practice has proved that this method has played a positive role in supporting the network operation, and greatly improved the production efficiency.","PeriodicalId":431851,"journal":{"name":"2022 7th International Conference on Communication, Image and Signal Processing (CCISP)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122276289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conceptual Framework For Optimized Pipeline Selection For Brain Tractography Using Multi-Criteria Decision Analysis 基于多准则决策分析的脑导管造影管道选择优化概念框架
Pub Date : 2022-11-01 DOI: 10.1109/CCISP55629.2022.9974377
Idrees Fazili, A. Achuthan, M. Mustapha, B. Belaton
Diffusion Tensor Imaging (DTI) allows us to reconstruct the brain white matter (WM) pathways in-vivo. Generating a diffusion tractograph from raw MRI data involves multiple layers of processes. Each set of processes that produces a particular analysis is called a pipeline. An extensive collection of software tools have been developed over the years for each layer of tractograph generation, giving researchers the freedom to choose the tools of their preference for different processes. However, this has resulted in the establishment of various pipelines aimed towards the same task, and depending upon an analysis, one pipeline may be more suitable than the other. This creates a hurdle for the clinical application of the DTI tools, as the clinicians and neuroscience researchers are not usually conversant with the technical aspects of the DTI tools. This study proposes an automated decision model for selection of tractography pipelines that will allow researchers and clinicians to select best of the possible DTI pipelines for a particular Analysis.
弥散张量成像(DTI)使我们能够在体内重建脑白质(WM)通路。从原始MRI数据生成弥散束图涉及多层过程。产生特定分析的每一组过程称为管道。多年来,为每一层牵引仪生成开发了大量的软件工具,使研究人员可以自由地选择他们喜欢的工具来进行不同的过程。然而,这导致建立了针对同一任务的各种管道,并且根据分析,一个管道可能比另一个更合适。这给DTI工具的临床应用带来了障碍,因为临床医生和神经科学研究人员通常不熟悉DTI工具的技术方面。本研究提出了一个自动决策模型,用于选择导管造影管道,这将允许研究人员和临床医生为特定的分析选择最好的可能的DTI管道。
{"title":"Conceptual Framework For Optimized Pipeline Selection For Brain Tractography Using Multi-Criteria Decision Analysis","authors":"Idrees Fazili, A. Achuthan, M. Mustapha, B. Belaton","doi":"10.1109/CCISP55629.2022.9974377","DOIUrl":"https://doi.org/10.1109/CCISP55629.2022.9974377","url":null,"abstract":"Diffusion Tensor Imaging (DTI) allows us to reconstruct the brain white matter (WM) pathways in-vivo. Generating a diffusion tractograph from raw MRI data involves multiple layers of processes. Each set of processes that produces a particular analysis is called a pipeline. An extensive collection of software tools have been developed over the years for each layer of tractograph generation, giving researchers the freedom to choose the tools of their preference for different processes. However, this has resulted in the establishment of various pipelines aimed towards the same task, and depending upon an analysis, one pipeline may be more suitable than the other. This creates a hurdle for the clinical application of the DTI tools, as the clinicians and neuroscience researchers are not usually conversant with the technical aspects of the DTI tools. This study proposes an automated decision model for selection of tractography pipelines that will allow researchers and clinicians to select best of the possible DTI pipelines for a particular Analysis.","PeriodicalId":431851,"journal":{"name":"2022 7th International Conference on Communication, Image and Signal Processing (CCISP)","volume":"321 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131985372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 7th International Conference on Communication, Image and Signal Processing (CCISP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1