首页 > 最新文献

光电工程最新文献

英文 中文
Research progress of electromagnetic properties of tunable chiral metasurfaces 可调手性超表面电磁特性的研究进展
Q3 Engineering Pub Date : 2021-02-26 DOI: 10.12086/OEE.2021.200218
Wang Jinjin, Zhu Qiuhao, Dong Jian-feng
Chiral metasurfaces are ultra-thin metamaterials composed of planar chiral cell structures with specific electromagnetic responses. They have attracted great attention due to their singular ability to control electromagnetic waves at will. With tunable materials incorporated into the metasurfaces design, one can realize tunable/reconfigurable metadevices with functionalities controlled by external stimuli, opening a new platform to dynamically manipulate electromagnetic waves. In this paper, we introduce some theoretical foundations of the electromagnetic properties of tunable/reconfigurable chiral metasurfaces. When a linearly polarized light enters a tunable chiral metasurface, it can be decomposed into left-handed circularly polarized (LCP) wave and right-handed circularly polarized (RCP) wave. By changing the dielectric constant and magnetic permeability of the medium through the external environment, the metadevices can dynamically control the response characteristics to various polarized lights, especially circularly polarized lights such as refractive index, dichroism, optical rotation, asymmetric transmission, etc. According to the properties of negative refractive index, circular dichroism, optical rotation, and asymmetric transmission controlled by the tunable chiral metasurfaces, we review the latest research progress. Finally, we put forward our own opinions on the possible future development directions and existing challenges of the rapidly developing field of the tunable chiral metasurface.
手性超表面是由具有特定电磁响应的平面手性细胞结构组成的超薄超材料。它们因具有随意控制电磁波的独特能力而引起了极大的关注。将可调谐材料纳入元表面设计,可以实现具有外部刺激控制功能的可调谐/可重构元器件,为动态操纵电磁波开辟了新的平台。本文介绍了可调/可重构手性超表面电磁特性的一些理论基础。当线偏振光进入可调谐的手性超表面时,可将其分解为左手圆偏振光(LCP)波和右手圆偏振光(RCP)波。元器件通过外界环境改变介质的介电常数和磁导率,可以动态控制对各种偏振光,特别是圆偏振光的响应特性,如折射率、二色性、旋光性、不对称透射等。根据可调手性超表面控制的负折射率、圆二色性、旋光性和不对称透射等特性,综述了其最新研究进展。最后,对快速发展的可调手性超表面领域未来可能的发展方向和存在的挑战提出了自己的看法。
{"title":"Research progress of electromagnetic properties of tunable chiral metasurfaces","authors":"Wang Jinjin, Zhu Qiuhao, Dong Jian-feng","doi":"10.12086/OEE.2021.200218","DOIUrl":"https://doi.org/10.12086/OEE.2021.200218","url":null,"abstract":"Chiral metasurfaces are ultra-thin metamaterials composed of planar chiral cell structures with specific electromagnetic responses. They have attracted great attention due to their singular ability to control electromagnetic waves at will. With tunable materials incorporated into the metasurfaces design, one can realize tunable/reconfigurable metadevices with functionalities controlled by external stimuli, opening a new platform to dynamically manipulate electromagnetic waves. In this paper, we introduce some theoretical foundations of the electromagnetic properties of tunable/reconfigurable chiral metasurfaces. When a linearly polarized light enters a tunable chiral metasurface, it can be decomposed into left-handed circularly polarized (LCP) wave and right-handed circularly polarized (RCP) wave. By changing the dielectric constant and magnetic permeability of the medium through the external environment, the metadevices can dynamically control the response characteristics to various polarized lights, especially circularly polarized lights such as refractive index, dichroism, optical rotation, asymmetric transmission, etc. According to the properties of negative refractive index, circular dichroism, optical rotation, and asymmetric transmission controlled by the tunable chiral metasurfaces, we review the latest research progress. Finally, we put forward our own opinions on the possible future development directions and existing challenges of the rapidly developing field of the tunable chiral metasurface.","PeriodicalId":39552,"journal":{"name":"Guangdian Gongcheng/Opto-Electronic Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84509123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RGB-D object recognition algorithm based on improved double stream convolution recursive neural network 基于改进双流卷积递归神经网络的RGB-D目标识别算法
Q3 Engineering Pub Date : 2021-02-26 DOI: 10.12086/OEE.2021.200069
Li Xun, Li Linpeng, A. Lazovik, Wang Wenjie, W. Xiaohua
An algorithm (Re-CRNN) of image processing is proposed using RGB-D object recognition, which is improved based on a double stream convolutional recursive neural network, in order to improve the accuracy of object recognition. Re-CRNN combines RGB image with depth optical information, the double stream convolutional neural network (CNN) is improved based on the idea of residual learning as follows: top-level feature fusion unit is added into the network, the representation of federation feature is learning in RGB images and depth images and the high-level features are integrated in across channels of the extracted RGB images and depth images information, after that, the probability distribution was generated by Softmax. Finally, the experiment was carried out on the standard RGB-D data set. The experimental results show that the accuracy was 94.1% using Re-CRNN algorithm for the RGB-D object recognition, which was significantly improved compared with the existing image-based object recognition methods.
为了提高目标识别的精度,提出了一种基于RGB-D目标识别的图像处理算法(Re-CRNN),该算法在双流卷积递归神经网络的基础上进行了改进。Re-CRNN将RGB图像与深度光学信息相结合,基于残差学习的思想对双流卷积神经网络(CNN)进行了如下改进:在网络中加入顶层特征融合单元,在RGB图像和深度图像中学习联邦特征的表示,在提取的RGB图像和深度图像信息的跨通道中集成高层特征,然后通过Softmax生成概率分布。最后,在标准RGB-D数据集上进行实验。实验结果表明,Re-CRNN算法用于RGB-D目标识别的准确率为94.1%,与现有的基于图像的目标识别方法相比有显著提高。
{"title":"RGB-D object recognition algorithm based on improved double stream convolution recursive neural network","authors":"Li Xun, Li Linpeng, A. Lazovik, Wang Wenjie, W. Xiaohua","doi":"10.12086/OEE.2021.200069","DOIUrl":"https://doi.org/10.12086/OEE.2021.200069","url":null,"abstract":"An algorithm (Re-CRNN) of image processing is proposed using RGB-D object recognition, which is improved based on a double stream convolutional recursive neural network, in order to improve the accuracy of object recognition. Re-CRNN combines RGB image with depth optical information, the double stream convolutional neural network (CNN) is improved based on the idea of residual learning as follows: top-level feature fusion unit is added into the network, the representation of federation feature is learning in RGB images and depth images and the high-level features are integrated in across channels of the extracted RGB images and depth images information, after that, the probability distribution was generated by Softmax. Finally, the experiment was carried out on the standard RGB-D data set. The experimental results show that the accuracy was 94.1% using Re-CRNN algorithm for the RGB-D object recognition, which was significantly improved compared with the existing image-based object recognition methods.","PeriodicalId":39552,"journal":{"name":"Guangdian Gongcheng/Opto-Electronic Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83868023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
EEG emotion recognition based on linear kernel PCA and XGBoost 基于线性核PCA和XGBoost的脑电情感识别
Q3 Engineering Pub Date : 2021-02-26 DOI: 10.12086/OEE.2021.200013
Dong Yindong, R. Fuji, Liu Chunbin
The principal component analysis of linear kernel and XGBoost models are introduced to design electro-encephalogram (EEG) classification algorithm of four emotional states under continuous audio-visual stimulation. In order to reflect universality, the traditional power spectral density (PSD) is used as the feature of EEG signal, and the feature importance measure under the weight index is obtained with XGBoost learning. Then linear kernel principal component analysis is used to process the threshold selected features and send them to XGBoost model for recognition. According to the experimental analysis, gamma-band plays a more important role than other bands in XGBoost model recognition; in addition, for distribution on channels, the central, parietal, and right occipital regions play a more important role than other brain regions. The recognition accuracy of this algorithm is 78.4% and 92.6% respectively under the two recognition schemes of subjects all participation (SAP) and subject single dependent (SSD). Compared with other literature, this algorithm has made a great improvement. The scheme proposed is helpful to improve the recognition performance of brain-computer emotion system under audio-visual stimulation.
引入线性核函数的主成分分析和XGBoost模型,设计了连续视听刺激下四种情绪状态的脑电分类算法。为了体现普适性,采用传统的功率谱密度(PSD)作为脑电信号的特征,利用XGBoost学习得到权指标下的特征重要性测度。然后利用线性核主成分分析对阈值选择的特征进行处理,并将其发送给XGBoost模型进行识别。实验分析表明,在XGBoost模型识别中,伽马波段比其他波段发挥更重要的作用;此外,对于通道的分布,中央、顶叶和右枕区比其他脑区起更重要的作用。在受试者全参与(SAP)和受试者单依赖(SSD)两种识别方案下,该算法的识别准确率分别为78.4%和92.6%。与其他文献相比,该算法有了很大的改进。该方案有助于提高脑机情感系统在视听刺激下的识别性能。
{"title":"EEG emotion recognition based on linear kernel PCA and XGBoost","authors":"Dong Yindong, R. Fuji, Liu Chunbin","doi":"10.12086/OEE.2021.200013","DOIUrl":"https://doi.org/10.12086/OEE.2021.200013","url":null,"abstract":"The principal component analysis of linear kernel and XGBoost models are introduced to design electro-encephalogram (EEG) classification algorithm of four emotional states under continuous audio-visual stimulation. In order to reflect universality, the traditional power spectral density (PSD) is used as the feature of EEG signal, and the feature importance measure under the weight index is obtained with XGBoost learning. Then linear kernel principal component analysis is used to process the threshold selected features and send them to XGBoost model for recognition. According to the experimental analysis, gamma-band plays a more important role than other bands in XGBoost model recognition; in addition, for distribution on channels, the central, parietal, and right occipital regions play a more important role than other brain regions. The recognition accuracy of this algorithm is 78.4% and 92.6% respectively under the two recognition schemes of subjects all participation (SAP) and subject single dependent (SSD). Compared with other literature, this algorithm has made a great improvement. The scheme proposed is helpful to improve the recognition performance of brain-computer emotion system under audio-visual stimulation.","PeriodicalId":39552,"journal":{"name":"Guangdian Gongcheng/Opto-Electronic Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82211394","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Insulator nondestructive testing based on VGGNet algorithm 基于VGGNet算法的绝缘子无损检测
Q3 Engineering Pub Date : 2021-01-15 DOI: 10.12086/OEE.2021.200072
Ma Lixin, Dou Chenfei, Song Chencan, Yan Tianxiao
In the power system, it is difficult to detect the insulator's deterioration in operation. Aiming at this problem, this thesis applies the convolution neural network algorithm to evaluate the insulator's deterioration degree based on the deep analysis of the principle and structure of the convolution neural network model. Firstly, the power frequency flashover test was conducted on the insulator to produce three states as follows: no discharge, weak discharge, and strong discharge. Moreover, the Ultraviolet imager was applied to collect the insulator's ultraviolet images in different discharge state to establish the ultraviolet images sample library. Subsequently, the VGGNet framework neural network algorithm was applied to perform the classification training and the state-prediction evaluation on the samples so as to eventually achieve the purpose of judging whether the insulator is degraded. From the experimental results, it can be seen that the accuracy rate of the algorithm is as high as 98.4%, which has broad application prospects in the insulator's degradation detection. Furthermore, it provides a mentality for the reliability detection of other power equipments.
在电力系统中,绝缘子在运行过程中的劣化是难以检测的。针对这一问题,本文在深入分析卷积神经网络模型原理和结构的基础上,应用卷积神经网络算法对绝缘子劣化程度进行评估。首先对绝缘子进行工频闪络试验,产生无放电、弱放电和强放电三种状态。利用紫外成像仪采集绝缘子在不同放电状态下的紫外图像,建立了绝缘子紫外图像样本库。随后,利用VGGNet框架神经网络算法对样本进行分类训练和状态预测评估,最终达到判断绝缘子是否退化的目的。从实验结果可以看出,该算法的准确率高达98.4%,在绝缘子退化检测中具有广阔的应用前景。同时也为其他电力设备的可靠性检测提供了思路。
{"title":"Insulator nondestructive testing based on VGGNet algorithm","authors":"Ma Lixin, Dou Chenfei, Song Chencan, Yan Tianxiao","doi":"10.12086/OEE.2021.200072","DOIUrl":"https://doi.org/10.12086/OEE.2021.200072","url":null,"abstract":"In the power system, it is difficult to detect the insulator's deterioration in operation. Aiming at this problem, this thesis applies the convolution neural network algorithm to evaluate the insulator's deterioration degree based on the deep analysis of the principle and structure of the convolution neural network model. Firstly, the power frequency flashover test was conducted on the insulator to produce three states as follows: no discharge, weak discharge, and strong discharge. Moreover, the Ultraviolet imager was applied to collect the insulator's ultraviolet images in different discharge state to establish the ultraviolet images sample library. Subsequently, the VGGNet framework neural network algorithm was applied to perform the classification training and the state-prediction evaluation on the samples so as to eventually achieve the purpose of judging whether the insulator is degraded. From the experimental results, it can be seen that the accuracy rate of the algorithm is as high as 98.4%, which has broad application prospects in the insulator's degradation detection. Furthermore, it provides a mentality for the reliability detection of other power equipments.","PeriodicalId":39552,"journal":{"name":"Guangdian Gongcheng/Opto-Electronic Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87649171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vehicle identification number recognition based on neural network 基于神经网络的车辆识别号码识别
Q3 Engineering Pub Date : 2021-01-15 DOI: 10.12086/OEE.2021.200094
Meng Fanjun, Yin Dong
It is far essential to properly recognize the vehicle identification number (VIN) engraved on the car frame for vehicle surveillance and identification. In this paper, we propose an algorithm for recognizing rotational VIN im-ages based on neural network which incorporates two components: VIN detection and VIN recognition. Firstly, with lightweight neural network and text segmentation based on EAST, we attain efficient and excellent VIN detection performance. Secondly, the VIN recognition is regarded as a sequence classification problem. By means of connecting sequential classifiers, we predict VIN characters directly and precisely. For validating our algorithm, we collect a VIN dataset, which contains raw rotational VIN images and horizontal VIN images. Experimental results show that the algorithm we proposed achieves good performance on VIN detection and VIN recognition in real time.
正确识别镌刻在车架上的车辆识别号码对于车辆监控和识别至关重要。本文提出了一种基于神经网络的旋转VIN图像识别算法,该算法包含两个部分:VIN检测和VIN识别。首先,利用轻量级神经网络和基于EAST的文本分割,实现了高效、优异的识别码检测性能;其次,将VIN识别视为一个序列分类问题。通过连接顺序分类器,可以直接准确地预测VIN字符。为了验证我们的算法,我们收集了一个VIN数据集,其中包含原始的旋转VIN图像和水平VIN图像。实验结果表明,该算法在VIN检测和实时识别方面取得了较好的效果。
{"title":"Vehicle identification number recognition based on neural network","authors":"Meng Fanjun, Yin Dong","doi":"10.12086/OEE.2021.200094","DOIUrl":"https://doi.org/10.12086/OEE.2021.200094","url":null,"abstract":"It is far essential to properly recognize the vehicle identification number (VIN) engraved on the car frame for vehicle surveillance and identification. In this paper, we propose an algorithm for recognizing rotational VIN im-ages based on neural network which incorporates two components: VIN detection and VIN recognition. Firstly, with lightweight neural network and text segmentation based on EAST, we attain efficient and excellent VIN detection performance. Secondly, the VIN recognition is regarded as a sequence classification problem. By means of connecting sequential classifiers, we predict VIN characters directly and precisely. For validating our algorithm, we collect a VIN dataset, which contains raw rotational VIN images and horizontal VIN images. Experimental results show that the algorithm we proposed achieves good performance on VIN detection and VIN recognition in real time.","PeriodicalId":39552,"journal":{"name":"Guangdian Gongcheng/Opto-Electronic Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88994332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The detection method for grab of portal crane based on deep learning 基于深度学习的门机抓斗检测方法
Q3 Engineering Pub Date : 2021-01-15 DOI: 10.12086/OEE.2021.200062
Zhang Wenming, Li Xiangyang, Li Hai-bin, Liu Ya-qian
In order to solve the problems of low work efficiency and safety caused by the inability of human eyes to accurately determine the position of the grab during the loading and unloading of dry bulk cargo by portal crane, a method of grab detection based on deep learning is proposed for the first time. The improved deep convolution neural network (YOLOv3-tiny) is used to train and test on the data set of grab, and then to learn its internal feature representation. The experimental results show that the detection method based on deep learning can achieve a detection speed of 45 frames per second and a recall rate of 95.78%. It can meet the real-time and accuracy of detection, and improve the safety and efficiency of work in the industrial field.
为了解决门式起重机装卸干散货过程中人眼无法准确判断抓斗位置造成的工作效率低、安全性低等问题,首次提出了一种基于深度学习的抓斗检测方法。利用改进的深度卷积神经网络(YOLOv3-tiny)对抓取数据集进行训练和测试,进而学习其内部特征表示。实验结果表明,基于深度学习的检测方法可以实现45帧/秒的检测速度和95.78%的召回率。满足了检测的实时性和准确性,提高了工业现场工作的安全性和效率。
{"title":"The detection method for grab of portal crane based on deep learning","authors":"Zhang Wenming, Li Xiangyang, Li Hai-bin, Liu Ya-qian","doi":"10.12086/OEE.2021.200062","DOIUrl":"https://doi.org/10.12086/OEE.2021.200062","url":null,"abstract":"In order to solve the problems of low work efficiency and safety caused by the inability of human eyes to accurately determine the position of the grab during the loading and unloading of dry bulk cargo by portal crane, a method of grab detection based on deep learning is proposed for the first time. The improved deep convolution neural network (YOLOv3-tiny) is used to train and test on the data set of grab, and then to learn its internal feature representation. The experimental results show that the detection method based on deep learning can achieve a detection speed of 45 frames per second and a recall rate of 95.78%. It can meet the real-time and accuracy of detection, and improve the safety and efficiency of work in the industrial field.","PeriodicalId":39552,"journal":{"name":"Guangdian Gongcheng/Opto-Electronic Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90914990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning adaptive spatial regularization and aberrance repression correlation filters for visual tracking 学习用于视觉跟踪的自适应空间正则化和畸变抑制相关滤波器
Q3 Engineering Pub Date : 2021-01-15 DOI: 10.12086/OEE.2021.200068
Wang Ye, Liu Qiang, Qin Linbo, Qizhi Teng, X. He
This paper proposes a correlation filter tracking algorithm based on adaptive spatial regularization and aberrance repression aiming at the problem that the spatial regularization weight of the background-aware correlation filter is fixed and does not adapt to the change of the target, and the problem that enlarging search area may introduce background noise, decreasing the discrimination ability of filters. First, FHOG features, CN features, and gray features are extracted to enhance the algorithm's ability to express the target. Second, aberrance repression terms are added to the target function to constrain the response map of the current frame, and to enhance the filter's discrimination ability to alleviate the filter model degradation. Finally, adaptive spatial regularization terms are added to the objective function to make the spatial regularization weights being updated as the objective changes, so that the filter can make full use of the target's diversity information. This paper involves experiments on the public data sets OTB-2013, OTB-2015 and VOT2016 to evaluate the proposed algorithm. The experimental results show that the speed of the algorithm used in this paper is 20 frames/s, evaluation indicators such as distance accuracy and success rate are superior to comparison algorithms, and it has good robustness in a variety of complex scenarios such as occlusion, background interference, and rotation changes.
针对背景感知相关滤波器空间正则化权值固定且不适应目标变化的问题,以及扩大搜索区域可能引入背景噪声,降低滤波器识别能力的问题,提出了一种基于自适应空间正则化和畸变抑制的相关滤波器跟踪算法。首先,提取FHOG特征、CN特征和灰度特征,增强算法对目标的表达能力;其次,在目标函数中加入异常抑制项,约束当前帧的响应映射,增强滤波器的识别能力,减轻滤波器模型的退化;最后,在目标函数中加入自适应空间正则化项,使空间正则化权值随着目标的变化而更新,使滤波器能够充分利用目标的分集信息。本文在OTB-2013、OTB-2015和VOT2016三个公共数据集上进行了实验,对所提出的算法进行了评估。实验结果表明,本文采用的算法速度为20帧/秒,距离精度、成功率等评价指标优于比较算法,并且在遮挡、背景干扰、旋转变化等多种复杂场景下具有良好的鲁棒性。
{"title":"Learning adaptive spatial regularization and aberrance repression correlation filters for visual tracking","authors":"Wang Ye, Liu Qiang, Qin Linbo, Qizhi Teng, X. He","doi":"10.12086/OEE.2021.200068","DOIUrl":"https://doi.org/10.12086/OEE.2021.200068","url":null,"abstract":"This paper proposes a correlation filter tracking algorithm based on adaptive spatial regularization and aberrance repression aiming at the problem that the spatial regularization weight of the background-aware correlation filter is fixed and does not adapt to the change of the target, and the problem that enlarging search area may introduce background noise, decreasing the discrimination ability of filters. First, FHOG features, CN features, and gray features are extracted to enhance the algorithm's ability to express the target. Second, aberrance repression terms are added to the target function to constrain the response map of the current frame, and to enhance the filter's discrimination ability to alleviate the filter model degradation. Finally, adaptive spatial regularization terms are added to the objective function to make the spatial regularization weights being updated as the objective changes, so that the filter can make full use of the target's diversity information. This paper involves experiments on the public data sets OTB-2013, OTB-2015 and VOT2016 to evaluate the proposed algorithm. The experimental results show that the speed of the algorithm used in this paper is 20 frames/s, evaluation indicators such as distance accuracy and success rate are superior to comparison algorithms, and it has good robustness in a variety of complex scenarios such as occlusion, background interference, and rotation changes.","PeriodicalId":39552,"journal":{"name":"Guangdian Gongcheng/Opto-Electronic Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75223799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comparative study of time of flight extraction methods in non-line-of-sight location 非视距定位下飞行时间提取方法的比较研究
Q3 Engineering Pub Date : 2021-01-15 DOI: 10.12086/OEE.2021.200124
Ren Yu, Luo Yihan, Xu Shaoxiong, Ma Hao-Tong, Tan Yi
Non-line-of-sight location is an active detection technology which is used to detect the position of objects out of sight by extracting the time of flight. It is a research hotspot in recent years. In order to study the performance differences of mean filter, median filter and Gaussian filter in extracting time of flight, firstly, the energy changing model of photon flight model is optimized by photometry, and then the parameters of the three filtering methods are optimized and analyzed. After that, the adaptability of these three extraction methods to the maximum value judgment method and probability threshold weighted judgment method is analyzed. Finally, the accuracy and stability of these three time extraction algorithms are compared by using the positions of devices and invisible object as variables. The simulation results show that the median filter is suitable for a narrow environment and it has the high accuracy in positioning; the locations with Gaussian filter have good positioning stability and there is a wider selection range of filtering parameters when the signal is processed with Gaussian filter.
非视距定位是一种主动检测技术,通过提取飞行时间来检测视距外物体的位置。这是近年来的研究热点。为了研究均值滤波、中值滤波和高斯滤波在提取飞行时间方面的性能差异,首先利用光度法对光子飞行模型的能量变化模型进行优化,然后对三种滤波方法的参数进行优化分析。然后,分析了这三种提取方法对最大值判断法和概率阈值加权判断法的适应性。最后,以设备位置和不可见物体位置为变量,比较了三种时间提取算法的精度和稳定性。仿真结果表明,中值滤波器适用于较窄的环境,具有较高的定位精度;采用高斯滤波器处理信号时,位置具有良好的定位稳定性,滤波参数的选择范围更广。
{"title":"A comparative study of time of flight extraction methods in non-line-of-sight location","authors":"Ren Yu, Luo Yihan, Xu Shaoxiong, Ma Hao-Tong, Tan Yi","doi":"10.12086/OEE.2021.200124","DOIUrl":"https://doi.org/10.12086/OEE.2021.200124","url":null,"abstract":"Non-line-of-sight location is an active detection technology which is used to detect the position of objects out of sight by extracting the time of flight. It is a research hotspot in recent years. In order to study the performance differences of mean filter, median filter and Gaussian filter in extracting time of flight, firstly, the energy changing model of photon flight model is optimized by photometry, and then the parameters of the three filtering methods are optimized and analyzed. After that, the adaptability of these three extraction methods to the maximum value judgment method and probability threshold weighted judgment method is analyzed. Finally, the accuracy and stability of these three time extraction algorithms are compared by using the positions of devices and invisible object as variables. The simulation results show that the median filter is suitable for a narrow environment and it has the high accuracy in positioning; the locations with Gaussian filter have good positioning stability and there is a wider selection range of filtering parameters when the signal is processed with Gaussian filter.","PeriodicalId":39552,"journal":{"name":"Guangdian Gongcheng/Opto-Electronic Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84058411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Super-resolution reconstruction of infrared image based on channel attention and transfer learning 基于通道关注和迁移学习的红外图像超分辨率重建
Q3 Engineering Pub Date : 2021-01-15 DOI: 10.12086/OEE.2021.200045
Sun Rui, Zhang Han, Zhi Cheng, Xudong Zhang
In recent years, infrared imaging technology has developed rapidly and has been increasingly used in military reconnaissance, security surveillance, and medical imaging. However, in the process of infrared image imaging or transmission, it is affected by many factors such as environment and equipment. The infrared image often has a low resolution, which greatly reduces the amount of information contained in the infrared image and restricts the application value of the infrared image. Therefore, how to obtain high-resolution and high-information infrared images has become an issue that people urgently need to solve. In recent years, the development of deep learning technology has made rapid progress, and super-resolution methods based on deep learning have begun to appear. However, if A super-resolution reconstruction method of infrared images based on channel attention and transfer learning was proposed to solve the problems of low resolution and low quality of infrared images. In this method, a deep convolutional neural network is designed to enhance the learning ability of the network by introducing the channel attention mechanism, and the residual learning method is used to mitigate the problem of gradient explosion or disappearance and to accelerate the convergence of the network. Because high-quality infrared images are difficult to collect and insufficient in number, so this method is divided into two steps: the first step is to use natural images to pre-train the neural network model, and the second step is to use transfer learning knowledge to fine-tune the pre-trained model’s parameters with a small number of high-quality infrared images to make the model better in reconstructing the infrared image. Finally, a multi-scale detail boosting filter is added to improve the visual effect of the reconstructed infrared image. Experiments on Set5 and Set14 datasets as well as infrared images show that the deepening network depth and introducing channel attention mechanism can improve the effect of super-resolution reconstruction, transfer learning can well solve the problem of insufficient number of infrared image samples, and multi-scale detail boosting filter can improve the details and increase the amount of information of the reconstruction image.these convolutional neural networks are directly applied to the infrared image field, there are some problems: SRCNN, FSRCNN, and ESPCN have fewer network convolutional layers and insufficient network depth, and the learning features will be relatively single, ignoring the differences between image features. The mutual relationship makes it difficult to extract the deep-level information of the infrared image, and SRGAN may generate super-resolution images that are not close to the original image in certain details, which is not conducive to the application of infrared images in military, medical and surveillance. Another problem that needs to be overcome is that it is difficult to collect a sufficie
近年来,红外成像技术发展迅速,在军事侦察、安防监控、医学成像等领域的应用越来越广泛。然而,红外图像在成像或传输过程中,受到环境、设备等诸多因素的影响。红外图像往往具有较低的分辨率,这大大降低了红外图像所包含的信息量,制约了红外图像的应用价值。因此,如何获取高分辨率、高信息的红外图像成为人们迫切需要解决的问题。近年来,深度学习技术的发展突飞猛进,基于深度学习的超分辨率方法开始出现。针对红外图像分辨率低、质量不高的问题,提出了一种基于通道关注和迁移学习的红外图像超分辨率重建方法。该方法设计了一个深度卷积神经网络,通过引入通道注意机制来增强网络的学习能力,并采用残差学习方法来缓解梯度爆炸或消失的问题,加速网络的收敛。由于高质量红外图像难以采集且数量不足,因此该方法分为两步:第一步是利用自然图像对神经网络模型进行预训练,第二步是利用迁移学习知识对预训练模型的参数进行微调,使用少量的高质量红外图像,使模型更好地重建红外图像。最后,加入多尺度细节增强滤波器,提高重建红外图像的视觉效果。在Set5和Set14数据集以及红外图像上的实验表明,加深网络深度和引入通道关注机制可以提高超分辨率重建效果,迁移学习可以很好地解决红外图像样本数量不足的问题,多尺度细节增强滤波器可以改善重建图像的细节,增加图像的信息量。这些卷积神经网络直接应用于红外图像领域,存在一些问题:SRCNN、FSRCNN和ESPCN的网络卷积层数较少,网络深度不足,学习特征会比较单一,忽略了图像特征之间的差异。这种相互关系使得红外图像的深层信息难以提取,并且SRGAN可能会生成在某些细节上与原始图像不接近的超分辨率图像,这不利于红外图像在军事、医疗和监视中的应用。另一个需要克服的问题是,在现实生活中很难收集到足够数量的高质量红外图像,而普通的深度学习方法需要大量不同场景和目标的图像作为训练样本。利用红外图像作为训练数据集来实现深度学习方法,往往达不到预期的效果。为了解决这些问题,本文提出了一种基于通道关注和迁移学习的红外图像超分辨率重建方法。该方法首先设计一个深度卷积神经网络,集成通道注意机制学习特征空间中通道之间的相关性,增强网络的学习能力,并利用残差学习减少梯度爆炸或消失问题,加快网络收敛速度。然后,考虑到高质量红外图像难以采集且数量不足,将网络训练分为两步:第一步使用自然图像预训练自然图像的超分辨率模型,第二步使用迁移学习知识。利用少量高质量红外图像,快速传递和微调预训练好的模型参数,提高模型对红外图像的重建效果,从而获得红外图像的超分辨率模型。最后,加入多尺度细节增强(MSDB)模块,增强红外重建图像的细节和视觉效果,增加信息量。
{"title":"Super-resolution reconstruction of infrared image based on channel attention and transfer learning","authors":"Sun Rui, Zhang Han, Zhi Cheng, Xudong Zhang","doi":"10.12086/OEE.2021.200045","DOIUrl":"https://doi.org/10.12086/OEE.2021.200045","url":null,"abstract":"In recent years, infrared imaging technology has developed rapidly and has been increasingly used in military reconnaissance, security surveillance, and medical imaging. However, in the process of infrared image imaging or transmission, it is affected by many factors such as environment and equipment. The infrared image often has a low resolution, which greatly reduces the amount of information contained in the infrared image and restricts the application value of the infrared image. Therefore, how to obtain high-resolution and high-information infrared images has become an issue that people urgently need to solve. In recent years, the development of deep learning technology has made rapid progress, and super-resolution methods based on deep learning have begun to appear. However, if A super-resolution reconstruction method of infrared images based on channel attention and transfer learning was proposed to solve the problems of low resolution and low quality of infrared images. In this method, a deep convolutional neural network is designed to enhance the learning ability of the network by introducing the channel attention mechanism, and the residual learning method is used to mitigate the problem of gradient explosion or disappearance and to accelerate the convergence of the network. Because high-quality infrared images are difficult to collect and insufficient in number, so this method is divided into two steps: the first step is to use natural images to pre-train the neural network model, and the second step is to use transfer learning knowledge to fine-tune the pre-trained model’s parameters with a small number of high-quality infrared images to make the model better in reconstructing the infrared image. Finally, a multi-scale detail boosting filter is added to improve the visual effect of the reconstructed infrared image. Experiments on Set5 and Set14 datasets as well as infrared images show that the deepening network depth and introducing channel attention mechanism can improve the effect of super-resolution reconstruction, transfer learning can well solve the problem of insufficient number of infrared image samples, and multi-scale detail boosting filter can improve the details and increase the amount of information of the reconstruction image.these convolutional neural networks are directly applied to the infrared image field, there are some problems: SRCNN, FSRCNN, and ESPCN have fewer network convolutional layers and insufficient network depth, and the learning features will be relatively single, ignoring the differences between image features. The mutual relationship makes it difficult to extract the deep-level information of the infrared image, and SRGAN may generate super-resolution images that are not close to the original image in certain details, which is not conducive to the application of infrared images in military, medical and surveillance. Another problem that needs to be overcome is that it is difficult to collect a sufficie","PeriodicalId":39552,"journal":{"name":"Guangdian Gongcheng/Opto-Electronic Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76583935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Crack detection based on multi-scale Faster RCNN with attention 基于多尺度快速RCNN的裂纹检测
Q3 Engineering Pub Date : 2021-01-15 DOI: 10.12086/OEE.2021.200112
Haiyong Chen, Zhao Peng, Haowei Yan
The background of the EL image of a photovoltaic cell under electroluminescence (EL) presents complex non-uniform texture features, and there are grain pseudo-defects similar to cracks. At the same time, the cracks appear as multi-scale features with various shapes. The above mentioned difficulties have presented great chal-lenges for the detection task. Therefore, this paper proposes a multi-scale Faster-RCNN model that integrates at-tention. On the one hand, an improved feature pyramid network is used to obtain multi-scale advanced semantic feature maps to improve the network's feature expression ability of multi-scale crack defects. On the other hand, an improved attention region proposal network A-RPN is adopted to increase the model's attention to crack defects and suppress the characteristics of complex background and grain pseudo-defects. At the same time, in the RPN network training process, a loss function Focal loss is used to reduce the proportion of simple samples in the training process, so that the model pays more attention to the samples that are difficult to distinguish. Experimental results show that this algorithm improves the accuracy of crack defect detection in EL images, reaching nearly 95%.
光电电池在电致发光(EL)下的EL图像背景呈现出复杂的非均匀纹理特征,存在类似裂纹的颗粒伪缺陷。同时,裂缝呈现出形状各异的多尺度特征。上述困难对检测任务提出了很大的挑战。因此,本文提出了一种多尺度的Faster-RCNN模型。一方面,利用改进的特征金字塔网络获得多尺度高级语义特征映射,提高网络对多尺度裂纹缺陷的特征表达能力;另一方面,采用改进的关注区域建议网络A-RPN,提高模型对裂纹缺陷的关注程度,抑制复杂背景和晶粒伪缺陷的特征。同时,在RPN网络的训练过程中,采用损失函数Focal loss来降低训练过程中简单样本的比例,使模型更加关注难以区分的样本。实验结果表明,该算法提高了EL图像中裂纹缺陷检测的准确率,达到近95%。
{"title":"Crack detection based on multi-scale Faster RCNN with attention","authors":"Haiyong Chen, Zhao Peng, Haowei Yan","doi":"10.12086/OEE.2021.200112","DOIUrl":"https://doi.org/10.12086/OEE.2021.200112","url":null,"abstract":"The background of the EL image of a photovoltaic cell under electroluminescence (EL) presents complex non-uniform texture features, and there are grain pseudo-defects similar to cracks. At the same time, the cracks appear as multi-scale features with various shapes. The above mentioned difficulties have presented great chal-lenges for the detection task. Therefore, this paper proposes a multi-scale Faster-RCNN model that integrates at-tention. On the one hand, an improved feature pyramid network is used to obtain multi-scale advanced semantic feature maps to improve the network's feature expression ability of multi-scale crack defects. On the other hand, an improved attention region proposal network A-RPN is adopted to increase the model's attention to crack defects and suppress the characteristics of complex background and grain pseudo-defects. At the same time, in the RPN network training process, a loss function Focal loss is used to reduce the proportion of simple samples in the training process, so that the model pays more attention to the samples that are difficult to distinguish. Experimental results show that this algorithm improves the accuracy of crack defect detection in EL images, reaching nearly 95%.","PeriodicalId":39552,"journal":{"name":"Guangdian Gongcheng/Opto-Electronic Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89582475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
光电工程
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1