International Conference on Image Processing and Intelligent Control最新文献

英文中文

Adversarial training-based robust diagnosis method for lumbar disc herniation 基于对抗训练的腰椎间盘突出症鲁棒诊断方法

International Conference on Image Processing and Intelligent Control

Pub Date : 2023-08-09 DOI: 10.1117/12.3001430

Ying Li, Jian Chen, Zhihai Su, Jinjin Hai, Ruoxi Qin, Kai Qiao, Hai Lu, Binghai Yan

Currently, lumbar spine diseases are becoming increasingly young, and with the aging of the population, clinical doctors are facing increasing pressure in detecting lumbar spine diseases. Therefore, an AI-based diagnosis system for lumbar spine diseases using nuclear magnetic images (MRI) has become a sustainable solution for early diagnosis. However, a large amount of work has shown the fragility of neural networks in unseen data distributions. Therefore, this paper proposes an adversarial training-based robust diagnosis method for lumbar disc herniation to address the fragility issue of deep models under specific small perturbations. By enhancing the robustness of the model to specific perturbations through adversarial training, the deep network can correctly classify lumbar spine MRI data with perturbations. The deep network model uses ResNet50, with adversarial examples containing adversarial perturbations added during training, followed by joint training of normal and adversarial examples, and Mixup augmentation from the perspective of data augmentation to further enhance the model's robustness. Through 5-fold cross-validation training, this method was verified to significantly improve the robustness of the model under adversarial perturbations (average recognition accuracy increased from 50.14% to 71.07%), while maintaining high recognition accuracy for normal samples (our method/baseline: 89.14%/89.05%).

目前，腰椎疾病越来越年轻化，随着人口老龄化，临床医生在腰椎疾病的检测上面临越来越大的压力。因此，基于人工智能的腰椎疾病核磁成像(MRI)诊断系统已成为早期诊断的可持续解决方案。然而，大量的研究表明，神经网络在不可见的数据分布中是脆弱的。因此，本文提出了一种基于对抗性训练的腰椎间盘突出症鲁棒诊断方法，以解决深度模型在特定小扰动下的脆弱性问题。通过对抗性训练增强模型对特定扰动的鲁棒性，深度网络可以正确分类具有扰动的腰椎MRI数据。深度网络模型采用ResNet50，在训练过程中加入包含对抗性扰动的对抗样例，然后对正常样例和对抗样例进行联合训练，并从数据增强的角度进行Mixup增强，进一步增强模型的鲁棒性。通过5倍交叉验证训练，验证了该方法在对抗性扰动下显著提高了模型的鲁棒性(平均识别准确率从50.14%提高到71.07%)，同时对正常样本保持了较高的识别准确率(我们的方法/基线:89.14%/89.05%)。

{"title":"Adversarial training-based robust diagnosis method for lumbar disc herniation","authors":"Ying Li, Jian Chen, Zhihai Su, Jinjin Hai, Ruoxi Qin, Kai Qiao, Hai Lu, Binghai Yan","doi":"10.1117/12.3001430","DOIUrl":"https://doi.org/10.1117/12.3001430","url":null,"abstract":"Currently, lumbar spine diseases are becoming increasingly young, and with the aging of the population, clinical doctors are facing increasing pressure in detecting lumbar spine diseases. Therefore, an AI-based diagnosis system for lumbar spine diseases using nuclear magnetic images (MRI) has become a sustainable solution for early diagnosis. However, a large amount of work has shown the fragility of neural networks in unseen data distributions. Therefore, this paper proposes an adversarial training-based robust diagnosis method for lumbar disc herniation to address the fragility issue of deep models under specific small perturbations. By enhancing the robustness of the model to specific perturbations through adversarial training, the deep network can correctly classify lumbar spine MRI data with perturbations. The deep network model uses ResNet50, with adversarial examples containing adversarial perturbations added during training, followed by joint training of normal and adversarial examples, and Mixup augmentation from the perspective of data augmentation to further enhance the model's robustness. Through 5-fold cross-validation training, this method was verified to significantly improve the robustness of the model under adversarial perturbations (average recognition accuracy increased from 50.14% to 71.07%), while maintaining high recognition accuracy for normal samples (our method/baseline: 89.14%/89.05%).","PeriodicalId":210802,"journal":{"name":"International Conference on Image Processing and Intelligent Control","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128781851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Review of infrared object detection algorithms for low-light background 低光背景下红外目标检测算法综述

International Conference on Image Processing and Intelligent Control

Pub Date : 2023-08-09 DOI: 10.1117/12.3001327

Jianguo Wei, Y. Qu, Yanbin Ma

At present, object detection algorithm using artificial intelligence technology plays an increasingly important role in the field of computer vision, and plays an extremely important role in such practical application scenarios as automatic driving, urban monitoring, national defense, military and medical assistance. Different from visible light imaging, infrared imaging technology uses detectors to measure the infrared radiation difference between the object itself and the background, overcoming the difficulty of low light intensity and realizing infrared object detection in the low-light scene. In this paper, the traditional infrared object detection algorithm for low light background and infrared object detection algorithm based on deep learning are reviewed, and the current representative classical algorithms are compared, and the characteristics of the model combined with the actual application scenarios are analyzed. Finally, the difficulties and challenges that the current infrared object detection task facing are described, and the research direction of infrared object detection is prospected.

目前，采用人工智能技术的目标检测算法在计算机视觉领域发挥着越来越重要的作用，在自动驾驶、城市监控、国防、军事、医疗救助等实际应用场景中发挥着极其重要的作用。与可见光成像不同，红外成像技术利用探测器测量物体本身与背景之间的红外辐射差，克服了低光强的困难，实现了低光场景下的红外物体检测。本文对传统的低光背景红外目标检测算法和基于深度学习的红外目标检测算法进行了综述，并对目前具有代表性的经典算法进行了比较，并结合实际应用场景分析了模型的特点。最后，阐述了当前红外目标检测任务面临的困难和挑战，并对红外目标检测的研究方向进行了展望。

引用次数: 0

HAU-Net: hybrid attention U-NET for retinal blood vessels image segmentation HAU-Net:用于视网膜血管图像分割的混合注意力U-NET

International Conference on Image Processing and Intelligent Control

Pub Date : 2023-08-09 DOI: 10.1117/12.3000792

Jialin Chen, Chunmei Ma, Y. Li, Shuaikun Fan, Rui Shi, Xi-ping Yan

Accurate semantic segmentation of retinal images is very important for intelligent diagnosis of eye diseases. However, the large number of tiny blood vessels and the uneven distribution of blood vessels in the retina pose many challenges to the segmentation algorithm. In this paper, we propose a Hybrid Attention Fusion U-Net model (HAU-Net) for segmentation of retinal blood vessel images. Specifically, we use the U-NET network as the backbone network, and bridge attention is introduced into the network to improve the efficiency of vessel feature extraction. In addition, we introduce channel attention and spatial attention modules at the bottom of the network, to obtain coarse-to-fine feature representation of retinal vessel images, so as to improve the accuracy of vascular image segmentation. In order to verify the model's performance, we conducted extensive experiments on DRIVE and CHASE_DB1 datasets, and the accuracy reach 97.03% and 97.72%, respectively, which are better than CAR-UNet and MC-UNet.

视网膜图像的准确语义分割对于眼部疾病的智能诊断至关重要。然而，视网膜中细小血管数量多，血管分布不均匀，给分割算法带来了诸多挑战。本文提出了一种用于视网膜血管图像分割的混合注意力融合U-Net模型(HAU-Net)。具体而言，我们采用U-NET网络作为骨干网，并在网络中引入桥式关注，以提高船舶特征提取的效率。此外，我们在网络底部引入通道注意和空间注意模块，获得视网膜血管图像从粗到细的特征表示，从而提高血管图像分割的精度。为了验证模型的性能，我们在DRIVE和CHASE_DB1数据集上进行了大量的实验，准确率分别达到97.03%和97.72%，优于CAR-UNet和MC-UNet。

引用次数: 0

Semantic segmentation of remote sensing image based on U-NET 基于U-NET的遥感图像语义分割

International Conference on Image Processing and Intelligent Control

Pub Date : 2023-08-09 DOI: 10.1117/12.3001440

Li Yao, Simeng Jia, Ziqing Dai

At present, the image processing of remote sensing technology mainly depends on the transcendental ability of human beings, and it needs to spend a lot of artificial resources to mark. Therefore, this paper proposes a research and application of semantic segmentation method for remote sensing images based on convolutional neural network. Normalize the data, subtract the mean value and divide it by the standard deviation to standardize, divide the data, introduce data enhancement to further enhance the training data, and create a convolutional neural network and a training network. Each layer of U-NET is composed of three layers of convolution, and features are extracted and integrated by pooling or up-sampling. At the last layer, all the previously extracted features are classified into two categories to realize the semantic segmentation of the image. The experimental results show that the F1 score, Recall score and Precision score of this method are 84.31%, 89.59% and 79.62%, respectively. By introducing U-NET, the semantic segmentation accuracy of remote sensing images is improved. Compared with the traditional full convolution neural network, U-NET has been improved. Through the stronger connection between layers, plus up-sampling and down-convolution, features can be fully extracted and accurate segmentation can be achieved with fewer training samples.

目前，遥感技术的图像处理主要依靠人类的超越能力，需要花费大量的人工资源进行标记。为此，本文提出了一种基于卷积神经网络的遥感图像语义分割方法的研究与应用。对数据进行归一化，减去均值并除以标准差进行标准化，对数据进行除法，引入数据增强对训练数据进行进一步增强，创建卷积神经网络和训练网络。U-NET的每一层由三层卷积组成，通过池化或上采样的方式提取和整合特征。在最后一层，将之前提取的所有特征分为两类，实现对图像的语义分割。实验结果表明，该方法的F1分、Recall分和Precision分分别为84.31%、89.59%和79.62%。通过引入U-NET，提高了遥感图像的语义分割精度。与传统的全卷积神经网络相比，U-NET进行了改进。通过层间更强的连接，加上上采样和下卷积，可以充分提取特征，用更少的训练样本实现准确的分割。

{"title":"Semantic segmentation of remote sensing image based on U-NET","authors":"Li Yao, Simeng Jia, Ziqing Dai","doi":"10.1117/12.3001440","DOIUrl":"https://doi.org/10.1117/12.3001440","url":null,"abstract":"At present, the image processing of remote sensing technology mainly depends on the transcendental ability of human beings, and it needs to spend a lot of artificial resources to mark. Therefore, this paper proposes a research and application of semantic segmentation method for remote sensing images based on convolutional neural network. Normalize the data, subtract the mean value and divide it by the standard deviation to standardize, divide the data, introduce data enhancement to further enhance the training data, and create a convolutional neural network and a training network. Each layer of U-NET is composed of three layers of convolution, and features are extracted and integrated by pooling or up-sampling. At the last layer, all the previously extracted features are classified into two categories to realize the semantic segmentation of the image. The experimental results show that the F1 score, Recall score and Precision score of this method are 84.31%, 89.59% and 79.62%, respectively. By introducing U-NET, the semantic segmentation accuracy of remote sensing images is improved. Compared with the traditional full convolution neural network, U-NET has been improved. Through the stronger connection between layers, plus up-sampling and down-convolution, features can be fully extracted and accurate segmentation can be achieved with fewer training samples.","PeriodicalId":210802,"journal":{"name":"International Conference on Image Processing and Intelligent Control","volume":"284 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120870535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Research on video vibration measurement based on fast two-dimensional empirical mode decomposition and Hilbert transform 基于快速二维经验模态分解和希尔伯特变换的视频振动测量研究

International Conference on Image Processing and Intelligent Control

Pub Date : 2023-08-09 DOI: 10.1117/12.3001022

Honglei Du, Z. Zhong

Against the poor noise immunity of the vibration measurement algorithm based on video phase, this paper proposes an algorithm based on enhanced fast empirical mode decomposition and Hilbert phase-based motion estimation (EFEMD-HPME). EFEMD decomposes the multicomponent video image into a single-component image, and then the local phase information of single component image is extracted through Hilbert transform, which has superior noise immunity compared with the spectral decomposition technique of the band-pass filter. Experiments show that the algorithm proposed in this article has a signal-to-noise ratio improvement of about 30% and a relative error of less than 0.5% compared with the HPME, which is of great significance for improving the robustness of video vibration measurement in general measurement environments.

针对基于视频相位的振动测量算法抗噪性差的问题，提出了一种基于增强快速经验模态分解和Hilbert相位运动估计(EFEMD-HPME)的算法。EFEMD将多分量视频图像分解为单分量图像，然后通过Hilbert变换提取单分量图像的局部相位信息，与带通滤波器的频谱分解技术相比，该方法具有更好的抗噪性。实验表明，与HPME相比，本文算法的信噪比提高了30%左右，相对误差小于0.5%，这对于提高一般测量环境下视频振动测量的鲁棒性具有重要意义。

引用次数: 0

Chinese image description evaluation method based on target domain semantic constraints 基于目标领域语义约束的中文图像描述评价方法

International Conference on Image Processing and Intelligent Control

Pub Date : 2023-08-09 DOI: 10.1117/12.3000808

Zhenhao Wang, Wenyi Sun, Zhengsong Wang, Le Yang

To address the problems of insufficient accuracy and difficulty of application in the current Chinese image description field, this paper proposes an evaluation method based on semantic constraints in the target domain. Unlike previous research, this method acts on the output stage of the model, and based on the extraction of key semantics in the target application domain, it is constrained by the macroscopic semantic space of that domain or by introducing external semantic information from other visual tasks. The experiments show that the proposed method effectively improves the semantic coherence between the model output description sentences and the input images in the target domain, and is helpful for the practical application of image description in specific domains.

针对当前中文图像描述领域准确度不足、应用难度大的问题，提出了一种基于目标域语义约束的评价方法。与以往的研究不同，该方法作用于模型的输出阶段，基于目标应用领域关键语义的提取，不受该领域宏观语义空间或引入其他视觉任务外部语义信息的约束。实验表明，该方法有效地提高了模型输出描述句子与目标域输入图像之间的语义一致性，有助于特定领域图像描述的实际应用。

引用次数: 0

CNN-LSTM-VAE based time series trend prediction 基于CNN-LSTM-VAE的时间序列趋势预测

International Conference on Image Processing and Intelligent Control

Pub Date : 2023-08-09 DOI: 10.1117/12.3000935

Wei Li, Hui Gao, Zeqi Qin

In the context of mobile Internet, time series analysis has become an important way to capture the characteristics of data such as periodicity and correlation. Establishing a temporal sequence analysis model as an effective means to capture data features, for the problems of irregularity, nonlinearity, and inconspicuous feature relationships that commonly occur in sequences. In this paper, we use convolutional neural network to extract the potential features in the sequence, and combine the long and short term memory network to analyze the temporal features in the data; meanwhile, due to the "gate" structure of the long and short term memory network, some noise in the data is introduced into the model for training, resulting in the overfitting problem. -The decode-reconstruction network structure is used to remove this noise and improve the accuracy of the model. In this paper, we use the stock data of CBS as an example and compare it with the existing algorithm model, based on which we demonstrate the higher accuracy of this algorithm with different domain data sets.

在移动互联网环境下，时间序列分析已成为捕捉数据周期性、相关性等特征的重要手段。针对序列中普遍存在的不规则性、非线性、特征关系不明显等问题，建立时序分析模型作为捕获数据特征的有效手段。本文采用卷积神经网络提取序列中的潜在特征，并结合长短期记忆网络分析数据中的时间特征;同时，由于长短期记忆网络的“门”结构，将数据中的一些噪声引入模型进行训练，导致过拟合问题。-采用译码重构网络结构去除该噪声，提高模型的精度。本文以CBS股票数据为例，将其与现有算法模型进行比较，在此基础上证明了该算法在不同领域数据集上具有较高的准确率。

引用次数: 0

A multi-scale branch convolutional neural network for denoising 一种多尺度分支卷积神经网络去噪方法

International Conference on Image Processing and Intelligent Control

Pub Date : 2023-08-09 DOI: 10.1117/12.3000863

Chunyu Wang, Xuesong Su

Images, being significant carriers of memories and information, are valued by people. To restore images, it is necessary to perform noise reduction processing to eliminate noise generated by camera equipment and other factors. Traditional denoising technology such as wavelet transform is used to help engineer restore a image. And in recent years, the introduction of convolutional neural networks has accelerated the progress of noise reduction research. Many classic models have been developed by researchers using U-shaped networks and other techniques. Researchers often use multi-scale approaches to obtain multiple feature maps and enhance their network with these features. Our work enhanced denoising network by introducing large convolutions, small convolutions, and Fast Fourier convolutions to capture feature information at different scales. Additionally, we used an SE block to introduce attention mechanisms into the network. As evidenced by experimental results, our network achieved outstanding performance.

图像作为记忆和信息的重要载体，受到人们的重视。为了恢复图像，需要进行降噪处理，消除相机设备等因素产生的噪声。利用传统的去噪技术，如小波变换，帮助工程师恢复图像。近年来，卷积神经网络的引入加速了降噪研究的进展。研究人员利用u型网络和其他技术开发了许多经典模型。研究人员经常使用多尺度方法来获取多个特征图，并用这些特征来增强网络。我们的工作通过引入大卷积、小卷积和快速傅立叶卷积来增强去噪网络，以捕获不同尺度的特征信息。此外，我们使用SE块将注意力机制引入网络。实验结果表明，我们的网络取得了优异的性能。

引用次数: 0

Design of digital watermarking algorithm based on compression sensing 基于压缩感知的数字水印算法设计

International Conference on Image Processing and Intelligent Control

Pub Date : 2023-08-09 DOI: 10.1117/12.3002802

Yichen Wang, Hu Sheng, Jing Li, Jiating Li, Jinxing Liu

With the rapid development of multimedia technology and the Internet, the security of multimedia data is facing more and more threats. Digital watermarking technology can effectively solve this problem, and compressed sensing theory is widely used in the field of digital watermarking. As the theoretical basis of the new generation of information processing, this theory can effectively solve the problem of multimedia data being attacked. Based on compressed sensing theory, combined with discrete wavelet transform and SVD decomposition, this research realized the embedding and extraction of watermark in the observation domain after image thinning, and tested different forms of watermark attacks such as salt pepper noise, Gaussian noise and filtering on the watermarked image. The experimental data shows that the PSNR value and NC of the algorithm are in line with expectations, and it has good robustness, transparency, and extraction resistance.

随着多媒体技术和互联网的飞速发展，多媒体数据的安全面临越来越多的威胁。数字水印技术可以有效地解决这一问题，压缩感知理论在数字水印领域得到了广泛的应用。该理论作为新一代信息处理的理论基础，可以有效地解决多媒体数据被攻击的问题。基于压缩感知理论，结合离散小波变换和奇异值分解，在图像细化后的观测域中实现水印的嵌入和提取，并对水印图像进行了椒盐噪声、高斯噪声和滤波等不同形式的水印攻击测试。实验数据表明，该算法的PSNR值和NC值符合预期，具有良好的鲁棒性、透明性和抗提取性。

引用次数: 0

Research on ticket recognition platform based on deep learning algorithm 基于深度学习算法的票证识别平台研究

International Conference on Image Processing and Intelligent Control

Pub Date : 2023-08-09 DOI: 10.1117/12.3003840

Gaode Cheng

With the advent of digital age, bill recognition has become a necessary work for many enterprises, institutions and individuals. This paper proposes a ticket recognition platform based on deep learning algorithm, which can automatically recognize various types of tickets and realize fast and accurate processing. The platform uses deep learning algorithms such as convolutional neural network, cyclic neural network and long and short time memory network and combines image processing technology and natural language processing technology to effectively solve the difficult problems in ticket recognition. The experimental results show that the platform has high performance in ticket recognition accuracy and speed and can meet the needs of practical applications.

随着数字时代的到来，票据识别已成为许多企事业单位和个人的一项必要工作。本文提出了一种基于深度学习算法的门票识别平台，可以自动识别各种类型的门票，实现快速准确的处理。该平台采用卷积神经网络、循环神经网络、长短时记忆网络等深度学习算法，结合图像处理技术和自然语言处理技术，有效解决票证识别中的疑难问题。实验结果表明，该平台在票据识别精度和速度方面具有较高的性能，能够满足实际应用的需要。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

International Conference on Image Processing and Intelligent Control

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀