Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning最新文献

英文中文

Human Activity Recognition based on Transformer in Smart Home 智能家居中基于变压器的人体活动识别

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2023-03-17 DOI: 10.1145/3590003.3590100

Xinmei Huang, Shenmin Zhang

With the advancement of artificial intelligence, smart home has attracted much attention from scholars. Human Activity Recognition (HAR) is a crucial foundation for various applications in smart home. In this paper, to improve the accuracy of HAR and promote the development of applications and services in smart home, we propose a Transformer-based approach that integrates multiple sensor sequence inputs for HAR. We integrate sequence features, collect contextual information, and employ Transformer to recognize various activities for the CASAS Aruba dataset that uses environmental sensors. The validation results on real-world dataset demonstrate its effectiveness compared to traditional machine learning and deep learning methods.

随着人工智能的发展，智能家居受到了学者们的广泛关注。人类活动识别(HAR)是智能家居中各种应用的重要基础。在本文中，为了提高HAR的准确性，促进智能家居应用和服务的发展，我们提出了一种基于变压器的方法，该方法集成了多个传感器序列输入。我们集成序列特征，收集上下文信息，并使用Transformer来识别使用环境传感器的CASAS Aruba数据集的各种活动。在实际数据集上的验证结果表明，该方法与传统的机器学习和深度学习方法相比是有效的。

引用次数: 0

Multi-Modal Fusion Object Tracking Based on Fully Convolutional Siamese Network 基于全卷积Siamese网络的多模态融合目标跟踪

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2023-03-17 DOI: 10.1145/3590003.3590084

Ke Qi, Liji Chen, Yicong Zhou, Yutao Qi

RGBT tracking incorporates thermal infrared data to achieve more accurate visual tracking. However, the efficiency of RGBT tracking may be diminished by some bottlenecks, such as thermal crossover, illumination variation and occlusion. To address the aforementioned problems, we propose a fully-convolutional Siamese-based Multi-modal Feature Fusion Network (SiamMFF) that integrates RGB and thermal features. In our work, visible and infrared images are initially processed by the Multi-Modal Feature Fusion framework (MFF) at the search and template sides, respectively. Then, the attribute-aware fusion module is introduced to conduct feature extraction and fusion for the major challenge attributes. In particular, we design a skip connections guidance module to prevent the propagation of noise and to enrich the feature information so that we can improve the tracker’s discriminative ability for modality-specific challenges. The proposed SiamMFF method has been evaluated in a great number of trials on two benchmark datasets GTOT and RGBT234, and the precision rate and success rate can reach 90.5%/73.6% and 81.2%/57.3%, respectively, demonstrating the superiority of our method over existing state-of-the-art methods.

RGBT跟踪结合热红外数据，实现更准确的视觉跟踪。然而，由于热交叉、光照变化和遮挡等瓶颈，RGBT跟踪的效率会受到影响。为了解决上述问题，我们提出了一个基于全卷积暹罗的多模态特征融合网络(SiamMFF)，该网络集成了RGB和热特征。在我们的工作中，可见光和红外图像分别在搜索端和模板端由多模态特征融合框架(MFF)进行初始处理。然后，引入属性感知融合模块，对主要挑战属性进行特征提取和融合;特别地，我们设计了一个跳跃连接引导模块，以防止噪声的传播，丰富特征信息，从而提高跟踪器对特定模态挑战的判别能力。本文方法在GTOT和RGBT234两个基准数据集上进行了大量试验，准确率和成功率分别达到90.5%/73.6%和81.2%/57.3%，证明了本文方法相对于现有先进方法的优越性。

{"title":"Multi-Modal Fusion Object Tracking Based on Fully Convolutional Siamese Network","authors":"Ke Qi, Liji Chen, Yicong Zhou, Yutao Qi","doi":"10.1145/3590003.3590084","DOIUrl":"https://doi.org/10.1145/3590003.3590084","url":null,"abstract":"RGBT tracking incorporates thermal infrared data to achieve more accurate visual tracking. However, the efficiency of RGBT tracking may be diminished by some bottlenecks, such as thermal crossover, illumination variation and occlusion. To address the aforementioned problems, we propose a fully-convolutional Siamese-based Multi-modal Feature Fusion Network (SiamMFF) that integrates RGB and thermal features. In our work, visible and infrared images are initially processed by the Multi-Modal Feature Fusion framework (MFF) at the search and template sides, respectively. Then, the attribute-aware fusion module is introduced to conduct feature extraction and fusion for the major challenge attributes. In particular, we design a skip connections guidance module to prevent the propagation of noise and to enrich the feature information so that we can improve the tracker’s discriminative ability for modality-specific challenges. The proposed SiamMFF method has been evaluated in a great number of trials on two benchmark datasets GTOT and RGBT234, and the precision rate and success rate can reach 90.5%/73.6% and 81.2%/57.3%, respectively, demonstrating the superiority of our method over existing state-of-the-art methods.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121945837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Genetic algorithm in hopfield neural network with probabilistic 2 satisfiability 概率可满足hopfield神经网络的遗传算法

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2023-03-17 DOI: 10.1145/3590003.3590024

Ju Chen, Chengfeng Zheng, Yuan Gao, Yueling Guo

Genetic Algorithm (GA) is to convert the problem-solving process into a process similar to the chromosomal changes in biological evolution using the mathematical method and computer simulation operation. This meta-heuristic algorithm has been successfully applied to system logic and non-system logic programming. In this study, we will explore the role of the Bipolar Genetic Algorithm (GA) in enhancing the learning process of the Hopfield neural network based on the previous study of PRO2SAT, and generate global solutions of the Probabilistic 2 Satisfiability model. The main purpose of the learning phase of the PRO2SAT model is to obtain consistent interpretations and calculate the optimal prominence weights, and the GA algorithm is introduced to improve the ability of PRO2SAT to obtain consistent interpretation using its selection, crossover, and mutation operators, and thus to improve the ability of the logic programming model to get a global solution. In the experimental phase, simulation data are used for result testing, and three performance metrics are used to test the consistency interpretation and global solution acquisition ability of the proposed model, including mean absolute error, logic formula satisfaction ratio, and global minimum ratio. Experimental results show that GA, as a meta-heuristic algorithm, has better searching ability for optimal solution and can effectively assist logic programming.

遗传算法(Genetic Algorithm, GA)是利用数学方法和计算机模拟运算，将问题解决过程转化为类似生物进化中染色体变化的过程。该元启发式算法已成功应用于系统逻辑和非系统逻辑编程。在本研究中，我们将在前人对PRO2SAT研究的基础上，探索双极遗传算法(Bipolar Genetic Algorithm, GA)在增强Hopfield神经网络学习过程中的作用，并生成Probabilistic 2 Satisfiability模型的全局解。PRO2SAT模型学习阶段的主要目的是获得一致解释和计算最优突出权值，并引入GA算法，利用其选择、交叉和变异算子提高PRO2SAT获得一致解释的能力，从而提高逻辑规划模型获得全局解的能力。在实验阶段，利用仿真数据对结果进行检验，并利用平均绝对误差、逻辑公式满意度和全局最小比三个性能指标对所提模型的一致性解释和全局解获取能力进行检验。实验结果表明，遗传算法作为一种元启发式算法，具有较好的最优解搜索能力，能够有效地辅助逻辑规划。

{"title":"Genetic algorithm in hopfield neural network with probabilistic 2 satisfiability","authors":"Ju Chen, Chengfeng Zheng, Yuan Gao, Yueling Guo","doi":"10.1145/3590003.3590024","DOIUrl":"https://doi.org/10.1145/3590003.3590024","url":null,"abstract":"Genetic Algorithm (GA) is to convert the problem-solving process into a process similar to the chromosomal changes in biological evolution using the mathematical method and computer simulation operation. This meta-heuristic algorithm has been successfully applied to system logic and non-system logic programming. In this study, we will explore the role of the Bipolar Genetic Algorithm (GA) in enhancing the learning process of the Hopfield neural network based on the previous study of PRO2SAT, and generate global solutions of the Probabilistic 2 Satisfiability model. The main purpose of the learning phase of the PRO2SAT model is to obtain consistent interpretations and calculate the optimal prominence weights, and the GA algorithm is introduced to improve the ability of PRO2SAT to obtain consistent interpretation using its selection, crossover, and mutation operators, and thus to improve the ability of the logic programming model to get a global solution. In the experimental phase, simulation data are used for result testing, and three performance metrics are used to test the consistency interpretation and global solution acquisition ability of the proposed model, including mean absolute error, logic formula satisfaction ratio, and global minimum ratio. Experimental results show that GA, as a meta-heuristic algorithm, has better searching ability for optimal solution and can effectively assist logic programming.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127231838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Elastic Detection Mechanism Aimed at Hybrid DDoS Attack 针对混合DDoS攻击的弹性检测机制

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2023-03-17 DOI: 10.1145/3590003.3590031

Yubo Wang, Jinyu Wang

In Distributed Denial of Service(DDoS) attack, the attacker uses a remotely controlled botnet to attack the target server at the same time to prevent legitimate users from obtaining information services. Previous studies focused on the detection of DDoS attacks on offline datasets, but ignored the detection of specific DDoS types, and some reports showed that the number of DDoS hybrid attacks was increasing significantly. In this paper, we propose an elastic detection mechanism(EDM), which can economize the server’s idle computing power. The framework integrates a variety of pre-trained lightweight CNN detect models, which are suitable for online rapid detection of DDoS hybrid attacks. We focus on evaluating the response accuracy and the detection speed of the EDM. The experimental results show that the model can achieve excellent hybrid attack detection performance, and meet the actual requirements of real-time detection.

DDoS (Distributed Denial of Service)攻击是指攻击者利用远程控制的僵尸网络，在攻击目标服务器的同时，阻止合法用户获取信息服务。以往的研究主要关注对离线数据集的DDoS攻击检测，而忽略了对具体DDoS类型的检测，一些报告显示，DDoS混合攻击的数量正在显著增加。在本文中，我们提出了一种弹性检测机制(EDM)，可以节省服务器的空闲计算能力。该框架集成了多种预训练的轻量级CNN检测模型，适用于在线快速检测DDoS混合攻击。重点对电火花加工的响应精度和检测速度进行了评价。实验结果表明，该模型能够取得优异的混合攻击检测性能，满足实时检测的实际要求。

引用次数: 0

Research on Colorization of Qinghai Farmer Painting Image Based on Generative Adversarial Networks 基于生成对抗网络的青海农民画图像着色研究

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2023-03-17 DOI: 10.1145/3590003.3590094

Chunyan Peng, Xueya Zhao, Guangyou Xia

At present, deep learning method is widely used in the field of gray image colorization. Qinghai farmer painting has distinct national characteristics. The farmer painting has bright colors, high saturation, chaotic color distribution and low color contrast, so it is difficult to restore the image color with high fidelity by using the general deep learning image colorization method. The Pix2Pix generation adversarial network of grayscale image colorization method uses the Leaky ReLU function as the activation function. The proposal algorithm replaces the maximum pooling layer with the convolution layer to retain more image feature information and further to improve the color simulation effect. Meanwhile, in view of the lack of relevant Qinghai farmer painting data set, the data set of Qinghai farmer paintings is constructed to meet the needs of network training. The experimental results show that the improved method further improves the color effect and can generate high quality color images of Qinghai farmer paintings with more real color distribution.

目前，深度学习方法被广泛应用于灰度图像着色领域。青海农民画具有鲜明的民族特色。农民画色彩鲜艳，饱和度高，色彩分布混乱，色彩对比度低，因此使用一般的深度学习图像着色方法很难还原出高保真的图像颜色。Pix2Pix生成对抗网络的灰度图像着色方法使用Leaky ReLU函数作为激活函数。该算法将最大池化层替换为卷积层，保留了更多的图像特征信息，进一步提高了色彩模拟效果。同时，针对青海农民画相关数据集的缺乏，构建了青海农民画数据集，以满足网络培训的需要。实验结果表明，改进后的方法进一步提高了色彩效果，能够生成色彩分布更真实的青海农民画高质量彩色图像。

引用次数: 0

Performance Evaluation of an Extradosed Cable-Stayed Bridge with Corrugated Web based on Machine Learning Algorithms 基于机器学习算法的波纹腹板斜拉桥性能评价

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2023-03-17 DOI: 10.1145/3590003.3590050

Zeyu Du, Zhenhua Pan, Z. Xiong, Lei He, Haipeng Wang, Houda Zhu, Jiangbo Wang

Corrugated steel web is suitable for large-span extradosed cable-stayed bridge's design scheme. Live Load Structural Index (LLSI) is applied to evaluate the performance of the bridge with corrugated steel web. Parametric numeric models were built and investigated to explore the web height and weight's effect on the structural performance of an extradosed cable-stayed bridge. Machine learning model involving Particle Swarm Optimization BP neural network has been constructed to predict the correlation and validate the relationship between the structural variable and live load structural index.

波纹钢腹板适用于大跨度斜拉桥的设计方案。采用活载结构指标(LLSI)对波形钢腹板桥梁的受力性能进行了评价。为探讨腹板高度和重量对斜拉桥结构性能的影响，建立了参数化数值模型。构建了基于粒子群优化BP神经网络的机器学习模型，对结构变量与活载结构指标之间的相关性进行预测和验证。

引用次数: 1

An Unmanned Lane Detection Algorithm Using Deep Learning and Ordered Test Sets Strategy 基于深度学习和有序测试集策略的无人驾驶车道检测算法

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2023-03-17 DOI: 10.1145/3590003.3590087

Shenwei Zhang, Xiaoyan Lin, Mingwei Zhang, Zhen Zhang, Yun Hou, Honglong Ning, Tian Qiu

The traditional method of automatic lane detection is mostly based on Hough detection. However, this category of methods has low robustness and is vulnerable to interference. In order to improve the accuracy of lane detection, the presented paper compares and analyzes the end-to-end lane line detection network based on deep learning, including Unet-base and Deeplabv3+, in view of gradient explosion and slow running speed during model training, solutions are also given. Ordered test sets are used to speed up the training processing and validate the deep learning algorithm, in the case of different image resolutions, uses Unet-base and Deeplabv3+ to perform experiments respectively. Experiments show that under the same resolution, the Unet-base model with FCN network structure incorporating a better training strategy outperforms the Deeplabv3+ algorithm model that uses a classical ASSP module to solve the downsampling layer problem in terms of model generalization capability. And the MIOU of improved Unet-base is higher than Deeplabv3+. Therefore, compared to DeepLabv3+, the improved Unet-base model is more generalized.

传统的车道自动检测方法多基于霍夫检测。然而，这类方法鲁棒性较低，容易受到干扰。为了提高车道检测的准确性，针对模型训练过程中出现的梯度爆炸和运行速度慢等问题，本文对基于深度学习的端到端车道线检测网络Unet-base和Deeplabv3+进行了比较分析，并给出了解决方案。使用有序测试集加快训练处理，验证深度学习算法，在不同图像分辨率的情况下，分别使用Unet-base和Deeplabv3+进行实验。实验表明，在相同分辨率下，采用更好训练策略的基于unet的FCN网络结构模型在模型泛化能力上优于使用经典ASSP模块解决下采样层问题的Deeplabv3+算法模型。改进后的Unet-base的MIOU高于Deeplabv3+。因此，与DeepLabv3+相比，改进的unet基模型更具有泛化性。

{"title":"An Unmanned Lane Detection Algorithm Using Deep Learning and Ordered Test Sets Strategy","authors":"Shenwei Zhang, Xiaoyan Lin, Mingwei Zhang, Zhen Zhang, Yun Hou, Honglong Ning, Tian Qiu","doi":"10.1145/3590003.3590087","DOIUrl":"https://doi.org/10.1145/3590003.3590087","url":null,"abstract":"The traditional method of automatic lane detection is mostly based on Hough detection. However, this category of methods has low robustness and is vulnerable to interference. In order to improve the accuracy of lane detection, the presented paper compares and analyzes the end-to-end lane line detection network based on deep learning, including Unet-base and Deeplabv3+, in view of gradient explosion and slow running speed during model training, solutions are also given. Ordered test sets are used to speed up the training processing and validate the deep learning algorithm, in the case of different image resolutions, uses Unet-base and Deeplabv3+ to perform experiments respectively. Experiments show that under the same resolution, the Unet-base model with FCN network structure incorporating a better training strategy outperforms the Deeplabv3+ algorithm model that uses a classical ASSP module to solve the downsampling layer problem in terms of model generalization capability. And the MIOU of improved Unet-base is higher than Deeplabv3+. Therefore, compared to DeepLabv3+, the improved Unet-base model is more generalized.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115123263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Interpretable Brain Network Atlas-Based Hybrid Model for Mild Cognitive Impairment Progression Prediction 基于可解释脑网络图谱的轻度认知障碍进展预测混合模型

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2023-03-17 DOI: 10.1145/3590003.3590081

Xianglong Guan, Li Ma, Yunyou Huang, Suqin Tang, Tinghui Li

The process of Alzheimer’s disease (AD) is irreversible, but reasonable medical intervention for preclinical AD can delay AD’s onset. Progressive mild cognitive impairment (pMCI) is the most critical stage for AD preclinical intervention. Therefore, accurate identification of pMCI will significantly improve patient benefits. Functional MRI is a neuroimaging modality that has been widely utilized to study brain activity related to AD. However, it is challenging to obtain functional MRI data, and a small amount of data will easily lead to the overfitting of the identification model. In addition, the current pMCI identification model lack interpretability leads to difficulty in acceptance by clinicians. In this work, we propose an interpretable hybrid model based on a brain network atlas to identify pMCI subjects. First, the hybrid model utilizes multi-layer perceptron to obtain categorical global features to help graph neural networks reduce overfitting. Second, the attention mechanism is introduced into the model to explain the recognition behavior of the model. The results show that our model outperforms the comparison models on multiple metrics.

阿尔茨海默病(AD)的发病过程是不可逆的，但对临床前AD进行合理的医学干预可以延缓AD的发病。进行性轻度认知障碍(pMCI)是AD临床前干预的最关键阶段。因此，准确识别pMCI将显著提高患者获益。功能MRI是一种神经成像技术，已广泛用于研究与AD相关的脑活动。然而，功能性MRI数据的获取具有挑战性，数据量少容易导致识别模型的过拟合。此外，目前的pMCI识别模型缺乏可解释性，导致临床医生难以接受。在这项工作中，我们提出了一个基于脑网络图谱的可解释混合模型来识别pMCI受试者。首先，混合模型利用多层感知器获取分类全局特征，帮助图神经网络减少过拟合。其次，在模型中引入注意机制来解释模型的识别行为。结果表明，我们的模型在多个指标上优于比较模型。

{"title":"An Interpretable Brain Network Atlas-Based Hybrid Model for Mild Cognitive Impairment Progression Prediction","authors":"Xianglong Guan, Li Ma, Yunyou Huang, Suqin Tang, Tinghui Li","doi":"10.1145/3590003.3590081","DOIUrl":"https://doi.org/10.1145/3590003.3590081","url":null,"abstract":"The process of Alzheimer’s disease (AD) is irreversible, but reasonable medical intervention for preclinical AD can delay AD’s onset. Progressive mild cognitive impairment (pMCI) is the most critical stage for AD preclinical intervention. Therefore, accurate identification of pMCI will significantly improve patient benefits. Functional MRI is a neuroimaging modality that has been widely utilized to study brain activity related to AD. However, it is challenging to obtain functional MRI data, and a small amount of data will easily lead to the overfitting of the identification model. In addition, the current pMCI identification model lack interpretability leads to difficulty in acceptance by clinicians. In this work, we propose an interpretable hybrid model based on a brain network atlas to identify pMCI subjects. First, the hybrid model utilizes multi-layer perceptron to obtain categorical global features to help graph neural networks reduce overfitting. Second, the attention mechanism is introduced into the model to explain the recognition behavior of the model. The results show that our model outperforms the comparison models on multiple metrics.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"4577 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114071189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Haze video image Clarity Processing Based on Optical Flow Threshold 基于光流阈值的雾霾视频图像清晰度处理

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2023-03-17 DOI: 10.1145/3590003.3590075

Ru Chen, Xijuan Wang

In view of the problem of haze weather on the visual effect of video image, which causes the picture distortion, image quality degradation and definition blur of video image, a defogging processing method of haze video image based on optical flow threshold is proposed so as to restore the real and natural color image. Firstly, extract the image of the t frame at time t, track the characteristics of the image at time t + 1 to time t + n, extract the image of the t+n frame, then calculate the optical flow values of the t frame and the t + n frame, make a difference between the obtained optical flow values to obtain the optical flow threshold, compare the obtained optical flow threshold with the given threshold, if the value is greater than or equal to the given threshold, take the optical flow threshold intermediate frame image, and the middle frame and t+n frame images are processed by Retinex algorithm, and this operation is performed iteratively. Finally, the processed single frame video sequence is merged into a whole and output. The experiment shows that the processing speed of the algorithm is 0.07, much lower than other processing methods, which verifies the effectiveness and innovativeness of the proposed algorithm.

针对雾霾天气对视频图像视觉效果造成的图像失真、图像质量下降、视频图像清晰度模糊等问题，提出了一种基于光流阈值的雾霾视频图像去雾处理方法，以恢复真实自然的彩色图像。首先,提取图像的帧在时间t, t跟踪图像的特征在时间t + 1 t + n,提取图像的t + n帧,然后计算出的光流值t和t + n帧,区别对待获得的光流值获取光学流阈值,比较了光流阈值与给定的阈值,如果该值大于或等于给定的阈值,取光流阈值中间帧图像，采用Retinex算法对中间帧和t+n帧图像进行处理，该操作迭代进行。最后将处理后的单帧视频序列合并成一个整体输出。实验表明，该算法的处理速度为0.07，远低于其他处理方法，验证了所提算法的有效性和创新性。

{"title":"Haze video image Clarity Processing Based on Optical Flow Threshold","authors":"Ru Chen, Xijuan Wang","doi":"10.1145/3590003.3590075","DOIUrl":"https://doi.org/10.1145/3590003.3590075","url":null,"abstract":"In view of the problem of haze weather on the visual effect of video image, which causes the picture distortion, image quality degradation and definition blur of video image, a defogging processing method of haze video image based on optical flow threshold is proposed so as to restore the real and natural color image. Firstly, extract the image of the t frame at time t, track the characteristics of the image at time t + 1 to time t + n, extract the image of the t+n frame, then calculate the optical flow values of the t frame and the t + n frame, make a difference between the obtained optical flow values to obtain the optical flow threshold, compare the obtained optical flow threshold with the given threshold, if the value is greater than or equal to the given threshold, take the optical flow threshold intermediate frame image, and the middle frame and t+n frame images are processed by Retinex algorithm, and this operation is performed iteratively. Finally, the processed single frame video sequence is merged into a whole and output. The experiment shows that the processing speed of the algorithm is 0.07, much lower than other processing methods, which verifies the effectiveness and innovativeness of the proposed algorithm.","PeriodicalId":340225,"journal":{"name":"Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning","volume":"208 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121196829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

CBAM-based Method in YOLOv7 for Detecting Defective Vacuum Glass Tubes 基于cbam的YOLOv7真空玻璃管缺陷检测方法

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

Pub Date : 2023-03-17 DOI: 10.1145/3590003.3590079

Zeyu Sheng, Haiguang Chen, Zifeng Qi

The vacuum glass tube is one of the most important materials in the physical industry, and the inspection rate of its production is crucial to the production of subsequent products. We propose a CBAM-based target detection method for YOLOv7 to detect defects in transparent glass tubes, which are not easily detectable due to their transparent walls. We replace all pooling layers in YOLOv7 with CBAM to enable it to better grasp target features. The experimental results show that the recall rate for defective product detection reaches 98.34% and the accuracy rate reaches 96.33% in the simulated industrial inspection environment. It can meet the accuracy requirements of detecting defects of transparent glass tubes in industrial sites.

真空玻璃管是物理工业中最重要的材料之一，其生产的检验率对后续产品的生产至关重要。我们提出了一种基于cbam的YOLOv7靶检测方法，用于检测透明玻璃管中由于壁透明而不易检测的缺陷。我们将YOLOv7中的所有池化层替换为CBAM，使其能够更好地把握目标特征。实验结果表明，在模拟工业检测环境下，缺陷产品检测的召回率达到98.34%，准确率达到96.33%。可满足工业现场对透明玻璃管缺陷检测的精度要求。

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀