首页 > 最新文献

International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)最新文献

英文 中文
Research on automatic identification algorithm of invoice information 发票信息自动识别算法研究
Liangyu Jiao, Hui Li
The invoice reimbursement process is very cumbersome and requires manual entry of key information in the invoice, which wastes a lot of manpower and time. Therefore, it is particularly important to design an algorithm for intelligent identification of invoice information. Traditional algorithms can identify information from scanned invoice images. However, since in our country, most of the invoice information is Chinese characters, the current recognition algorithm has a certain degree of difficulty in identifying Chinese characters, and garbled characters will appear. Therefore, this article combines the CTPN text detection algorithm with the DesNets text recognition algorithm, and uses this algorithm to detect and recognize text on the information extracted from the invoice area image. Experiments show that the model outperforms the comparison model, with a recognition accuracy of up to 99.79%.
发票报销流程非常繁琐,需要人工录入发票中的关键信息,浪费了大量的人力和时间。因此,设计一种智能识别发票信息的算法尤为重要。传统算法可以从扫描的发票图像中识别信息。但由于在我国,大部分发票信息都是汉字,目前的识别算法在识别汉字时存在一定难度,会出现乱码。因此,本文将 CTPN 文本检测算法与 DesNets 文本识别算法相结合,利用该算法对从发票区域图像中提取的信息进行文本检测和识别。实验表明,该模型优于对比模型,识别准确率高达 99.79%。
{"title":"Research on automatic identification algorithm of invoice information","authors":"Liangyu Jiao, Hui Li","doi":"10.1117/12.3014488","DOIUrl":"https://doi.org/10.1117/12.3014488","url":null,"abstract":"The invoice reimbursement process is very cumbersome and requires manual entry of key information in the invoice, which wastes a lot of manpower and time. Therefore, it is particularly important to design an algorithm for intelligent identification of invoice information. Traditional algorithms can identify information from scanned invoice images. However, since in our country, most of the invoice information is Chinese characters, the current recognition algorithm has a certain degree of difficulty in identifying Chinese characters, and garbled characters will appear. Therefore, this article combines the CTPN text detection algorithm with the DesNets text recognition algorithm, and uses this algorithm to detect and recognize text on the information extracted from the invoice area image. Experiments show that the model outperforms the comparison model, with a recognition accuracy of up to 99.79%.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":"25 3","pages":"129690U - 129690U-6"},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A music generation model based on Bi-LSTM 基于 Bi-LSTM 的音乐生成模型
yong bai
The unidirectional LSTM based music generation model does not take into account the influence of future information when generating music. It solely focuses on learning the dependencies of the current moment on past information, resulting in music with poor stability and subpar quality. To address this issue, we have developed a music generation model based on bidirectional LSTM. During the training phase, this model effectively captures musical information from both past and future time steps, resulting in a probability distribution of musical elements that closely approximates real-world music. This, in turn, leads to enhanced structural stability and improved music quality in the generated compositions. Finally, we conducted validation experiments on our proposed approach, and the results unequivocally demonstrate its effectiveness.
基于单向 LSTM 的音乐生成模型在生成音乐时没有考虑未来信息的影响。它只专注于学习当前时刻对过去信息的依赖性,导致音乐稳定性差、质量不高。针对这一问题,我们开发了一种基于双向 LSTM 的音乐生成模型。在训练阶段,该模型能有效捕捉来自过去和未来时间步骤的音乐信息,从而产生与真实世界音乐非常接近的音乐元素概率分布。这反过来又增强了生成作品的结构稳定性,提高了音乐质量。最后,我们对所提出的方法进行了验证实验,结果明确证明了该方法的有效性。
{"title":"A music generation model based on Bi-LSTM","authors":"yong bai","doi":"10.1117/12.3014368","DOIUrl":"https://doi.org/10.1117/12.3014368","url":null,"abstract":"The unidirectional LSTM based music generation model does not take into account the influence of future information when generating music. It solely focuses on learning the dependencies of the current moment on past information, resulting in music with poor stability and subpar quality. To address this issue, we have developed a music generation model based on bidirectional LSTM. During the training phase, this model effectively captures musical information from both past and future time steps, resulting in a probability distribution of musical elements that closely approximates real-world music. This, in turn, leads to enhanced structural stability and improved music quality in the generated compositions. Finally, we conducted validation experiments on our proposed approach, and the results unequivocally demonstrate its effectiveness.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":"100 3","pages":"129691T - 129691T-6"},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application analysis of three-dimensional laser scanning technology in the protection of dong drum tower in Sanjiang county 三维激光扫描技术在三江县东鼓楼保护中的应用分析
Bin Huang
The three-dimensional laser scanning technology is a non-contact measurement of objects by using a three-dimensional scanner. It is a new surveying and mapping technology, also known as real-life replication technology. This technology can completely reconstruct the surface of the scanned object through the 360-degree rotation of the laser emitter. The accuracy of point cloud data is very high. Each three-dimensional data in the laser point cloud is the real three-dimensional data X, Y, and Z coordinates of the building target, which makes the post-processing data true and reliable, and its accuracy reaches 3mm ~ 6mm. The working principle of this technology is through reverse three-dimensional data acquisition and model reconstruction, on-site photography, texture mapping for the later period, can truly achieve 1 : 1 physical restoration, and then quickly reconstruct the three-dimensional model of the building, so as to obtain the building point, line, surface, body and other mapping data. Three-dimensional laser scanning technology has played a huge advantage in the reconstruction and protection of wooden structure drum tower.
三维激光扫描技术是利用三维扫描仪对物体进行非接触式测量。它是一种新的测绘技术,也被称为实景复制技术。该技术通过激光发射器的 360 度旋转,可以完全重建被扫描物体的表面。点云数据的精度非常高。激光点云中的每个三维数据都是建筑目标的真实三维数据 X、Y、Z 坐标,使得后处理数据真实可靠,精度达到 3mm ~ 6mm。该技术的工作原理是通过逆向三维数据采集和模型重建,对后期进行现场摄影、纹理测绘,可真正实现 1 : 1 实物还原,进而快速重建建筑物的三维模型,从而获得建筑物的点、线、面、体等测绘数据。三维激光扫描技术在木结构鼓楼的重建和保护中发挥了巨大的优势。
{"title":"Application analysis of three-dimensional laser scanning technology in the protection of dong drum tower in Sanjiang county","authors":"Bin Huang","doi":"10.1117/12.3014534","DOIUrl":"https://doi.org/10.1117/12.3014534","url":null,"abstract":"The three-dimensional laser scanning technology is a non-contact measurement of objects by using a three-dimensional scanner. It is a new surveying and mapping technology, also known as real-life replication technology. This technology can completely reconstruct the surface of the scanned object through the 360-degree rotation of the laser emitter. The accuracy of point cloud data is very high. Each three-dimensional data in the laser point cloud is the real three-dimensional data X, Y, and Z coordinates of the building target, which makes the post-processing data true and reliable, and its accuracy reaches 3mm ~ 6mm. The working principle of this technology is through reverse three-dimensional data acquisition and model reconstruction, on-site photography, texture mapping for the later period, can truly achieve 1 : 1 physical restoration, and then quickly reconstruct the three-dimensional model of the building, so as to obtain the building point, line, surface, body and other mapping data. Three-dimensional laser scanning technology has played a huge advantage in the reconstruction and protection of wooden structure drum tower.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":" 32","pages":"129691S - 129691S-8"},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139640393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coal gangue sorting based on deep learning 基于深度学习的煤矸石分拣
Panliang Yang, Bin Zhu, Lianquan Ji, Peng Nie
Coal gangue sorting is an important link in the process of coal mining and processing, which can effectively reduce the difficulty and cost of coal post-processing. Aiming at the problems of complicated sorting process and low sorting efficiency of coal gangue, a coal gangue sorting method based on deep learning was proposed. The method is based on the YOLO v7 deep learning algorithm, and it achieves real-time detection of coal gangue by creating a coal gangue dataset and training the detection model. By constructing a coal gangue sorting platform, the capture of target gangue has been achieved. The experimental results show that the mAP of YOLO v7 model is 96.70%, and the detection speed is 69fps, which has significant advantages compared to YOLO v5, SSD and Faster RCNN algorithms.
煤矸石分选是煤炭开采和加工过程中的重要环节,可有效降低煤炭后处理的难度和成本。针对煤矸石分选过程复杂、分选效率低等问题,提出了一种基于深度学习的煤矸石分选方法。该方法基于 YOLO v7 深度学习算法,通过创建煤矸石数据集和训练检测模型,实现了煤矸石的实时检测。通过构建煤矸石分拣平台,实现了对目标煤矸石的捕捉。实验结果表明,YOLO v7 模型的 mAP 为 96.70%,检测速度为 69fps,与 YOLO v5、SSD 和 Faster RCNN 算法相比具有显著优势。
{"title":"Coal gangue sorting based on deep learning","authors":"Panliang Yang, Bin Zhu, Lianquan Ji, Peng Nie","doi":"10.1117/12.3014357","DOIUrl":"https://doi.org/10.1117/12.3014357","url":null,"abstract":"Coal gangue sorting is an important link in the process of coal mining and processing, which can effectively reduce the difficulty and cost of coal post-processing. Aiming at the problems of complicated sorting process and low sorting efficiency of coal gangue, a coal gangue sorting method based on deep learning was proposed. The method is based on the YOLO v7 deep learning algorithm, and it achieves real-time detection of coal gangue by creating a coal gangue dataset and training the detection model. By constructing a coal gangue sorting platform, the capture of target gangue has been achieved. The experimental results show that the mAP of YOLO v7 model is 96.70%, and the detection speed is 69fps, which has significant advantages compared to YOLO v5, SSD and Faster RCNN algorithms.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":" 37","pages":"1296913 - 1296913-7"},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139640512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Anti aliasing algorithm based on sub aperture stitching 基于子光圈拼接的抗混叠算法
Dekun Li
High resolution synthetic aperture radar system will produce aliasing problem in the sliding spotlight mode, we propose a solution based on sub-apertures splice to solving the program, the simulation and analysis of airborne mode are given. The method use the sub-aperture division, respectively, each aperture frequency shift, scaling, compression distance, azimuth compression performed prior to stitching, the data will be spliced after azimuth compression and windowing. Simulation results show that the algorithm can effectively solve the problem.
高分辨率合成孔径雷达系统在滑动聚光模式下会产生混叠问题,我们提出了一种基于子孔径拼接的求解方案,并给出了机载模式下的仿真分析。该方法利用子孔径划分,分别对每个孔径进行移频、缩放、压缩距离、方位角压缩后再进行拼接,将拼接后的数据进行方位角压缩和开窗。仿真结果表明,该算法能有效解决问题。
{"title":"Anti aliasing algorithm based on sub aperture stitching","authors":"Dekun Li","doi":"10.1117/12.3014409","DOIUrl":"https://doi.org/10.1117/12.3014409","url":null,"abstract":"High resolution synthetic aperture radar system will produce aliasing problem in the sliding spotlight mode, we propose a solution based on sub-apertures splice to solving the program, the simulation and analysis of airborne mode are given. The method use the sub-aperture division, respectively, each aperture frequency shift, scaling, compression distance, azimuth compression performed prior to stitching, the data will be spliced after azimuth compression and windowing. Simulation results show that the algorithm can effectively solve the problem.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":"55 1","pages":"1296906 - 1296906-6"},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved YOLOv5s recognition of cotton top buds with fusion of attention and feature weighting 融合注意力和特征权重,提高 YOLOv5s 对棉花顶芽的识别能力
Lei Yin, Jian Wu, Qikong Liu, Wenxiong Wu
In order to improve the accuracy and real-time performance of cotton top bud recognition, an improved YOLOv5s target real-time detection model is proposed. First, the SE module and the CBAM module in the attention mechanism are added to optimize the weight ratio of channel attention and spatial attention to improve accuracy; then the BiFPN structure of bidirectional weighted features is introduced to strengthen the fusion between high-level features and low-level features; finally, a new bounding box regression loss function EIoU is used for ablation experiments, and more position information of cotton buds can be obtained by reducing the bounding box loss. The experimental results show that, by applying the improved algorithm in the identification of cotton top buds, compared with the original YOLOv5s model, the accuracy of the C3SE-4l+BiFPN+EIoU model has increased by 7.9%, the recall rate has increased by 2.8%, and the average precision an increase of 5.7%. These improvements and optimizations provide a new idea and method, which can provide a more efficient solution for the identification of cotton top buds.
为了提高棉花顶芽识别的准确性和实时性,提出了一种改进的 YOLOv5s 目标实时检测模型。首先,在注意机制中增加了SE模块和CBAM模块,优化通道注意和空间注意的权重比例,提高准确率;然后引入双向加权特征的BiFPN结构,加强高层特征与低层特征的融合;最后,采用新的边界框回归损失函数EIoU进行消融实验,通过降低边界框损失可以获得更多的棉花顶芽位置信息。实验结果表明,将改进算法应用于棉花顶芽识别,与原始 YOLOv5s 模型相比,C3SE-4l+BiFPN+EIoU 模型的准确率提高了 7.9%,召回率提高了 2.8%,平均精度提高了 5.7%。这些改进和优化提供了一种新的思路和方法,可以为棉花顶芽的识别提供更有效的解决方案。
{"title":"Improved YOLOv5s recognition of cotton top buds with fusion of attention and feature weighting","authors":"Lei Yin, Jian Wu, Qikong Liu, Wenxiong Wu","doi":"10.1117/12.3014605","DOIUrl":"https://doi.org/10.1117/12.3014605","url":null,"abstract":"In order to improve the accuracy and real-time performance of cotton top bud recognition, an improved YOLOv5s target real-time detection model is proposed. First, the SE module and the CBAM module in the attention mechanism are added to optimize the weight ratio of channel attention and spatial attention to improve accuracy; then the BiFPN structure of bidirectional weighted features is introduced to strengthen the fusion between high-level features and low-level features; finally, a new bounding box regression loss function EIoU is used for ablation experiments, and more position information of cotton buds can be obtained by reducing the bounding box loss. The experimental results show that, by applying the improved algorithm in the identification of cotton top buds, compared with the original YOLOv5s model, the accuracy of the C3SE-4l+BiFPN+EIoU model has increased by 7.9%, the recall rate has increased by 2.8%, and the average precision an increase of 5.7%. These improvements and optimizations provide a new idea and method, which can provide a more efficient solution for the identification of cotton top buds.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":"48 3","pages":"1296928 - 1296928-6"},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Moisture detection of oil-immersed bushing insulation based on color level analysis 基于色度分析的油浸套管绝缘层湿度检测
Hongmin Ren, Denan Wang, Danni Duan
The operational level and service life of oil-immersed bushings are related to the moisture content in their cellulosic insulation layers. The authors proposed a novel method based on Taylor series modeling and grey relational analysis to assess the moisture content in the cellulosic insulation of oil-immersed bushings. Unlike conventional methods relying solely on frequency domain spectroscopy (FDS) data from certain frequency ranges, this method uses Taylor series to model the full spectrum and extracts a set of moisture-related feature parameters, thus avoiding dependence on specific measurement points or ranges. To establish the relationship between feature parameters and moisture levels, a database was constructed in this study. Grey relational analysis was used to propose an alternative method for assessing the moisture in bushing cellulosic insulation. Results show the average relative error between the estimated and measured moisture was less than 15.7%. Overall, the proposed method in this study provides an effective approach for assessing the moisture in the cellulosic insulation of oil-immersed bushings.
油浸式衬套的运行水平和使用寿命与其纤维素绝缘层中的含水量有关。作者提出了一种基于泰勒级数建模和灰色关系分析的新方法,用于评估油浸式衬套纤维素绝缘层中的含水量。与仅依赖特定频率范围的频域光谱 (FDS) 数据的传统方法不同,该方法使用泰勒级数对整个光谱进行建模,并提取一组与湿度相关的特征参数,从而避免了对特定测量点或测量范围的依赖。为了确定特征参数与湿度之间的关系,本研究建立了一个数据库。通过灰色关系分析,提出了一种评估衬套纤维素绝缘材料湿度的替代方法。结果表明,估计湿度和测量湿度之间的平均相对误差小于 15.7%。总之,本研究中提出的方法为评估油浸式衬套纤维素绝缘层中的水分提供了一种有效的方法。
{"title":"Moisture detection of oil-immersed bushing insulation based on color level analysis","authors":"Hongmin Ren, Denan Wang, Danni Duan","doi":"10.1117/12.3014683","DOIUrl":"https://doi.org/10.1117/12.3014683","url":null,"abstract":"The operational level and service life of oil-immersed bushings are related to the moisture content in their cellulosic insulation layers. The authors proposed a novel method based on Taylor series modeling and grey relational analysis to assess the moisture content in the cellulosic insulation of oil-immersed bushings. Unlike conventional methods relying solely on frequency domain spectroscopy (FDS) data from certain frequency ranges, this method uses Taylor series to model the full spectrum and extracts a set of moisture-related feature parameters, thus avoiding dependence on specific measurement points or ranges. To establish the relationship between feature parameters and moisture levels, a database was constructed in this study. Grey relational analysis was used to propose an alternative method for assessing the moisture in bushing cellulosic insulation. Results show the average relative error between the estimated and measured moisture was less than 15.7%. Overall, the proposed method in this study provides an effective approach for assessing the moisture in the cellulosic insulation of oil-immersed bushings.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":"62 1","pages":"129692Q - 129692Q-6"},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Concrete crack identification and detection method based on improved CNN and DoG operator 基于改进型 CNN 和 DoG 算子的混凝土裂缝识别和检测方法
Haohua Luo, Yulun Wu, Yaoyang Liang, Jinshuai Ren, Zhiming Wang, Yilu Huang
Concrete will produce cracks under the long-term action of loads and affect the building safety, due to the unsatisfactory accuracy and efficiency of manual detection of concrete cracks, a concrete crack recognition and detection method based on improved CNN and DoG operator is proposed. Firstly, the recognition ability is trained by CNN to locate the valuable images from the image dataset, and then the locating crack images are greyscaled, denoised using bilateral filtering method, considering that the filtering will make the image edges blurred, the DoG operator is used to detect the completeness of the edges, and then the image is binary transformed by selecting thresholds through the one-dimensional Otus segmentation method, and the binary map is opened by the open operation, to fill in the broken parts within the cracks and protect the crack edges, and finally the length, width, and rotation angle of the crack are calculated by mapping the complete straight lines present in the crack through Hough space. The experimental results show that the method can accurately identify and detect crack features of different shapes with superior detection accuracy.
混凝土在荷载的长期作用下会产生裂缝,影响建筑物的安全,由于人工检测混凝土裂缝的精度和效率不理想,因此提出了一种基于改进型 CNN 和 DoG 算子的混凝土裂缝识别检测方法。首先,通过 CNN 训练识别能力,从图像数据集中定位出有价值的图像,然后对定位出的裂缝图像进行灰度化处理,使用双边滤波方法去噪,考虑到滤波会使图像边缘模糊,使用 DoG 算子检测边缘的完整性、然后通过一维 Otus 分割方法选择阈值对图像进行二值化转换,并通过打开操作打开二值图,以填充裂缝内的破碎部分并保护裂缝边缘,最后通过 Hough 空间映射裂缝中存在的完整直线,计算出裂缝的长度、宽度和旋转角度。实验结果表明,该方法能准确识别和检测不同形状的裂纹特征,检测精度高。
{"title":"Concrete crack identification and detection method based on improved CNN and DoG operator","authors":"Haohua Luo, Yulun Wu, Yaoyang Liang, Jinshuai Ren, Zhiming Wang, Yilu Huang","doi":"10.1117/12.3014461","DOIUrl":"https://doi.org/10.1117/12.3014461","url":null,"abstract":"Concrete will produce cracks under the long-term action of loads and affect the building safety, due to the unsatisfactory accuracy and efficiency of manual detection of concrete cracks, a concrete crack recognition and detection method based on improved CNN and DoG operator is proposed. Firstly, the recognition ability is trained by CNN to locate the valuable images from the image dataset, and then the locating crack images are greyscaled, denoised using bilateral filtering method, considering that the filtering will make the image edges blurred, the DoG operator is used to detect the completeness of the edges, and then the image is binary transformed by selecting thresholds through the one-dimensional Otus segmentation method, and the binary map is opened by the open operation, to fill in the broken parts within the cracks and protect the crack edges, and finally the length, width, and rotation angle of the crack are calculated by mapping the complete straight lines present in the crack through Hough space. The experimental results show that the method can accurately identify and detect crack features of different shapes with superior detection accuracy.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":"44 5","pages":"1296925 - 1296925-5"},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Energy consumption and carbon emission measurement online detection method based on hybrid particle swarm optimization algorithm 基于混合粒子群优化算法的能耗和碳排放测量在线检测方法
Tianyi Zhang, Huaiying Shang, Angang Zheng
The detection data of energy consumption and carbon emissions may be affected by equipment failure, sensor error, incomplete data collection and other factors, resulting in low detection accuracy. Based on this, an online detection method of energy consumption and carbon emissions measurement based on hybrid particle swarm optimization algorithm is proposed. Analyze the measurement principles of energy consumption and carbon emissions. On this basis, collect the measurement data of energy consumption and carbon emissions in real time, use semi-supervised learning to extract the measurement operation data, calculate the Angle between the newly generated optimization solution and the reference direction vector, and use it as the attribute space of particle update, and assign all optimization target values, and use the hybrid particle swarm optimization algorithm. The on-line measurement process based on hybrid particle swarm optimization algorithm is completed. The experimental results show that the proposed method has advantages in all aspects of performance index, AUC value is higher than 0.9, detection time is lower than 8s.
能耗和碳排放的检测数据可能会受到设备故障、传感器误差、数据采集不完整等因素的影响,导致检测精度较低。基于此,提出了一种基于混合粒子群优化算法的能耗和碳排放测量在线检测方法。分析能耗和碳排放的测量原理。在此基础上,实时采集能耗和碳排放的测量数据,利用半监督学习提取测量运行数据,计算新生成的优化解与参考方向向量之间的夹角,并将其作为粒子更新的属性空间,分配所有优化目标值,采用混合粒子群优化算法。完成了基于混合粒子群优化算法的在线测量过程。实验结果表明,所提出的方法在各方面性能指标上都具有优势,AUC 值高于 0.9,检测时间小于 8s。
{"title":"Energy consumption and carbon emission measurement online detection method based on hybrid particle swarm optimization algorithm","authors":"Tianyi Zhang, Huaiying Shang, Angang Zheng","doi":"10.1117/12.3014377","DOIUrl":"https://doi.org/10.1117/12.3014377","url":null,"abstract":"The detection data of energy consumption and carbon emissions may be affected by equipment failure, sensor error, incomplete data collection and other factors, resulting in low detection accuracy. Based on this, an online detection method of energy consumption and carbon emissions measurement based on hybrid particle swarm optimization algorithm is proposed. Analyze the measurement principles of energy consumption and carbon emissions. On this basis, collect the measurement data of energy consumption and carbon emissions in real time, use semi-supervised learning to extract the measurement operation data, calculate the Angle between the newly generated optimization solution and the reference direction vector, and use it as the attribute space of particle update, and assign all optimization target values, and use the hybrid particle swarm optimization algorithm. The on-line measurement process based on hybrid particle swarm optimization algorithm is completed. The experimental results show that the proposed method has advantages in all aspects of performance index, AUC value is higher than 0.9, detection time is lower than 8s.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":"164 2","pages":"129690N - 129690N-6"},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140511916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on obstacle distance measurement method of UAV based on image processing technology 基于图像处理技术的无人机障碍物距离测量方法研究
mingxia Lin
In order to understand the obstacle distance measurement method of UAV, a research on obstacle distance measurement method of UAV based on image processing technology is proposed. This paper proposes an obstacle detection algorithm based on edge extraction. The outline of the obstacle can be obtained by extracting the edge features in the image, and the size, shape and position of the obstacle can be obtained by using a rectangle to frame the outline. Then the distance of the obstacle can be extracted from the contour of the obstacle by using the depth map generated by the binocular camera. The millimeter wave radar in front is used to fuse ranging with the central area of binocular camera image to improve the update frequency of obstacle distance. Finally, the effectiveness of the obstacle avoidance control strategy is verified by the obstacle avoidance flight test, and the UAV can choose the optimal obstacle avoidance direction and successfully bypass when encountering obstacles. The comparison of obstacle avoidance results shows that the obstacle avoidance method in this paper is advanced and has certain engineering application value.
为了了解无人机的障碍物测距方法,提出了基于图像处理技术的无人机障碍物测距方法研究。本文提出了一种基于边缘提取的障碍物检测算法。通过提取图像中的边缘特征可以得到障碍物的轮廓,用矩形框框出轮廓可以得到障碍物的大小、形状和位置。然后利用双目摄像头生成的深度图,从障碍物轮廓中提取障碍物的距离。利用前方的毫米波雷达与双目摄像头图像的中心区域进行融合测距,以提高障碍物距离的更新频率。最后,通过避障飞行试验验证了避障控制策略的有效性,无人机在遇到障碍物时能够选择最优避障方向并成功绕过。避障结果对比表明,本文的避障方法是先进的,具有一定的工程应用价值。
{"title":"Research on obstacle distance measurement method of UAV based on image processing technology","authors":"mingxia Lin","doi":"10.1117/12.3014383","DOIUrl":"https://doi.org/10.1117/12.3014383","url":null,"abstract":"In order to understand the obstacle distance measurement method of UAV, a research on obstacle distance measurement method of UAV based on image processing technology is proposed. This paper proposes an obstacle detection algorithm based on edge extraction. The outline of the obstacle can be obtained by extracting the edge features in the image, and the size, shape and position of the obstacle can be obtained by using a rectangle to frame the outline. Then the distance of the obstacle can be extracted from the contour of the obstacle by using the depth map generated by the binocular camera. The millimeter wave radar in front is used to fuse ranging with the central area of binocular camera image to improve the update frequency of obstacle distance. Finally, the effectiveness of the obstacle avoidance control strategy is verified by the obstacle avoidance flight test, and the UAV can choose the optimal obstacle avoidance direction and successfully bypass when encountering obstacles. The comparison of obstacle avoidance results shows that the obstacle avoidance method in this paper is advanced and has certain engineering application value.","PeriodicalId":516634,"journal":{"name":"International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)","volume":"18 3","pages":"1296912 - 1296912-6"},"PeriodicalIF":0.0,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140512105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Conference on Algorithm, Imaging Processing and Machine Vision (AIPMV 2023)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1