首页 > 最新文献

Proceedings of the 2023 5th International Conference on Pattern Recognition and Intelligent Systems最新文献

英文 中文
Reconstructing 3D Shapes as an Union of Boxes from Multi-View Images 从多视图图像中重建三维形状作为盒子的联合
Zihan Yang, Minglun Gong
The task of reconstructing object shapes from input images has become increasingly important in various fields, such as computer vision, robotics, augmented reality, video games, and autonomous vehicles. While approaches for reconstructing shapes with varying levels of detail have been proposed, balancing representation accuracy and model complexity remains a challenge. To address this challenge, we propose an end-to-end approach for reconstructing object shapes from multiple images using a union of box primitives. Our approach offers a simpler and more efficient 3D representation of objects without the need for intermediate products such as voxels, resulting in faster inference times. Additionally, we introduce an auxiliary task to aid in learning how to extract and transform spatial features from images without requiring camera calibrations. Extensive experiments demonstrate that our method can produce comparable results to approaches that require 3D voxelized input while utilizing only 2D RGB images as input. Furthermore, our method significantly outperforms the aforementioned approaches in terms of inference time.
从输入图像中重建物体形状的任务在计算机视觉、机器人、增强现实、视频游戏和自动驾驶汽车等各个领域变得越来越重要。虽然已经提出了重建具有不同细节级别的形状的方法,但平衡表示精度和模型复杂性仍然是一个挑战。为了解决这一挑战,我们提出了一种端到端方法,用于使用盒原语的联合从多个图像中重建物体形状。我们的方法提供了一种更简单、更有效的物体3D表示,而不需要体素等中间产品,从而加快了推理时间。此外,我们还引入了一个辅助任务来帮助学习如何在不需要相机校准的情况下从图像中提取和转换空间特征。大量的实验表明,我们的方法可以产生与只使用2D RGB图像作为输入而需要3D体素化输入的方法相当的结果。此外,我们的方法在推理时间方面明显优于上述方法。
{"title":"Reconstructing 3D Shapes as an Union of Boxes from Multi-View Images","authors":"Zihan Yang, Minglun Gong","doi":"10.1145/3609703.3609705","DOIUrl":"https://doi.org/10.1145/3609703.3609705","url":null,"abstract":"The task of reconstructing object shapes from input images has become increasingly important in various fields, such as computer vision, robotics, augmented reality, video games, and autonomous vehicles. While approaches for reconstructing shapes with varying levels of detail have been proposed, balancing representation accuracy and model complexity remains a challenge. To address this challenge, we propose an end-to-end approach for reconstructing object shapes from multiple images using a union of box primitives. Our approach offers a simpler and more efficient 3D representation of objects without the need for intermediate products such as voxels, resulting in faster inference times. Additionally, we introduce an auxiliary task to aid in learning how to extract and transform spatial features from images without requiring camera calibrations. Extensive experiments demonstrate that our method can produce comparable results to approaches that require 3D voxelized input while utilizing only 2D RGB images as input. Furthermore, our method significantly outperforms the aforementioned approaches in terms of inference time.","PeriodicalId":101485,"journal":{"name":"Proceedings of the 2023 5th International Conference on Pattern Recognition and Intelligent Systems","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115668506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification-Dissemination-Warning: Algorithm and Prediction of Early Warning of Network Public Opinion 识别-传播-预警:网络舆情预警算法与预测
Lin Sun
In order to better monitor public opinion, this study reviews how the existing thesis work theoretically and practically from the interdisciplinary perspective of communication and computer science and then proposes a new vision under the framework of “identification-dissemination-warning”. Real-life applications include news reports and social media in data collection, emphasizing attitude analysis, building an evaluation system that consisted of four parameters, i.e., event, dissemination, status, and response, determine how serious an emergency is based on how fast public opinion will deteriorate and provide response guidance accordingly. On the theoretical front, this study takes into account the inter-influence between different parameters and optimize semantic analysis, emergency grading and nonlinear processing with the help of Bayesian network, hierarchical network models, grey relational analysis, latent semantic analysis and BP neural network.
为了更好地监控舆情,本研究从传播学与计算机科学交叉的视角,对现有论文的理论与实践进行了回顾,并在“识别-传播-预警”的框架下提出了新的视角。现实应用包括新闻报道和社交媒体在数据收集方面,强调态度分析,构建由事件、传播、状态、响应四个参数组成的评价体系,根据舆情恶化的速度判断突发事件的严重程度,并提供相应的应对指导。在理论方面,考虑到不同参数之间的相互影响,利用贝叶斯网络、层次网络模型、灰色关联分析、潜在语义分析和BP神经网络对语义分析、应急分级和非线性处理进行优化。
{"title":"Identification-Dissemination-Warning: Algorithm and Prediction of Early Warning of Network Public Opinion","authors":"Lin Sun","doi":"10.1145/3609703.3609723","DOIUrl":"https://doi.org/10.1145/3609703.3609723","url":null,"abstract":"In order to better monitor public opinion, this study reviews how the existing thesis work theoretically and practically from the interdisciplinary perspective of communication and computer science and then proposes a new vision under the framework of “identification-dissemination-warning”. Real-life applications include news reports and social media in data collection, emphasizing attitude analysis, building an evaluation system that consisted of four parameters, i.e., event, dissemination, status, and response, determine how serious an emergency is based on how fast public opinion will deteriorate and provide response guidance accordingly. On the theoretical front, this study takes into account the inter-influence between different parameters and optimize semantic analysis, emergency grading and nonlinear processing with the help of Bayesian network, hierarchical network models, grey relational analysis, latent semantic analysis and BP neural network.","PeriodicalId":101485,"journal":{"name":"Proceedings of the 2023 5th International Conference on Pattern Recognition and Intelligent Systems","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114616440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-population Runge Kutta Optimizer Based on Gaussian Disturbance 基于高斯扰动的多种群Runge Kutta优化器
Jinhan Chen, Yitong Song, Jixiang Zhu, Sheng-Kai Wang
To address the lack of development capacity of Runge Kutta Optimizer, we propose the Multi-population Runge Kutta algorithm Based on Gaussian disturbance(MPRUN). In the algorithm, the population is divided into subgroups. The individuals in the subgroups are randomly selected for a global search with decreasing search radius with the number of iterations, which is used to improve the global search ability of the subgroups. In addition, the algorithm introduces a Gaussian disturbance mechanism to generate more uniformly distributed populations, performing random perturbation to the global best individual. Finally, the performance of the optimized algorithm is verified by 30 test set functions.
针对龙格库塔优化器开发能力不足的问题,提出了基于高斯扰动的多种群龙格库塔算法(MPRUN)。在该算法中,总体被划分为子组。随机选取子组中的个体进行全局搜索,搜索半径随着迭代次数的增加而减小,从而提高子组的全局搜索能力。此外,该算法引入高斯扰动机制生成更均匀分布的总体,对全局最优个体进行随机扰动。最后,通过30个测试集函数验证了优化算法的性能。
{"title":"Multi-population Runge Kutta Optimizer Based on Gaussian Disturbance","authors":"Jinhan Chen, Yitong Song, Jixiang Zhu, Sheng-Kai Wang","doi":"10.1145/3609703.3609713","DOIUrl":"https://doi.org/10.1145/3609703.3609713","url":null,"abstract":"To address the lack of development capacity of Runge Kutta Optimizer, we propose the Multi-population Runge Kutta algorithm Based on Gaussian disturbance(MPRUN). In the algorithm, the population is divided into subgroups. The individuals in the subgroups are randomly selected for a global search with decreasing search radius with the number of iterations, which is used to improve the global search ability of the subgroups. In addition, the algorithm introduces a Gaussian disturbance mechanism to generate more uniformly distributed populations, performing random perturbation to the global best individual. Finally, the performance of the optimized algorithm is verified by 30 test set functions.","PeriodicalId":101485,"journal":{"name":"Proceedings of the 2023 5th International Conference on Pattern Recognition and Intelligent Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122809226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MSYOLOF: Multi-input-single-output encoder network with tripartite feature enhancement for object detection MSYOLOF:多输入-单输出编码器网络与三方特征增强的目标检测
Gong Cheng, Xi Yong, Xin Lyu, Tao Zeng, Xinyu Wang, Jiale Chen, Xin Li
Object detection under one-level feature is a challenging task, which requires that object representations at different scales can be extracted on a single feature map. However, existing object detectors using a one-level feature suffer from inadequate of different-scale object representations resulting in low accuracy for multi-scale object detection, especially for smaller objects. To address the problem above-mentioned, a novel object detector named MSYOLOF, is proposed to construct an effective single feature map for detecting objects of different scales. In the proposed network, three modules are proposed to bring considerable improvements, namely Feature Pyramid Pooling (FPP), Feature Perception Enhancement (FPE), and Dual Branch Receptive Field (DBRF). Firstly, the FPP module aggregates contextual information from various regions to improve the network's ability to achieve global information, which strengthens the model's understanding of the overall scene. Then, the FPE module utilizes coordinate attention to construct a residual block to obtain orientation-aware and position-sensitive information, making the network efficient in accurately locating and identifying objects of interest. Third, by rethinking the Dilated Encoder of YOLOF, the DBRF module reduces information loss and mitigates the problem of being sensitive only to large objects when dilated convolution utilizes large expansion rates. Extensive experiments are conducted on COCO benchmark to validate the effectiveness of our network, which exhibits superior performance compared to other state-of-the-art networks.
单级特征下的目标检测是一项具有挑战性的任务,它要求在单个特征图上提取不同尺度的目标表示。然而,现有的单级特征目标检测器由于缺乏不同尺度的目标表示,导致多尺度目标检测精度低,尤其是对较小的目标检测精度低。为了解决上述问题,提出了一种新的目标检测器MSYOLOF,用于构建有效的单特征映射来检测不同尺度的目标。在该网络中,提出了三个模块,即特征金字塔池(FPP)、特征感知增强(FPE)和双分支接受野(DBRF),带来了相当大的改进。首先,FPP模块聚合来自各个区域的上下文信息,提高网络获取全局信息的能力,增强模型对整体场景的理解。然后,FPE模块利用坐标注意构造残差块,获得方向感知和位置敏感信息,使网络能够高效准确地定位和识别感兴趣的目标。第三,通过重新考虑YOLOF的扩展编码器,DBRF模块减少了信息丢失,并缓解了当扩展卷积使用大扩展速率时仅对大对象敏感的问题。在COCO基准上进行了大量的实验,以验证我们的网络的有效性,与其他最先进的网络相比,它表现出优越的性能。
{"title":"MSYOLOF: Multi-input-single-output encoder network with tripartite feature enhancement for object detection","authors":"Gong Cheng, Xi Yong, Xin Lyu, Tao Zeng, Xinyu Wang, Jiale Chen, Xin Li","doi":"10.1145/3609703.3609710","DOIUrl":"https://doi.org/10.1145/3609703.3609710","url":null,"abstract":"Object detection under one-level feature is a challenging task, which requires that object representations at different scales can be extracted on a single feature map. However, existing object detectors using a one-level feature suffer from inadequate of different-scale object representations resulting in low accuracy for multi-scale object detection, especially for smaller objects. To address the problem above-mentioned, a novel object detector named MSYOLOF, is proposed to construct an effective single feature map for detecting objects of different scales. In the proposed network, three modules are proposed to bring considerable improvements, namely Feature Pyramid Pooling (FPP), Feature Perception Enhancement (FPE), and Dual Branch Receptive Field (DBRF). Firstly, the FPP module aggregates contextual information from various regions to improve the network's ability to achieve global information, which strengthens the model's understanding of the overall scene. Then, the FPE module utilizes coordinate attention to construct a residual block to obtain orientation-aware and position-sensitive information, making the network efficient in accurately locating and identifying objects of interest. Third, by rethinking the Dilated Encoder of YOLOF, the DBRF module reduces information loss and mitigates the problem of being sensitive only to large objects when dilated convolution utilizes large expansion rates. Extensive experiments are conducted on COCO benchmark to validate the effectiveness of our network, which exhibits superior performance compared to other state-of-the-art networks.","PeriodicalId":101485,"journal":{"name":"Proceedings of the 2023 5th International Conference on Pattern Recognition and Intelligent Systems","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117075465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fiber Recognition Algorithm Based on Improved Mask RCNN 基于改进掩模RCNN的光纤识别算法
Zheng-hao Huo, Ziyin Li, Ruide Qu, Xiaodong Wang, Fei Ye, Jun Jin, Xiaojuan Yao
In response to the application requirements of identifying and classifying multiple types of fibers, this paper proposes a fiber recognition algorithm based on improved Mask RCNN to achieve recognition and classification of multiple types of fibers, reduce the labor cost of fiber inspection, and improve inspection efficiency and quality. Firstly, a data augmentation strategy is adopted, which combines three data augmentation methods: RandomFlip, RandomCrop, and Cutout to achieve the best increase in network performance; Subsequently, a multi-scale training strategy is introduced to improve the model's training efficiency while also enhancing its robustness to scale; Finally, the attention mechanism module of convolutional blocks is added to solve the problem of low recognition and classification accuracy caused by small differences in fine-grained granularity between certain fiber classes. The experimental results show that the algorithm achieves a recognition and classification accuracy of 97.87% on the test set by introducing techniques such as data augmentation, multi-scale training, and CBAM, significantly improving the recognition and classification accuracy of various fiber targets.
针对多类型纤维的识别和分类的应用需求,本文提出了一种基于改进Mask RCNN的纤维识别算法,实现对多类型纤维的识别和分类,降低纤维检测的人工成本,提高检测效率和质量。首先,采用数据增强策略,结合RandomFlip、RandomCrop和Cutout三种数据增强方法,实现网络性能的最佳提升;随后,引入多尺度训练策略,在提高模型训练效率的同时增强模型的尺度鲁棒性;最后,增加了卷积块的注意机制模块,解决了某些纤维类之间细粒度差异小导致识别分类精度低的问题。实验结果表明,该算法通过引入数据增强、多尺度训练、CBAM等技术,在测试集上实现了97.87%的识别分类准确率,显著提高了对各种光纤目标的识别分类准确率。
{"title":"Fiber Recognition Algorithm Based on Improved Mask RCNN","authors":"Zheng-hao Huo, Ziyin Li, Ruide Qu, Xiaodong Wang, Fei Ye, Jun Jin, Xiaojuan Yao","doi":"10.1145/3609703.3609719","DOIUrl":"https://doi.org/10.1145/3609703.3609719","url":null,"abstract":"In response to the application requirements of identifying and classifying multiple types of fibers, this paper proposes a fiber recognition algorithm based on improved Mask RCNN to achieve recognition and classification of multiple types of fibers, reduce the labor cost of fiber inspection, and improve inspection efficiency and quality. Firstly, a data augmentation strategy is adopted, which combines three data augmentation methods: RandomFlip, RandomCrop, and Cutout to achieve the best increase in network performance; Subsequently, a multi-scale training strategy is introduced to improve the model's training efficiency while also enhancing its robustness to scale; Finally, the attention mechanism module of convolutional blocks is added to solve the problem of low recognition and classification accuracy caused by small differences in fine-grained granularity between certain fiber classes. The experimental results show that the algorithm achieves a recognition and classification accuracy of 97.87% on the test set by introducing techniques such as data augmentation, multi-scale training, and CBAM, significantly improving the recognition and classification accuracy of various fiber targets.","PeriodicalId":101485,"journal":{"name":"Proceedings of the 2023 5th International Conference on Pattern Recognition and Intelligent Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134142358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on vehicle spare parts demand forecast based on XGBoost-LightGBM 基于XGBoost-LightGBM的汽车备件需求预测研究
Qianqian Zhu, Liu Yang, Yingnan Liu
Vehicle spare parts demand forecasting is crucial for optimizing inventory and improving maintenance efficiency. This study aims to explore a vehicle spare parts demand forecasting method based on the fusion of XGBoost and LightGBM models to enhance prediction accuracy and precision. In this paper, we first collected a large amount of historical spare parts demand data and associated feature data, followed by data preprocessing and feature engineering. Then, we constructed individual machine learning models as well as the XGBoost-LightGBM fusion model, and performed parameter tuning and optimization using the Optuna framework. Experimental results demonstrate that both XGBoost and LightGBM models achieve favorable performance in spare parts demand forecasting, but the fusion of these two models further enhances prediction accuracy. The fusion model exhibits lower MAPE values compared to individual models on the test set, confirming its superiority and effectiveness. This method leverages the strengths of both models and improves prediction accuracy through weight fusion, offering practical significance in achieving accurate spare parts demand forecasting, optimizing inventory, and improving maintenance efficiency. Future research can explore alternative machine learning algorithms and feature engineering methods to further enhance the accuracy and reliability of vehicle spare parts forecasting.
汽车零配件需求预测是优化库存、提高维修效率的关键。本研究旨在探索一种基于XGBoost和LightGBM模型融合的汽车备件需求预测方法,以提高预测的准确度和精度。本文首先收集了大量的历史备件需求数据和相关特征数据,然后进行数据预处理和特征工程。然后,我们构建了单独的机器学习模型以及XGBoost-LightGBM融合模型,并使用Optuna框架进行了参数调优。实验结果表明,XGBoost和LightGBM模型在备件需求预测中都取得了较好的效果,但两种模型的融合进一步提高了预测精度。该融合模型在测试集中表现出较低的MAPE值,证实了其优越性和有效性。该方法利用两种模型的优点,通过权值融合提高预测精度,对实现准确的备件需求预测、优化库存、提高维修效率具有现实意义。未来的研究可以探索替代的机器学习算法和特征工程方法,进一步提高汽车零部件预测的准确性和可靠性。
{"title":"Research on vehicle spare parts demand forecast based on XGBoost-LightGBM","authors":"Qianqian Zhu, Liu Yang, Yingnan Liu","doi":"10.1145/3609703.3609721","DOIUrl":"https://doi.org/10.1145/3609703.3609721","url":null,"abstract":"Vehicle spare parts demand forecasting is crucial for optimizing inventory and improving maintenance efficiency. This study aims to explore a vehicle spare parts demand forecasting method based on the fusion of XGBoost and LightGBM models to enhance prediction accuracy and precision. In this paper, we first collected a large amount of historical spare parts demand data and associated feature data, followed by data preprocessing and feature engineering. Then, we constructed individual machine learning models as well as the XGBoost-LightGBM fusion model, and performed parameter tuning and optimization using the Optuna framework. Experimental results demonstrate that both XGBoost and LightGBM models achieve favorable performance in spare parts demand forecasting, but the fusion of these two models further enhances prediction accuracy. The fusion model exhibits lower MAPE values compared to individual models on the test set, confirming its superiority and effectiveness. This method leverages the strengths of both models and improves prediction accuracy through weight fusion, offering practical significance in achieving accurate spare parts demand forecasting, optimizing inventory, and improving maintenance efficiency. Future research can explore alternative machine learning algorithms and feature engineering methods to further enhance the accuracy and reliability of vehicle spare parts forecasting.","PeriodicalId":101485,"journal":{"name":"Proceedings of the 2023 5th International Conference on Pattern Recognition and Intelligent Systems","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123324728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Building Segmentation from Remote Sensing Image via DWT Attention Networks 基于DWT关注网络的遥感影像建筑物分割
Yin-hua Wu, Mingquan Zhou, Shenglin Geng, Dan Zhang
The attention mechanism has been widely used and achieved good results in many visual tasks. But the calculations of attention mechanism in vision tasks consume huge spaces and times, which is the obvious disadvantage of this method. In order to alleviate this problem, we use the DWT(Discrete Wavelet Transform) method to reduce the complexity of attention calculation. DWT can transform an N-dimensional vector into two vectors, one is the low-frequency component of N/2 dimension and the other is high-frequency component of N/2 dimension too. We only use the low-frequency to calculate the attention matrixes, which can reduce the complexity of matrix multiplication, then the time and space consumption of the network is reduced significantly. We also find that the building segmentation in the remote sensing image is different from the other scene segmentation, that the sizes and numbers of different classes of the targets in the general scene images are obvious. Despite all this, our method is still applicable for the targets with large numbers and sizes in general scene images, but not for the targets with small sizes and numbers, and this view is also verified by the subsequent experiments on different datasets. We apply our method on three typical networks (Danet, Swin and Segmenter), and carry out comprehensive experiments on the Cityscape dataset and three building segmentation datasets (Inria Aerial Dataset, Massachusetts Buildings Dataset and Chinese Style Architecture Dataset). The experiments show that, our method is more suitable for building segmentation and can reduce the complexity of the model calculation in building segmentation, and the Mean IoU of segmentation results is not reduced clearly, some even improved.
注意机制在许多视觉任务中得到了广泛的应用,并取得了良好的效果。但视觉任务中注意机制的计算耗费巨大的空间和时间,这是该方法的明显缺点。为了缓解这一问题,我们采用离散小波变换(DWT)方法来降低注意力计算的复杂度。DWT可以将一个N维向量变换为两个向量,一个是N/2维的低频分量,另一个也是N/2维的高频分量。我们只使用低频来计算注意矩阵,这样可以降低矩阵乘法的复杂度,从而显著降低网络的时间和空间消耗。我们还发现,遥感图像中的建筑物分割与其他场景分割不同,在一般场景图像中,不同类别目标的大小和数量是明显的。尽管如此,我们的方法仍然适用于一般场景图像中数量和尺寸较大的目标,而不适用于尺寸和尺寸较小的目标,并且这一观点也通过后续在不同数据集上的实验得到验证。我们在三个典型网络(Danet、Swin和Segmenter)上应用了我们的方法,并在Cityscape数据集和三个建筑分割数据集(Inria Aerial dataset、Massachusetts Buildings dataset和Chinese Style Architecture dataset)上进行了综合实验。实验表明,我们的方法更适合于建筑物分割,可以降低建筑物分割中模型计算的复杂度,分割结果的Mean IoU没有明显降低,有的甚至有所提高。
{"title":"Building Segmentation from Remote Sensing Image via DWT Attention Networks","authors":"Yin-hua Wu, Mingquan Zhou, Shenglin Geng, Dan Zhang","doi":"10.1145/3609703.3609704","DOIUrl":"https://doi.org/10.1145/3609703.3609704","url":null,"abstract":"The attention mechanism has been widely used and achieved good results in many visual tasks. But the calculations of attention mechanism in vision tasks consume huge spaces and times, which is the obvious disadvantage of this method. In order to alleviate this problem, we use the DWT(Discrete Wavelet Transform) method to reduce the complexity of attention calculation. DWT can transform an N-dimensional vector into two vectors, one is the low-frequency component of N/2 dimension and the other is high-frequency component of N/2 dimension too. We only use the low-frequency to calculate the attention matrixes, which can reduce the complexity of matrix multiplication, then the time and space consumption of the network is reduced significantly. We also find that the building segmentation in the remote sensing image is different from the other scene segmentation, that the sizes and numbers of different classes of the targets in the general scene images are obvious. Despite all this, our method is still applicable for the targets with large numbers and sizes in general scene images, but not for the targets with small sizes and numbers, and this view is also verified by the subsequent experiments on different datasets. We apply our method on three typical networks (Danet, Swin and Segmenter), and carry out comprehensive experiments on the Cityscape dataset and three building segmentation datasets (Inria Aerial Dataset, Massachusetts Buildings Dataset and Chinese Style Architecture Dataset). The experiments show that, our method is more suitable for building segmentation and can reduce the complexity of the model calculation in building segmentation, and the Mean IoU of segmentation results is not reduced clearly, some even improved.","PeriodicalId":101485,"journal":{"name":"Proceedings of the 2023 5th International Conference on Pattern Recognition and Intelligent Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129285724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CapsNet-based drift detection in data stream mining 数据流挖掘中基于capsnet的漂移检测
Borong Lin, Nanlin Jin
For data streams, drift detection methods warn and detect the changes in patterns over time. For example, in smart manufacturing, many data streams are generated from sensors that monitor the real-time operation of manufacturing. Drift detection can be used to discover if and how the operation status changes. At present, there have been three main approaches in drift detection: error rate-based, distribution-based, and hypothesis-based. However, these approaches bear an impractical limitation: delays due to the demand for computational time. In a large-scale and high-speed data stream, a time-efficient detector is vital. To address this, this paper proposes a CapsNet-based drift detection algorithm (CapsNet-DDM). Our experimental results and comparative studies have found that CapsNet-DDM demonstrates a distinguishing advantage on computational time, with no compromise on accuracy, F1 score, and effective drift detection rates.
对于数据流,漂移检测方法警告并检测模式随时间的变化。例如,在智能制造中,许多数据流是由监控制造实时运行的传感器产生的。漂移检测可以用来发现是否以及如何操作状态的变化。目前,漂移检测主要有三种方法:基于错误率的方法、基于分布的方法和基于假设的方法。然而,这些方法有一个不切实际的限制:由于对计算时间的需求而导致的延迟。在大规模和高速数据流中,时间效率高的检测器是至关重要的。为了解决这个问题,本文提出了一种基于capsnet的漂移检测算法(CapsNet-DDM)。我们的实验结果和比较研究发现,CapsNet-DDM在计算时间上具有显著优势,在准确性、F1分数和有效漂移检测率方面没有妥协。
{"title":"CapsNet-based drift detection in data stream mining","authors":"Borong Lin, Nanlin Jin","doi":"10.1145/3609703.3609724","DOIUrl":"https://doi.org/10.1145/3609703.3609724","url":null,"abstract":"For data streams, drift detection methods warn and detect the changes in patterns over time. For example, in smart manufacturing, many data streams are generated from sensors that monitor the real-time operation of manufacturing. Drift detection can be used to discover if and how the operation status changes. At present, there have been three main approaches in drift detection: error rate-based, distribution-based, and hypothesis-based. However, these approaches bear an impractical limitation: delays due to the demand for computational time. In a large-scale and high-speed data stream, a time-efficient detector is vital. To address this, this paper proposes a CapsNet-based drift detection algorithm (CapsNet-DDM). Our experimental results and comparative studies have found that CapsNet-DDM demonstrates a distinguishing advantage on computational time, with no compromise on accuracy, F1 score, and effective drift detection rates.","PeriodicalId":101485,"journal":{"name":"Proceedings of the 2023 5th International Conference on Pattern Recognition and Intelligent Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120992664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A study on the line loss index of a substation area based on cooperative games with multiple influencing factors 基于多影响因素合作博弈的变电站区域线损指标研究
Linfeng Wu, Xiaowei Yang, Hao Yang, Zhenhui Zhu, Shunli Chen
The line loss rate varies significantly among different substation areas due to diverse influencing factors. Consequently, a study is conducted to investigate the line loss index of a substation area by employing a cooperative game approach that considers multiple influencing factors. Firstly, utilizing the available fundamental data of the substation area, construct a substation area factor suitable for the calculation of "one substation area, one index". Subsequently, an initial low-voltage substation area line loss prediction model was constructed using Bi-LSTM. Finally, the weights of each influencing factor are calculated using a cooperative game strategy, and the attention mechanism is applied to Bi-LSTM. After the model is trained and optimized, the predicted value for the line loss index for each substation area is output. Experiments indicate that the algorithm can effectively enhances the accuracy of predicting the line loss index value in the substation area, and assist in customized and refined management of loss reduction in the low-voltage distribution substation area.
不同变电所区域间线损率差异较大,影响因素不同。因此,本文采用考虑多种影响因素的合作博弈方法对变电站区域线损指标进行了研究。首先,利用现有的变电站面积基础数据,构建适合于“一个变电站面积,一个指标”计算的变电站面积因子。随后,利用Bi-LSTM构建了低压变电站区域线损预测的初始模型。最后,利用合作博弈策略计算各影响因素的权重,并将注意机制应用到Bi-LSTM中。模型经过训练和优化后,输出各变电所区域的线损指标预测值。实验表明,该算法能有效提高变电所区域线损指标值预测的准确性,有助于低压配电变电所区域的降损定制化精细化管理。
{"title":"A study on the line loss index of a substation area based on cooperative games with multiple influencing factors","authors":"Linfeng Wu, Xiaowei Yang, Hao Yang, Zhenhui Zhu, Shunli Chen","doi":"10.1145/3609703.3609715","DOIUrl":"https://doi.org/10.1145/3609703.3609715","url":null,"abstract":"The line loss rate varies significantly among different substation areas due to diverse influencing factors. Consequently, a study is conducted to investigate the line loss index of a substation area by employing a cooperative game approach that considers multiple influencing factors. Firstly, utilizing the available fundamental data of the substation area, construct a substation area factor suitable for the calculation of \"one substation area, one index\". Subsequently, an initial low-voltage substation area line loss prediction model was constructed using Bi-LSTM. Finally, the weights of each influencing factor are calculated using a cooperative game strategy, and the attention mechanism is applied to Bi-LSTM. After the model is trained and optimized, the predicted value for the line loss index for each substation area is output. Experiments indicate that the algorithm can effectively enhances the accuracy of predicting the line loss index value in the substation area, and assist in customized and refined management of loss reduction in the low-voltage distribution substation area.","PeriodicalId":101485,"journal":{"name":"Proceedings of the 2023 5th International Conference on Pattern Recognition and Intelligent Systems","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127563414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vehicle Appearance Damage Detection Based on Mask R-CNN 基于掩模R-CNN的车辆外观损伤检测
Fei Meng, Qianqian Zhu, Xuening Wu
Fei Meng Automotive Data of China Co.,Ltd., China Automotive Technology and Research Center Co.,Ltd., Tianjin, China mengfei@catarc.ac.cn Qianqian Zhu* Automotive Data of China Co.,Ltd., China Automotive Technology and Research Center Co.,Ltd., Tianjin, China zhuqianqian@catarc.ac.cn* Xuening Wu Automotive Data of China Co.,Ltd., China Automotive Technology and Research Center Co.,Ltd., Tianjin, China wuxuening@catarc.ac.cn
飞盟汽车数据中国有限公司,中国汽车技术研究中心有限公司中国天津mengfei@catarc.ac.cn朱倩倩*中国汽车数据有限公司,中国汽车技术研究中心有限公司中国天津zhuqianqian@catarc.ac.cn*武学宁中国汽车数据有限公司,中国汽车技术研究中心有限公司,中国天津wuxuening@catarc.ac.cn
{"title":"Vehicle Appearance Damage Detection Based on Mask R-CNN","authors":"Fei Meng, Qianqian Zhu, Xuening Wu","doi":"10.1145/3609703.3609709","DOIUrl":"https://doi.org/10.1145/3609703.3609709","url":null,"abstract":"Fei Meng Automotive Data of China Co.,Ltd., China Automotive Technology and Research Center Co.,Ltd., Tianjin, China mengfei@catarc.ac.cn Qianqian Zhu* Automotive Data of China Co.,Ltd., China Automotive Technology and Research Center Co.,Ltd., Tianjin, China zhuqianqian@catarc.ac.cn* Xuening Wu Automotive Data of China Co.,Ltd., China Automotive Technology and Research Center Co.,Ltd., Tianjin, China wuxuening@catarc.ac.cn","PeriodicalId":101485,"journal":{"name":"Proceedings of the 2023 5th International Conference on Pattern Recognition and Intelligent Systems","volume":"365 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120896007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2023 5th International Conference on Pattern Recognition and Intelligent Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1