首页 > 最新文献

Proceedings of the 4th International Conference on Control and Computer Vision最新文献

英文 中文
A Multi-Scale Self-Attention Network for Diabetic Retinopathy Retrieval 糖尿病视网膜病变检索的多尺度自注意网络
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484290
Ming Zeng, Jiansheng Fang, Hanpei Miao, Tianyang Zhang, Jiang Liu
Diabetic retinopathy (DR), a complication due to diabetes, is a common cause of progressive damage to the retina. The mass screening of populations for DR is time-consuming. Therefore, computerized diagnosis is of great significance in the clinical practice, which providing evidence to assist clinicians in decision making. Specifically, hemorrhages, microaneurysms, hard exudates, soft exudates, and other lesions are verified to be closely associated with DR. These lesions, however, are scattered in different positions and sizes in fundus images, the internal relation of which are hard to be reserved in the ultimate features due to a large number of convolution layers that reduce the detail characteristics. In this paper, we present a deep-learning network with a multi-scale self-attention module to aggregate the global context to learned features for DR image retrieval. The multi-scale fusion enhances, in terms of scale, the efficacious latent relation of different positions in features explored by the self-attention. For the experiment, the proposed network is validated on the Kaggle DR dataset, and the result shows that it achieves state-of-the-art performance.
糖尿病视网膜病变(DR)是由糖尿病引起的并发症,是视网膜进行性损伤的常见原因。对人群进行大规模的DR筛查非常耗时。因此,计算机诊断在临床实践中具有重要意义,为临床医生的决策提供依据。具体而言,出血、微动脉瘤、硬渗出物、软渗出物等病变被证实与dr密切相关,但这些病变在眼底图像中分散在不同位置和大小,由于大量的卷积层减少了细节特征,难以在最终特征中保留其内部关系。在本文中,我们提出了一个具有多尺度自关注模块的深度学习网络,将全局上下文聚合为学习特征,用于DR图像检索。多尺度融合在尺度上增强了自关注所探索的特征中不同位置的有效潜在关系。在实验中,在Kaggle DR数据集上对所提出的网络进行了验证,结果表明该网络达到了最先进的性能。
{"title":"A Multi-Scale Self-Attention Network for Diabetic Retinopathy Retrieval","authors":"Ming Zeng, Jiansheng Fang, Hanpei Miao, Tianyang Zhang, Jiang Liu","doi":"10.1145/3484274.3484290","DOIUrl":"https://doi.org/10.1145/3484274.3484290","url":null,"abstract":"Diabetic retinopathy (DR), a complication due to diabetes, is a common cause of progressive damage to the retina. The mass screening of populations for DR is time-consuming. Therefore, computerized diagnosis is of great significance in the clinical practice, which providing evidence to assist clinicians in decision making. Specifically, hemorrhages, microaneurysms, hard exudates, soft exudates, and other lesions are verified to be closely associated with DR. These lesions, however, are scattered in different positions and sizes in fundus images, the internal relation of which are hard to be reserved in the ultimate features due to a large number of convolution layers that reduce the detail characteristics. In this paper, we present a deep-learning network with a multi-scale self-attention module to aggregate the global context to learned features for DR image retrieval. The multi-scale fusion enhances, in terms of scale, the efficacious latent relation of different positions in features explored by the self-attention. For the experiment, the proposed network is validated on the Kaggle DR dataset, and the result shows that it achieves state-of-the-art performance.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114995233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Object Detection Algorithm Combining FPN Structure With DETR 一种结合FPN结构和DETR的目标检测算法
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484284
Nan Xiang, Chuanzhong Pan, Xiaozhao Li
In order to solve the problem of low detection accuracy of the DETR model for small and medium objects, an object detection algorithm with improved feature extraction combined with FPN structure combined with DETR is proposed. This method first extracts features from the original image through the improved Darknet53 network. In this process, the 104*104 size feature map after the first residual error in the second stage is additionally output as a fourth-scale feature map. Combine this feature map with the feature maps output from the original 3 stages to form 4 feature map outputs of different scales. Secondly, it uses FPN to down-sample and up-sample the feature maps of 4 scales, and to merge them to output 52*52 scales. Then, the feature map and the positional encoding are combined and input into the Transformer to obtain the data, and the category and position information of the predicted object are output through FFNs. On the COCO2017 data set, the accuracy has been improved compared with other models.
为了解决DETR模型对中小目标检测精度低的问题,提出了一种改进特征提取与FPN结构结合DETR的目标检测算法。该方法首先通过改进的Darknet53网络从原始图像中提取特征。在这个过程中,在第二阶段的第一次残差后的104*104大小的特征图作为四尺度特征图额外输出。将该特征图与原3个阶段输出的特征图结合,形成4个不同尺度的特征图输出。其次,利用FPN对4个尺度的特征图进行下采样和上采样,并合并为52*52尺度的输出;然后将特征映射与位置编码相结合,输入到Transformer中获得数据,并通过ffn输出预测对象的类别和位置信息。在COCO2017数据集上,与其他模型相比,精度得到了提高。
{"title":"An Object Detection Algorithm Combining FPN Structure With DETR","authors":"Nan Xiang, Chuanzhong Pan, Xiaozhao Li","doi":"10.1145/3484274.3484284","DOIUrl":"https://doi.org/10.1145/3484274.3484284","url":null,"abstract":"In order to solve the problem of low detection accuracy of the DETR model for small and medium objects, an object detection algorithm with improved feature extraction combined with FPN structure combined with DETR is proposed. This method first extracts features from the original image through the improved Darknet53 network. In this process, the 104*104 size feature map after the first residual error in the second stage is additionally output as a fourth-scale feature map. Combine this feature map with the feature maps output from the original 3 stages to form 4 feature map outputs of different scales. Secondly, it uses FPN to down-sample and up-sample the feature maps of 4 scales, and to merge them to output 52*52 scales. Then, the feature map and the positional encoding are combined and input into the Transformer to obtain the data, and the category and position information of the predicted object are output through FFNs. On the COCO2017 data set, the accuracy has been improved compared with other models.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114349831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Ensemble Multilayer Perceptron Model for Day-ahead Photovoltaic Forecasting 光伏日前预测的集成多层感知器模型
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484304
Minli Wang, Peihong Wang
World-wide deployment of photovoltaic system requires accurate forecasting concerning of the uncertainty and imprecision of solar radiation. Multilayer perceptron (MLP) is commonly used in day-ahead photovoltaic forecasting, which has excellent performances in convergence speed and a disadvantage of easily causing overfitting. An ensemble model of MLP is proposed in this paper to counteract the overfitting and reduce the variance of a single MLP model. The input of the ensemble model for day-ahead photovoltaic forecasting comprises feature vectors and the 24-hour power generation of the nearest day. The connection coefficients between MLP are defined by the discounting of feature distance, which measures the dissimilarity between input feature vectors. The forecasting results of a PV system in Macau verifies the effectiveness of the proposed ensemble model of MLP for solving day-ahead photovoltaic forecasting problems.
光伏发电系统在世界范围内的部署需要对太阳辐射的不确定性和不精确性进行准确的预测。多层感知器(Multilayer perceptron, MLP)是光伏日前预测中常用的一种方法,它具有较快的收敛速度和易引起过拟合的缺点。为了克服单个MLP模型的过拟合和减小方差,本文提出了一种MLP集成模型。日前光伏预测集成模型的输入包括特征向量和最近一天的24小时发电量。MLP之间的连接系数通过特征距离的折现来定义,特征距离衡量输入特征向量之间的不相似度。澳门光伏系统的预测结果验证了所提出的MLP集成模型在解决光伏日前预测问题上的有效性。
{"title":"Ensemble Multilayer Perceptron Model for Day-ahead Photovoltaic Forecasting","authors":"Minli Wang, Peihong Wang","doi":"10.1145/3484274.3484304","DOIUrl":"https://doi.org/10.1145/3484274.3484304","url":null,"abstract":"World-wide deployment of photovoltaic system requires accurate forecasting concerning of the uncertainty and imprecision of solar radiation. Multilayer perceptron (MLP) is commonly used in day-ahead photovoltaic forecasting, which has excellent performances in convergence speed and a disadvantage of easily causing overfitting. An ensemble model of MLP is proposed in this paper to counteract the overfitting and reduce the variance of a single MLP model. The input of the ensemble model for day-ahead photovoltaic forecasting comprises feature vectors and the 24-hour power generation of the nearest day. The connection coefficients between MLP are defined by the discounting of feature distance, which measures the dissimilarity between input feature vectors. The forecasting results of a PV system in Macau verifies the effectiveness of the proposed ensemble model of MLP for solving day-ahead photovoltaic forecasting problems.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116347795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Improved Image Segmentation Method of BiSeNetV2 Network 一种改进的BiSeNetV2网络图像分割方法
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484277
Peng Liu, Huan Zhang, Gaochao Yang, Qing Wang
The task of image semantic segmentation is to annotate and segment the semantic information of different types of objects in the image, and predict the category and location information of objects. The difficulty lies in obtaining enough semantic information while retaining enough space information. In order to solve this problem, this paper proposes an improved BiSeNetV2 network. The main idea is to add DenseASPP module to detail branch to obtain larger receptive field, and add efficient channel attention (ECA) module to detail and semantic branch to optimize the feature graph extracted in each stage. so as to further improve the network acquisition. Experimental results show that the proposed algorithm improves the MIoU index by 1.62% on cityscapes dataset, and achieves better performance than BiSeNetV2 network.
图像语义分割的任务是对图像中不同类型物体的语义信息进行标注和分割,并预测物体的类别和位置信息。难点在于获取足够的语义信息的同时保留足够的空间信息。为了解决这一问题,本文提出了一种改进的BiSeNetV2网络。其主要思想是在细节分支中加入DenseASPP模块,获得更大的接受野;在细节分支和语义分支中加入高效通道注意(ECA)模块,对各阶段提取的特征图进行优化。从而进一步完善网络采集。实验结果表明,该算法在城市景观数据集上的MIoU指数提高了1.62%,取得了比BiSeNetV2网络更好的性能。
{"title":"An Improved Image Segmentation Method of BiSeNetV2 Network","authors":"Peng Liu, Huan Zhang, Gaochao Yang, Qing Wang","doi":"10.1145/3484274.3484277","DOIUrl":"https://doi.org/10.1145/3484274.3484277","url":null,"abstract":"The task of image semantic segmentation is to annotate and segment the semantic information of different types of objects in the image, and predict the category and location information of objects. The difficulty lies in obtaining enough semantic information while retaining enough space information. In order to solve this problem, this paper proposes an improved BiSeNetV2 network. The main idea is to add DenseASPP module to detail branch to obtain larger receptive field, and add efficient channel attention (ECA) module to detail and semantic branch to optimize the feature graph extracted in each stage. so as to further improve the network acquisition. Experimental results show that the proposed algorithm improves the MIoU index by 1.62% on cityscapes dataset, and achieves better performance than BiSeNetV2 network.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134291250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
An Improved Monocular PL-SlAM Method with Point-Line Feature Fusion under Low-Texture Environment 低纹理环境下改进的点-线特征融合单目pls - slam方法
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484293
Gaochao Yang, Qing Wang, Peng Liu, Huan Zhang
Traditional visual SLAM only relies on the point features in the scene to complete positioning and mapping. When the texture information in the scene is missing, it affects the accuracy of pose estimation and mapping. In the artificial structured environment, there are a lot of structured lines that can be utilized. Compared with point features, line features contain richer information. For example, structure lines can be used to construct surface features. To improve the robustness and stability of visual SLAM positioning in a low-texture environment, we propose a new point-line feature Visual inertial navigation system based on traditional SLAM method, which makes full use of the structural line features in the scene. Compared to the traditional SLAM system which use point-line features, we adopt a new point-line feature error reprojection model-cross-product of between projection line feature and detected line feature and nonlinear optimization strategy under long line, aiming to increase the robustness in a low-texture environment. The proposed algorithm has been verified by EuRoc dataset and real-world scenarios, and the results show that our algorithm has a greater improvement in accuracy.
传统的视觉SLAM仅依靠场景中的点特征来完成定位和映射。当场景中的纹理信息缺失时,会影响姿态估计和映射的准确性。在人工的结构化环境中,有很多可以利用的结构化线条。与点特征相比,线特征包含更丰富的信息。例如,结构线可以用来构造表面特征。为了提高低纹理环境下视觉SLAM定位的鲁棒性和稳定性,在传统SLAM方法的基础上,充分利用场景中的结构线特征,提出了一种新的点-线特征视觉惯导系统。与传统的点线特征SLAM系统相比,我们采用了一种新的点线特征误差重投影模型——投影线特征与检测线特征的叉积,并采用了长线下的非线性优化策略,以提高在低纹理环境下的鲁棒性。通过EuRoc数据集和真实场景对算法进行了验证,结果表明算法在精度上有较大的提高。
{"title":"An Improved Monocular PL-SlAM Method with Point-Line Feature Fusion under Low-Texture Environment","authors":"Gaochao Yang, Qing Wang, Peng Liu, Huan Zhang","doi":"10.1145/3484274.3484293","DOIUrl":"https://doi.org/10.1145/3484274.3484293","url":null,"abstract":"Traditional visual SLAM only relies on the point features in the scene to complete positioning and mapping. When the texture information in the scene is missing, it affects the accuracy of pose estimation and mapping. In the artificial structured environment, there are a lot of structured lines that can be utilized. Compared with point features, line features contain richer information. For example, structure lines can be used to construct surface features. To improve the robustness and stability of visual SLAM positioning in a low-texture environment, we propose a new point-line feature Visual inertial navigation system based on traditional SLAM method, which makes full use of the structural line features in the scene. Compared to the traditional SLAM system which use point-line features, we adopt a new point-line feature error reprojection model-cross-product of between projection line feature and detected line feature and nonlinear optimization strategy under long line, aiming to increase the robustness in a low-texture environment. The proposed algorithm has been verified by EuRoc dataset and real-world scenarios, and the results show that our algorithm has a greater improvement in accuracy.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130837594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
FlexiNet: Fast and Accurate Vehicle Detection for Autonomous Vehicles FlexiNet:自动驾驶汽车快速准确的车辆检测
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484282
Sabeeha Mehtab, Farah Sarwar, Weiqi Yan
Autonomous vehicle has come to reach on the road; however accurate road perception in real-time is one of the crucial factors towards its success. The greatest challenge in this direction includes occlusion, truncation, lighting conditions, and complex backgrounds. In order to improve the accuracy and detection speed of vehicle detection, a dynamic scaling network is proposed that assists in constructing a balanced shape neural network to achieve optimum accuracy with minimal hardware. The net architecture is influenced by YOLOv5 and is composed of Cross-Stage Partial Network (CSPNet) as its backbone. In order to go even further, we have proposed an auto-anchor generating method that makes the network suitable for any datasets. Our neural network is fine-tuned by using activation, loss, and optimization functions so as to get the optimum results. Our experimental results demonstrate that the proposed net provides comparable performance of YOLOv4 and Faster R-CNN based on KITTI dataset as the benchmark.
自动驾驶汽车已经开始在道路上行驶;然而,准确的实时道路感知是其成功的关键因素之一。在这个方向上最大的挑战包括遮挡、截断、照明条件和复杂的背景。为了提高车辆检测的精度和检测速度,提出了一种动态缩放网络,帮助构建平衡形状的神经网络,以最小的硬件实现最优的精度。网络架构受YOLOv5的影响,以跨阶段局部网络(Cross-Stage Partial Network, CSPNet)为骨干组成。为了更进一步,我们提出了一种自动锚生成方法,使网络适用于任何数据集。我们的神经网络通过激活函数、损失函数和优化函数进行微调,从而得到最优结果。实验结果表明,以KITTI数据集为基准,本文提出的网络具有与YOLOv4和Faster R-CNN相当的性能。
{"title":"FlexiNet: Fast and Accurate Vehicle Detection for Autonomous Vehicles","authors":"Sabeeha Mehtab, Farah Sarwar, Weiqi Yan","doi":"10.1145/3484274.3484282","DOIUrl":"https://doi.org/10.1145/3484274.3484282","url":null,"abstract":"Autonomous vehicle has come to reach on the road; however accurate road perception in real-time is one of the crucial factors towards its success. The greatest challenge in this direction includes occlusion, truncation, lighting conditions, and complex backgrounds. In order to improve the accuracy and detection speed of vehicle detection, a dynamic scaling network is proposed that assists in constructing a balanced shape neural network to achieve optimum accuracy with minimal hardware. The net architecture is influenced by YOLOv5 and is composed of Cross-Stage Partial Network (CSPNet) as its backbone. In order to go even further, we have proposed an auto-anchor generating method that makes the network suitable for any datasets. Our neural network is fine-tuned by using activation, loss, and optimization functions so as to get the optimum results. Our experimental results demonstrate that the proposed net provides comparable performance of YOLOv4 and Faster R-CNN based on KITTI dataset as the benchmark.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125536528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
An original 3D reconstruction method using a conical light and a camera in underwater caves 一种利用锥形光源和摄像机在水下洞穴中进行三维重建的新颖方法
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484294
Quentin Massone, S. Druon, J. Triboulet
This paper presents a new structured light approach dedicated to 3D mapping of underwater galleries in karst aquifers. This kind of method is based on the projection of a light pattern onto the scene captured by a camera. Our originality comes from the projected pattern used, since unlike literature methods, our light projector is a simple conical diving light. With a light cone, the recovered pattern in the image is a closed 2D curve we extract with a light contour detection method we have developed. A specific calibration method has also been created to estimate the cone geometry with respect to the camera. Therefore, we get a calibrated projector-camera pair, we can use to find matches between projected and recovered patterns. Our last contribution is the recovery of 3D data by triangulation using the camera cone relationship and the extracted closed 2D curve. The experimental results that will be presented in this article show the feasibility of the method.
本文提出了一种新的结构光方法,用于岩溶含水层水下通道的三维制图。这种方法是基于一个光模式的投影到场景捕捉由相机。我们的创意来自于使用的投影模式,因为与文献方法不同,我们的灯光投影仪是一个简单的锥形潜水灯。在光锥的作用下,图像中恢复的图案是用我们开发的光轮廓检测方法提取的一个封闭的二维曲线。还创建了一种特定的校准方法来估计相对于相机的锥体几何形状。因此,我们得到一个校准的投影仪-相机对,我们可以用它来找到投影模式和恢复模式之间的匹配。我们的最后一个贡献是通过使用相机锥关系和提取的封闭2D曲线进行三角剖分来恢复3D数据。本文将给出的实验结果表明了该方法的可行性。
{"title":"An original 3D reconstruction method using a conical light and a camera in underwater caves","authors":"Quentin Massone, S. Druon, J. Triboulet","doi":"10.1145/3484274.3484294","DOIUrl":"https://doi.org/10.1145/3484274.3484294","url":null,"abstract":"This paper presents a new structured light approach dedicated to 3D mapping of underwater galleries in karst aquifers. This kind of method is based on the projection of a light pattern onto the scene captured by a camera. Our originality comes from the projected pattern used, since unlike literature methods, our light projector is a simple conical diving light. With a light cone, the recovered pattern in the image is a closed 2D curve we extract with a light contour detection method we have developed. A specific calibration method has also been created to estimate the cone geometry with respect to the camera. Therefore, we get a calibrated projector-camera pair, we can use to find matches between projected and recovered patterns. Our last contribution is the recovery of 3D data by triangulation using the camera cone relationship and the extracted closed 2D curve. The experimental results that will be presented in this article show the feasibility of the method.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124934192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Car Image Matting 汽车图像抠图
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484279
Jiajian Huang
At present, matting technology has achieved excellent performance under laboratory conditions, but they still fail to meet the needs of some practical businesses. In this paper, according to the practical problem that the contour of the matting object is not smooth when applying matting technology to car data set, we add the smoothing loss, location information, and detailed information based on the original algorithm. The experimental results show that our improvement has achieved nice results.
目前,抠图技术在实验室条件下已经取得了优异的性能,但仍不能满足一些实际业务的需求。本文针对将抠图技术应用于汽车数据集时抠图对象轮廓不光滑的实际问题,在原有算法的基础上增加了平滑损失、位置信息和详细信息。实验结果表明,我们的改进取得了良好的效果。
{"title":"Car Image Matting","authors":"Jiajian Huang","doi":"10.1145/3484274.3484279","DOIUrl":"https://doi.org/10.1145/3484274.3484279","url":null,"abstract":"At present, matting technology has achieved excellent performance under laboratory conditions, but they still fail to meet the needs of some practical businesses. In this paper, according to the practical problem that the contour of the matting object is not smooth when applying matting technology to car data set, we add the smoothing loss, location information, and detailed information based on the original algorithm. The experimental results show that our improvement has achieved nice results.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132301446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Face Template Protection through Residual Learning Based Error-Correcting Codes 基于残差学习的人脸模板纠错码保护
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484292
Junwei Zhou, D. Shang, Huile Lang, G. Ye, Zhe Xia
The leakage of the face template leads to severe security problems since the facial image is unique and irreplaceable to each individual. Many researchers have been devoted to protecting the face template. Nevertheless, to achieve high security for the face template, partial matching accuracy is usually sacrificed. The main challenge of this problem is the low inter-user variations and high intra-user variations of facial images. In this work, we propose a method integrating residual learning and error-correcting codes for face template protection. In particular, the proposed method consists of two major components: (a) a deep residual network component mapping facial images to polar codewords assigned to users, and (b) a polar decoder reducing noise brought by high intra-user variations in the predicted codewords. The proposed method is evaluated on extended Yale B, CMU-PIE, and FEI databases. It provides high security of face template and achieves a high (100%) genuine accept rate at a low false accept rate (0%) simultaneously, which outperforms most state-of-the-arts.
人脸模板的泄露会导致严重的安全问题,因为人脸图像对每个人来说都是独一无二的、不可替代的。许多研究者一直致力于保护人脸模板。然而,为了保证人脸模板的高安全性,往往会牺牲部分匹配精度。该问题的主要挑战是面部图像的低用户间变化和高用户内部变化。在这项工作中,我们提出了一种残差学习和纠错码相结合的人脸模板保护方法。特别地,所提出的方法由两个主要部分组成:(a)将面部图像映射到分配给用户的极性码字的深度残差网络组件,以及(b)极化解码器,用于减少预测码字中用户内部高度变化带来的噪声。在扩展的Yale B、CMU-PIE和FEI数据库上对该方法进行了评估。它提供了人脸模板的高安全性,同时实现了高(100%)的真实接受率和低(0%)的假接受率,优于目前最先进的技术。
{"title":"Face Template Protection through Residual Learning Based Error-Correcting Codes","authors":"Junwei Zhou, D. Shang, Huile Lang, G. Ye, Zhe Xia","doi":"10.1145/3484274.3484292","DOIUrl":"https://doi.org/10.1145/3484274.3484292","url":null,"abstract":"The leakage of the face template leads to severe security problems since the facial image is unique and irreplaceable to each individual. Many researchers have been devoted to protecting the face template. Nevertheless, to achieve high security for the face template, partial matching accuracy is usually sacrificed. The main challenge of this problem is the low inter-user variations and high intra-user variations of facial images. In this work, we propose a method integrating residual learning and error-correcting codes for face template protection. In particular, the proposed method consists of two major components: (a) a deep residual network component mapping facial images to polar codewords assigned to users, and (b) a polar decoder reducing noise brought by high intra-user variations in the predicted codewords. The proposed method is evaluated on extended Yale B, CMU-PIE, and FEI databases. It provides high security of face template and achieves a high (100%) genuine accept rate at a low false accept rate (0%) simultaneously, which outperforms most state-of-the-arts.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"27 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133761760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Road Network Congestion Propagation Based on Complex Network and SIR Model 基于复杂网络和SIR模型的道路网络拥塞传播研究
Pub Date : 2021-08-13 DOI: 10.1145/3484274.3484300
Xiaofan Song, Lidong Zhang, Wei Zhang
This paper analyzed the dynamic evolution process of road traffic network congestion from a holistic and macroscopic point of view, and establishes a road network congestion propagation model based on the similarity between traffic congestion propagation and infectious disease propagation and the topological structure information of roads. The model takes the road section as the research object, and analyzes the effect of different parameters on congestion propagation through simulation. The simulation results show that the road network congestion propagation model constructed in this paper is consistent with the actual situation, which can provide a new idea for road network congestion management and congestion prediction, and has certain practical application value.
本文从整体和宏观的角度分析了道路交通网络拥堵的动态演化过程,基于交通拥堵传播与传染病传播的相似性和道路的拓扑结构信息,建立了道路网络拥堵传播模型。该模型以路段为研究对象,通过仿真分析了不同参数对拥堵传播的影响。仿真结果表明,本文构建的路网拥塞传播模型与实际情况相符,可为路网拥塞管理和拥塞预测提供新的思路,具有一定的实际应用价值。
{"title":"Research on Road Network Congestion Propagation Based on Complex Network and SIR Model","authors":"Xiaofan Song, Lidong Zhang, Wei Zhang","doi":"10.1145/3484274.3484300","DOIUrl":"https://doi.org/10.1145/3484274.3484300","url":null,"abstract":"This paper analyzed the dynamic evolution process of road traffic network congestion from a holistic and macroscopic point of view, and establishes a road network congestion propagation model based on the similarity between traffic congestion propagation and infectious disease propagation and the topological structure information of roads. The model takes the road section as the research object, and analyzes the effect of different parameters on congestion propagation through simulation. The simulation results show that the road network congestion propagation model constructed in this paper is consistent with the actual situation, which can provide a new idea for road network congestion management and congestion prediction, and has certain practical application value.","PeriodicalId":143540,"journal":{"name":"Proceedings of the 4th International Conference on Control and Computer Vision","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126638928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 4th International Conference on Control and Computer Vision
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1