Optimized instance segmentation by super-resolution and maximal clique generation

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Integrated Computer-Aided Engineering Pub Date : 2023-02-10 DOI:10.3233/ica-230700

Iván García Aguilar, Jorge García-González, Rafael Marcos Luque Baena, Ezequiel López-Rubio, E. Domínguez

{"title":"Optimized instance segmentation by super-resolution and maximal clique generation","authors":"Iván García Aguilar, Jorge García-González, Rafael Marcos Luque Baena, Ezequiel López-Rubio, E. Domínguez","doi":"10.3233/ica-230700","DOIUrl":null,"url":null,"abstract":"The rise of surveillance systems has led to exponential growth in collected data, enabling several advances in Deep Learning to exploit them and automate tasks for autonomous systems. Vehicle detection is a crucial task in the fields of Intelligent Vehicle Systems and Intelligent Transport systems, making it possible to control traffic density or detect accidents and potential risks. This paper presents an optimal meta-method that can be applied to any instant segmentation model, such as Mask R-CNN or YOLACT++. Using the initial detections obtained by these models and super-resolution, an optimized re-inference is performed, allowing the detection of elements not identified a priori and improving the quality of the rest of the detections. The direct application of super-resolution is limited because instance segmentation models process images according to a fixed dimension. Therefore, in cases where the super-resolved images exceed this fixed size, the model will rescale them again, thus losing the desired effect. The advantages of this meta-method lie mainly in the fact that it is not required to modify the model architecture or re-train it. Regardless of the size of the images given as input, super-resolved areas that fit the defined dimension of the object segmentation model will be generated. After applying our proposal, experiments show an improvement of up to 8.1% for the YOLACT++ model used in the Jena sequence of the CityScapes dataset.","PeriodicalId":50358,"journal":{"name":"Integrated Computer-Aided Engineering","volume":"23 1","pages":"243-256"},"PeriodicalIF":5.8000,"publicationDate":"2023-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Integrated Computer-Aided Engineering","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.3233/ica-230700","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 2

Abstract

The rise of surveillance systems has led to exponential growth in collected data, enabling several advances in Deep Learning to exploit them and automate tasks for autonomous systems. Vehicle detection is a crucial task in the fields of Intelligent Vehicle Systems and Intelligent Transport systems, making it possible to control traffic density or detect accidents and potential risks. This paper presents an optimal meta-method that can be applied to any instant segmentation model, such as Mask R-CNN or YOLACT++. Using the initial detections obtained by these models and super-resolution, an optimized re-inference is performed, allowing the detection of elements not identified a priori and improving the quality of the rest of the detections. The direct application of super-resolution is limited because instance segmentation models process images according to a fixed dimension. Therefore, in cases where the super-resolved images exceed this fixed size, the model will rescale them again, thus losing the desired effect. The advantages of this meta-method lie mainly in the fact that it is not required to modify the model architecture or re-train it. Regardless of the size of the images given as input, super-resolved areas that fit the defined dimension of the object segmentation model will be generated. After applying our proposal, experiments show an improvement of up to 8.1% for the YOLACT++ model used in the Jena sequence of the CityScapes dataset.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过超分辨率和最大团生成优化实例分割

监控系统的兴起导致了收集数据的指数级增长，使深度学习的一些进展能够利用它们并自动执行自主系统的任务。车辆检测是智能车辆系统和智能交通系统领域的一项关键任务，它可以控制交通密度或检测事故和潜在风险。本文提出了一种最优元方法，可以应用于任何即时分割模型，如Mask R-CNN或yolact++。利用这些模型获得的初始检测和超分辨率，进行优化的重新推理，允许检测未先验识别的元素，并提高其余检测的质量。由于实例分割模型按照固定的维数对图像进行处理，限制了超分辨率的直接应用。因此，当超分辨率图像超过这个固定尺寸时，模型会重新缩放，从而失去预期的效果。这种元方法的优点主要在于它不需要修改模型体系结构或重新训练它。无论输入图像的大小如何，都会生成符合目标分割模型定义尺寸的超分辨率区域。应用我们的建议后，实验表明，在城市景观数据集的耶拿序列中使用的yolact++模型的改进幅度高达8.1%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Integrated Computer-Aided Engineering 工程技术-工程：综合

CiteScore

9.90

自引率

21.50%

发文量

审稿时长

>12 weeks

期刊介绍： Integrated Computer-Aided Engineering (ICAE) was founded in 1993. "Based on the premise that interdisciplinary thinking and synergistic collaboration of disciplines can solve complex problems, open new frontiers, and lead to true innovations and breakthroughs, the cornerstone of industrial competitiveness and advancement of the society" as noted in the inaugural issue of the journal. The focus of ICAE is the integration of leading edge and emerging computer and information technologies for innovative solution of engineering problems. The journal fosters interdisciplinary research and presents a unique forum for innovative computer-aided engineering. It also publishes novel industrial applications of CAE, thus helping to bring new computational paradigms from research labs and classrooms to reality. Areas covered by the journal include (but are not limited to) artificial intelligence, advanced signal processing, biologically inspired computing, cognitive modeling, concurrent engineering, database management, distributed computing, evolutionary computing, fuzzy logic, genetic algorithms, geometric modeling, intelligent and adaptive systems, internet-based technologies, knowledge discovery and engineering, machine learning, mechatronics, mobile computing, multimedia technologies, networking, neural network computing, object-oriented systems, optimization and search, parallel processing, robotics virtual reality, and visualization techniques.