卷积神经网络在语义图像分割中的性能评价

Computer Science Research Notes Pub Date : 1900-01-01 DOI:10.24132/csrn.2019.2901.1.4

Alexander Leipnitz, T. Strutz, O. Jokisch

{"title":"卷积神经网络在语义图像分割中的性能评价","authors":"Alexander Leipnitz, T. Strutz, O. Jokisch","doi":"10.24132/csrn.2019.2901.1.4","DOIUrl":null,"url":null,"abstract":"Convolutional neural networks are applied successfully for image classification and object detection. Recently, they have been adopted to semantic segmentation tasks and several new network architectures have been proposed. With respect to automotive applications, the Cityscapes dataset is often used as a benchmark. It is one of the biggest datasets in this field and consists of a training, a validation, and a test set. While training and validation allow the optimisation of these nets, the test dataset can be used to evaluate their performance.\nOur investigations have shown that while these networks perform well for images of the Cityscapes dataset, their segmentation quality significantly drops when applied to new data. It seems that they have limited generalisation abilities. In order to find out whether the image content itself or other image properties cause this effect, we have carried out systematic investigations with modified Cityscapes data. We have found that camera-dependent image properties like brightness, contrast, or saturation can significantly influence the segmentation quality. This papers presents the results of these tests including eight state-of-the-art CNNs. It can be concluded that the out-of-the-boxusage of CNNs in real-world environments is not recommended.","PeriodicalId":322214,"journal":{"name":"Computer Science Research Notes","volume":"171 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Performance Assessment of Convolutional Neural Networks for Semantic Image Segmentation\",\"authors\":\"Alexander Leipnitz, T. Strutz, O. Jokisch\",\"doi\":\"10.24132/csrn.2019.2901.1.4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional neural networks are applied successfully for image classification and object detection. Recently, they have been adopted to semantic segmentation tasks and several new network architectures have been proposed. With respect to automotive applications, the Cityscapes dataset is often used as a benchmark. It is one of the biggest datasets in this field and consists of a training, a validation, and a test set. While training and validation allow the optimisation of these nets, the test dataset can be used to evaluate their performance.\\nOur investigations have shown that while these networks perform well for images of the Cityscapes dataset, their segmentation quality significantly drops when applied to new data. It seems that they have limited generalisation abilities. In order to find out whether the image content itself or other image properties cause this effect, we have carried out systematic investigations with modified Cityscapes data. We have found that camera-dependent image properties like brightness, contrast, or saturation can significantly influence the segmentation quality. This papers presents the results of these tests including eight state-of-the-art CNNs. It can be concluded that the out-of-the-boxusage of CNNs in real-world environments is not recommended.\",\"PeriodicalId\":322214,\"journal\":{\"name\":\"Computer Science Research Notes\",\"volume\":\"171 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Science Research Notes\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.24132/csrn.2019.2901.1.4\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science Research Notes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24132/csrn.2019.2901.1.4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

卷积神经网络成功地应用于图像分类和目标检测。近年来，它们被用于语义分割任务，并提出了几种新的网络结构。对于汽车应用程序，cityscape数据集经常被用作基准。它是该领域最大的数据集之一，由训练集、验证集和测试集组成。虽然训练和验证允许优化这些网络，但测试数据集可以用来评估它们的性能。我们的研究表明，虽然这些网络对cityscape数据集的图像表现良好，但当应用于新数据时，它们的分割质量显著下降。似乎他们的泛化能力有限。为了找出是图像内容本身还是其他图像属性造成了这种影响，我们对修改后的cityscape数据进行了系统的调查。我们发现，相机相关的图像属性，如亮度，对比度，或饱和度可以显著影响分割质量。本文介绍了这些测试的结果，包括八个最先进的cnn。可以得出结论，不建议在现实环境中使用cnn的开箱即用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Performance Assessment of Convolutional Neural Networks for Semantic Image Segmentation

Convolutional neural networks are applied successfully for image classification and object detection. Recently, they have been adopted to semantic segmentation tasks and several new network architectures have been proposed. With respect to automotive applications, the Cityscapes dataset is often used as a benchmark. It is one of the biggest datasets in this field and consists of a training, a validation, and a test set. While training and validation allow the optimisation of these nets, the test dataset can be used to evaluate their performance. Our investigations have shown that while these networks perform well for images of the Cityscapes dataset, their segmentation quality significantly drops when applied to new data. It seems that they have limited generalisation abilities. In order to find out whether the image content itself or other image properties cause this effect, we have carried out systematic investigations with modified Cityscapes data. We have found that camera-dependent image properties like brightness, contrast, or saturation can significantly influence the segmentation quality. This papers presents the results of these tests including eight state-of-the-art CNNs. It can be concluded that the out-of-the-boxusage of CNNs in real-world environments is not recommended.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computer Science Research Notes

自引率

0.00%

发文量

期刊最新文献

The Usage of the BP-Layers Stereo Matching Algorithm with the EBCA Camera Set Evolutionary-Edge Bundling with Concatenation Process of Control Points Fast Incremental Image Reconstruction with CNN-enhanced Poisson Interpolation Temporal Segmentation of Actions in Fencing Footwork Training Low-Rank Rational Approximation of Natural Trochoid Parameterizations