Tristan Wirth, Aria Jamili, Max von Bülow, V. Knauthe, S. Guthe
{"title":"透明结构通用单目深度估计体系的适应度","authors":"Tristan Wirth, Aria Jamili, Max von Bülow, V. Knauthe, S. Guthe","doi":"10.2312/egs.20221020","DOIUrl":null,"url":null,"abstract":"Due to material properties, monocular depth estimation of transparent structures is inherently challenging. Recent advances leverage additional knowledge that is not available in all contexts, i.e., known shape or depth information from a sensor. General-purpose machine learning models, that do not utilize such additional knowledge, have not yet been explicitly evaluated regarding their performance on transparent structures. In this work, we show that these models show poor performance on the depth estimation of transparent structures. However, fine-tuning on suitable data sets, such as ClearGrasp, increases their estimation performance on the task at hand. Our evaluations show that high performance on general-purpose benchmarks translates well into performance on transparent objects after fine-tuning. Furthermore, our analysis suggests, that state-of-theart high-performing models are not able to capture a high grade of detail from both the image foreground and background at the same time. This finding shows the demand for a combination of existing models to further enhance depth estimation quality. CCS Concepts • Computing methodologies → Computer vision; Shape inference;","PeriodicalId":72958,"journal":{"name":"Eurographics ... Workshop on 3D Object Retrieval : EG 3DOR. Eurographics Workshop on 3D Object Retrieval","volume":"17 1","pages":"9-12"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fitness of General-Purpose Monocular Depth Estimation Architectures for Transparent Structures\",\"authors\":\"Tristan Wirth, Aria Jamili, Max von Bülow, V. Knauthe, S. Guthe\",\"doi\":\"10.2312/egs.20221020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to material properties, monocular depth estimation of transparent structures is inherently challenging. Recent advances leverage additional knowledge that is not available in all contexts, i.e., known shape or depth information from a sensor. General-purpose machine learning models, that do not utilize such additional knowledge, have not yet been explicitly evaluated regarding their performance on transparent structures. In this work, we show that these models show poor performance on the depth estimation of transparent structures. However, fine-tuning on suitable data sets, such as ClearGrasp, increases their estimation performance on the task at hand. Our evaluations show that high performance on general-purpose benchmarks translates well into performance on transparent objects after fine-tuning. Furthermore, our analysis suggests, that state-of-theart high-performing models are not able to capture a high grade of detail from both the image foreground and background at the same time. This finding shows the demand for a combination of existing models to further enhance depth estimation quality. CCS Concepts • Computing methodologies → Computer vision; Shape inference;\",\"PeriodicalId\":72958,\"journal\":{\"name\":\"Eurographics ... Workshop on 3D Object Retrieval : EG 3DOR. Eurographics Workshop on 3D Object Retrieval\",\"volume\":\"17 1\",\"pages\":\"9-12\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Eurographics ... Workshop on 3D Object Retrieval : EG 3DOR. Eurographics Workshop on 3D Object Retrieval\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2312/egs.20221020\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Eurographics ... Workshop on 3D Object Retrieval : EG 3DOR. Eurographics Workshop on 3D Object Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2312/egs.20221020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Fitness of General-Purpose Monocular Depth Estimation Architectures for Transparent Structures
Due to material properties, monocular depth estimation of transparent structures is inherently challenging. Recent advances leverage additional knowledge that is not available in all contexts, i.e., known shape or depth information from a sensor. General-purpose machine learning models, that do not utilize such additional knowledge, have not yet been explicitly evaluated regarding their performance on transparent structures. In this work, we show that these models show poor performance on the depth estimation of transparent structures. However, fine-tuning on suitable data sets, such as ClearGrasp, increases their estimation performance on the task at hand. Our evaluations show that high performance on general-purpose benchmarks translates well into performance on transparent objects after fine-tuning. Furthermore, our analysis suggests, that state-of-theart high-performing models are not able to capture a high grade of detail from both the image foreground and background at the same time. This finding shows the demand for a combination of existing models to further enhance depth estimation quality. CCS Concepts • Computing methodologies → Computer vision; Shape inference;