Liyun Zhang, P. Ratsamee, Yuuki Uranishi, Manabu Higashida, H. Takemura
{"title":"增强热视觉视觉里程计的热色图像转换","authors":"Liyun Zhang, P. Ratsamee, Yuuki Uranishi, Manabu Higashida, H. Takemura","doi":"10.1109/SSRR56537.2022.10018810","DOIUrl":null,"url":null,"abstract":"A panoptic perception-based generative adversarial network for thermal-to-color image translation is proposed to demonstrate its potential as an image sequence enhancement for monocular visual odometry in blurry and low-resolution thermal vision. The pre-trained panoptic segmentation model is utilized to obtain the panoptic perception (i.e., bounding boxes, categories, and masks) of the image scene to guide the alignment between the object content codes of the original thermal domain and panoptic-level style codes sampled from the target color style space. A feature masking module further refines the style-aligned object representations for sharpening object boundaries to synthesize higher fidelity translated color image sequences. The extensive experimental evaluation shows that our method outperforms other thermal-to-color image translation methods in the image quality of translated color images. We demonstrate that the enhanced image sequences significantly improve the performance of monocular visual odometry compared with dif-ferent competing methods including thermal image sequences.","PeriodicalId":272862,"journal":{"name":"2022 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Thermal-to-Color Image Translation for Enhancing Visual Odometry of Thermal Vision\",\"authors\":\"Liyun Zhang, P. Ratsamee, Yuuki Uranishi, Manabu Higashida, H. Takemura\",\"doi\":\"10.1109/SSRR56537.2022.10018810\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A panoptic perception-based generative adversarial network for thermal-to-color image translation is proposed to demonstrate its potential as an image sequence enhancement for monocular visual odometry in blurry and low-resolution thermal vision. The pre-trained panoptic segmentation model is utilized to obtain the panoptic perception (i.e., bounding boxes, categories, and masks) of the image scene to guide the alignment between the object content codes of the original thermal domain and panoptic-level style codes sampled from the target color style space. A feature masking module further refines the style-aligned object representations for sharpening object boundaries to synthesize higher fidelity translated color image sequences. The extensive experimental evaluation shows that our method outperforms other thermal-to-color image translation methods in the image quality of translated color images. We demonstrate that the enhanced image sequences significantly improve the performance of monocular visual odometry compared with dif-ferent competing methods including thermal image sequences.\",\"PeriodicalId\":272862,\"journal\":{\"name\":\"2022 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SSRR56537.2022.10018810\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSRR56537.2022.10018810","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Thermal-to-Color Image Translation for Enhancing Visual Odometry of Thermal Vision
A panoptic perception-based generative adversarial network for thermal-to-color image translation is proposed to demonstrate its potential as an image sequence enhancement for monocular visual odometry in blurry and low-resolution thermal vision. The pre-trained panoptic segmentation model is utilized to obtain the panoptic perception (i.e., bounding boxes, categories, and masks) of the image scene to guide the alignment between the object content codes of the original thermal domain and panoptic-level style codes sampled from the target color style space. A feature masking module further refines the style-aligned object representations for sharpening object boundaries to synthesize higher fidelity translated color image sequences. The extensive experimental evaluation shows that our method outperforms other thermal-to-color image translation methods in the image quality of translated color images. We demonstrate that the enhanced image sequences significantly improve the performance of monocular visual odometry compared with dif-ferent competing methods including thermal image sequences.