{"title":"OHMA: 用于探索丰富特征表达的基于边缘的轻量级闭合目标再识别框架","authors":"Xiaoyu Zhang;Yichao Wang;Xiting Peng;Mianxiong Dong;Kaoru Ota;Lexi Xu","doi":"10.1109/TCE.2024.3443336","DOIUrl":null,"url":null,"abstract":"The rise of the Internet of Things (IoT) and the Internet of Vehicles (IoV) has accelerated the realization of smart cities, where cameras as interconnected consumer electronics (CE) are deployed across cities to capture target images. The widespread deployment of monitoring equipment has prompted us to focus on the target re-identification (Re-ID) issue. One major challenge about this issue is that the identified targets are often obscured by different obstacles, which leads to bad performance. In practical applications, the occluded Re-ID task is very significant to complete. Previous approaches have focused on improving the occluded Re-ID performance but have neglected the lightweight problem, which makes the model difficult to deploy in the real world. Therefore, this paper proposes a lightweight framework that ensures occluded Re-ID performance and deploys at the edge to solve the problem of long transmission time and high latency caused by wireless and cloud technology in CE. This framework tackles occluded target Re-ID issues by integrating omni-scale features with human keypoint estimation and multi-head attention mechanism (OHMA). To solve the vehicle Re-ID problem, we use the cutout method to simulate an occlusion scene due to the lack of occluded vehicle data. Then, The multi-head attention mechanism combines with the omni-scale network (OSNet) to learn vehicles subtle features. To deal with occluded pedestrians, human keypoint estimation focuses on non-occluded areas of pedestrian images by paying attention to visible information about the human body. The generated heatmaps fuse omni-scale feature maps to explore better feature representations. In addition, the HUAWEI Atlas 200I DK A2 is used to simulate real edge devices and evaluate the experiments on both public and real-world private datasets. The results demonstrate that our framework improves the occluded Re-ID performance while ensuring lightweight. Compared with the previous methods, OHMA displays advantages in occlusion scenes.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"70 4","pages":"7424-7435"},"PeriodicalIF":4.3000,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"OHMA: An Edge-Based Lightweight Occluded Target Re-Identification Framework for Exploring Abundant Feature Expression\",\"authors\":\"Xiaoyu Zhang;Yichao Wang;Xiting Peng;Mianxiong Dong;Kaoru Ota;Lexi Xu\",\"doi\":\"10.1109/TCE.2024.3443336\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The rise of the Internet of Things (IoT) and the Internet of Vehicles (IoV) has accelerated the realization of smart cities, where cameras as interconnected consumer electronics (CE) are deployed across cities to capture target images. The widespread deployment of monitoring equipment has prompted us to focus on the target re-identification (Re-ID) issue. One major challenge about this issue is that the identified targets are often obscured by different obstacles, which leads to bad performance. In practical applications, the occluded Re-ID task is very significant to complete. Previous approaches have focused on improving the occluded Re-ID performance but have neglected the lightweight problem, which makes the model difficult to deploy in the real world. Therefore, this paper proposes a lightweight framework that ensures occluded Re-ID performance and deploys at the edge to solve the problem of long transmission time and high latency caused by wireless and cloud technology in CE. This framework tackles occluded target Re-ID issues by integrating omni-scale features with human keypoint estimation and multi-head attention mechanism (OHMA). To solve the vehicle Re-ID problem, we use the cutout method to simulate an occlusion scene due to the lack of occluded vehicle data. Then, The multi-head attention mechanism combines with the omni-scale network (OSNet) to learn vehicles subtle features. To deal with occluded pedestrians, human keypoint estimation focuses on non-occluded areas of pedestrian images by paying attention to visible information about the human body. The generated heatmaps fuse omni-scale feature maps to explore better feature representations. In addition, the HUAWEI Atlas 200I DK A2 is used to simulate real edge devices and evaluate the experiments on both public and real-world private datasets. The results demonstrate that our framework improves the occluded Re-ID performance while ensuring lightweight. Compared with the previous methods, OHMA displays advantages in occlusion scenes.\",\"PeriodicalId\":13208,\"journal\":{\"name\":\"IEEE Transactions on Consumer Electronics\",\"volume\":\"70 4\",\"pages\":\"7424-7435\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-08-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Consumer Electronics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10648979/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10648979/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
OHMA: An Edge-Based Lightweight Occluded Target Re-Identification Framework for Exploring Abundant Feature Expression
The rise of the Internet of Things (IoT) and the Internet of Vehicles (IoV) has accelerated the realization of smart cities, where cameras as interconnected consumer electronics (CE) are deployed across cities to capture target images. The widespread deployment of monitoring equipment has prompted us to focus on the target re-identification (Re-ID) issue. One major challenge about this issue is that the identified targets are often obscured by different obstacles, which leads to bad performance. In practical applications, the occluded Re-ID task is very significant to complete. Previous approaches have focused on improving the occluded Re-ID performance but have neglected the lightweight problem, which makes the model difficult to deploy in the real world. Therefore, this paper proposes a lightweight framework that ensures occluded Re-ID performance and deploys at the edge to solve the problem of long transmission time and high latency caused by wireless and cloud technology in CE. This framework tackles occluded target Re-ID issues by integrating omni-scale features with human keypoint estimation and multi-head attention mechanism (OHMA). To solve the vehicle Re-ID problem, we use the cutout method to simulate an occlusion scene due to the lack of occluded vehicle data. Then, The multi-head attention mechanism combines with the omni-scale network (OSNet) to learn vehicles subtle features. To deal with occluded pedestrians, human keypoint estimation focuses on non-occluded areas of pedestrian images by paying attention to visible information about the human body. The generated heatmaps fuse omni-scale feature maps to explore better feature representations. In addition, the HUAWEI Atlas 200I DK A2 is used to simulate real edge devices and evaluate the experiments on both public and real-world private datasets. The results demonstrate that our framework improves the occluded Re-ID performance while ensuring lightweight. Compared with the previous methods, OHMA displays advantages in occlusion scenes.
期刊介绍:
The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.