{"title":"用于时间动作分割的动态图的延迟嵌入","authors":"Jun-Bin Zhang, Pei-Hsuan Tsai, Meng-Hsun Tsai","doi":"10.1109/IS3C57901.2023.00049","DOIUrl":null,"url":null,"abstract":"In real-world interactive applications, where videos are generated in real-time and require immediate feedback, online segmentation has practical advantages over offline inference. Many excellent previous models have been developed for offline scenarios, while real-time prediction for temporal action segmentation (TAS) is a difficult task. Some interactive applications can tolerate a certain amount of delay. In this paper, we propose a node delay embedding of a dynamic graph for real-time TAS. We transform the video stream into a dynamic graph stream that evolves over time. We define past, current, and future nodes to construct sub-graphs at each step. Specifically, future nodes are sampled using our proposed node delay method. A graph model is utilized to aggregate past, current, and future node information to update the representation of current nodes and predict their labels. To the best of our knowledge, it is the first real-time TAS graph model with delay embedding. Experiments show that delay embedding enhances node representation and improves performance. Overall, our proposed approach provides a promising solution for real-time TAS.","PeriodicalId":142483,"journal":{"name":"2023 Sixth International Symposium on Computer, Consumer and Control (IS3C)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DEDGraph: Delay Embedding of Dynamic Graph for Temporal Action Segmentation\",\"authors\":\"Jun-Bin Zhang, Pei-Hsuan Tsai, Meng-Hsun Tsai\",\"doi\":\"10.1109/IS3C57901.2023.00049\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In real-world interactive applications, where videos are generated in real-time and require immediate feedback, online segmentation has practical advantages over offline inference. Many excellent previous models have been developed for offline scenarios, while real-time prediction for temporal action segmentation (TAS) is a difficult task. Some interactive applications can tolerate a certain amount of delay. In this paper, we propose a node delay embedding of a dynamic graph for real-time TAS. We transform the video stream into a dynamic graph stream that evolves over time. We define past, current, and future nodes to construct sub-graphs at each step. Specifically, future nodes are sampled using our proposed node delay method. A graph model is utilized to aggregate past, current, and future node information to update the representation of current nodes and predict their labels. To the best of our knowledge, it is the first real-time TAS graph model with delay embedding. Experiments show that delay embedding enhances node representation and improves performance. Overall, our proposed approach provides a promising solution for real-time TAS.\",\"PeriodicalId\":142483,\"journal\":{\"name\":\"2023 Sixth International Symposium on Computer, Consumer and Control (IS3C)\",\"volume\":\"66 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 Sixth International Symposium on Computer, Consumer and Control (IS3C)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IS3C57901.2023.00049\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Sixth International Symposium on Computer, Consumer and Control (IS3C)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IS3C57901.2023.00049","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
DEDGraph: Delay Embedding of Dynamic Graph for Temporal Action Segmentation
In real-world interactive applications, where videos are generated in real-time and require immediate feedback, online segmentation has practical advantages over offline inference. Many excellent previous models have been developed for offline scenarios, while real-time prediction for temporal action segmentation (TAS) is a difficult task. Some interactive applications can tolerate a certain amount of delay. In this paper, we propose a node delay embedding of a dynamic graph for real-time TAS. We transform the video stream into a dynamic graph stream that evolves over time. We define past, current, and future nodes to construct sub-graphs at each step. Specifically, future nodes are sampled using our proposed node delay method. A graph model is utilized to aggregate past, current, and future node information to update the representation of current nodes and predict their labels. To the best of our knowledge, it is the first real-time TAS graph model with delay embedding. Experiments show that delay embedding enhances node representation and improves performance. Overall, our proposed approach provides a promising solution for real-time TAS.