Afsana Airin, Rezab Ud Dawla, Ahmed Shabab Noor, Muhib Al Hasan, Ahmed Rafi Hasan, Akib Zaman, D. Farid
{"title":"基于注意力的场景图生成:综述","authors":"Afsana Airin, Rezab Ud Dawla, Ahmed Shabab Noor, Muhib Al Hasan, Ahmed Rafi Hasan, Akib Zaman, D. Farid","doi":"10.1109/SKIMA57145.2022.10029570","DOIUrl":null,"url":null,"abstract":"The automated creation of a semantic structural scene graph from an image or video is known as scene graph generation (SGG), which includes accurate labeling of all objects that are identified and the interconnections between them. Several SGG methods have been proposed employing deep learning techniques nowadays to achieve good results but most of the approaches failed to integrate the contextual information of pair of objects. Apart from the existing state of the arts of SGG, the attention mechanism is creating a new horizon in this field. This paper offers a thorough analysis of the most recent Attention-Based Scene Graph Generation techniques. In this paper, we have compared and tested five existing Attention-Based Scene Graph Generation methods. We have summarised the results of existing methods to understand progress in this field of Scene Graph Generation. Moreover, we have discussed the strengths of existing techniques and future directions of attention-based models in Scene Graph Generation.","PeriodicalId":277436,"journal":{"name":"2022 14th International Conference on Software, Knowledge, Information Management and Applications (SKIMA)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Attention-Based Scene Graph Generation: A Review\",\"authors\":\"Afsana Airin, Rezab Ud Dawla, Ahmed Shabab Noor, Muhib Al Hasan, Ahmed Rafi Hasan, Akib Zaman, D. Farid\",\"doi\":\"10.1109/SKIMA57145.2022.10029570\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The automated creation of a semantic structural scene graph from an image or video is known as scene graph generation (SGG), which includes accurate labeling of all objects that are identified and the interconnections between them. Several SGG methods have been proposed employing deep learning techniques nowadays to achieve good results but most of the approaches failed to integrate the contextual information of pair of objects. Apart from the existing state of the arts of SGG, the attention mechanism is creating a new horizon in this field. This paper offers a thorough analysis of the most recent Attention-Based Scene Graph Generation techniques. In this paper, we have compared and tested five existing Attention-Based Scene Graph Generation methods. We have summarised the results of existing methods to understand progress in this field of Scene Graph Generation. Moreover, we have discussed the strengths of existing techniques and future directions of attention-based models in Scene Graph Generation.\",\"PeriodicalId\":277436,\"journal\":{\"name\":\"2022 14th International Conference on Software, Knowledge, Information Management and Applications (SKIMA)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 14th International Conference on Software, Knowledge, Information Management and Applications (SKIMA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SKIMA57145.2022.10029570\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Software, Knowledge, Information Management and Applications (SKIMA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SKIMA57145.2022.10029570","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The automated creation of a semantic structural scene graph from an image or video is known as scene graph generation (SGG), which includes accurate labeling of all objects that are identified and the interconnections between them. Several SGG methods have been proposed employing deep learning techniques nowadays to achieve good results but most of the approaches failed to integrate the contextual information of pair of objects. Apart from the existing state of the arts of SGG, the attention mechanism is creating a new horizon in this field. This paper offers a thorough analysis of the most recent Attention-Based Scene Graph Generation techniques. In this paper, we have compared and tested five existing Attention-Based Scene Graph Generation methods. We have summarised the results of existing methods to understand progress in this field of Scene Graph Generation. Moreover, we have discussed the strengths of existing techniques and future directions of attention-based models in Scene Graph Generation.