{"title":"Human-object interaction detection based on graph model","authors":"Qing Ye, Xiujuan Xu","doi":"10.1117/12.2671248","DOIUrl":null,"url":null,"abstract":"Human-Object Interaction (HOI) detection is a fundamental task for understanding real-world scenes. In this paper, a graph model-based human-object interaction detection algorithm is proposed, which aims to make full use of the visual-spatial features and semantic information of human-object instances in the image, thereby improving the accuracy of interaction detection. Aiming at the characteristics of visual-spatial features and semantic information, we take the visual features of human and object instance boxes as nodes, and the corresponding spatial features of interaction relations as edges to construct an initial dense graph, and adaptively update the graph through the spatial and semantic information of instances. The V-COCO dataset is used to evaluate the algorithm, and the final accuracy is significantly improved, which proves the effectiveness of the algorithm.","PeriodicalId":227528,"journal":{"name":"International Conference on Artificial Intelligence and Computer Engineering (ICAICE 2022)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Artificial Intelligence and Computer Engineering (ICAICE 2022)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2671248","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Human-Object Interaction (HOI) detection is a fundamental task for understanding real-world scenes. In this paper, a graph model-based human-object interaction detection algorithm is proposed, which aims to make full use of the visual-spatial features and semantic information of human-object instances in the image, thereby improving the accuracy of interaction detection. Aiming at the characteristics of visual-spatial features and semantic information, we take the visual features of human and object instance boxes as nodes, and the corresponding spatial features of interaction relations as edges to construct an initial dense graph, and adaptively update the graph through the spatial and semantic information of instances. The V-COCO dataset is used to evaluate the algorithm, and the final accuracy is significantly improved, which proves the effectiveness of the algorithm.