{"title":"A Study of Data Processing for Object Recognition in Scene Image using FRCNN: A Smart Grid Technology","authors":"K. Das, A. Baruah","doi":"10.1109/IEMRE52042.2021.9386732","DOIUrl":null,"url":null,"abstract":"This paper proposes a new learning about the significance of part elements of a scene image with the effort of data processing. A top down tree structure with every node representing an annotation or bounding box having labeled visual features of an object existing in the image is studied in the paper. The images and its object annotations are from a trained dataset and are parsed to obtain the proposed representation. The images from the datasets and their parsed tree representations will be trained using a network called LSTM (Long Short Term Memory) network. The object detection may not be agnostic to the entire content of the image due to being influenced by the image composition and the discovered parts. The attempt has been made to show the object detection as a representation of the objects and their locations, parts of these objects, and the accuracy of the object detection method has been noted to have an efficient record with the implementation of the baseline Fast Region Based Convolutional Neural Network(FRCNN) method. The tested google open images datasets are used and found to have increased object detection record in notably respect to the use of high cost sensors in digital devices.","PeriodicalId":202287,"journal":{"name":"2021 Innovations in Energy Management and Renewable Resources(52042)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Innovations in Energy Management and Renewable Resources(52042)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMRE52042.2021.9386732","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper proposes a new learning about the significance of part elements of a scene image with the effort of data processing. A top down tree structure with every node representing an annotation or bounding box having labeled visual features of an object existing in the image is studied in the paper. The images and its object annotations are from a trained dataset and are parsed to obtain the proposed representation. The images from the datasets and their parsed tree representations will be trained using a network called LSTM (Long Short Term Memory) network. The object detection may not be agnostic to the entire content of the image due to being influenced by the image composition and the discovered parts. The attempt has been made to show the object detection as a representation of the objects and their locations, parts of these objects, and the accuracy of the object detection method has been noted to have an efficient record with the implementation of the baseline Fast Region Based Convolutional Neural Network(FRCNN) method. The tested google open images datasets are used and found to have increased object detection record in notably respect to the use of high cost sensors in digital devices.