{"title":"A Text Detection and Recognition System Based on an End-to-End Trainable Framework from UAV Imagery","authors":"Qingtian Wu, Yimin Zhou, Guoyuan Liang","doi":"10.1109/ROBIO.2018.8665259","DOIUrl":null,"url":null,"abstract":"In this paper, we present a DAV-based system for text (mainly English and Chinese) detection and recognition. With the combination of unmanned aerial vehicle and scene text recognition, the system can realize text detection and recognition in long-range air-plane images, providing an underlay for unmanned navigation and fast text information understanding. Robust text detection and accurate text recognition can be achieved by two contributions. First, a scalable engine is proposed to synthesize text images by overlaying English or Chinese text into existing images in a natural way. Second, an framework which is trainable and end-to-end by combining Convolutional Neural Network and Recurrent Neural Network is adapted to recognize the variable-length text with a high accuracy. Field experiments are performed with different videos shot in various backgrounds and outdoors to show that the proposed system can detect and recognise text information in UAV imagery robustly and effectively.","PeriodicalId":417415,"journal":{"name":"2018 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Robotics and Biomimetics (ROBIO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROBIO.2018.8665259","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In this paper, we present a DAV-based system for text (mainly English and Chinese) detection and recognition. With the combination of unmanned aerial vehicle and scene text recognition, the system can realize text detection and recognition in long-range air-plane images, providing an underlay for unmanned navigation and fast text information understanding. Robust text detection and accurate text recognition can be achieved by two contributions. First, a scalable engine is proposed to synthesize text images by overlaying English or Chinese text into existing images in a natural way. Second, an framework which is trainable and end-to-end by combining Convolutional Neural Network and Recurrent Neural Network is adapted to recognize the variable-length text with a high accuracy. Field experiments are performed with different videos shot in various backgrounds and outdoors to show that the proposed system can detect and recognise text information in UAV imagery robustly and effectively.