Tyler W. Nivin, G. Scott, J. A. Hurt, Raymond L. Chastain, C. Davis
{"title":"Exploring the Effects of Class-Specific Augmentation and Class Coalescence on Deep Neural Network Performance Using a Novel Road Feature Dataset","authors":"Tyler W. Nivin, G. Scott, J. A. Hurt, Raymond L. Chastain, C. Davis","doi":"10.1109/AIPR.2018.8707406","DOIUrl":null,"url":null,"abstract":"The identification of nodal road network features in remote sensing imagery is an important object detection task due to its versatility of application. A successful capability enables urban sprawl tracking, automatic or semi-automated map accuracy validation and updating, and macro-scale infrastructure damage evaluation and tracking just to name a few. We have curated a custom, novel dataset that includes nodal road network features such as bridges, cul-de-sacs, freeway exchanges and exits, freeway overpasses, intersections, and traffic circles. From this curated data we have evaluated the use of deep machine learning for object recognition across two variations in this image dataset. These variations are expanded versus semantically coalesced classes. We have evaluated the performance of two deep convolutional neural networks, ResNet50 and Xception, to detect these features across these variations of the image datasets. We have also explored the use of class-specific data augmentation to improve the performance of the models trained for nodal road network feature detection. Cross-validation performance of the models evaluated on four variations of this nodal road network feature dataset range from 0.81 to 0.96 (F1 scores). Coalescing highly specific, semantically challenging classes into more semantically generalized classes has a significant impact on the accuracy of the models. Our analysis provides insight into if and how these techniques can improve the performance of machine learning models, facilitating application to broad area imagery analysis in numerous application domains.","PeriodicalId":230582,"journal":{"name":"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIPR.2018.8707406","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The identification of nodal road network features in remote sensing imagery is an important object detection task due to its versatility of application. A successful capability enables urban sprawl tracking, automatic or semi-automated map accuracy validation and updating, and macro-scale infrastructure damage evaluation and tracking just to name a few. We have curated a custom, novel dataset that includes nodal road network features such as bridges, cul-de-sacs, freeway exchanges and exits, freeway overpasses, intersections, and traffic circles. From this curated data we have evaluated the use of deep machine learning for object recognition across two variations in this image dataset. These variations are expanded versus semantically coalesced classes. We have evaluated the performance of two deep convolutional neural networks, ResNet50 and Xception, to detect these features across these variations of the image datasets. We have also explored the use of class-specific data augmentation to improve the performance of the models trained for nodal road network feature detection. Cross-validation performance of the models evaluated on four variations of this nodal road network feature dataset range from 0.81 to 0.96 (F1 scores). Coalescing highly specific, semantically challenging classes into more semantically generalized classes has a significant impact on the accuracy of the models. Our analysis provides insight into if and how these techniques can improve the performance of machine learning models, facilitating application to broad area imagery analysis in numerous application domains.