M. Talha, Sheikh Faisal Rashid, Zain Iftikhar, Muhammad Touseef Afzal, Liu Ying
{"title":"可扩展视觉质量检测的可转移学习架构","authors":"M. Talha, Sheikh Faisal Rashid, Zain Iftikhar, Muhammad Touseef Afzal, Liu Ying","doi":"10.1109/ICAI55435.2022.9773637","DOIUrl":null,"url":null,"abstract":"In recent years, convolutional neural networks (CNNs) have become a de facto standard in computer vision for object detection and recognition. At present, CNNs have been used in many application areas including the automation of industrial manufacturing processes. But using CNN in a real-time environment to track defects on products has many shortcomings like long training time, large data requirements, slow inference time, dynamic environment, and hardware dependency. This paper evaluates the state-of-the-art CNN architectures for object detection to address the mentioned challenges and provide the best possible solution. A set of pre-trained models has been trained on just 781 annotated images by applying transfer learning. Experimental results showed that Faster RCNN with VGG-16 backbone outperforms the other models in case of accuracy and mAP. But RetinaNet with an FPN backbone has the fastest inference time on multi-scaled defects. Paper also presents the deployment pipeline for inference on mobile devices to use in a real-time environment without any special hardware. In addition, an improved dataset of submersible pump impellers, based on the existing Kaggle dataset is introduced.","PeriodicalId":146842,"journal":{"name":"2022 2nd International Conference on Artificial Intelligence (ICAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Transferable Learning Architecture for Scalable Visual Quality Inspection\",\"authors\":\"M. Talha, Sheikh Faisal Rashid, Zain Iftikhar, Muhammad Touseef Afzal, Liu Ying\",\"doi\":\"10.1109/ICAI55435.2022.9773637\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, convolutional neural networks (CNNs) have become a de facto standard in computer vision for object detection and recognition. At present, CNNs have been used in many application areas including the automation of industrial manufacturing processes. But using CNN in a real-time environment to track defects on products has many shortcomings like long training time, large data requirements, slow inference time, dynamic environment, and hardware dependency. This paper evaluates the state-of-the-art CNN architectures for object detection to address the mentioned challenges and provide the best possible solution. A set of pre-trained models has been trained on just 781 annotated images by applying transfer learning. Experimental results showed that Faster RCNN with VGG-16 backbone outperforms the other models in case of accuracy and mAP. But RetinaNet with an FPN backbone has the fastest inference time on multi-scaled defects. Paper also presents the deployment pipeline for inference on mobile devices to use in a real-time environment without any special hardware. In addition, an improved dataset of submersible pump impellers, based on the existing Kaggle dataset is introduced.\",\"PeriodicalId\":146842,\"journal\":{\"name\":\"2022 2nd International Conference on Artificial Intelligence (ICAI)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 2nd International Conference on Artificial Intelligence (ICAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAI55435.2022.9773637\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Artificial Intelligence (ICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAI55435.2022.9773637","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Transferable Learning Architecture for Scalable Visual Quality Inspection
In recent years, convolutional neural networks (CNNs) have become a de facto standard in computer vision for object detection and recognition. At present, CNNs have been used in many application areas including the automation of industrial manufacturing processes. But using CNN in a real-time environment to track defects on products has many shortcomings like long training time, large data requirements, slow inference time, dynamic environment, and hardware dependency. This paper evaluates the state-of-the-art CNN architectures for object detection to address the mentioned challenges and provide the best possible solution. A set of pre-trained models has been trained on just 781 annotated images by applying transfer learning. Experimental results showed that Faster RCNN with VGG-16 backbone outperforms the other models in case of accuracy and mAP. But RetinaNet with an FPN backbone has the fastest inference time on multi-scaled defects. Paper also presents the deployment pipeline for inference on mobile devices to use in a real-time environment without any special hardware. In addition, an improved dataset of submersible pump impellers, based on the existing Kaggle dataset is introduced.