Irfan Al-Hussaini, Zubayer Islam, Antik Mallick, M. E. Hoque
{"title":"Object recognition and real-time spoken word recognition using two-fold dynamic time warping for autonomous arm manipulator","authors":"Irfan Al-Hussaini, Zubayer Islam, Antik Mallick, M. E. Hoque","doi":"10.1109/ICSIPA.2017.8120613","DOIUrl":null,"url":null,"abstract":"In this paper, we propound a command processing mechanism for an autonomous arm manipulator using real-time speech and images. We propose a novel two-stage speech recognition algorithm using two-fold Dynamic Time Warping in each stage. Real-time wake-up word recognition is followed by offline command recognition using k-means. Since high precision is paramount in any control system activation mechanism, a restrictive threshold is set to gain a precision of 1. This alleviates the problem of accidental triggering of the control system. Object recognition and classification is performed by matching features resulting from a local feature detector and descriptor. The algorithm controls an arm manipulator with 5 degrees of freedom.","PeriodicalId":268112,"journal":{"name":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIPA.2017.8120613","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, we propound a command processing mechanism for an autonomous arm manipulator using real-time speech and images. We propose a novel two-stage speech recognition algorithm using two-fold Dynamic Time Warping in each stage. Real-time wake-up word recognition is followed by offline command recognition using k-means. Since high precision is paramount in any control system activation mechanism, a restrictive threshold is set to gain a precision of 1. This alleviates the problem of accidental triggering of the control system. Object recognition and classification is performed by matching features resulting from a local feature detector and descriptor. The algorithm controls an arm manipulator with 5 degrees of freedom.