Aarushi Dua, A. Bhatia, B. Kalra, Srishti Vashishtha
{"title":"A Novel Recurrent and Convolutional Neural Network Technique for Generating Handwriting from Voice","authors":"Aarushi Dua, A. Bhatia, B. Kalra, Srishti Vashishtha","doi":"10.1109/ICIRCA51532.2021.9544925","DOIUrl":null,"url":null,"abstract":"This paper presents a way for generating online handwriting using voice. To build this tool, two broad steps are required: Voice Recognition using Google Speech-to-text API and Handwritten Recognition using a combination of Recurrent and Convolutional neural networks (RCNN). The model is evaluated on IAM and Electronic Fonts datasets that contains handwritten images. This research work has reported the result of training data based on Connectionist Temporal Classification (CTC) loss. CTC also has a function named decoder to predict vector data generated by RCNN into understandable text.","PeriodicalId":245244,"journal":{"name":"2021 Third International Conference on Inventive Research in Computing Applications (ICIRCA)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Third International Conference on Inventive Research in Computing Applications (ICIRCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIRCA51532.2021.9544925","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents a way for generating online handwriting using voice. To build this tool, two broad steps are required: Voice Recognition using Google Speech-to-text API and Handwritten Recognition using a combination of Recurrent and Convolutional neural networks (RCNN). The model is evaluated on IAM and Electronic Fonts datasets that contains handwritten images. This research work has reported the result of training data based on Connectionist Temporal Classification (CTC) loss. CTC also has a function named decoder to predict vector data generated by RCNN into understandable text.