OCR BASED SPEECH SYNTHESIS SYSTEM USING LABVIEW : Text to Speech Conversion System using OCR

2018 Second International Conference on Computing Methodologies and Communication (ICCMC) Pub Date : 2018-02-01 DOI:10.1109/ICCMC.2018.8487731

J. J. Mullani, M. Sankar, P. Khade, Snehal H Sonalkar, Nikita L Patil

{"title":"OCR BASED SPEECH SYNTHESIS SYSTEM USING LABVIEW : Text to Speech Conversion System using OCR","authors":"J. J. Mullani, M. Sankar, P. Khade, Snehal H Sonalkar, Nikita L Patil","doi":"10.1109/ICCMC.2018.8487731","DOIUrl":null,"url":null,"abstract":"Machine replication of human capacities, such as perusing, is an antiquated dream. Be that as it may, in the course of the most recent five decades, machine perusing has developed from a fantasy to reality. Discourse is likely the most proficient medium for correspondence between people. Optical character acknowledgment has turned out to be a standout amongst the best utilizations of innovation in the field of example acknowledgment and manmade brainpower. In current society, there is an awesome request to rapidly include expansive measure of printed and manually written data into the PC, along these lines everybody depend vigorously on PCs to process tremendous volumes of information. The essential goal is to enable vocally debilitated individuals to utilize the PC or to peruse archives in a simpler way. The framework is separated into two sections initially is Optical Character Recognition (OCR) and second part is content to discourse. In the initial segment, Virtual Instrument is produced in which a hued picture that contains the characters is changed over into grayscale picture and characters are prepared and in the second part; transformation from content to discourse is created. The mean of normal review time, standard deviation, least examination time and most extreme assessment time in ms is estimated. There are a few varieties in time parameters due to factors like number of characters perceived, line profile, histogram, shine, difference and gamma revision esteems.","PeriodicalId":6604,"journal":{"name":"2018 Second International Conference on Computing Methodologies and Communication (ICCMC)","volume":"264 1","pages":"7-14"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Second International Conference on Computing Methodologies and Communication (ICCMC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCMC.2018.8487731","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Machine replication of human capacities, such as perusing, is an antiquated dream. Be that as it may, in the course of the most recent five decades, machine perusing has developed from a fantasy to reality. Discourse is likely the most proficient medium for correspondence between people. Optical character acknowledgment has turned out to be a standout amongst the best utilizations of innovation in the field of example acknowledgment and manmade brainpower. In current society, there is an awesome request to rapidly include expansive measure of printed and manually written data into the PC, along these lines everybody depend vigorously on PCs to process tremendous volumes of information. The essential goal is to enable vocally debilitated individuals to utilize the PC or to peruse archives in a simpler way. The framework is separated into two sections initially is Optical Character Recognition (OCR) and second part is content to discourse. In the initial segment, Virtual Instrument is produced in which a hued picture that contains the characters is changed over into grayscale picture and characters are prepared and in the second part; transformation from content to discourse is created. The mean of normal review time, standard deviation, least examination time and most extreme assessment time in ms is estimated. There are a few varieties in time parameters due to factors like number of characters perceived, line profile, histogram, shine, difference and gamma revision esteems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于OCR的LABVIEW语音合成系统:基于OCR的文本到语音转换系统

机器复制人类的能力，比如阅读，是一个过时的梦想。尽管如此，在最近的50年里，机器阅读已经从幻想变成了现实。话语可能是人与人之间沟通最熟练的媒介。光学字符识别已成为实例识别和人工智能领域创新的最佳应用之一。在当今社会，有一个惊人的要求，迅速包括大量的印刷和手工写入数据到个人电脑，沿着这些路线，每个人都大力依赖个人电脑来处理大量的信息。其基本目标是使声音衰弱的人能够以更简单的方式使用PC或阅读档案。该框架首先分为光学字符识别(OCR)和内容语篇两部分。在初始部分，制作虚拟仪器，其中将包含字符的彩色图像转换为灰度图像并准备字符，第二部分;创造了从内容到话语的转换。估计了正常评审时间、标准偏差、最小评审时间和最极端评审时间的平均值(ms)。由于感知到的字符数量、线条轮廓、直方图、亮度、差异和伽玛修正值等因素，时间参数有一些变化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2018 Second International Conference on Computing Methodologies and Communication (ICCMC)

自引率

0.00%

发文量