OCR BASED SPEECH SYNTHESIS SYSTEM USING LABVIEW : Text to Speech Conversion System using OCR

J. J. Mullani, M. Sankar, P. Khade, Snehal H Sonalkar, Nikita L Patil
{"title":"OCR BASED SPEECH SYNTHESIS SYSTEM USING LABVIEW : Text to Speech Conversion System using OCR","authors":"J. J. Mullani, M. Sankar, P. Khade, Snehal H Sonalkar, Nikita L Patil","doi":"10.1109/ICCMC.2018.8487731","DOIUrl":null,"url":null,"abstract":"Machine replication of human capacities, such as perusing, is an antiquated dream. Be that as it may, in the course of the most recent five decades, machine perusing has developed from a fantasy to reality. Discourse is likely the most proficient medium for correspondence between people. Optical character acknowledgment has turned out to be a standout amongst the best utilizations of innovation in the field of example acknowledgment and manmade brainpower. In current society, there is an awesome request to rapidly include expansive measure of printed and manually written data into the PC, along these lines everybody depend vigorously on PCs to process tremendous volumes of information. The essential goal is to enable vocally debilitated individuals to utilize the PC or to peruse archives in a simpler way. The framework is separated into two sections initially is Optical Character Recognition (OCR) and second part is content to discourse. In the initial segment, Virtual Instrument is produced in which a hued picture that contains the characters is changed over into grayscale picture and characters are prepared and in the second part; transformation from content to discourse is created. The mean of normal review time, standard deviation, least examination time and most extreme assessment time in ms is estimated. There are a few varieties in time parameters due to factors like number of characters perceived, line profile, histogram, shine, difference and gamma revision esteems.","PeriodicalId":6604,"journal":{"name":"2018 Second International Conference on Computing Methodologies and Communication (ICCMC)","volume":"264 1","pages":"7-14"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Second International Conference on Computing Methodologies and Communication (ICCMC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCMC.2018.8487731","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Machine replication of human capacities, such as perusing, is an antiquated dream. Be that as it may, in the course of the most recent five decades, machine perusing has developed from a fantasy to reality. Discourse is likely the most proficient medium for correspondence between people. Optical character acknowledgment has turned out to be a standout amongst the best utilizations of innovation in the field of example acknowledgment and manmade brainpower. In current society, there is an awesome request to rapidly include expansive measure of printed and manually written data into the PC, along these lines everybody depend vigorously on PCs to process tremendous volumes of information. The essential goal is to enable vocally debilitated individuals to utilize the PC or to peruse archives in a simpler way. The framework is separated into two sections initially is Optical Character Recognition (OCR) and second part is content to discourse. In the initial segment, Virtual Instrument is produced in which a hued picture that contains the characters is changed over into grayscale picture and characters are prepared and in the second part; transformation from content to discourse is created. The mean of normal review time, standard deviation, least examination time and most extreme assessment time in ms is estimated. There are a few varieties in time parameters due to factors like number of characters perceived, line profile, histogram, shine, difference and gamma revision esteems.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于OCR的LABVIEW语音合成系统:基于OCR的文本到语音转换系统
机器复制人类的能力,比如阅读,是一个过时的梦想。尽管如此,在最近的50年里,机器阅读已经从幻想变成了现实。话语可能是人与人之间沟通最熟练的媒介。光学字符识别已成为实例识别和人工智能领域创新的最佳应用之一。在当今社会,有一个惊人的要求,迅速包括大量的印刷和手工写入数据到个人电脑,沿着这些路线,每个人都大力依赖个人电脑来处理大量的信息。其基本目标是使声音衰弱的人能够以更简单的方式使用PC或阅读档案。该框架首先分为光学字符识别(OCR)和内容语篇两部分。在初始部分,制作虚拟仪器,其中将包含字符的彩色图像转换为灰度图像并准备字符,第二部分;创造了从内容到话语的转换。估计了正常评审时间、标准偏差、最小评审时间和最极端评审时间的平均值(ms)。由于感知到的字符数量、线条轮廓、直方图、亮度、差异和伽玛修正值等因素,时间参数有一些变化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Modelling of Audio Effects for Vocal and Music Synthesis in Real Time Deep Learning Framework for Diabetic Retinopathy Diagnosis A Comprehensive Survey on Internet of Things Based Healthcare Services and its Applications Exploring Pain Insensitivity Inducing Gene ZFHX2 by using Deep Convolutional Neural Network Atmospheric Weather Prediction Using various machine learning Techniques: A Survey
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1