{"title":"一种基于Adam优化算法的改进文本检测方法","authors":"Himani Kohli , Jyoti Agarwal , Manoj Kumar","doi":"10.1016/j.gltp.2022.03.028","DOIUrl":null,"url":null,"abstract":"<div><p>Optical Character Recognition (OCR) is an automatic identification technique which is applied in different application areas to translate documents or images into analysable and editable data. Printed or typed characters are easy to recognize as they have well defined shape and size, but this is not true in case of handwritten text. Handwriting of every individual is different so OCR face difficulty to recognize the characters. In past, researchers have been used different Machine Learning and Artificial Intelligence tools and techniques to analyse handwritten and printed documents and also worked to create an electronic format file from them. It is difficult to reuse this information as it is very difficult to search the content from these documents by lines or words. To solve this problem, OpenCV technique is used in this research work which focuses on training and testing of neural network model to conduct Document Image Analysis. The proposed model is named as J&M model for Text Detection from Hand written images. Implementation of research work is done in Python on MNIST database of handwritten digits. From this research work, 99.5% of training accuracy and 99% of testing accuracy was achieved along with training loss of 1.5%.</p></div>","PeriodicalId":100588,"journal":{"name":"Global Transitions Proceedings","volume":"3 1","pages":"Pages 230-234"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666285X22000346/pdfft?md5=18ea2e6ab1218ae399944d82e7bd551d&pid=1-s2.0-S2666285X22000346-main.pdf","citationCount":"7","resultStr":"{\"title\":\"An improved method for text detection using Adam optimization algorithm\",\"authors\":\"Himani Kohli , Jyoti Agarwal , Manoj Kumar\",\"doi\":\"10.1016/j.gltp.2022.03.028\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Optical Character Recognition (OCR) is an automatic identification technique which is applied in different application areas to translate documents or images into analysable and editable data. Printed or typed characters are easy to recognize as they have well defined shape and size, but this is not true in case of handwritten text. Handwriting of every individual is different so OCR face difficulty to recognize the characters. In past, researchers have been used different Machine Learning and Artificial Intelligence tools and techniques to analyse handwritten and printed documents and also worked to create an electronic format file from them. It is difficult to reuse this information as it is very difficult to search the content from these documents by lines or words. To solve this problem, OpenCV technique is used in this research work which focuses on training and testing of neural network model to conduct Document Image Analysis. The proposed model is named as J&M model for Text Detection from Hand written images. Implementation of research work is done in Python on MNIST database of handwritten digits. From this research work, 99.5% of training accuracy and 99% of testing accuracy was achieved along with training loss of 1.5%.</p></div>\",\"PeriodicalId\":100588,\"journal\":{\"name\":\"Global Transitions Proceedings\",\"volume\":\"3 1\",\"pages\":\"Pages 230-234\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666285X22000346/pdfft?md5=18ea2e6ab1218ae399944d82e7bd551d&pid=1-s2.0-S2666285X22000346-main.pdf\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Global Transitions Proceedings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666285X22000346\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Global Transitions Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666285X22000346","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An improved method for text detection using Adam optimization algorithm
Optical Character Recognition (OCR) is an automatic identification technique which is applied in different application areas to translate documents or images into analysable and editable data. Printed or typed characters are easy to recognize as they have well defined shape and size, but this is not true in case of handwritten text. Handwriting of every individual is different so OCR face difficulty to recognize the characters. In past, researchers have been used different Machine Learning and Artificial Intelligence tools and techniques to analyse handwritten and printed documents and also worked to create an electronic format file from them. It is difficult to reuse this information as it is very difficult to search the content from these documents by lines or words. To solve this problem, OpenCV technique is used in this research work which focuses on training and testing of neural network model to conduct Document Image Analysis. The proposed model is named as J&M model for Text Detection from Hand written images. Implementation of research work is done in Python on MNIST database of handwritten digits. From this research work, 99.5% of training accuracy and 99% of testing accuracy was achieved along with training loss of 1.5%.