{"title":"相机捕获的手写文档中的方程检测","authors":"Koushik K S, Ankita Mahale, Shobha Rani N","doi":"10.1109/ICAAIC56838.2023.10141166","DOIUrl":null,"url":null,"abstract":"One of the most important tasks in the realm of document analysis and recognition is the detection of equations in documents that were acquired using a camera. The procedure includes several steps, including pre-processing of the images, segmentation, feature extraction, and classification. The suggested method comprises taking a user-provided input expression image and classifying it into one of three types of equations: simple, complex, and highly complex. By choosing a decision boundary set off from the initial hyperplane, the SVR algorithm encodes the image, producing a model that fits the data better. The result is then obtained by character-wise segmenting the image and comparing it with trained models. Two recurrent neural networks make up the RNN encoder-decoder that is used. One RNN creates a fixed-length vector representation from a sequence of symbols, and a different RNN decodes that representation into a different sequence of symbols. 1900 images containing various equations made up the dataset utilized for training, validating, and testing the SVR and RNN. The accuracy of the system was about 93.64%.","PeriodicalId":267906,"journal":{"name":"2023 2nd International Conference on Applied Artificial Intelligence and Computing (ICAAIC)","volume":"405 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Equation Detection in the Camera Captured Handwritten Document\",\"authors\":\"Koushik K S, Ankita Mahale, Shobha Rani N\",\"doi\":\"10.1109/ICAAIC56838.2023.10141166\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the most important tasks in the realm of document analysis and recognition is the detection of equations in documents that were acquired using a camera. The procedure includes several steps, including pre-processing of the images, segmentation, feature extraction, and classification. The suggested method comprises taking a user-provided input expression image and classifying it into one of three types of equations: simple, complex, and highly complex. By choosing a decision boundary set off from the initial hyperplane, the SVR algorithm encodes the image, producing a model that fits the data better. The result is then obtained by character-wise segmenting the image and comparing it with trained models. Two recurrent neural networks make up the RNN encoder-decoder that is used. One RNN creates a fixed-length vector representation from a sequence of symbols, and a different RNN decodes that representation into a different sequence of symbols. 1900 images containing various equations made up the dataset utilized for training, validating, and testing the SVR and RNN. The accuracy of the system was about 93.64%.\",\"PeriodicalId\":267906,\"journal\":{\"name\":\"2023 2nd International Conference on Applied Artificial Intelligence and Computing (ICAAIC)\",\"volume\":\"405 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 2nd International Conference on Applied Artificial Intelligence and Computing (ICAAIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAAIC56838.2023.10141166\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd International Conference on Applied Artificial Intelligence and Computing (ICAAIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAAIC56838.2023.10141166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Equation Detection in the Camera Captured Handwritten Document
One of the most important tasks in the realm of document analysis and recognition is the detection of equations in documents that were acquired using a camera. The procedure includes several steps, including pre-processing of the images, segmentation, feature extraction, and classification. The suggested method comprises taking a user-provided input expression image and classifying it into one of three types of equations: simple, complex, and highly complex. By choosing a decision boundary set off from the initial hyperplane, the SVR algorithm encodes the image, producing a model that fits the data better. The result is then obtained by character-wise segmenting the image and comparing it with trained models. Two recurrent neural networks make up the RNN encoder-decoder that is used. One RNN creates a fixed-length vector representation from a sequence of symbols, and a different RNN decodes that representation into a different sequence of symbols. 1900 images containing various equations made up the dataset utilized for training, validating, and testing the SVR and RNN. The accuracy of the system was about 93.64%.