数学公式提取

Jianming Jin, Xionghu Han, Qingren Wang
{"title":"数学公式提取","authors":"Jianming Jin, Xionghu Han, Qingren Wang","doi":"10.1109/ICDAR.2003.1227834","DOIUrl":null,"url":null,"abstract":"As a universal technical language, mathematics hasbeen widely applied in many fields, and it is more accuratethan any other languages in describing information.Therefore, numerous mathematical formulas exist in allkinds of documents. There is no doubt that automaticmathematical formulas processing is very important andnecessary, of which extract formulas from documentimages is the first step. In this paper, formulas extractionmethods which are not based on recognition results arepresented: isolated formulas are extracted based onParzen window and embedded expressions are extractedbased on 2-D structures detection. Experiments show thatour methods are very effective in formulas extraction.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"213 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"33","resultStr":"{\"title\":\"Mathematical formulas extraction\",\"authors\":\"Jianming Jin, Xionghu Han, Qingren Wang\",\"doi\":\"10.1109/ICDAR.2003.1227834\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As a universal technical language, mathematics hasbeen widely applied in many fields, and it is more accuratethan any other languages in describing information.Therefore, numerous mathematical formulas exist in allkinds of documents. There is no doubt that automaticmathematical formulas processing is very important andnecessary, of which extract formulas from documentimages is the first step. In this paper, formulas extractionmethods which are not based on recognition results arepresented: isolated formulas are extracted based onParzen window and embedded expressions are extractedbased on 2-D structures detection. Experiments show thatour methods are very effective in formulas extraction.\",\"PeriodicalId\":249193,\"journal\":{\"name\":\"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.\",\"volume\":\"213 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-08-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"33\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDAR.2003.1227834\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2003.1227834","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 33

摘要

作为一种通用的技术语言,数学在许多领域得到了广泛的应用,它在描述信息方面比任何其他语言都更准确。因此,各种文献中存在着大量的数学公式。毫无疑问,数学公式的自动处理是非常重要和必要的,其中从文档图像中提取公式是第一步。本文提出了不基于识别结果的公式提取方法:基于parzen窗口的孤立公式提取和基于二维结构检测的嵌入表达式提取。实验表明,该方法在公式提取中是非常有效的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Mathematical formulas extraction
As a universal technical language, mathematics hasbeen widely applied in many fields, and it is more accuratethan any other languages in describing information.Therefore, numerous mathematical formulas exist in allkinds of documents. There is no doubt that automaticmathematical formulas processing is very important andnecessary, of which extract formulas from documentimages is the first step. In this paper, formulas extractionmethods which are not based on recognition results arepresented: isolated formulas are extracted based onParzen window and embedded expressions are extractedbased on 2-D structures detection. Experiments show thatour methods are very effective in formulas extraction.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Impact of imperfect OCR on part-of-speech tagging Writer identification using innovative binarised features of handwritten numerals Word searching in CCITT group 4 compressed document images Exploiting reliability for dynamic selection of classi .ers by means of genetic algorithms Investigation of off-line Japanese signature verification using a pattern matching
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1