{"title":"数学公式提取","authors":"Jianming Jin, Xionghu Han, Qingren Wang","doi":"10.1109/ICDAR.2003.1227834","DOIUrl":null,"url":null,"abstract":"As a universal technical language, mathematics hasbeen widely applied in many fields, and it is more accuratethan any other languages in describing information.Therefore, numerous mathematical formulas exist in allkinds of documents. There is no doubt that automaticmathematical formulas processing is very important andnecessary, of which extract formulas from documentimages is the first step. In this paper, formulas extractionmethods which are not based on recognition results arepresented: isolated formulas are extracted based onParzen window and embedded expressions are extractedbased on 2-D structures detection. Experiments show thatour methods are very effective in formulas extraction.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"213 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"33","resultStr":"{\"title\":\"Mathematical formulas extraction\",\"authors\":\"Jianming Jin, Xionghu Han, Qingren Wang\",\"doi\":\"10.1109/ICDAR.2003.1227834\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As a universal technical language, mathematics hasbeen widely applied in many fields, and it is more accuratethan any other languages in describing information.Therefore, numerous mathematical formulas exist in allkinds of documents. There is no doubt that automaticmathematical formulas processing is very important andnecessary, of which extract formulas from documentimages is the first step. In this paper, formulas extractionmethods which are not based on recognition results arepresented: isolated formulas are extracted based onParzen window and embedded expressions are extractedbased on 2-D structures detection. Experiments show thatour methods are very effective in formulas extraction.\",\"PeriodicalId\":249193,\"journal\":{\"name\":\"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.\",\"volume\":\"213 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-08-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"33\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDAR.2003.1227834\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2003.1227834","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
As a universal technical language, mathematics hasbeen widely applied in many fields, and it is more accuratethan any other languages in describing information.Therefore, numerous mathematical formulas exist in allkinds of documents. There is no doubt that automaticmathematical formulas processing is very important andnecessary, of which extract formulas from documentimages is the first step. In this paper, formulas extractionmethods which are not based on recognition results arepresented: isolated formulas are extracted based onParzen window and embedded expressions are extractedbased on 2-D structures detection. Experiments show thatour methods are very effective in formulas extraction.