{"title":"医学词汇表征的结构度量在双语文本识别中的应用","authors":"Nouf M. Alzahrani, Adil F. Alharthi","doi":"10.1007/s41133-019-0028-z","DOIUrl":null,"url":null,"abstract":"<div><p>The objective of this work is to contribute to the analysis and understanding of medical documents taken from health institutions in Saudi Arabia. The project aimed to use intelligent technologies and image processing tools to the automation of processing the medical documents. This consists particularly to assist medical staff to the treatment of the different medical forms in order to facilitate the storage of the important information and their centralization. As we worked on bilingual context, we proposed a system for identifying Arabic and Latin texts whether taped or manuscripted. In this way, we can identify the extracted blocks from different regions of interest and distribute them to different OCR systems to recognize them. We used SGLD as a texture measure of the image writing shapes. Then, we calculated Haralick descriptors that characterize them. The resulting recognition ratios were very efficient and promising.</p></div>","PeriodicalId":100147,"journal":{"name":"Augmented Human Research","volume":"5 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s41133-019-0028-z","citationCount":"0","resultStr":"{\"title\":\"Textural Measure for Medical Words Characterization Applied to Script Identification in Bilingual Context\",\"authors\":\"Nouf M. Alzahrani, Adil F. Alharthi\",\"doi\":\"10.1007/s41133-019-0028-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The objective of this work is to contribute to the analysis and understanding of medical documents taken from health institutions in Saudi Arabia. The project aimed to use intelligent technologies and image processing tools to the automation of processing the medical documents. This consists particularly to assist medical staff to the treatment of the different medical forms in order to facilitate the storage of the important information and their centralization. As we worked on bilingual context, we proposed a system for identifying Arabic and Latin texts whether taped or manuscripted. In this way, we can identify the extracted blocks from different regions of interest and distribute them to different OCR systems to recognize them. We used SGLD as a texture measure of the image writing shapes. Then, we calculated Haralick descriptors that characterize them. The resulting recognition ratios were very efficient and promising.</p></div>\",\"PeriodicalId\":100147,\"journal\":{\"name\":\"Augmented Human Research\",\"volume\":\"5 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1007/s41133-019-0028-z\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Augmented Human Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s41133-019-0028-z\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Augmented Human Research","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s41133-019-0028-z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Textural Measure for Medical Words Characterization Applied to Script Identification in Bilingual Context
The objective of this work is to contribute to the analysis and understanding of medical documents taken from health institutions in Saudi Arabia. The project aimed to use intelligent technologies and image processing tools to the automation of processing the medical documents. This consists particularly to assist medical staff to the treatment of the different medical forms in order to facilitate the storage of the important information and their centralization. As we worked on bilingual context, we proposed a system for identifying Arabic and Latin texts whether taped or manuscripted. In this way, we can identify the extracted blocks from different regions of interest and distribute them to different OCR systems to recognize them. We used SGLD as a texture measure of the image writing shapes. Then, we calculated Haralick descriptors that characterize them. The resulting recognition ratios were very efficient and promising.