基于随机游走算法的中文文献信息处理模型

Q3 Medicine Koomesh Pub Date : 2018-08-01 DOI:10.1109/I-SMAC.2018.8653683
Xiao Xian, Zhenhui Yue
{"title":"基于随机游走算法的中文文献信息处理模型","authors":"Xiao Xian, Zhenhui Yue","doi":"10.1109/I-SMAC.2018.8653683","DOIUrl":null,"url":null,"abstract":"In this paper, we conduct research on Chinese document information processing model based on random walk algorithm. Because of the complexity and also the particularity of processing Chinese information, Chinese search engine technology needs to be improved. The Chinese search engine cannot directly copy foreign technology. To study and analyze the expertise of the Chinese, we can accurately find the need in vast information base as the Chinese information. In this paper, the dictionary learning and sparse representation with random walk model are introduced into the character recognition to solve the problem of pen character and noise of the fax characters. The novel analytic framework is presented to assist the processing of the methodologies. The recognition method does not require preprocessing operations such as character binarization and thinning, only one feature and one classifier is needed, compared with the current multi-feature multi-cascade classifier fusion recognition method, proposed recognition method has characteristics of low complexity. The test on the experiment also reflects the robustness of the proposed model.","PeriodicalId":53631,"journal":{"name":"Koomesh","volume":"78 1","pages":"779-783"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Chinese Document Information Processing Model Based on Random Walk Algorithm\",\"authors\":\"Xiao Xian, Zhenhui Yue\",\"doi\":\"10.1109/I-SMAC.2018.8653683\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we conduct research on Chinese document information processing model based on random walk algorithm. Because of the complexity and also the particularity of processing Chinese information, Chinese search engine technology needs to be improved. The Chinese search engine cannot directly copy foreign technology. To study and analyze the expertise of the Chinese, we can accurately find the need in vast information base as the Chinese information. In this paper, the dictionary learning and sparse representation with random walk model are introduced into the character recognition to solve the problem of pen character and noise of the fax characters. The novel analytic framework is presented to assist the processing of the methodologies. The recognition method does not require preprocessing operations such as character binarization and thinning, only one feature and one classifier is needed, compared with the current multi-feature multi-cascade classifier fusion recognition method, proposed recognition method has characteristics of low complexity. The test on the experiment also reflects the robustness of the proposed model.\",\"PeriodicalId\":53631,\"journal\":{\"name\":\"Koomesh\",\"volume\":\"78 1\",\"pages\":\"779-783\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Koomesh\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/I-SMAC.2018.8653683\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Koomesh","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/I-SMAC.2018.8653683","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 1

摘要

本文对基于随机漫步算法的中文文档信息处理模型进行了研究。由于中文信息处理的复杂性和特殊性,中文搜索引擎技术有待改进。中国的搜索引擎不能直接抄袭外国技术。通过对汉语专业知识的研究和分析,我们可以在庞大的信息库中准确地找到所需的汉语信息。本文将字典学习和随机游走模型的稀疏表示引入到字符识别中,解决了传真字符的笔头字符和噪声问题。提出了一种新的分析框架来辅助方法的处理。该识别方法不需要字符二值化和细化等预处理操作,只需要一个特征和一个分类器,与目前多特征多级联分类器融合识别方法相比,该识别方法具有低复杂度的特点。对实验的检验也反映了所提模型的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Chinese Document Information Processing Model Based on Random Walk Algorithm
In this paper, we conduct research on Chinese document information processing model based on random walk algorithm. Because of the complexity and also the particularity of processing Chinese information, Chinese search engine technology needs to be improved. The Chinese search engine cannot directly copy foreign technology. To study and analyze the expertise of the Chinese, we can accurately find the need in vast information base as the Chinese information. In this paper, the dictionary learning and sparse representation with random walk model are introduced into the character recognition to solve the problem of pen character and noise of the fax characters. The novel analytic framework is presented to assist the processing of the methodologies. The recognition method does not require preprocessing operations such as character binarization and thinning, only one feature and one classifier is needed, compared with the current multi-feature multi-cascade classifier fusion recognition method, proposed recognition method has characteristics of low complexity. The test on the experiment also reflects the robustness of the proposed model.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Koomesh
Koomesh Medicine-Medicine (all)
CiteScore
0.80
自引率
0.00%
发文量
0
审稿时长
24 weeks
期刊最新文献
New Evidence for the Civic Center from the Roman Colony to the Late Byzantine Period: Excavation of the Parking Lot at the Archaeological Museum of Philippi Reconstructing the Religious Landscape of the Roman Colony of Philippi Paul and Philippi: The Early Cult of the Apostle and the Topography of the Late Antique City Thracian, Greek, or Roman? Ethnic and Social Identities of Worshippers (and Gods) in Roman Philippi Reassessing Urban Continuity in Early Medieval Philippi
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1