基于句子排序条件概率的web文档概念规则挖掘

V. Navaneethakumar
{"title":"基于句子排序条件概率的web文档概念规则挖掘","authors":"V. Navaneethakumar","doi":"10.1109/ICPRIME.2013.6496467","DOIUrl":null,"url":null,"abstract":"Text classification and information mining are two significant objectives of natural language processing. Whereas handcrafting rules for both tasks has an extensive convention, learning strategies increased much attention in the past. Existing work presented concept based mining model for text, sentence mining and does not support text classification. To enhance the text clustering approach, we first presented Conceptual Rule Mining On Text clusters to evaluate the more related and influential sentences contributing the document topic. But this model might discriminate terms with semantic variation and negligible authority on the sentence meaning. In addition, we plan to extend conceptual text clustering to web documents, by assigning sentence weights based on conditional probability. Probability ratio is identified for the sentence similarity from which unique sentence meaning contributing to the document topic are listed. In this work, our plan is to implement ranking of the sentences which are calculated using the weights assigned to each and every sentences. With sentence rank conceptual rules are defined for the text cluster documents. The conceptual rule depicts finer tuned document topic and sentence meaning utilized to evaluate the global document contribution. Experiments are conducted with the web documents extracted from the research repositories to evaluate the efficiency of the proposed efficient conceptual rule mining on web document clusters using sentence ranking conditional probability [CRMSRCP] and compared with an existing Model for Concept Based Clustering and Classification and our previous works in terms of Sentence Term Relation, Cluster Object weights, and cluster quality.","PeriodicalId":123210,"journal":{"name":"2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering","volume":"128 12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mining conceptual rules for web document using sentence ranking conditional probability\",\"authors\":\"V. Navaneethakumar\",\"doi\":\"10.1109/ICPRIME.2013.6496467\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text classification and information mining are two significant objectives of natural language processing. Whereas handcrafting rules for both tasks has an extensive convention, learning strategies increased much attention in the past. Existing work presented concept based mining model for text, sentence mining and does not support text classification. To enhance the text clustering approach, we first presented Conceptual Rule Mining On Text clusters to evaluate the more related and influential sentences contributing the document topic. But this model might discriminate terms with semantic variation and negligible authority on the sentence meaning. In addition, we plan to extend conceptual text clustering to web documents, by assigning sentence weights based on conditional probability. Probability ratio is identified for the sentence similarity from which unique sentence meaning contributing to the document topic are listed. In this work, our plan is to implement ranking of the sentences which are calculated using the weights assigned to each and every sentences. With sentence rank conceptual rules are defined for the text cluster documents. The conceptual rule depicts finer tuned document topic and sentence meaning utilized to evaluate the global document contribution. Experiments are conducted with the web documents extracted from the research repositories to evaluate the efficiency of the proposed efficient conceptual rule mining on web document clusters using sentence ranking conditional probability [CRMSRCP] and compared with an existing Model for Concept Based Clustering and Classification and our previous works in terms of Sentence Term Relation, Cluster Object weights, and cluster quality.\",\"PeriodicalId\":123210,\"journal\":{\"name\":\"2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering\",\"volume\":\"128 12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICPRIME.2013.6496467\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPRIME.2013.6496467","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

文本分类和信息挖掘是自然语言处理的两个重要目标。手工制作这两项任务的规则有着广泛的惯例,而学习策略在过去得到了更多的关注。现有工作提出了基于概念的文本、句子挖掘模型,不支持文本分类。为了增强文本聚类方法,我们首先提出了基于文本聚类的概念规则挖掘,以评估与文档主题相关且更有影响力的句子。但是,该模型可能会对语义变化较大的术语进行区分,而对句子意义的权威可以忽略不计。此外,我们计划通过基于条件概率分配句子权重,将概念文本聚类扩展到web文档。识别句子相似度的概率比,从中列出对文档主题有贡献的唯一句子意义。在这项工作中,我们的计划是使用分配给每个句子的权重来计算句子的排名。通过句子排序,定义了文本聚类文档的概念规则。概念规则描述了用于评估全局文档贡献的更精细的文档主题和句子含义。利用从研究库中提取的web文档进行实验,评估基于句子排序条件概率(CRMSRCP)的web文档聚类高效概念规则挖掘的效率,并在句子术语关系、聚类对象权重和聚类质量方面与现有的基于概念的聚类和分类模型以及我们之前的工作进行比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Mining conceptual rules for web document using sentence ranking conditional probability
Text classification and information mining are two significant objectives of natural language processing. Whereas handcrafting rules for both tasks has an extensive convention, learning strategies increased much attention in the past. Existing work presented concept based mining model for text, sentence mining and does not support text classification. To enhance the text clustering approach, we first presented Conceptual Rule Mining On Text clusters to evaluate the more related and influential sentences contributing the document topic. But this model might discriminate terms with semantic variation and negligible authority on the sentence meaning. In addition, we plan to extend conceptual text clustering to web documents, by assigning sentence weights based on conditional probability. Probability ratio is identified for the sentence similarity from which unique sentence meaning contributing to the document topic are listed. In this work, our plan is to implement ranking of the sentences which are calculated using the weights assigned to each and every sentences. With sentence rank conceptual rules are defined for the text cluster documents. The conceptual rule depicts finer tuned document topic and sentence meaning utilized to evaluate the global document contribution. Experiments are conducted with the web documents extracted from the research repositories to evaluate the efficiency of the proposed efficient conceptual rule mining on web document clusters using sentence ranking conditional probability [CRMSRCP] and compared with an existing Model for Concept Based Clustering and Classification and our previous works in terms of Sentence Term Relation, Cluster Object weights, and cluster quality.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Separable reversible data hiding using Rc4 algorithm Personal approach for mobile search: A review Bijective soft set based classification of medical data Deployment and power assignment problem in Wireless Sensor Networks for intruder detection application using MEA Protein sequence motif patterns using adaptive Fuzzy C-Means granular computing model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1