文本挖掘应用中模式提取的一种新的统计和语义方法

D. G. Vasques, P. Martins, S. O. Rezende
{"title":"文本挖掘应用中模式提取的一种新的统计和语义方法","authors":"D. G. Vasques, P. Martins, S. O. Rezende","doi":"10.19153/cleiej.22.3.5","DOIUrl":null,"url":null,"abstract":"The discovery of knowledge in textual databases is an approach that basically seeks for implicit relationships between different concepts in different documents written in natural language, in order to identify new useful knowledge. To assist in this process, this approach can count on the help of Text Mining techniques. Despite all the progress made, researchers in this area must still deal with a large volume of information and with the challenge of identifying the causal relationships between concepts in a certain field. A statistical and verbal semantic approach that supports the understanding of the semantic logic between concepts may help the extraction of relevant information and knowledge. The objective of this work is to support the user with the identification of implicit relationships between concepts present in different texts, considering their causal relationships. We propose a hybrid approach for the discovery of implicit knowledge present in a text corpus, using analysis based on association rules together with metrics from complex networks to identify relevant associations, verbal semantics to determine the causal relationships, and causal concept maps for their visualization. Through a case study, a set of texts from alternative medicine was selected and the different extractions showed that the proposed approach facilitates the identification of implicit knowledge by the user.","PeriodicalId":418941,"journal":{"name":"CLEI Electron. J.","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A New Statistical and Verbal-Semantic Approach to Pattern Extraction in Text Mining Applications\",\"authors\":\"D. G. Vasques, P. Martins, S. O. Rezende\",\"doi\":\"10.19153/cleiej.22.3.5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The discovery of knowledge in textual databases is an approach that basically seeks for implicit relationships between different concepts in different documents written in natural language, in order to identify new useful knowledge. To assist in this process, this approach can count on the help of Text Mining techniques. Despite all the progress made, researchers in this area must still deal with a large volume of information and with the challenge of identifying the causal relationships between concepts in a certain field. A statistical and verbal semantic approach that supports the understanding of the semantic logic between concepts may help the extraction of relevant information and knowledge. The objective of this work is to support the user with the identification of implicit relationships between concepts present in different texts, considering their causal relationships. We propose a hybrid approach for the discovery of implicit knowledge present in a text corpus, using analysis based on association rules together with metrics from complex networks to identify relevant associations, verbal semantics to determine the causal relationships, and causal concept maps for their visualization. Through a case study, a set of texts from alternative medicine was selected and the different extractions showed that the proposed approach facilitates the identification of implicit knowledge by the user.\",\"PeriodicalId\":418941,\"journal\":{\"name\":\"CLEI Electron. J.\",\"volume\":\"59 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CLEI Electron. J.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.19153/cleiej.22.3.5\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CLEI Electron. J.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.19153/cleiej.22.3.5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

文本数据库中的知识发现基本上是一种寻找以自然语言书写的不同文档中不同概念之间的隐含关系,以识别新的有用知识的方法。为了在这个过程中提供帮助,这种方法可以依靠文本挖掘技术的帮助。尽管取得了所有这些进展,但这一领域的研究人员仍然必须处理大量的信息,并面临确定某一领域中概念之间因果关系的挑战。支持理解概念之间语义逻辑的统计和语言语义方法可能有助于提取相关信息和知识。这项工作的目的是支持用户识别不同文本中存在的概念之间的隐含关系,考虑到它们的因果关系。我们提出了一种用于发现文本语料库中存在的隐性知识的混合方法,使用基于关联规则的分析以及来自复杂网络的度量来识别相关关联,使用口头语义来确定因果关系,以及使用因果概念图来实现其可视化。通过一个案例研究,选择了一组替代医学文本,不同的提取表明,所提出的方法有助于用户识别隐性知识。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A New Statistical and Verbal-Semantic Approach to Pattern Extraction in Text Mining Applications
The discovery of knowledge in textual databases is an approach that basically seeks for implicit relationships between different concepts in different documents written in natural language, in order to identify new useful knowledge. To assist in this process, this approach can count on the help of Text Mining techniques. Despite all the progress made, researchers in this area must still deal with a large volume of information and with the challenge of identifying the causal relationships between concepts in a certain field. A statistical and verbal semantic approach that supports the understanding of the semantic logic between concepts may help the extraction of relevant information and knowledge. The objective of this work is to support the user with the identification of implicit relationships between concepts present in different texts, considering their causal relationships. We propose a hybrid approach for the discovery of implicit knowledge present in a text corpus, using analysis based on association rules together with metrics from complex networks to identify relevant associations, verbal semantics to determine the causal relationships, and causal concept maps for their visualization. Through a case study, a set of texts from alternative medicine was selected and the different extractions showed that the proposed approach facilitates the identification of implicit knowledge by the user.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Cluster-based LSTM models to improve Dengue cases forecast Medium Access Control Techniques for Massive Machine-Type Communications in Cellular IoT Networks 2D Simplified Wildfire Spreading Model in Python: From NumPy to CuPy Preface to the CLTM-CLTD 2022 Special Issue On the specification and verification of the PCR parallel programming pattern in TLA+
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1