利用开放信息抽取抽取关系:一个扩展的系统映射

2021 XLVII Latin American Computing Conference (CLEI) Pub Date : 2021-10-25 DOI:10.1109/CLEI53233.2021.9639968

Vinicius dos Santos, P. R. Silva, Erica Ferreira, K. Felizardo, W. Watanabe, Arnaldo Cândido Júnior, G. V. Meinerz, S. Aluísio, N. Vijaykumar

{"title":"利用开放信息抽取抽取关系:一个扩展的系统映射","authors":"Vinicius dos Santos, P. R. Silva, Erica Ferreira, K. Felizardo, W. Watanabe, Arnaldo Cândido Júnior, G. V. Meinerz, S. Aluísio, N. Vijaykumar","doi":"10.1109/CLEI53233.2021.9639968","DOIUrl":null,"url":null,"abstract":"Context: For thousands of years humans have been using natural language to register their knowledge on important information to enable its access to future generations. With internet, a large amount of textual data is produced and shared on a daily basis. So, scientists started to research techniques for efficiently process knowledge stored in textual format. In this context, Natural Language Processing (NLP) became a popular area studying linguistic phenomena and using computational methods to process texts in natural language. In particular, Open Information Extraction (Open IE) was proposed to gather information from plain text. Despite the advances in this area, it is still necessary to map details about how these approaches were proposed to support the community while creating more efficient Open IE systems. Objective: In this paper, we identify, in the literature, the main characteristics of proposed Open IE approaches. Method: First, we extended the search performed in a systematic mapping previously published by using backward snowballing and a manual search. Next, we updated the electronic database search including ACL Anthology. Finally, 159 studies proposing Open IE approaches were considered for data extraction. Results: Data analysis showed a significant increase in the number of studies published about Open IE in the last years. In addition, we provide important details about how these techniques were proposed (e.g., data sets used and output evaluation techniques). Results indicate that researchers started to adopt neural networks to perform Open IE instead of using conventional supervised learning techniques. Conclusion: Recent advances in Artificial Intelligence and neural networks techniques allowed scientists to have a new perspective on how to perform efficient textual data management. Therefore, Open IE approaches gained much attention as they can help in many contexts, especially in knowledge management tasks.","PeriodicalId":6803,"journal":{"name":"2021 XLVII Latin American Computing Conference (CLEI)","volume":"128 1","pages":"1-10"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using Open Information Extraction to Extract Relations: An Extended Systematic Mapping\",\"authors\":\"Vinicius dos Santos, P. R. Silva, Erica Ferreira, K. Felizardo, W. Watanabe, Arnaldo Cândido Júnior, G. V. Meinerz, S. Aluísio, N. Vijaykumar\",\"doi\":\"10.1109/CLEI53233.2021.9639968\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Context: For thousands of years humans have been using natural language to register their knowledge on important information to enable its access to future generations. With internet, a large amount of textual data is produced and shared on a daily basis. So, scientists started to research techniques for efficiently process knowledge stored in textual format. In this context, Natural Language Processing (NLP) became a popular area studying linguistic phenomena and using computational methods to process texts in natural language. In particular, Open Information Extraction (Open IE) was proposed to gather information from plain text. Despite the advances in this area, it is still necessary to map details about how these approaches were proposed to support the community while creating more efficient Open IE systems. Objective: In this paper, we identify, in the literature, the main characteristics of proposed Open IE approaches. Method: First, we extended the search performed in a systematic mapping previously published by using backward snowballing and a manual search. Next, we updated the electronic database search including ACL Anthology. Finally, 159 studies proposing Open IE approaches were considered for data extraction. Results: Data analysis showed a significant increase in the number of studies published about Open IE in the last years. In addition, we provide important details about how these techniques were proposed (e.g., data sets used and output evaluation techniques). Results indicate that researchers started to adopt neural networks to perform Open IE instead of using conventional supervised learning techniques. Conclusion: Recent advances in Artificial Intelligence and neural networks techniques allowed scientists to have a new perspective on how to perform efficient textual data management. Therefore, Open IE approaches gained much attention as they can help in many contexts, especially in knowledge management tasks.\",\"PeriodicalId\":6803,\"journal\":{\"name\":\"2021 XLVII Latin American Computing Conference (CLEI)\",\"volume\":\"128 1\",\"pages\":\"1-10\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 XLVII Latin American Computing Conference (CLEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CLEI53233.2021.9639968\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 XLVII Latin American Computing Conference (CLEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLEI53233.2021.9639968","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

背景:几千年来，人类一直在使用自然语言来记录他们对重要信息的了解，以便后代能够获得这些信息。有了互联网，每天都会产生和共享大量的文本数据。因此，科学家们开始研究有效处理以文本形式存储的知识的技术。在这种背景下，自然语言处理(NLP)成为研究语言现象和使用计算方法处理自然语言文本的热门领域。特别地，Open Information Extraction (Open IE)被提出从纯文本中收集信息。尽管在这一领域取得了进展，但仍然有必要详细描述这些方法是如何被提出来支持社区的，同时创建更高效的开放IE系统。目的:在本文中，我们在文献中确定了提出的开放IE方法的主要特征。方法:首先，我们通过向后滚雪球和手动搜索扩展了以前发布的系统映射中执行的搜索。接下来，我们更新了包括ACL Anthology在内的电子数据库搜索。最后，159项提出Open IE方法的研究被考虑用于数据提取。结果:数据分析显示，在过去几年中，关于Open IE的研究发表的数量显著增加。此外，我们还提供了关于这些技术是如何提出的重要细节(例如，使用的数据集和输出评估技术)。结果表明，研究人员开始采用神经网络来执行Open IE，而不是使用传统的监督学习技术。结论:人工智能和神经网络技术的最新进展使科学家们对如何执行有效的文本数据管理有了新的视角。因此，开放IE方法获得了很多关注，因为它们可以在很多情况下提供帮助，特别是在知识管理任务中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Using Open Information Extraction to Extract Relations: An Extended Systematic Mapping

Context: For thousands of years humans have been using natural language to register their knowledge on important information to enable its access to future generations. With internet, a large amount of textual data is produced and shared on a daily basis. So, scientists started to research techniques for efficiently process knowledge stored in textual format. In this context, Natural Language Processing (NLP) became a popular area studying linguistic phenomena and using computational methods to process texts in natural language. In particular, Open Information Extraction (Open IE) was proposed to gather information from plain text. Despite the advances in this area, it is still necessary to map details about how these approaches were proposed to support the community while creating more efficient Open IE systems. Objective: In this paper, we identify, in the literature, the main characteristics of proposed Open IE approaches. Method: First, we extended the search performed in a systematic mapping previously published by using backward snowballing and a manual search. Next, we updated the electronic database search including ACL Anthology. Finally, 159 studies proposing Open IE approaches were considered for data extraction. Results: Data analysis showed a significant increase in the number of studies published about Open IE in the last years. In addition, we provide important details about how these techniques were proposed (e.g., data sets used and output evaluation techniques). Results indicate that researchers started to adopt neural networks to perform Open IE instead of using conventional supervised learning techniques. Conclusion: Recent advances in Artificial Intelligence and neural networks techniques allowed scientists to have a new perspective on how to perform efficient textual data management. Therefore, Open IE approaches gained much attention as they can help in many contexts, especially in knowledge management tasks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 XLVII Latin American Computing Conference (CLEI)

自引率

0.00%

发文量

期刊最新文献

Structured Text Generation for Spanish Freestyle Battles using Neural Networks Learning factory for the Software Engineering area: First didactic transformation An Early Alert System for Software Vulnerabilities based on Vulnerability Repositories and Social Networks Data Quality Management oriented to the Electronic Medical Record Program Committees