首页 > 最新文献

Proceedings of the International Workshop on Semantic Big Data最新文献

英文 中文
Safety check: a semantic web application for emergency management 安全检查:用于应急管理的语义web应用程序
Pub Date : 2017-05-19 DOI: 10.1145/3066911.3066917
Yogesh Pandey, S. Bansal
Semantic Computing is being used in a wide variety of domains to develop intelligent applications. With the increasing availability of different kinds of data on the web, providing better emergency management in case of natural disasters and humanitarian crises is much needed. This paper presents the use of semantic technologies to build a web application called Safety Check. The aim of this work is to implement a knowledge intensive application that identifies those people that may have been affected due to natural disasters or man-made disasters at any geographical location and notify them with safety instructions. This involves extraction of data from various sources for emergency alerts, weather alerts, and contacts data. The extracted data is integrated using a semantic data model and transformed into semantic data. Semantic reasoning was done through rules and queries.
语义计算被广泛应用于各种领域,以开发智能应用程序。随着网络上各种数据的可用性不断增加,在发生自然灾害和人道主义危机时提供更好的应急管理是非常必要的。本文介绍了使用语义技术构建一个名为“安全检查”的web应用程序。这项工作的目的是实现一个知识密集型应用程序,该应用程序可以识别在任何地理位置可能受到自然灾害或人为灾害影响的人,并向他们发出安全指示。这涉及从各种来源提取用于紧急警报、天气警报和联系人数据的数据。提取的数据使用语义数据模型集成并转换为语义数据。语义推理是通过规则和查询完成的。
{"title":"Safety check: a semantic web application for emergency management","authors":"Yogesh Pandey, S. Bansal","doi":"10.1145/3066911.3066917","DOIUrl":"https://doi.org/10.1145/3066911.3066917","url":null,"abstract":"Semantic Computing is being used in a wide variety of domains to develop intelligent applications. With the increasing availability of different kinds of data on the web, providing better emergency management in case of natural disasters and humanitarian crises is much needed. This paper presents the use of semantic technologies to build a web application called Safety Check. The aim of this work is to implement a knowledge intensive application that identifies those people that may have been affected due to natural disasters or man-made disasters at any geographical location and notify them with safety instructions. This involves extraction of data from various sources for emergency alerts, weather alerts, and contacts data. The extracted data is integrated using a semantic data model and transformed into semantic data. Semantic reasoning was done through rules and queries.","PeriodicalId":210506,"journal":{"name":"Proceedings of the International Workshop on Semantic Big Data","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133225250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Ontology-based approach for unsupervised and adaptive focused crawling 基于本体的无监督自适应聚焦爬行方法
Pub Date : 2017-05-19 DOI: 10.1145/3066911.3066912
Thomas Hassan, C. Cruz, Aurélie Bertaux
Information from the web is a key resource exploited in the domain of competitive intelligence. These sources represent important volumes of information to process everyday. As the amount of information available grows rapidly, this process becomes overwhelming for experts. To leverage this challenge, this paper presents a novel approach to process such sources and extract only the most valuable pieces of information. The approach is based on an unsupervised and adaptive ontology-learning process. The resulting ontology is used to enhance the performance of a focused crawler. The combination of Big Data and Semantic Web technologies allows to classify information precisely according to domain knowledge, while maintaining optimal performances. The approach and its implementation are described, and an presents the feasibility and performance of the approach.
网络信息是竞争情报领域的重要资源。这些来源代表了每天需要处理的大量重要信息。随着可用信息量的迅速增长,这一过程对专家来说变得势不可挡。为了利用这一挑战,本文提出了一种新的方法来处理这些来源并仅提取最有价值的信息。该方法基于无监督和自适应本体学习过程。生成的本体用于增强聚焦爬虫的性能。大数据和语义网技术的结合可以根据领域知识对信息进行精确分类,同时保持最佳性能。介绍了该方法及其实现,并给出了该方法的可行性和性能。
{"title":"Ontology-based approach for unsupervised and adaptive focused crawling","authors":"Thomas Hassan, C. Cruz, Aurélie Bertaux","doi":"10.1145/3066911.3066912","DOIUrl":"https://doi.org/10.1145/3066911.3066912","url":null,"abstract":"Information from the web is a key resource exploited in the domain of competitive intelligence. These sources represent important volumes of information to process everyday. As the amount of information available grows rapidly, this process becomes overwhelming for experts. To leverage this challenge, this paper presents a novel approach to process such sources and extract only the most valuable pieces of information. The approach is based on an unsupervised and adaptive ontology-learning process. The resulting ontology is used to enhance the performance of a focused crawler. The combination of Big Data and Semantic Web technologies allows to classify information precisely according to domain knowledge, while maintaining optimal performances. The approach and its implementation are described, and an presents the feasibility and performance of the approach.","PeriodicalId":210506,"journal":{"name":"Proceedings of the International Workshop on Semantic Big Data","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123489206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Supervised typing of big graphs using semantic embeddings 使用语义嵌入的大图的监督类型
Pub Date : 2017-03-22 DOI: 10.1145/3066911.3066918
M. Kejriwal, Pedro A. Szekely
We propose a supervised algorithm for generating type embeddings in the same semantic vector space as a given set of entity embeddings. The algorithm is agnostic to the derivation of the underlying entity embeddings. It does not require any manual feature engineering, generalizes well to hundreds of types and achieves near-linear scaling on Big Graphs containing many millions of triples and instances by virtue of an incremental execution. We demonstrate the utility of the embeddings on a type recommendation task, outperforming a non-parametric feature-agnostic baseline while achieving 15× speedup and near-constant memory usage on a full partition of DBpedia. Using state-of-the-art visualization, we illustrate the agreement of our extensionally derived DBpedia type embeddings with the manually curated domain ontology. Finally, we use the embeddings to probabilistically cluster about 4 million DBpedia instances into 415 types in the DBpedia ontology.
我们提出了一种监督算法,用于在与给定实体嵌入集相同的语义向量空间中生成类型嵌入。该算法对底层实体嵌入的推导是不可知的。它不需要任何手动特征工程,可以很好地泛化到数百种类型,并通过增量执行在包含数百万个三元组和实例的大图上实现近线性缩放。我们演示了嵌入在类型推荐任务上的效用,在DBpedia的整个分区上实现了15倍的加速和近乎恒定的内存使用,同时优于非参数特征不确定基线。使用最先进的可视化技术,我们说明了扩展派生的DBpedia类型嵌入与手动策划的领域本体的一致性。最后,我们使用嵌入将大约400万个DBpedia实例概率地聚类到DBpedia本体中的415种类型中。
{"title":"Supervised typing of big graphs using semantic embeddings","authors":"M. Kejriwal, Pedro A. Szekely","doi":"10.1145/3066911.3066918","DOIUrl":"https://doi.org/10.1145/3066911.3066918","url":null,"abstract":"We propose a supervised algorithm for generating type embeddings in the same semantic vector space as a given set of entity embeddings. The algorithm is agnostic to the derivation of the underlying entity embeddings. It does not require any manual feature engineering, generalizes well to hundreds of types and achieves near-linear scaling on Big Graphs containing many millions of triples and instances by virtue of an incremental execution. We demonstrate the utility of the embeddings on a type recommendation task, outperforming a non-parametric feature-agnostic baseline while achieving 15× speedup and near-constant memory usage on a full partition of DBpedia. Using state-of-the-art visualization, we illustrate the agreement of our extensionally derived DBpedia type embeddings with the manually curated domain ontology. Finally, we use the embeddings to probabilistically cluster about 4 million DBpedia instances into 415 types in the DBpedia ontology.","PeriodicalId":210506,"journal":{"name":"Proceedings of the International Workshop on Semantic Big Data","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2017-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129392171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Proceedings of The International Workshop on Semantic Big Data 语义大数据国际研讨会论文集
Pub Date : 2016-06-26 DOI: 10.1145/3391274
Sven Groppe, L. Gruenwald
The current World-Wide Web enables an easy, instant access to a vast amount of online information. However, the content in the Web is typically for human consumption, and is not tailored for machine processing. The Semantic Web is hence intended to establish a machine-understandable Web, and is currently also used in many other domains and not only in the Web. The World Wide Web Consortium (W3C) has developed a number of standards around this vision. Among them is the Resource Description Framework (RDF), which is used as the data model of the Semantic Web. The W3C has also defined SPARQL as the RDF query language, RIF as the rule language, and the ontology languages RDFS and OWL to describe schemas of RDF. The usage of common ontologies increases interoperability between heterogeneous data sets, and the proprietary ontologies with the additional abstraction layer facilitate the integration of these data sets. Therefore, we can argue that the Semantic Web is ideally designed to work in heterogeneous Big Data environments. We define Semantic Big Data as the intersection of Semantic Web data and Big Data. There are masses of Semantic Web data freely available to the public - thanks to the efforts of the linked data initiative. According to http://stats.lod2.eu/ the current freely available Semantic Web data is approximately 90 billion triples in over 3,300 datasets, many of which are accessible via SPARQL query servers called SPARQL endpoints. Everyone can submit SPARQL queries to SPARQL endpoints via a standardized protocol, where the queries are processed on the datasets of the SPARQL endpoints and the query results are sent back in a standardized format. Hence, not only Semantic Big Data is freely available, but also distributed execution environments for Semantic Big Data are freely accessible. This makes the Semantic Web an ideal playground for Big Data research. The goal of this workshop is to bring together academic researchers and industry practitioners to address the challenges and report and exchange the research findings in Semantic Big Data, including new approaches, techniques and applications, make substantial theoretical and empirical contributions to, and significantly advance the state of the art of Semantic Big Data.
当前的万维网使人们能够轻松、即时地访问大量的在线信息。然而,Web中的内容通常是供人使用的,而不是为机器处理而定制的。因此,语义网旨在建立一个机器可理解的Web,目前也用于许多其他领域,而不仅仅是在Web中。万维网联盟(W3C)围绕这一愿景开发了许多标准。其中包括资源描述框架(RDF),它被用作语义Web的数据模型。W3C还将SPARQL定义为RDF查询语言,将RIF定义为规则语言,并将本体语言RDFS和OWL定义为描述RDF模式。公共本体的使用增加了异构数据集之间的互操作性,而带有附加抽象层的专有本体促进了这些数据集的集成。因此,我们可以认为语义网是在异构大数据环境中工作的理想设计。我们将语义大数据定义为语义网数据和大数据的交集。由于关联数据倡议的努力,公众可以免费获得大量的语义Web数据。根据http://stats.lod2.eu/,目前可以免费获得的语义Web数据在3300多个数据集中大约有900亿个三元组,其中许多可以通过SPARQL查询服务器(称为SPARQL端点)访问。每个人都可以通过标准化协议向SPARQL端点提交SPARQL查询,在SPARQL端点的数据集上处理查询,并以标准化格式发送查询结果。因此,不仅语义大数据是免费的,而且语义大数据的分布式执行环境也是免费的。这使得语义网成为大数据研究的理想场所。本次研讨会的目标是将学术研究人员和行业从业者聚集在一起,共同应对语义大数据的挑战,报告和交流语义大数据的研究成果,包括新的方法、技术和应用,为语义大数据的发展做出实质性的理论和实证贡献,并显著推进语义大数据的发展。
{"title":"Proceedings of The International Workshop on Semantic Big Data","authors":"Sven Groppe, L. Gruenwald","doi":"10.1145/3391274","DOIUrl":"https://doi.org/10.1145/3391274","url":null,"abstract":"The current World-Wide Web enables an easy, instant access to a vast amount of online information. However, the content in the Web is typically for human consumption, and is not tailored for machine processing. The Semantic Web is hence intended to establish a machine-understandable Web, and is currently also used in many other domains and not only in the Web. The World Wide Web Consortium (W3C) has developed a number of standards around this vision. Among them is the Resource Description Framework (RDF), which is used as the data model of the Semantic Web. The W3C has also defined SPARQL as the RDF query language, RIF as the rule language, and the ontology languages RDFS and OWL to describe schemas of RDF. The usage of common ontologies increases interoperability between heterogeneous data sets, and the proprietary ontologies with the additional abstraction layer facilitate the integration of these data sets. Therefore, we can argue that the Semantic Web is ideally designed to work in heterogeneous Big Data environments. \u0000 \u0000We define Semantic Big Data as the intersection of Semantic Web data and Big Data. There are masses of Semantic Web data freely available to the public - thanks to the efforts of the linked data initiative. According to http://stats.lod2.eu/ the current freely available Semantic Web data is approximately 90 billion triples in over 3,300 datasets, many of which are accessible via SPARQL query servers called SPARQL endpoints. Everyone can submit SPARQL queries to SPARQL endpoints via a standardized protocol, where the queries are processed on the datasets of the SPARQL endpoints and the query results are sent back in a standardized format. Hence, not only Semantic Big Data is freely available, but also distributed execution environments for Semantic Big Data are freely accessible. This makes the Semantic Web an ideal playground for Big Data research. \u0000 \u0000The goal of this workshop is to bring together academic researchers and industry practitioners to address the challenges and report and exchange the research findings in Semantic Big Data, including new approaches, techniques and applications, make substantial theoretical and empirical contributions to, and significantly advance the state of the art of Semantic Big Data.","PeriodicalId":210506,"journal":{"name":"Proceedings of the International Workshop on Semantic Big Data","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2016-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132463125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Proceedings of The International Workshop on Semantic Big Data 语义大数据国际研讨会论文集
Pub Date : 1900-01-01 DOI: 10.1145/3066911
{"title":"Proceedings of The International Workshop on Semantic Big Data","authors":"","doi":"10.1145/3066911","DOIUrl":"https://doi.org/10.1145/3066911","url":null,"abstract":"","PeriodicalId":210506,"journal":{"name":"Proceedings of the International Workshop on Semantic Big Data","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131355025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the International Workshop on Semantic Big Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1