2019 IEEE International Congress on Big Data (BigDataCongress)最新文献

英文中文

Big Data Integration of Heterogeneous Data Sources: The Re-Search Alps Case Study 异构数据源的大数据集成:research Alps案例研究

2019 IEEE International Congress on Big Data (BigDataCongress)

Pub Date : 2019-07-01 DOI: 10.1109/BigDataCongress.2019.00027

F. Guerra, Paolo Sottovia, Matteo Paganelli, M. Vincini

The application of big data integration techniques in real scenarios needs to address practical issues related to the scalability of the process and the heterogeneity of data sources. In this paper, we describe the pipeline that has been developed in the context of the Re-search Alps project, a project funded by the EU Commission through the INEA Agency in the CEF Telecom framework, that aims at creating an open dataset describing research centers located in the Alpine area.

大数据集成技术在实际场景中的应用，需要解决过程的可扩展性和数据源的异构性等实际问题。在本文中，我们描述了在阿尔卑斯研究项目背景下开发的管道，该项目由欧盟委员会通过CEF电信框架中的INEA机构资助，旨在创建一个描述位于阿尔卑斯地区的研究中心的开放数据集。

引用次数: 4

Big Data and Analytics in the Age of the GDPR GDPR时代的大数据与分析

2019 IEEE International Congress on Big Data (BigDataCongress)

Pub Date : 2019-07-01 DOI: 10.1109/BigDataCongress.2019.00015

P. Bonatti, S. Kirrane

The new European General Data Protection Regulation places stringent restrictions on the processing of personally identifiable data. The GDPR does not only affect European companies, as the regulation applies to all the organizations that track or provide services to European citizens. Free exploratory data analysis is permitted only on anonymous data, at the cost of some legal risks. We argue that for the other kinds of personal data processing, the most flexible and safe legal basis is explicit consent. We illustrate the approach to consent management and compliance with the GDPR being developed by the European H2020 project SPECIAL, and highlight some related big data aspects.

新的欧洲通用数据保护条例对个人身份数据的处理施加了严格的限制。GDPR不仅影响欧洲公司，因为该法规适用于所有跟踪或为欧洲公民提供服务的组织。免费的探索性数据分析只允许对匿名数据进行分析，这要付出一些法律风险的代价。我们认为，对于其他类型的个人数据处理，最灵活、最安全的法律依据是明确同意。我们说明了欧洲H2020项目SPECIAL正在开发的同意管理和遵守GDPR的方法，并强调了一些相关的大数据方面。

引用次数: 21

Sequences of Recommendations for Dynamic Groups: What Is the Role of Context? 动态群体的建议序列:语境的作用是什么?

2019 IEEE International Congress on Big Data (BigDataCongress)

Pub Date : 2019-07-01 DOI: 10.1109/BigDataCongress.2019.00029

S. Migliorini, E. Quintarelli, D. Carra, A. Belussi

Recommendation algorithms have been investigated and employed by many important companies in the past years: some scenarios, such as the one where a system suggests the points of interest to tourists, well adapt to sequence of recommendations to (groups of) users. We envision that sequence recommendations can be useful whenever the group of users has a limited time interval to spend together, since they reduce the time wasted in selecting the best next activity. In this paper, we investigate the role played by the context, i.e. the situation the group is currently experiencing, in the design of a system that recommends sequences of activities. We model the problem as a multi-objective optimization, where the satisfaction of the group and the available time interval are two of the functions to be optimized. In particular, the dynamic evolution of the group can be considered as the key contextual feature to produce better suggestions.

在过去的几年里，许多重要的公司已经研究并采用了推荐算法:在某些情况下，比如系统向游客推荐感兴趣的景点，可以很好地适应向(群体)用户推荐的顺序。我们设想，每当用户组在一起的时间间隔有限时，序列推荐就会很有用，因为它们减少了选择最佳下一个活动所浪费的时间。在本文中，我们研究了背景所起的作用，即群体目前正在经历的情况，在推荐活动序列的系统设计中。我们将该问题建模为一个多目标优化问题，其中群体满意度和可用时间间隔是两个需要优化的函数。特别是，群体的动态演变可以被认为是产生更好建议的关键上下文特征。

引用次数: 5

Message from the IEEE Big Data Congress 2019 Chairs 2019年IEEE大数据大会主席致辞

2019 IEEE International Congress on Big Data (BigDataCongress)

Pub Date : 2019-07-01 DOI: 10.1109/bigdatacongress.2019.00011

引用次数: 0

Scalable Block Reporting for HopsFS HopsFS的可伸缩块报告

2019 IEEE International Congress on Big Data (BigDataCongress)

Pub Date : 2019-07-01 DOI: 10.1109/BigDataCongress.2019.00035

Mahmoud Ismail, August Bonds, Salman Niazi, Seif Haridi, J. Dowling

Distributed hierarchical file systems typically decouple the storage of the file system's metadata from the data (file system blocks) to enable the scalability of the file system. This decoupling, however, requires the introduction of a periodic synchronization protocol to ensure the consistency of the file system's metadata and its blocks. Apache HDFS and HopsFS implement a protocol, called block reporting, where each data server periodically sends ground truth information about all its file system blocks to the metadata servers, allowing the metadata to be synchronized with the actual state of the data blocks in the file system. The network and processing overhead of the existing block reporting protocol, however, increases with cluster size, ultimately limiting cluster scalability. In this paper, we introduce a new block reporting protocol for HopsFS that reduces the protocol bandwidth and processing overhead by up to three orders of magnitude, compared to HDFS/HopsFS' existing protocol. Our new protocol removes a major bottleneck that prevented HopsFS clusters scaling to tens of thousands of servers.

分布式分层文件系统通常将文件系统元数据的存储与数据(文件系统块)解耦，以支持文件系统的可伸缩性。但是，这种解耦需要引入定期同步协议，以确保文件系统的元数据及其块的一致性。Apache HDFS和HopsFS实现了一个称为块报告的协议，其中每个数据服务器定期向元数据服务器发送有关其所有文件系统块的基本真实信息，从而允许元数据与文件系统中数据块的实际状态同步。然而，现有块报告协议的网络和处理开销随着集群规模的增加而增加，最终限制了集群的可伸缩性。在本文中，我们为HopsFS引入了一个新的块报告协议，与HDFS/HopsFS现有协议相比，该协议将协议带宽和处理开销减少了多达三个数量级。我们的新协议消除了阻碍HopsFS集群扩展到数万台服务器的主要瓶颈。

引用次数: 1

Title Page i 第1页

2019 IEEE International Congress on Big Data (BigDataCongress)

Pub Date : 2019-07-01 DOI: 10.1109/bigdatacongress.2019.00001

引用次数: 0

Message from the IEEE SERVICES 2019 Symposia Chairs 2019年IEEE服务研讨会主席致辞

2019 IEEE International Congress on Big Data (BigDataCongress)

Pub Date : 2019-07-01 DOI: 10.1109/edge.2019.00009

M. Goul, Rong N. Chang, L. Brunie

引用次数: 0

Message from the IEEE SERVICES 2019 Program Chair-in-Chief and Vice Program Chair-in-Chief IEEE服务2019项目主席和副项目主席致辞

2019 IEEE International Congress on Big Data (BigDataCongress)

Pub Date : 2019-07-01 DOI: 10.1109/edge.2019.00008

引用次数: 0

A Service Clustering Method Based on Wisdom of Crowds 基于群体智慧的服务聚类方法

2019 IEEE International Congress on Big Data (BigDataCongress)

Pub Date : 2019-04-16 DOI: 10.1109/BigDataCongress.2019.00026

Hui Gao, Karolina K. Dluzniak, Hong Xia, W. Jie, Yanping Chen, Wei Xing, Xin Wang, Zhongmin Wang

As the number and variety of services increase, it is becoming difficult and time-consuming to locate services that satisfy users' need. Service clustering is efficacious method to prune the query space, to narrow the searching space, and improve the accuracy of locating services that satisfied users' needs. At present, clustering method of web services adopted single or traditional clustering algorithms. However, accuracy and stability of single or traditional clustering algorithms is poor. In the paper, we proposed SWOC a service clustering method based on wisdom of crowd. Firstly, by using SWOC we calculated document similarity. Secondly, we implemented a mapping algorithm that reduces the correlation of web services and improve accuracy of method. And then, we applyed different number of clusters using different individual clustering methods that increase the number of partitions so as to enhance the robustness of SWOC. Lastly, the diversity algorithm evaluates and selects the partitions to extract interesting information for the final aggregation with the weight of each individual result. Experiments were performed on the real web service dataset crawled from ProgrammableWeb which prove the accuracy, recall, F-value and stability of proposed method.

随着服务数量和种类的增加，定位满足用户需求的服务变得越来越困难和耗时。服务聚类是一种有效的方法，可以减少查询空间，缩小搜索空间，提高定位满足用户需求的服务的准确性。目前，web服务的聚类方法采用单一或传统的聚类算法。然而，单一或传统聚类算法的准确率和稳定性较差。本文提出了一种基于群体智慧的服务聚类方法——SWOC。首先，利用SWOC计算文档相似度。其次，我们实现了一种映射算法，减少了web服务之间的相关性，提高了方法的准确性。然后，我们使用不同的单个聚类方法，增加分区的数量，从而提高SWOC的鲁棒性。最后，多样性算法评估和选择分区，以每个单独结果的权重提取最终聚合的感兴趣信息。在programableweb上抓取的真实web服务数据集上进行了实验，验证了该方法的准确率、召回率、f值和稳定性。

{"title":"A Service Clustering Method Based on Wisdom of Crowds","authors":"Hui Gao, Karolina K. Dluzniak, Hong Xia, W. Jie, Yanping Chen, Wei Xing, Xin Wang, Zhongmin Wang","doi":"10.1109/BigDataCongress.2019.00026","DOIUrl":"https://doi.org/10.1109/BigDataCongress.2019.00026","url":null,"abstract":"As the number and variety of services increase, it is becoming difficult and time-consuming to locate services that satisfy users' need. Service clustering is efficacious method to prune the query space, to narrow the searching space, and improve the accuracy of locating services that satisfied users' needs. At present, clustering method of web services adopted single or traditional clustering algorithms. However, accuracy and stability of single or traditional clustering algorithms is poor. In the paper, we proposed SWOC a service clustering method based on wisdom of crowd. Firstly, by using SWOC we calculated document similarity. Secondly, we implemented a mapping algorithm that reduces the correlation of web services and improve accuracy of method. And then, we applyed different number of clusters using different individual clustering methods that increase the number of partitions so as to enhance the robustness of SWOC. Lastly, the diversity algorithm evaluates and selects the partitions to extract interesting information for the final aggregation with the weight of each individual result. Experiments were performed on the real web service dataset crawled from ProgrammableWeb which prove the accuracy, recall, F-value and stability of proposed method.","PeriodicalId":335850,"journal":{"name":"2019 IEEE International Congress on Big Data (BigDataCongress)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122321280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Message from the IEEE SERVICES 2019 Steering Committee Chair IEEE服务2019指导委员会主席的讲话

2019 IEEE International Congress on Big Data (BigDataCongress)

Pub Date : 2018-07-01 DOI: 10.1109/services.2018.00006

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2019 IEEE International Congress on Big Data (BigDataCongress)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀