Proceedings of the 22nd International Database Engineering & Applications Symposium最新文献

英文中文

A Multiple Instance Learning Algorithm for Color Images Classification 彩色图像分类的多实例学习算法

Proceedings of the 22nd International Database Engineering & Applications Symposium

Pub Date : 2018-06-18 DOI: 10.1145/3216122.3216144

A. Astorino, A. Fuduli, M. Gaudioso, E. Vocaturo

After a brief survey on well established methods for image classification, we focus on a recently proposed Multiple Istance Learning (MIL) method which is suitable for applications in image processing. In particular the method is based on a mixed integer nonlinear formulation of the optimization problem to be solved for MIL purposes. The algorithm is applied to a set of color images (Red, Green, Blue, RGB) with the objective of classifying the images containing some specific pattern. The results of our experimentation are reported.

在简要介绍了现有的图像分类方法之后，我们重点介绍了最近提出的一种适合于图像处理应用的多瞬间学习(MIL)方法。特别地，该方法是基于一个混合整数非线性公式的优化问题，以解决MIL的目的。该算法应用于一组彩色图像(红、绿、蓝、RGB)，目的是对含有特定模式的图像进行分类。报告了我们的实验结果。

引用次数: 24

Data Mining Ancient Script Image Data Using Convolutional Neural Networks 基于卷积神经网络的古文字图像数据挖掘

Proceedings of the 22nd International Database Engineering & Applications Symposium

Pub Date : 2018-06-18 DOI: 10.1145/3216122.3216163

Shruti Daggumati, P. Revesz

The recent surge in ancient scripts has resulted in huge image libraries of ancient texts. Data mining of the collected images enables the study of the evolution of these ancient scripts. In particular, the origin of the Indus Valley script is highly debated. We use convolutional neural networks to test which Phoenician alphabet letters and Brahmi symbols are closest to the Indus Valley script symbols. Surprisingly, our analysis shows that overall the Phoenician alphabet is much closer than the Brahmi script to the Indus Valley script symbols.

近年来，古代文字的激增催生了庞大的古代文本图像库。对收集到的图像进行数据挖掘，可以研究这些古代文字的演变。特别是，印度河流域文字的起源备受争议。我们使用卷积神经网络来测试哪些腓尼基字母和婆罗门符号最接近印度河流域的文字符号。令人惊讶的是，我们的分析表明，总的来说，腓尼基字母比婆罗门文字更接近印度河流域的文字符号。

引用次数: 20

The Inverse Tree-OLAP Problem: Definitions, Models, Complexity Analysis, and a Possible Solution 逆树- olap问题:定义、模型、复杂性分析和可能的解决方案

Proceedings of the 22nd International Database Engineering & Applications Symposium

Pub Date : 2018-06-18 DOI: 10.1145/3216122.3216129

D. Saccá, Edoardo Serra, A. Cuzzocrea

Count constraint is a data dependency that requires the results of given count operations on a relation to be within a certain range. By means of count constraints a new decisional problem, called the Inverse OLAP, has been recently introduced: given a flat fact table, does there exist an instance satisfying a set of given count constraints? This paper focuses on a special case of Inverse OLAP, called Inverse Tree-OLAP, for which the flat fact table key is modeled by a Dimensional Fact Model (DFM) with a tree structure. The count constraints define aggregation patterns to be respected by both the many-to-many relationship among the basic dimensions and the one-to-many relationships within dimension hierarchies. A count constraint is required to have a particular structure so that the problem of handling fact table projections with duplicates is avoided. The simplified structure enables the invention of an effective method for its solution that consists of three main steps: (1) using some of the count constraints to extract a subproblem that is formulated as a known data mining problem (inverse frequent itemset mining), (2) solving the subproblem using a recent method that has been shown to be effective in practical situations also for large size instances and (3) enforcing the remaining count constraints on the solution returned by step 2 using a system of linear equations. The overall proposed approach can be effectively used to generate OLAP cubes for benchmarking that reflect patterns of real datasets.

计数约束是一种数据依赖，它要求对关系进行给定计数操作的结果必须在一定范围内。通过计数约束，最近引入了一个新的决策问题，称为逆OLAP:给定一个平面事实表，是否存在一个实例满足一组给定的计数约束?本文关注的是逆OLAP的一种特殊情况，称为逆树OLAP，其平面事实表键由具有树结构的维度事实模型(DFM)建模。计数约束定义了基本维度之间的多对多关系和维度层次结构中的一对多关系都要遵循的聚合模式。计数约束需要具有特定的结构，以避免处理具有重复项的事实表投影的问题。简化的结构使发明一种有效的解决方法成为可能，该方法由三个主要步骤组成:(1)使用一些计数约束来提取一个子问题，该子问题被表述为已知的数据挖掘问题(逆频繁项集挖掘)，(2)使用最近的方法解决子问题，该方法已被证明在实际情况下也适用于大型实例，(3)使用线性方程组对步骤2返回的解强制执行剩余的计数约束。所提出的总体方法可以有效地用于生成反映真实数据集模式的基准测试的OLAP多维数据集。

{"title":"The Inverse Tree-OLAP Problem: Definitions, Models, Complexity Analysis, and a Possible Solution","authors":"D. Saccá, Edoardo Serra, A. Cuzzocrea","doi":"10.1145/3216122.3216129","DOIUrl":"https://doi.org/10.1145/3216122.3216129","url":null,"abstract":"Count constraint is a data dependency that requires the results of given count operations on a relation to be within a certain range. By means of count constraints a new decisional problem, called the Inverse OLAP, has been recently introduced: given a flat fact table, does there exist an instance satisfying a set of given count constraints? This paper focuses on a special case of Inverse OLAP, called Inverse Tree-OLAP, for which the flat fact table key is modeled by a Dimensional Fact Model (DFM) with a tree structure. The count constraints define aggregation patterns to be respected by both the many-to-many relationship among the basic dimensions and the one-to-many relationships within dimension hierarchies. A count constraint is required to have a particular structure so that the problem of handling fact table projections with duplicates is avoided. The simplified structure enables the invention of an effective method for its solution that consists of three main steps: (1) using some of the count constraints to extract a subproblem that is formulated as a known data mining problem (inverse frequent itemset mining), (2) solving the subproblem using a recent method that has been shown to be effective in practical situations also for large size instances and (3) enforcing the remaining count constraints on the solution returned by step 2 using a system of linear equations. The overall proposed approach can be effectively used to generate OLAP cubes for benchmarking that reflect patterns of real datasets.","PeriodicalId":422509,"journal":{"name":"Proceedings of the 22nd International Database Engineering & Applications Symposium","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123830775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Using CalcuList To MapReduce Jsons 使用CalcuList MapReduce json

Proceedings of the 22nd International Database Engineering & Applications Symposium

Pub Date : 2018-06-18 DOI: 10.1145/3216122.3216164

D. Saccá, A. Furfaro

CalcuList (Calculator with List manipulation), is an educational language for teaching functional programming extended with some imperative and side-effect features, which are enabled under explicit request by the programmer. In addition to strings and lists, the language natively supports json objects and may be effectively used to implement generic MapReduce recursive procedures to manipulate json lists. MapReduce is a popular model in distributed computing that underpins many NoSQL systems and a json list can be thought of as a dataset of a document NoSQL datastore. It turns out that CalcuList can be used as a tool for teaching advanced query algorithms for document datastores such as MongoDB and CouchDB.

CalcuList(带有列表操作的计算器)是一种用于教授函数式编程的教育语言，扩展了一些命令式和副作用特性，这些特性在程序员的明确要求下启用。除了字符串和列表，该语言本身也支持json对象，并且可以有效地用于实现通用MapReduce递归过程来操作json列表。MapReduce是分布式计算中的一个流行模型，它支持许多NoSQL系统，json列表可以被认为是文档NoSQL数据存储的数据集。事实证明，CalcuList可以用作教授文档数据存储(如MongoDB和CouchDB)的高级查询算法的工具。

引用次数: 1

WalDis: Mining Discriminative Patterns within Dynamic Graphs WalDis:挖掘动态图中的判别模式

Proceedings of the 22nd International Database Engineering & Applications Symposium

Pub Date : 2018-06-18 DOI: 10.1145/3216122.3216172

Karel Vaculík, L. Popelínský

Real-world networks typically evolve through time, which means there are various events occurring, such as edge additions or attribute changes. In order to understand the events, one must be able to discriminate between different events. Existing approaches typically discriminate whole graphs, which are, in addition, mostly static. We propose a new algorithm WalDis for mining discriminate patterns of events in dynamic graphs. This algorithm uses sampling by random walks and greedy approaches in order to keep the performance high. Furthermore, it does not require the time to be discretized as other algorithms commonly do. We have evaluated the algorithm on three real-world graph datasets.

现实世界的网络通常会随着时间的推移而进化，这意味着会发生各种各样的事件，比如边缘的增加或属性的改变。为了理解事件，人们必须能够区分不同的事件。现有的方法通常区分整个图，另外，这些图大多是静态的。提出了一种新的算法WalDis，用于挖掘动态图中事件的区别模式。该算法采用随机漫步抽样和贪心方法来保持较高的性能。此外，它不像其他算法那样需要时间进行离散化。我们已经在三个真实世界的图形数据集上评估了该算法。

引用次数: 2

CART: A Real-Time Concurrency Control Protocol 实时并发控制协议

Proceedings of the 22nd International Database Engineering & Applications Symposium

Pub Date : 2018-06-18 DOI: 10.1145/3216122.3216161

Sarvesh Pandey, Udai Shanker

The Two Phase Locking with High Priority (2PL-HP) concurrency control protocol addresses the transaction scheduling issue in a distributed real-time database system (DRTDBS). Although the 2PL-HP protocol is free from priority inversion, it may suffer from the problems such as deadlock, cyclic restart, and starvation of lengthy transactions. In this paper, a Controlled Avoidance of deadlock and starvation causing Resourceful Conflict resolution between Transactions (CART) concurrency control protocol has been proposed to minimize the transactions miss percentage by reducing the wastage of system resources through avoiding the deadlock due to controlled locking and starvation to some extent by ensuring a fairness in the allocation of resources for their completion. DRTDBS is simulated and CART outperforms as compared with previous other protocols.

高优先级两阶段锁定(2PL-HP)并发控制协议解决了分布式实时数据库系统(DRTDBS)中的事务调度问题。尽管2PL-HP协议没有优先级反转，但它可能会遇到死锁、循环重启和长时间事务耗尽等问题。本文提出了一种可控避免死锁和饥饿导致的事务间资源冲突解决(resource - conflictresolution between Transactions, CART)并发控制协议，通过保证资源分配的公平性，在一定程度上避免可控锁定和饥饿导致的死锁，从而减少系统资源的浪费，从而最大限度地降低事务错过率。对DRTDBS进行了仿真，CART的性能优于以往的其他协议。

引用次数: 15

Quality awareness for a Successful Big Data Exploitation 大数据开发的质量意识

Proceedings of the 22nd International Database Engineering & Applications Symposium

Pub Date : 2018-06-18 DOI: 10.1145/3216122.3216124

C. Cappiello, Walter Samá, Monica Vitali

The combination of data and technology is having a high impact on the way we live. The world is getting smarter thanks to the quantity of collected and analyzed data. However, it is necessary to consider that such amount of data is continuously increasing and it is necessary to deal with novel requirements related to variety, volume, velocity, and veracity issues. In this paper we focus on veracity that is related to the presence of uncertain or imprecise data: errors, missing or invalid data can compromise the usefulness of the collected values. In such a scenario, new methods and techniques able to evaluate the quality of the available data are needed. In fact, the literature provides many data quality assessment and improvement techniques, especially for structured data, but in the Big Data era new algorithms have to be designed. We aim to provide an overview of the issues and challenges related to Data Quality assessment in the Big Data scenario. We also propose a possible solution developed by considering a smart city case study and we describe the lessons learned in the design and implementation phases.

数据和技术的结合对我们的生活方式产生了很大的影响。由于收集和分析数据的数量，世界变得越来越智能。然而，有必要考虑到这样的数据量是不断增加的，并且有必要处理与种类、数量、速度和准确性问题相关的新需求。在本文中，我们关注与不确定或不精确数据存在相关的准确性:错误，缺失或无效数据可能会损害收集值的有用性。在这种情况下，需要能够评估现有数据质量的新方法和技术。事实上，文献提供了许多数据质量评估和改进技术，特别是对于结构化数据，但在大数据时代，必须设计新的算法。我们的目标是概述与大数据场景中数据质量评估相关的问题和挑战。我们还通过考虑一个智慧城市案例研究提出了一个可能的解决方案，并描述了在设计和实施阶段的经验教训。

引用次数: 23

Secure Reverse k-Nearest Neighbours Search over Encrypted Multi-dimensional Databases 加密多维数据库的安全反向k近邻搜索

Proceedings of the 22nd International Database Engineering & Applications Symposium

Pub Date : 2018-06-18 DOI: 10.1145/3216122.3216170

T. Tzouramanis, Y. Manolopoulos

The reverse k-nearest neighbours search is a fundamental primitive in multi-dimensional (i.e. multi-attribute) databases with applications in location-based services, online recommendations, statistical classification, pat-tern recognition, graph algorithms, computer games development, and so on. Despite the relevance and popularity of the query, no solution has yet been put forward that supports it in encrypted databases while protecting at the same time the privacy of both the data and the queries. With the outsourcing of massive datasets in the cloud, it has become urgent to find ways of ensuring the fast and secure processing of this query in untrustworthy cloud computing. This paper presents searchable encryption schemes which can efficiently and securely enable the processing of the reverse k-nearest neighbours query over encrypted multi-dimensional data, including index-based search schemes which can carry out fast query response that preserves data confidentiality and query privacy. The proposed schemes resist practical attacks operating on the basis of powerful background knowledge and their efficiency is confirmed by a theoretical analysis and extensive simulation experiments.

反向k近邻搜索是多维(即多属性)数据库的基本元素，在基于位置的服务、在线推荐、统计分类、模式识别、图形算法、计算机游戏开发等领域都有应用。尽管该查询的相关性和流行度很高，但目前还没有提出在加密数据库中支持该查询的解决方案，同时保护数据和查询的隐私。随着海量数据集在云上的外包，如何在不可信的云计算中保证查询的快速、安全处理已成为当务之急。本文提出了一种可搜索的加密方案，能够高效、安全地处理加密多维数据的逆k近邻查询，其中包括基于索引的搜索方案，它可以实现快速的查询响应，同时保持数据的机密性和查询隐私性。基于强大的背景知识，所提出的方案能够抵御实际攻击，并通过理论分析和大量的仿真实验验证了其有效性。

{"title":"Secure Reverse k-Nearest Neighbours Search over Encrypted Multi-dimensional Databases","authors":"T. Tzouramanis, Y. Manolopoulos","doi":"10.1145/3216122.3216170","DOIUrl":"https://doi.org/10.1145/3216122.3216170","url":null,"abstract":"The reverse k-nearest neighbours search is a fundamental primitive in multi-dimensional (i.e. multi-attribute) databases with applications in location-based services, online recommendations, statistical classification, pat-tern recognition, graph algorithms, computer games development, and so on. Despite the relevance and popularity of the query, no solution has yet been put forward that supports it in encrypted databases while protecting at the same time the privacy of both the data and the queries. With the outsourcing of massive datasets in the cloud, it has become urgent to find ways of ensuring the fast and secure processing of this query in untrustworthy cloud computing. This paper presents searchable encryption schemes which can efficiently and securely enable the processing of the reverse k-nearest neighbours query over encrypted multi-dimensional data, including index-based search schemes which can carry out fast query response that preserves data confidentiality and query privacy. The proposed schemes resist practical attacks operating on the basis of powerful background knowledge and their efficiency is confirmed by a theoretical analysis and extensive simulation experiments.","PeriodicalId":422509,"journal":{"name":"Proceedings of the 22nd International Database Engineering & Applications Symposium","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133918862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

MamaBot: a System based on ML and NLP for supporting Women and Families during Pregnancy MamaBot:一个基于ML和NLP的系统，用于支持怀孕期间的妇女和家庭

Proceedings of the 22nd International Database Engineering & Applications Symposium

Pub Date : 2018-06-18 DOI: 10.1145/3216122.3216173

L. Vaira, Mario Alessandro Bochicchio, Matteo Conte, Francesco Margiotta Casaluci, A. Melpignano

Artificial intelligence is transforming healthcare with a profound paradigm shift impacting diagnostic techniques, drug discovery, health analytics, interventions and much more. In this paper we focus on exploiting AI-based chatbot systems, mainly based on machine learning algorithms and Natural Language Processing, to understand and respond to needs of patients and their families. In particular, we describe an application scenario for an AI-chatbot delivering support to pregnant women, mothers, and families with young children, by giving them help and instructions in relevant situations.

人工智能正在改变医疗保健，对诊断技术、药物发现、健康分析、干预等产生深远的影响。在本文中，我们重点开发基于人工智能的聊天机器人系统，主要基于机器学习算法和自然语言处理，以了解和响应患者及其家属的需求。特别地，我们描述了一个ai聊天机器人的应用场景，通过在相关情况下为孕妇、母亲和有小孩的家庭提供帮助和指导，为他们提供支持。

引用次数: 37

A Context-driven Querying System for Urban Graph Analysis 城市图分析的上下文驱动查询系统

Proceedings of the 22nd International Database Engineering & Applications Symposium

Pub Date : 2018-06-18 DOI: 10.1145/3216122.3216148

Jacques Chabin, L. Gomes, Mirian Halfeld-Ferrari

This paper presents a context-driven query system for urban computing where users are responsible for defining their own restrictions over which datalog-like queries are built. Instead of imposing constraints on databases, our goal is to filter consistent data during the query process. Our query language is able to express aggregates in recursive rules, allowing it to capture network properties typical of graph analysis. This paper presents our query system and analyzes its capabilities using use cases in Urban Computing.

本文提出了一个用于城市计算的上下文驱动查询系统，其中用户负责定义他们自己的限制，在这些限制上构建类似数据的查询。我们的目标是在查询过程中过滤一致的数据，而不是对数据库施加约束。我们的查询语言能够在递归规则中表达聚合，允许它捕获图分析的典型网络属性。本文介绍了我们的查询系统，并通过城市计算中的用例分析了它的功能。

引用次数: 2

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 22nd International Database Engineering & Applications Symposium

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀