首页 > 最新文献

Proceedings of the 22nd International Database Engineering & Applications Symposium最新文献

英文 中文
Modeling Customers and Products with Word Embeddings from Receipt Data 利用收据数据中的词嵌入对客户和产品进行建模
Lucas Woltmann, Maik Thiele, Wolfgang Lehner
For many tasks in market research it is important to model customers and products as comparable instances. Usually, the integration of customers and products into one model is not trivial. In this paper, we will detail an approach for a combined vector space of customers and products based on word embeddings learned from receipt data. To highlight the strengths of this approach we propose four different applications: recommender systems, customer and product segmentation and purchase prediction. Experimental results on a real-world dataset with 200M order receipts for 2M customers show that our word embedding approach is promising and helps to improve the quality in these applications scenarios.
对于市场研究中的许多任务来说,将客户和产品作为可比较的实例进行建模是很重要的。通常,将客户和产品集成到一个模型中并不容易。在本文中,我们将详细介绍一种基于从收据数据中学习到的词嵌入的客户和产品组合向量空间的方法。为了突出这种方法的优势,我们提出了四种不同的应用:推荐系统、客户和产品细分以及购买预测。在一个包含200万客户的200万订单收据的真实数据集上的实验结果表明,我们的词嵌入方法很有前途,有助于提高这些应用场景中的质量。
{"title":"Modeling Customers and Products with Word Embeddings from Receipt Data","authors":"Lucas Woltmann, Maik Thiele, Wolfgang Lehner","doi":"10.1145/3216122.3229860","DOIUrl":"https://doi.org/10.1145/3216122.3229860","url":null,"abstract":"For many tasks in market research it is important to model customers and products as comparable instances. Usually, the integration of customers and products into one model is not trivial. In this paper, we will detail an approach for a combined vector space of customers and products based on word embeddings learned from receipt data. To highlight the strengths of this approach we propose four different applications: recommender systems, customer and product segmentation and purchase prediction. Experimental results on a real-world dataset with 200M order receipts for 2M customers show that our word embedding approach is promising and helps to improve the quality in these applications scenarios.","PeriodicalId":422509,"journal":{"name":"Proceedings of the 22nd International Database Engineering & Applications Symposium","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125101362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Computing Extensions of Dynamic Abstract Argumentation Frameworks with Second-Order Attacks 二阶攻击下动态抽象论证框架的计算扩展
Gianvincenzo Alfano, S. Greco, F. Parisi
Extended argumentation frameworks (EAFs) extend Dung's argumentation frameworks (AFs) to represent a kind of defeasible attack (by relying on the concept of second-order attack), in addition to the Dung's classical notion of attack between arguments. EAFs can be profitably used to model disputes between agents, with the aim of deciding the sets of arguments (called extensions) that should be accepted to support a point of view in a discussion. However, since new arguments and attacks are often introduced to take into account new available knowledge, EAFs as well as their extensions change over the time. In this paper we tackle the problem of efficiently recomputing extensions of dynamic EAFs under two well-known semantics (i.e., preferred and stable semantics). We introduce an incremental approach that, given an initial EAF, an initial extension for it, and an update, computes an extension of the updated EAF. This is achieved by introducing a meta-argumentation transformation according to which an initial EAF, as well as a given initial extension and an update, is transformed into a plain argumentation framework with a corresponding extension and update. The proposed approach is able to incorporate existing AF-solvers to compute an extension of the updated EAF. The experimental analysis showed that our technique is significantly faster than computing extensions of updated EAFs from scratch.
扩展论证框架(Extended argument frameworks, EAFs)对Dung的论证框架(argument frameworks, AFs)进行了扩展,在Dung经典的论证间攻击概念之外,表示了一种可失败的攻击(依靠二阶攻击的概念)。eaf可以很好地用于对代理之间的争议进行建模,其目的是决定应该接受的一组参数(称为扩展),以支持讨论中的某个观点。但是,由于经常引入新的参数和攻击来考虑新的可用知识,因此eaf及其扩展会随着时间的推移而变化。本文研究了两种已知语义(即首选语义和稳定语义)下动态eaf扩展的有效重计算问题。我们引入一种增量方法,给定初始EAF、初始扩展和更新,计算更新后的EAF的扩展。这是通过引入元论证转换来实现的,根据元论证转换,初始EAF以及给定的初始扩展和更新被转换为具有相应扩展和更新的普通论证框架。所提出的方法能够结合现有的af求解器来计算更新后的EAF的扩展。实验分析表明,我们的技术比从头开始计算更新的EAFs扩展要快得多。
{"title":"Computing Extensions of Dynamic Abstract Argumentation Frameworks with Second-Order Attacks","authors":"Gianvincenzo Alfano, S. Greco, F. Parisi","doi":"10.1145/3216122.3216162","DOIUrl":"https://doi.org/10.1145/3216122.3216162","url":null,"abstract":"Extended argumentation frameworks (EAFs) extend Dung's argumentation frameworks (AFs) to represent a kind of defeasible attack (by relying on the concept of second-order attack), in addition to the Dung's classical notion of attack between arguments. EAFs can be profitably used to model disputes between agents, with the aim of deciding the sets of arguments (called extensions) that should be accepted to support a point of view in a discussion. However, since new arguments and attacks are often introduced to take into account new available knowledge, EAFs as well as their extensions change over the time. In this paper we tackle the problem of efficiently recomputing extensions of dynamic EAFs under two well-known semantics (i.e., preferred and stable semantics). We introduce an incremental approach that, given an initial EAF, an initial extension for it, and an update, computes an extension of the updated EAF. This is achieved by introducing a meta-argumentation transformation according to which an initial EAF, as well as a given initial extension and an update, is transformed into a plain argumentation framework with a corresponding extension and update. The proposed approach is able to incorporate existing AF-solvers to compute an extension of the updated EAF. The experimental analysis showed that our technique is significantly faster than computing extensions of updated EAFs from scratch.","PeriodicalId":422509,"journal":{"name":"Proceedings of the 22nd International Database Engineering & Applications Symposium","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126144615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
The Science of Science and a Multilayer Network Approach to Scientists' Ranking 科学科学与科学家排名的多层网络方法
G. Sideris, Dimitrios Katsaros, Antonis Sidiropoulos, Y. Manolopoulos
The deluge of data on scholarly output created unique opportunities for identifying the drivers of modern science, for studying career paths of scientists, and for measuring the research performance. These massive data and processing methodologies have given rise to an exciting new field, namely Science of Science (SoS) as the successor of what is called scientometrics or informetrics for many decades. Science of Science is the offspring of the fertile cooperation of many disciplines, such as network science, statistics, machine learning, mathematical analysis, sociology of science and so on. In this article, we provide a comprehensive coverage of recent advances in SoS related to network analysis, prediction and ranking, and investigate the issue of scientist ranking from a multilayer network perspective. Towards this goal, we contrast by experiments the well-known h-index and the recently proposed indicator C3-index to a generalization of PageRank for multilayer networks, namely BiPlex PageRank, which is based on solid tensor analysis. Both the obtained results and the brief survey of SoS will deepen our faith to SoS and stimulate further efforts in this transdisciplinary field.
大量的学术产出数据为确定现代科学的驱动力、研究科学家的职业道路和衡量研究绩效创造了独特的机会。这些大量的数据和处理方法已经产生了一个令人兴奋的新领域,即科学科学(so),作为几十年来所谓的科学计量学或信息计量学的继承者。科学学是网络科学、统计学、机器学习、数学分析、科学社会学等多学科肥沃合作的产物。在本文中,我们全面介绍了网络分析、预测和排名方面的最新进展,并从多层网络的角度探讨了科学家排名问题。为了实现这一目标,我们通过实验将众所周知的h-index和最近提出的指标C3-index与基于实体张量分析的多层网络的PageRank泛化,即BiPlex PageRank进行了对比。无论是获得的结果还是对SoS的简要调查,都将加深我们对SoS的信心,并激发我们在这一跨学科领域的进一步努力。
{"title":"The Science of Science and a Multilayer Network Approach to Scientists' Ranking","authors":"G. Sideris, Dimitrios Katsaros, Antonis Sidiropoulos, Y. Manolopoulos","doi":"10.1145/3216122.3229862","DOIUrl":"https://doi.org/10.1145/3216122.3229862","url":null,"abstract":"The deluge of data on scholarly output created unique opportunities for identifying the drivers of modern science, for studying career paths of scientists, and for measuring the research performance. These massive data and processing methodologies have given rise to an exciting new field, namely Science of Science (SoS) as the successor of what is called scientometrics or informetrics for many decades. Science of Science is the offspring of the fertile cooperation of many disciplines, such as network science, statistics, machine learning, mathematical analysis, sociology of science and so on. In this article, we provide a comprehensive coverage of recent advances in SoS related to network analysis, prediction and ranking, and investigate the issue of scientist ranking from a multilayer network perspective. Towards this goal, we contrast by experiments the well-known h-index and the recently proposed indicator C3-index to a generalization of PageRank for multilayer networks, namely BiPlex PageRank, which is based on solid tensor analysis. Both the obtained results and the brief survey of SoS will deepen our faith to SoS and stimulate further efforts in this transdisciplinary field.","PeriodicalId":422509,"journal":{"name":"Proceedings of the 22nd International Database Engineering & Applications Symposium","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121297121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Personalized Keyword Search on Large RDF Graphs based on Pattern Graph Similarity 基于模式图相似度的大型RDF图个性化关键字搜索
S. Sinha, Xinge Lu, D. Theodoratos
The structure of the ever increasing large RDF repositories is too complex to allow non-expert users extract useful information from them. Keyword search is an interesting alternative but in the context of RDF graph data, where query answers are RDF graph fragments, itfaces two major problems: the query quality answer problem and the result computation algorithm scalability problem. In this paper we focus on empowering keyword search on RDF data by exploiting personalized information. We proposean original approach which exploits the structural summary of the RDF graph to generate pattern graphs for the input keyword query. Pattern graphs are structured conjunctive queries and are seen as possible interpretations of the unstructured keyword query. Personalized information is represented as collections of profile graphs, a concept similar to pattern graphs. The ran king of the results is achieved by measuring graph similarity between the user profile graph and the generated pattern graphs. Novel similarity metrics have been introduced which consider intrinsic and extrinsic similarity and take into account both structural and semantic characteristics of the pattern and profile graphs. Effectiveness and efficiency experimental results show that our approach can tackle the two major problems that hinder the widespread use of keyword search on RDF data.
不断增加的大型RDF存储库的结构太复杂,不允许非专业用户从中提取有用的信息。关键字搜索是一种有趣的替代方法,但在RDF图数据的上下文中,查询答案是RDF图片段,它面临两个主要问题:查询质量回答问题和结果计算算法的可伸缩性问题。在本文中,我们着重于通过利用个性化信息来增强RDF数据的关键字搜索能力。我们提出了一种新颖的方法,利用RDF图的结构摘要为输入关键字查询生成模式图。模式图是结构化的连接查询,可以看作是对非结构化关键字查询的可能解释。个性化信息表示为概要图的集合,这是一个类似于模式图的概念。通过测量用户概要图与生成的模式图之间的图相似度来实现结果的排序。引入了新的相似度度量,它考虑了内在和外在相似度,并考虑了模式和轮廓图的结构和语义特征。实验结果表明,我们的方法可以解决阻碍RDF数据上关键字搜索广泛使用的两个主要问题。
{"title":"Personalized Keyword Search on Large RDF Graphs based on Pattern Graph Similarity","authors":"S. Sinha, Xinge Lu, D. Theodoratos","doi":"10.1145/3216122.3216167","DOIUrl":"https://doi.org/10.1145/3216122.3216167","url":null,"abstract":"The structure of the ever increasing large RDF repositories is too complex to allow non-expert users extract useful information from them. Keyword search is an interesting alternative but in the context of RDF graph data, where query answers are RDF graph fragments, itfaces two major problems: the query quality answer problem and the result computation algorithm scalability problem. In this paper we focus on empowering keyword search on RDF data by exploiting personalized information. We proposean original approach which exploits the structural summary of the RDF graph to generate pattern graphs for the input keyword query. Pattern graphs are structured conjunctive queries and are seen as possible interpretations of the unstructured keyword query. Personalized information is represented as collections of profile graphs, a concept similar to pattern graphs. The ran king of the results is achieved by measuring graph similarity between the user profile graph and the generated pattern graphs. Novel similarity metrics have been introduced which consider intrinsic and extrinsic similarity and take into account both structural and semantic characteristics of the pattern and profile graphs. Effectiveness and efficiency experimental results show that our approach can tackle the two major problems that hinder the widespread use of keyword search on RDF data.","PeriodicalId":422509,"journal":{"name":"Proceedings of the 22nd International Database Engineering & Applications Symposium","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131764845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A Cluster-based Approach of Smartphone Camera Fingerprint for User Profiles Resolution within Social Network 基于聚类的智能手机相机指纹在社交网络中的用户档案分辨率
R. Rouhi, Flavio Bertini, D. Montesi
In the last decades, Social Networks (SNs) have deeply changed interactions and habits of the users that are also prone to create more than one profile on the same SN. On the flip side, fake profiles (i.e., impersonating profiles), have become a considerable problem in digital investigations. In this paper, we propose a method for user profiles resolution through a cluster-based approach of the smartphone fingerprints extracted from the images being posted on SNs. The proposed method is thus able to detect fake profiles. To evaluate our approach, we use a real dataset of 1,500 images from 10 different smartphone devices and Facebook and WhatsApp platforms. The results show that the average of sensitivity and specificity for user profiles resolution is about 98%.
在过去的几十年里,社交网络(SNs)已经深刻地改变了用户的互动和习惯,也倾向于在同一个SN上创建多个个人资料。另一方面,虚假资料(即冒充资料)已成为数字调查中的一个相当大的问题。在本文中,我们提出了一种通过基于聚类的方法从发布在SNs上的图像中提取智能手机指纹来解决用户配置文件的方法。因此,所提出的方法能够检测出虚假轮廓。为了评估我们的方法,我们使用了来自10个不同智能手机设备和Facebook和WhatsApp平台的1500张图像的真实数据集。结果表明,该方法对用户档案分辨率的平均灵敏度和特异度约为98%。
{"title":"A Cluster-based Approach of Smartphone Camera Fingerprint for User Profiles Resolution within Social Network","authors":"R. Rouhi, Flavio Bertini, D. Montesi","doi":"10.1145/3216122.3216123","DOIUrl":"https://doi.org/10.1145/3216122.3216123","url":null,"abstract":"In the last decades, Social Networks (SNs) have deeply changed interactions and habits of the users that are also prone to create more than one profile on the same SN. On the flip side, fake profiles (i.e., impersonating profiles), have become a considerable problem in digital investigations. In this paper, we propose a method for user profiles resolution through a cluster-based approach of the smartphone fingerprints extracted from the images being posted on SNs. The proposed method is thus able to detect fake profiles. To evaluate our approach, we use a real dataset of 1,500 images from 10 different smartphone devices and Facebook and WhatsApp platforms. The results show that the average of sensitivity and specificity for user profiles resolution is about 98%.","PeriodicalId":422509,"journal":{"name":"Proceedings of the 22nd International Database Engineering & Applications Symposium","volume":"156 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134484253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
The Web of Betrayals 背叛之网
B. Desai
The web was ushered in with great expectations, formally in May 1994, in a conference called World Wide Web I, This event, in hindsight, is sometimes referred to as the Woodstock of the web. The web and Mosaic, the graphical browser, which was announced soon after has revolutionized the internet. For most people, the internet is the web, while one of the monopolist tech-corporations wants the world to view their platforms to be not only the web but the Internet! The web has given rise to a number of rich powerful corporations which did not exist before its advent. The easy to use graphical interface and the cell phone with its tiny screen have become the de-facto interface to all kinds of applications and have provided new methods of communication and connections. The control of all this by a small number of monopolistic corporations, who have amassed last quantities of data on people, has created a situation which has become a web of betrayal of the promise of sharing and providing information, freely. We also consider the remote possibility of a new freer web without monopolies
1994年5月,在一次名为“万维网ⅰ”(World Wide web I)的会议上,人们满怀期待地迎来了网络。事后看来,这次会议有时被称为网络界的伍德斯托克音乐节。网页和马赛克,图形浏览器,不久之后宣布,已经彻底改变了互联网。对大多数人来说,互联网就是网络,而一家垄断的科技公司却希望全世界认为他们的平台不仅是网络,而且是互联网!网络催生了许多在它出现之前并不存在的富有而强大的公司。易于使用的图形界面和小屏幕的手机已经成为各种应用程序的实际接口,并提供了新的通信和连接方式。这一切都被少数垄断公司所控制,他们积累了最后一批关于人们的数据,造成了一种局面,这种局面已经成为一个背叛自由分享和提供信息的承诺的网络。我们还考虑了一个没有垄断的更自由的新网络的遥远可能性
{"title":"The Web of Betrayals","authors":"B. Desai","doi":"10.1145/3216122.3216140","DOIUrl":"https://doi.org/10.1145/3216122.3216140","url":null,"abstract":"The web was ushered in with great expectations, formally in May 1994, in a conference called World Wide Web I, This event, in hindsight, is sometimes referred to as the Woodstock of the web. The web and Mosaic, the graphical browser, which was announced soon after has revolutionized the internet. For most people, the internet is the web, while one of the monopolist tech-corporations wants the world to view their platforms to be not only the web but the Internet! The web has given rise to a number of rich powerful corporations which did not exist before its advent. The easy to use graphical interface and the cell phone with its tiny screen have become the de-facto interface to all kinds of applications and have provided new methods of communication and connections. The control of all this by a small number of monopolistic corporations, who have amassed last quantities of data on people, has created a situation which has become a web of betrayal of the promise of sharing and providing information, freely. We also consider the remote possibility of a new freer web without monopolies","PeriodicalId":422509,"journal":{"name":"Proceedings of the 22nd International Database Engineering & Applications Symposium","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125207381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Using declarative programming for network data management in smart grids 在智能电网中使用声明式编程进行网络数据管理
Thi-Thanh-Quynh Nguyen, Christophe Bobineau, V. Debusschere, Quang-Huy Giap, N. Hadjsaid
In the control and management of smart grids, from steady state to real-time, the objective is to handle and to treat any change in the system as fast as possible, with as less resources as possible. In this context, this paper proposes a new language, called Smartlog, designed as declarative programming. Smartlog is developed for distributed computing in real-time and distributed database management. Compared to imperative programming, based on anticipation rather than reaction, the interest is to not ignore the meaning of some data, or to collect and analyze data without interest, and thus loose bandwidth and computational time. Smartlog is designed for operating smart grids, which are defined as abstract structures of large and scalable distributed databases. After its definition, the main features of the Smartlog language are its compactness, its simplicity and its scalability are shown. The language is tested on the application of a frequency and voltage secondary control of an islanded micro-grid in an experimental test-case, using a realtime simulator connected to Raspberry Pis. The characteristics of Smartlog are illustrated thanks to a comparison with an imperative programming implementation of the same regulation.
在智能电网的控制和管理中,从稳态到实时,目标是用尽可能少的资源尽可能快地处理和处理系统中的任何变化。在这种背景下,本文提出了一种称为Smartlog的新语言,它被设计为声明式编程。Smartlog是为实时分布式计算和分布式数据库管理而开发的。与命令式编程相比,命令式编程基于预期而不是反应,兴趣在于不忽略某些数据的含义,或者不感兴趣地收集和分析数据,从而导致带宽和计算时间的减少。智能电网被定义为大型、可扩展的分布式数据库的抽象结构。定义后,Smartlog语言的主要特点是紧凑、简单和可扩展性。在一个实验测试用例中,使用连接到Raspberry Pis的实时模拟器,对该语言在孤岛微电网的频率和电压二次控制中的应用进行了测试。通过与同一规则的命令式编程实现的比较,说明了Smartlog的特点。
{"title":"Using declarative programming for network data management in smart grids","authors":"Thi-Thanh-Quynh Nguyen, Christophe Bobineau, V. Debusschere, Quang-Huy Giap, N. Hadjsaid","doi":"10.1145/3216122.3216160","DOIUrl":"https://doi.org/10.1145/3216122.3216160","url":null,"abstract":"In the control and management of smart grids, from steady state to real-time, the objective is to handle and to treat any change in the system as fast as possible, with as less resources as possible. In this context, this paper proposes a new language, called Smartlog, designed as declarative programming. Smartlog is developed for distributed computing in real-time and distributed database management. Compared to imperative programming, based on anticipation rather than reaction, the interest is to not ignore the meaning of some data, or to collect and analyze data without interest, and thus loose bandwidth and computational time. Smartlog is designed for operating smart grids, which are defined as abstract structures of large and scalable distributed databases. After its definition, the main features of the Smartlog language are its compactness, its simplicity and its scalability are shown. The language is tested on the application of a frequency and voltage secondary control of an islanded micro-grid in an experimental test-case, using a realtime simulator connected to Raspberry Pis. The characteristics of Smartlog are illustrated thanks to a comparison with an imperative programming implementation of the same regulation.","PeriodicalId":422509,"journal":{"name":"Proceedings of the 22nd International Database Engineering & Applications Symposium","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129364262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A framework for the decomposition and features extraction from lung DICOM images 肺DICOM图像的分解与特征提取框架
Pietro Cinaglia, G. Tradigo, G. Cascini, E. Zumpano, P. Veltri
Extracting morphological features from DICOM images is useful to obtain numerical anatomic values for population-wide studies. Currently software tools on medical devices are able to extract some parameters that can indicate the presence of diseases. Nevertheless, there still is a lot of not exploited information contained in images which can be useful for research as well as to characterize human behavior. For instance, measures for lung volume compared with reference data sets can be studied starting from clinical images. In this paper we report preliminary results on a framework for the acquisition and decomposition of DICOM images applied on a dataset containing lung exams from which we extracted information and parameters useful for disease research studies. The here proposed algorithms for images segmentation and anatomical features extraction have been tested on a clinical dataset obtained from University Hospital of Catanzaro, providing the framework validity.
从DICOM图像中提取形态学特征对于获得种群范围研究的数值解剖学值是有用的。目前,医疗设备上的软件工具能够提取一些可以指示疾病存在的参数。然而,仍然有很多未开发的信息包含在图像中,可以用于研究以及表征人类行为。例如,可以从临床图像开始研究肺体积与参考数据集的比较。在本文中,我们报告了DICOM图像采集和分解框架的初步结果,该框架应用于包含肺部检查的数据集,我们从中提取了对疾病研究有用的信息和参数。本文提出的图像分割和解剖特征提取算法已经在Catanzaro大学医院的临床数据集上进行了测试,验证了框架的有效性。
{"title":"A framework for the decomposition and features extraction from lung DICOM images","authors":"Pietro Cinaglia, G. Tradigo, G. Cascini, E. Zumpano, P. Veltri","doi":"10.1145/3216122.3216127","DOIUrl":"https://doi.org/10.1145/3216122.3216127","url":null,"abstract":"Extracting morphological features from DICOM images is useful to obtain numerical anatomic values for population-wide studies. Currently software tools on medical devices are able to extract some parameters that can indicate the presence of diseases. Nevertheless, there still is a lot of not exploited information contained in images which can be useful for research as well as to characterize human behavior. For instance, measures for lung volume compared with reference data sets can be studied starting from clinical images. In this paper we report preliminary results on a framework for the acquisition and decomposition of DICOM images applied on a dataset containing lung exams from which we extracted information and parameters useful for disease research studies. The here proposed algorithms for images segmentation and anatomical features extraction have been tested on a clinical dataset obtained from University Hospital of Catanzaro, providing the framework validity.","PeriodicalId":422509,"journal":{"name":"Proceedings of the 22nd International Database Engineering & Applications Symposium","volume":"474 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123340100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A Step forward for Spatial Skyline Queries for a Group of Users: Semantic in the Evidence Theory Setting 面向一组用户的空间天际线查询:证据理论设置中的语义
Sayda Elmi, Jun-Ki Min
Cities are the main poles of human and economic activity. Analyzing cities data is very important to improve the city economy as well as the life quality of the citizens. Since location based services and GPS devices can easily connect users located in different positions, it is worthwhile to optimize the efficiency of their shifting to a common location according to their preferences. For this reason, the support of advanced analysis queries such as the skyline operator has become important. This later finds the interesting objects according to a user preferences. However, data in such application can be uncertain, imprecise and incomplete. In this paper, we propose an imperfect spatial skyline query for users located in different positions. Detailed experimental analysis are reported. In addition, the theoretical properties developed in this paper help to devise efficient techniques to compute the spatial skyline over uncertain data fora set of users. Our extensive experiments show that the proposed algorithms provide quick initial response time.
城市是人类和经济活动的主要极点。城市数据分析对提高城市经济发展水平和提高市民生活质量具有重要意义。由于基于位置的服务和GPS设备可以很容易地连接位于不同位置的用户,因此根据他们的偏好优化他们转移到公共位置的效率是值得的。由于这个原因,高级分析查询(如skyline操作符)的支持变得非常重要。然后根据用户偏好找到感兴趣的对象。然而,这种应用中的数据可能是不确定的、不精确的和不完整的。本文针对不同位置的用户,提出了一种不完全空间天际线查询方法。详细的实验分析报告。此外,本文发展的理论性质有助于设计有效的技术来计算一组用户在不确定数据上的空间天际线。我们的大量实验表明,提出的算法提供了快速的初始响应时间。
{"title":"A Step forward for Spatial Skyline Queries for a Group of Users: Semantic in the Evidence Theory Setting","authors":"Sayda Elmi, Jun-Ki Min","doi":"10.1145/3216122.3216142","DOIUrl":"https://doi.org/10.1145/3216122.3216142","url":null,"abstract":"Cities are the main poles of human and economic activity. Analyzing cities data is very important to improve the city economy as well as the life quality of the citizens. Since location based services and GPS devices can easily connect users located in different positions, it is worthwhile to optimize the efficiency of their shifting to a common location according to their preferences. For this reason, the support of advanced analysis queries such as the skyline operator has become important. This later finds the interesting objects according to a user preferences. However, data in such application can be uncertain, imprecise and incomplete. In this paper, we propose an imperfect spatial skyline query for users located in different positions. Detailed experimental analysis are reported. In addition, the theoretical properties developed in this paper help to devise efficient techniques to compute the spatial skyline over uncertain data fora set of users. Our extensive experiments show that the proposed algorithms provide quick initial response time.","PeriodicalId":422509,"journal":{"name":"Proceedings of the 22nd International Database Engineering & Applications Symposium","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124330317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Big Data Clustering 高效的大数据集群
M. Ianni, E. Masciari, G. Mazzeo, C. Zaniolo
The need to support advanced analytics on Big Data is driving data scientist' interest toward massively parallel distributed systems and software platforms, such as Map-Reduce and Spark, that make possible their scalable utilization. However, when complex data mining algorithms are required, their fully scalable deployment on such platforms faces a number of technical challenges that grow with the complexity of the algorithms involved. Thus algorithms, that were originally designed for a sequential nature, must often be redesigned in order to effectively use the distributed computational resources. In this paper, we explore these problems, and then propose a solution which has proven to be very effective on the complex hierarchical clustering algorithm CLUBS+. By using four stages of successive refinements, CLUBS+ delivers high-quality clusters of data grouped around their centroids, working in a totally unsupervised fashion. Experimental results confirm the accuracy and scalability of CLUBS+.
支持大数据高级分析的需求正推动数据科学家对大规模并行分布式系统和软件平台的兴趣,如Map-Reduce和Spark,这使得它们的可扩展利用成为可能。然而,当需要复杂的数据挖掘算法时,它们在这样的平台上的完全可伸缩部署面临着许多技术挑战,这些挑战随着所涉及算法的复杂性而增长。因此,为了有效地使用分布式计算资源,必须经常重新设计最初设计用于顺序性质的算法。在本文中,我们对这些问题进行了探讨,并提出了一种解决方案,该方案在复杂的分层聚类算法CLUBS+上被证明是非常有效的。通过四个阶段的连续细化,CLUBS+提供了围绕其质心分组的高质量数据簇,以完全无监督的方式工作。实验结果证实了CLUBS+的准确性和可扩展性。
{"title":"Efficient Big Data Clustering","authors":"M. Ianni, E. Masciari, G. Mazzeo, C. Zaniolo","doi":"10.1145/3216122.3216154","DOIUrl":"https://doi.org/10.1145/3216122.3216154","url":null,"abstract":"The need to support advanced analytics on Big Data is driving data scientist' interest toward massively parallel distributed systems and software platforms, such as Map-Reduce and Spark, that make possible their scalable utilization. However, when complex data mining algorithms are required, their fully scalable deployment on such platforms faces a number of technical challenges that grow with the complexity of the algorithms involved. Thus algorithms, that were originally designed for a sequential nature, must often be redesigned in order to effectively use the distributed computational resources. In this paper, we explore these problems, and then propose a solution which has proven to be very effective on the complex hierarchical clustering algorithm CLUBS+. By using four stages of successive refinements, CLUBS+ delivers high-quality clusters of data grouped around their centroids, working in a totally unsupervised fashion. Experimental results confirm the accuracy and scalability of CLUBS+.","PeriodicalId":422509,"journal":{"name":"Proceedings of the 22nd International Database Engineering & Applications Symposium","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121110651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 22nd International Database Engineering & Applications Symposium
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1