首页 > 最新文献

2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications最新文献

英文 中文
Adaptation of Apriori to MapReduce to Build a Warehouse of Relations between Named Entities across the Web 将Apriori应用于MapReduce以构建跨Web命名实体之间的关系仓库
Jean-Daniel Cryans, S. Ratté, R. Champagne
The Semantic Web has made possible the use of the Internet to extract useful content, a task that could necessitate an infrastructure across the Web. With Hadoop, a free implementation of the MapReduce programming paradigm created by Google, we can treat these data reliably over hundreds of servers. This article describes how the Apriori algorithm was adapted to MapReduce in the search for relations between entities to deal with thousands of Web pages coming from RSS feeds daily. First, every feed is looked up five times per day and each entry is registered in a database with MapReduce. Second, the entries are read and their content sent to the Web service OpenCalais for the detection of named entities. For each Web page, the set of all itemsets found is generated and stored in the database. Third, all generated sets, from first to last, are counted and their support is registered. Finally, various analytical tasks are executed to present the relationships found. Our tests show that the third step, executed over 3,000,000 sets, was 4.5 times faster using five servers than using a single machine. This approach allows us to easily and automatically distribute treatments on as many machines as are available, and be able to process datasets that one server, even a very powerful one, would not be able to manage alone. We believe that this work is a step forward in processing semantic Web data efficiently and effectively.
语义网使得利用因特网提取有用的内容成为可能,这项任务可能需要一个跨Web的基础设施。有了Hadoop (Google创建的MapReduce编程范例的免费实现),我们可以在数百台服务器上可靠地处理这些数据。本文描述了如何将Apriori算法应用于MapReduce来搜索实体之间的关系,以处理每天来自RSS提要的数千个Web页面。首先,每个提要每天被查找五次,每个条目都用MapReduce在数据库中注册。其次,读取条目并将其内容发送到Web服务OpenCalais以检测命名实体。对于每个Web页面,生成所有找到的项目集并将其存储在数据库中。第三,对所有生成的集合从头到尾进行计数,并对其支持度进行登记。最后,执行各种分析任务来表示所发现的关系。我们的测试表明,使用五台服务器执行超过3,000,000组的第三步比使用一台机器快4.5倍。这种方法使我们能够轻松、自动地在尽可能多的机器上分发处理方法,并且能够处理一台服务器(即使是非常强大的服务器)无法单独管理的数据集。我们相信这项工作是在高效和有效地处理语义Web数据方面向前迈出的一步。
{"title":"Adaptation of Apriori to MapReduce to Build a Warehouse of Relations between Named Entities across the Web","authors":"Jean-Daniel Cryans, S. Ratté, R. Champagne","doi":"10.1109/DBKDA.2010.34","DOIUrl":"https://doi.org/10.1109/DBKDA.2010.34","url":null,"abstract":"The Semantic Web has made possible the use of the Internet to extract useful content, a task that could necessitate an infrastructure across the Web. With Hadoop, a free implementation of the MapReduce programming paradigm created by Google, we can treat these data reliably over hundreds of servers. This article describes how the Apriori algorithm was adapted to MapReduce in the search for relations between entities to deal with thousands of Web pages coming from RSS feeds daily. First, every feed is looked up five times per day and each entry is registered in a database with MapReduce. Second, the entries are read and their content sent to the Web service OpenCalais for the detection of named entities. For each Web page, the set of all itemsets found is generated and stored in the database. Third, all generated sets, from first to last, are counted and their support is registered. Finally, various analytical tasks are executed to present the relationships found. Our tests show that the third step, executed over 3,000,000 sets, was 4.5 times faster using five servers than using a single machine. This approach allows us to easily and automatically distribute treatments on as many machines as are available, and be able to process datasets that one server, even a very powerful one, would not be able to manage alone. We believe that this work is a step forward in processing semantic Web data efficiently and effectively.","PeriodicalId":273177,"journal":{"name":"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122411687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Performance Evaluation of an Optimistic Concurrency Control Algorithm for Temporal Databases 一种面向时态数据库的乐观并发控制算法的性能评价
Achraf Makni, R. Bouaziz
We propose in this paper a performance study of an access concurrency control algorithm for temporal databases. This algorithm is based on the optimistic approach, which is, in our opinion, more suitable for temporal databases than the pessimistic methods. Indeed, our optimistic algorithm, in the contrary to the pessimistic ones, can exploit the temporal specifications to reduce the granule size and then to minimize the conflict degree. Moreover, it can detect, as soon as possible, all the conflict cases. By using the end of transaction marker technique, it has the merit to reduce to the maximum the period during which resources are locked in the validation phase. By carrying out a formal verification, based first on the serialization theory and next on the SPIN model checker, we have ensured that our algorithm operate correctly. Now, we proceed to its experimental evaluation vis-à-vis of other well-known concurrency control mechanisms based on an optimistic and pessimistic approach.
本文提出了一种时态数据库访问并发控制算法的性能研究。该算法基于乐观方法,我们认为乐观方法比悲观方法更适合于时态数据库。事实上,与悲观算法相反,我们的乐观算法可以利用时间规范来减小颗粒大小,从而最小化冲突程度。此外,它还可以尽快发现所有的冲突情况。通过使用事务结束标记技术,它的优点是最大限度地减少了资源在验证阶段被锁定的时间。通过首先基于序列化理论,然后基于SPIN模型检查器进行形式化验证,我们确保了算法的正确运行。现在,我们根据乐观和悲观的方法对其他知名的并发控制机制进行实验评估-à-vis。
{"title":"Performance Evaluation of an Optimistic Concurrency Control Algorithm for Temporal Databases","authors":"Achraf Makni, R. Bouaziz","doi":"10.1109/DBKDA.2010.41","DOIUrl":"https://doi.org/10.1109/DBKDA.2010.41","url":null,"abstract":"We propose in this paper a performance study of an access concurrency control algorithm for temporal databases. This algorithm is based on the optimistic approach, which is, in our opinion, more suitable for temporal databases than the pessimistic methods. Indeed, our optimistic algorithm, in the contrary to the pessimistic ones, can exploit the temporal specifications to reduce the granule size and then to minimize the conflict degree. Moreover, it can detect, as soon as possible, all the conflict cases. By using the end of transaction marker technique, it has the merit to reduce to the maximum the period during which resources are locked in the validation phase. By carrying out a formal verification, based first on the serialization theory and next on the SPIN model checker, we have ensured that our algorithm operate correctly. Now, we proceed to its experimental evaluation vis-à-vis of other well-known concurrency control mechanisms based on an optimistic and pessimistic approach.","PeriodicalId":273177,"journal":{"name":"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133431795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Modeling Topological Relations between Uncertain Spatial Regions in Geo-spatial Databases: Uncertain Intersection and Difference Topological Model 地理空间数据库中不确定空间区域间拓扑关系建模:不确定交集与差分拓扑模型
A. Alboody, F. Sèdes, J. Inglada
Topological relations have played important roles in spatial query, analysis and reasoning in Geographic Information Systems (GIS) and geospatial databases. The topological relations between crisp, uncertain and fuzzy spatial regions based upon the 9-intersections model have been identified. The research issue of topological relations, particularly, between spatial regions with uncertainties, has gained a lot of attention during the past two decades. However, the formal representation and calculation of the topological relations between uncertain regions is still an open issue and needs to be further developed. The paper provides a theoretical framework for modeling topological relations between uncertain spatial regions based upon a new uncertain topological model called the Uncertain Intersection and Difference (UID) Model. In order to derive all topological relations between two spatial regions with uncertainties, the spatial object of type Region (A) is decomposed in four components: the Interior, the Interior’s Boundary, the Object’s Boundary, and the Exterior’s Boundary of A. By use of this definition of spatial region with uncertainties, new 4*4-Intersection and Uncertain Intersection and Difference (UID) models are proposed as a qualitative model for the identification of all topological relations between two spatial regions with uncertainties. These two new models are compared with other models studied in the literature. 152 binary topological relations can be identified by these two models. Then, the topological complexity and distance of the 152 relations will be study in details by using the UID model. Based upon this study of topological complexity and distance, a conceptual neighborhood graph for the 152 relations can be obtained. Examples are provided to illustrate the utility of these two models presented in this paper with results which can be applied for modeling GIS, geospatial databases and satellite image processing.
拓扑关系在地理信息系统(GIS)和地理空间数据库的空间查询、分析和推理中发挥着重要作用。基于9交模型,确定了清晰、不确定和模糊空间区域之间的拓扑关系。在过去的二十年里,拓扑关系的研究,特别是不确定空间区域之间的拓扑关系的研究得到了广泛的关注。然而,不确定区域之间拓扑关系的形式化表示和计算仍然是一个开放的问题,需要进一步发展。本文提出了一种新的不确定拓扑模型——不确定交差模型(UID),为不确定空间区域间拓扑关系建模提供了理论框架。为了导出两个不确定空间区域之间的所有拓扑关系,将类型为Region (A)的空间对象分解为四个分量:a的内部、内部边界、对象边界和外部边界。利用这一不确定空间区域的定义,提出了新的4*4-交集和不确定交集与差分(UID)模型,作为识别两个不确定空间区域之间所有拓扑关系的定性模型。将这两个新模型与文献中研究的其他模型进行了比较。这两个模型可以识别出152个二元拓扑关系。然后,利用UID模型详细研究152个关系的拓扑复杂度和距离。基于拓扑复杂度和距离的研究,得到了152个关系的概念邻域图。通过实例说明了本文所提出的两种模型的实用性,其结果可用于GIS建模、地理空间数据库和卫星图像处理。
{"title":"Modeling Topological Relations between Uncertain Spatial Regions in Geo-spatial Databases: Uncertain Intersection and Difference Topological Model","authors":"A. Alboody, F. Sèdes, J. Inglada","doi":"10.1109/DBKDA.2010.28","DOIUrl":"https://doi.org/10.1109/DBKDA.2010.28","url":null,"abstract":"Topological relations have played important roles in spatial query, analysis and reasoning in Geographic Information Systems (GIS) and geospatial databases. The topological relations between crisp, uncertain and fuzzy spatial regions based upon the 9-intersections model have been identified. The research issue of topological relations, particularly, between spatial regions with uncertainties, has gained a lot of attention during the past two decades. However, the formal representation and calculation of the topological relations between uncertain regions is still an open issue and needs to be further developed. The paper provides a theoretical framework for modeling topological relations between uncertain spatial regions based upon a new uncertain topological model called the Uncertain Intersection and Difference (UID) Model. In order to derive all topological relations between two spatial regions with uncertainties, the spatial object of type Region (A) is decomposed in four components: the Interior, the Interior’s Boundary, the Object’s Boundary, and the Exterior’s Boundary of A. By use of this definition of spatial region with uncertainties, new 4*4-Intersection and Uncertain Intersection and Difference (UID) models are proposed as a qualitative model for the identification of all topological relations between two spatial regions with uncertainties. These two new models are compared with other models studied in the literature. 152 binary topological relations can be identified by these two models. Then, the topological complexity and distance of the 152 relations will be study in details by using the UID model. Based upon this study of topological complexity and distance, a conceptual neighborhood graph for the 152 relations can be obtained. Examples are provided to illustrate the utility of these two models presented in this paper with results which can be applied for modeling GIS, geospatial databases and satellite image processing.","PeriodicalId":273177,"journal":{"name":"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128598909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Analysis of the Quality of Life after an Endoscopic Thoracic Sympathectomy: A Business Intelligence Approach 内窥镜胸椎交感神经切除术后的生活质量分析:商业智能方法
D. Goncalves, M. Y. Santos, Jorge Cruz
Primary hyperhidrosis, a disorder characterized by an excessive sweating, has been treated by endoscopic thoracic sympathectomy. As a consequence of the surgery, patients improved their overall quality of life. Their day-by-day activities are not affected, or are less affected, by this disorder, and their emotional state verifies a significant improvement, from a situation of shame and self-punishing to what we could say a normal life. This paper presents the analysis of the quality of life of 227 patients that were treated by an endoscopic thoracic sympathectomy. The study was based on the use of business intelligence technologies, which allowed the storage, the analysis and the reporting of all the relevant findings. In technological terms, this paper illustrates the database and data analysis developments needed in a specific healthcare application domain. For data storage, a data mart was designed addressing the relevant attributes. For data analysis, on-line analytical processing and data mining technologies were used to show the evolution of the patients’, health condition and the incidence of complications or side effects as consequence of the surgery.
原发性多汗症是一种以过度出汗为特征的疾病,经内窥镜胸椎交感神经切除术治疗。手术的结果是,患者的整体生活质量得到了改善。他们的日常活动不受这种疾病的影响,或者受到的影响较小,他们的情绪状态也有了显著的改善,从羞耻和自我惩罚的状态变成了我们所说的正常生活。本文分析了227例经内窥镜胸椎交感神经切除术治疗的患者的生活质量。这项研究基于商业智能技术的使用,它允许存储、分析和报告所有相关的发现。在技术方面,本文说明了特定医疗保健应用领域所需的数据库和数据分析开发。对于数据存储,设计了处理相关属性的数据集市。在数据分析方面,使用在线分析处理和数据挖掘技术来显示患者的演变,健康状况以及手术并发症或副作用的发生率。
{"title":"Analysis of the Quality of Life after an Endoscopic Thoracic Sympathectomy: A Business Intelligence Approach","authors":"D. Goncalves, M. Y. Santos, Jorge Cruz","doi":"10.1109/DBKDA.2010.12","DOIUrl":"https://doi.org/10.1109/DBKDA.2010.12","url":null,"abstract":"Primary hyperhidrosis, a disorder characterized by an excessive sweating, has been treated by endoscopic thoracic sympathectomy. As a consequence of the surgery, patients improved their overall quality of life. Their day-by-day activities are not affected, or are less affected, by this disorder, and their emotional state verifies a significant improvement, from a situation of shame and self-punishing to what we could say a normal life. This paper presents the analysis of the quality of life of 227 patients that were treated by an endoscopic thoracic sympathectomy. The study was based on the use of business intelligence technologies, which allowed the storage, the analysis and the reporting of all the relevant findings. In technological terms, this paper illustrates the database and data analysis developments needed in a specific healthcare application domain. For data storage, a data mart was designed addressing the relevant attributes. For data analysis, on-line analytical processing and data mining technologies were used to show the evolution of the patients&#8217, health condition and the incidence of complications or side effects as consequence of the surgery.","PeriodicalId":273177,"journal":{"name":"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130807428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Towards a Discovering Knowledge Comprehensible and Exploitable by the End-User 面向最终用户可理解和可利用的发现知识
A. Touzi
The main goal to extract knowledge in database is to help the user to give semantics of data and to optimize the information research. Unfortunately, this fundamental constraint is not taken into account by almost all the approaches for knowledge discovery. Indeed, these approaches generate a big number of rules that are not easily assimilated by the human brain. In this paper, we propose a new approach for Knowledge Discovery in Databases through the fusion of conceptual clustering, fuzzy logic, and formal concept analysis. While basing on the hierarchical structure offered by the lattices, we proceed to discover the Knowledge in a hierarchical way. Thus, according to the degree of detail required by the user, this approach proposes a level of knowledge and different views of this knowledge, so the user can easily exploit all knowledge generated. Moreover, this solution is extensible, the user is able to choose the fuzzy method of classification according to the domain of his data and his needs.
从数据库中提取知识的主要目的是帮助用户给出数据的语义和优化信息研究。不幸的是,几乎所有的知识发现方法都没有考虑到这一基本约束。事实上,这些方法产生了大量不容易被人脑吸收的规则。本文提出了一种融合概念聚类、模糊逻辑和形式化概念分析的数据库知识发现新方法。在网格提供层次结构的基础上,对知识进行分层发现。因此,根据用户所需的细节程度,该方法提出了一个知识层次和对该知识的不同看法,因此用户可以很容易地利用所生成的所有知识。此外,该方案具有可扩展性,用户可以根据自己的数据领域和需要选择模糊分类方法。
{"title":"Towards a Discovering Knowledge Comprehensible and Exploitable by the End-User","authors":"A. Touzi","doi":"10.1109/DBKDA.2010.36","DOIUrl":"https://doi.org/10.1109/DBKDA.2010.36","url":null,"abstract":"The main goal to extract knowledge in database is to help the user to give semantics of data and to optimize the information research. Unfortunately, this fundamental constraint is not taken into account by almost all the approaches for knowledge discovery. Indeed, these approaches generate a big number of rules that are not easily assimilated by the human brain. In this paper, we propose a new approach for Knowledge Discovery in Databases through the fusion of conceptual clustering, fuzzy logic, and formal concept analysis. While basing on the hierarchical structure offered by the lattices, we proceed to discover the Knowledge in a hierarchical way. Thus, according to the degree of detail required by the user, this approach proposes a level of knowledge and different views of this knowledge, so the user can easily exploit all knowledge generated. Moreover, this solution is extensible, the user is able to choose the fuzzy method of classification according to the domain of his data and his needs.","PeriodicalId":273177,"journal":{"name":"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications","volume":"189 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121204995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Failure-Tolerant Transaction Routing at Large Scale 大规模容错事务路由
Idrissa Sarr, Hubert Naacke, Stéphane Gançarski
Emerging Web2.0 applications such as virtual worlds or social networking websites strongly differ from usual OLTP applications. First, the transactions are encapsulated in an API such that it is possible to know which data a transaction will access, before processing it. Second, the simultaneous transactions are very often commutative since they access distinct data. Anticipating that the workload of such applications will quickly reach thousands of transactions per seconds, we envision a novel solution that would allow these applications to scale-up without the need to buy expensive resources at a data center. To this end, databases are replicated over a P2P infrastructure for achieving high availability and fast transaction processing thanks to parallelism. However, achieving both fast and consistent data access on such architectures is challenging at many points. In particular, centralized control is prohibited because of its vulnerability and lack of efficiency at large scale. Moreover dynamic behavior of nodes, which can join and leave the system at anytime and frequently, can compromise mutual consistency. In this article, we propose a failure-tolerant solution for the distributed control of transaction routing in a large scale network. We leverage a fully distributed approach relying on a DHT to handle routing metadata, with a suitable failure management mechanism that handles nodes dynamicity and nodes failures. Moreover, we demonstrate the feasibility of our transaction routing implementation through experimentation and the effectiveness of our failure management approach through simulation.
新兴的Web2.0应用程序(如虚拟世界或社交网络网站)与通常的OLTP应用程序有很大不同。首先,将事务封装在API中,这样就可以在处理事务之前知道事务将访问哪些数据。其次,并发事务通常是可交换的,因为它们访问不同的数据。预计此类应用程序的工作负载将迅速达到每秒数千个事务,我们设想了一种新颖的解决方案,该解决方案将允许这些应用程序扩展,而无需在数据中心购买昂贵的资源。为此,数据库在P2P基础设施上进行复制,通过并行性实现高可用性和快速事务处理。然而,在这样的架构上实现快速和一致的数据访问在很多方面都是具有挑战性的。特别是集中控制,由于其脆弱性和大规模缺乏效率而被禁止。此外,节点的动态行为可以在任何时间和频繁地加入和离开系统,这可能会损害相互的一致性。在本文中,我们提出了一个大规模网络中事务路由分布式控制的容错解决方案。我们利用完全分布式的方法,依靠DHT来处理路由元数据,并使用合适的故障管理机制来处理节点动态和节点故障。此外,我们通过实验验证了事务路由实现的可行性,并通过仿真验证了故障管理方法的有效性。
{"title":"Failure-Tolerant Transaction Routing at Large Scale","authors":"Idrissa Sarr, Hubert Naacke, Stéphane Gançarski","doi":"10.1109/DBKDA.2010.9","DOIUrl":"https://doi.org/10.1109/DBKDA.2010.9","url":null,"abstract":"Emerging Web2.0 applications such as virtual worlds or social networking websites strongly differ from usual OLTP applications. First, the transactions are encapsulated in an API such that it is possible to know which data a transaction will access, before processing it. Second, the simultaneous transactions are very often commutative since they access distinct data. Anticipating that the workload of such applications will quickly reach thousands of transactions per seconds, we envision a novel solution that would allow these applications to scale-up without the need to buy expensive resources at a data center. To this end, databases are replicated over a P2P infrastructure for achieving high availability and fast transaction processing thanks to parallelism. However, achieving both fast and consistent data access on such architectures is challenging at many points. In particular, centralized control is prohibited because of its vulnerability and lack of efficiency at large scale. Moreover dynamic behavior of nodes, which can join and leave the system at anytime and frequently, can compromise mutual consistency. In this article, we propose a failure-tolerant solution for the distributed control of transaction routing in a large scale network. We leverage a fully distributed approach relying on a DHT to handle routing metadata, with a suitable failure management mechanism that handles nodes dynamicity and nodes failures. Moreover, we demonstrate the feasibility of our transaction routing implementation through experimentation and the effectiveness of our failure management approach through simulation.","PeriodicalId":273177,"journal":{"name":"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122726644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Situational Resource Rating System 情境资源评级系统
Raphaël Thollot, Marie-Aude Aufaure
Recommendation technologies are considered a major technological trend in both industrial and academic environments. This growing interest was highlighted by, e.g., the Netflix prize which generated an intense competition. Recommender systems are crucial to support users and help them by suggesting resources relevant at a given instant. On the other hand, these systems are a core piece of e-commerce web sites, since they aim at generating more sales by encouraging users to buy more items. However, recommender systems are often designed to work with very specific types of resources, and they hardly take into account the current user’s situation. In this paper, we present our approach to augment an existing recommender system with a situation model. On top of this model, we define a situational interest measure to estimate a user’s interest for a resource, which we demonstrate with a prototypical implementation.
推荐技术被认为是工业和学术环境中的一个主要技术趋势。这种日益增长的兴趣在Netflix奖等引起激烈竞争的奖项中得到了突出体现。推荐系统对于支持用户并通过在给定时刻推荐相关资源来帮助他们至关重要。另一方面,这些系统是电子商务网站的核心部分,因为它们的目标是通过鼓励用户购买更多的商品来产生更多的销售额。然而,推荐系统通常被设计为处理非常特定类型的资源,它们几乎没有考虑到当前用户的情况。在本文中,我们提出了用情境模型增强现有推荐系统的方法。在这个模型之上,我们定义了一个情景兴趣度量来估计用户对资源的兴趣,我们用一个原型实现来演示。
{"title":"A Situational Resource Rating System","authors":"Raphaël Thollot, Marie-Aude Aufaure","doi":"10.1109/DBKDA.2010.31","DOIUrl":"https://doi.org/10.1109/DBKDA.2010.31","url":null,"abstract":"Recommendation technologies are considered a major technological trend in both industrial and academic environments. This growing interest was highlighted by, e.g., the Netflix prize which generated an intense competition. Recommender systems are crucial to support users and help them by suggesting resources relevant at a given instant. On the other hand, these systems are a core piece of e-commerce web sites, since they aim at generating more sales by encouraging users to buy more items. However, recommender systems are often designed to work with very specific types of resources, and they hardly take into account the current user’s situation. In this paper, we present our approach to augment an existing recommender system with a situation model. On top of this model, we define a situational interest measure to estimate a user’s interest for a resource, which we demonstrate with a prototypical implementation.","PeriodicalId":273177,"journal":{"name":"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131417335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Maintenance of k-Dominant Skyline for Frequently Updated Database 频繁更新数据库中k-Dominant Skyline的高效维护
M. A. Siddique, Y. Morimoto
Skyline queries retrieve a set of skyline objects so that the user can choose promising objects from them and make further inquiries. However, a skyline query often retrieves too many objects to analyze intensively. To solve the problem, k-dominant skyline queries have been introduced, which can reduce the number of retrieved objects by relaxing the definition of the dominance. Though it can reduce the number of retrieved objects, the k-dominant skyline objects are difficult to maintain if the database is updated. This paper addresses the problem of maintenance of k-dominant skyline objects of frequently updated database. We propose an algorithm for maintaining k-dominant skyline objects. Intensive experiments using real and synthetic datasets demonstrated that our method is efficient and scalable.
Skyline查询检索一组Skyline对象,以便用户可以从中选择有希望的对象并进行进一步查询。然而,skyline查询通常会检索到太多的对象,无法进行深入分析。为了解决这个问题,引入了k主导的天际线查询,它可以通过放松主导的定义来减少检索对象的数量。虽然它可以减少检索对象的数量,但如果数据库更新,则k主导的skyline对象难以维护。本文研究了频繁更新数据库中k-显性天际线对象的维护问题。我们提出了一种维持k-显性天际线物体的算法。使用真实数据集和合成数据集进行的大量实验表明,我们的方法是有效的和可扩展的。
{"title":"Efficient Maintenance of k-Dominant Skyline for Frequently Updated Database","authors":"M. A. Siddique, Y. Morimoto","doi":"10.1109/DBKDA.2010.16","DOIUrl":"https://doi.org/10.1109/DBKDA.2010.16","url":null,"abstract":"Skyline queries retrieve a set of skyline objects so that the user can choose promising objects from them and make further inquiries. However, a skyline query often retrieves too many objects to analyze intensively. To solve the problem, k-dominant skyline queries have been introduced, which can reduce the number of retrieved objects by relaxing the definition of the dominance. Though it can reduce the number of retrieved objects, the k-dominant skyline objects are difficult to maintain if the database is updated. This paper addresses the problem of maintenance of k-dominant skyline objects of frequently updated database. We propose an algorithm for maintaining k-dominant skyline objects. Intensive experiments using real and synthetic datasets demonstrated that our method is efficient and scalable.","PeriodicalId":273177,"journal":{"name":"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114750171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Intelligent Network Communications for Distributed Database Systems 分布式数据库系统的智能网络通信
I. Hababeh
Customizing network sites have become an increasingly important issue in distributed database systems. This will improve the network system performance by reducing the number of communications required for query processing in terms of retrieval and update transactions. This paper presents an intelligent clustering method for distributed database system that provides a structure for organizing large number of network sites into a set of useful clusters to minimize transactions processing communications. It has been designed to divide the database network sites into a set of disjoint clusters based on a high performance clustering technique. This can reduce the amount of redundant data to be accessed and transferred among different sites, definitely increase the transaction performance, significantly improve database system response time, and result in better distributed network decision support. Experimental validations on real database applications at different networks connectivity are performed and the results demonstrate that the proposed method leads to precise solutions for the problems of data communication, allocation, and redundancy.
在分布式数据库系统中,定制网络站点已成为一个日益重要的问题。这将通过减少查询处理在检索和更新事务方面所需的通信数量来提高网络系统性能。本文提出了一种用于分布式数据库系统的智能集群方法,该方法提供了一种结构,可以将大量的网络站点组织成一组有用的集群,以减少事务处理通信。基于高性能的集群技术,将数据库网络站点划分为一组不相交的集群。这可以减少需要在不同站点之间访问和传输的冗余数据量,明显提高事务性能,显著改善数据库系统响应时间,并提供更好的分布式网络决策支持。在不同网络连接的实际数据库应用中进行了实验验证,结果表明该方法可以精确地解决数据通信、分配和冗余问题。
{"title":"Intelligent Network Communications for Distributed Database Systems","authors":"I. Hababeh","doi":"10.1109/DBKDA.2010.11","DOIUrl":"https://doi.org/10.1109/DBKDA.2010.11","url":null,"abstract":"Customizing network sites have become an increasingly important issue in distributed database systems. This will improve the network system performance by reducing the number of communications required for query processing in terms of retrieval and update transactions. This paper presents an intelligent clustering method for distributed database system that provides a structure for organizing large number of network sites into a set of useful clusters to minimize transactions processing communications. It has been designed to divide the database network sites into a set of disjoint clusters based on a high performance clustering technique. This can reduce the amount of redundant data to be accessed and transferred among different sites, definitely increase the transaction performance, significantly improve database system response time, and result in better distributed network decision support. Experimental validations on real database applications at different networks connectivity are performed and the results demonstrate that the proposed method leads to precise solutions for the problems of data communication, allocation, and redundancy.","PeriodicalId":273177,"journal":{"name":"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133886197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Towards Social Network Extraction Using a Graph Database 基于图数据库的社会网络提取研究
Rania Soussi, Marie-Aude Aufaure, Hajer Baazaoui Zghal
In the enterprise context, an important amount of information is stored in relational databases. Therefore, relational database can be a rich source to extract social network. Moreover, it is not very suitable to present and store a social network. On the other hand, a graph database canmodel data in natural way and facilitates the query of data using graph operations. In this way, we propose a social network extraction approach from relational, and present mechanisms for transforming relational database into graph databases.
在企业上下文中,大量重要信息存储在关系数据库中。因此,关系数据库可以作为提取社会网络的丰富资源。此外,它不太适合呈现和存储社交网络。另一方面,图数据库可以以自然的方式对数据进行建模,并便于使用图操作对数据进行查询。通过这种方式,我们提出了一种从关系数据库中提取社会网络的方法,并提出了将关系数据库转换为图数据库的机制。
{"title":"Towards Social Network Extraction Using a Graph Database","authors":"Rania Soussi, Marie-Aude Aufaure, Hajer Baazaoui Zghal","doi":"10.1109/DBKDA.2010.19","DOIUrl":"https://doi.org/10.1109/DBKDA.2010.19","url":null,"abstract":"In the enterprise context, an important amount of information is stored in relational databases. Therefore, relational database can be a rich source to extract social network. Moreover, it is not very suitable to present and store a social network. On the other hand, a graph database canmodel data in natural way and facilitates the query of data using graph operations. In this way, we propose a social network extraction approach from relational, and present mechanisms for transforming relational database into graph databases.","PeriodicalId":273177,"journal":{"name":"2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132345580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
期刊
2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1