首页 > 最新文献

2016 Eleventh International Conference on Digital Information Management (ICDIM)最新文献

英文 中文
SJSON: A succinct representation for JavaScript object notation documents json: JavaScript对象表示法文档的简洁表示
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829787
Edman Anjos, Junhee Lee, S. Rao
The massive amounts of data processed in modern computational systems is becoming a problem of increasing importance. This data is commonly stored directly or indirectly through the use of data exchange languages, such as JavaScript Object Notation (JSON), for human-readable platform agnostic access. This paper focuses on describing and analyzing SJSON, a library that explores succinct representations of JSON documents as a means to achieve reduced memory usage of files in main memory, and to permit the compression of JSON files stored in disk. In SJSON we represent the document structure with succinct trees, as opposed to the usual pointer-based implementation. Furthermore, the remaining raw data are organized in arrays of attributes and values. Attributes are stripped of redundancies and stored in a simple contiguous array, while values are represented through a bit string indexed array. The scheme here proposed is then evaluated with respect to a number of metrics comparing its performance with popular libraries, anssd possible improvements to the representation are then presented.
在现代计算系统中处理的大量数据正成为一个日益重要的问题。这些数据通常通过使用数据交换语言(如JavaScript Object Notation (JSON))直接或间接地存储,以实现与人类可读的平台无关的访问。本文的重点是描述和分析JSON,这是一个探索JSON文档的简洁表示的库,它可以减少主存中文件的内存使用,并允许压缩存储在磁盘中的JSON文件。在json中,我们用简洁的树表示文档结构,而不是通常的基于指针的实现。此外,剩余的原始数据被组织在属性和值数组中。属性被剥离冗余并存储在一个简单的连续数组中,而值则通过位字符串索引数组表示。这里提出的方案,然后就一些指标进行评估,将其性能与流行的库进行比较,然后提出可能的改进表示。
{"title":"SJSON: A succinct representation for JavaScript object notation documents","authors":"Edman Anjos, Junhee Lee, S. Rao","doi":"10.1109/ICDIM.2016.7829787","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829787","url":null,"abstract":"The massive amounts of data processed in modern computational systems is becoming a problem of increasing importance. This data is commonly stored directly or indirectly through the use of data exchange languages, such as JavaScript Object Notation (JSON), for human-readable platform agnostic access. This paper focuses on describing and analyzing SJSON, a library that explores succinct representations of JSON documents as a means to achieve reduced memory usage of files in main memory, and to permit the compression of JSON files stored in disk. In SJSON we represent the document structure with succinct trees, as opposed to the usual pointer-based implementation. Furthermore, the remaining raw data are organized in arrays of attributes and values. Attributes are stripped of redundancies and stored in a simple contiguous array, while values are represented through a bit string indexed array. The scheme here proposed is then evaluated with respect to a number of metrics comparing its performance with popular libraries, anssd possible improvements to the representation are then presented.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131252815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Information security and digital forensics in the world of cyber physical systems 网络物理系统世界中的信息安全和数字取证
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829795
Andrew Jones, S. Vidalis, N. Abouzakhar
The security of Cyber Physical Systems and any digital forensic investigations into them will be highly dependent on data that is stored and processed in the Cloud. This paper looks at a number of the issues that will need to be addressed if this environment is to be trusted to securely hold both system critical and personal information and to enable investigations into incidents to be undertaken.
网络物理系统的安全性以及对其进行的任何数字取证调查将高度依赖于存储和处理在云中的数据。如果要信任该环境以安全地保存系统关键信息和个人信息,并能够对事件进行调查,本文将研究需要解决的一些问题。
{"title":"Information security and digital forensics in the world of cyber physical systems","authors":"Andrew Jones, S. Vidalis, N. Abouzakhar","doi":"10.1109/ICDIM.2016.7829795","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829795","url":null,"abstract":"The security of Cyber Physical Systems and any digital forensic investigations into them will be highly dependent on data that is stored and processed in the Cloud. This paper looks at a number of the issues that will need to be addressed if this environment is to be trusted to securely hold both system critical and personal information and to enable investigations into incidents to be undertaken.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133522269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Exploiting response patterns for identifying topical experts in StackOverflow 利用响应模式来识别StackOverflow中的专题专家
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829790
M. Bhanu, Joydeep Chandra
The popularity of community question answer (CQA) forums like Stack Overflow, Yahoo Answers and Quora is increasing tremendously with thousands of questions being posted each day and about thrice the number of responses being provided. With such query explosion, users participating in these forums receive a huge number of postings that adversely affects their responsiveness and also the quality of the responses. Hence, identifying topical experts is necessary to improve the efficacy of these systems in terms of both response time and quality. Although expert detection in CQA forums has traditionally been a topic of wide interest, however, many of the proposed techniques use features set that reflect the popularity of the responses of the responder rather than the difficulty level of the questions being responded. In this paper we provide measures of labeling difficult questions and use the number of difficult questions responded by a user combined with other user interaction parameters in identifying potential topical experts. Using a random forest classifier with the proposed feature set on Stack Overflow data, we obtain an improvement in accuracy of 5–16% over existing techniques, in detecting topical experts.
像Stack Overflow, Yahoo Answers和Quora这样的社区问答(CQA)论坛的受欢迎程度正在急剧增加,每天有数千个问题被发布,并且提供了大约三倍的回答。随着查询的爆炸式增长,参与这些论坛的用户会收到大量的帖子,这对他们的响应能力和回复质量产生了不利影响。因此,确定专题专家对于提高这些系统在响应时间和质量方面的有效性是必要的。尽管CQA论坛中的专家检测传统上是一个广受关注的话题,但是,许多提议的技术使用的特征集反映了应答者回答的受欢迎程度,而不是被应答问题的难度级别。在本文中,我们提供了标记难题的措施,并使用用户回答的难题数量与其他用户交互参数相结合来识别潜在的专题专家。在Stack Overflow数据上使用具有所提出特征集的随机森林分类器,在检测主题专家方面,我们比现有技术的准确率提高了5-16%。
{"title":"Exploiting response patterns for identifying topical experts in StackOverflow","authors":"M. Bhanu, Joydeep Chandra","doi":"10.1109/ICDIM.2016.7829790","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829790","url":null,"abstract":"The popularity of community question answer (CQA) forums like Stack Overflow, Yahoo Answers and Quora is increasing tremendously with thousands of questions being posted each day and about thrice the number of responses being provided. With such query explosion, users participating in these forums receive a huge number of postings that adversely affects their responsiveness and also the quality of the responses. Hence, identifying topical experts is necessary to improve the efficacy of these systems in terms of both response time and quality. Although expert detection in CQA forums has traditionally been a topic of wide interest, however, many of the proposed techniques use features set that reflect the popularity of the responses of the responder rather than the difficulty level of the questions being responded. In this paper we provide measures of labeling difficult questions and use the number of difficult questions responded by a user combined with other user interaction parameters in identifying potential topical experts. Using a random forest classifier with the proposed feature set on Stack Overflow data, we obtain an improvement in accuracy of 5–16% over existing techniques, in detecting topical experts.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131421547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Processing of multimedia aplications and their use in foreign language teaching 多媒体应用程序的处理及其在外语教学中的应用
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829793
S. Hubackova
The paper gives some basic view on use of multimedia when teaching foreign language. The concept of multimedia in connection with education process involves a few technical didactical instruments, which during the school supplies development appeared as a part of educational process. In classical education the most used sense in absorbing information is hearing. With the use of multimedia is the most used sense vision. Visual perception enables faster achieving not only partial learning success, but as well as reducing the overall time of learning process.
本文对多媒体在外语教学中的应用提出了一些基本看法。与教育过程相关的多媒体概念涉及到一些技术教学工具,这些工具在学校用品发展过程中作为教育过程的一部分而出现。在古典教育中,吸收信息最常用的感官是听觉。随着多媒体的使用,使用最多的是感官视觉。视觉感知不仅能够更快地实现部分学习成功,而且还可以缩短学习过程的总体时间。
{"title":"Processing of multimedia aplications and their use in foreign language teaching","authors":"S. Hubackova","doi":"10.1109/ICDIM.2016.7829793","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829793","url":null,"abstract":"The paper gives some basic view on use of multimedia when teaching foreign language. The concept of multimedia in connection with education process involves a few technical didactical instruments, which during the school supplies development appeared as a part of educational process. In classical education the most used sense in absorbing information is hearing. With the use of multimedia is the most used sense vision. Visual perception enables faster achieving not only partial learning success, but as well as reducing the overall time of learning process.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116749020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards automatic generation of relevance judgments for a test collection 为测试集合自动生成相关判断
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829763
Mireille Makary, M. Oakes, Fadi Yamout
This paper represents a new technique for building a relevance judgment list for information retrieval test collections without any human intervention. It is based on the number of occurrences of the documents in runs retrieved from several information retrieval systems and a distance based measure between the documents. The effectiveness of the technique is evaluated by computing the correlation between the ranking of the TREC systems using the original relevance judgment list (qrels) built by human assessors and the ranking obtained by using the newly generated qrels.
提出了一种无需人工干预的信息检索测试集相关性判断表的构建方法。它基于从几个信息检索系统检索的运行中文档的出现次数和基于文档之间距离的度量。通过计算由人工评估者建立的原始关联判断表与新生成的关联判断表对TREC系统的排序之间的相关性来评估该技术的有效性。
{"title":"Towards automatic generation of relevance judgments for a test collection","authors":"Mireille Makary, M. Oakes, Fadi Yamout","doi":"10.1109/ICDIM.2016.7829763","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829763","url":null,"abstract":"This paper represents a new technique for building a relevance judgment list for information retrieval test collections without any human intervention. It is based on the number of occurrences of the documents in runs retrieved from several information retrieval systems and a distance based measure between the documents. The effectiveness of the technique is evaluated by computing the correlation between the ranking of the TREC systems using the original relevance judgment list (qrels) built by human assessors and the ranking obtained by using the newly generated qrels.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"190 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117344338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The evolution of C2C social commerce models C2C社交商务模式的演变
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829761
Supattana Sukrat, P. Mahatanankoon, B. Papasratorn
While large retailers utilize social commerce to increase consumer trust and online sales, a variety of online transactions take place among social networking users. These new forms of e-business models often conceptualized as C2C s-commerce have become one of the most popular methods for consumer online trading. The article proposes four phases of C2C s-commerce (i.e., ad-hoc, empowered, organized, integrated) and examines how consumers can leverage the features and functionalities of social networking websites to their benefit. These four phases reveal the evolution of C2C s-commerce business models with future implications.
当大型零售商利用社交商务来增加消费者信任和在线销售时,各种各样的在线交易发生在社交网络用户之间。这些新形式的电子商务模式通常被概念化为C2C s-commerce,已经成为消费者在线交易最流行的方法之一。本文提出了C2C s-commerce的四个阶段(即,特设,授权,组织,集成),并研究了消费者如何利用社交网站的特性和功能来造福他们。这四个阶段揭示了C2C s-commerce商业模式的演变及其对未来的影响。
{"title":"The evolution of C2C social commerce models","authors":"Supattana Sukrat, P. Mahatanankoon, B. Papasratorn","doi":"10.1109/ICDIM.2016.7829761","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829761","url":null,"abstract":"While large retailers utilize social commerce to increase consumer trust and online sales, a variety of online transactions take place among social networking users. These new forms of e-business models often conceptualized as C2C s-commerce have become one of the most popular methods for consumer online trading. The article proposes four phases of C2C s-commerce (i.e., ad-hoc, empowered, organized, integrated) and examines how consumers can leverage the features and functionalities of social networking websites to their benefit. These four phases reveal the evolution of C2C s-commerce business models with future implications.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127032692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A data mining application on moving object data 移动对象数据的数据挖掘应用
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829757
Yasen Yakufu, C. Atay
With the wide availability of GPS devices in our lives, massive amounts of object movement data have been collected from various moving object targets, such as mobile devices, animals, and vehicles. In the last decade, Moving Object Databases (MOD) have attracted many researchers. Analyzing such data has deep implications in many areas, such as ecological study and traffic control. In this study, we focus on moving object data (moving points) analysis and retrieve valuable information for knowledge discovery. In this research, a moving object data model is implemented in the object-relational database system, additionally some special queries and data mining techniques are performed. Retrieving information directly from unorganized spatial-temporal data is almost impossible. However, not only a vast amount of spatial-temporal data sets organized into MOD data model but also the discovery of valuable knowledge from spatial-temporal data to help decision support processes is possible now owing to this research implementation.
随着GPS设备在我们生活中的广泛应用,大量的物体运动数据已经从各种移动物体目标(如移动设备、动物和车辆)中收集到。在过去的十年中,移动对象数据库(MOD)吸引了许多研究者。分析这些数据在生态研究和交通控制等许多领域具有深远的意义。在本研究中,我们专注于对运动物体数据(运动点)的分析,并检索有价值的信息进行知识发现。本研究在对象关系数据库系统中实现了一个移动对象数据模型,并实现了一些特殊的查询和数据挖掘技术。直接从无组织的时空数据中检索信息几乎是不可能的。然而,由于这项研究的实施,现在不仅可以将大量的时空数据集组织到MOD数据模型中,而且可以从时空数据中发现有价值的知识来帮助决策支持过程。
{"title":"A data mining application on moving object data","authors":"Yasen Yakufu, C. Atay","doi":"10.1109/ICDIM.2016.7829757","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829757","url":null,"abstract":"With the wide availability of GPS devices in our lives, massive amounts of object movement data have been collected from various moving object targets, such as mobile devices, animals, and vehicles. In the last decade, Moving Object Databases (MOD) have attracted many researchers. Analyzing such data has deep implications in many areas, such as ecological study and traffic control. In this study, we focus on moving object data (moving points) analysis and retrieve valuable information for knowledge discovery. In this research, a moving object data model is implemented in the object-relational database system, additionally some special queries and data mining techniques are performed. Retrieving information directly from unorganized spatial-temporal data is almost impossible. However, not only a vast amount of spatial-temporal data sets organized into MOD data model but also the discovery of valuable knowledge from spatial-temporal data to help decision support processes is possible now owing to this research implementation.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123758365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A federation model to support semantic SPARQL queries for enterprise data governance 支持用于企业数据治理的语义SPARQL查询的联合模型
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829778
Antonio Corradi, L. Foschini, Alessandro Zanni, Mirco Casoni, S. Monti, Francesco Sprotetto
Data Governance and Federation in large complex organizations poses non-trivial challenges due to the integration of heterogeneous, distributed data sources. Semantic Web, and its de-facto standard query language — SPARQL — have proven to be key in defining and searching semantic over any sort of content on the Web, thus easily letting content clients discover hidden relationships among disparate data. However, current SPARQL support of Data Federation is fairly limited, making it impractical for real-world scenarios. Our work proposes an open and autonomous platform for Data Federation that overcome traditional SPARQL limitations and opens up unprecedented opportunities for Data Governance in large organizations.
由于异构、分布式数据源的集成,大型复杂组织中的数据治理和联合带来了不小的挑战。语义Web及其事实上的标准查询语言SPARQL已被证明是定义和搜索Web上任何类型内容的语义的关键,因此可以轻松地让内容客户端发现不同数据之间的隐藏关系。然而,当前SPARQL对Data Federation的支持相当有限,这使得它在实际场景中不切实际。我们的工作为数据联合提出了一个开放和自治的平台,它克服了传统SPARQL的限制,并为大型组织中的数据治理开辟了前所未有的机会。
{"title":"A federation model to support semantic SPARQL queries for enterprise data governance","authors":"Antonio Corradi, L. Foschini, Alessandro Zanni, Mirco Casoni, S. Monti, Francesco Sprotetto","doi":"10.1109/ICDIM.2016.7829778","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829778","url":null,"abstract":"Data Governance and Federation in large complex organizations poses non-trivial challenges due to the integration of heterogeneous, distributed data sources. Semantic Web, and its de-facto standard query language — SPARQL — have proven to be key in defining and searching semantic over any sort of content on the Web, thus easily letting content clients discover hidden relationships among disparate data. However, current SPARQL support of Data Federation is fairly limited, making it impractical for real-world scenarios. Our work proposes an open and autonomous platform for Data Federation that overcome traditional SPARQL limitations and opens up unprecedented opportunities for Data Governance in large organizations.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131074535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Grafeno: Semantic graph extraction and operation Grafeno:语义图的提取和操作
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829770
Antonio F. G. Sevilla, Alberto Fernández-Isabel, Alberto Díaz
Grafeno is a Natural Language Processing library for doing semantics. It represents semantic information with a graph structure, and is able to automatically extract this representation from the dependency analysis of a text. It aims to encompass the different possible approaches to doing graph semantics by being as modular and flexible as possible. It also provides functionality for operating on the graph and performing different experiments. In this article, we explain its design and use, and show its potential with two use cases.
Grafeno是一个用于语义处理的自然语言处理库。它用图形结构表示语义信息,并能够从文本的依赖分析中自动提取这种表示。它的目标是通过尽可能模块化和灵活的方式来包含实现图形语义的不同可能方法。它还提供了对图形进行操作和执行不同实验的功能。在本文中,我们将解释其设计和使用,并通过两个用例展示其潜力。
{"title":"Grafeno: Semantic graph extraction and operation","authors":"Antonio F. G. Sevilla, Alberto Fernández-Isabel, Alberto Díaz","doi":"10.1109/ICDIM.2016.7829770","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829770","url":null,"abstract":"Grafeno is a Natural Language Processing library for doing semantics. It represents semantic information with a graph structure, and is able to automatically extract this representation from the dependency analysis of a text. It aims to encompass the different possible approaches to doing graph semantics by being as modular and flexible as possible. It also provides functionality for operating on the graph and performing different experiments. In this article, we explain its design and use, and show its potential with two use cases.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127935721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A “Fast Data” architecture: Dashboard for anomalous traffic analysis in data networks “快速数据”架构:数据网络中异常流量分析的仪表板
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829756
Miguel Angel López Peña, C. Rua, Sergio Segovia Lozoya
Fast Data is a new Big Data computing paradigm that ensures requirements such as Real-Time processing of continuous data stream, storage at high rates and low latency with no data losses. In this work we propose a “Fast Data” architecture for a specific kind of software application in which input data arrive very fast and the results for each processed data have to match such input rates. We applied this architecture to build a Dashboard for Anomalous Traffic Analysis in Data Networks. In order to fulfill the requirements of Real-Time processing and no data losses, we carry out a design that consists of a pattern of dynamic tree of process pipelines, where the number of branches increases proportionally to the input data rate. Two different approaches have been followed to implement this design pattern: one based in a well-known set of products from the Big Data ecosystem; and the other built with Kafka, Zookeeper and a set of components designed and implemented by us. These two implementations have been compared in terms of velocity and scalability performance. As a result, the implementation built with our own components is significantly faster and scalable than the traditional one. The good results obtained by using both the design pattern of dynamic tree of process pipelines and our implementation make them very suitable for its use in other scenarios and applications such as smart cities, environment monitoring, industry 4.0, distributed control systems, etc.
Fast Data是一种新的大数据计算范式,可确保对连续数据流的实时处理、高速率存储和低延迟、无数据丢失等要求。在这项工作中,我们为一种特定类型的软件应用程序提出了一种“快速数据”架构,其中输入数据到达非常快,并且每个处理数据的结果必须匹配这样的输入速率。我们应用这个架构来构建一个仪表板,用于数据网络中的异常流量分析。为了满足实时处理和无数据丢失的要求,我们进行了一种由流程管道动态树模式组成的设计,其中分支数量与输入数据率成比例增加。实现这种设计模式有两种不同的方法:一种是基于大数据生态系统中一组众所周知的产品;另一个是用Kafka、Zookeeper和我们设计和实现的一组组件构建的。这两种实现在速度和可伸缩性性能方面进行了比较。因此,使用我们自己的组件构建的实现比传统的实现要快得多,而且可扩展。通过使用过程管道动态树的设计模式和我们的实现所获得的良好结果使它们非常适合在其他场景和应用中使用,例如智慧城市,环境监测,工业4.0,分布式控制系统等。
{"title":"A “Fast Data” architecture: Dashboard for anomalous traffic analysis in data networks","authors":"Miguel Angel López Peña, C. Rua, Sergio Segovia Lozoya","doi":"10.1109/ICDIM.2016.7829756","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829756","url":null,"abstract":"Fast Data is a new Big Data computing paradigm that ensures requirements such as Real-Time processing of continuous data stream, storage at high rates and low latency with no data losses. In this work we propose a “Fast Data” architecture for a specific kind of software application in which input data arrive very fast and the results for each processed data have to match such input rates. We applied this architecture to build a Dashboard for Anomalous Traffic Analysis in Data Networks. In order to fulfill the requirements of Real-Time processing and no data losses, we carry out a design that consists of a pattern of dynamic tree of process pipelines, where the number of branches increases proportionally to the input data rate. Two different approaches have been followed to implement this design pattern: one based in a well-known set of products from the Big Data ecosystem; and the other built with Kafka, Zookeeper and a set of components designed and implemented by us. These two implementations have been compared in terms of velocity and scalability performance. As a result, the implementation built with our own components is significantly faster and scalable than the traditional one. The good results obtained by using both the design pattern of dynamic tree of process pipelines and our implementation make them very suitable for its use in other scenarios and applications such as smart cities, environment monitoring, industry 4.0, distributed control systems, etc.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130849725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2016 Eleventh International Conference on Digital Information Management (ICDIM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1