首页 > 最新文献

2016 Eleventh International Conference on Digital Information Management (ICDIM)最新文献

英文 中文
A critical review of density-based data stream clustering techniques 基于密度的数据流聚类技术综述
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829786
Affan Ahmad Toor, M. Usman, W. Ahmed
Data stream is relatively new and emerging domain in the current era of Internet advancement. Clustering data streams is equally important and difficult because of the numerous hurdles attached to it. A number of algorithms have been proposed to offer solutions for efficient clustering. Grid-based clustering approach was adopted few years ago to overcome the limitations of conventional partition-based algorithms for data stream clustering. Data points are mapped to the grid-cells to form micro-clusters which later are used for clustering. Using density in the clustering process is proved to be a remarkable success and in recent years many researchers have used density to find arbitrary shaped & density clusters and identify outliers. Concept of density-based clustering is to use grid-based clustering at core and create a distinction between dense and sparse grids using density threshold values and use dense grids to yield clustering results; which provide more cluster purity and accuracy. In this paper, we reviewed grid-based data stream clustering algorithms which utilize density. We evaluated their functionalities and identified their limitations. In the end, we critically evaluated different aspects of algorithms and suggested one of these algorithms which is better in terms of performance and accuracy.
数据流是当今互联网发展的新领域。聚类数据流同样重要和困难,因为它附带了许多障碍。已经提出了许多算法来提供有效聚类的解决方案。为了克服传统的基于分区的数据流聚类算法的局限性,几年前采用了基于网格的聚类方法。数据点被映射到网格单元,形成微集群,然后用于聚类。在聚类过程中使用密度被证明是一个显著的成功,近年来许多研究人员使用密度来发现任意形状和密度的聚类和识别异常值。基于密度的聚类概念是以基于网格的聚类为核心,利用密度阈值区分密集网格和稀疏网格,并利用密集网格产生聚类结果;这提供了更高的聚类纯度和准确性。本文综述了基于网格的数据流聚类算法。我们评估了它们的功能并确定了它们的局限性。最后,我们批判性地评估了算法的不同方面,并提出了其中一种在性能和准确性方面更好的算法。
{"title":"A critical review of density-based data stream clustering techniques","authors":"Affan Ahmad Toor, M. Usman, W. Ahmed","doi":"10.1109/ICDIM.2016.7829786","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829786","url":null,"abstract":"Data stream is relatively new and emerging domain in the current era of Internet advancement. Clustering data streams is equally important and difficult because of the numerous hurdles attached to it. A number of algorithms have been proposed to offer solutions for efficient clustering. Grid-based clustering approach was adopted few years ago to overcome the limitations of conventional partition-based algorithms for data stream clustering. Data points are mapped to the grid-cells to form micro-clusters which later are used for clustering. Using density in the clustering process is proved to be a remarkable success and in recent years many researchers have used density to find arbitrary shaped & density clusters and identify outliers. Concept of density-based clustering is to use grid-based clustering at core and create a distinction between dense and sparse grids using density threshold values and use dense grids to yield clustering results; which provide more cluster purity and accuracy. In this paper, we reviewed grid-based data stream clustering algorithms which utilize density. We evaluated their functionalities and identified their limitations. In the end, we critically evaluated different aspects of algorithms and suggested one of these algorithms which is better in terms of performance and accuracy.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123973250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Prominent voices and prevalent discourses: A corporate social responsibility application 突出的声音和流行的话语:企业社会责任的应用
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829780
Carlos M. Parra, M. Tremblay, A. Castellanos
In this study we develop a simplified technique for identifying prominent voices (and characterizing prevalent discourses) using Text Data Mining around Corporate Social Responsibility (CSR) issues or topics. We do this by analyzing a corpus of CSR reports produced by 7 US firms (Citi, Coca-Cola, Exxon-Mobil, General Motors, Intel, McDonald's and Microsoft) in 2004, 2008 and 2012, and focusing on a reduced set of vectors — or Singular Vector Decompositions (SVDs)-derived from these CSR reports while exploring term associations (Text Topics or Term Clusters). Specifically, we use centroid clustering on these SVDs to identify centroid-guiding-CSR-report-components (or firms with prominent voices and prevalent discourses around a CSR topic). The analysis is performed by year in order to discern the way in which prominent voices and prevalent discourses (around CSR topics) have evolved through time. Results indicate that it is difficult for firms to maintain a prominent voice around CSR issues through time, and that when they manage to do so it is because the prevalent discourse has direct business implications.
在这项研究中,我们开发了一种简化的技术,用于围绕企业社会责任(CSR)问题或主题使用文本数据挖掘来识别突出的声音(并描述流行的话语)。我们通过分析7家美国公司(花旗、可口可乐、埃克森美孚、通用汽车、英特尔、麦当劳和微软)在2004年、2008年和2012年制作的社会责任报告语料库来做到这一点,并在探索术语关联(文本主题或术语聚类)的同时,专注于从这些社会责任报告中提取的简化向量集-或奇异向量分解(SVDs)。具体来说,我们在这些svd上使用质心聚类来识别质心导向的CSR-报告成分(或围绕CSR主题有突出声音和流行话语的公司)。该分析是按年进行的,目的是辨别(围绕企业社会责任主题)的突出声音和流行话语是如何随着时间的推移而演变的。结果表明,随着时间的推移,企业很难在企业社会责任问题上保持突出的声音,而当他们设法做到这一点时,这是因为流行的话语具有直接的商业含义。
{"title":"Prominent voices and prevalent discourses: A corporate social responsibility application","authors":"Carlos M. Parra, M. Tremblay, A. Castellanos","doi":"10.1109/ICDIM.2016.7829780","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829780","url":null,"abstract":"In this study we develop a simplified technique for identifying prominent voices (and characterizing prevalent discourses) using Text Data Mining around Corporate Social Responsibility (CSR) issues or topics. We do this by analyzing a corpus of CSR reports produced by 7 US firms (Citi, Coca-Cola, Exxon-Mobil, General Motors, Intel, McDonald's and Microsoft) in 2004, 2008 and 2012, and focusing on a reduced set of vectors — or Singular Vector Decompositions (SVDs)-derived from these CSR reports while exploring term associations (Text Topics or Term Clusters). Specifically, we use centroid clustering on these SVDs to identify centroid-guiding-CSR-report-components (or firms with prominent voices and prevalent discourses around a CSR topic). The analysis is performed by year in order to discern the way in which prominent voices and prevalent discourses (around CSR topics) have evolved through time. Results indicate that it is difficult for firms to maintain a prominent voice around CSR issues through time, and that when they manage to do so it is because the prevalent discourse has direct business implications.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122032634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Systematic mapping for big data stream processing frameworks 大数据流处理框架的系统映射
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829760
Mohammed Alayyoub, A. Yazici, Z. Karakaya
There has been lots of discussions about the choice of a stream processing framework (SPF) for Big Data. Each of the SPFs has different cutting edge technologies in their steps of processing the data in motion that gives them a better advantage over the others. Even though, the cutting edge technologies used in each stream processing framework might better them, it is still hard to say which framework bests the rest under different scenarios and conditions. In this study, we aim to show trends and differences about several SPFs for Big Data by using the Systematic Mapping (SM) approach. To achieve our objectives, we raise 6 research questions (RQs), in which 91 studies that conducted between 2010 and 2015 were evaluated. We present the trends by classifying the research on SPFs with respect to the proposed RQs which can help researchers to obtain an overview of the field.
关于为大数据选择一个流处理框架(SPF)已经有了很多讨论。每个spf在处理动态数据的步骤中都有不同的尖端技术,这使它们比其他spf具有更好的优势。尽管每个流处理框架中使用的尖端技术可能比它们更好,但在不同的场景和条件下,仍然很难说哪个框架比其他框架更好。在本研究中,我们的目标是通过使用系统映射(SM)方法来显示大数据的几个spf的趋势和差异。为了实现我们的目标,我们提出了6个研究问题(RQs),其中评估了2010年至2015年间进行的91项研究。我们通过对SPFs研究的分类来呈现趋势,这可以帮助研究人员获得该领域的概述。
{"title":"Systematic mapping for big data stream processing frameworks","authors":"Mohammed Alayyoub, A. Yazici, Z. Karakaya","doi":"10.1109/ICDIM.2016.7829760","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829760","url":null,"abstract":"There has been lots of discussions about the choice of a stream processing framework (SPF) for Big Data. Each of the SPFs has different cutting edge technologies in their steps of processing the data in motion that gives them a better advantage over the others. Even though, the cutting edge technologies used in each stream processing framework might better them, it is still hard to say which framework bests the rest under different scenarios and conditions. In this study, we aim to show trends and differences about several SPFs for Big Data by using the Systematic Mapping (SM) approach. To achieve our objectives, we raise 6 research questions (RQs), in which 91 studies that conducted between 2010 and 2015 were evaluated. We present the trends by classifying the research on SPFs with respect to the proposed RQs which can help researchers to obtain an overview of the field.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122615643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Defining requirements for color-coding text software in teaching of Arabic 确定阿拉伯语教学中彩色编码文本软件的要求
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829759
Hend Suliman Al-Khalifa, Muna A. Muhaureq
Founding proper reading and comprehension abilities of the Arabic written text is of great significance for learners of the language since this is a means for extracting the linguistic and cultural knowledge. This process is complex in Arabic since the script is interwoven and multiple segments can be fused to create a single word which in return complicates identifying word units for new learners and accordingly delays proper acquisition. Proper acquisition is defined here as the ability to fluently read the text as well as manage to decode word parts formed by the agglutination of affixes. This paper introduces the requirements for software that simplifies instruction on word decoding and comprehension through utilizing color-coding on Arabic text.
建立正确的阿拉伯语书面文本阅读和理解能力对于阿拉伯语学习者来说意义重大,因为这是提取语言和文化知识的手段。这个过程在阿拉伯语中是复杂的,因为脚本是交织在一起的,多个片段可以融合成一个单词,这反过来又使新学习者识别单词单位变得复杂,从而延迟了正确的学习。适当的习得在这里被定义为能够流利地阅读文本,并设法解码由词缀凝集形成的单词部分。本文介绍了对阿拉伯文本进行颜色编码,简化单词解码和理解教学的软件要求。
{"title":"Defining requirements for color-coding text software in teaching of Arabic","authors":"Hend Suliman Al-Khalifa, Muna A. Muhaureq","doi":"10.1109/ICDIM.2016.7829759","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829759","url":null,"abstract":"Founding proper reading and comprehension abilities of the Arabic written text is of great significance for learners of the language since this is a means for extracting the linguistic and cultural knowledge. This process is complex in Arabic since the script is interwoven and multiple segments can be fused to create a single word which in return complicates identifying word units for new learners and accordingly delays proper acquisition. Proper acquisition is defined here as the ability to fluently read the text as well as manage to decode word parts formed by the agglutination of affixes. This paper introduces the requirements for software that simplifies instruction on word decoding and comprehension through utilizing color-coding on Arabic text.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"403 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127593860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
On the structural properties of eBay's network 论eBay网络的结构特性
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829771
C. M. França, Antonio A. Rocha, P. B. Velloso
The OSN's (On-line Social Networks) have reached an incredible popularity in modern Internet. Those systems have been present in the daily lives of countless people helping them to share personal experiences, expectations and opinions. So high popularity has made of such networks complex systems. To understand the operation and phenomena that occur in such networks, there are metrics and models that capture aspects of their structures. The purpose of this work is to understand the complex reality of eBay e-commerce network, their connections and the dynamics of its users. Data were collected using a Web crawler developed in this work, and it resulted in a database of approximately 87 million transactions and 15 million different dealer users. From these data, the characterization was made estimating network metrics, like dealer users' degree distribution, that gave us key insights about the eBay negotiation network. We found that there are users who bought/sold for more than 100.000 different persons. We also found that a user A interacted over 4.000 times with another user B in just 3 months. Those and other interesting results, such as average distance and feedbacks ratings, were obtained, analyzed and discussed in this work.
OSN(在线社交网络)在现代互联网中已经达到了令人难以置信的普及程度。这些系统已经存在于无数人的日常生活中,帮助他们分享个人经历、期望和意见。如此高的普及率使得这种网络成为复杂的系统。为了理解在这种网络中发生的操作和现象,有一些度量和模型可以捕捉其结构的各个方面。这项工作的目的是了解eBay电子商务网络的复杂现实,他们的连接和动态的用户。数据是使用在这项工作中开发的Web爬虫收集的,它产生了一个包含大约8700万笔交易和1500万不同经销商用户的数据库。从这些数据中,我们对网络指标进行了表征,比如经销商用户的程度分布,这给了我们关于eBay谈判网络的关键见解。我们发现,有些用户为超过10万个不同的人买卖股票。我们还发现,用户a在3个月内与另一个用户B进行了超过4000次的互动。这些结果以及其他有趣的结果,如平均距离和反馈评级,在本工作中进行了分析和讨论。
{"title":"On the structural properties of eBay's network","authors":"C. M. França, Antonio A. Rocha, P. B. Velloso","doi":"10.1109/ICDIM.2016.7829771","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829771","url":null,"abstract":"The OSN's (On-line Social Networks) have reached an incredible popularity in modern Internet. Those systems have been present in the daily lives of countless people helping them to share personal experiences, expectations and opinions. So high popularity has made of such networks complex systems. To understand the operation and phenomena that occur in such networks, there are metrics and models that capture aspects of their structures. The purpose of this work is to understand the complex reality of eBay e-commerce network, their connections and the dynamics of its users. Data were collected using a Web crawler developed in this work, and it resulted in a database of approximately 87 million transactions and 15 million different dealer users. From these data, the characterization was made estimating network metrics, like dealer users' degree distribution, that gave us key insights about the eBay negotiation network. We found that there are users who bought/sold for more than 100.000 different persons. We also found that a user A interacted over 4.000 times with another user B in just 3 months. Those and other interesting results, such as average distance and feedbacks ratings, were obtained, analyzed and discussed in this work.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127651924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A survey revealing path towards service life cycle management in COBIT 5 揭示COBIT 5中服务生命周期管理路径的调查
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829754
Umara Noor, A. Ghazanfar
Information technology has become an indispensable unit of an enterprise life in current era. Its inception has changed the ways businesses are done today in a competitive environment. In order to get value from significant investments done on complex IT infrastructure, it should be efficiently governed and managed. IT governance and management is a part of overall corporate governance and management and plays a vital role in aligning IT with business strategies. Among the several frameworks proposed for IT governance, COBIT is the most comprehensive and diverse framework providing support for both governance and management at all levels in multiple business domains. COBIT provides a toolset to bridge the gap between control requirements, technical issues and business risks. In this study we provide a survey of a few implementations of COBIT framework in multiple domains. Based on the survey we state our findings and recommend its adoption to comprehend similar issues. Further we identified certain limitations of COBIT framework and addressed the integration of service life cycle management into the original framework. We added seven new processes to the process structure of COBIT 5 along with their high level objectives. Also we added a few control objectives to the existing processes of COBIT framework. The survey provides a clear understanding of each COBIT implementation and the elements of the framework addressed in each implementation. Our study serves as a guide for all COBIT implementers and helps them teach how to deal with different kinds of governance or management matters. Further how the framework can be enhanced to provide service life cycle management.
在当今时代,信息技术已经成为企业生活中不可或缺的组成部分。它的诞生改变了当今在竞争环境中开展业务的方式。为了从对复杂IT基础设施的重大投资中获得价值,应该对其进行有效的治理和管理。IT治理和管理是整个公司治理和管理的一部分,在将IT与业务策略结合起来方面起着至关重要的作用。在为IT治理提出的几个框架中,COBIT是最全面和最多样化的框架,为多个业务领域的所有级别的治理和管理提供支持。COBIT提供了一个工具集来弥合控制需求、技术问题和业务风险之间的鸿沟。在本研究中,我们对COBIT框架在多个领域的一些实现进行了调查。根据这项调查,我们陈述了我们的发现,并建议采用它来理解类似的问题。此外,我们还确定了COBIT框架的某些局限性,并解决了将服务生命周期管理集成到原始框架中的问题。我们向COBIT 5的过程结构中添加了7个新的过程以及它们的高层目标。我们还在COBIT框架的现有过程中添加了一些控制目标。调查提供了对每个COBIT实现和每个实现中处理的框架元素的清晰理解。我们的研究可以作为所有COBIT实现者的指南,并帮助他们学习如何处理不同类型的治理或管理问题。进一步说明如何增强框架以提供服务生命周期管理。
{"title":"A survey revealing path towards service life cycle management in COBIT 5","authors":"Umara Noor, A. Ghazanfar","doi":"10.1109/ICDIM.2016.7829754","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829754","url":null,"abstract":"Information technology has become an indispensable unit of an enterprise life in current era. Its inception has changed the ways businesses are done today in a competitive environment. In order to get value from significant investments done on complex IT infrastructure, it should be efficiently governed and managed. IT governance and management is a part of overall corporate governance and management and plays a vital role in aligning IT with business strategies. Among the several frameworks proposed for IT governance, COBIT is the most comprehensive and diverse framework providing support for both governance and management at all levels in multiple business domains. COBIT provides a toolset to bridge the gap between control requirements, technical issues and business risks. In this study we provide a survey of a few implementations of COBIT framework in multiple domains. Based on the survey we state our findings and recommend its adoption to comprehend similar issues. Further we identified certain limitations of COBIT framework and addressed the integration of service life cycle management into the original framework. We added seven new processes to the process structure of COBIT 5 along with their high level objectives. Also we added a few control objectives to the existing processes of COBIT framework. The survey provides a clear understanding of each COBIT implementation and the elements of the framework addressed in each implementation. Our study serves as a guide for all COBIT implementers and helps them teach how to deal with different kinds of governance or management matters. Further how the framework can be enhanced to provide service life cycle management.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129974203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A quality metric for BPEL process under evolution 演进中的BPEL流程的质量度量
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829777
N. Parimala, R. Kohar
In Service-Oriented Architecture (SOA), behaviour of a business process is specified using Business Process Execution Language (BPEL) which is a XML based language. In today's competitive market, enterprises change their business processes frequently. Changes in BPEL process may affect the quality of BPEL process for the consumer. It is desirable to measure and evaluate the BPEL process quality when changes occur. Metrics are vastly used to provide a quantitative measure for the quality. In this paper, BPEL Process Usefulness Metric under Evolution (BUME) is proposed to measure quality of a BPEL process when it evolves. The applicability of the metric is demonstrated using simulated data for different versions of a BPEL process.
在面向服务的体系结构(SOA)中,业务流程的行为是使用业务流程执行语言(BPEL)指定的,BPEL是一种基于XML的语言。在当今竞争激烈的市场中,企业频繁地改变其业务流程。BPEL流程中的更改可能会影响使用者的BPEL流程质量。在发生更改时度量和评估BPEL流程质量是可取的。度量标准被广泛用于提供质量的定量度量。本文提出了BPEL流程演化下的有用性度量(BUME)来度量BPEL流程演化时的质量。使用不同版本BPEL流程的模拟数据演示了该度量的适用性。
{"title":"A quality metric for BPEL process under evolution","authors":"N. Parimala, R. Kohar","doi":"10.1109/ICDIM.2016.7829777","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829777","url":null,"abstract":"In Service-Oriented Architecture (SOA), behaviour of a business process is specified using Business Process Execution Language (BPEL) which is a XML based language. In today's competitive market, enterprises change their business processes frequently. Changes in BPEL process may affect the quality of BPEL process for the consumer. It is desirable to measure and evaluate the BPEL process quality when changes occur. Metrics are vastly used to provide a quantitative measure for the quality. In this paper, BPEL Process Usefulness Metric under Evolution (BUME) is proposed to measure quality of a BPEL process when it evolves. The applicability of the metric is demonstrated using simulated data for different versions of a BPEL process.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"183 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116336868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Normalizing digital news-stories for preservation 将数字新闻故事规范化以保存
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829785
Muzammil Khan, Arif Ur Rahman, M. D. Awan, Syed Mehtab Alam
Preserving news stories may be important because of various reasons like they provide detailed information about events and they may be used for research purposes in the long term. However, the news stories published online are in danger because of reasons like constant change in the technologies used to publish information and the formats for publication. Certain institutions or individuals may be interested in preserving news stories related to a particular event or topic. The stories should be collected from various online newspapers and preserved for the long term. The major issue in the preservation process is that newspapers use different formats for online publication of the stories. The paper presents a tool which is developed to addresses the issue. The tool facilitates users in the extraction of news stories from various online newspapers and migration to a normalized format.
保存新闻报道可能很重要,因为有各种原因,比如它们提供了关于事件的详细信息,它们可能长期用于研究目的。然而,由于发布信息的技术和发布格式的不断变化等原因,在线发布的新闻故事处于危险之中。某些机构或个人可能对保存与特定事件或主题相关的新闻故事感兴趣。这些故事应该从各种在线报纸上收集并长期保存。保存过程中的主要问题是报纸使用不同的格式来在线发布这些故事。本文提出了一种解决这一问题的工具。该工具方便用户从各种在线报纸中提取新闻故事,并将其迁移到规范化格式。
{"title":"Normalizing digital news-stories for preservation","authors":"Muzammil Khan, Arif Ur Rahman, M. D. Awan, Syed Mehtab Alam","doi":"10.1109/ICDIM.2016.7829785","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829785","url":null,"abstract":"Preserving news stories may be important because of various reasons like they provide detailed information about events and they may be used for research purposes in the long term. However, the news stories published online are in danger because of reasons like constant change in the technologies used to publish information and the formats for publication. Certain institutions or individuals may be interested in preserving news stories related to a particular event or topic. The stories should be collected from various online newspapers and preserved for the long term. The major issue in the preservation process is that newspapers use different formats for online publication of the stories. The paper presents a tool which is developed to addresses the issue. The tool facilitates users in the extraction of news stories from various online newspapers and migration to a normalized format.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"23 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123502924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Extracting keyword and keyphrase from online privacy policies 从在线隐私策略中提取关键字和关键短语
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829792
Dhiren A. Audich, R. Dara, B. Nonnecke
One of the key components of constructing an ontology is a taxonomy. Creating a comprehensive taxonomy involves extracting keywords and keyphrases from the domain corpus. It is a time consuming endeavour that involves domain expertise and syntactic and structural knowledge of the corpus in question. In this paper we explore different keyword and keyphrase extraction algorithms for the domain of online privacy policies. To do this we used a variety of well-known techniques such as TF-IDF, RAKE, TextRank, and AlchemyAPI, benchmarked against manual annotation. We then further evaluated the performances of various algorithms over a large corpus of 631 privacy policies. Due to the inconsistent language of privacy policies algorithms evaluating single documents (RAKE, TextRank, AlchemyAPI) outperformed the one evaluating the entire corpus (TF-IDF).
构建本体的关键组件之一是分类法。创建一个全面的分类法涉及从领域语料库中提取关键字和关键短语。这是一项耗时的工作,涉及领域专业知识和语料库的句法和结构知识。本文探讨了在线隐私策略领域中不同的关键字和关键短语提取算法。为此,我们使用了各种众所周知的技术,如TF-IDF、RAKE、TextRank和AlchemyAPI,并对手动注释进行基准测试。然后,我们在631个隐私策略的大型语料库上进一步评估了各种算法的性能。由于隐私策略语言不一致,评估单个文档的算法(RAKE、TextRank、AlchemyAPI)的性能优于评估整个语料库的算法(TF-IDF)。
{"title":"Extracting keyword and keyphrase from online privacy policies","authors":"Dhiren A. Audich, R. Dara, B. Nonnecke","doi":"10.1109/ICDIM.2016.7829792","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829792","url":null,"abstract":"One of the key components of constructing an ontology is a taxonomy. Creating a comprehensive taxonomy involves extracting keywords and keyphrases from the domain corpus. It is a time consuming endeavour that involves domain expertise and syntactic and structural knowledge of the corpus in question. In this paper we explore different keyword and keyphrase extraction algorithms for the domain of online privacy policies. To do this we used a variety of well-known techniques such as TF-IDF, RAKE, TextRank, and AlchemyAPI, benchmarked against manual annotation. We then further evaluated the performances of various algorithms over a large corpus of 631 privacy policies. Due to the inconsistent language of privacy policies algorithms evaluating single documents (RAKE, TextRank, AlchemyAPI) outperformed the one evaluating the entire corpus (TF-IDF).","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123751169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Methods for supporting the understanding of differences between search intentions and actual browsing situations in collaborative exploration 支持理解协同探索中搜索意图和实际浏览情况之间差异的方法
Pub Date : 2016-09-01 DOI: 10.1109/ICDIM.2016.7829772
H. Nakayama, R. Onuma, Hayato Takagi, H. Kaminaga, Y. Miyadera, Shoichi Nakamura
Collaborative exploration is one of the essential factors in advanced intellectual activities such as group work in project-based learning (PBL) and research work. Skillful sharing of the intentions of search and their results is quite important to enable collaborative exploration to be smoothly conducted. However, such sharing is usually difficult for members since they often face difficulties in expressing search intentions into queries and suffer from the troublesome activity of page selection. Such problems become more serious for novices. In particular, it is important but difficult to sufficiently understand the differences between search intentions and actual browsing situations. Moreover, there is often insufficient mutual understanding of differences in search policies between members of collaborative exploration since they tend to superficially confirm the search results. This research was aimed at developing novel support to cultivate the consideration skill of search strategy focusing on the novices' understanding of search contexts. This paper mainly describes the framework of support methods and provides a system overview. This paper also discusses the basic effectiveness and characteristics of our methods based on the results obtained from an experiment.
协作探索是项目学习(PBL)和研究工作中的小组合作等高级智力活动的重要因素之一。熟练地分享搜索意图及其结果对于协作探索的顺利进行是非常重要的。然而,这样的共享对于成员来说通常是困难的,因为他们经常面临在查询中表达搜索意图的困难,并遭受页面选择的麻烦活动。对于新手来说,这样的问题变得更加严重。特别是,充分理解搜索意图和实际浏览情况之间的差异很重要,但却很困难。此外,协作探索的成员之间往往缺乏对搜索策略差异的相互理解,因为他们倾向于肤浅地确认搜索结果。本研究旨在开发新的支持,以培养新手对搜索语境的理解为重点,培养他们对搜索策略的考虑能力。本文主要描述了支持方法的框架,并给出了系统概述。本文还根据实验结果讨论了该方法的基本有效性和特点。
{"title":"Methods for supporting the understanding of differences between search intentions and actual browsing situations in collaborative exploration","authors":"H. Nakayama, R. Onuma, Hayato Takagi, H. Kaminaga, Y. Miyadera, Shoichi Nakamura","doi":"10.1109/ICDIM.2016.7829772","DOIUrl":"https://doi.org/10.1109/ICDIM.2016.7829772","url":null,"abstract":"Collaborative exploration is one of the essential factors in advanced intellectual activities such as group work in project-based learning (PBL) and research work. Skillful sharing of the intentions of search and their results is quite important to enable collaborative exploration to be smoothly conducted. However, such sharing is usually difficult for members since they often face difficulties in expressing search intentions into queries and suffer from the troublesome activity of page selection. Such problems become more serious for novices. In particular, it is important but difficult to sufficiently understand the differences between search intentions and actual browsing situations. Moreover, there is often insufficient mutual understanding of differences in search policies between members of collaborative exploration since they tend to superficially confirm the search results. This research was aimed at developing novel support to cultivate the consideration skill of search strategy focusing on the novices' understanding of search contexts. This paper mainly describes the framework of support methods and provides a system overview. This paper also discusses the basic effectiveness and characteristics of our methods based on the results obtained from an experiment.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125149453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2016 Eleventh International Conference on Digital Information Management (ICDIM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1