首页 > 最新文献

2019 XLV Latin American Computing Conference (CLEI)最新文献

英文 中文
CLEI 2019 Sponsors
Pub Date : 2019-09-01 DOI: 10.1109/clei47609.2019.9073974
{"title":"CLEI 2019 Sponsors","authors":"","doi":"10.1109/clei47609.2019.9073974","DOIUrl":"https://doi.org/10.1109/clei47609.2019.9073974","url":null,"abstract":"","PeriodicalId":216193,"journal":{"name":"2019 XLV Latin American Computing Conference (CLEI)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123356834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A ranking-based approach for supporting the initial selection of primary studies in a Systematic Literature Review 在系统文献综述中支持初步研究选择的基于排序的方法
Pub Date : 2019-09-01 DOI: 10.1109/CLEI47609.2019.235079
Santiago Gonzalez-Toral, Renán Freire, R. Gualán, Víctor Saquicela
Traditionally most of the steps involved in a Systematic Literature Review (SLR) process are manually executed, causing inconvenience of time and effort, given the massive amount of primary studies available online. This has motivated a lot of research focused on automating the process. Current state-of-the-art methods combine active learning methods and manual selection of primary studies from a smaller set so they can maximize the finding of relevant papers while at the same time minimizing the number of manually reviewed papers. In this work, we propose a novel strategy to further improve these methods whose early success heavily depends on an effective selection of initial papers to be read by researchers using a PCAbased method which combines different document representation and similarity metric approaches to cluster and rank the content within the corpus related to an enriched representation of research questions within the SLR protocol. Validation was carried out over four publicly available data sets corresponding to SLR studies from the Software Engineering domain. The proposed model proved to be more efficient than a BM25 baseline model as a mechanism to select the initial set of relevant primary studies within the top 100 rank, which makes it a promising method to bootstrap an active learning cycle.
传统上,系统文献综述(SLR)过程中涉及的大多数步骤都是手动执行的,由于网上有大量的原始研究,这给时间和精力带来了不便。这激发了许多专注于自动化流程的研究。目前最先进的方法结合了主动学习方法和人工从较小的研究集中选择主要研究,这样他们可以最大限度地找到相关论文,同时最大限度地减少人工审查论文的数量。在这项工作中,我们提出了一种新的策略来进一步改进这些方法,这些方法的早期成功很大程度上取决于研究人员使用基于pcaba的方法有效地选择要阅读的初始论文,该方法结合了不同的文档表示和相似性度量方法,对语料库中与SLR协议中研究问题的丰富表示相关的内容进行聚类和排序。验证是在四个公开可用的数据集上进行的,这些数据集与软件工程领域的单反研究相对应。所提出的模型被证明比BM25基线模型更有效,作为一种机制,在前100名的排名中选择相关的初始研究集,这使得它成为一种有希望的方法来引导主动学习周期。
{"title":"A ranking-based approach for supporting the initial selection of primary studies in a Systematic Literature Review","authors":"Santiago Gonzalez-Toral, Renán Freire, R. Gualán, Víctor Saquicela","doi":"10.1109/CLEI47609.2019.235079","DOIUrl":"https://doi.org/10.1109/CLEI47609.2019.235079","url":null,"abstract":"Traditionally most of the steps involved in a Systematic Literature Review (SLR) process are manually executed, causing inconvenience of time and effort, given the massive amount of primary studies available online. This has motivated a lot of research focused on automating the process. Current state-of-the-art methods combine active learning methods and manual selection of primary studies from a smaller set so they can maximize the finding of relevant papers while at the same time minimizing the number of manually reviewed papers. In this work, we propose a novel strategy to further improve these methods whose early success heavily depends on an effective selection of initial papers to be read by researchers using a PCAbased method which combines different document representation and similarity metric approaches to cluster and rank the content within the corpus related to an enriched representation of research questions within the SLR protocol. Validation was carried out over four publicly available data sets corresponding to SLR studies from the Software Engineering domain. The proposed model proved to be more efficient than a BM25 baseline model as a mechanism to select the initial set of relevant primary studies within the top 100 rank, which makes it a promising method to bootstrap an active learning cycle.","PeriodicalId":216193,"journal":{"name":"2019 XLV Latin American Computing Conference (CLEI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130645351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Framework for Improving Cold Start Time in Function-as-a-service (FaaS) 功能即服务(FaaS)中改进冷启动时间的框架
Pub Date : 2019-09-01 DOI: 10.1109/CLEI47609.2019.235112
Rogério Dias Moreira, P. S. Barreto
This article describes a framework proposal for the cold start problem in Function-as-a-service (FaaS). The proposed framework has the goal to reduce the execution time and is presented as a prototype implemented that was evaluated with two different experimental scenarios and compared with a commercial proposal, the FaaS AWS Lambda from Amazon. The results show that the proposed framework may be considered a solution for the cold start problem and may improve the performance for applications that require a low response time.
本文描述了功能即服务(FaaS)中冷启动问题的框架建议。提议的框架的目标是减少执行时间,并以原型实现的形式呈现,该原型使用两个不同的实验场景进行了评估,并与商业提案(来自Amazon的FaaS AWS Lambda)进行了比较。结果表明,所提出的框架可以被认为是解决冷启动问题的一种方法,并且可以提高需要低响应时间的应用程序的性能。
{"title":"A Framework for Improving Cold Start Time in Function-as-a-service (FaaS)","authors":"Rogério Dias Moreira, P. S. Barreto","doi":"10.1109/CLEI47609.2019.235112","DOIUrl":"https://doi.org/10.1109/CLEI47609.2019.235112","url":null,"abstract":"This article describes a framework proposal for the cold start problem in Function-as-a-service (FaaS). The proposed framework has the goal to reduce the execution time and is presented as a prototype implemented that was evaluated with two different experimental scenarios and compared with a commercial proposal, the FaaS AWS Lambda from Amazon. The results show that the proposed framework may be considered a solution for the cold start problem and may improve the performance for applications that require a low response time.","PeriodicalId":216193,"journal":{"name":"2019 XLV Latin American Computing Conference (CLEI)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125842193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
CLEI 2019 Program Committee CLEI 2019项目委员会
Pub Date : 2019-09-01 DOI: 10.1109/clei47609.2019.9073955
{"title":"CLEI 2019 Program Committee","authors":"","doi":"10.1109/clei47609.2019.9073955","DOIUrl":"https://doi.org/10.1109/clei47609.2019.9073955","url":null,"abstract":"","PeriodicalId":216193,"journal":{"name":"2019 XLV Latin American Computing Conference (CLEI)","volume":"159 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127342567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ILP-based Energy Saving Routing for Software Defined Networking 基于ilp的软件定义网络节能路由
Pub Date : 2019-09-01 DOI: 10.1109/CLEI47609.2019.235109
Gerardo Riveros, Pedro Pablo Cespedes Sanchez, D. Pinto, H. Legal-Ayala
Software defined networking (SDN) is an emerging technology based on the separation of the control plane and the data plane. This allows to obtain benefits, in comparison with traditional networks, in terms of network management, global monitoring-control, cost reduction, and in particular the energy saving by the strategic activation of devices. In this paper, we propose an approach that seeks to minimize the global energy consumption of the network by suspending inactive devices, such as chassis and line cards, as well as limiting the use of links in traffic sessions. For this purpose, we developed an Integer Linear Programming (ILP) model for the SDN routing problem in order to obtain the minimum energy consumption, subject to satisfy all traffic demands. The experimental results on two network topologies for a set of static traffic requests indicate that the proposed model is promising, saving up to 42% of the global energy consumption obtaining a better performance to the models proposed in the literature. On the other hand, the experimental results for incremental semi-dynamic traffic indicate that the performance of the optimization with re-routing improves the approach without re-routing when increasing the traffic in the network, but this improvement is not always perceptible. The approach without re-routing in terms of scalability is promising, by increasing the traffic load not generate interruptions to the traffic already attended and affect the quality of the service.
软件定义网络(SDN)是一种基于控制平面和数据平面分离的新兴技术。与传统网络相比,这可以在网络管理、全局监控、成本降低、特别是通过战略性激活设备节省能源方面获得好处。在本文中,我们提出了一种方法,旨在通过暂停非活动设备(如机箱和线路卡)以及限制流量会话中链路的使用来最大限度地减少网络的全球能耗。为此,我们开发了SDN路由问题的整数线性规划(ILP)模型,以获得最小的能量消耗,同时满足所有流量需求。在两种网络拓扑上对一组静态流量请求的实验结果表明,该模型是有前景的,可节省高达42%的全局能耗,性能优于文献中提出的模型。另一方面,增量半动态流量的实验结果表明,当网络中流量增加时,重路由优化的性能优于不重路由的优化方法,但这种改进并不总是明显的。在可伸缩性方面没有重新路由的方法很有前途,通过增加流量负载而不会对已经参加的流量产生中断并影响服务质量。
{"title":"ILP-based Energy Saving Routing for Software Defined Networking","authors":"Gerardo Riveros, Pedro Pablo Cespedes Sanchez, D. Pinto, H. Legal-Ayala","doi":"10.1109/CLEI47609.2019.235109","DOIUrl":"https://doi.org/10.1109/CLEI47609.2019.235109","url":null,"abstract":"Software defined networking (SDN) is an emerging technology based on the separation of the control plane and the data plane. This allows to obtain benefits, in comparison with traditional networks, in terms of network management, global monitoring-control, cost reduction, and in particular the energy saving by the strategic activation of devices. In this paper, we propose an approach that seeks to minimize the global energy consumption of the network by suspending inactive devices, such as chassis and line cards, as well as limiting the use of links in traffic sessions. For this purpose, we developed an Integer Linear Programming (ILP) model for the SDN routing problem in order to obtain the minimum energy consumption, subject to satisfy all traffic demands. The experimental results on two network topologies for a set of static traffic requests indicate that the proposed model is promising, saving up to 42% of the global energy consumption obtaining a better performance to the models proposed in the literature. On the other hand, the experimental results for incremental semi-dynamic traffic indicate that the performance of the optimization with re-routing improves the approach without re-routing when increasing the traffic in the network, but this improvement is not always perceptible. The approach without re-routing in terms of scalability is promising, by increasing the traffic load not generate interruptions to the traffic already attended and affect the quality of the service.","PeriodicalId":216193,"journal":{"name":"2019 XLV Latin American Computing Conference (CLEI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130204282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Method for Project Execution Control based on Soft Computing and Machine Learning 基于软计算和机器学习的项目执行控制方法
Pub Date : 2019-09-01 DOI: 10.1109/CLEI47609.2019.235097
Anié Bermudez Peña, G. F. Castro, D. M. L. Alvarez, I. M. Alcivar, Giselle Lorena Núñez Núñez, Danny Saavedra Cevallos, Jorge Luis Zambrano Santa
To support decision-making, organizations employ dissimilar tools during their projects execution control. However, they are still insufficient in environments with uncertain information and changing conditions in management styles. Deficiencies in systems for controlling the projects execution, affects the quality of their classification in aiding decision-making. An alternative solution is the introduction of soft computing techniques, which provide robustness, efficiency and adaptability at tools. This research proposes a method for project execution control based on soft computing and machine learning, which contributes to improve the project management. The proposed method allows the machine learning and adjusting of fuzzy inference systems to the project evaluation. The results are obtained from the execution of seven algorithms, which are based on space partitioning, neural networks, gradient descent and genetic algorithms. Validation of the proposed system, integrated to a project management tool, ratifies an improvement in the quality of project evaluation. The obtained result provides a contribution to the perfection of tools to support the decision-making in project management organization
为了支持决策,组织在项目执行控制期间使用不同的工具。但是,在信息不确定的环境和管理方式变化的条件下,它们仍然是不够的。控制项目执行的系统的缺陷,影响了它们在辅助决策方面的分类质量。另一种解决方案是引入软计算技术,它在工具上提供健壮性、效率和适应性。本研究提出一种基于软计算和机器学习的项目执行控制方法,有助于提高项目管理水平。该方法将模糊推理系统的机器学习和调整应用到项目评价中。结果由基于空间划分、神经网络、梯度下降和遗传算法的7种算法的执行得到。与项目管理工具相结合的拟议系统的验证,确认了项目评估质量的改进。所得结果有助于完善项目管理组织的决策支持工具
{"title":"Method for Project Execution Control based on Soft Computing and Machine Learning","authors":"Anié Bermudez Peña, G. F. Castro, D. M. L. Alvarez, I. M. Alcivar, Giselle Lorena Núñez Núñez, Danny Saavedra Cevallos, Jorge Luis Zambrano Santa","doi":"10.1109/CLEI47609.2019.235097","DOIUrl":"https://doi.org/10.1109/CLEI47609.2019.235097","url":null,"abstract":"To support decision-making, organizations employ dissimilar tools during their projects execution control. However, they are still insufficient in environments with uncertain information and changing conditions in management styles. Deficiencies in systems for controlling the projects execution, affects the quality of their classification in aiding decision-making. An alternative solution is the introduction of soft computing techniques, which provide robustness, efficiency and adaptability at tools. This research proposes a method for project execution control based on soft computing and machine learning, which contributes to improve the project management. The proposed method allows the machine learning and adjusting of fuzzy inference systems to the project evaluation. The results are obtained from the execution of seven algorithms, which are based on space partitioning, neural networks, gradient descent and genetic algorithms. Validation of the proposed system, integrated to a project management tool, ratifies an improvement in the quality of project evaluation. The obtained result provides a contribution to the perfection of tools to support the decision-making in project management organization","PeriodicalId":216193,"journal":{"name":"2019 XLV Latin American Computing Conference (CLEI)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129124409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Risk Catalogs in Software Project Management 软件项目管理中的风险目录
Pub Date : 2019-09-01 DOI: 10.1109/CLEI47609.2019.9089044
V. Machado, Paulo Afonso Parreira Júnior, H. Costa
The software industry is continuously growing, and projects need to be planned to have a better chance of success. But, planning errors in a project can cause the project to fail. These errors, when there is damage/loss or gain, are called risks and need to be managed. Inadequate risk management can lead to project failure. Therefore, risk management in software design is crucial to its success. In this paper, through research in the literature, catalogs of risks that may occur during the development of software projects are presented. Besides, there are measures defined/identified in the literature to support decision making by project managers, using the GQM method.
软件行业在不断发展,需要对项目进行规划,以获得更好的成功机会。但是,项目中的计划错误可能导致项目失败。当存在损害/损失或收益时,这些错误被称为风险,需要加以管理。不充分的风险管理可能导致项目失败。因此,软件设计中的风险管理对其成功至关重要。本文通过对文献的研究,给出了软件项目开发过程中可能出现的风险目录。此外,文献中还定义/识别了一些度量,以支持项目经理使用GQM方法进行决策。
{"title":"Risk Catalogs in Software Project Management","authors":"V. Machado, Paulo Afonso Parreira Júnior, H. Costa","doi":"10.1109/CLEI47609.2019.9089044","DOIUrl":"https://doi.org/10.1109/CLEI47609.2019.9089044","url":null,"abstract":"The software industry is continuously growing, and projects need to be planned to have a better chance of success. But, planning errors in a project can cause the project to fail. These errors, when there is damage/loss or gain, are called risks and need to be managed. Inadequate risk management can lead to project failure. Therefore, risk management in software design is crucial to its success. In this paper, through research in the literature, catalogs of risks that may occur during the development of software projects are presented. Besides, there are measures defined/identified in the literature to support decision making by project managers, using the GQM method.","PeriodicalId":216193,"journal":{"name":"2019 XLV Latin American Computing Conference (CLEI)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116852859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Effectiveness of preprocessing techniques over social media texts for the improvement of machine learning based classifiers 社交媒体文本预处理技术对基于机器学习的分类器改进的有效性
Pub Date : 2019-09-01 DOI: 10.1109/CLEI47609.2019.235076
L. Esnaola, Juan Pablo Tessore, Hugo Ramón, C. Russo
The language present in the context of social networks is usually more informal than the one used in traditional sources. The researches that take such content as input for machine learning based classifying algorithms, perform, as a first step, a cleaning and standardization process. The goal of the latter is to improve the accuracy of the classification. In this paper, several cleaning tasks are defined and executed over a dataset of comments extracted from the social network Facebook. The goal is to verify if the corrections, made by such tasks, produce a significant improvement in the accuracy reached by the classifying algorithms. The results obtained, indicate that, over this type of dataset, preprocessing tasks with a reasonably good performance in the correction of errors, do not necessarily produce a noteworthy improvement in the classification accuracy reached by the algorithms.
社交网络中使用的语言通常比传统资源中使用的语言更不正式。将这些内容作为基于机器学习的分类算法的输入的研究,作为第一步,执行一个清理和标准化过程。后者的目标是提高分类的准确性。在本文中,定义了几个清理任务,并在从社交网络Facebook提取的评论数据集上执行。目标是验证这些任务所做的修正是否能显著提高分类算法所达到的准确性。得到的结果表明,在这种类型的数据集上,具有相当好的纠错性能的预处理任务并不一定会使算法达到的分类精度有明显的提高。
{"title":"Effectiveness of preprocessing techniques over social media texts for the improvement of machine learning based classifiers","authors":"L. Esnaola, Juan Pablo Tessore, Hugo Ramón, C. Russo","doi":"10.1109/CLEI47609.2019.235076","DOIUrl":"https://doi.org/10.1109/CLEI47609.2019.235076","url":null,"abstract":"The language present in the context of social networks is usually more informal than the one used in traditional sources. The researches that take such content as input for machine learning based classifying algorithms, perform, as a first step, a cleaning and standardization process. The goal of the latter is to improve the accuracy of the classification. In this paper, several cleaning tasks are defined and executed over a dataset of comments extracted from the social network Facebook. The goal is to verify if the corrections, made by such tasks, produce a significant improvement in the accuracy reached by the classifying algorithms. The results obtained, indicate that, over this type of dataset, preprocessing tasks with a reasonably good performance in the correction of errors, do not necessarily produce a noteworthy improvement in the classification accuracy reached by the algorithms.","PeriodicalId":216193,"journal":{"name":"2019 XLV Latin American Computing Conference (CLEI)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126925051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An architectural proposal for the interactive publication of the data classification obtained through a Differentially Private Random Decision Forest 一种基于差分私有随机决策林的数据分类交互式发布的体系结构方案
Pub Date : 2019-09-01 DOI: 10.1109/CLEI47609.2019.235070
Rosinei Cristiano Pereira, F. Lopes
Data are generated in several contexts, by various devices, and are collected by organizations whose aims to obtain as much information as possible to add value to their business. There are plenty of ethical and non-ethical purposes involved such as identifying consumers' needs and then recommend products and services, developing new business, conducting health-related research in order to reduce medical errors, assessing risk of people developing diseases, so on. The organizations’ concerns about risks associated to potential privacy leaks and their impacts have increased dramatically. Thus, apply data mining in process optimization without compromising sensitive data and provide a strong privacy standard are challenges imposed to data stewards, who use techniques and privacy models during data release process. This study aims to propose a classification decision tree application, developed under the Differential Privacy model definition, whose architecture was designed according to the interactive data release model that deploys a barrier to forbid users to have access data in their raw format. In addition, a self-tuning feature that controls the forest growth was put in place, resulting in a better classification performance if compared to the adoption of a fixed amount of trees in the forest. However, there was an increase in processing time. It also was observed in most of the datasets used in the experiment that beyond a threshold the classification performance is reduced by increasing the number of trees that compose the forest.
数据由不同的设备在不同的环境中生成,并由旨在获取尽可能多的信息以增加其业务价值的组织收集。有很多道德和非道德的目的,如确定消费者的需求,然后推荐产品和服务,发展新的业务,进行健康相关的研究,以减少医疗差错,评估人们患疾病的风险,等等。这些组织对潜在隐私泄露风险及其影响的担忧急剧增加。因此,如何在不影响敏感数据的情况下将数据挖掘应用于流程优化,并提供一个强大的隐私标准,是数据管理员在数据发布过程中使用技术和隐私模型所面临的挑战。本研究旨在提出一种分类决策树应用程序,该应用程序在差分隐私模型定义下开发,其架构根据交互式数据发布模型设计,该模型部署了一个屏障,禁止用户访问原始格式的数据。此外,还引入了控制森林生长的自调优功能,与在森林中采用固定数量的树木相比,可以获得更好的分类性能。然而,处理时间有所增加。在实验中使用的大多数数据集中还观察到,超过阈值后,增加组成森林的树木数量会降低分类性能。
{"title":"An architectural proposal for the interactive publication of the data classification obtained through a Differentially Private Random Decision Forest","authors":"Rosinei Cristiano Pereira, F. Lopes","doi":"10.1109/CLEI47609.2019.235070","DOIUrl":"https://doi.org/10.1109/CLEI47609.2019.235070","url":null,"abstract":"Data are generated in several contexts, by various devices, and are collected by organizations whose aims to obtain as much information as possible to add value to their business. There are plenty of ethical and non-ethical purposes involved such as identifying consumers' needs and then recommend products and services, developing new business, conducting health-related research in order to reduce medical errors, assessing risk of people developing diseases, so on. The organizations’ concerns about risks associated to potential privacy leaks and their impacts have increased dramatically. Thus, apply data mining in process optimization without compromising sensitive data and provide a strong privacy standard are challenges imposed to data stewards, who use techniques and privacy models during data release process. This study aims to propose a classification decision tree application, developed under the Differential Privacy model definition, whose architecture was designed according to the interactive data release model that deploys a barrier to forbid users to have access data in their raw format. In addition, a self-tuning feature that controls the forest growth was put in place, resulting in a better classification performance if compared to the adoption of a fixed amount of trees in the forest. However, there was an increase in processing time. It also was observed in most of the datasets used in the experiment that beyond a threshold the classification performance is reduced by increasing the number of trees that compose the forest.","PeriodicalId":216193,"journal":{"name":"2019 XLV Latin American Computing Conference (CLEI)","volume":"327 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123509118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CLEI 2019 Organizing Committee CLEI 2019组委会
Pub Date : 2019-09-01 DOI: 10.1109/clei47609.2019.9073824
{"title":"CLEI 2019 Organizing Committee","authors":"","doi":"10.1109/clei47609.2019.9073824","DOIUrl":"https://doi.org/10.1109/clei47609.2019.9073824","url":null,"abstract":"","PeriodicalId":216193,"journal":{"name":"2019 XLV Latin American Computing Conference (CLEI)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133752181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2019 XLV Latin American Computing Conference (CLEI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1