首页 > 最新文献

Proceedings of the XV Brazilian Symposium on Information Systems最新文献

英文 中文
Open Data Extraction, Transformation, and Loading as a Tool for Supporting 2018 Elections' Voters 开放数据提取、转换和加载作为支持2018年选举选民的工具
Pub Date : 2019-05-20 DOI: 10.1145/3330204.3330232
Nélson R. S. Passos, Ariel F. Rodrigues, Hendrik T. Macedo, Bruno O. Prado, G. J. F. D. Silva, L. Matos
Democracy is a political regime based on the majority's choice. However, people can only make conscious decisions if they have access to high quality information. This paper aimed to join data from different sources and to transform them in knowledge to Brazilians voters. It applied ETL (Extract, Transform, Load) methods on open and property data to build a process that covers data gathering and transformation, dataset generation, database modeling and population, public APIs development, and a mobile app as the knowledge's visualization model. As a result, for 2018 Brazil's general elections, it processed almost two million candidacies, half a million deputies' tasks and five thousand court lawsuits. Furthermore, the products released by this research reached good performance indicators: the access logs recorded more than three million hits for the public API and twelve thousand downloads for the mobile app in the last week of the first-round's political campaign.
民主是一种建立在多数人选择基础上的政治制度。然而,人们只有在获得高质量信息的情况下才能做出有意识的决定。本文旨在将来自不同来源的数据结合起来,并将其转化为巴西选民的知识。它在开放数据和属性数据上应用ETL (Extract, Transform, Load)方法来构建一个过程,该过程涵盖数据收集和转换、数据集生成、数据库建模和填充、公共api开发以及作为知识可视化模型的移动应用程序。因此,在2018年巴西大选中,它处理了近200万名候选人、50万名代表的任务和5000起法庭诉讼。此外,该研究发布的产品达到了良好的性能指标:在第一轮政治竞选的最后一周,访问日志记录了超过300万次公共API点击和12000次移动应用程序下载。
{"title":"Open Data Extraction, Transformation, and Loading as a Tool for Supporting 2018 Elections' Voters","authors":"Nélson R. S. Passos, Ariel F. Rodrigues, Hendrik T. Macedo, Bruno O. Prado, G. J. F. D. Silva, L. Matos","doi":"10.1145/3330204.3330232","DOIUrl":"https://doi.org/10.1145/3330204.3330232","url":null,"abstract":"Democracy is a political regime based on the majority's choice. However, people can only make conscious decisions if they have access to high quality information. This paper aimed to join data from different sources and to transform them in knowledge to Brazilians voters. It applied ETL (Extract, Transform, Load) methods on open and property data to build a process that covers data gathering and transformation, dataset generation, database modeling and population, public APIs development, and a mobile app as the knowledge's visualization model. As a result, for 2018 Brazil's general elections, it processed almost two million candidacies, half a million deputies' tasks and five thousand court lawsuits. Furthermore, the products released by this research reached good performance indicators: the access logs recorded more than three million hits for the public API and twelve thousand downloads for the mobile app in the last week of the first-round's political campaign.","PeriodicalId":348938,"journal":{"name":"Proceedings of the XV Brazilian Symposium on Information Systems","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115314346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
On the Effects of Developers' Intuition on Measuring Similarity Between UML Models 开发人员的直觉对度量UML模型间相似性的影响
Pub Date : 2019-05-20 DOI: 10.1145/3330204.3330238
L. Gonçales, Kleinner Farias, Vinícius Bischoff
Software design models play a key role in many activities of information systems engineering, such as documenting software artefacts, communicating project decisions, and code generation. In this scenario, the techniques for comparison of software design models are used for several purposes, such as, for detecting clones, and model evolution. In the last decades, academia proposed different techniques for comparing software models. Even using these different techniques for model comparison, this process is still an activity of a subjective nature, because during this process, different developers can interpret the similarity differently. Thus, the problem is that it is still unknown if developers has the same intuition in order to resolve comparison of software design models. For this, the main objective of this work is to explore the effects of their experience level, i.e., experienced and inexperienced developers, relative to their effort and correctness for resolving activities of comparing software design models. Therefore, a controlled experiment was conducted to evaluate the developer's experience level regarding on similarities of UML Models. The results show that the developer's experience does not affect the understanding of similarities activities.
软件设计模型在信息系统工程的许多活动中扮演着关键的角色,例如记录软件工件、交流项目决策和代码生成。在这个场景中,比较软件设计模型的技术用于几个目的,例如,用于检测克隆和模型演化。在过去的几十年里,学术界提出了不同的技术来比较软件模型。即使使用这些不同的技术进行模型比较,这个过程仍然是一种主观的活动,因为在这个过程中,不同的开发人员可以以不同的方式解释相似性。因此,问题在于,为了解决软件设计模型的比较,开发人员是否具有相同的直觉仍然是未知的。为此,这项工作的主要目标是探索他们的经验水平的影响,即,有经验和没有经验的开发人员,相对于他们的努力和解决比较软件设计模型的活动的正确性。因此,进行了一个控制实验来评估开发人员在UML模型相似性方面的经验水平。结果表明,开发人员的经验不影响对相似性活动的理解。
{"title":"On the Effects of Developers' Intuition on Measuring Similarity Between UML Models","authors":"L. Gonçales, Kleinner Farias, Vinícius Bischoff","doi":"10.1145/3330204.3330238","DOIUrl":"https://doi.org/10.1145/3330204.3330238","url":null,"abstract":"Software design models play a key role in many activities of information systems engineering, such as documenting software artefacts, communicating project decisions, and code generation. In this scenario, the techniques for comparison of software design models are used for several purposes, such as, for detecting clones, and model evolution. In the last decades, academia proposed different techniques for comparing software models. Even using these different techniques for model comparison, this process is still an activity of a subjective nature, because during this process, different developers can interpret the similarity differently. Thus, the problem is that it is still unknown if developers has the same intuition in order to resolve comparison of software design models. For this, the main objective of this work is to explore the effects of their experience level, i.e., experienced and inexperienced developers, relative to their effort and correctness for resolving activities of comparing software design models. Therefore, a controlled experiment was conducted to evaluate the developer's experience level regarding on similarities of UML Models. The results show that the developer's experience does not affect the understanding of similarities activities.","PeriodicalId":348938,"journal":{"name":"Proceedings of the XV Brazilian Symposium on Information Systems","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122684826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Kairós
Pub Date : 2019-05-20 DOI: 10.1145/3330204.3330205
F. C. Rodrigues, A. Filippetto, J. Barbosa
This paper presents a computational model entitled Kairós for prediction and recommendation in project schedules. The model uses context prediction mechanisms based on task data and projects stored during its execution. The recommendations are made to the manager in a proactive manner, considering best practices in project management and learning with the approval or rejection of each recommendation. A prototype was implemented based on the proposed model, and through it, an evaluation was carried out using simulated use cases with real data from a large company. The results showed that the model was able to predict with precision of 93% if a task would be completed with delay, with 87% accuracy.
{"title":"Kairós","authors":"F. C. Rodrigues, A. Filippetto, J. Barbosa","doi":"10.1145/3330204.3330205","DOIUrl":"https://doi.org/10.1145/3330204.3330205","url":null,"abstract":"This paper presents a computational model entitled Kairós for prediction and recommendation in project schedules. The model uses context prediction mechanisms based on task data and projects stored during its execution. The recommendations are made to the manager in a proactive manner, considering best practices in project management and learning with the approval or rejection of each recommendation. A prototype was implemented based on the proposed model, and through it, an evaluation was carried out using simulated use cases with real data from a large company. The results showed that the model was able to predict with precision of 93% if a task would be completed with delay, with 87% accuracy.","PeriodicalId":348938,"journal":{"name":"Proceedings of the XV Brazilian Symposium on Information Systems","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114587063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How do software technologies impact the daily of people with autism in Brazil: A survey 软件技术如何影响巴西自闭症患者的日常生活:一项调查
Pub Date : 2019-05-20 DOI: 10.1145/3330204.3330274
Tamires A. S. Sousa, V. D. Ferreira, A. B. Marques
Autistic Spectrum Disorder (ASD) is characterized by persistent deficits in communication and social interaction, restrictive and repetitive patterns of behavior. The development of information systems for supporting the treatment of ASD has been intensified in recent years, allowing new ways of treatment. Although new systems are developed, there are still few studies that investigate the impact on the use of these systems on the daily of autistic users. In this sense, we conducted a research to investigate the impact on the use of information systems by autistic users. Our methodology adopted the following steps: 1) immersion in groups of social networks in which discussions about ASD are carried out; 2) identification of the research target audience; 3) creation of the research collection instrument; 4) execution of a survey with parents and professionals that care of autistic children; 5) analysis of the data obtained. During two weeks, we obtained 53 questionnaire responses. We observed that 96.2% of the respondents indicate that their children have access to software technologies and 90.5% agree that the technologies can support the teaching and learning of autistic people. We identified positive and negative characteristics of software technologies in order to provide opportunities for improving the existing systems or developing systems more adequate to needs of autistic children.
自闭症谱系障碍(ASD)的特征是沟通和社会互动的持续缺陷,限制性和重复性的行为模式。近年来,支持ASD治疗的信息系统的发展得到了加强,从而提供了新的治疗方法。虽然开发了新的系统,但仍然很少有研究调查这些系统对自闭症患者日常使用的影响。在这个意义上,我们进行了一项研究,调查自闭症用户对信息系统使用的影响。我们的方法采用了以下步骤:1)沉浸在讨论自闭症谱系障碍的社交网络群体中;2)研究目标受众的识别;3)科研采集工具的创建;4)对照顾自闭症儿童的家长和专业人士进行调查;5)对所得数据进行分析。在两周的时间里,我们获得了53份问卷的回复。我们观察到,96.2%的受访者表示他们的孩子可以使用软件技术,90.5%的受访者认为这些技术可以支持自闭症患者的教学。我们确定了软件技术的积极和消极特征,以便为改进现有系统或开发更适合自闭症儿童需要的系统提供机会。
{"title":"How do software technologies impact the daily of people with autism in Brazil: A survey","authors":"Tamires A. S. Sousa, V. D. Ferreira, A. B. Marques","doi":"10.1145/3330204.3330274","DOIUrl":"https://doi.org/10.1145/3330204.3330274","url":null,"abstract":"Autistic Spectrum Disorder (ASD) is characterized by persistent deficits in communication and social interaction, restrictive and repetitive patterns of behavior. The development of information systems for supporting the treatment of ASD has been intensified in recent years, allowing new ways of treatment. Although new systems are developed, there are still few studies that investigate the impact on the use of these systems on the daily of autistic users. In this sense, we conducted a research to investigate the impact on the use of information systems by autistic users. Our methodology adopted the following steps: 1) immersion in groups of social networks in which discussions about ASD are carried out; 2) identification of the research target audience; 3) creation of the research collection instrument; 4) execution of a survey with parents and professionals that care of autistic children; 5) analysis of the data obtained. During two weeks, we obtained 53 questionnaire responses. We observed that 96.2% of the respondents indicate that their children have access to software technologies and 90.5% agree that the technologies can support the teaching and learning of autistic people. We identified positive and negative characteristics of software technologies in order to provide opportunities for improving the existing systems or developing systems more adequate to needs of autistic children.","PeriodicalId":348938,"journal":{"name":"Proceedings of the XV Brazilian Symposium on Information Systems","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129477778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Decision Support System for Precision Livestock: Machine Learning-Based Prediction Module for Stocking Rate Adjustment 精准畜牧业决策支持系统:基于机器学习的放养率调整预测模块
Pub Date : 2019-05-20 DOI: 10.1145/3330204.3330222
L. Schulte, N. Perez, Leonardo Bidese de Pinho, G. Trentin
The increasing worldwide demand for resources such as water and food brings the need for the application of scientific methods in agriculture and livestock to increase their productivity. One way to increase the efficiency of productive systems that make extensive beef cattle breeding is by adjusting the pasture stocking rate to optimize the animal weight gain per hectare. The present work describes a module for Farm Management Information System (FMIS) based on Long Short-Term Memory (LSTM) neural networks to estimate forage mass by means of historical pasture growth data collected through the direct method associated with meteorological data. The proposed method is based on exploratory and experimental interdisciplinary research, with systematic bibliographic research and study case. The results show that LSTM neural networks are able to make a reasonable estimate for the dry mass variation over time. Using this estimate, one can obtain a gain/hectare/year of 121 kg of live weight against 70 kg where there is no adjustment of animal load and 98 kg where this adjustment is made based on the estimate of the previous month.
世界范围内对水和食物等资源的需求不断增加,因此需要在农业和畜牧业中应用科学方法,以提高其生产力。提高生产系统效率的一种方法是调整牧场放养率,以优化动物每公顷增重。本文描述了一个基于长短期记忆(LSTM)神经网络的农场管理信息系统(FMIS)模块,该模块通过与气象数据相关联的直接方法收集历史牧场生长数据来估计饲料质量。本文提出的方法是基于探索性和实验性的跨学科研究,采用系统的文献研究和案例研究。结果表明,LSTM神经网络能够对干质量随时间的变化做出合理的估计。根据这一估计,每公顷/年可获得121公斤活重,而在不调整动物负荷的情况下可获得70公斤活重,在根据上个月的估计进行调整的情况下可获得98公斤活重。
{"title":"Decision Support System for Precision Livestock: Machine Learning-Based Prediction Module for Stocking Rate Adjustment","authors":"L. Schulte, N. Perez, Leonardo Bidese de Pinho, G. Trentin","doi":"10.1145/3330204.3330222","DOIUrl":"https://doi.org/10.1145/3330204.3330222","url":null,"abstract":"The increasing worldwide demand for resources such as water and food brings the need for the application of scientific methods in agriculture and livestock to increase their productivity. One way to increase the efficiency of productive systems that make extensive beef cattle breeding is by adjusting the pasture stocking rate to optimize the animal weight gain per hectare. The present work describes a module for Farm Management Information System (FMIS) based on Long Short-Term Memory (LSTM) neural networks to estimate forage mass by means of historical pasture growth data collected through the direct method associated with meteorological data. The proposed method is based on exploratory and experimental interdisciplinary research, with systematic bibliographic research and study case. The results show that LSTM neural networks are able to make a reasonable estimate for the dry mass variation over time. Using this estimate, one can obtain a gain/hectare/year of 121 kg of live weight against 70 kg where there is no adjustment of animal load and 98 kg where this adjustment is made based on the estimate of the previous month.","PeriodicalId":348938,"journal":{"name":"Proceedings of the XV Brazilian Symposium on Information Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129099205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Enrichment of dictionaries to improve the automatic classification of feelings in postings related to the use of systems 丰富词典,提高自动分类的感受,在帖子中使用相关系统
Pub Date : 2019-05-20 DOI: 10.1145/3330204.3330219
Afonso Matheus Sousa Lima, M. Mendes, L. A. Cruz
This work proposes an investigation to improve the efficiency of a lexical-based classifier, the SentiStrength, for automatic sentiment detection in postings related to the use of systems. To achieve this goal, the TF-IDF metric was used to select words that are related to the domain of the posts, which will enrich the dictionary used by the tool to generate the polarity of the posts. The efficiency of a dictionarie enriched with words in their root form and a dictionarie enriched with lematized words will also be investigated. The research was conducted with 2108 sentences extracted from the reviews section of the Play Store on urban mobility applications, such as Waze, Google Maps and GPS Brazil. One of the results obtained was a 7.3 % increase in the accuracy of the classifier when using enriched dictionaries.
这项工作提出了一项调查,以提高基于词汇的分类器SentiStrength的效率,用于与系统使用相关的帖子中的自动情感检测。为了实现这一目标,使用TF-IDF度量来选择与帖子领域相关的单词,这将丰富该工具用于生成帖子极性的字典。本文还将对词根形式词典和词根形式词典的效率进行研究。这项研究从Play商店的城市移动应用评论部分提取了2108个句子,如Waze、谷歌地图和GPS巴西。其中一个结果是,当使用丰富的字典时,分类器的准确性提高了7.3%。
{"title":"Enrichment of dictionaries to improve the automatic classification of feelings in postings related to the use of systems","authors":"Afonso Matheus Sousa Lima, M. Mendes, L. A. Cruz","doi":"10.1145/3330204.3330219","DOIUrl":"https://doi.org/10.1145/3330204.3330219","url":null,"abstract":"This work proposes an investigation to improve the efficiency of a lexical-based classifier, the SentiStrength, for automatic sentiment detection in postings related to the use of systems. To achieve this goal, the TF-IDF metric was used to select words that are related to the domain of the posts, which will enrich the dictionary used by the tool to generate the polarity of the posts. The efficiency of a dictionarie enriched with words in their root form and a dictionarie enriched with lematized words will also be investigated. The research was conducted with 2108 sentences extracted from the reviews section of the Play Store on urban mobility applications, such as Waze, Google Maps and GPS Brazil. One of the results obtained was a 7.3 % increase in the accuracy of the classifier when using enriched dictionaries.","PeriodicalId":348938,"journal":{"name":"Proceedings of the XV Brazilian Symposium on Information Systems","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116509397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Software Startups Success Factors Study under the Entrepreneurial Perspective 创业视角下的软件创业成功因素研究
Pub Date : 2019-05-20 DOI: 10.1145/3330204.3330263
Tatiany Xavier de Godoi, A. Menolli, Gustavo Marcelino Dionisio
Entrepreneurship, innovation and startup are widely used terms lately, and much information is currently available on this subject. Software startups are companies that have particular characteristics, such as being scalable, developing innovative products, and living in an environment of uncertainty. Considering the scenario of these types of companies is still incipient, this paper aims to analyze the perception of software startups in relation to the main factors described in the literature that can lead to success or failure. For that, a survey was carried out with software startups incubated in Paraná. The results show that the startups perception is that their success is related only to internal factors of the company, and that many software startups are not prepared as they should in the early stages of development, not applying several concepts described in the literature as fundamental to aid in the business development and validation.
创业、创新和创业是最近被广泛使用的术语,目前有很多关于这一主题的信息。软件初创公司是具有特定特征的公司,例如可扩展、开发创新产品和生活在不确定的环境中。考虑到这些类型的公司的情况仍处于起步阶段,本文旨在分析与文献中描述的可能导致成功或失败的主要因素相关的软件初创公司的看法。为此,我们对在帕拉纳孵化的软件初创公司进行了一项调查。结果表明,创业公司认为他们的成功只与公司的内部因素有关,许多软件创业公司在开发的早期阶段没有做好准备,没有应用文献中描述的几个概念作为帮助业务开发和验证的基础。
{"title":"Software Startups Success Factors Study under the Entrepreneurial Perspective","authors":"Tatiany Xavier de Godoi, A. Menolli, Gustavo Marcelino Dionisio","doi":"10.1145/3330204.3330263","DOIUrl":"https://doi.org/10.1145/3330204.3330263","url":null,"abstract":"Entrepreneurship, innovation and startup are widely used terms lately, and much information is currently available on this subject. Software startups are companies that have particular characteristics, such as being scalable, developing innovative products, and living in an environment of uncertainty. Considering the scenario of these types of companies is still incipient, this paper aims to analyze the perception of software startups in relation to the main factors described in the literature that can lead to success or failure. For that, a survey was carried out with software startups incubated in Paraná. The results show that the startups perception is that their success is related only to internal factors of the company, and that many software startups are not prepared as they should in the early stages of development, not applying several concepts described in the literature as fundamental to aid in the business development and validation.","PeriodicalId":348938,"journal":{"name":"Proceedings of the XV Brazilian Symposium on Information Systems","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124003840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Louvre: A Framework for Metadata Curation in Data Ecosystem Louvre:数据生态系统中的元数据管理框架
Pub Date : 2019-05-20 DOI: 10.1145/3330204.3330248
Marcelo Iury S. Oliveira, B. Lóscio
Data Ecosystems are a cultural, technological, and social phenomenon based on the interplay of technology, actors and businesses, which provide an environment for creating, managing and sustaining data sharing initiatives. There is a general consensus as to the crucial role metadata can play on the Data Ecosystem. However, in most cases, the metadata management is underspecified, if not unaddressed at all. The employment of a metadata curation strategy can bring an ecosystem success and further ensure realization of Data Ecosystem actors' purposes. In this work, our contribution is proposing a Metadata Curation Framework, called Louvre, which proposes a wide range of processes for curating metadata in Data Ecosystems. The promise is the employment of a well-conceived, efficient curation strategy for metadata.
数据生态系统是一种基于技术、行动者和企业相互作用的文化、技术和社会现象,为创建、管理和维持数据共享举措提供了环境。对于元数据在数据生态系统中扮演的关键角色,人们达成了普遍共识。然而,在大多数情况下,元数据管理没有被充分指定,甚至根本没有被处理。采用元数据管理策略可以带来生态系统的成功,并进一步确保数据生态系统参与者目的的实现。在这项工作中,我们的贡献是提出一个元数据管理框架,称为Louvre,它提出了在数据生态系统中管理元数据的广泛过程。它的承诺是为元数据提供一个精心设计的、高效的管理策略。
{"title":"Louvre: A Framework for Metadata Curation in Data Ecosystem","authors":"Marcelo Iury S. Oliveira, B. Lóscio","doi":"10.1145/3330204.3330248","DOIUrl":"https://doi.org/10.1145/3330204.3330248","url":null,"abstract":"Data Ecosystems are a cultural, technological, and social phenomenon based on the interplay of technology, actors and businesses, which provide an environment for creating, managing and sustaining data sharing initiatives. There is a general consensus as to the crucial role metadata can play on the Data Ecosystem. However, in most cases, the metadata management is underspecified, if not unaddressed at all. The employment of a metadata curation strategy can bring an ecosystem success and further ensure realization of Data Ecosystem actors' purposes. In this work, our contribution is proposing a Metadata Curation Framework, called Louvre, which proposes a wide range of processes for curating metadata in Data Ecosystems. The promise is the employment of a well-conceived, efficient curation strategy for metadata.","PeriodicalId":348938,"journal":{"name":"Proceedings of the XV Brazilian Symposium on Information Systems","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116387036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Deep Learning Approach to the Malware Classification Problem using Autoencoders 基于自编码器的恶意软件分类问题的深度学习方法
Pub Date : 2019-05-20 DOI: 10.1145/3330204.3330229
Dhiego Ramos Pinto, J. C. Duarte, R. Sant'Ana
Detecting malicious code or categorizing it among families has become an increasingly difficult task. Malware1 exploits vulnerabilities and employ sophisticated techniques to avoid their detection and further classification, challenging cybersecurity teams, governments, enterprises, and the ordinary user, causing uncountable losses annually. Traditional machine learning algorithms have been used to attack the problem, although, these methods are heavily relying on domain expertise to be successful. Deep Learning methods requires less dependency on feature engineering, discovering the important features straightly from the raw data, recognizing patterns that humans usually can't. This work presents a deep learning approach for malware multi-class classification based on an unsupervised pre-trained classifier, using opcodes and its operands frequencies as raw data, ignoring knowledge that could be acquired from any known features from the malware families. The results confirmed that the approach is well succeeded and our best model achieved a MacroF1 of 93.14% a competitive result comparing to best-known classifier, since it uses less information about the malware.
检测恶意代码或在家庭中对其进行分类已成为越来越困难的任务。恶意软件1利用漏洞并采用复杂的技术来避免其检测和进一步分类,挑战网络安全团队,政府,企业和普通用户,每年造成不可估量的损失。传统的机器学习算法已经被用来解决这个问题,尽管这些方法在很大程度上依赖于领域的专业知识才能取得成功。深度学习方法对特征工程的依赖较少,直接从原始数据中发现重要特征,识别人类通常无法识别的模式。这项工作提出了一种基于无监督预训练分类器的恶意软件多类分类的深度学习方法,使用操作码及其操作数频率作为原始数据,忽略了可以从恶意软件家族的任何已知特征中获得的知识。结果证实该方法非常成功,我们最好的模型实现了93.14%的MacroF1,与最知名的分类器相比,这是一个有竞争力的结果,因为它使用了较少的恶意软件信息。
{"title":"A Deep Learning Approach to the Malware Classification Problem using Autoencoders","authors":"Dhiego Ramos Pinto, J. C. Duarte, R. Sant'Ana","doi":"10.1145/3330204.3330229","DOIUrl":"https://doi.org/10.1145/3330204.3330229","url":null,"abstract":"Detecting malicious code or categorizing it among families has become an increasingly difficult task. Malware1 exploits vulnerabilities and employ sophisticated techniques to avoid their detection and further classification, challenging cybersecurity teams, governments, enterprises, and the ordinary user, causing uncountable losses annually. Traditional machine learning algorithms have been used to attack the problem, although, these methods are heavily relying on domain expertise to be successful. Deep Learning methods requires less dependency on feature engineering, discovering the important features straightly from the raw data, recognizing patterns that humans usually can't. This work presents a deep learning approach for malware multi-class classification based on an unsupervised pre-trained classifier, using opcodes and its operands frequencies as raw data, ignoring knowledge that could be acquired from any known features from the malware families. The results confirmed that the approach is well succeeded and our best model achieved a MacroF1 of 93.14% a competitive result comparing to best-known classifier, since it uses less information about the malware.","PeriodicalId":348938,"journal":{"name":"Proceedings of the XV Brazilian Symposium on Information Systems","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126953424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Dynamic Discovery of IoT Services Based on Semantic Processing of Event Flows 基于事件流语义处理的物联网服务动态发现
Pub Date : 2019-05-20 DOI: 10.1145/3330204.3330280
Anderson Soares Costa, Rodolfo Sobreira Alves, F. Silva, M. Endler
The Internet of Things (IoT) is a combination of ubiquitous computing and the Internet, in which IoT (smart objects) devices can collect and exchange data, cooperating with people and the environment in which they find themselves. The Internet of Mobile Thing (IoMT), which is an extension of IoT, proposes scenarios in which smart objects and gateways are mobile. In this context, this work is focused on the discovery of smart objects in IoT/IoMT environments considering the following problems: mobility of both smart objects and gateways; great heterogeneity of smart objects and communication technologies to access them; the need for interoperability in these environments; the need to combine data from smart objects with knowledge bases. Therefore, the objective of this work is to combine Semantic Flow Processing with knowledge representation techniques to enrich the instantaneous and continuous discovery of smart objects and their services in IoT/IoMT environments. To this end, an ontology was developed to describe IoT/IoMT scenarios, a semantic middleware, an API for building information systems and applications, and a cloud infrastructure for querying and semantic streaming of smart objects. The evaluation of this work is done through a use case in the field of intelligent parking lots.
物联网(IoT)是无处不在的计算和互联网的结合,其中IoT(智能对象)设备可以收集和交换数据,与人及其所在的环境合作。移动物联网(Internet of Mobile Thing, IoMT)是物联网的延伸,提出了智能对象和智能网关移动的场景。在此背景下,本工作的重点是在IoT/IoMT环境中发现智能对象,考虑以下问题:智能对象和网关的移动性;智能对象的巨大异质性和访问它们的通信技术;在这些环境中需要互操作性;需要将来自智能对象的数据与知识库相结合。因此,本工作的目标是将语义流处理与知识表示技术相结合,以丰富IoT/IoMT环境中智能对象及其服务的即时和连续发现。为此,开发了描述IoT/IoMT场景的本体、语义中间件、用于构建信息系统和应用程序的API以及用于查询和智能对象语义流的云基础设施。通过智能停车场领域的一个用例对该工作进行了评估。
{"title":"Dynamic Discovery of IoT Services Based on Semantic Processing of Event Flows","authors":"Anderson Soares Costa, Rodolfo Sobreira Alves, F. Silva, M. Endler","doi":"10.1145/3330204.3330280","DOIUrl":"https://doi.org/10.1145/3330204.3330280","url":null,"abstract":"The Internet of Things (IoT) is a combination of ubiquitous computing and the Internet, in which IoT (smart objects) devices can collect and exchange data, cooperating with people and the environment in which they find themselves. The Internet of Mobile Thing (IoMT), which is an extension of IoT, proposes scenarios in which smart objects and gateways are mobile. In this context, this work is focused on the discovery of smart objects in IoT/IoMT environments considering the following problems: mobility of both smart objects and gateways; great heterogeneity of smart objects and communication technologies to access them; the need for interoperability in these environments; the need to combine data from smart objects with knowledge bases. Therefore, the objective of this work is to combine Semantic Flow Processing with knowledge representation techniques to enrich the instantaneous and continuous discovery of smart objects and their services in IoT/IoMT environments. To this end, an ontology was developed to describe IoT/IoMT scenarios, a semantic middleware, an API for building information systems and applications, and a cloud infrastructure for querying and semantic streaming of smart objects. The evaluation of this work is done through a use case in the field of intelligent parking lots.","PeriodicalId":348938,"journal":{"name":"Proceedings of the XV Brazilian Symposium on Information Systems","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128853786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the XV Brazilian Symposium on Information Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1