Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)最新文献

英文中文

Generating E-commerce Product Titles in Portuguese 用葡萄牙语生成电子商务产品标题

Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)

Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15835

Livy Real, Karina M. Johansson, Júlio C. S. Mendes, Bianca M. Lopes, Marcio T. I. Oshiro

This paper explores how Natural Language Processing techniques can be integrated to solve real-world problems in the e-commerce scenario. We address the issue of having high quality information products offered to customers in a marketplace platform, composed by thousands of sellers producing original content in multiple languages, following different SEO and cultural assumptions. We propose an NLP pipeline to generate high quality titles products in Portuguese.

本文探讨了如何将自然语言处理技术集成到电子商务场景中来解决实际问题。我们解决了在市场平台上为客户提供高质量信息产品的问题，该平台由成千上万的卖家组成，以多种语言制作原创内容，遵循不同的SEO和文化假设。我们提出了一个自然语言处理管道，以产生高质量的标题产品在葡萄牙。

引用次数: 1

Uso de Aprendizado de Máquina Automatizado para Seleção de Provedores de Nuvem 使用自动机器学习选择云提供商

Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)

Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15802

Kauã B. Hopfer, Adriano Fiorese

Neste trabalho, uma forma de ranqueamento e seleção de provedores de nuvem é apresentada por meio do uso da função de Aprendizado de Máquina Automatizado (AutoML) da plataforma H2O. Ele exibe um sistema de ranqueamento que produz uma pontuação para cada provedor de nuvem avaliado, a partir da qualificação dos requisitos exigidos pelo usuário. Experimentos realizados com o auxílio da plataforma H2O, levando em consideração o treinamento e análise de modelos de regressão, apresentam resultados precisos e mais rápidos quando comparados a alternativa de resolução determinística exata.

本文提出了一种利用H2O平台的自动机器学习函数(AutoML)对云提供商进行排名和选择的方法。它显示了一个排名系统，根据用户的需求评级，为每个被评估的云提供商生成一个分数。在H2O平台的帮助下进行的实验，考虑了回归模型的训练和分析，与精确确定性分辨率的替代方案相比，给出了准确和更快的结果。

引用次数: 0

Digital Identity Challenge: The Security and Convenience Dilemma 数字身份的挑战:安全性和便利性的困境

Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)

Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15829

André Ferraz, C. Ferraz

This paper argues that the essential pieces of an enduring digital identity should be privacy, security, and convenience. Authentication should be frictionless. In this sense, the core of the digital identity of the future will be created around location sensing techniques. Incognia proposes a solution to secure and frictionless authentication for mobile apps that is composed of five steps. Its proprietary technology called environment fingerprinting can identify location spoofing and precisely determine the devices actual location. Incognia has found that most mobile logins, sensitive transactions, and purchases occur at trusted locations. To date, 90% of mobile logins and 89% of mobile banking sessions happen at a trusted location. Experimental results show false-negative rates below 0.004% and a decrease of over 85% of account takeover attacks.

本文认为，持久的数字身份的基本要素应该是隐私、安全性和便利性。认证应该是无摩擦的。从这个意义上说，未来数字身份的核心将围绕位置传感技术创造。Incognia提出了一种安全无摩擦的移动应用认证解决方案，该解决方案由五个步骤组成。其专有技术环境指纹可以识别位置欺骗，并精确确定设备的实际位置。Incognia发现，大多数手机登录、敏感交易和购买都发生在可信的地点。到目前为止，90%的手机登录和89%的手机银行会话发生在可信任的位置。实验结果表明，假阴性率低于0.004%，账户接管攻击减少85%以上。

引用次数: 0

Segurança em Dispositivos Móveis: Um Estudo Sobre a Adoção de Boas Práticas para Proteção em Celulares 移动设备安全:采用移动设备保护最佳实践的研究

Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)

Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15807

Juliana Pereira Cabral, Herleson Paiva Pontes

Nos últimos anos, é possível observar o considerável aumento do número de ataques à segurança da informação em dispositivos móveis no Brasil, onde estima-se que ocorreram 850 mil tentativas objetivando o acesso indevido aos dados pessoais dos usuários. Este trabalho apresenta um estudo sobre a adoção dos recursos, ferramentas e boas práticas de segurança voltadas para celulares. Após a condução de um estudo de caso envolvendo 222 participantes que avaliou aspectos tecnológicos e psicológicos dos usuários acerca da segurança em seus aparelhos, os resultados sugerem que parcela considerável da população possui escasso conhecimento em relação ao emprego dos recursos e boas práticas de proteção eficientes nos dispositivos móveis, tornando-os alvos de crimes cibernéticos.

近年来，在巴西，针对移动设备信息安全的攻击数量显著增加，估计有85万次针对用户个人数据的不当访问尝试。这项工作提出了一项关于手机资源、工具和良好安全实践的采用的研究。后驾驶的一个案例研究涉及222名参与者评估技术和心理关于安全设备的用户,结果表明,相当一部分的就业人口的宝贵知识和最佳实践的资源有效保护移动设备,成为网络犯罪的目标。

引用次数: 0

Cross-Media Sentiment Analysis on German Blogs 德语博客的跨媒体情感分析

Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)

Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15813

Nina N. Zahn, G. P. D. Molin, S. Musse

Social interactions have changed in recent years. People post their thoughts, opinions and feelings on social media platforms more often. Due to the increase in the amount of data on the internet, it is impracticable to carry out the sentiment analysis manually, requiring automation of the process. In this work, we present the corpus Cross-Media German Blog (CGB) which consists of German blogs with feelings in the domain of images, texts and posts (Ground Truth), classified according to human perceptions. We apply existing Machine Learning technologies and lexicons to the corpus to detect the feelings (negative, neutral or positive) of the images and texts and compare the results with the GT. We examined contradictory posts, when the image and text classified by humans in the same post had diverging feelings. The comparison of this article with the analysis of sentiment among the media of Brazilian blogs finds its justification for performance results in cultural differences, since, throughout this work, Brazil is classified as indulgent and Germany as a restrained country.

近年来，社会互动发生了变化。人们更频繁地在社交媒体平台上发布自己的想法、观点和感受。由于互联网上数据量的增加，人工进行情感分析是不切实际的，需要自动化的过程。在这项工作中，我们展示了语料库跨媒体德语博客(CGB)，它由德语博客组成，这些博客在图像、文本和帖子(Ground Truth)领域有感情，并根据人类的感知进行分类。我们将现有的机器学习技术和词汇应用到语料库中，以检测图像和文本的情感(消极、中性或积极)，并将结果与GT进行比较。当人类在同一篇文章中分类的图像和文本具有不同的情感时，我们检查了矛盾的帖子。将这篇文章与巴西博客媒体的情绪分析相比较，可以发现其表现的理由是文化差异，因为在整个研究中，巴西被归类为放纵的国家，而德国则被归类为克制的国家。

引用次数: 0

Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)

Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15815

B. Dalmoro, S. Musse

With the popularization of social networks, the sharing and consumption of content in video format becomes easier. Understanding what makes a video popular and being able to predict its popularity in number of views is useful for both content creators and advertising. In this work, we explore visual features extracted from 1,820 Facebook videos in order to predict whether they will reach more than a certain number of views on the seven days after publication. For this purpose, we used Support Vector Machine with Gaussian Radial Basis Function classification model. Using only visual features as predictors, the model with Video Characteristics and Rigidity features combined reached Kappa of 0.7324, sensitivity of 0.8930, and positive predictive value of 0.8930.

随着社交网络的普及，视频格式内容的分享和消费变得更加容易。了解是什么让一个视频受欢迎，并能够预测其受欢迎的观看次数，这对内容创作者和广告都很有用。在这项工作中，我们探索了从1820个Facebook视频中提取的视觉特征，以预测它们在发布后七天内是否会达到一定数量的观看量。为此，我们使用支持向量机与高斯径向基函数的分类模型。仅使用视觉特征作为预测因子，结合Video Characteristics和刚度特征的模型Kappa值为0.7324，灵敏度为0.8930，阳性预测值为0.8930。

引用次数: 0

Técnicas de Processamento de Linguagem Natural em Denúncias Criminais: Automatização e Classificação de Texto em Português Coloquial 刑事投诉中的自然语言处理技术:葡萄牙语口语文本的自动化与分类

Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)

Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15820

Camila Gusmão, Karla Figueiredo, Walkir Brito

Este artigo apresenta a investigação de Técnicas de Processamento de Linguagem Natural (PLN) em Denúncias Criminais, provenientes do aplicativo do serviço do Disque Denúncia RJ para smartphone. Nele é apresentado o processo de automatização, avaliando e classificando as denúncias, objetivando reduzir o tempo de análise do conteúdo das mensagens, que possui, como principal desafio, textos escritos em linguagem muito informal, contendo muitos erros morfossintáticos. Para alcançar tais objetivos foi necessária uma investigação de técnicas de pré-processamento visando melhorar a acurácia da classificação, que foi realizada por Support Vector Machine (SVM). Os resultados encontrados são bastante promissores para o tipo de textos de denúncias, atingindo uma precisão de 76,11%.

本文介绍了自然语言处理技术(nlp)在刑事投诉中的研究，从电话服务投诉RJ到智能手机的应用。它提出了自动化的过程，评估和分类投诉，旨在减少分析信息内容的时间，这是一个主要的挑战，文本写在非常非正式的语言，包含许多形态句法错误。来实现这些目标是需要预处理技术的研究来提高分类的精度,通过支持向量机(SVM)进行的。结果对投诉文本类型非常有希望，准确率为76.11%。

引用次数: 2

CoEPinKB: A Framework to Understand the Connectivity of Entity Pairs in Knowledge Bases CoEPinKB:一个理解知识库中实体对连通性的框架

Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)

Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15811

J. G. Jiménez, Luiz André Portes Paes Leme, M. Casanova

A knowledge base, expressed using the Resource Description Framework (RDF), can be viewed as a graph whose nodes represent entities and whose edges denote relationships. The entity relatedness problem refers to the problem of discovering and understanding how two entities are related, directly or indirectly, that is, how they are connected by paths in a knowledge base. Strategies designed to solve the entity relatedness problem typically adopt an entity similarity measure to reduce the path search space and a path ranking measure to order and filter the list of paths returned. This paper presents a framework, called CoEPinKB, that supports the empirical evaluation of such strategies. The proposed framework allows combining entity similarity and path ranking measures to generate different path search strategies. The main goals of this paper are to describe the framework and present a performance evaluation of nine different path search strategies.

使用资源描述框架(RDF)表示的知识库可以看作是一个图，其节点表示实体，其边表示关系。实体关联问题是指发现和理解两个实体是如何直接或间接关联的问题，即它们是如何通过知识库中的路径连接起来的问题。解决实体关联问题的策略通常采用实体相似度度量来减少路径搜索空间，采用路径排序度量来对返回的路径列表进行排序和过滤。本文提出了一个名为CoEPinKB的框架，该框架支持对此类策略进行实证评估。该框架允许结合实体相似度和路径排序度量来生成不同的路径搜索策略。本文的主要目标是描述该框架，并对九种不同的路径搜索策略进行性能评估。

引用次数: 3

Implementação Adaptativa de Variante do Algoritmo de Otimização Extrema Generalizada (GEO) 广义极值优化算法(GEO)的自适应变体实现

Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)

Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15832

L. B. D. Luz, Fabiano Luís de Sousa, Ronan Arraes Jardim Chagas

O GEO é um algoritmo evolutivo que recentemente teve uma versão adaptativa (A-GEO) desenvolvida. No presente trabalho, foi implementada e avaliada uma versão adaptativa para o algoritmo GEOvar, uma variante do GEO. Para tanto, foram testadas duas diferentes implementações para um conjunto de 5 funções. Uma dessas implementações mostrou resultados superiores em relação ao A-GEO.

GEO是一种进化算法，最近开发了一个自适应版本(A-GEO)。在这项工作中，我们实现并评估了GEO算法的一个自适应版本，GEO的一个变体。为此，我们测试了一组5个函数的两种不同实现。其中一个实现显示了优于A-GEO的结果。

引用次数: 0

Scaling up Cast Face Detection in Videos at Globo 在Globo视频中扩大演员面部检测

Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)

Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15816

F. Ferreira, Bruno P. Oliveira, Rodrigo Kassick, V. Furlan, Hélio Lopes

It has been recognized that a significant increase in the production and consumption of video content occurred in the last decade. Many entertainment companies, like Globo, face challenges regarding video metadata generation. The objective of this paper is to present a suitable architecture for the Globo Group to automatically identify actors that appear in each scene of a video stream, generating new metadata annotations that can be used by recommender systems and search engines among different other applications in this industry sector.

人们认识到，在过去十年中，视频内容的制作和消费显著增加。许多娱乐公司，如Globo，都面临着视频元数据生成方面的挑战。本文的目标是为Globo Group提供一个合适的架构，以自动识别视频流中每个场景中出现的角色，生成新的元数据注释，这些注释可以被推荐系统和搜索引擎在该行业领域的不同其他应用程序中使用。

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀