首页 > 最新文献

Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)最新文献

英文 中文
Generating E-commerce Product Titles in Portuguese 用葡萄牙语生成电子商务产品标题
Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15835
Livy Real, Karina M. Johansson, Júlio C. S. Mendes, Bianca M. Lopes, Marcio T. I. Oshiro
This paper explores how Natural Language Processing techniques can be integrated to solve real-world problems in the e-commerce scenario. We address the issue of having high quality information products offered to customers in a marketplace platform, composed by thousands of sellers producing original content in multiple languages, following different SEO and cultural assumptions. We propose an NLP pipeline to generate high quality titles products in Portuguese.
本文探讨了如何将自然语言处理技术集成到电子商务场景中来解决实际问题。我们解决了在市场平台上为客户提供高质量信息产品的问题,该平台由成千上万的卖家组成,以多种语言制作原创内容,遵循不同的SEO和文化假设。我们提出了一个自然语言处理管道,以产生高质量的标题产品在葡萄牙。
{"title":"Generating E-commerce Product Titles in Portuguese","authors":"Livy Real, Karina M. Johansson, Júlio C. S. Mendes, Bianca M. Lopes, Marcio T. I. Oshiro","doi":"10.5753/SEMISH.2021.15835","DOIUrl":"https://doi.org/10.5753/SEMISH.2021.15835","url":null,"abstract":"This paper explores how Natural Language Processing techniques can be integrated to solve real-world problems in the e-commerce scenario. We address the issue of having high quality information products offered to customers in a marketplace platform, composed by thousands of sellers producing original content in multiple languages, following different SEO and cultural assumptions. We propose an NLP pipeline to generate high quality titles products in Portuguese.","PeriodicalId":206312,"journal":{"name":"Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123994011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Uso de Aprendizado de Máquina Automatizado para Seleção de Provedores de Nuvem 使用自动机器学习选择云提供商
Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15802
Kauã B. Hopfer, Adriano Fiorese
Neste trabalho, uma forma de ranqueamento e seleção de provedores de nuvem é apresentada por meio do uso da função de Aprendizado de Máquina Automatizado (AutoML) da plataforma H2O. Ele exibe um sistema de ranqueamento que produz uma pontuação para cada provedor de nuvem avaliado, a partir da qualificação dos requisitos exigidos pelo usuário. Experimentos realizados com o auxílio da plataforma H2O, levando em consideração o treinamento e análise de modelos de regressão, apresentam resultados precisos e mais rápidos quando comparados a alternativa de resolução determinística exata.
本文提出了一种利用H2O平台的自动机器学习函数(AutoML)对云提供商进行排名和选择的方法。它显示了一个排名系统,根据用户的需求评级,为每个被评估的云提供商生成一个分数。在H2O平台的帮助下进行的实验,考虑了回归模型的训练和分析,与精确确定性分辨率的替代方案相比,给出了准确和更快的结果。
{"title":"Uso de Aprendizado de Máquina Automatizado para Seleção de Provedores de Nuvem","authors":"Kauã B. Hopfer, Adriano Fiorese","doi":"10.5753/SEMISH.2021.15802","DOIUrl":"https://doi.org/10.5753/SEMISH.2021.15802","url":null,"abstract":"Neste trabalho, uma forma de ranqueamento e seleção de provedores de nuvem é apresentada por meio do uso da função de Aprendizado de Máquina Automatizado (AutoML) da plataforma H2O. Ele exibe um sistema de ranqueamento que produz uma pontuação para cada provedor de nuvem avaliado, a partir da qualificação dos requisitos exigidos pelo usuário. Experimentos realizados com o auxílio da plataforma H2O, levando em consideração o treinamento e análise de modelos de regressão, apresentam resultados precisos e mais rápidos quando comparados a alternativa de resolução determinística exata.","PeriodicalId":206312,"journal":{"name":"Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124540217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Digital Identity Challenge: The Security and Convenience Dilemma 数字身份的挑战:安全性和便利性的困境
Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15829
André Ferraz, C. Ferraz
This paper argues that the essential pieces of an enduring digital identity should be privacy, security, and convenience. Authentication should be frictionless. In this sense, the core of the digital identity of the future will be created around location sensing techniques. Incognia proposes a solution to secure and frictionless authentication for mobile apps that is composed of five steps. Its proprietary technology called environment fingerprinting can identify location spoofing and precisely determine the devices actual location. Incognia has found that most mobile logins, sensitive transactions, and purchases occur at trusted locations. To date, 90% of mobile logins and 89% of mobile banking sessions happen at a trusted location. Experimental results show false-negative rates below 0.004% and a decrease of over 85% of account takeover attacks.
本文认为,持久的数字身份的基本要素应该是隐私、安全性和便利性。认证应该是无摩擦的。从这个意义上说,未来数字身份的核心将围绕位置传感技术创造。Incognia提出了一种安全无摩擦的移动应用认证解决方案,该解决方案由五个步骤组成。其专有技术环境指纹可以识别位置欺骗,并精确确定设备的实际位置。Incognia发现,大多数手机登录、敏感交易和购买都发生在可信的地点。到目前为止,90%的手机登录和89%的手机银行会话发生在可信任的位置。实验结果表明,假阴性率低于0.004%,账户接管攻击减少85%以上。
{"title":"Digital Identity Challenge: The Security and Convenience Dilemma","authors":"André Ferraz, C. Ferraz","doi":"10.5753/SEMISH.2021.15829","DOIUrl":"https://doi.org/10.5753/SEMISH.2021.15829","url":null,"abstract":"This paper argues that the essential pieces of an enduring digital identity should be privacy, security, and convenience. Authentication should be frictionless. In this sense, the core of the digital identity of the future will be created around location sensing techniques. Incognia proposes a solution to secure and frictionless authentication for mobile apps that is composed of five steps. Its proprietary technology called environment fingerprinting can identify location spoofing and precisely determine the devices actual location. Incognia has found that most mobile logins, sensitive transactions, and purchases occur at trusted locations. To date, 90% of mobile logins and 89% of mobile banking sessions happen at a trusted location. Experimental results show false-negative rates below 0.004% and a decrease of over 85% of account takeover attacks.","PeriodicalId":206312,"journal":{"name":"Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121452604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Segurança em Dispositivos Móveis: Um Estudo Sobre a Adoção de Boas Práticas para Proteção em Celulares 移动设备安全:采用移动设备保护最佳实践的研究
Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15807
Juliana Pereira Cabral, Herleson Paiva Pontes
Nos últimos anos, é possível observar o considerável aumento do número de ataques à segurança da informação em dispositivos móveis no Brasil, onde estima-se que ocorreram 850 mil tentativas objetivando o acesso indevido aos dados pessoais dos usuários. Este trabalho apresenta um estudo sobre a adoção dos recursos, ferramentas e boas práticas de segurança voltadas para celulares. Após a condução de um estudo de caso envolvendo 222 participantes que avaliou aspectos tecnológicos e psicológicos dos usuários acerca da segurança em seus aparelhos, os resultados sugerem que parcela considerável da população possui escasso conhecimento em relação ao emprego dos recursos e boas práticas de proteção eficientes nos dispositivos móveis, tornando-os alvos de crimes cibernéticos.
近年来,在巴西,针对移动设备信息安全的攻击数量显著增加,估计有85万次针对用户个人数据的不当访问尝试。这项工作提出了一项关于手机资源、工具和良好安全实践的采用的研究。后驾驶的一个案例研究涉及222名参与者评估技术和心理关于安全设备的用户,结果表明,相当一部分的就业人口的宝贵知识和最佳实践的资源有效保护移动设备,成为网络犯罪的目标。
{"title":"Segurança em Dispositivos Móveis: Um Estudo Sobre a Adoção de Boas Práticas para Proteção em Celulares","authors":"Juliana Pereira Cabral, Herleson Paiva Pontes","doi":"10.5753/SEMISH.2021.15807","DOIUrl":"https://doi.org/10.5753/SEMISH.2021.15807","url":null,"abstract":"Nos últimos anos, é possível observar o considerável aumento do número de ataques à segurança da informação em dispositivos móveis no Brasil, onde estima-se que ocorreram 850 mil tentativas objetivando o acesso indevido aos dados pessoais dos usuários. Este trabalho apresenta um estudo sobre a adoção dos recursos, ferramentas e boas práticas de segurança voltadas para celulares. Após a condução de um estudo de caso envolvendo 222 participantes que avaliou aspectos tecnológicos e psicológicos dos usuários acerca da segurança em seus aparelhos, os resultados sugerem que parcela considerável da população possui escasso conhecimento em relação ao emprego dos recursos e boas práticas de proteção eficientes nos dispositivos móveis, tornando-os alvos de crimes cibernéticos.","PeriodicalId":206312,"journal":{"name":"Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131255637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cross-Media Sentiment Analysis on German Blogs 德语博客的跨媒体情感分析
Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15813
Nina N. Zahn, G. P. D. Molin, S. Musse
Social interactions have changed in recent years. People post their thoughts, opinions and feelings on social media platforms more often. Due to the increase in the amount of data on the internet, it is impracticable to carry out the sentiment analysis manually, requiring automation of the process. In this work, we present the corpus Cross-Media German Blog (CGB) which consists of German blogs with feelings in the domain of images, texts and posts (Ground Truth), classified according to human perceptions. We apply existing Machine Learning technologies and lexicons to the corpus to detect the feelings (negative, neutral or positive) of the images and texts and compare the results with the GT. We examined contradictory posts, when the image and text classified by humans in the same post had diverging feelings. The comparison of this article with the analysis of sentiment among the media of Brazilian blogs finds its justification for performance results in cultural differences, since, throughout this work, Brazil is classified as indulgent and Germany as a restrained country.
近年来,社会互动发生了变化。人们更频繁地在社交媒体平台上发布自己的想法、观点和感受。由于互联网上数据量的增加,人工进行情感分析是不切实际的,需要自动化的过程。在这项工作中,我们展示了语料库跨媒体德语博客(CGB),它由德语博客组成,这些博客在图像、文本和帖子(Ground Truth)领域有感情,并根据人类的感知进行分类。我们将现有的机器学习技术和词汇应用到语料库中,以检测图像和文本的情感(消极、中性或积极),并将结果与GT进行比较。当人类在同一篇文章中分类的图像和文本具有不同的情感时,我们检查了矛盾的帖子。将这篇文章与巴西博客媒体的情绪分析相比较,可以发现其表现的理由是文化差异,因为在整个研究中,巴西被归类为放纵的国家,而德国则被归类为克制的国家。
{"title":"Cross-Media Sentiment Analysis on German Blogs","authors":"Nina N. Zahn, G. P. D. Molin, S. Musse","doi":"10.5753/SEMISH.2021.15813","DOIUrl":"https://doi.org/10.5753/SEMISH.2021.15813","url":null,"abstract":"Social interactions have changed in recent years. People post their thoughts, opinions and feelings on social media platforms more often. Due to the increase in the amount of data on the internet, it is impracticable to carry out the sentiment analysis manually, requiring automation of the process. In this work, we present the corpus Cross-Media German Blog (CGB) which consists of German blogs with feelings in the domain of images, texts and posts (Ground Truth), classified according to human perceptions. We apply existing Machine Learning technologies and lexicons to the corpus to detect the feelings (negative, neutral or positive) of the images and texts and compare the results with the GT. We examined contradictory posts, when the image and text classified by humans in the same post had diverging feelings. The comparison of this article with the analysis of sentiment among the media of Brazilian blogs finds its justification for performance results in cultural differences, since, throughout this work, Brazil is classified as indulgent and Germany as a restrained country.","PeriodicalId":206312,"journal":{"name":"Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132033856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting Popularity of Facebook Videos Through Visual Features Using Support Vector Machine Classifier 使用支持向量机分类器通过视觉特征预测Facebook视频的受欢迎程度
Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15815
B. Dalmoro, S. Musse
With the popularization of social networks, the sharing and consumption of content in video format becomes easier. Understanding what makes a video popular and being able to predict its popularity in number of views is useful for both content creators and advertising. In this work, we explore visual features extracted from 1,820 Facebook videos in order to predict whether they will reach more than a certain number of views on the seven days after publication. For this purpose, we used Support Vector Machine with Gaussian Radial Basis Function classification model. Using only visual features as predictors, the model with Video Characteristics and Rigidity features combined reached Kappa of 0.7324, sensitivity of 0.8930, and positive predictive value of 0.8930.
随着社交网络的普及,视频格式内容的分享和消费变得更加容易。了解是什么让一个视频受欢迎,并能够预测其受欢迎的观看次数,这对内容创作者和广告都很有用。在这项工作中,我们探索了从1820个Facebook视频中提取的视觉特征,以预测它们在发布后七天内是否会达到一定数量的观看量。为此,我们使用支持向量机与高斯径向基函数的分类模型。仅使用视觉特征作为预测因子,结合Video Characteristics和刚度特征的模型Kappa值为0.7324,灵敏度为0.8930,阳性预测值为0.8930。
{"title":"Predicting Popularity of Facebook Videos Through Visual Features Using Support Vector Machine Classifier","authors":"B. Dalmoro, S. Musse","doi":"10.5753/SEMISH.2021.15815","DOIUrl":"https://doi.org/10.5753/SEMISH.2021.15815","url":null,"abstract":"With the popularization of social networks, the sharing and consumption of content in video format becomes easier. Understanding what makes a video popular and being able to predict its popularity in number of views is useful for both content creators and advertising. In this work, we explore visual features extracted from 1,820 Facebook videos in order to predict whether they will reach more than a certain number of views on the seven days after publication. For this purpose, we used Support Vector Machine with Gaussian Radial Basis Function classification model. Using only visual features as predictors, the model with Video Characteristics and Rigidity features combined reached Kappa of 0.7324, sensitivity of 0.8930, and positive predictive value of 0.8930.","PeriodicalId":206312,"journal":{"name":"Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132314312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Técnicas de Processamento de Linguagem Natural em Denúncias Criminais: Automatização e Classificação de Texto em Português Coloquial 刑事投诉中的自然语言处理技术:葡萄牙语口语文本的自动化与分类
Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15820
Camila Gusmão, Karla Figueiredo, Walkir Brito
Este artigo apresenta a investigação de Técnicas de Processamento de Linguagem Natural (PLN) em Denúncias Criminais, provenientes do aplicativo do serviço do Disque Denúncia RJ para smartphone. Nele é apresentado o processo de automatização, avaliando e classificando as denúncias, objetivando reduzir o tempo de análise do conteúdo das mensagens, que possui, como principal desafio, textos escritos em linguagem muito informal, contendo muitos erros morfossintáticos. Para alcançar tais objetivos foi necessária uma investigação de técnicas de pré-processamento visando melhorar a acurácia da classificação, que foi realizada por Support Vector Machine (SVM). Os resultados encontrados são bastante promissores para o tipo de textos de denúncias, atingindo uma precisão de 76,11%.
本文介绍了自然语言处理技术(nlp)在刑事投诉中的研究,从电话服务投诉RJ到智能手机的应用。它提出了自动化的过程,评估和分类投诉,旨在减少分析信息内容的时间,这是一个主要的挑战,文本写在非常非正式的语言,包含许多形态句法错误。来实现这些目标是需要预处理技术的研究来提高分类的精度,通过支持向量机(SVM)进行的。结果对投诉文本类型非常有希望,准确率为76.11%。
{"title":"Técnicas de Processamento de Linguagem Natural em Denúncias Criminais: Automatização e Classificação de Texto em Português Coloquial","authors":"Camila Gusmão, Karla Figueiredo, Walkir Brito","doi":"10.5753/SEMISH.2021.15820","DOIUrl":"https://doi.org/10.5753/SEMISH.2021.15820","url":null,"abstract":"Este artigo apresenta a investigação de Técnicas de Processamento de Linguagem Natural (PLN) em Denúncias Criminais, provenientes do aplicativo do serviço do Disque Denúncia RJ para smartphone. Nele é apresentado o processo de automatização, avaliando e classificando as denúncias, objetivando reduzir o tempo de análise do conteúdo das mensagens, que possui, como principal desafio, textos escritos em linguagem muito informal, contendo muitos erros morfossintáticos. Para alcançar tais objetivos foi necessária uma investigação de técnicas de pré-processamento visando melhorar a acurácia da classificação, que foi realizada por Support Vector Machine (SVM). Os resultados encontrados são bastante promissores para o tipo de textos de denúncias, atingindo uma precisão de 76,11%.","PeriodicalId":206312,"journal":{"name":"Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124985463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Scaling up Cast Face Detection in Videos at Globo 在Globo视频中扩大演员面部检测
Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15816
F. Ferreira, Bruno P. Oliveira, Rodrigo Kassick, V. Furlan, Hélio Lopes
It has been recognized that a significant increase in the production and consumption of video content occurred in the last decade. Many entertainment companies, like Globo, face challenges regarding video metadata generation. The objective of this paper is to present a suitable architecture for the Globo Group to automatically identify actors that appear in each scene of a video stream, generating new metadata annotations that can be used by recommender systems and search engines among different other applications in this industry sector.
人们认识到,在过去十年中,视频内容的制作和消费显著增加。许多娱乐公司,如Globo,都面临着视频元数据生成方面的挑战。本文的目标是为Globo Group提供一个合适的架构,以自动识别视频流中每个场景中出现的角色,生成新的元数据注释,这些注释可以被推荐系统和搜索引擎在该行业领域的不同其他应用程序中使用。
{"title":"Scaling up Cast Face Detection in Videos at Globo","authors":"F. Ferreira, Bruno P. Oliveira, Rodrigo Kassick, V. Furlan, Hélio Lopes","doi":"10.5753/SEMISH.2021.15816","DOIUrl":"https://doi.org/10.5753/SEMISH.2021.15816","url":null,"abstract":"It has been recognized that a significant increase in the production and consumption of video content occurred in the last decade. Many entertainment companies, like Globo, face challenges regarding video metadata generation. The objective of this paper is to present a suitable architecture for the Globo Group to automatically identify actors that appear in each scene of a video stream, generating new metadata annotations that can be used by recommender systems and search engines among different other applications in this industry sector.","PeriodicalId":206312,"journal":{"name":"Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122626933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning based Pricing Methodology for the Logistic Domain: a Preliminary Approach 基于机器学习的物流领域定价方法初探
Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15819
Antonio L. Amadeu, Fernando Vinturin, Guilherme A. Zimeo Morais, Maickel Hubner, E. M. Pereira, Marcelo Santos
In this work, we introduce a new methodology to discover logistic regions for pricing. We use value-based characteristics from different sources, such as demographic, socioeconomic, risk, transportation, among others, to find homogeneous and valuable pricing regions. The problem was formulated as a traditional cluster solution, where well-know metrics, such as BIC and silhouette score, were used for technical validation, and business premises and constraints, operational and sales, where used to enrich feature engineering and refine cluster formation. The results presented here are from a preliminary work that was validated through several sessions with stakeholders of interest, but it is still missing the market validation. Indeed, this work will be deployed soon and a more detailed validation process, including client adherence, will be performed and monitored until the end of this year.
在这项工作中,我们引入了一种新的方法来发现物流区域的定价。我们使用来自不同来源的基于价值的特征,如人口统计、社会经济、风险、运输等,以找到同质和有价值的定价区域。这个问题被表述为一个传统的集群解决方案,其中众所周知的指标,如BIC和轮廓分数,被用于技术验证,而商业场所和约束,运营和销售,被用于丰富特征工程和改进集群形成。这里展示的结果来自与利益相关者的几次会议验证的初步工作,但它仍然缺少市场验证。事实上,这项工作将很快部署,并将执行更详细的验证过程,包括客户遵守情况,并在今年年底之前进行监测。
{"title":"Machine Learning based Pricing Methodology for the Logistic Domain: a Preliminary Approach","authors":"Antonio L. Amadeu, Fernando Vinturin, Guilherme A. Zimeo Morais, Maickel Hubner, E. M. Pereira, Marcelo Santos","doi":"10.5753/SEMISH.2021.15819","DOIUrl":"https://doi.org/10.5753/SEMISH.2021.15819","url":null,"abstract":"In this work, we introduce a new methodology to discover logistic regions for pricing. We use value-based characteristics from different sources, such as demographic, socioeconomic, risk, transportation, among others, to find homogeneous and valuable pricing regions. The problem was formulated as a traditional cluster solution, where well-know metrics, such as BIC and silhouette score, were used for technical validation, and business premises and constraints, operational and sales, where used to enrich feature engineering and refine cluster formation. The results presented here are from a preliminary work that was validated through several sessions with stakeholders of interest, but it is still missing the market validation. Indeed, this work will be deployed soon and a more detailed validation process, including client adherence, will be performed and monitored until the end of this year.","PeriodicalId":206312,"journal":{"name":"Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126610212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CoEPinKB: A Framework to Understand the Connectivity of Entity Pairs in Knowledge Bases CoEPinKB:一个理解知识库中实体对连通性的框架
Pub Date : 2021-07-18 DOI: 10.5753/SEMISH.2021.15811
J. G. Jiménez, Luiz André Portes Paes Leme, M. Casanova
A knowledge base, expressed using the Resource Description Framework (RDF), can be viewed as a graph whose nodes represent entities and whose edges denote relationships. The entity relatedness problem refers to the problem of discovering and understanding how two entities are related, directly or indirectly, that is, how they are connected by paths in a knowledge base. Strategies designed to solve the entity relatedness problem typically adopt an entity similarity measure to reduce the path search space and a path ranking measure to order and filter the list of paths returned. This paper presents a framework, called CoEPinKB, that supports the empirical evaluation of such strategies. The proposed framework allows combining entity similarity and path ranking measures to generate different path search strategies. The main goals of this paper are to describe the framework and present a performance evaluation of nine different path search strategies.
使用资源描述框架(RDF)表示的知识库可以看作是一个图,其节点表示实体,其边表示关系。实体关联问题是指发现和理解两个实体是如何直接或间接关联的问题,即它们是如何通过知识库中的路径连接起来的问题。解决实体关联问题的策略通常采用实体相似度度量来减少路径搜索空间,采用路径排序度量来对返回的路径列表进行排序和过滤。本文提出了一个名为CoEPinKB的框架,该框架支持对此类策略进行实证评估。该框架允许结合实体相似度和路径排序度量来生成不同的路径搜索策略。本文的主要目标是描述该框架,并对九种不同的路径搜索策略进行性能评估。
{"title":"CoEPinKB: A Framework to Understand the Connectivity of Entity Pairs in Knowledge Bases","authors":"J. G. Jiménez, Luiz André Portes Paes Leme, M. Casanova","doi":"10.5753/SEMISH.2021.15811","DOIUrl":"https://doi.org/10.5753/SEMISH.2021.15811","url":null,"abstract":"A knowledge base, expressed using the Resource Description Framework (RDF), can be viewed as a graph whose nodes represent entities and whose edges denote relationships. The entity relatedness problem refers to the problem of discovering and understanding how two entities are related, directly or indirectly, that is, how they are connected by paths in a knowledge base. Strategies designed to solve the entity relatedness problem typically adopt an entity similarity measure to reduce the path search space and a path ranking measure to order and filter the list of paths returned. This paper presents a framework, called CoEPinKB, that supports the empirical evaluation of such strategies. The proposed framework allows combining entity similarity and path ranking measures to generate different path search strategies. The main goals of this paper are to describe the framework and present a performance evaluation of nine different path search strategies.","PeriodicalId":206312,"journal":{"name":"Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134224932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Anais do XLVIII Seminário Integrado de Software e Hardware (SEMISH 2021)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1