首页 > 最新文献

2018 7th Brazilian Conference on Intelligent Systems (BRACIS)最新文献

英文 中文
A Face Recognition Framework for Illumination Compensation Based on Bio-Inspired Algorithms 基于生物启发算法的光照补偿人脸识别框架
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00056
G. Plichoski, Chidambaram Chidambaram, R. S. Parpinelli
It is possible to find in the literature a wide range of techniques employed for face recognition. Hence, to select a technique or set of techniques and tune their respective parameters become an optimization task. In this paper, we present a face recognition framework with the aid of bio-inspired optimization algorithms. This approach implements several preprocessing and feature extraction techniques, and the optimization algorithm is responsible for choosing which strategies to use, as well as tunning their parameters. In this work, we analyzed the performance of two optimization algorithms, namely Particle Swarm Optimization (PSO) and Differential Evolution (DE) aiming to address the illumination compensation problem. The well known Yale Extended B database is used in the classification task. The results obtained show that the proposed approach is competitive with literature achieving the average recognition rate of 99.95% with DE.
有可能在文献中找到广泛的用于人脸识别的技术。因此,选择一种或一组技术并调整它们各自的参数成为一项优化任务。本文提出了一种基于仿生优化算法的人脸识别框架。该方法实现了多种预处理和特征提取技术,优化算法负责选择使用哪种策略以及调整其参数。在这项工作中,我们分析了两种优化算法的性能,即粒子群优化(PSO)和差分进化(DE),旨在解决照明补偿问题。在分类任务中使用了著名的Yale Extended B数据库。结果表明,该方法与文献的平均识别率达到99.95%,具有一定的竞争力。
{"title":"A Face Recognition Framework for Illumination Compensation Based on Bio-Inspired Algorithms","authors":"G. Plichoski, Chidambaram Chidambaram, R. S. Parpinelli","doi":"10.1109/BRACIS.2018.00056","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00056","url":null,"abstract":"It is possible to find in the literature a wide range of techniques employed for face recognition. Hence, to select a technique or set of techniques and tune their respective parameters become an optimization task. In this paper, we present a face recognition framework with the aid of bio-inspired optimization algorithms. This approach implements several preprocessing and feature extraction techniques, and the optimization algorithm is responsible for choosing which strategies to use, as well as tunning their parameters. In this work, we analyzed the performance of two optimization algorithms, namely Particle Swarm Optimization (PSO) and Differential Evolution (DE) aiming to address the illumination compensation problem. The well known Yale Extended B database is used in the classification task. The results obtained show that the proposed approach is competitive with literature achieving the average recognition rate of 99.95% with DE.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125207647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Recommending Scientific Collaboration from ResearchGate 从ResearchGate推荐科学合作
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00065
M. W. Rodrigues, Wladmir Cardoso Brandão, Luis E. Zárate
Scientific collaboration improves researchers productivity by providing a way to share new ideas, learn new techniques, and find new research applications, increasing the chance to access funding. Beyond ethics and reciprocity, there are other important aspects on achieving scientific collaborations, such as research interests and expected productivity gain, that are paramount to a successful partnership. However, achieve effective collaborations is a hard work and can drain researchers time. In this work, we propose a recommendation approach that uses different strategies to suggest scientific collaboration for researchers based on their research interest. In particular, our approach exploits ResearchGate, a well known research social network from where research interests and researchers production are used to model similarity between them. Experimental results show that the content-based strategy outperforms neighborhood-based collaborative filtering strategies to recommend scientific collaboration with gains of up 16.60% in precision, 37.19% in recall, and 21.16% in F1 for the top-20 recommendation lists.
科学合作提供了一种分享新思想、学习新技术和发现新的研究应用的方式,从而提高了研究人员的生产力,增加了获得资助的机会。除了伦理和互惠之外,实现科学合作还有其他重要方面,例如研究兴趣和预期的生产力提高,这对于成功的伙伴关系至关重要。然而,实现有效的合作是一项艰巨的工作,可能会消耗研究人员的时间。在这项工作中,我们提出了一种推荐方法,根据研究人员的研究兴趣,使用不同的策略来建议他们进行科学合作。特别是,我们的方法利用了ResearchGate,这是一个著名的研究社交网络,研究兴趣和研究人员的成果被用来模拟它们之间的相似性。实验结果表明,基于内容的协同过滤策略在推荐科学协作方面优于基于邻居的协同过滤策略,在推荐前20名列表上,准确率提高了16.60%,召回率提高了37.19%,F1提高了21.16%。
{"title":"Recommending Scientific Collaboration from ResearchGate","authors":"M. W. Rodrigues, Wladmir Cardoso Brandão, Luis E. Zárate","doi":"10.1109/BRACIS.2018.00065","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00065","url":null,"abstract":"Scientific collaboration improves researchers productivity by providing a way to share new ideas, learn new techniques, and find new research applications, increasing the chance to access funding. Beyond ethics and reciprocity, there are other important aspects on achieving scientific collaborations, such as research interests and expected productivity gain, that are paramount to a successful partnership. However, achieve effective collaborations is a hard work and can drain researchers time. In this work, we propose a recommendation approach that uses different strategies to suggest scientific collaboration for researchers based on their research interest. In particular, our approach exploits ResearchGate, a well known research social network from where research interests and researchers production are used to model similarity between them. Experimental results show that the content-based strategy outperforms neighborhood-based collaborative filtering strategies to recommend scientific collaboration with gains of up 16.60% in precision, 37.19% in recall, and 21.16% in F1 for the top-20 recommendation lists.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127217541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A Concept-Based ILP Approach for Multi-document Summarization Exploring Centrality and Position 一种基于概念的多文档摘要探索中心性和位置的ILP方法
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00015
Hilário Oliveira, R. Lins, Rinaldo Lima, F. Freitas, S. Simske
Multi-document summarization systems aim to generate a brief text containing the most relevant information from a collection of related documents. The fast and continually growing volume of text data has increasingly drawn the attention from users and researchers to such systems. Aspects such as sentence centrality and position have been extensively studied in multi-document summarization as indicators of content relevancy. Very few works have investigated their efficient integration using global-based optimization approaches, however. This paper proposes a concept-based integer linear programming approach for multi-document summarization of news articles that integrates centrality and position features to filter out the less relevant sentences and measure the importance of concepts (textual fragments) in composing the output summary. The presented approach relies on a centrality-based strategy to perform the sentence clustering process and also to support the sentence ordering step. The benchmarks conducted with four datasets of the Document Understanding Conferences from 2001 to 2004 demonstrate that the proposed approach presents competitive performance compared with other state-of-the-art methods.
多文档摘要系统旨在从相关文档的集合中生成包含最相关信息的简短文本。快速增长的文本数据量越来越引起用户和研究人员对此类系统的关注。句子中心性和位置等方面作为内容相关性的指标在多文档摘要中得到了广泛的研究。然而,很少有作品使用基于全局的优化方法来研究它们的有效集成。本文提出了一种基于概念的整数线性规划方法,用于新闻文章的多文档摘要,该方法结合中心性和位置特征来过滤不太相关的句子,并测量概念(文本片段)在组成输出摘要中的重要性。所提出的方法依赖于基于中心的策略来执行句子聚类过程,并支持句子排序步骤。以2001年至2004年文件了解会议的四个数据集进行的基准测试表明,与其他最先进的方法相比,建议的方法具有竞争力。
{"title":"A Concept-Based ILP Approach for Multi-document Summarization Exploring Centrality and Position","authors":"Hilário Oliveira, R. Lins, Rinaldo Lima, F. Freitas, S. Simske","doi":"10.1109/BRACIS.2018.00015","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00015","url":null,"abstract":"Multi-document summarization systems aim to generate a brief text containing the most relevant information from a collection of related documents. The fast and continually growing volume of text data has increasingly drawn the attention from users and researchers to such systems. Aspects such as sentence centrality and position have been extensively studied in multi-document summarization as indicators of content relevancy. Very few works have investigated their efficient integration using global-based optimization approaches, however. This paper proposes a concept-based integer linear programming approach for multi-document summarization of news articles that integrates centrality and position features to filter out the less relevant sentences and measure the importance of concepts (textual fragments) in composing the output summary. The presented approach relies on a centrality-based strategy to perform the sentence clustering process and also to support the sentence ordering step. The benchmarks conducted with four datasets of the Document Understanding Conferences from 2001 to 2004 demonstrate that the proposed approach presents competitive performance compared with other state-of-the-art methods.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121752138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Novel Evolutionary Algorithm for Automated Machine Learning Focusing on Classifier Ensembles 一种新的基于分类器集成的自动机器学习进化算法
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00086
J. C. Xavier, A. Freitas, Antonino Feitosa Neto, Teresa B Ludermir
Automated Machine Learning (Auto-ML) is an emerging area of ML which consists of automatically selecting the best ML algorithm and its best hyper-parameter settings for a given input dataset, by doing a search in a large space of candidate algorithms and settings. In this work we propose a new Evolutionary Algorithm (EA) for the Auto-ML task of automatically selecting the best ensemble of classifiers and their hyper-parameter settings for an input dataset. The proposed EA was compared against a version of the well-known Auto-WEKA method adapted to search in the same space of algorithms and hyper-parameter settings as the EA. In general, the EA obtained significantly smaller classification error rates than that Auto-WEKA version in experiments with 15 classification datasets.
自动机器学习(Auto-ML)是机器学习的一个新兴领域,它包括通过在大量候选算法和设置的空间中进行搜索,为给定的输入数据集自动选择最佳机器学习算法及其最佳超参数设置。在这项工作中,我们提出了一种新的进化算法(EA),用于自动选择输入数据集的最佳分类器集合及其超参数设置的Auto-ML任务。将本文提出的EA与Auto-WEKA方法进行了比较,该方法适用于与EA相同的算法和超参数设置空间的搜索。在15个分类数据集的实验中,EA的分类错误率明显低于Auto-WEKA版本。
{"title":"A Novel Evolutionary Algorithm for Automated Machine Learning Focusing on Classifier Ensembles","authors":"J. C. Xavier, A. Freitas, Antonino Feitosa Neto, Teresa B Ludermir","doi":"10.1109/BRACIS.2018.00086","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00086","url":null,"abstract":"Automated Machine Learning (Auto-ML) is an emerging area of ML which consists of automatically selecting the best ML algorithm and its best hyper-parameter settings for a given input dataset, by doing a search in a large space of candidate algorithms and settings. In this work we propose a new Evolutionary Algorithm (EA) for the Auto-ML task of automatically selecting the best ensemble of classifiers and their hyper-parameter settings for an input dataset. The proposed EA was compared against a version of the well-known Auto-WEKA method adapted to search in the same space of algorithms and hyper-parameter settings as the EA. In general, the EA obtained significantly smaller classification error rates than that Auto-WEKA version in experiments with 15 classification datasets.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130702708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Gene Essentiality Prediction Using Topological Features From Metabolic Networks 基于代谢网络拓扑特征的基因本质性预测
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00024
J. Nagai, H. Sousa, A. Aono, Ana Carolina Lorena, R. Kuroshu
Fundamental questions such as what are the genes that are really necessary for the survival of cells have motivated many studies to investigate the essentiality of genes in different species. Initial efforts have attempted to address this problem through exhaustive knockout experiments in simple bacteria. Recently, results obtained in these studies have also been applied to the emerging field of synthetic biology with possible implications in many other fields such as health and energy. Motivated by the evolution of DNA sequencing technology and high-throughput biological data generation, many recent efforts have also been made for building and understanding biological networks. In particular, metabolic networks represent the set of known biochemical reactions within a cell. Essential genes are expected to play a key role in these networks, as they must be involved in vital metabolic pathways. Even though some studies investigated the correlation between essential genes and biological network information, different types of networks and other biological information were usually combined and the effect of each of them in the obtained results was not stressed. This paper describes an attempt to predict essential genes using solely topological features from metabolic networks. The networks were built from a common repository, the KEGG database, ensuring data uniformity. Experimentally, considering different prediction scenarios and reference organisms, the use of topological features from metabolic networks achieved mean AUC of about 70% in the prediction of gene essentiality. This reveals that more factors affect essentiality and should indeed be considered in order to obtain more accurate predictions.
一些基本的问题,如什么基因对细胞的生存是真正必要的,激发了许多研究来调查不同物种基因的重要性。最初的努力试图通过在简单细菌中进行彻底的基因敲除实验来解决这个问题。最近,在这些研究中获得的结果也被应用于合成生物学的新兴领域,可能对许多其他领域产生影响,如卫生和能源。受DNA测序技术和高通量生物数据生成技术发展的推动,近年来人们在构建和理解生物网络方面也做出了许多努力。特别是,代谢网络代表了细胞内一系列已知的生化反应。必需基因预计在这些网络中发挥关键作用,因为它们必须参与重要的代谢途径。尽管一些研究调查了必需基因与生物网络信息的相关性,但通常将不同类型的网络和其他生物信息组合在一起,而不强调每种网络和其他生物信息在所得结果中的作用。本文描述了一种仅使用代谢网络的拓扑特征来预测必需基因的尝试。这些网络建立在一个共同的存储库——KEGG数据库之上,确保了数据的一致性。在实验中,考虑到不同的预测场景和参考生物,使用代谢网络的拓扑特征在预测基因必要性方面实现了约70%的平均AUC。这表明,更多的因素影响本质,确实应该考虑,以获得更准确的预测。
{"title":"Gene Essentiality Prediction Using Topological Features From Metabolic Networks","authors":"J. Nagai, H. Sousa, A. Aono, Ana Carolina Lorena, R. Kuroshu","doi":"10.1109/BRACIS.2018.00024","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00024","url":null,"abstract":"Fundamental questions such as what are the genes that are really necessary for the survival of cells have motivated many studies to investigate the essentiality of genes in different species. Initial efforts have attempted to address this problem through exhaustive knockout experiments in simple bacteria. Recently, results obtained in these studies have also been applied to the emerging field of synthetic biology with possible implications in many other fields such as health and energy. Motivated by the evolution of DNA sequencing technology and high-throughput biological data generation, many recent efforts have also been made for building and understanding biological networks. In particular, metabolic networks represent the set of known biochemical reactions within a cell. Essential genes are expected to play a key role in these networks, as they must be involved in vital metabolic pathways. Even though some studies investigated the correlation between essential genes and biological network information, different types of networks and other biological information were usually combined and the effect of each of them in the obtained results was not stressed. This paper describes an attempt to predict essential genes using solely topological features from metabolic networks. The networks were built from a common repository, the KEGG database, ensuring data uniformity. Experimentally, considering different prediction scenarios and reference organisms, the use of topological features from metabolic networks achieved mean AUC of about 70% in the prediction of gene essentiality. This reveals that more factors affect essentiality and should indeed be considered in order to obtain more accurate predictions.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114494249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Rule-Based Greedy Ant (rGrAnt) Protocol for Networking Environments 网络环境中基于规则的贪心蚂蚁(rGrAnt)协议
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00090
Luis Guilherme Bergamini Mendes, A. Vendramin, Anelise Munaretto, M. Delgado
This paper presents a rule-based system for the Greedy Ant Protocol (GrAnt), named rGrAnt. GrAnt uses the Ant Colony Optimization (ACO) meta-heuristic aiming to route traffic in complex and dynamic Delay Tolerant Networks. rGrAnt has been developed to provide the protocol the ability to extract information online from nodes' social connectivity, which can range from disconnected and sparse to highly connected networking environments. With this information, the proposed protocol can guide through its fuzzy/crisp rules the ACO routing module by deciding when to consider data from heuristic functions and/or pheromone concentration, which data can be incorporated in both heuristic and pheromone parameters, and if the message forwarding phase must be less or more restrictive. In nodes with low connectivity, the rules of rGrAnt indicate that the protocol must be less restrictive when forwarding messages, in order to make better use of the few available contacts. In contrast, in nodes with high connectivity, it is necessary to restrict forwarding to avoid overloading the same sets of nodes and links. rGrAnt is compared with GrAnt in three different movement models. Results show that, in the three models, rGrAnt achieves a higher delivery ratio than GrAnt.
本文提出了一个基于规则的贪心蚂蚁协议(GrAnt)系统,命名为rGrAnt。GrAnt使用蚁群优化(ACO)元启发式算法来解决复杂动态容延迟网络中的路由问题。开发rGrAnt的目的是为协议提供从节点的社会连接中在线提取信息的能力,这些连接可以从断开和稀疏的网络环境到高度连接的网络环境。有了这些信息,该协议可以通过其模糊/清晰的规则来指导蚁群算法路由模块,决定何时考虑启发式函数和/或信息素浓度的数据,哪些数据可以同时包含在启发式和信息素参数中,以及消息转发阶段是否必须更少或更严格。在连通性较低的节点中,rGrAnt规则表明协议在转发消息时必须限制较少,以便更好地利用少数可用的联系人。相反,在高连通性的节点中,需要限制转发,以避免同一组节点和链路过载。在三种不同的运动模型中比较了rGrAnt和GrAnt。结果表明,在三个模型中,rGrAnt的交付率高于GrAnt。
{"title":"A Rule-Based Greedy Ant (rGrAnt) Protocol for Networking Environments","authors":"Luis Guilherme Bergamini Mendes, A. Vendramin, Anelise Munaretto, M. Delgado","doi":"10.1109/BRACIS.2018.00090","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00090","url":null,"abstract":"This paper presents a rule-based system for the Greedy Ant Protocol (GrAnt), named rGrAnt. GrAnt uses the Ant Colony Optimization (ACO) meta-heuristic aiming to route traffic in complex and dynamic Delay Tolerant Networks. rGrAnt has been developed to provide the protocol the ability to extract information online from nodes' social connectivity, which can range from disconnected and sparse to highly connected networking environments. With this information, the proposed protocol can guide through its fuzzy/crisp rules the ACO routing module by deciding when to consider data from heuristic functions and/or pheromone concentration, which data can be incorporated in both heuristic and pheromone parameters, and if the message forwarding phase must be less or more restrictive. In nodes with low connectivity, the rules of rGrAnt indicate that the protocol must be less restrictive when forwarding messages, in order to make better use of the few available contacts. In contrast, in nodes with high connectivity, it is necessary to restrict forwarding to avoid overloading the same sets of nodes and links. rGrAnt is compared with GrAnt in three different movement models. Results show that, in the three models, rGrAnt achieves a higher delivery ratio than GrAnt.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133909630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Density-Based Core Support Extraction for Non-stationary Environments with Extreme Verification Latency 具有极端验证延迟的非平稳环境下基于密度的核心支持提取
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00039
Raul Sena Ferreira, Bruno M. A. da Silva, W. Teixeira, Geraldo Zimbrão, L. Alvim
Machine learning solutions usually consider that the train and test data has the same probabilistic distribution, that is, the data is stationary. However, in streaming scenarios, data distribution generally change through the time, that is, the data is non-stationary. The main challenge in such online environment is the model adaptation for the constant drifts in data distribution. Besides, other important restriction may happen in online scenarios: the extreme latency to verify the labels. Worth to mention that the incremental drift assumption is that class distributions overlap at subsequent time steps. Hence, the core region of data distribution have significant overlap with incoming data. Therefore, selecting samples from these core regions helps to retain the most important instances that represent the new distribution. This selection is denominated core support extraction (CSE). Thus, we present a study about density-based algorithms applied in non-stationary environments. We compared KDE, GMM and two variations of DBSCAN against single semi-supervised approaches. We validated these approaches in seventeen synthetic datasets and a real one, showing the strengths and weaknesses of these CSE methods through many metrics. We show that a semi-supervised classifier is improved up to 68% on a real dataset when it is applied along with a density-based CSE algorithm. The results between KDE and GMM, as CSE methods, were close but the approach using KDE is more practical due to having less parameters.
机器学习解决方案通常认为训练数据和测试数据具有相同的概率分布,即数据是平稳的。但在流场景下,数据的分布一般会随着时间的变化而变化,即数据是非平稳的。这种在线环境下的主要挑战是如何适应数据分布的不断变化。此外,在在线场景中可能会出现其他重要的限制:验证标签的极端延迟。值得一提的是,增量漂移假设是类分布在随后的时间步骤中重叠。因此,数据分布的核心区域与输入数据有明显的重叠。因此,从这些核心区域中选择样本有助于保留代表新分布的最重要的实例。这个选择被命名为核心支持提取(CSE)。因此,我们提出了一项关于应用于非平稳环境的基于密度的算法的研究。我们将KDE、GMM和DBSCAN的两个变体与单一的半监督方法进行了比较。我们在17个合成数据集和一个真实数据集中验证了这些方法,通过许多度量显示了这些CSE方法的优点和缺点。我们表明,当与基于密度的CSE算法一起应用时,半监督分类器在真实数据集上的改进高达68%。作为CSE方法,KDE和GMM之间的结果非常接近,但使用KDE的方法由于参数较少而更加实用。
{"title":"Density-Based Core Support Extraction for Non-stationary Environments with Extreme Verification Latency","authors":"Raul Sena Ferreira, Bruno M. A. da Silva, W. Teixeira, Geraldo Zimbrão, L. Alvim","doi":"10.1109/BRACIS.2018.00039","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00039","url":null,"abstract":"Machine learning solutions usually consider that the train and test data has the same probabilistic distribution, that is, the data is stationary. However, in streaming scenarios, data distribution generally change through the time, that is, the data is non-stationary. The main challenge in such online environment is the model adaptation for the constant drifts in data distribution. Besides, other important restriction may happen in online scenarios: the extreme latency to verify the labels. Worth to mention that the incremental drift assumption is that class distributions overlap at subsequent time steps. Hence, the core region of data distribution have significant overlap with incoming data. Therefore, selecting samples from these core regions helps to retain the most important instances that represent the new distribution. This selection is denominated core support extraction (CSE). Thus, we present a study about density-based algorithms applied in non-stationary environments. We compared KDE, GMM and two variations of DBSCAN against single semi-supervised approaches. We validated these approaches in seventeen synthetic datasets and a real one, showing the strengths and weaknesses of these CSE methods through many metrics. We show that a semi-supervised classifier is improved up to 68% on a real dataset when it is applied along with a density-based CSE algorithm. The results between KDE and GMM, as CSE methods, were close but the approach using KDE is more practical due to having less parameters.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134410082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Towards a Referenceless Visual Quality Assessment Model Using Binarized Statistical Image Features 基于二值化统计图像特征的无参考视觉质量评价模型
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00048
P. Freitas, W. Y. L. Akamine, Mylène C. Q. Farias
In many practical multimedia applications, the visual content is modified during transmission, enhancement, modification, and compression stages. These modifications often create visible distortions that may be perceived by humans. Therefore, the development of algorithms that are able to assess the visual quality as perceived by a human viewer can lead to significant progress in multimedia applications. Many researchers have developed algorithms that estimate visual quality. These algorithms can either make use of the full pristine content (full-reference metrics), partial aspects of the pristine content (reduced-reference metrics) or only the assessed content (referenceless or no-reference metrics). These three approaches have advantages and drawbacks. Nevertheless, although the design of a referenceless metric is more challenging, they have greater applicability in different scenarios. This paper introduces a novel referenceless image quality assessment (RIQA) metric. The proposed metric uses statistics of the Binarized Statistical Image Features descriptor (BSIF) to analyze the textures of an image. These statistics are mapped into subjective quality scores using a Random Forest Regression approach. Results show that the proposed metric is robust and accurate, outperforming other state-of-the-art RIQA methods.
在许多实际的多媒体应用中,可视内容在传输、增强、修改和压缩阶段被修改。这些修改通常会造成人类可能感知到的明显扭曲。因此,能够评估人类观众所感知的视觉质量的算法的发展可以导致多媒体应用的重大进展。许多研究人员已经开发出了评估视觉质量的算法。这些算法可以使用完整的原始内容(完整引用度量)、原始内容的部分方面(减少引用度量)或仅使用评估的内容(无引用或无引用度量)。这三种方法各有优缺点。然而,尽管无参考度量的设计更具挑战性,但它们在不同的场景中具有更大的适用性。介绍了一种新的无参考图像质量评价(RIQA)度量。该度量使用二值化统计图像特征描述符(BSIF)的统计量来分析图像的纹理。使用随机森林回归方法将这些统计数据映射为主观质量分数。结果表明,所提出的度量鲁棒性和准确性,优于其他最先进的RIQA方法。
{"title":"Towards a Referenceless Visual Quality Assessment Model Using Binarized Statistical Image Features","authors":"P. Freitas, W. Y. L. Akamine, Mylène C. Q. Farias","doi":"10.1109/BRACIS.2018.00048","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00048","url":null,"abstract":"In many practical multimedia applications, the visual content is modified during transmission, enhancement, modification, and compression stages. These modifications often create visible distortions that may be perceived by humans. Therefore, the development of algorithms that are able to assess the visual quality as perceived by a human viewer can lead to significant progress in multimedia applications. Many researchers have developed algorithms that estimate visual quality. These algorithms can either make use of the full pristine content (full-reference metrics), partial aspects of the pristine content (reduced-reference metrics) or only the assessed content (referenceless or no-reference metrics). These three approaches have advantages and drawbacks. Nevertheless, although the design of a referenceless metric is more challenging, they have greater applicability in different scenarios. This paper introduces a novel referenceless image quality assessment (RIQA) metric. The proposed metric uses statistics of the Binarized Statistical Image Features descriptor (BSIF) to analyze the textures of an image. These statistics are mapped into subjective quality scores using a Random Forest Regression approach. Results show that the proposed metric is robust and accurate, outperforming other state-of-the-art RIQA methods.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132148778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Exploring the Data Using Extended Association Rule Network 利用扩展关联规则网络探索数据
Pub Date : 2018-10-01 DOI: 10.1109/bracis.2018.00064
Renan de Padua, Dario Brito Calçada, Verônica Oliveira de Carvalho, Solange Oliveira Rezende
In this paper, we presented the Extended Association Rule Network (ExARN) to structure, prune and analyze a set of association rules, aiming to build hypothesis candidates. The ExARN extends the ARN, proposed by [2], allowing a more complete exploration. We validate the ExARN using two databases: contact lenses and hayes-roth, both available online for download. The results were validated by comparing the ExARN to the conventional ARN and also by comparing the results with a decision tree algorithms. The approach presented promising results, showing its capability to explain a set of objective items, aiding the user on the hypothesis building.
在本文中,我们提出了扩展关联规则网络(ExARN)来构建、修剪和分析一组关联规则,旨在建立假设候选。ExARN扩展了[2]提出的ARN,允许更完整的探索。我们使用两个数据库来验证ExARN:隐形眼镜数据库和hayes-roth数据库,这两个数据库都可以在线下载。通过将ExARN与传统的ARN进行比较,并将结果与决策树算法进行比较,验证了结果。该方法呈现出令人鼓舞的结果,显示出其解释一组客观项目的能力,帮助用户建立假设。
{"title":"Exploring the Data Using Extended Association Rule Network","authors":"Renan de Padua, Dario Brito Calçada, Verônica Oliveira de Carvalho, Solange Oliveira Rezende","doi":"10.1109/bracis.2018.00064","DOIUrl":"https://doi.org/10.1109/bracis.2018.00064","url":null,"abstract":"In this paper, we presented the Extended Association Rule Network (ExARN) to structure, prune and analyze a set of association rules, aiming to build hypothesis candidates. The ExARN extends the ARN, proposed by [2], allowing a more complete exploration. We validate the ExARN using two databases: contact lenses and hayes-roth, both available online for download. The results were validated by comparing the ExARN to the conventional ARN and also by comparing the results with a decision tree algorithms. The approach presented promising results, showing its capability to explain a set of objective items, aiding the user on the hypothesis building.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"34 7","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114113057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Similarity-Based Matrix Factorization for Item Cold-Start in Recommender Systems 基于相似性矩阵分解的项目冷启动推荐系统
Pub Date : 2018-10-01 DOI: 10.1109/bracis.2018.00066
Eduardo Pereira Fressato, Arthur Fortes da Costa, Marcelo Garcia Manzato
In recommender systems (RS) one of the most used approaches is collaborative filtering (CF), which recommends items according to the behavior of similar users. Among CF approaches, those based on matrix factorization are generally more effective because they allow the system to discover the underlying characteristics of interactions between users and items. However, this approach presents the cold-start problem, which occurs because of the system's inability to recommend new items and/or accurately predict new users' preferences. This paper proposes a novel matrix factorization approach, which incorporates similarity of items using their metadata, in order to improve the rating prediction task in an item cold-start scenario. For this purpose, we explore semantic descriptions of items which are gathered from knowledge bases available online. Our approach is evaluated in two different and publicly available datasets and compared against content-based and collaborative algorithms. The experiments show the effectiveness of our approach in the item cold-start scenario.
在推荐系统(RS)中,最常用的方法之一是协同过滤(CF),它根据相似用户的行为来推荐项目。在CF方法中,基于矩阵分解的方法通常更有效,因为它们允许系统发现用户和物品之间交互的潜在特征。然而,这种方法提出了冷启动问题,这是因为系统无法推荐新产品和/或准确预测新用户的偏好。为了改进冷启动场景下的评分预测任务,提出了一种新的矩阵分解方法,利用元数据结合物品的相似度。为此,我们探索从在线知识库中收集的项目的语义描述。我们的方法在两个不同的公开可用的数据集中进行了评估,并与基于内容的和协作的算法进行了比较。实验证明了该方法在项目冷启动场景下的有效性。
{"title":"Similarity-Based Matrix Factorization for Item Cold-Start in Recommender Systems","authors":"Eduardo Pereira Fressato, Arthur Fortes da Costa, Marcelo Garcia Manzato","doi":"10.1109/bracis.2018.00066","DOIUrl":"https://doi.org/10.1109/bracis.2018.00066","url":null,"abstract":"In recommender systems (RS) one of the most used approaches is collaborative filtering (CF), which recommends items according to the behavior of similar users. Among CF approaches, those based on matrix factorization are generally more effective because they allow the system to discover the underlying characteristics of interactions between users and items. However, this approach presents the cold-start problem, which occurs because of the system's inability to recommend new items and/or accurately predict new users' preferences. This paper proposes a novel matrix factorization approach, which incorporates similarity of items using their metadata, in order to improve the rating prediction task in an item cold-start scenario. For this purpose, we explore semantic descriptions of items which are gathered from knowledge bases available online. Our approach is evaluated in two different and publicly available datasets and compared against content-based and collaborative algorithms. The experiments show the effectiveness of our approach in the item cold-start scenario.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123849110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2018 7th Brazilian Conference on Intelligent Systems (BRACIS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1