首页 > 最新文献

2018 7th Brazilian Conference on Intelligent Systems (BRACIS)最新文献

英文 中文
Control of Gene Regulatory Networks Basin of Attractions with Batch Reinforcement Learning 基于批强化学习的基因调控网络吸引盆地控制
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00030
Cyntia Eico Hayama Nishida, Anna Helena Reali Costa, R. Bianchi
Basin of attraction contains biological functions and channels cell behavior, so when a gene network is in an unhealthy basin it may cause diseases. Control techniques can support the design of therapies that promote the transition of a biological system from diseased to healthier basins. Most control methods first infer a gene network and then derive a control strategy to avoid diseased states. However, this approach is limited to few genes and may cause other diseases, as the biological side of the problem is not considered. While changing between basins may change a diseased biological function for a healthier one, state avoidance can change functions in an unexpected way. We propose to extend a batch reinforcement learning method FQI-Sarsa, to change basin of attractions in a partial observable network. Using a batch reinforcement learning technique avoids the most time consuming phases that are the inference and control of the gene network. Results demonstrate that our method, BOAFQI-Sarsa, is more effective than previous studies that do not consider basins in their computations.
吸引力盆地包含生物功能并引导细胞行为,因此当基因网络处于不健康的盆地时,可能会导致疾病。控制技术可以支持设计促进生物系统从患病盆地向健康盆地过渡的疗法。大多数控制方法首先推断出基因网络,然后推导出控制策略以避免疾病状态。然而,这种方法仅限于少数基因,并可能导致其他疾病,因为没有考虑到问题的生物学方面。虽然在流域之间的变化可能会使患病的生物功能变为更健康的生物功能,但状态回避可以以意想不到的方式改变功能。我们提出扩展批强化学习方法FQI-Sarsa,以改变部分可观察网络中的吸引力盆地。使用批强化学习技术避免了基因网络的推理和控制这两个最耗时的阶段。结果表明,我们的方法(BOAFQI-Sarsa)比以往不考虑盆地的计算方法更有效。
{"title":"Control of Gene Regulatory Networks Basin of Attractions with Batch Reinforcement Learning","authors":"Cyntia Eico Hayama Nishida, Anna Helena Reali Costa, R. Bianchi","doi":"10.1109/BRACIS.2018.00030","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00030","url":null,"abstract":"Basin of attraction contains biological functions and channels cell behavior, so when a gene network is in an unhealthy basin it may cause diseases. Control techniques can support the design of therapies that promote the transition of a biological system from diseased to healthier basins. Most control methods first infer a gene network and then derive a control strategy to avoid diseased states. However, this approach is limited to few genes and may cause other diseases, as the biological side of the problem is not considered. While changing between basins may change a diseased biological function for a healthier one, state avoidance can change functions in an unexpected way. We propose to extend a batch reinforcement learning method FQI-Sarsa, to change basin of attractions in a partial observable network. Using a batch reinforcement learning technique avoids the most time consuming phases that are the inference and control of the gene network. Results demonstrate that our method, BOAFQI-Sarsa, is more effective than previous studies that do not consider basins in their computations.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120972818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Fusion of Local Descriptors for Multi-view Facial Expression Recognition 多视角面部表情识别的局部描述符融合
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00104
Xuejian Wang, M. Fairhurst, A. Canuto
Facial expressions can be seen as a form of non-verbal communication as well as a primary means of conveying social information among humans.Automatic facial expression recognition (FER) can be applied to a wide range of scenarios in human-computer interaction, facial animation, entertainment, and psychology studies. For feature representation in a FER system, various texture descriptors have been employed to derive an effective solution for this system. However, these individual texture descriptor-based FER systems have often failed to achieve effective performance in the recognition of facial expressions. In this sense, it is necessary to further improve the general performance of a facial expression recognition system, evaluating different feature representations. In this paper, a novel local descriptor for a facial expression recognition system is proposed, designated the level of difference descriptor (LOD). The main goal is to use this descriptor as a supplement to state-of-the-art local descriptors to further improve the performance of a FER system in terms of classification accuracy. Furthermore, the fusion of various texture features for devising a robust feature representation for multi-view facial expression recognition is presented.
面部表情可以被看作是一种非语言交流的形式,也是人类传递社会信息的主要手段。自动面部表情识别(FER)可以广泛应用于人机交互、面部动画、娱乐和心理学研究等领域。对于FER系统中的特征表示,采用了各种纹理描述符来推导出该系统的有效解。然而,这些基于单个纹理描述符的FER系统在面部表情识别中往往不能达到有效的性能。从这个意义上说,有必要进一步提高面部表情识别系统的总体性能,评估不同的特征表示。本文提出了一种新的面部表情识别系统的局部描述符,称为差分描述符(LOD)。主要目标是使用该描述符作为最先进的局部描述符的补充,以进一步提高FER系统在分类精度方面的性能。在此基础上,提出了多种纹理特征的融合,设计了一种鲁棒的多视图面部表情识别特征表示。
{"title":"Fusion of Local Descriptors for Multi-view Facial Expression Recognition","authors":"Xuejian Wang, M. Fairhurst, A. Canuto","doi":"10.1109/BRACIS.2018.00104","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00104","url":null,"abstract":"Facial expressions can be seen as a form of non-verbal communication as well as a primary means of conveying social information among humans.Automatic facial expression recognition (FER) can be applied to a wide range of scenarios in human-computer interaction, facial animation, entertainment, and psychology studies. For feature representation in a FER system, various texture descriptors have been employed to derive an effective solution for this system. However, these individual texture descriptor-based FER systems have often failed to achieve effective performance in the recognition of facial expressions. In this sense, it is necessary to further improve the general performance of a facial expression recognition system, evaluating different feature representations. In this paper, a novel local descriptor for a facial expression recognition system is proposed, designated the level of difference descriptor (LOD). The main goal is to use this descriptor as a supplement to state-of-the-art local descriptors to further improve the performance of a FER system in terms of classification accuracy. Furthermore, the fusion of various texture features for devising a robust feature representation for multi-view facial expression recognition is presented.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116703874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Study of Biclustering Coherence Measures for Gene Expression Data 基因表达数据的双聚类一致性测度研究
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00100
V. A. Padilha, A. Carvalho
Biclustering algorithms have become one of the main tools for the analysis of gene expression data. They allow the identification of local patterns defined by subsets of genes and subsets of samples, which cannot be detected by traditional clustering algorithms. However, although useful, biclustering is a NP-hard problem. Therefore, the majority of biclustering algorithms look for biclusters optimizing a pre-established coherence measure. In the last 20 years, several heuristics and measures have been published for biclustering. However, most of these publications do not provide an extensive comparison of bicluster coherence measures on practical scenarios. To deal with this problem, this paper analyze the behavior of 15 bicluster coherence measures and external evaluation regarding 9 algorithms from the literature on gene expression datasets. According to the experimental results, there is no clear relation between these measures and assessment using information from gene ontology.
双聚类算法已成为基因表达数据分析的主要工具之一。它们允许识别由基因子集和样本子集定义的局部模式,这是传统聚类算法无法检测到的。然而,尽管有用,双聚类是一个np困难问题。因此,大多数双聚类算法寻找优化预先建立的相干度量的双聚类。在过去的20年里,针对双聚类已经发表了一些启发式和度量方法。然而,这些出版物中的大多数都没有对实际情况下的双聚类相干度量进行广泛的比较。为了解决这一问题,本文分析了基因表达数据集文献中9种算法的15种双聚类一致性度量的行为和外部评价。实验结果表明,这些措施与利用基因本体信息进行评估之间没有明确的关系。
{"title":"A Study of Biclustering Coherence Measures for Gene Expression Data","authors":"V. A. Padilha, A. Carvalho","doi":"10.1109/BRACIS.2018.00100","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00100","url":null,"abstract":"Biclustering algorithms have become one of the main tools for the analysis of gene expression data. They allow the identification of local patterns defined by subsets of genes and subsets of samples, which cannot be detected by traditional clustering algorithms. However, although useful, biclustering is a NP-hard problem. Therefore, the majority of biclustering algorithms look for biclusters optimizing a pre-established coherence measure. In the last 20 years, several heuristics and measures have been published for biclustering. However, most of these publications do not provide an extensive comparison of bicluster coherence measures on practical scenarios. To deal with this problem, this paper analyze the behavior of 15 bicluster coherence measures and external evaluation regarding 9 algorithms from the literature on gene expression datasets. According to the experimental results, there is no clear relation between these measures and assessment using information from gene ontology.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"108 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125195472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Dimensionality Reduction for the Algorithm Recommendation Problem 算法推荐问题的降维方法
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00062
Edesio Alcobaça, R. G. Mantovani, A. L. Rossi, A. Carvalho
Given the increase in data generation, as many algorithms have become available in recent years, the algorithm recommendation problem has attracted increasing attention in Machine Learning. This problem has been addressed in the Machine Learning community as a learning task at the meta-level where the most suitable algorithm has to be recommended for a specific dataset. Since it is not trivial to define which characteristics are the most useful for a specific domain, several meta-features have been proposed and used, increasing the meta-data meta-feature dimension. This study investigates the influence of dimensionality reduction techniques on the quality of the algorithm recommendation process. Experiments were carried out with 15 algorithm recommendation problems from the Aslib library, 4 meta-learners, and 3 dimensionality reduction techniques. The experimental results showed that linear aggregation techniques, such as PCA and LDA, can be used in algorithm recommendation problems to reduce the number of meta-features and computational cost without losing predictive performance.
随着数据生成的增加,近年来出现了许多算法,算法推荐问题在机器学习中越来越受到关注。这个问题已经在机器学习社区中作为元级的学习任务来解决,在元级学习任务中,必须为特定的数据集推荐最合适的算法。由于定义哪些特征对特定领域最有用并非易事,因此已经提出并使用了几个元特征,从而增加了元数据元特征维度。本研究探讨了降维技术对算法推荐过程质量的影响。实验使用了来自Aslib库的15个算法推荐问题、4个元学习器和3种降维技术。实验结果表明,线性聚合技术(如PCA和LDA)可以用于算法推荐问题,在不损失预测性能的情况下减少元特征的数量和计算成本。
{"title":"Dimensionality Reduction for the Algorithm Recommendation Problem","authors":"Edesio Alcobaça, R. G. Mantovani, A. L. Rossi, A. Carvalho","doi":"10.1109/BRACIS.2018.00062","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00062","url":null,"abstract":"Given the increase in data generation, as many algorithms have become available in recent years, the algorithm recommendation problem has attracted increasing attention in Machine Learning. This problem has been addressed in the Machine Learning community as a learning task at the meta-level where the most suitable algorithm has to be recommended for a specific dataset. Since it is not trivial to define which characteristics are the most useful for a specific domain, several meta-features have been proposed and used, increasing the meta-data meta-feature dimension. This study investigates the influence of dimensionality reduction techniques on the quality of the algorithm recommendation process. Experiments were carried out with 15 algorithm recommendation problems from the Aslib library, 4 meta-learners, and 3 dimensionality reduction techniques. The experimental results showed that linear aggregation techniques, such as PCA and LDA, can be used in algorithm recommendation problems to reduce the number of meta-features and computational cost without losing predictive performance.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121742678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Estimation of the Energy Production in a Wind Farm Using Regression Methods and Wind Speed Forecast 基于回归方法和风速预报的风电场发电量估算
Pub Date : 2018-10-01 DOI: 10.1109/bracis.2018.00022
P. P. Rebouças Filho, Navar de Medeiros Mendonça e Nascimento, Shara Shami Araújo Alves, Samuel Luz Gomes, Cláudio Marques de Sá Medeiros
Wind energy is an excellent source of alternative energy to complement the Brazilian energy matrix. However, one of the significant challenges lies in managing this resource, due to its intermittent behavior. This study addresses the estimation of the electric power production of the wind park, so its management could be more efficient. A real data from one-year records of wind speed and power from a wind park installed in a wind farm in Ceará State, Brazil, is used. At first, we provide a study of Logistic versus Least Squares regression to model the wind turbine power curve. Then, a novel variant of the Least Square Support Regression is used to forecast wind speed on the site. The Logistic regression demonstrated to be more suitable for the task of regression, and the wind speed forecasting with three steps ahead provided lower error rates. Our approach represents a system based on data from both wind turbine power and speed to serve as a tool for helping energy selling issues and scheduling turbine maintenance on periods of time with low energy production in the wind park.
风能是一种极好的替代能源,可以补充巴西的能源矩阵。然而,由于这种资源的间歇性行为,其中一个重大挑战在于如何管理这种资源。本研究解决了对风电场发电量的估算,从而使其管理更加有效。本文使用了巴西塞埃尔州一个风力发电场一年的风速和功率记录的真实数据。首先,我们研究了Logistic与最小二乘回归对风力发电机功率曲线的建模。然后,使用一种新颖的最小二乘支持回归来预测现场的风速。Logistic回归更适合于回归任务,提前三步预测风速的错误率更低。我们的方法代表了一个基于风力涡轮机功率和速度数据的系统,可以作为一个工具,帮助解决能源销售问题,并在风力发电场能源产量低的时期安排涡轮机维护。
{"title":"Estimation of the Energy Production in a Wind Farm Using Regression Methods and Wind Speed Forecast","authors":"P. P. Rebouças Filho, Navar de Medeiros Mendonça e Nascimento, Shara Shami Araújo Alves, Samuel Luz Gomes, Cláudio Marques de Sá Medeiros","doi":"10.1109/bracis.2018.00022","DOIUrl":"https://doi.org/10.1109/bracis.2018.00022","url":null,"abstract":"Wind energy is an excellent source of alternative energy to complement the Brazilian energy matrix. However, one of the significant challenges lies in managing this resource, due to its intermittent behavior. This study addresses the estimation of the electric power production of the wind park, so its management could be more efficient. A real data from one-year records of wind speed and power from a wind park installed in a wind farm in Ceará State, Brazil, is used. At first, we provide a study of Logistic versus Least Squares regression to model the wind turbine power curve. Then, a novel variant of the Least Square Support Regression is used to forecast wind speed on the site. The Logistic regression demonstrated to be more suitable for the task of regression, and the wind speed forecasting with three steps ahead provided lower error rates. Our approach represents a system based on data from both wind turbine power and speed to serve as a tool for helping energy selling issues and scheduling turbine maintenance on periods of time with low energy production in the wind park.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115183348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
[Publisher's information] (发布者的信息)
Pub Date : 2018-10-01 DOI: 10.1109/bracis.2018.00106
{"title":"[Publisher's information]","authors":"","doi":"10.1109/bracis.2018.00106","DOIUrl":"https://doi.org/10.1109/bracis.2018.00106","url":null,"abstract":"","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123842719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning for People Detection on Beach Images 基于深度学习的海滩图像人物检测
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00045
S. Chevtchenko, Rafaella F. Vale, F. Cordeiro, V. Macário
Convolutional architectures have in recent years become state-of-the-art for several object detection tasks. However, these detectors have not yet been evaluated for detection and monitoring of beach areas. As some of these areas need to be continually monitored for dangerous situations, such as shark attacks, an automated system would be an effective risk control measure. The most significant and specific challenges for this problem are variable scene illumination, partial occlusion and distant camera position. In this work we present a study on three recent convolutional architectures for the task of people detection in beach scenarios. Our dataset is composed of images taken in the Boa Viagem beach, in Brazil, and is used to evaluate Faster R-CNN, R-FCN and SSD in terms of quality and speed of detection. The detectors are pretrained on a dataset containing 91 classes of objects, including people with different levels of scale and occlusion. The results suggest that the Faster R-CNN meta-architecture with the Resnet 101 feature extractor generates significantly better detections in terms of F-measure, while performing at 5.6 fps on a GTX 1080 Ti GPU.
卷积架构近年来已经成为一些目标检测任务的最先进的技术。然而,这些探测器在探测和监测海滩地区方面尚未得到评价。由于其中一些区域需要持续监测危险情况,例如鲨鱼袭击,自动化系统将是一种有效的风险控制措施。这个问题最重要和最具体的挑战是可变的场景照明,部分遮挡和远距离摄像机位置。在这项工作中,我们对海滩场景中人员检测任务的三种最新卷积架构进行了研究。我们的数据集由在巴西Boa Viagem海滩拍摄的图像组成,并用于评估Faster R-CNN, R-FCN和SSD在检测质量和速度方面的性能。检测器在包含91类对象的数据集上进行预训练,包括具有不同规模和遮挡水平的人。结果表明,更快的R-CNN元架构与Resnet 101特征提取器在F-measure方面产生了显着更好的检测,而在GTX 1080 Ti GPU上以5.6 fps的速度执行。
{"title":"Deep Learning for People Detection on Beach Images","authors":"S. Chevtchenko, Rafaella F. Vale, F. Cordeiro, V. Macário","doi":"10.1109/BRACIS.2018.00045","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00045","url":null,"abstract":"Convolutional architectures have in recent years become state-of-the-art for several object detection tasks. However, these detectors have not yet been evaluated for detection and monitoring of beach areas. As some of these areas need to be continually monitored for dangerous situations, such as shark attacks, an automated system would be an effective risk control measure. The most significant and specific challenges for this problem are variable scene illumination, partial occlusion and distant camera position. In this work we present a study on three recent convolutional architectures for the task of people detection in beach scenarios. Our dataset is composed of images taken in the Boa Viagem beach, in Brazil, and is used to evaluate Faster R-CNN, R-FCN and SSD in terms of quality and speed of detection. The detectors are pretrained on a dataset containing 91 classes of objects, including people with different levels of scale and occlusion. The results suggest that the Faster R-CNN meta-architecture with the Resnet 101 feature extractor generates significantly better detections in terms of F-measure, while performing at 5.6 fps on a GTX 1080 Ti GPU.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128039211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Benchmarking Multi-target Regression Methods 基准多目标回归方法
Pub Date : 2018-10-01 DOI: 10.1109/bracis.2018.00075
Saulo Martiello Mastelini, Everton José Santana, Victor Guilherme Turrisi da Costa, Sylvio Barbon Junior
Machine learning methods for multi-target regression (MTR) rely on the hypothesis that an inter-target correlation can improve predictive performance. In the last years, many MTR methods were developed, but there are still questions about how their performances are influenced by the datasets characteristics such as linearity, number of targets, and inter-correlation complexity. Aiming at contributing to the understanding of the relationship between the dataset properties and MTR methods, we generated 33 synthetic datasets with controlled characteristics and tested their performance with single-target and six MTR methods. The results showed that MTR methods were able to improve performance even in datasets whose targets were not linearly correlated among them, but the predictive improvement differed among the combinations of method/regressor according to the dataset composition.
多目标回归(MTR)的机器学习方法依赖于目标间相关性可以提高预测性能的假设。近年来,人们开发了许多MTR方法,但它们的性能如何受到数据集特征(如线性度、目标数量和相互关联复杂性)的影响仍然存在疑问。为了更好地理解数据集属性与MTR方法之间的关系,我们生成了33个具有受控特征的合成数据集,并使用单目标和6种MTR方法测试了它们的性能。结果表明,即使在目标不线性相关的数据集上,MTR方法也能提高预测性能,但根据数据集组成的不同,方法/回归量组合的预测提高程度有所不同。
{"title":"Benchmarking Multi-target Regression Methods","authors":"Saulo Martiello Mastelini, Everton José Santana, Victor Guilherme Turrisi da Costa, Sylvio Barbon Junior","doi":"10.1109/bracis.2018.00075","DOIUrl":"https://doi.org/10.1109/bracis.2018.00075","url":null,"abstract":"Machine learning methods for multi-target regression (MTR) rely on the hypothesis that an inter-target correlation can improve predictive performance. In the last years, many MTR methods were developed, but there are still questions about how their performances are influenced by the datasets characteristics such as linearity, number of targets, and inter-correlation complexity. Aiming at contributing to the understanding of the relationship between the dataset properties and MTR methods, we generated 33 synthetic datasets with controlled characteristics and tested their performance with single-target and six MTR methods. The results showed that MTR methods were able to improve performance even in datasets whose targets were not linearly correlated among them, but the predictive improvement differed among the combinations of method/regressor according to the dataset composition.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122760121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Making Data Stream Classification Tree-Based Ensembles Lighter 使基于数据流分类树的集成更轻
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00089
V. G. T. D. Costa, S. M. Mastelini, A. Carvalho, Sylvio Barbon Junior
Recently, several classification algorithms capable of dealing with potentially infinite data streams have been proposed. One of the main challenges of this task is to continuously update predictive models to address concept drifts without compromise their predictive performance. Moreover, the classification algorithm used must be able to efficiently deal with processing time and memory limitations. In the data stream mining literature, ensemble-based classification algorithms are a good alternative to satisfy the previous requirements. These algorithms combine multiple weak learner algorithms, e.g., the Very Fast Decision Tree (VFDT), to create a model with higher predictive performance. However, the memory costs of each weak learner are stacked in an ensemble, compromising the limited space requirements. To manage the trade-off between accuracy, memory space, and processing time, this paper proposes to use the Strict VFDT (SVFDT) algorithm as an alternative weak learner for ensemble solutions which is capable of reducing memory consumption without harming the predictive performance. This paper experimentally compares two traditional and three state-of-the-art ensembles using as weak learners the VFDT and SVFDT across thirteen benchmark datasets. According to the experimental results, the proposed algorithm can obtain a similar predictive performance with a significant economy of memory space.
最近,已经提出了几种能够处理潜在无限数据流的分类算法。该任务的主要挑战之一是不断更新预测模型以解决概念漂移而不影响其预测性能。此外,所使用的分类算法必须能够有效地处理处理时间和内存限制。在数据流挖掘文献中,基于集成的分类算法是满足上述要求的一个很好的替代方案。这些算法结合了多种弱学习算法,如快速决策树(VFDT),以创建具有更高预测性能的模型。然而,每个弱学习器的记忆成本是堆叠在一个集合中,损害了有限的空间要求。为了处理精度、内存空间和处理时间之间的权衡,本文提出使用严格VFDT (SVFDT)算法作为集成解决方案的替代弱学习器,它能够在不损害预测性能的情况下减少内存消耗。本文在13个基准数据集上实验比较了两种传统集成和三种最先进集成作为弱学习器的VFDT和SVFDT。实验结果表明,该算法在节省内存空间的前提下,可以获得相似的预测性能。
{"title":"Making Data Stream Classification Tree-Based Ensembles Lighter","authors":"V. G. T. D. Costa, S. M. Mastelini, A. Carvalho, Sylvio Barbon Junior","doi":"10.1109/BRACIS.2018.00089","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00089","url":null,"abstract":"Recently, several classification algorithms capable of dealing with potentially infinite data streams have been proposed. One of the main challenges of this task is to continuously update predictive models to address concept drifts without compromise their predictive performance. Moreover, the classification algorithm used must be able to efficiently deal with processing time and memory limitations. In the data stream mining literature, ensemble-based classification algorithms are a good alternative to satisfy the previous requirements. These algorithms combine multiple weak learner algorithms, e.g., the Very Fast Decision Tree (VFDT), to create a model with higher predictive performance. However, the memory costs of each weak learner are stacked in an ensemble, compromising the limited space requirements. To manage the trade-off between accuracy, memory space, and processing time, this paper proposes to use the Strict VFDT (SVFDT) algorithm as an alternative weak learner for ensemble solutions which is capable of reducing memory consumption without harming the predictive performance. This paper experimentally compares two traditional and three state-of-the-art ensembles using as weak learners the VFDT and SVFDT across thirteen benchmark datasets. According to the experimental results, the proposed algorithm can obtain a similar predictive performance with a significant economy of memory space.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"71 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120968808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Data Classification: Dimensionality Reduction Using Combined and Non-combined Multidimensional Projection Techniques 数据分类:使用组合和非组合多维投影技术的降维
Pub Date : 2018-10-01 DOI: 10.1109/BRACIS.2018.00076
G. E. Rodrigues, Wilson Estécio Marcílio Júnior, D. M. Eler
Dimensionality Reduction is a commonly used method to reduce the number of dimensions of data. In this work, we verified its influence in classification process using combinations of projection techniques as dimensionality reduction algorithms. We also used Naïve Bayes and SMO as classifiers.
降维是一种常用的降低数据维数的方法。在这项工作中,我们使用组合投影技术作为降维算法验证了它在分类过程中的影响。我们还使用Naïve贝叶斯和SMO作为分类器。
{"title":"Data Classification: Dimensionality Reduction Using Combined and Non-combined Multidimensional Projection Techniques","authors":"G. E. Rodrigues, Wilson Estécio Marcílio Júnior, D. M. Eler","doi":"10.1109/BRACIS.2018.00076","DOIUrl":"https://doi.org/10.1109/BRACIS.2018.00076","url":null,"abstract":"Dimensionality Reduction is a commonly used method to reduce the number of dimensions of data. In this work, we verified its influence in classification process using combinations of projection techniques as dimensionality reduction algorithms. We also used Naïve Bayes and SMO as classifiers.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"108 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121376821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2018 7th Brazilian Conference on Intelligent Systems (BRACIS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1