首页 > 最新文献

2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)最新文献

英文 中文
Sequence Generative Adversarial Network for Long Text Summarization 用于长文本摘要的序列生成对抗网络
Haoji Xu, Yanan Cao, Ruipeng Jia, Yanbing Liu, Jianlong Tan
In this paper, we propose a new adversarial training framework for text summarization task. Although sequence-to-sequence models have achieved state-of-the-art performance in abstractive summarization, the training strategy (MLE) suffers from exposure bias in the inference stage. This discrepancy between training and inference makes generated summaries less coherent and accuracy, which is more prominent in summarizing long articles. To address this issue, we model abstractive summarization using Generative Adversarial Network (GAN), aiming to minimize the gap between generated summaries and the ground-truth ones. This framework consists of two models: a generator that generates summaries, a discriminator that evaluates generated summaries. Reinforcement learning (RL) strategy is used to guarantee the co-training of generator and discriminator. Besides, motivated by the nature of summarization task, we design a novel Triple-RNNs discriminator, and extend the off-the-shelf generator by appending encoder and decoder with attention mechanism. Experimental results showed that our model significantly outperforms the state-of-the-art models, especially on long text corpus.
在本文中,我们提出了一个新的文本摘要任务对抗训练框架。尽管序列到序列模型在抽象摘要方面已经取得了最先进的性能,但训练策略在推理阶段存在暴露偏差。训练和推理之间的这种差异使得生成的摘要缺乏连贯性和准确性,这在总结长文章时更为突出。为了解决这个问题,我们使用生成对抗网络(GAN)对抽象摘要进行建模,旨在最大限度地减少生成摘要与真实摘要之间的差距。该框架由两个模型组成:一个生成摘要的生成器,一个对生成的摘要进行评估的鉴别器。采用强化学习(RL)策略保证生成器和鉴别器的协同训练。此外,根据摘要任务的性质,我们设计了一种新的三重rnn鉴别器,并通过添加注意机制的编码器和解码器来扩展现有的生成器。实验结果表明,我们的模型明显优于目前最先进的模型,特别是在长文本语料库上。
{"title":"Sequence Generative Adversarial Network for Long Text Summarization","authors":"Haoji Xu, Yanan Cao, Ruipeng Jia, Yanbing Liu, Jianlong Tan","doi":"10.1109/ICTAI.2018.00045","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00045","url":null,"abstract":"In this paper, we propose a new adversarial training framework for text summarization task. Although sequence-to-sequence models have achieved state-of-the-art performance in abstractive summarization, the training strategy (MLE) suffers from exposure bias in the inference stage. This discrepancy between training and inference makes generated summaries less coherent and accuracy, which is more prominent in summarizing long articles. To address this issue, we model abstractive summarization using Generative Adversarial Network (GAN), aiming to minimize the gap between generated summaries and the ground-truth ones. This framework consists of two models: a generator that generates summaries, a discriminator that evaluates generated summaries. Reinforcement learning (RL) strategy is used to guarantee the co-training of generator and discriminator. Besides, motivated by the nature of summarization task, we design a novel Triple-RNNs discriminator, and extend the off-the-shelf generator by appending encoder and decoder with attention mechanism. Experimental results showed that our model significantly outperforms the state-of-the-art models, especially on long text corpus.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116856684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Improved Spoken Uyghur Segmentation for Neural Machine Translation 基于神经机器翻译的维吾尔语语音切分方法
Chenggang Mi, Yating Yang, Xi Zhou, Lei Wang, Tonghai Jiang
To increase vocabulary overlap in spoken Uyghur neural machine translation (NMT), we propose a novel method to enhance the common used subword units based segmentation method. In particular, we apply a log-linear model as the main framework and integrate several features such as subword, morphological information, bilingual word alignment and monolingual language model into it. Experimental results show that spoken Uyghur segmentation with our proposed method improves the performance of the spoken Uyghur-Chinese NMT significantly (yield up to 1.52 BLEU improvements).
为了增加维吾尔语口语神经机器翻译中的词汇重叠,提出了一种改进常用子词单元分割方法的新方法。特别地,我们以对数线性模型为主要框架,整合了子词、形态信息、双语词对齐和单语语言模型等特征。实验结果表明,本文提出的维吾尔语语音分割方法显著提高了维吾尔语-汉语语音NMT的性能(良率提高了1.52 BLEU)。
{"title":"Improved Spoken Uyghur Segmentation for Neural Machine Translation","authors":"Chenggang Mi, Yating Yang, Xi Zhou, Lei Wang, Tonghai Jiang","doi":"10.1109/ICTAI.2018.00018","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00018","url":null,"abstract":"To increase vocabulary overlap in spoken Uyghur neural machine translation (NMT), we propose a novel method to enhance the common used subword units based segmentation method. In particular, we apply a log-linear model as the main framework and integrate several features such as subword, morphological information, bilingual word alignment and monolingual language model into it. Experimental results show that spoken Uyghur segmentation with our proposed method improves the performance of the spoken Uyghur-Chinese NMT significantly (yield up to 1.52 BLEU improvements).","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124300323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Automatic Context-Based Similarity Metric for Local Outlier Detection Tasks 局部离群点检测任务中一种新的基于上下文的自动相似度度量
Fan Meng, Yang Gao, Ruili Wang
Local outlier detection is able to capture local behavior to improve detection performance compared to traditional global outlier detection techniques. Most existing local outlier detection methods have the fundamental assumption that attributes and attribute values are independent and identically distributed (IID). However, in many situations, since the attributes usually have an inner structure, they should not be handled equally. To address the issue above, we propose a novel automatic context-based similarity metric for local outlier detection tasks. This paper mainly includes three aspects: (i) to propose a novel approach to automatically detect the contextual attributes by capturing the attribute intra-coupling and inter-coupling; (ii) to introduce a Non-IID similarity metric to derive the kNN set and reachability distance of an object based on the attribute structure and incorporate it into local outlier detection tasks; (iii) to build a data set called EG-Permission, which is a real-world data set from an E-Government Information System for context-based local outlier detection. Results obtained from 10 data sets show the proposed approach can identify the attribute structure effectively and improve the performance in local outlier detection tasks.
与传统的全局离群点检测技术相比,局部离群点检测能够捕获局部行为,从而提高检测性能。现有的局部离群点检测方法大都假定属性和属性值是独立且同分布的(IID)。然而,在许多情况下,由于属性通常具有内部结构,因此不应该平等地处理它们。为了解决上述问题,我们提出了一种用于局部离群值检测任务的基于上下文的自动相似性度量。本文主要包括三个方面的内容:(1)提出了一种通过捕获属性内耦合和间耦合来自动检测上下文属性的新方法;(ii)引入Non-IID相似性度量,根据属性结构推导出对象的kNN集和可达距离,并将其纳入局部离群点检测任务;(iii)建立一个名为EG-Permission的数据集,这是一个来自电子政府信息系统的真实数据集,用于基于上下文的局部离群值检测。10个数据集的结果表明,该方法可以有效地识别属性结构,提高局部离群点检测任务的性能。
{"title":"A Novel Automatic Context-Based Similarity Metric for Local Outlier Detection Tasks","authors":"Fan Meng, Yang Gao, Ruili Wang","doi":"10.1109/ICTAI.2018.00152","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00152","url":null,"abstract":"Local outlier detection is able to capture local behavior to improve detection performance compared to traditional global outlier detection techniques. Most existing local outlier detection methods have the fundamental assumption that attributes and attribute values are independent and identically distributed (IID). However, in many situations, since the attributes usually have an inner structure, they should not be handled equally. To address the issue above, we propose a novel automatic context-based similarity metric for local outlier detection tasks. This paper mainly includes three aspects: (i) to propose a novel approach to automatically detect the contextual attributes by capturing the attribute intra-coupling and inter-coupling; (ii) to introduce a Non-IID similarity metric to derive the kNN set and reachability distance of an object based on the attribute structure and incorporate it into local outlier detection tasks; (iii) to build a data set called EG-Permission, which is a real-world data set from an E-Government Information System for context-based local outlier detection. Results obtained from 10 data sets show the proposed approach can identify the attribute structure effectively and improve the performance in local outlier detection tasks.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117151791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weight Adjusted Naive Bayes 权重调整朴素贝叶斯
Liangjun Yu, Liangxiao Jiang, Lungan Zhang, Dianhong Wang
Naive Bayes (NB) continues to be one of the top 10 data mining algorithms due to its simplicity, efficiency and efficacy, but the assumption of independence for attributes in NB is rarely true in reality. Attribute weighting is effective for overcoming the unrealistic assumption in NB, but it has received less attention than it warrants. Attribute weighting approaches can be broadly divided into two categories: filters and wrappers. In this paper, we mainly focus on wrapper attribute weighting approaches because they have generally higher classification performance than filter attribute weighting approaches. We propose a weight adjusted naive Bayes approach and simply denote it WANB. In WANB, the importance of each attribute in the classification of a training data set is learned and the weight vector reflecting this importance is updated. We use weight adjustment based on objective functions to find the optimal weight vector. We compare WANB with standard NB and its state-of-the-art attribute weighting approaches. Empirical studies on a collection of 36 benchmark datasets show that the classification performance of WANB significantly outperforms NB and all the existing filter approaches used to compare. Yet at the same time, compared to the existing wrapper approach called DEWANB, WANB is much more efficient and comprehensible.
朴素贝叶斯(Naive Bayes, NB)以其简单、高效和高效的特点一直是十大数据挖掘算法之一,但在现实中,朴素贝叶斯对属性独立的假设很少成立。属性加权对于克服NB中不切实际的假设是有效的,但它得到的关注比它应有的要少。属性加权方法大致可以分为两类:过滤器和包装器。在本文中,我们主要关注包装器属性加权方法,因为它们通常比过滤器属性加权方法具有更高的分类性能。我们提出了一种权重调整的朴素贝叶斯方法,并将其简单地表示为WANB。在WANB中,学习训练数据集分类中每个属性的重要性,并更新反映该重要性的权重向量。我们使用基于目标函数的权值调整来找到最优的权值向量。我们将WANB与标准NB及其最先进的属性加权方法进行比较。对36个基准数据集的实证研究表明,WANB的分类性能明显优于NB和所有现有的用于比较的滤波器方法。然而与此同时,与现有的称为DEWANB的包装器方法相比,WANB更加高效和易于理解。
{"title":"Weight Adjusted Naive Bayes","authors":"Liangjun Yu, Liangxiao Jiang, Lungan Zhang, Dianhong Wang","doi":"10.1109/ICTAI.2018.00129","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00129","url":null,"abstract":"Naive Bayes (NB) continues to be one of the top 10 data mining algorithms due to its simplicity, efficiency and efficacy, but the assumption of independence for attributes in NB is rarely true in reality. Attribute weighting is effective for overcoming the unrealistic assumption in NB, but it has received less attention than it warrants. Attribute weighting approaches can be broadly divided into two categories: filters and wrappers. In this paper, we mainly focus on wrapper attribute weighting approaches because they have generally higher classification performance than filter attribute weighting approaches. We propose a weight adjusted naive Bayes approach and simply denote it WANB. In WANB, the importance of each attribute in the classification of a training data set is learned and the weight vector reflecting this importance is updated. We use weight adjustment based on objective functions to find the optimal weight vector. We compare WANB with standard NB and its state-of-the-art attribute weighting approaches. Empirical studies on a collection of 36 benchmark datasets show that the classification performance of WANB significantly outperforms NB and all the existing filter approaches used to compare. Yet at the same time, compared to the existing wrapper approach called DEWANB, WANB is much more efficient and comprehensible.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124981751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Fine-Grained Hierarchical Classification of Plant Leaf Images Using Fusion of Deep Models 基于深度模型融合的植物叶片图像细粒度分层分类
Voncarlos M. Araújo, A. Britto, André L. Brun, Alessandro Lameiras Koerich, Luiz Oliveira
A fine-grained plant leaf classification method based on the fusion of deep models is described. Complementary global and patch-based leaf features are combined at each hierarchical level (genus and species) by pre-trained CNNs. The deep models are adapted for plant recognition by using data augmentation techniques to face the problem of plant classes with very few samples for training in the available imbalanced dataset. Experimental results have shown that the proposed coarse-to-fine classification strategy is a very promising alternative to deal with the low inter-class and high intra-class variability inherent to the problem of plant identification. The proposed method was able to surpass other state-of-the-art approaches on the ImageCLEF 2015 plant recognition dataset in terms of average classification scores.
提出了一种基于深度模型融合的细粒度植物叶片分类方法。通过预训练的cnn在每个层次(属和种)上组合互补的全局和基于斑块的叶子特征。利用数据增强技术,将深度模型应用于植物识别,以解决在可用的不平衡数据集中训练样本很少的植物类别问题。实验结果表明,提出的从粗到细的分类策略是一种非常有前途的替代方法,可以解决植物识别问题固有的低类间和高类内变异性。该方法在ImageCLEF 2015植物识别数据集上的平均分类分数超过了其他最先进的方法。
{"title":"Fine-Grained Hierarchical Classification of Plant Leaf Images Using Fusion of Deep Models","authors":"Voncarlos M. Araújo, A. Britto, André L. Brun, Alessandro Lameiras Koerich, Luiz Oliveira","doi":"10.1109/ICTAI.2018.00011","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00011","url":null,"abstract":"A fine-grained plant leaf classification method based on the fusion of deep models is described. Complementary global and patch-based leaf features are combined at each hierarchical level (genus and species) by pre-trained CNNs. The deep models are adapted for plant recognition by using data augmentation techniques to face the problem of plant classes with very few samples for training in the available imbalanced dataset. Experimental results have shown that the proposed coarse-to-fine classification strategy is a very promising alternative to deal with the low inter-class and high intra-class variability inherent to the problem of plant identification. The proposed method was able to surpass other state-of-the-art approaches on the ImageCLEF 2015 plant recognition dataset in terms of average classification scores.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123440580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
HetEOTL: An Algorithm for Heterogeneous Online Transfer Learning 异构在线迁移学习的一种算法
Qian Chen, Yunshu Du, Ming Xu, Chongjun Wang
Transfer learning is an important topic in machine learning and has been broadly studied for many years. However, most existing transfer learning methods assume the training sets are prepared in advance, which is often not the case in practice. Fortunately, online transfer learning (OTL), which addresses the transfer learning tasks in an online fashion, has been proposed to solve the problem. This paper mainly focuses on the heterogeneous OTL, which is in general very challenging because the feature space of target domain is different from that of the source domain. In order to enhance the learning performance, we designed the algorithm called Heterogeneous Ensembled Online Transfer Learning (HetEOTL) using ensemble learning strategy. Finally, we evaluate our algorithm on some benchmark datasets, and the experimental results show that HetEOTL has better performance than some other existing online learning and transfer learning algorithms, which proves the effectiveness of HetEOTL.
迁移学习是机器学习中的一个重要课题,已被广泛研究多年。然而,大多数现有的迁移学习方法都假设训练集是预先准备好的,而在实践中往往不是这样。幸运的是,在线迁移学习(online transfer learning, OTL)已经被提出来解决这个问题,它以在线的方式处理迁移学习任务。本文主要研究的是异构OTL,由于目标域的特征空间与源域的特征空间不同,异构OTL具有很大的挑战性。为了提高学习性能,采用集成学习策略设计了异构集成在线迁移学习算法(HetEOTL)。最后,我们在一些基准数据集上对算法进行了评估,实验结果表明,HetEOTL比现有的一些在线学习和迁移学习算法具有更好的性能,证明了HetEOTL的有效性。
{"title":"HetEOTL: An Algorithm for Heterogeneous Online Transfer Learning","authors":"Qian Chen, Yunshu Du, Ming Xu, Chongjun Wang","doi":"10.1109/ICTAI.2018.00062","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00062","url":null,"abstract":"Transfer learning is an important topic in machine learning and has been broadly studied for many years. However, most existing transfer learning methods assume the training sets are prepared in advance, which is often not the case in practice. Fortunately, online transfer learning (OTL), which addresses the transfer learning tasks in an online fashion, has been proposed to solve the problem. This paper mainly focuses on the heterogeneous OTL, which is in general very challenging because the feature space of target domain is different from that of the source domain. In order to enhance the learning performance, we designed the algorithm called Heterogeneous Ensembled Online Transfer Learning (HetEOTL) using ensemble learning strategy. Finally, we evaluate our algorithm on some benchmark datasets, and the experimental results show that HetEOTL has better performance than some other existing online learning and transfer learning algorithms, which proves the effectiveness of HetEOTL.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"95 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128685053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Constrainedness in Stable Matching 稳定匹配中的约束
Guillaume Escamocher, B. O’Sullivan
In constraint satisfaction problems, constrainedness provides a way to predict the number of solutions: for instances of a same size, the number of constraints is inversely correlated with the number of solutions. However, there is no obvious equivalent metric for stable matching problems. We introduce the contrarian score, a simple metric that is to matching problems what constrainedness is to constraint satisfaction problems. In addition to comparing the contrarian score against other potential tightness metrics, we test it for different instance sizes as well as extremely distinct versions of the stable matching problem. In all cases, we find that the correlation between contrarian score and number of solutions is very strong.
在约束满足问题中,约束提供了一种预测解决方案数量的方法:对于相同大小的实例,约束的数量与解决方案的数量呈负相关。然而,对于稳定匹配问题,没有明显的等效度量。我们引入了逆向得分,这是一个简单的度量,对于匹配问题就像约束对于约束满足问题一样。除了将反向得分与其他潜在的紧密性指标进行比较外,我们还针对不同的实例大小以及稳定匹配问题的极端不同版本对其进行了测试。在所有情况下,我们发现逆向得分与解决方案数量之间的相关性非常强。
{"title":"Constrainedness in Stable Matching","authors":"Guillaume Escamocher, B. O’Sullivan","doi":"10.1109/ICTAI.2018.00112","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00112","url":null,"abstract":"In constraint satisfaction problems, constrainedness provides a way to predict the number of solutions: for instances of a same size, the number of constraints is inversely correlated with the number of solutions. However, there is no obvious equivalent metric for stable matching problems. We introduce the contrarian score, a simple metric that is to matching problems what constrainedness is to constraint satisfaction problems. In addition to comparing the contrarian score against other potential tightness metrics, we test it for different instance sizes as well as extremely distinct versions of the stable matching problem. In all cases, we find that the correlation between contrarian score and number of solutions is very strong.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128939655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recursive Structure Similarity: A Novel Algorithm for Graph Clustering 递归结构相似度:一种新的图聚类算法
Yixin Fang, R. Jin, Wei Xiong, Xiaoning Qian, D. Dou, HaiNhat Phan
A various number of graph clustering algorithms have been proposed and applied in real-world applications such as network analysis, bio-informatics, social computing, and etc. However, existing algorithms usually focus on optimizing specified quality measures at the global network level, without carefully considering the destruction of local structures which could be informative and significant in practice. In this paper, we propose a novel clustering algorithm for undirected graphs based on a new structure similarity measure which is computed in a recursive procedure. Our method can provide robust and high-quality clustering results, while preserving informative local structures in the original graph. Rigorous experiments conducted on a variety of benchmark and protein datasets show that our algorithm consistently outperforms existing algorithms.
各种各样的图聚类算法已经被提出并应用于现实世界的应用,如网络分析、生物信息学、社会计算等。然而,现有的算法通常侧重于在全局网络层面上优化特定的质量度量,而没有仔细考虑在实践中可能具有信息和意义的局部结构的破坏。本文提出了一种新的无向图聚类算法,该算法基于递归计算的结构相似性度量。我们的方法可以提供鲁棒性和高质量的聚类结果,同时保留原始图中信息丰富的局部结构。在各种基准和蛋白质数据集上进行的严格实验表明,我们的算法始终优于现有算法。
{"title":"Recursive Structure Similarity: A Novel Algorithm for Graph Clustering","authors":"Yixin Fang, R. Jin, Wei Xiong, Xiaoning Qian, D. Dou, HaiNhat Phan","doi":"10.1109/ICTAI.2018.00068","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00068","url":null,"abstract":"A various number of graph clustering algorithms have been proposed and applied in real-world applications such as network analysis, bio-informatics, social computing, and etc. However, existing algorithms usually focus on optimizing specified quality measures at the global network level, without carefully considering the destruction of local structures which could be informative and significant in practice. In this paper, we propose a novel clustering algorithm for undirected graphs based on a new structure similarity measure which is computed in a recursive procedure. Our method can provide robust and high-quality clustering results, while preserving informative local structures in the original graph. Rigorous experiments conducted on a variety of benchmark and protein datasets show that our algorithm consistently outperforms existing algorithms.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124184906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GT-Net: A Deep Learning Network for Gastric Tumor Diagnosis GT-Net:用于胃肿瘤诊断的深度学习网络
Yuexiang Li, Xinpeng Xie, Shaoxiong Liu, Xuechen Li, L. Shen
Gastric cancer is one of the most common cancers, which causes the second largest number of deaths in the world. Traditional diagnosis approach requires pathologists to manually annotate the gastric tumor in gastric slice for cancer identification, which is laborious and time-consuming. In this paper, we proposed a deep learning based framework, namely GT-Net, for automatic segmentation of gastric tumor. The proposed GT-Net adopts different architectures for shallow and deep layers for better feature extraction. We evaluate the proposed framework on publicly available BOT gastric slice dataset. The experimental results show that our GT-Net performs better than state-of-the-art networks like FCN-8s, U-net, and achieved a new state-of-the-art F1 score of 90.88% for gastric tumor segmentation.
胃癌是最常见的癌症之一,是世界上导致死亡人数第二多的癌症。传统的诊断方法需要病理学家在胃切片上手工标注胃肿瘤进行肿瘤鉴别,费时费力。本文提出了一种基于深度学习的胃肿瘤自动分割框架,即GT-Net。为了更好地提取特征,本文提出的GT-Net对浅层和深层采用了不同的体系结构。我们在公开可用的BOT胃切片数据集上评估了所提出的框架。实验结果表明,我们的GT-Net比FCN-8s、U-net等最先进的网络性能更好,在胃肿瘤分割上取得了新的最先进的F1分数90.88%。
{"title":"GT-Net: A Deep Learning Network for Gastric Tumor Diagnosis","authors":"Yuexiang Li, Xinpeng Xie, Shaoxiong Liu, Xuechen Li, L. Shen","doi":"10.1109/ICTAI.2018.00014","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00014","url":null,"abstract":"Gastric cancer is one of the most common cancers, which causes the second largest number of deaths in the world. Traditional diagnosis approach requires pathologists to manually annotate the gastric tumor in gastric slice for cancer identification, which is laborious and time-consuming. In this paper, we proposed a deep learning based framework, namely GT-Net, for automatic segmentation of gastric tumor. The proposed GT-Net adopts different architectures for shallow and deep layers for better feature extraction. We evaluate the proposed framework on publicly available BOT gastric slice dataset. The experimental results show that our GT-Net performs better than state-of-the-art networks like FCN-8s, U-net, and achieved a new state-of-the-art F1 score of 90.88% for gastric tumor segmentation.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126907954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Inducing Readable Oblique Decision Trees 诱导可读的倾斜决策树
Antonin Leroux, M. Boussard, R. Dès
Although machine learning models are found in more and more practical applications, stakeholders can be suspicious about the fact that they are not hard-coded and fully specified. To foster trust, it is crucial to provide models whose predictions are explainable. Decision Trees can be understood by humans if they are simple enough, but they suffer in accuracy when compared to other common machine learning methods. Oblique Decision Trees can provide better accuracy and smaller trees, but their decision rules are more complex. This article presents MUST (Multivariate Understandable Statistical Tree), an Oblique Decision Tree split algorithm based on Linear Discriminant Analysis that aims to preserve explainability by limiting the number of variables that appear in decision rules.
尽管机器学习模型在越来越多的实际应用中被发现,但利益相关者可能会怀疑它们没有硬编码和完全指定的事实。为了培养信任,提供预测可解释的模型至关重要。如果决策树足够简单,人类是可以理解的,但与其他常见的机器学习方法相比,它们的准确性会受到影响。倾斜决策树可以提供更好的准确性和更小的树,但它们的决策规则更复杂。本文提出了MUST(多元可理解统计树),这是一种基于线性判别分析的倾斜决策树分割算法,旨在通过限制决策规则中出现的变量数量来保持可解释性。
{"title":"Inducing Readable Oblique Decision Trees","authors":"Antonin Leroux, M. Boussard, R. Dès","doi":"10.1109/ICTAI.2018.00069","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00069","url":null,"abstract":"Although machine learning models are found in more and more practical applications, stakeholders can be suspicious about the fact that they are not hard-coded and fully specified. To foster trust, it is crucial to provide models whose predictions are explainable. Decision Trees can be understood by humans if they are simple enough, but they suffer in accuracy when compared to other common machine learning methods. Oblique Decision Trees can provide better accuracy and smaller trees, but their decision rules are more complex. This article presents MUST (Multivariate Understandable Statistical Tree), an Oblique Decision Tree split algorithm based on Linear Discriminant Analysis that aims to preserve explainability by limiting the number of variables that appear in decision rules.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127245462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1