首页 > 最新文献

2019 IEEE International Conference on Big Knowledge (ICBK)最新文献

英文 中文
Sparse Tensor Decomposition for Multi-task Interaction Selection 稀疏张量分解用于多任务交互选择
Pub Date : 2019-11-01 DOI: 10.1109/ICBK.2019.00022
Jun-Yong Jeong, C. Jun
Multi-task learning aims to improve the generalization performance of related tasks based on simultaneous learning where prediction models share information. Recently, identifying significant feature interaction attracts more interests because of its practical importance. We propose a second-order interaction method for multi-task learning, which identifies significant linear and interaction terms. We develop a sparse tensor decomposition based on a feature augmentation and a symmetrization trick to express the prediction models of related tasks as the linear combinations of the shared parameters. We show that the proposed method could generate diverse relationships between linear and interaction terms. In minimizing the resulting multiconvex objective function, we select an initial value by deriving unbiased estimators and proposing a tensor decomposition. Experiments on synthetic and benchmark datasets demonstrate the effectiveness of the proposed method.
多任务学习的目的是在预测模型共享信息的同时学习的基础上提高相关任务的泛化性能。近年来,识别显著特征交互因其重要的现实意义而受到越来越多的关注。我们提出了一种用于多任务学习的二阶交互方法,该方法可以识别重要的线性项和交互项。本文提出了一种基于特征增强和对称化的稀疏张量分解方法,将相关任务的预测模型表示为共享参数的线性组合。结果表明,该方法可以生成线性项和交互项之间的多种关系。在最小化所得到的多凸目标函数时,我们通过推导无偏估计量和提出张量分解来选择初始值。在综合数据集和基准数据集上的实验证明了该方法的有效性。
{"title":"Sparse Tensor Decomposition for Multi-task Interaction Selection","authors":"Jun-Yong Jeong, C. Jun","doi":"10.1109/ICBK.2019.00022","DOIUrl":"https://doi.org/10.1109/ICBK.2019.00022","url":null,"abstract":"Multi-task learning aims to improve the generalization performance of related tasks based on simultaneous learning where prediction models share information. Recently, identifying significant feature interaction attracts more interests because of its practical importance. We propose a second-order interaction method for multi-task learning, which identifies significant linear and interaction terms. We develop a sparse tensor decomposition based on a feature augmentation and a symmetrization trick to express the prediction models of related tasks as the linear combinations of the shared parameters. We show that the proposed method could generate diverse relationships between linear and interaction terms. In minimizing the resulting multiconvex objective function, we select an initial value by deriving unbiased estimators and proposing a tensor decomposition. Experiments on synthetic and benchmark datasets demonstrate the effectiveness of the proposed method.","PeriodicalId":383917,"journal":{"name":"2019 IEEE International Conference on Big Knowledge (ICBK)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116590213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Performance Comparison of Cloud-Based Container Orchestration Tools 基于云的容器编排工具的性能比较
Pub Date : 2019-11-01 DOI: 10.1109/ICBK.2019.00033
Yao Pan, Ian Chen, F. Brasileiro, G. Jayaputera, R. Sinnott
Compared to the traditional approach of using virtual machines as the basis for the development and deployment of applications running in Cloud-based infrastructures, container technology provides developers with a higher degree of portability and availability, allowing developers to build and deploy their applications in a much more efficient and flexible manner. A number of tools have been proposed to orchestrate complex applications comprising multiple containers requiring continuous monitoring and management actions to meet application-oriented and non-functional requirements. Different container orchestration tools provide different features that incur different overheads. As such, it is not always easy for developers to choose the orchestration tool that will best suit their needs. In this paper we compare the benefits and overheads incurred by the most popular open source container orchestration tools currently available, namely: Kubernetes and Docker in Swarm mode. We undertake a number of benchmarking exercises from well-known benchmarking tools to evaluate the performance overheads of container orchestration tools and identify their pros and cons more generally. The results show that the overall performance of Kubernetes is slightly worse than that of Docker in Swarm mode. However, Docker in Swarm mode is not as flexible or powerful as Kubernetes in more complex situations.
与使用虚拟机作为在基于云的基础设施中开发和部署应用程序的基础的传统方法相比,容器技术为开发人员提供了更高程度的可移植性和可用性,允许开发人员以更高效、更灵活的方式构建和部署应用程序。已经提出了许多工具来编排包含多个容器的复杂应用程序,这些容器需要持续的监视和管理操作,以满足面向应用程序和非功能需求。不同的容器编排工具提供不同的特性,导致不同的开销。因此,对于开发人员来说,选择最适合他们需求的编排工具并不总是那么容易。在本文中,我们比较了目前最流行的开源容器编排工具带来的好处和开销,即:Kubernetes和Docker在Swarm模式下。我们从知名的基准测试工具中进行了许多基准测试练习,以评估容器编排工具的性能开销,并更普遍地确定它们的优缺点。结果表明,在Swarm模式下,Kubernetes的整体性能略差于Docker。然而,在更复杂的情况下,Docker在Swarm模式下并不像Kubernetes那样灵活或强大。
{"title":"A Performance Comparison of Cloud-Based Container Orchestration Tools","authors":"Yao Pan, Ian Chen, F. Brasileiro, G. Jayaputera, R. Sinnott","doi":"10.1109/ICBK.2019.00033","DOIUrl":"https://doi.org/10.1109/ICBK.2019.00033","url":null,"abstract":"Compared to the traditional approach of using virtual machines as the basis for the development and deployment of applications running in Cloud-based infrastructures, container technology provides developers with a higher degree of portability and availability, allowing developers to build and deploy their applications in a much more efficient and flexible manner. A number of tools have been proposed to orchestrate complex applications comprising multiple containers requiring continuous monitoring and management actions to meet application-oriented and non-functional requirements. Different container orchestration tools provide different features that incur different overheads. As such, it is not always easy for developers to choose the orchestration tool that will best suit their needs. In this paper we compare the benefits and overheads incurred by the most popular open source container orchestration tools currently available, namely: Kubernetes and Docker in Swarm mode. We undertake a number of benchmarking exercises from well-known benchmarking tools to evaluate the performance overheads of container orchestration tools and identify their pros and cons more generally. The results show that the overall performance of Kubernetes is slightly worse than that of Docker in Swarm mode. However, Docker in Swarm mode is not as flexible or powerful as Kubernetes in more complex situations.","PeriodicalId":383917,"journal":{"name":"2019 IEEE International Conference on Big Knowledge (ICBK)","volume":"253 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121135955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Edge Sign Prediction Based on Orthogonal Graph Regularized Nonnegative Matrix Factorization for Transfer Learning 基于正交图正则化非负矩阵分解的迁移学习边号预测
Pub Date : 2019-11-01 DOI: 10.1109/ICBK.2019.00050
Junwu Yu, Shuyin Xia, Guoyin Wang
In a signed graph, the edges have binary labels that indicate positive or negative relationships. In scenarios where some of the edge signs are unavailable, conventional learning methods will be ineffective. In contrast, transfer learning methods can improve the learning performance by using another network with adequate signs. In a social network, the problem often facedis that the network dimension is too high. Nonnegative Matrix Factorization (NMF) is a widely used matrix decomposition method to decrease the high dimensionality. However, the matrix that is generated may not be sparse enough, which can impact its representation ability. To address this problem, we propose Orthogonal Graph Regularized Nonnegative Matrix Factorization (OGNMF) to extract latent features from social networks and prove its convergence theoretically. Based on TrAdaBoost, a classical transfer learning algorithm, the experimental results using benchmark datasets demonstrate that our method has superior performance to the other baseline methods.
在有符号图中,边有二元标记,表示正或负关系。在某些边缘符号不可用的情况下,传统的学习方法将无效。相比之下,迁移学习方法可以通过使用另一个具有足够符号的网络来提高学习性能。在社交网络中,经常面临的问题是网络维度过高。非负矩阵分解(NMF)是一种用于降低高维数的矩阵分解方法。然而,生成的矩阵可能不够稀疏,这会影响其表示能力。为了解决这个问题,我们提出了正交图正则化非负矩阵分解(OGNMF)从社交网络中提取潜在特征,并从理论上证明了其收敛性。基于经典迁移学习算法TrAdaBoost,使用基准数据集的实验结果表明,该方法具有优于其他基准方法的性能。
{"title":"Edge Sign Prediction Based on Orthogonal Graph Regularized Nonnegative Matrix Factorization for Transfer Learning","authors":"Junwu Yu, Shuyin Xia, Guoyin Wang","doi":"10.1109/ICBK.2019.00050","DOIUrl":"https://doi.org/10.1109/ICBK.2019.00050","url":null,"abstract":"In a signed graph, the edges have binary labels that indicate positive or negative relationships. In scenarios where some of the edge signs are unavailable, conventional learning methods will be ineffective. In contrast, transfer learning methods can improve the learning performance by using another network with adequate signs. In a social network, the problem often facedis that the network dimension is too high. Nonnegative Matrix Factorization (NMF) is a widely used matrix decomposition method to decrease the high dimensionality. However, the matrix that is generated may not be sparse enough, which can impact its representation ability. To address this problem, we propose Orthogonal Graph Regularized Nonnegative Matrix Factorization (OGNMF) to extract latent features from social networks and prove its convergence theoretically. Based on TrAdaBoost, a classical transfer learning algorithm, the experimental results using benchmark datasets demonstrate that our method has superior performance to the other baseline methods.","PeriodicalId":383917,"journal":{"name":"2019 IEEE International Conference on Big Knowledge (ICBK)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132979130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Co-training Based on Semi-Supervised Ensemble Classification Approach for Multi-label Data Stream 基于半监督集成分类方法的多标签数据流协同训练
Pub Date : 2019-11-01 DOI: 10.1109/ICBK.2019.00016
Zhe Chu, Peipei Li, Xuegang Hu
A large amount of data streams in the form of texts and images has been emerging in many real-world applications. These data streams often present the characteristics such as multi-labels, label missing and new class emerging, which makes the existing data stream classification algorithm face the challenges in precision space and time performance. This is because, on the one hand, it is known that data stream classification algorithms are mostly trained on all labeled single-class data, while there are a large amount of unlabeled data and few labeled data due to it is difficult to obtain labels in the real world. On the other hand, many of existing multi-label data stream classification algorithms mostly focused on the classification with all labeled data and without emerging new classes, and there are few semi-supervised methods. Therefore, this paper proposes a semi-supervised ensemble classification algorithm for multi-label data streams based on co-training. Firstly, the algorithm uses the sliding window mechanism to partition the data stream into data chunks. On the former w data chucks, the multi-label semi-supervised classification algorithm COINS based on co-training is used to training a base classifier on each chunk, and then an ensemble model with w COINS classifiers is generated ensemble model to adapt to the environment of data stream with a large number of unlabeled data. Meanwhile, a new class emerging detection mechanism is introduced, and the w+1 data chunk is predicted by the ensemble model to detect whether there is a new class emerging. When a new label is detected, the classifier is retrained on the current data chunk, and the ensemble model is updated. Finally, experimental results on five real data sets show that: as compared with the classical algorithms, the proposed approach can improve the classification accuracy of multi-label data streams with a large number of missing labels and new labels emerging.
大量文本和图像形式的数据流已经出现在许多实际应用中。这些数据流往往呈现出多标签、标签缺失和新类别出现等特点,使得现有的数据流分类算法在精度、空间性能和时间性能方面面临挑战。这是因为,一方面,我们知道数据流分类算法大多是对所有标记的单类数据进行训练,而由于现实世界中很难获得标签,因此存在大量未标记的数据和很少的标记数据。另一方面,现有的多标签数据流分类算法大多集中在对所有标记数据的分类上,没有出现新的类,半监督的方法很少。为此,本文提出了一种基于协同训练的多标签数据流半监督集成分类算法。该算法首先利用滑动窗口机制将数据流划分为数据块;在前w个数据卡上,采用基于协同训练的多标签半监督分类算法COINS在每个数据块上训练一个基分类器,然后生成一个包含w个COINS分类器的集成模型,以适应具有大量无标签数据的数据流环境。同时,引入了一种新的类出现检测机制,通过集成模型对w+1数据块进行预测,检测是否有新类出现。当检测到新标签时,在当前数据块上重新训练分类器,并更新集成模型。最后,在5个真实数据集上的实验结果表明:与经典算法相比,本文提出的方法能够提高存在大量缺失标签和新标签出现的多标签数据流的分类精度。
{"title":"Co-training Based on Semi-Supervised Ensemble Classification Approach for Multi-label Data Stream","authors":"Zhe Chu, Peipei Li, Xuegang Hu","doi":"10.1109/ICBK.2019.00016","DOIUrl":"https://doi.org/10.1109/ICBK.2019.00016","url":null,"abstract":"A large amount of data streams in the form of texts and images has been emerging in many real-world applications. These data streams often present the characteristics such as multi-labels, label missing and new class emerging, which makes the existing data stream classification algorithm face the challenges in precision space and time performance. This is because, on the one hand, it is known that data stream classification algorithms are mostly trained on all labeled single-class data, while there are a large amount of unlabeled data and few labeled data due to it is difficult to obtain labels in the real world. On the other hand, many of existing multi-label data stream classification algorithms mostly focused on the classification with all labeled data and without emerging new classes, and there are few semi-supervised methods. Therefore, this paper proposes a semi-supervised ensemble classification algorithm for multi-label data streams based on co-training. Firstly, the algorithm uses the sliding window mechanism to partition the data stream into data chunks. On the former w data chucks, the multi-label semi-supervised classification algorithm COINS based on co-training is used to training a base classifier on each chunk, and then an ensemble model with w COINS classifiers is generated ensemble model to adapt to the environment of data stream with a large number of unlabeled data. Meanwhile, a new class emerging detection mechanism is introduced, and the w+1 data chunk is predicted by the ensemble model to detect whether there is a new class emerging. When a new label is detected, the classifier is retrained on the current data chunk, and the ensemble model is updated. Finally, experimental results on five real data sets show that: as compared with the classical algorithms, the proposed approach can improve the classification accuracy of multi-label data streams with a large number of missing labels and new labels emerging.","PeriodicalId":383917,"journal":{"name":"2019 IEEE International Conference on Big Knowledge (ICBK)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123579648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A Multi-granularity Genetic Algorithm 一种多粒度遗传算法
Pub Date : 2019-11-01 DOI: 10.1109/ICBK.2019.00027
Caoxiao Li, Shuyin Xia, Zizhong Chen, Guoyin Wang
The genetic algorithm is a classical evolutionary algorithm that mainly consists of mutation and crossover operations. Existing genetic algorithms implement these two operations on the current population and rarely use the spatial information that has been traversed. To address this problem, this paper proposes an improved genetic algorithm that divides the feasible region into multiple granularities. It is called the multi-granularity genetic algorithm (MGGA). This algorithm adopts a multi-granularity space strategy based on a random tree, which accelerates the searching speed of the algorithm in the multi-granular space. Firstly, a hierarchical strategy is applied to the current population to accelerate the generation of good individuals. Then, the multi-granularity space strategy is used to increase the search intensity of the sparse space and the subspace, where the current optimal solution is located. The experimental results on six classical functions demonstrate that the proposed MGGA can improve the convergence speed and solution accuracy and reduce the number of calculations required for the fitness value.
遗传算法是一种经典的进化算法,主要由变异和交叉操作组成。现有的遗传算法在当前种群上实现这两种操作,很少使用已经遍历的空间信息。为了解决这一问题,本文提出了一种改进的遗传算法,将可行区域划分为多个粒度。它被称为多粒度遗传算法(MGGA)。该算法采用基于随机树的多粒度空间策略,加快了算法在多粒度空间中的搜索速度。首先,对现有种群采用分层策略,加速优秀个体的产生;然后,采用多粒度空间策略增加稀疏空间和当前最优解所在子空间的搜索强度;在6个经典函数上的实验结果表明,该算法提高了收敛速度和求解精度,减少了适应度值的计算次数。
{"title":"A Multi-granularity Genetic Algorithm","authors":"Caoxiao Li, Shuyin Xia, Zizhong Chen, Guoyin Wang","doi":"10.1109/ICBK.2019.00027","DOIUrl":"https://doi.org/10.1109/ICBK.2019.00027","url":null,"abstract":"The genetic algorithm is a classical evolutionary algorithm that mainly consists of mutation and crossover operations. Existing genetic algorithms implement these two operations on the current population and rarely use the spatial information that has been traversed. To address this problem, this paper proposes an improved genetic algorithm that divides the feasible region into multiple granularities. It is called the multi-granularity genetic algorithm (MGGA). This algorithm adopts a multi-granularity space strategy based on a random tree, which accelerates the searching speed of the algorithm in the multi-granular space. Firstly, a hierarchical strategy is applied to the current population to accelerate the generation of good individuals. Then, the multi-granularity space strategy is used to increase the search intensity of the sparse space and the subspace, where the current optimal solution is located. The experimental results on six classical functions demonstrate that the proposed MGGA can improve the convergence speed and solution accuracy and reduce the number of calculations required for the fitness value.","PeriodicalId":383917,"journal":{"name":"2019 IEEE International Conference on Big Knowledge (ICBK)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122646813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Approximate Query Answering in Complex Gaussian Mixture Models 复杂高斯混合模型的近似查询应答
Pub Date : 2019-11-01 DOI: 10.1109/ICBK.2019.00019
Mattis Hartwig, M. Gehrke, R. Möller
Gaussian mixture models are widely used in a diverse range of research fields. If the number of components and dimensions grow high, the computational costs for answering queries become unreasonably high for practical use. Therefore approximation approaches are necessary to make complex Gaussian mixture models more usable. The need for approximation approaches is also driven by the relatively recent representations that theoretically allow unlimited number of mixture components (e.g. nonparametric Bayesian networks or infinite mixture models). In this paper we introduce an approximate inference algorithm that splits the existing algorithm for query answering into two steps and uses the knowledge from the first step to reduce unnecessary calculations in the second step while maintaining a defined error bound. In highly complex mixture models we observed significant time savings even with low error bounds.
高斯混合模型广泛应用于各种研究领域。如果组件和维度的数量增长得很高,那么回答查询的计算成本对于实际使用来说就会变得不合理地高。因此,近似方法是必要的,以使复杂的高斯混合模型更可用。对近似方法的需求也受到相对较新的表示的推动,理论上允许无限数量的混合成分(例如非参数贝叶斯网络或无限混合模型)。本文介绍了一种近似推理算法,该算法将现有的查询回答算法分成两步,并利用第一步的知识减少第二步的不必要计算,同时保持定义的误差范围。在高度复杂的混合模型中,即使误差范围很低,我们也观察到显著的时间节省。
{"title":"Approximate Query Answering in Complex Gaussian Mixture Models","authors":"Mattis Hartwig, M. Gehrke, R. Möller","doi":"10.1109/ICBK.2019.00019","DOIUrl":"https://doi.org/10.1109/ICBK.2019.00019","url":null,"abstract":"Gaussian mixture models are widely used in a diverse range of research fields. If the number of components and dimensions grow high, the computational costs for answering queries become unreasonably high for practical use. Therefore approximation approaches are necessary to make complex Gaussian mixture models more usable. The need for approximation approaches is also driven by the relatively recent representations that theoretically allow unlimited number of mixture components (e.g. nonparametric Bayesian networks or infinite mixture models). In this paper we introduce an approximate inference algorithm that splits the existing algorithm for query answering into two steps and uses the knowledge from the first step to reduce unnecessary calculations in the second step while maintaining a defined error bound. In highly complex mixture models we observed significant time savings even with low error bounds.","PeriodicalId":383917,"journal":{"name":"2019 IEEE International Conference on Big Knowledge (ICBK)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133545568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
U-Net Based Defects Inspection in Photovoltaic Electroluminecscence Images 基于U-Net的光伏电致发光图像缺陷检测
Pub Date : 2019-11-01 DOI: 10.1109/ICBK.2019.00036
Muhammad Rameez Ur Rahman, Haiyong Chen, Wen Xi
Efficient defects segmentation from photovoltaic (PV) electroluminescence (EL) images is a crucial process due to the random inhomogeneous background and unbalanced crack non-crack pixel distribution. The automatic defect inspection of solar cells greatly influences the quality of photovoltaic cells, so it is necessary to examine defects efficiently and accurately. In this paper we propose a novel end to end deep learning-based architecture for defects segmentation. In the proposed architecture we introduce a novel global attention to extract rich context information. Further, we modified the U-net by adding dilated convolution at both encoder and decoder side with skip connections from early layers to later layers at encoder side. Then the proposed global attention is incorporated into the modified U-net. The model is trained and tested on Photovoltaic electroluminescence 512x512 images dataset and the results are recorded using mean Intersection over union (IOU). In experiments, we reported the results and made comparison between the proposed model and other state of the art methods. The mean IOU of proposed method is 0.6477 with pixel accuracy 0.9738 which is better than the state-of-the-art methods. We demonstrate that the proposed method can give effective results with smaller dataset and is computationally efficient.
由于光伏电致发光图像背景随机不均匀、裂纹非裂纹像元分布不平衡等特点,对其进行有效的缺陷分割至关重要。太阳能电池的缺陷自动检测对光伏电池的质量影响很大,因此对缺陷进行高效、准确的检测是十分必要的。本文提出了一种新的基于端到端深度学习的缺陷分割体系结构。在提出的架构中,我们引入了一种新的全局关注来提取丰富的上下文信息。此外,我们通过在编码器和解码器侧添加扩展卷积来修改U-net,并在编码器侧从早期层到后期层进行跳过连接。然后将建议的全球关注纳入改进的U-net中。在光伏电致发光512x512图像数据集上对该模型进行训练和测试,并使用平均交汇超过联合(Intersection over union, IOU)记录结果。在实验中,我们报告了结果,并将所提出的模型与其他最先进的方法进行了比较。该方法的平均IOU为0.6477,像素精度为0.9738,优于现有方法。结果表明,该方法可以在较小的数据集上得到有效的结果,并且计算效率高。
{"title":"U-Net Based Defects Inspection in Photovoltaic Electroluminecscence Images","authors":"Muhammad Rameez Ur Rahman, Haiyong Chen, Wen Xi","doi":"10.1109/ICBK.2019.00036","DOIUrl":"https://doi.org/10.1109/ICBK.2019.00036","url":null,"abstract":"Efficient defects segmentation from photovoltaic (PV) electroluminescence (EL) images is a crucial process due to the random inhomogeneous background and unbalanced crack non-crack pixel distribution. The automatic defect inspection of solar cells greatly influences the quality of photovoltaic cells, so it is necessary to examine defects efficiently and accurately. In this paper we propose a novel end to end deep learning-based architecture for defects segmentation. In the proposed architecture we introduce a novel global attention to extract rich context information. Further, we modified the U-net by adding dilated convolution at both encoder and decoder side with skip connections from early layers to later layers at encoder side. Then the proposed global attention is incorporated into the modified U-net. The model is trained and tested on Photovoltaic electroluminescence 512x512 images dataset and the results are recorded using mean Intersection over union (IOU). In experiments, we reported the results and made comparison between the proposed model and other state of the art methods. The mean IOU of proposed method is 0.6477 with pixel accuracy 0.9738 which is better than the state-of-the-art methods. We demonstrate that the proposed method can give effective results with smaller dataset and is computationally efficient.","PeriodicalId":383917,"journal":{"name":"2019 IEEE International Conference on Big Knowledge (ICBK)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121712107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
[Copyright notice] (版权)
Pub Date : 2019-11-01 DOI: 10.1109/icbk.2019.00003
{"title":"[Copyright notice]","authors":"","doi":"10.1109/icbk.2019.00003","DOIUrl":"https://doi.org/10.1109/icbk.2019.00003","url":null,"abstract":"","PeriodicalId":383917,"journal":{"name":"2019 IEEE International Conference on Big Knowledge (ICBK)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124407366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ensemble Classification Method Based on Truth Discovery 基于真相发现的集成分类方法
Pub Date : 2019-11-01 DOI: 10.1109/ICBK.2019.00024
Yuxin Jin, Ze Yang, Ying He, Xianyu Bao, Gongqing Wu
Classification is a hot topic in such fields as machine learning and data mining. The traditional approach of machine learning is to find a classifier closest to the real classification function, while ensemble classification is to integrate the results of base classifiers, then make an overall prediction. Compared to using a single classifier, ensemble classification can significantly improve the generalization of the learning system in most cases. However, the existing ensemble classification methods rarely consider the weight of the classifier, and there are few methods to consider updating the weights dynamically. In this paper, we are inspired by the idea of truth discovery and propose a new ensemble classification method based on the truth discovery (named ECTD). As far as we know, we are the first to apply the idea of truth discovery in the field of ensemble learning. Experimental results demonstrate that the proposed method performs well in ensemble classification.
分类是机器学习和数据挖掘等领域的热门话题。传统的机器学习方法是寻找最接近真实分类函数的分类器,而集成分类是将基分类器的结果进行整合,然后进行整体预测。与使用单一分类器相比,集成分类在大多数情况下可以显著提高学习系统的泛化能力。然而,现有的集成分类方法很少考虑分类器的权值,也很少考虑权值的动态更新。本文受真值发现思想的启发,提出了一种新的基于真值发现的集成分类方法(ECTD)。据我们所知,我们是第一个将真理发现的思想应用于集成学习领域的。实验结果表明,该方法具有较好的集成分类效果。
{"title":"Ensemble Classification Method Based on Truth Discovery","authors":"Yuxin Jin, Ze Yang, Ying He, Xianyu Bao, Gongqing Wu","doi":"10.1109/ICBK.2019.00024","DOIUrl":"https://doi.org/10.1109/ICBK.2019.00024","url":null,"abstract":"Classification is a hot topic in such fields as machine learning and data mining. The traditional approach of machine learning is to find a classifier closest to the real classification function, while ensemble classification is to integrate the results of base classifiers, then make an overall prediction. Compared to using a single classifier, ensemble classification can significantly improve the generalization of the learning system in most cases. However, the existing ensemble classification methods rarely consider the weight of the classifier, and there are few methods to consider updating the weights dynamically. In this paper, we are inspired by the idea of truth discovery and propose a new ensemble classification method based on the truth discovery (named ECTD). As far as we know, we are the first to apply the idea of truth discovery in the field of ensemble learning. Experimental results demonstrate that the proposed method performs well in ensemble classification.","PeriodicalId":383917,"journal":{"name":"2019 IEEE International Conference on Big Knowledge (ICBK)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132681392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ICDM/ICBK 2019 Panel: Marketing Intelligence – Let Marketing Drive Efficiency and Innovation ICDM/ICBK 2019专题讨论:营销智能——让营销驱动效率和创新
Pub Date : 2019-11-01 DOI: 10.1109/icbk.2019.00008
Xindong Wu
Marketing connects product/service providers and customers. It runs through the whole life cycle of an organization (such as a manufacturing enterprise or a public safety department), including market opportunities, market penetration, market developments, product/service innovation, and possibly market renovation. Marketing intelligence (MI) seeks to facilitate a positive cycle among market opportunities, market penetration, and market developments, not just intelligent marketing. It applies AI, Big Data and CRM technologies to analyze huge amounts of heterogeneous multi-source data, and supports intelligent decision-making by mining operational patterns from production and consumption data, and providing data insights, customer profiling, brand analysis, personalized advertising, product/service recommendations, supply chain integration and inventory management.
市场营销连接产品/服务供应商和客户。它贯穿于一个组织(如制造企业或公共安全部门)的整个生命周期,包括市场机会、市场渗透、市场开发、产品/服务创新,以及可能的市场革新。营销情报(MI)寻求促进市场机会、市场渗透和市场发展之间的良性循环,而不仅仅是智能营销。它应用AI、大数据和CRM技术,分析海量异构多源数据,通过从生产和消费数据中挖掘运营模式,提供数据洞察、客户分析、品牌分析、个性化广告、产品/服务推荐、供应链集成和库存管理,支持智能决策。
{"title":"ICDM/ICBK 2019 Panel: Marketing Intelligence – Let Marketing Drive Efficiency and Innovation","authors":"Xindong Wu","doi":"10.1109/icbk.2019.00008","DOIUrl":"https://doi.org/10.1109/icbk.2019.00008","url":null,"abstract":"Marketing connects product/service providers and customers. It runs through the whole life cycle of an organization (such as a manufacturing enterprise or a public safety department), including market opportunities, market penetration, market developments, product/service innovation, and possibly market renovation. Marketing intelligence (MI) seeks to facilitate a positive cycle among market opportunities, market penetration, and market developments, not just intelligent marketing. It applies AI, Big Data and CRM technologies to analyze huge amounts of heterogeneous multi-source data, and supports intelligent decision-making by mining operational patterns from production and consumption data, and providing data insights, customer profiling, brand analysis, personalized advertising, product/service recommendations, supply chain integration and inventory management.","PeriodicalId":383917,"journal":{"name":"2019 IEEE International Conference on Big Knowledge (ICBK)","volume":"99 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117230477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2019 IEEE International Conference on Big Knowledge (ICBK)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1