首页 > 最新文献

Knowledge-Based Systems最新文献

英文 中文
Domain generalization via geometric adaptation over augmented data
IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-26 DOI: 10.1016/j.knosys.2024.112765
Ali Atghaei, Mohammad Rahmati
This article addresses the challenge of adapting deep learning models trained on specific datasets to effectively generalize to similar-class dataset with different underlying distributions. We introduce a novel deep representation learning method that takes into account both statistical and geometric properties of features for domain generalization. Our approach utilizes Fourier augmentation and Nyström estimation to evaluate the similarity between graphs derived from original and augmented data features. Furthermore, we employ a contrastive loss function to maintain proximity among samples belonging to the same class while ensuring separation between samples from different classes in the feature space. By minimizing these loss functions, our method aims to enhance model generalizability across diverse domains. Comprehensive experiments conducted on real-world benchmark datasets, including PACS, Office-Home, VLCS, Digits-DG and UTKFace, demonstrate the effectiveness of the proposed method. The results consistently indicate superior performance compared to other approaches under various conditions, underscoring its robustness in achieving improved generalization across domains.
{"title":"Domain generalization via geometric adaptation over augmented data","authors":"Ali Atghaei,&nbsp;Mohammad Rahmati","doi":"10.1016/j.knosys.2024.112765","DOIUrl":"10.1016/j.knosys.2024.112765","url":null,"abstract":"<div><div>This article addresses the challenge of adapting deep learning models trained on specific datasets to effectively generalize to similar-class dataset with different underlying distributions. We introduce a novel deep representation learning method that takes into account both statistical and geometric properties of features for domain generalization. Our approach utilizes Fourier augmentation and Nyström estimation to evaluate the similarity between graphs derived from original and augmented data features. Furthermore, we employ a contrastive loss function to maintain proximity among samples belonging to the same class while ensuring separation between samples from different classes in the feature space. By minimizing these loss functions, our method aims to enhance model generalizability across diverse domains. Comprehensive experiments conducted on real-world benchmark datasets, including PACS, Office-Home, VLCS, Digits-DG and UTKFace, demonstrate the effectiveness of the proposed method. The results consistently indicate superior performance compared to other approaches under various conditions, underscoring its robustness in achieving improved generalization across domains.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"309 ","pages":"Article 112765"},"PeriodicalIF":7.2,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142748612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gradient consistency strategy cooperative meta-feature learning for mixed domain generalized machine fault diagnosis
IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-26 DOI: 10.1016/j.knosys.2024.112771
Shushuai Xie , Wei Cheng , Ji Xing , Xuefeng Chen , Zelin Nie , Qian Huang , Rongyong Zhang
Recently, fault diagnosis methods based on domain generalization (DG) have been developed to improve the diagnostic performance of unseen target domains by multi-source domain knowledge transfer. However, existing methods assume that the source domains are discrete and that domain labels are known a priori, which is difficult to satisfy in complex and changing industrial systems. In addition, the gradient update conflict caused by the specific information of source domains leads to the degradation of the DG performance. Therefore, in this study, we relax the discrete domain assumption to the mixed domain setting and propose a novel gradient-consistency strategy cooperative meta-feature learning for mixed-domain generalized machine fault diagnosis. First, a domain feature-guided adaptive normalization module is proposed to normalize the underlying distribution of multi-source domains, and the mixed-source domains are divided into potential domain clusters. Then, a novel meta-feature encoding method is proposed to explicitly encode the overall fault feature structure, which is used to learn the generalized fault feature representation. Finally, a novel gradient consistency update strategy is designed to reduce the impact of domain-specific differences on model generalization. The effectiveness and superiority of the proposed method are verified on many DG diagnostic tasks on two public bearing datasets and the nuclear circulating water pump planetary gearbox dataset.
{"title":"Gradient consistency strategy cooperative meta-feature learning for mixed domain generalized machine fault diagnosis","authors":"Shushuai Xie ,&nbsp;Wei Cheng ,&nbsp;Ji Xing ,&nbsp;Xuefeng Chen ,&nbsp;Zelin Nie ,&nbsp;Qian Huang ,&nbsp;Rongyong Zhang","doi":"10.1016/j.knosys.2024.112771","DOIUrl":"10.1016/j.knosys.2024.112771","url":null,"abstract":"<div><div>Recently, fault diagnosis methods based on domain generalization (DG) have been developed to improve the diagnostic performance of unseen target domains by multi-source domain knowledge transfer. However, existing methods assume that the source domains are discrete and that domain labels are known a priori, which is difficult to satisfy in complex and changing industrial systems. In addition, the gradient update conflict caused by the specific information of source domains leads to the degradation of the DG performance. Therefore, in this study, we relax the discrete domain assumption to the mixed domain setting and propose a novel gradient-consistency strategy cooperative meta-feature learning for mixed-domain generalized machine fault diagnosis. First, a domain feature-guided adaptive normalization module is proposed to normalize the underlying distribution of multi-source domains, and the mixed-source domains are divided into potential domain clusters. Then, a novel meta-feature encoding method is proposed to explicitly encode the overall fault feature structure, which is used to learn the generalized fault feature representation. Finally, a novel gradient consistency update strategy is designed to reduce the impact of domain-specific differences on model generalization. The effectiveness and superiority of the proposed method are verified on many DG diagnostic tasks on two public bearing datasets and the nuclear circulating water pump planetary gearbox dataset.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"309 ","pages":"Article 112771"},"PeriodicalIF":7.2,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142748708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Blind image quality assessment for in-the-wild images by integrating distorted patch selection and multi-scale-and-granularity fusion
IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-26 DOI: 10.1016/j.knosys.2024.112772
Jili Xia , Lihuo He , Xinbo Gao , Bo Hu
Images taken in natural environments often exhibit complicated distortions, posing significant challenges for assessing their quality. Although current methods prioritize the perception of image contents and distortions, few explicitly investigate local distortions, a crucial factor affecting human visual perception. To mitigate this, this paper proposes a novel blind image quality assessment (IQA) method for in-the-wild images, termed DPSF, which integrates Distorted Patch Selection and multi-scale and multi-granularity feature Fusion. Specifically, it is first explained that the distributions of the mean subtracted contrast normalized coefficients of distorted patches differ from those of undistorted patches. Building upon this, an effective strategy for distorted patch selection is devised. Subsequently, a hybrid Transformer-convolutional neural network (CNN) module is proposed to exploit the benefits of both Transformer and CNN for distortion perception, in which the long-range dependencies of the selected patches are considered. Finally, an effective fusion module is employed for image quality evaluation, amalgamating finer and richer semantic and distortion features from multiple scales and granularities. Experimental results on five authentic IQA databases demonstrate that the proposed method yields more precise quality predictions compared with the state-of-the-art methods.
{"title":"Blind image quality assessment for in-the-wild images by integrating distorted patch selection and multi-scale-and-granularity fusion","authors":"Jili Xia ,&nbsp;Lihuo He ,&nbsp;Xinbo Gao ,&nbsp;Bo Hu","doi":"10.1016/j.knosys.2024.112772","DOIUrl":"10.1016/j.knosys.2024.112772","url":null,"abstract":"<div><div>Images taken in natural environments often exhibit complicated distortions, posing significant challenges for assessing their quality. Although current methods prioritize the perception of image contents and distortions, few explicitly investigate local distortions, a crucial factor affecting human visual perception. To mitigate this, this paper proposes a novel blind image quality assessment (IQA) method for in-the-wild images, termed DPSF, which integrates Distorted Patch Selection and multi-scale and multi-granularity feature Fusion. Specifically, it is first explained that the distributions of the mean subtracted contrast normalized coefficients of distorted patches differ from those of undistorted patches. Building upon this, an effective strategy for distorted patch selection is devised. Subsequently, a hybrid Transformer-convolutional neural network (CNN) module is proposed to exploit the benefits of both Transformer and CNN for distortion perception, in which the long-range dependencies of the selected patches are considered. Finally, an effective fusion module is employed for image quality evaluation, amalgamating finer and richer semantic and distortion features from multiple scales and granularities. Experimental results on five authentic IQA databases demonstrate that the proposed method yields more precise quality predictions compared with the state-of-the-art methods.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"309 ","pages":"Article 112772"},"PeriodicalIF":7.2,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142757412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A deep unfolding network based on intrinsic image decomposition for pansharpening 基于内在图像分解的深度展开网络,用于平锐化
IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-23 DOI: 10.1016/j.knosys.2024.112764
Yufei Ge , Xiaoli Zhang , Bo Huang , Xiongfei Li , Siwei Ma
Pansharpening aims to obtain high-resolution multispectral images by fusing panchromatic and low-resolution multispectral images. However, current deep learning-based methods lack reasonable interpretability and suffer from certain spectral and spatial distortions. To address these issues, we propose an interpretable deep unfolding network based on intrinsic image decomposition, called DUN-IID. IID decomposes the multispectral image into a reflectance component and a shading component to formulate a novel variational optimization function. The reflectance and shading components effectively reflect spectral and spatial information, respectively. This decoupling strategy enhances feature fidelity by preventing interference between spectral and spatial information, enabling independent optimization of spatial reconstruction and spectral correction during fusion. The optimization function is solved by the half-quadratic splitting method and unfolded into the end-to-end DUN-IID, which consists of two primal update blocks for prior learning and two dual update blocks for reconstruction. To alleviate the effects of information loss across intermediate stages, we introduce the source images into two primal update blocks for information enhancement. Therefore, the reflectance and shading primal update blocks are customized as a multi-scale structure and a band-aware construction, respectively. Besides, the multi-dimension attention mechanism is adopted to improve feature representation. Extensive experiments validate that our method is superior to other state-of-the-art methods.
全色锐化旨在通过融合全色和低分辨率多光谱图像来获得高分辨率多光谱图像。然而,目前基于深度学习的方法缺乏合理的可解释性,并受到某些光谱和空间失真的影响。为了解决这些问题,我们提出了一种基于内在图像分解的可解释深度展开网络,称为 DUN-IID。IID 将多光谱图像分解为反射分量和阴影分量,从而形成一个新颖的变分优化函数。反射分量和阴影分量分别有效地反映了光谱信息和空间信息。这种解耦策略通过防止光谱和空间信息之间的干扰来提高特征保真度,从而在融合过程中实现空间重建和光谱校正的独立优化。优化函数采用半二次分裂法求解,并展开为端到端 DUN-IID 算法,其中包括两个用于先验学习的原始更新块和两个用于重构的双重更新块。为了减轻中间阶段信息丢失的影响,我们将源图像引入两个原始更新块进行信息增强。因此,反射和阴影基元更新块分别被定制为多尺度结构和带感知结构。此外,我们还采用了多维度关注机制来改进特征表示。广泛的实验验证了我们的方法优于其他最先进的方法。
{"title":"A deep unfolding network based on intrinsic image decomposition for pansharpening","authors":"Yufei Ge ,&nbsp;Xiaoli Zhang ,&nbsp;Bo Huang ,&nbsp;Xiongfei Li ,&nbsp;Siwei Ma","doi":"10.1016/j.knosys.2024.112764","DOIUrl":"10.1016/j.knosys.2024.112764","url":null,"abstract":"<div><div>Pansharpening aims to obtain high-resolution multispectral images by fusing panchromatic and low-resolution multispectral images. However, current deep learning-based methods lack reasonable interpretability and suffer from certain spectral and spatial distortions. To address these issues, we propose an interpretable deep unfolding network based on intrinsic image decomposition, called DUN-IID. IID decomposes the multispectral image into a reflectance component and a shading component to formulate a novel variational optimization function. The reflectance and shading components effectively reflect spectral and spatial information, respectively. This decoupling strategy enhances feature fidelity by preventing interference between spectral and spatial information, enabling independent optimization of spatial reconstruction and spectral correction during fusion. The optimization function is solved by the half-quadratic splitting method and unfolded into the end-to-end DUN-IID, which consists of two primal update blocks for prior learning and two dual update blocks for reconstruction. To alleviate the effects of information loss across intermediate stages, we introduce the source images into two primal update blocks for information enhancement. Therefore, the reflectance and shading primal update blocks are customized as a multi-scale structure and a band-aware construction, respectively. Besides, the multi-dimension attention mechanism is adopted to improve feature representation. Extensive experiments validate that our method is superior to other state-of-the-art methods.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"308 ","pages":"Article 112764"},"PeriodicalIF":7.2,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142720584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CABiLSTM-BERT: Aspect-based sentiment analysis model based on deep implicit feature extraction
IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-23 DOI: 10.1016/j.knosys.2024.112782
Bo He, Ruoyu Zhao, Dali Tang
Aspect-based sentiment analysis (ABSA) models typically focus on learning contextual syntactic information and dependency relations. However, these models often struggle with losing or forgetting implicit feature information from shallow and intermediate layers during the learning process, potentially compromising classification performance. We consider the implicit feature information in each layer of the model to be equally important for processing. So, this paper proposes the CABiLSTM-BERT model, which aims to fully leverage implicit features at each layer to address this information loss problem and improve accuracy. The CABiLSTM-BERT model employs a frozen BERT pre-trained model to extract text word vector features, reducing overfitting and accelerating training. These word vectors are then processed through CABiLSTM, which preserves implicit feature representations of input sequences and LSTMs in each direction and layer. The model applies convolution to merge all features into a set of embedding representations after highlighting important features through multi-head self-attention calculations for each feature group. This approach minimizes information loss and maximizes utilization of important implicit feature information at each layer. Finally, the feature representations undergo average pooling before passing through the sentiment classification layer for polarity prediction. The effectiveness of the CABiLSTM-BERT model is validated using five publicly available real-world datasets and evaluated using metrics such as accuracy and Macro-F1. Results demonstrate the model's efficacy in addressing ABSA tasks.
{"title":"CABiLSTM-BERT: Aspect-based sentiment analysis model based on deep implicit feature extraction","authors":"Bo He,&nbsp;Ruoyu Zhao,&nbsp;Dali Tang","doi":"10.1016/j.knosys.2024.112782","DOIUrl":"10.1016/j.knosys.2024.112782","url":null,"abstract":"<div><div>Aspect-based sentiment analysis (ABSA) models typically focus on learning contextual syntactic information and dependency relations. However, these models often struggle with losing or forgetting implicit feature information from shallow and intermediate layers during the learning process, potentially compromising classification performance. We consider the implicit feature information in each layer of the model to be equally important for processing. So, this paper proposes the CABiLSTM-BERT model, which aims to fully leverage implicit features at each layer to address this information loss problem and improve accuracy. The CABiLSTM-BERT model employs a frozen BERT pre-trained model to extract text word vector features, reducing overfitting and accelerating training. These word vectors are then processed through CABiLSTM, which preserves implicit feature representations of input sequences and LSTMs in each direction and layer. The model applies convolution to merge all features into a set of embedding representations after highlighting important features through multi-head self-attention calculations for each feature group. This approach minimizes information loss and maximizes utilization of important implicit feature information at each layer. Finally, the feature representations undergo average pooling before passing through the sentiment classification layer for polarity prediction. The effectiveness of the CABiLSTM-BERT model is validated using five publicly available real-world datasets and evaluated using metrics such as accuracy and Macro-F1. Results demonstrate the model's efficacy in addressing ABSA tasks.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"309 ","pages":"Article 112782"},"PeriodicalIF":7.2,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142748615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A new graph-based clustering method with dual-feature regularization and Laplacian rank constraint
IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-23 DOI: 10.1016/j.knosys.2024.112738
Hengdong Zhu , Yingshan Shen , Choujun Zhan , Fu Lee Wang , Heng Weng , Tianyong Hao
The performance of graph-based clustering is commonly limited by two-stage processing (Constructing and dividing similarity graph) and the quality of similar graphs. To this end, we propose a new graph-based clustering method with dual-feature regularization and Laplacian rank constraint. Specifically, our method reveals the clustering structure and unifies the two-stage process. It imposes a Laplacian rank constraint on the similarity graph to ensure that it has C connected components. In addition, a method based on dual-feature regularization is designed to capture local data feature information from both feature extraction and adaptive regression, and is applied to an accurate distance metric learning. A reweighting optimization is integrated to learn a high-quality robust similarity graph. Comprehensive experiments on Ecoli, Yale and Yeast datasets show that our method outperforms the existing graph-based clustering methods with an average improvement of about 4%, 5% and 7% on the evaluation metrics ACC, NMI and RI, respectively.
{"title":"A new graph-based clustering method with dual-feature regularization and Laplacian rank constraint","authors":"Hengdong Zhu ,&nbsp;Yingshan Shen ,&nbsp;Choujun Zhan ,&nbsp;Fu Lee Wang ,&nbsp;Heng Weng ,&nbsp;Tianyong Hao","doi":"10.1016/j.knosys.2024.112738","DOIUrl":"10.1016/j.knosys.2024.112738","url":null,"abstract":"<div><div>The performance of graph-based clustering is commonly limited by two-stage processing (Constructing and dividing similarity graph) and the quality of similar graphs. To this end, we propose a new graph-based clustering method with dual-feature regularization and Laplacian rank constraint. Specifically, our method reveals the clustering structure and unifies the two-stage process. It imposes a Laplacian rank constraint on the similarity graph to ensure that it has <span><math><mi>C</mi></math></span> connected components. In addition, a method based on dual-feature regularization is designed to capture local data feature information from both feature extraction and adaptive regression, and is applied to an accurate distance metric learning. A reweighting optimization is integrated to learn a high-quality robust similarity graph. Comprehensive experiments on Ecoli, Yale and Yeast datasets show that our method outperforms the existing graph-based clustering methods with an average improvement of about 4%, 5% and 7% on the evaluation metrics ACC, NMI and RI, respectively.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"309 ","pages":"Article 112738"},"PeriodicalIF":7.2,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142748704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel multiscale adaptive graph adversarial network for mechanical fault diagnosis
IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-23 DOI: 10.1016/j.knosys.2024.112787
Jiayang Liu , Chaobing Wang , Rui Wang , Qian Xiao , Xiaosun Wang , Shijing Wu , Long Zhang
The measured signals of mechanical equipment may exhibit distribution discrepancies due to object variations or operational conditions. The multiscale framework has been demonstrated to be effective in enriching features. However, existing methods overlook the structural discrepancies and collaborative contributions at specific scales. Meanwhile, most current domain adaptation strategies predominantly consider class and domain labels, neglecting feature structural shifts, especially in subdomain feature structures. Based on that, a new cross-domain fault diagnosis method for mechanical is proposed in this paper, called multiscale adaptive graph adversarial network(MAGAN). MAGAN consists of a feature extractor, domain adaptation module, and classifier. In the feature extractor, a hierarchical residual multiscale graph learning module is employed to obtain rich features, an adaptive graph learning module is utilized to learn differentiated representations of specific scale structures, and a multiscale fusion module is applied to facilitate the collaboration of different scale features. After that, the domain adaptation module aids the feature extractor in learning transferable features by constructing a measure of subdomain feature structure discrepancy and adversarial domain discriminator. The classifier is then utilized for cross-domain fault diagnosis on the extracted transferable features. Finally, the proposed MAGAN is evaluated using 3 cross-machine transfer scenarios based on a scaled-down test rig for wind turbine gearbox and 12 cross-operating conditions transfer scenarios based on a published bearing dataset. The results validate the transferability and generalization of MAGAN.
{"title":"A novel multiscale adaptive graph adversarial network for mechanical fault diagnosis","authors":"Jiayang Liu ,&nbsp;Chaobing Wang ,&nbsp;Rui Wang ,&nbsp;Qian Xiao ,&nbsp;Xiaosun Wang ,&nbsp;Shijing Wu ,&nbsp;Long Zhang","doi":"10.1016/j.knosys.2024.112787","DOIUrl":"10.1016/j.knosys.2024.112787","url":null,"abstract":"<div><div>The measured signals of mechanical equipment may exhibit distribution discrepancies due to object variations or operational conditions. The multiscale framework has been demonstrated to be effective in enriching features. However, existing methods overlook the structural discrepancies and collaborative contributions at specific scales. Meanwhile, most current domain adaptation strategies predominantly consider class and domain labels, neglecting feature structural shifts, especially in subdomain feature structures. Based on that, a new cross-domain fault diagnosis method for mechanical is proposed in this paper, called multiscale adaptive graph adversarial network(MAGAN). MAGAN consists of a feature extractor, domain adaptation module, and classifier. In the feature extractor, a hierarchical residual multiscale graph learning module is employed to obtain rich features, an adaptive graph learning module is utilized to learn differentiated representations of specific scale structures, and a multiscale fusion module is applied to facilitate the collaboration of different scale features. After that, the domain adaptation module aids the feature extractor in learning transferable features by constructing a measure of subdomain feature structure discrepancy and adversarial domain discriminator. The classifier is then utilized for cross-domain fault diagnosis on the extracted transferable features. Finally, the proposed MAGAN is evaluated using 3 cross-machine transfer scenarios based on a scaled-down test rig for wind turbine gearbox and 12 cross-operating conditions transfer scenarios based on a published bearing dataset. The results validate the transferability and generalization of MAGAN.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"309 ","pages":"Article 112787"},"PeriodicalIF":7.2,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142748705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated message selection for robust Heterogeneous Graph Contrastive Learning 为稳健的异构图对比学习自动选择信息
IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-22 DOI: 10.1016/j.knosys.2024.112739
Rui Bing , Guan Yuan , Yanmei Zhang , Yong Zhou , Qiuyan Yan
Heterogeneous Graph Contrastive Learning (HGCL) has attracted lots of attentions because of eliminating the requirement of node labels. The encoders used in HGCL mainly are message-passing based Heterogeneous Graph Neural Networks, which are vulnerable to edge perturbations. Recently, a few HGCL models replace the polluted graph with KNN graph or threshold graph, which both have flaws: (1) each node in KNN graph have the same degree K, it is irrational and loses original structural features; (2) the threshold is selected artificially, which hinders both effectiveness and interpretability. To tackle the above issues, we propose an Automated Message Selection based Heterogeneous Graph Contrastive Learning (AMS-HGCL) model. We first set relation view and meta-path view for contrast. Then, a robust encoder is proposed to defend structural attacks for both views by automatically selecting harmless messages, without setting all nodes having the same number of neighbors. The learned probabilities of messages can show harmful features directly, which makes our model interpretable. Finally, we design a novel cross-view contrastive loss to optimize AMS-HGCL and output robust node representations. The experimental results on four real datasets demonstrate that AMS-HGCL is feasible and effective.
异构图对比学习(Heterogeneous Graph Contrastive Learning,HGCL)因无需节点标签而备受关注。HGCL 中使用的编码器主要是基于消息传递的异构图神经网络,这种网络容易受到边缘扰动的影响。最近,一些 HGCL 模型用 KNN 图或阈值图取代了污染图,但这两种图都存在缺陷:(1)KNN 图中的每个节点都有相同的度 K,不合理,失去了原有的结构特征;(2)阈值是人为选择的,既影响了有效性,也影响了可解释性。针对上述问题,我们提出了基于异构图对比学习(AMS-HGCL)的自动信息选择模型。我们首先设置了关系视图和元路径视图进行对比。然后,我们提出了一种鲁棒编码器,通过自动选择无害信息来防御这两种视图的结构性攻击,而无需设置所有节点都有相同数量的邻居。学习到的信息概率可以直接显示有害特征,这使得我们的模型具有可解释性。最后,我们设计了一种新颖的跨视图对比损失来优化 AMS-HGCL,并输出稳健的节点表示。在四个真实数据集上的实验结果表明,AMS-HGCL 是可行且有效的。
{"title":"Automated message selection for robust Heterogeneous Graph Contrastive Learning","authors":"Rui Bing ,&nbsp;Guan Yuan ,&nbsp;Yanmei Zhang ,&nbsp;Yong Zhou ,&nbsp;Qiuyan Yan","doi":"10.1016/j.knosys.2024.112739","DOIUrl":"10.1016/j.knosys.2024.112739","url":null,"abstract":"<div><div>Heterogeneous Graph Contrastive Learning (HGCL) has attracted lots of attentions because of eliminating the requirement of node labels. The encoders used in HGCL mainly are message-passing based Heterogeneous Graph Neural Networks, which are vulnerable to edge perturbations. Recently, a few HGCL models replace the polluted graph with <em>KNN</em> graph or threshold graph, which both have flaws: (1) each node in <em>KNN</em> graph have the same degree <em>K</em>, it is irrational and loses original structural features; (2) the threshold is selected artificially, which hinders both effectiveness and interpretability. To tackle the above issues, we propose an <u>A</u>utomated <u>M</u>essage <u>S</u>election based <u>H</u>eterogeneous <u>G</u>raph <u>C</u>ontrastive <u>L</u>earning (AMS-HGCL) model. We first set relation view and meta-path view for contrast. Then, a robust encoder is proposed to defend structural attacks for both views by automatically selecting harmless messages, without setting all nodes having the same number of neighbors. The learned probabilities of messages can show harmful features directly, which makes our model interpretable. Finally, we design a novel cross-view contrastive loss to optimize AMS-HGCL and output robust node representations. The experimental results on four real datasets demonstrate that AMS-HGCL is feasible and effective.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"307 ","pages":"Article 112739"},"PeriodicalIF":7.2,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142698129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-LoRA continual learning based instruction tuning framework for universal information extraction 基于持续学习的指令调整框架(Multi-LoRA),用于通用信息提取
IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-22 DOI: 10.1016/j.knosys.2024.112750
Yu Jin, Jie Liu, Shaowei Chen
Universal information extraction (Universal IE) aims to develop one model capable of solving multiple IE target tasks. Previous works have enhanced extraction performance of target tasks through auxiliary tasks. However, there are still limitations in terms of learning strategies. From one aspect, joint learning-based universal IE approaches, which simply mix auxiliary tasks with target tasks, fail to enable the model to master basic knowledge from auxiliary tasks before learning target tasks. From another aspect, continual learning-based universal IE approaches, which sequentially update all the model parameters on auxiliary tasks and target tasks, tend to cause catastrophic forgetting. In this study, we design a multi-LoRA continual learning-based instruction fine-tuning framework for universal IE. Specifically, we design unique LoRA modules for learning auxiliary tasks and target tasks. We first freeze pre-trained weights and update additional parameters on auxiliary tasks through one LoRA module. Subsequently, we keep the weights frozen and further adjust parameters through another LoRA module to adapt the model to the target tasks. Finally, we merge the frozen weights with learned weights, thereby enabling the model to better leverage the acquired abilities during the inference phase. Therefore, our model masters basic extraction abilities before learning target tasks and does not forget this basic knowledge during the target learning process. Moreover, we regard extraction, classification, and recognition as basic abilities and further design auxiliary tasks based on these basic abilities. Experimental results on 37 datasets across 3 tasks show that our approach reaches state-of-the-art performance.
通用信息提取(Universal IE)旨在开发一种能够解决多种信息提取目标任务的模型。以往的研究通过辅助任务提高了目标任务的提取性能。然而,在学习策略方面仍存在局限性。一方面,基于联合学习的通用 IE 方法只是将辅助任务与目标任务混合在一起,无法让模型在学习目标任务之前掌握辅助任务的基本知识。另一方面,基于持续学习的通用 IE 方法会依次更新辅助任务和目标任务的所有模型参数,容易造成灾难性遗忘。在本研究中,我们为通用智能教育设计了一个基于持续学习的多 LoRA 指令微调框架。具体来说,我们为学习辅助任务和目标任务设计了独特的 LoRA 模块。我们首先冻结预先训练的权重,并通过一个 LoRA 模块更新辅助任务的附加参数。随后,我们保持权重冻结,并通过另一个 LoRA 模块进一步调整参数,使模型适应目标任务。最后,我们将冻结的权重与学习到的权重合并,从而使模型在推理阶段更好地利用已获得的能力。因此,我们的模型在学习目标任务之前就掌握了基本的提取能力,并且在目标学习过程中不会遗忘这些基本知识。此外,我们将提取、分类和识别视为基本能力,并在这些基本能力的基础上进一步设计辅助任务。在 3 个任务的 37 个数据集上的实验结果表明,我们的方法达到了最先进的性能。
{"title":"Multi-LoRA continual learning based instruction tuning framework for universal information extraction","authors":"Yu Jin,&nbsp;Jie Liu,&nbsp;Shaowei Chen","doi":"10.1016/j.knosys.2024.112750","DOIUrl":"10.1016/j.knosys.2024.112750","url":null,"abstract":"<div><div>Universal information extraction (Universal IE) aims to develop one model capable of solving multiple IE target tasks. Previous works have enhanced extraction performance of target tasks through auxiliary tasks. However, there are still limitations in terms of learning strategies. From one aspect, joint learning-based universal IE approaches, which simply mix auxiliary tasks with target tasks, fail to enable the model to master basic knowledge from auxiliary tasks before learning target tasks. From another aspect, continual learning-based universal IE approaches, which sequentially update all the model parameters on auxiliary tasks and target tasks, tend to cause catastrophic forgetting. In this study, we design a multi-LoRA continual learning-based instruction fine-tuning framework for universal IE. Specifically, we design unique LoRA modules for learning auxiliary tasks and target tasks. We first freeze pre-trained weights and update additional parameters on auxiliary tasks through one LoRA module. Subsequently, we keep the weights frozen and further adjust parameters through another LoRA module to adapt the model to the target tasks. Finally, we merge the frozen weights with learned weights, thereby enabling the model to better leverage the acquired abilities during the inference phase. Therefore, our model masters basic extraction abilities before learning target tasks and does not forget this basic knowledge during the target learning process. Moreover, we regard extraction, classification, and recognition as basic abilities and further design auxiliary tasks based on these basic abilities. Experimental results on 37 datasets across 3 tasks show that our approach reaches state-of-the-art performance.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"308 ","pages":"Article 112750"},"PeriodicalIF":7.2,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142720586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extended ARMA graph neural networks for the prognosis of complex systems
IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-11-22 DOI: 10.1016/j.knosys.2024.112762
Zhizhen Wang , Liu Fu , Meng Ma , Zhi Zhai , Hui Chen
With the advancement of intelligent sensing techniques, massive monitoring signals are collected and accumulated from industrial systems. Given that sensors are often correlated and constructed to reflect graph topology, the signals can be conceptualized as graph data. The polynomial filter-based graph neural networks (GNNs) are commonly employed to exploit information from nodes features and graph topology for graph data analysis. However, the polynomial filter-based GNNs encounter difficulty in accurately modeling sharp changes and the coefficients can vary a lot, making them hard to learn. To address this problem, a novel graph neural network named extended auto-regressive moving average graph neural network (eAGNN) is proposed. Compared with auto-regressive moving average (ARMA) neural network, the order restriction are removed, allowing for the inference of a more general neural network, which enables the modeling of filters with more different shapes. Furthermore, both low-frequency and high-frequency information are explicitly and separately extracted so as to alleviate the burden of the learning process and further enhance the learning capability. Finally, several experiments including public node classification and fault diagnosis were conducted. The results demonstrate that the proposed eAGNN exhibits high performance compared to alternative methods.
{"title":"Extended ARMA graph neural networks for the prognosis of complex systems","authors":"Zhizhen Wang ,&nbsp;Liu Fu ,&nbsp;Meng Ma ,&nbsp;Zhi Zhai ,&nbsp;Hui Chen","doi":"10.1016/j.knosys.2024.112762","DOIUrl":"10.1016/j.knosys.2024.112762","url":null,"abstract":"<div><div>With the advancement of intelligent sensing techniques, massive monitoring signals are collected and accumulated from industrial systems. Given that sensors are often correlated and constructed to reflect graph topology, the signals can be conceptualized as graph data. The polynomial filter-based graph neural networks (GNNs) are commonly employed to exploit information from nodes features and graph topology for graph data analysis. However, the polynomial filter-based GNNs encounter difficulty in accurately modeling sharp changes and the coefficients can vary a lot, making them hard to learn. To address this problem, a novel graph neural network named extended auto-regressive moving average graph neural network (eAGNN) is proposed. Compared with auto-regressive moving average (ARMA) neural network, the order restriction are removed, allowing for the inference of a more general neural network, which enables the modeling of filters with more different shapes. Furthermore, both low-frequency and high-frequency information are explicitly and separately extracted so as to alleviate the burden of the learning process and further enhance the learning capability. Finally, several experiments including public node classification and fault diagnosis were conducted. The results demonstrate that the proposed eAGNN exhibits high performance compared to alternative methods.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"309 ","pages":"Article 112762"},"PeriodicalIF":7.2,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142748613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Knowledge-Based Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1