首页 > 最新文献

IEEE Transactions on Big Data最新文献

英文 中文
Streaming Local Community Detection Through Approximate Conductance 通过近似电导率进行流式本地群落检测
IF 7.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-31 DOI: 10.1109/TBDATA.2023.3310251
Meng Wang;Yanhao Yang;David Bindel;Kun He
Community is a universal structure in various complex networks, and community detection is a fundamental task for network analysis. With the rapid growth of network scale, networks are massive, changing rapidly, and could naturally be modeled as graph streams. Due to the limited memory and access constraint in graph streams, existing non-streaming community detection methods are no longer applicable. This raises an emerging need for online approaches. In this work, we consider the problem of uncovering the local community containing a few query nodes in graph streams, termed streaming local community detection. This new problem raised recently is more challenging for community detection, and only a few works address this online setting. Correspondingly, we design an online single-pass streaming local community detection approach. Inspired by the local property of communities, our method samples the local structure around the query nodes in graph streams and extracts the target community on the sampled subgraph using our proposed metric called approximate conductance. Comprehensive experiments show that our method remarkably outperforms the streaming baseline on both effectiveness and efficiency, and even achieves similar accuracy compared to the state-of-the-art non-streaming local community detection methods that use static and complete graphs.
社群是各种复杂网络中的一种普遍结构,社群检测是网络分析的一项基本任务。随着网络规模的快速增长,网络规模庞大、变化迅速,自然可以被建模为图流。由于图流的内存和访问限制有限,现有的非流式社群检测方法已不再适用。这就提出了对在线方法的新需求。在这项工作中,我们考虑的问题是发现图流中包含几个查询节点的本地社区,即流本地社区检测。最近提出的这一新问题对社区检测来说更具挑战性,只有少数作品涉及这一在线设置。因此,我们设计了一种在线单程流本地社区检测方法。受社群局部属性的启发,我们的方法对图流中查询节点周围的局部结构进行采样,并使用我们提出的近似传导率指标在采样子图上提取目标社群。综合实验表明,我们的方法在效果和效率上都明显优于流式基线方法,甚至与使用静态和完整图的最先进非流式本地社区检测方法相比,也达到了类似的精度。
{"title":"Streaming Local Community Detection Through Approximate Conductance","authors":"Meng Wang;Yanhao Yang;David Bindel;Kun He","doi":"10.1109/TBDATA.2023.3310251","DOIUrl":"10.1109/TBDATA.2023.3310251","url":null,"abstract":"Community is a universal structure in various complex networks, and community detection is a fundamental task for network analysis. With the rapid growth of network scale, networks are massive, changing rapidly, and could naturally be modeled as graph streams. Due to the limited memory and access constraint in graph streams, existing non-streaming community detection methods are no longer applicable. This raises an emerging need for online approaches. In this work, we consider the problem of uncovering the local community containing a few query nodes in graph streams, termed streaming local community detection. This new problem raised recently is more challenging for community detection, and only a few works address this online setting. Correspondingly, we design an online single-pass streaming local community detection approach. Inspired by the local property of communities, our method samples the local structure around the query nodes in graph streams and extracts the target community on the sampled subgraph using our proposed metric called approximate conductance. Comprehensive experiments show that our method remarkably outperforms the streaming baseline on both effectiveness and efficiency, and even achieves similar accuracy compared to the state-of-the-art non-streaming local community detection methods that use static and complete graphs.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"10 1","pages":"12-22"},"PeriodicalIF":7.2,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89772759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transfer Learning With Document-Level Data Augmentation for Aspect-Level Sentiment Classification 面向方面级情感分类的文档级数据增强迁移学习
IF 7.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-30 DOI: 10.1109/TBDATA.2023.3310267
Xiaosai Huang;Jing Li;Jia Wu;Jun Chang;Donghua Liu
Aspect-level sentiment classification (ASC) seeks to reveal the emotional tendency of a designated aspect of a text. Some researchers have recently tried to exploit large amounts of document-level sentiment classification (DSC) data available to help improve the performance of ASC models through transfer learning. However, these studies often ignore the difference in sentiment distribution between document-level and aspect-level data without preprocessing the document-level knowledge. Our study provides a transfer learning with document-level data augmentation (TL-DDA) framework to transfer more accurate document-level knowledge to the ASC model by means of document-level data augmentation and attention fusion. First, we use document data selection and text concatenation to produce document-level data with various sentiment distributions. The augmented document data is then utilized for pre-training a well-designed DSC model. Finally, after attention adjustment, we fuse the word attention obtained from this DSC model into the ASC model. Results of experiments utilizing two publicly available datasets suggest that TL-DDA is reliable.
方面级情感分类(ASC)旨在揭示文本中指定方面的情感倾向。最近,一些研究人员试图利用大量的文档级情感分类(DSC)数据,通过迁移学习来帮助提高ASC模型的性能。然而,这些研究往往忽略了文档级和方面级数据之间情感分布的差异,没有对文档级知识进行预处理。我们的研究提供了一个具有文档级数据增强(TL-DDA)框架的迁移学习,通过文档级数据增强和注意力融合将更准确的文档级知识转移到ASC模型中。首先,我们使用文档数据选择和文本连接来生成具有各种情感分布的文档级数据。然后利用增强的文档数据对设计良好的DSC模型进行预训练。最后,经过注意调整,我们将DSC模型得到的单词注意融合到ASC模型中。利用两个公开数据集的实验结果表明,TL-DDA是可靠的。
{"title":"Transfer Learning With Document-Level Data Augmentation for Aspect-Level Sentiment Classification","authors":"Xiaosai Huang;Jing Li;Jia Wu;Jun Chang;Donghua Liu","doi":"10.1109/TBDATA.2023.3310267","DOIUrl":"10.1109/TBDATA.2023.3310267","url":null,"abstract":"Aspect-level sentiment classification (ASC) seeks to reveal the emotional tendency of a designated aspect of a text. Some researchers have recently tried to exploit large amounts of document-level sentiment classification (DSC) data available to help improve the performance of ASC models through transfer learning. However, these studies often ignore the difference in sentiment distribution between document-level and aspect-level data without preprocessing the document-level knowledge. Our study provides a transfer learning with document-level data augmentation (TL-DDA) framework to transfer more accurate document-level knowledge to the ASC model by means of \u0000<italic>document-level data augmentation</i>\u0000 and \u0000<italic>attention fusion</i>\u0000. First, we use \u0000<italic>document data selection</i>\u0000 and \u0000<italic>text concatenation</i>\u0000 to produce document-level data with various sentiment distributions. The augmented document data is then utilized for pre-training a well-designed DSC model. Finally, after \u0000<italic>attention adjustment</i>\u0000, we \u0000<italic>fuse the word attention</i>\u0000 obtained from this DSC model into the ASC model. Results of experiments utilizing two publicly available datasets suggest that TL-DDA is reliable.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1643-1657"},"PeriodicalIF":7.2,"publicationDate":"2023-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TS-RTPM-Net: Data-Driven Tensor Sketching for Efficient CP Decomposition TS-RTPM-Net:数据驱动张量素描,实现高效 CP 分解
IF 7.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-30 DOI: 10.1109/TBDATA.2023.3310254
Xingyu Cao;Xiangtao Zhang;Ce Zhu;Jiani Liu;Yipeng Liu
Tensor decomposition is widely used in feature extraction, data analysis, and other fields. As a means of tensor decomposition, the robust tensor power method based on tensor sketch (TS-RTPM) can quickly mine the potential features of tensor, but in some cases, its approximation performance is limited. In this paper, we propose a data-driven framework called TS-RTPM-Net, which improves the estimation accuracy of TS-RTPM by jointly training the TS value matrices with the RTPM initial matrices. It also uses two greedy initialization algorithms to optimize the TS location matrices. In addition, TS-RTPM-Net accelerates TS-RTPM by using fast power iteration modules. Comparative experiments on real-world datasets verify that TS-RTPM-Net outperforms TS-RTPM in terms of estimation accuracy, running speed, and memory consumption.
张量分解被广泛应用于特征提取、数据分析等领域。作为张量分解的一种手段,基于张量素描的鲁棒张量幂方法(TS-RTPM)能快速挖掘张量的潜在特征,但在某些情况下,其近似性能有限。本文提出了一种名为 TS-RTPM-Net 的数据驱动框架,它通过联合训练 TS 值矩阵和 RTPM 初始矩阵来提高 TS-RTPM 的估计精度。它还使用两种贪婪初始化算法来优化 TS 位置矩阵。此外,TS-RTPM-Net 还通过使用快速幂迭代模块来加速 TS-RTPM。实际数据集的对比实验验证了 TS-RTPM-Net 在估计精度、运行速度和内存消耗方面都优于 TS-RTPM。
{"title":"TS-RTPM-Net: Data-Driven Tensor Sketching for Efficient CP Decomposition","authors":"Xingyu Cao;Xiangtao Zhang;Ce Zhu;Jiani Liu;Yipeng Liu","doi":"10.1109/TBDATA.2023.3310254","DOIUrl":"10.1109/TBDATA.2023.3310254","url":null,"abstract":"Tensor decomposition is widely used in feature extraction, data analysis, and other fields. As a means of tensor decomposition, the robust tensor power method based on tensor sketch (TS-RTPM) can quickly mine the potential features of tensor, but in some cases, its approximation performance is limited. In this paper, we propose a data-driven framework called TS-RTPM-Net, which improves the estimation accuracy of TS-RTPM by jointly training the TS value matrices with the RTPM initial matrices. It also uses two greedy initialization algorithms to optimize the TS location matrices. In addition, TS-RTPM-Net accelerates TS-RTPM by using fast power iteration modules. Comparative experiments on real-world datasets verify that TS-RTPM-Net outperforms TS-RTPM in terms of estimation accuracy, running speed, and memory consumption.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"10 1","pages":"1-11"},"PeriodicalIF":7.2,"publicationDate":"2023-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved Box Embeddings for Fine-Grained Entity Typing 改进的细粒度实体类型的盒嵌入
IF 7.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-30 DOI: 10.1109/TBDATA.2023.3310239
Yixiu Qin;Yizhao Wang;Jiawei Li;Shun Mao;He Wang;Yuncheng Jiang
Different from traditional vector-based fine-grained entity typing methods, the box-based method is more effective in capturing the complex relationships between entity mentions and entity types. The box-based fine-grained entity typing method projects entity types and entity mentions into high-dimensional box space, where entity types and entity mentions are embedded as d-dimensional hyperrectangles. However, the impacts of entity types are not considered during classification in high-dimensional box space, and the model cannot be optimized precisely when two boxes are completely separated or overlapped in high-dimensional box space. Based on the above shortcomings, an Improved Box Embeddings (IBE) method for fine-grained entity typing is proposed in this work. The IBE not only introduces the impacts of entity types during classification in high-dimensional box space, but also proposes a distance based module to optimize the model precisely when two boxes are completely separated or overlapped in high-dimensional box space. Experimental results on four fine-grained entity typing datasets verify the effectiveness of the proposed IBE, demonstrating that IBE is a state-of-the-art method for fine-grained entity typing.
与传统的基于矢量的细粒度实体类型方法不同,基于框的方法在捕获实体提及和实体类型之间的复杂关系方面更有效。基于盒的细粒度实体类型方法将实体类型和实体提及投射到高维盒空间中,其中实体类型和实体提及被嵌入为d维超矩形。然而,在高维盒空间中,分类时没有考虑实体类型的影响,当两个盒子在高维盒空间中完全分离或重叠时,无法精确优化模型。基于上述不足,本文提出了一种改进的细粒度实体分类盒嵌入方法。IBE不仅引入了实体类型对高维盒空间分类的影响,而且提出了一个基于距离的模块,在高维盒空间中,当两个盒完全分离或重叠时精确优化模型。在四个细粒度实体类型数据集上的实验结果验证了所提出的IBE的有效性,表明IBE是一种最先进的细粒度实体类型方法。
{"title":"Improved Box Embeddings for Fine-Grained Entity Typing","authors":"Yixiu Qin;Yizhao Wang;Jiawei Li;Shun Mao;He Wang;Yuncheng Jiang","doi":"10.1109/TBDATA.2023.3310239","DOIUrl":"10.1109/TBDATA.2023.3310239","url":null,"abstract":"Different from traditional vector-based fine-grained entity typing methods, the box-based method is more effective in capturing the complex relationships between entity mentions and entity types. The box-based fine-grained entity typing method projects entity types and entity mentions into high-dimensional box space, where entity types and entity mentions are embedded as \u0000<italic>d</i>\u0000-dimensional hyperrectangles. However, the impacts of entity types are not considered during classification in high-dimensional box space, and the model cannot be optimized precisely when two boxes are completely separated or overlapped in high-dimensional box space. Based on the above shortcomings, an \u0000<bold>I</b>\u0000mproved \u0000<bold>B</b>\u0000ox \u0000<bold>E</b>\u0000mbeddings (IBE) method for fine-grained entity typing is proposed in this work. The IBE not only introduces the impacts of entity types during classification in high-dimensional box space, but also proposes a distance based module to optimize the model precisely when two boxes are completely separated or overlapped in high-dimensional box space. Experimental results on four fine-grained entity typing datasets verify the effectiveness of the proposed IBE, demonstrating that IBE is a state-of-the-art method for fine-grained entity typing.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1631-1642"},"PeriodicalIF":7.2,"publicationDate":"2023-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PredLife: Predicting Fine-Grained Future Activity Patterns PredLife:预测细粒度的未来活动模式
IF 7.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-30 DOI: 10.1109/TBDATA.2023.3310241
Wenjing Li;Xiaodan Shi;Dou Huang;Xudong Shen;Jinyu Chen;Hill Hiroki Kobayashi;Haoran Zhang;Xuan Song;Ryosuke Shibasaki
Activity pattern prediction is a critical part of urban computing, urban planning, intelligent transportation, and so on. Based on a dataset with more than 10 million GPS trajectory records collected by mobile sensors, this research proposed a CNN-BiLSTM-VAE-ATT-based encoder-decoder model for fine-grained individual activity sequence prediction. The model combines the long-term and short-term dependencies crosswise and also considers randomness, diversity, and uncertainty of individual activity patterns. The proposed results show higher accuracy compared to the ten baselines. The model can generate high diversity results while approximating the original activity patterns distribution. Moreover, the model also has interpretability in revealing the time dependency importance of the activity pattern prediction.
活动模式预测是城市计算、城市规划、智能交通等领域的重要组成部分。基于移动传感器收集的1000多万条GPS轨迹数据集,提出了一种基于cnn - bilstm - vae - at的编码器-解码器模型,用于细粒度个体活动序列预测。该模型横向结合了长期和短期依赖关系,并考虑了个体活动模式的随机性、多样性和不确定性。与10个基线相比,所提出的结果具有更高的精度。该模型在接近原始活动模式分布的情况下,可以得到较高的多样性结果。此外,该模型在揭示活动模式预测的时间依赖性重要性方面也具有可解释性。
{"title":"PredLife: Predicting Fine-Grained Future Activity Patterns","authors":"Wenjing Li;Xiaodan Shi;Dou Huang;Xudong Shen;Jinyu Chen;Hill Hiroki Kobayashi;Haoran Zhang;Xuan Song;Ryosuke Shibasaki","doi":"10.1109/TBDATA.2023.3310241","DOIUrl":"10.1109/TBDATA.2023.3310241","url":null,"abstract":"Activity pattern prediction is a critical part of urban computing, urban planning, intelligent transportation, and so on. Based on a dataset with more than 10 million GPS trajectory records collected by mobile sensors, this research proposed a CNN-BiLSTM-VAE-ATT-based encoder-decoder model for fine-grained individual activity sequence prediction. The model combines the long-term and short-term dependencies crosswise and also considers randomness, diversity, and uncertainty of individual activity patterns. The proposed results show higher accuracy compared to the ten baselines. The model can generate high diversity results while approximating the original activity patterns distribution. Moreover, the model also has interpretability in revealing the time dependency importance of the activity pattern prediction.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1658-1669"},"PeriodicalIF":7.2,"publicationDate":"2023-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cosine Multilinear Principal Component Analysis for Recognition 余弦多线性主成分分析识别
IF 7.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-02 DOI: 10.1109/TBDATA.2023.3301389
Feng Han;Chengcai Leng;Bing Li;Anup Basu;Licheng Jiao
Existing two-dimensional principal component analysis methods can only handle second-order tensors (i.e., matrices). However, with the advancement of technology, tensors of order three and higher are gradually increasing. This brings new challenges to dimensionality reduction. Thus, a multilinear method called MPCA was proposed. Although MPCA can be applied to all tensors, using the square of the F-norm makes it very sensitive to outliers. Several two-dimensional methods, such as Angle 2DPCA, have good robustness but cannot be applied to all tensors. We extend the robust Angle 2DPCA method to a multilinear method and propose Cosine Multilinear Principal Component Analysis (CosMPCA) for tensor representation. Our CosMPCA method considers the relationship between the reconstruction error and projection scatter and selects the cosine metric. In addition, our method naturally uses the F-norm to reduce the impact of outliers. We introduce an iterative algorithm to solve CosMPCA. We provide detailed theoretical analysis in both the proposed method and the analysis of the algorithm. Experiments show that our method is robust to outliers and is suitable for tensors of any order.
现有的二维主成分分析方法只能处理二阶张量(即矩阵)。然而,随着技术的进步,三阶及以上的张量逐渐增加。这给降维带来了新的挑战。因此,提出了一种称为MPCA的多线性方法。尽管MPCA可以应用于所有张量,但使用f范数的平方使其对异常值非常敏感。一些二维方法,如角2DPCA,具有良好的鲁棒性,但不能适用于所有张量。我们将鲁棒角2DPCA方法扩展到多线性方法,并提出了余弦多线性主成分分析(CosMPCA)用于张量表示。我们的CosMPCA方法考虑了重建误差与投影散射之间的关系,并选择了余弦度量。此外,我们的方法自然地使用f范数来减少异常值的影响。介绍了一种求解CosMPCA的迭代算法。我们对所提出的方法和算法进行了详细的理论分析。实验表明,该方法对异常值具有较强的鲁棒性,适用于任意阶张量。
{"title":"Cosine Multilinear Principal Component Analysis for Recognition","authors":"Feng Han;Chengcai Leng;Bing Li;Anup Basu;Licheng Jiao","doi":"10.1109/TBDATA.2023.3301389","DOIUrl":"10.1109/TBDATA.2023.3301389","url":null,"abstract":"Existing two-dimensional principal component analysis methods can only handle second-order tensors (i.e., matrices). However, with the advancement of technology, tensors of order three and higher are gradually increasing. This brings new challenges to dimensionality reduction. Thus, a multilinear method called MPCA was proposed. Although MPCA can be applied to all tensors, using the square of the F-norm makes it very sensitive to outliers. Several two-dimensional methods, such as Angle 2DPCA, have good robustness but cannot be applied to all tensors. We extend the robust Angle 2DPCA method to a multilinear method and propose Cosine Multilinear Principal Component Analysis (CosMPCA) for tensor representation. Our CosMPCA method considers the relationship between the reconstruction error and projection scatter and selects the cosine metric. In addition, our method naturally uses the F-norm to reduce the impact of outliers. We introduce an iterative algorithm to solve CosMPCA. We provide detailed theoretical analysis in both the proposed method and the analysis of the algorithm. Experiments show that our method is robust to outliers and is suitable for tensors of any order.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1620-1630"},"PeriodicalIF":7.2,"publicationDate":"2023-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Powerball Stochastic Conjugate Gradient for Large-Scale Learning 大规模学习的自适应强力球随机共轭梯度
IF 7.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-01 DOI: 10.1109/TBDATA.2023.3300546
Zhuang Yang
The extreme success of stochastic optimization (SO) in large-scale machine learning problems, information retrieval, bioinformatics, etc., has been widely reported, especially in recent years. As an effective tactic, conjugate gradient (CG) has been gaining its popularity in accelerating SO algorithms. This paper develops a novel type of stochastic conjugate gradient descent (SCG) algorithms from the perspective of the Powerball strategy and the hypergradient descent (HD) technique. The crucial idea behind the resulting methods is inspired by pursuing the equilibrium of ordinary differential equations (ODEs). We elucidate the effect of the Powerball strategy in SCG algorithms. The introduction of HD, on the other side, makes the resulting methods work with an online learning rate. Meanwhile, we provide a comprehension of the theoretical results for the resulting algorithms under non-convex assumptions. As a byproduct, we bridge the gap between the learning rate and powered stochastic optimization (PSO) algorithms, which is still an open problem. Resorting to numerical experiments on numerous benchmark datasets, we test the parameter sensitivity of the proposed methods and demonstrate the superior performance of our new algorithms over state-of-the-art algorithms.
随机优化(SO)在大规模机器学习问题、信息检索、生物信息学等领域的巨大成功已经被广泛报道,尤其是近年来。共轭梯度(CG)作为一种有效的策略,在加速SO算法中得到了广泛的应用。从强力球策略和超梯度下降技术的角度出发,提出了一种新的随机共轭梯度下降(SCG)算法。结果方法背后的关键思想是由追求常微分方程(ode)的平衡所启发的。我们阐明了强力球策略在SCG算法中的作用。另一方面,HD的引入使最终的方法与在线学习率一起工作。同时,我们提供了在非凸假设下所得算法的理论结果的理解。作为一个副产品,我们弥合了学习率和动力随机优化(PSO)算法之间的差距,这仍然是一个悬而未决的问题。通过在众多基准数据集上进行数值实验,我们测试了所提出方法的参数敏感性,并证明了我们的新算法比最先进的算法具有优越的性能。
{"title":"Adaptive Powerball Stochastic Conjugate Gradient for Large-Scale Learning","authors":"Zhuang Yang","doi":"10.1109/TBDATA.2023.3300546","DOIUrl":"10.1109/TBDATA.2023.3300546","url":null,"abstract":"The extreme success of stochastic optimization (SO) in large-scale machine learning problems, information retrieval, bioinformatics, etc., has been widely reported, especially in recent years. As an effective tactic, conjugate gradient (CG) has been gaining its popularity in accelerating SO algorithms. This paper develops a novel type of stochastic conjugate gradient descent (SCG) algorithms from the perspective of the Powerball strategy and the hypergradient descent (HD) technique. The crucial idea behind the resulting methods is inspired by pursuing the equilibrium of ordinary differential equations (ODEs). We elucidate the effect of the Powerball strategy in SCG algorithms. The introduction of HD, on the other side, makes the resulting methods work with an online learning rate. Meanwhile, we provide a comprehension of the theoretical results for the resulting algorithms under non-convex assumptions. As a byproduct, we bridge the gap between the learning rate and powered stochastic optimization (PSO) algorithms, which is still an open problem. Resorting to numerical experiments on numerous benchmark datasets, we test the parameter sensitivity of the proposed methods and demonstrate the superior performance of our new algorithms over state-of-the-art algorithms.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1598-1606"},"PeriodicalIF":7.2,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972533","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Rethinking Missing Data: Aleatoric Uncertainty-Aware Recommendation 重新思考缺失的数据:有意识的不确定性建议
IF 7.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-08-01 DOI: 10.1109/TBDATA.2023.3300547
Chenxu Wang;Fuli Feng;Yang Zhang;Qifan Wang;Xunhan Hu;Xiangnan He
Historical interactions are the default choice for recommender model training, which typically exhibit high sparsity, i.e., most user-item pairs are unobserved missing data. A standard choice is treating the missing data as negative training samples and estimating interaction likelihood between user-item pairs along with the observed interactions. In this way, some potential interactions are inevitably mislabeled during training, which will hurt the model fidelity, hindering the model to recall the mislabeled items, especially the long-tail ones. In this work, we investigate the mislabeling issue from a new perspective of aleatoric uncertainty, which describes the inherent randomness of missing data. The randomness pushes us to go beyond merely the interaction likelihood and embrace aleatoric uncertainty modeling. Towards this end, we propose a new Aleatoric Uncertainty-aware Recommendation (AUR) framework that consists of a new uncertainty estimator along with a normal recommender model. According to the theory of aleatoric uncertainty, we derive a new recommendation objective to learn the estimator. As the chance of mislabeling reflects the potential of a pair, AUR makes recommendations according to the uncertainty, which is demonstrated to improve the recommendation performance of less popular items without sacrificing the overall performance. We instantiate AUR on three representative recommender models: Matrix Factorization (MF), LightGCN, and VAE from mainstream model architectures. Extensive results on four real-world datasets validate the effectiveness of AUR w.r.t. better recommendation results, especially on long-tail items.
历史交互是推荐模型训练的默认选择,它通常表现出高稀疏性,即大多数用户-项目对都是未观察到的缺失数据。一种标准的选择是将缺失的数据作为负训练样本,并根据观察到的交互估计用户-物品对之间的交互可能性。这样,在训练过程中不可避免地会出现一些潜在的交互误标注,这会影响模型的保真度,阻碍模型对错误标注的项目,特别是长尾项目的召回。在这项工作中,我们从任意不确定性的新角度研究了错误标记问题,它描述了丢失数据的固有随机性。随机性促使我们超越仅仅是相互作用的可能性,而拥抱任意的不确定性模型。为此,我们提出了一个新的任意不确定性感知推荐(AUR)框架,该框架由一个新的不确定性估计器和一个正常的推荐模型组成。根据任意不确定性理论,提出了一种新的推荐目标来学习估计量。由于错误标注的概率反映了一对商品的潜力,AUR根据不确定性进行推荐,在不牺牲整体性能的情况下提高了不太受欢迎商品的推荐性能。我们在三个代表性的推荐模型上实例化AUR:矩阵分解(MF)、LightGCN和主流模型体系结构中的VAE。在四个真实数据集上的广泛结果验证了AUR w.r.t.的有效性,获得了更好的推荐结果,特别是在长尾项目上。
{"title":"Rethinking Missing Data: Aleatoric Uncertainty-Aware Recommendation","authors":"Chenxu Wang;Fuli Feng;Yang Zhang;Qifan Wang;Xunhan Hu;Xiangnan He","doi":"10.1109/TBDATA.2023.3300547","DOIUrl":"https://doi.org/10.1109/TBDATA.2023.3300547","url":null,"abstract":"Historical interactions are the default choice for recommender model training, which typically exhibit high sparsity, i.e., most user-item pairs are unobserved missing data. A standard choice is treating the missing data as negative training samples and estimating interaction likelihood between user-item pairs along with the observed interactions. In this way, some potential interactions are inevitably mislabeled during training, which will hurt the model fidelity, hindering the model to recall the mislabeled items, especially the long-tail ones. In this work, we investigate the mislabeling issue from a new perspective of \u0000<italic>aleatoric uncertainty</i>\u0000, which describes the inherent randomness of missing data. The randomness pushes us to go beyond merely the interaction likelihood and embrace aleatoric uncertainty modeling. Towards this end, we propose a new \u0000<italic>Aleatoric Uncertainty-aware Recommendation</i>\u0000 (AUR) framework that consists of a new uncertainty estimator along with a normal recommender model. According to the theory of aleatoric uncertainty, we derive a new recommendation objective to learn the estimator. As the chance of mislabeling reflects the potential of a pair, AUR makes recommendations according to the uncertainty, which is demonstrated to improve the recommendation performance of less popular items without sacrificing the overall performance. We instantiate AUR on three representative recommender models: Matrix Factorization (MF), LightGCN, and VAE from mainstream model architectures. Extensive results on four real-world datasets validate the effectiveness of AUR w.r.t. better recommendation results, especially on long-tail items.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1607-1619"},"PeriodicalIF":7.2,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138138211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Black-Box Adversarial Attack Method via Nesterov Accelerated Gradient and Rewiring Towards Attacking Graph Neural Networks 一种基于Nesterov加速梯度和重布线的攻击图神经网络的黑盒对抗攻击方法
IF 7.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-07-19 DOI: 10.1109/TBDATA.2023.3296936
Shu Zhao;Wenyu Wang;Ziwei Du;Jie Chen;Zhen Duan
Recent studies have shown that Graph Neural Networks (GNNs) are vulnerable to well-designed and imperceptible adversarial attack. Attacks utilizing gradient information are widely used in the field of attack due to their simplicity and efficiency. However, several challenges are faced by gradient-based attacks: 1) Generate perturbations use white-box attacks (i.e., requiring access to the full knowledge of the model), which is not practical in the real world; 2) It is easy to drop into local optima; and 3) The perturbation budget is not limited and might be detected even if the number of modified edges is small. Faced with the above challenges, this article proposes a black-box adversarial attack method, named NAG-R, which consists of two modules known as Nesterov Accelerated Gradient attack module and Rewiring optimization module. Specifically, inspired by adversarial attacks on images, the first module generates perturbations by introducing Nesterov Accelerated Gradient (NAG) to avoid falling into local optima. The second module keeps the fundamental properties of the graph (e.g., the total degree of the graph) unchanged through a rewiring operation, thus ensuring that perturbations are imperceptible. Intensive experiments show that our method has significant attack success and transferability over existing state-of-the-art gradient-based attack methods.
最近的研究表明,图神经网络(gnn)容易受到设计良好且难以察觉的对抗性攻击。利用梯度信息的攻击以其简单、高效的特点在攻击领域得到了广泛的应用。然而,基于梯度的攻击面临着几个挑战:1)使用白盒攻击生成扰动(即需要访问模型的全部知识),这在现实世界中是不实际的;2)容易陷入局部最优;3)扰动预算不受限制,即使修改边的数量很少也可以检测到扰动预算。面对上述挑战,本文提出了一种黑箱对抗攻击方法,命名为NAG-R,该方法由Nesterov加速梯度攻击模块和Rewiring优化模块两个模块组成。具体来说,受图像对抗性攻击的启发,第一个模块通过引入Nesterov加速梯度(NAG)来产生扰动,以避免陷入局部最优。第二个模块通过重新布线操作保持图的基本属性(例如,图的总度)不变,从而确保扰动是不可察觉的。大量的实验表明,与现有的基于梯度的攻击方法相比,我们的方法具有显著的攻击成功率和可移植性。
{"title":"A Black-Box Adversarial Attack Method via Nesterov Accelerated Gradient and Rewiring Towards Attacking Graph Neural Networks","authors":"Shu Zhao;Wenyu Wang;Ziwei Du;Jie Chen;Zhen Duan","doi":"10.1109/TBDATA.2023.3296936","DOIUrl":"10.1109/TBDATA.2023.3296936","url":null,"abstract":"Recent studies have shown that Graph Neural Networks (GNNs) are vulnerable to well-designed and imperceptible adversarial attack. Attacks utilizing gradient information are widely used in the field of attack due to their simplicity and efficiency. However, several challenges are faced by gradient-based attacks: 1) Generate perturbations use white-box attacks (i.e., requiring access to the full knowledge of the model), which is not practical in the real world; 2) It is easy to drop into local optima; and 3) The perturbation budget is not limited and might be detected even if the number of modified edges is small. Faced with the above challenges, this article proposes a black-box adversarial attack method, named NAG-R, which consists of two modules known as \u0000<bold>N</b>\u0000esterov \u0000<bold>A</b>\u0000ccelerated \u0000<bold>G</b>\u0000radient attack module and \u0000<bold>R</b>\u0000ewiring optimization module. Specifically, inspired by adversarial attacks on images, the first module generates perturbations by introducing Nesterov Accelerated Gradient (NAG) to avoid falling into local optima. The second module keeps the fundamental properties of the graph (e.g., the total degree of the graph) unchanged through a rewiring operation, thus ensuring that perturbations are imperceptible. Intensive experiments show that our method has significant attack success and transferability over existing state-of-the-art gradient-based attack methods.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1586-1597"},"PeriodicalIF":7.2,"publicationDate":"2023-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Discriminator Active Adversarial Network for Multi-Center Brain Disease Diagnosis 多鉴别器主动对抗网络多中心脑疾病诊断
IF 7.2 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-07-11 DOI: 10.1109/TBDATA.2023.3294000
Qi Zhu;Qiming Yang;Mingming Wang;Xiangyu Xu;Yuwu Lu;Wei Shao;Daoqiang Zhang
Multi-center analysis has attracted increasing attention in brain disease diagnosis, because it provides effective approaches to improve disease diagnostic performance by making use of the information from different centers. However, in practical multi-center applications, data uncertainty is more common than that in single center, which brings challenge to robust modeling of diagnosis. In this article, we proposed a multi-discriminator active adversarial network (MDAAN) to alleviate the uncertainties at the center, feature, and label levels for multi-center brain disease diagnosis. First, we extract the latent invariant representation of the source center and target center to reduce domain shift by adversarial learning strategy. Second, the proposed method adaptively evaluates the contribution of different source centers in fusion by measuring data distribution difference between source and target center. Moreover, only the hard learning samples in target center are identified to label with low sample annotation cost. Finally, we treat the selected samples as the auxiliary domain to alleviate the negative transfer and improve the robustness of the multi-center model. We extensively compare the proposed approach with several state-of-the-art multi-center methods on the five-center schizophrenia dataset, and the results demonstrate that our method is superior to the previous methods in identifying brain disease.
多中心分析在脑疾病诊断中越来越受到关注,因为它通过利用来自不同中心的信息,为提高疾病诊断性能提供了有效的途径。然而,在实际的多中心应用中,数据的不确定性比单中心更普遍,这给诊断的鲁棒建模带来了挑战。在本文中,我们提出了一种多鉴别器主动对抗网络(MDAAN)来缓解多中心脑疾病诊断在中心、特征和标签层面的不确定性。首先,我们提取源中心和目标中心的潜在不变表示,通过对抗学习策略减少域漂移。其次,通过测量源中心和目标中心之间的数据分布差异,自适应评估不同源中心对融合的贡献;并且,仅识别目标中心的难学习样本进行标注,样本标注成本较低。最后,我们将选择的样本作为辅助域,以减轻负迁移,提高多中心模型的鲁棒性。在五中心精神分裂症数据集上,我们将所提出的方法与几种最先进的多中心方法进行了广泛的比较,结果表明我们的方法在识别脑部疾病方面优于先前的方法。
{"title":"Multi-Discriminator Active Adversarial Network for Multi-Center Brain Disease Diagnosis","authors":"Qi Zhu;Qiming Yang;Mingming Wang;Xiangyu Xu;Yuwu Lu;Wei Shao;Daoqiang Zhang","doi":"10.1109/TBDATA.2023.3294000","DOIUrl":"10.1109/TBDATA.2023.3294000","url":null,"abstract":"Multi-center analysis has attracted increasing attention in brain disease diagnosis, because it provides effective approaches to improve disease diagnostic performance by making use of the information from different centers. However, in practical multi-center applications, data uncertainty is more common than that in single center, which brings challenge to robust modeling of diagnosis. In this article, we proposed a multi-discriminator active adversarial network (MDAAN) to alleviate the uncertainties at the center, feature, and label levels for multi-center brain disease diagnosis. First, we extract the latent invariant representation of the source center and target center to reduce domain shift by adversarial learning strategy. Second, the proposed method adaptively evaluates the contribution of different source centers in fusion by measuring data distribution difference between source and target center. Moreover, only the hard learning samples in target center are identified to label with low sample annotation cost. Finally, we treat the selected samples as the auxiliary domain to alleviate the negative transfer and improve the robustness of the multi-center model. We extensively compare the proposed approach with several state-of-the-art multi-center methods on the five-center schizophrenia dataset, and the results demonstrate that our method is superior to the previous methods in identifying brain disease.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1575-1585"},"PeriodicalIF":7.2,"publicationDate":"2023-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Big Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1