IEEE Transactions on Big Data最新文献_第10页

Rethinking Missing Data: Aleatoric Uncertainty-Aware Recommendation 重新思考缺失的数据:有意识的不确定性建议

IF 7.2 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data

Pub Date : 2023-08-01 DOI: 10.1109/TBDATA.2023.3300547

Chenxu Wang;Fuli Feng;Yang Zhang;Qifan Wang;Xunhan Hu;Xiangnan He

Historical interactions are the default choice for recommender model training, which typically exhibit high sparsity, i.e., most user-item pairs are unobserved missing data. A standard choice is treating the missing data as negative training samples and estimating interaction likelihood between user-item pairs along with the observed interactions. In this way, some potential interactions are inevitably mislabeled during training, which will hurt the model fidelity, hindering the model to recall the mislabeled items, especially the long-tail ones. In this work, we investigate the mislabeling issue from a new perspective of aleatoric uncertainty, which describes the inherent randomness of missing data. The randomness pushes us to go beyond merely the interaction likelihood and embrace aleatoric uncertainty modeling. Towards this end, we propose a new Aleatoric Uncertainty-aware Recommendation (AUR) framework that consists of a new uncertainty estimator along with a normal recommender model. According to the theory of aleatoric uncertainty, we derive a new recommendation objective to learn the estimator. As the chance of mislabeling reflects the potential of a pair, AUR makes recommendations according to the uncertainty, which is demonstrated to improve the recommendation performance of less popular items without sacrificing the overall performance. We instantiate AUR on three representative recommender models: Matrix Factorization (MF), LightGCN, and VAE from mainstream model architectures. Extensive results on four real-world datasets validate the effectiveness of AUR w.r.t. better recommendation results, especially on long-tail items.

历史交互是推荐模型训练的默认选择，它通常表现出高稀疏性，即大多数用户-项目对都是未观察到的缺失数据。一种标准的选择是将缺失的数据作为负训练样本，并根据观察到的交互估计用户-物品对之间的交互可能性。这样，在训练过程中不可避免地会出现一些潜在的交互误标注，这会影响模型的保真度，阻碍模型对错误标注的项目，特别是长尾项目的召回。在这项工作中，我们从任意不确定性的新角度研究了错误标记问题，它描述了丢失数据的固有随机性。随机性促使我们超越仅仅是相互作用的可能性，而拥抱任意的不确定性模型。为此，我们提出了一个新的任意不确定性感知推荐(AUR)框架，该框架由一个新的不确定性估计器和一个正常的推荐模型组成。根据任意不确定性理论，提出了一种新的推荐目标来学习估计量。由于错误标注的概率反映了一对商品的潜力，AUR根据不确定性进行推荐，在不牺牲整体性能的情况下提高了不太受欢迎商品的推荐性能。我们在三个代表性的推荐模型上实例化AUR:矩阵分解(MF)、LightGCN和主流模型体系结构中的VAE。在四个真实数据集上的广泛结果验证了AUR w.r.t.的有效性，获得了更好的推荐结果，特别是在长尾项目上。

{"title":"Rethinking Missing Data: Aleatoric Uncertainty-Aware Recommendation","authors":"Chenxu Wang;Fuli Feng;Yang Zhang;Qifan Wang;Xunhan Hu;Xiangnan He","doi":"10.1109/TBDATA.2023.3300547","DOIUrl":"https://doi.org/10.1109/TBDATA.2023.3300547","url":null,"abstract":"Historical interactions are the default choice for recommender model training, which typically exhibit high sparsity, i.e., most user-item pairs are unobserved missing data. A standard choice is treating the missing data as negative training samples and estimating interaction likelihood between user-item pairs along with the observed interactions. In this way, some potential interactions are inevitably mislabeled during training, which will hurt the model fidelity, hindering the model to recall the mislabeled items, especially the long-tail ones. In this work, we investigate the mislabeling issue from a new perspective of \u0000<italic>aleatoric uncertainty\u0000, which describes the inherent randomness of missing data. The randomness pushes us to go beyond merely the interaction likelihood and embrace aleatoric uncertainty modeling. Towards this end, we propose a new \u0000<italic>Aleatoric Uncertainty-aware Recommendation\u0000 (AUR) framework that consists of a new uncertainty estimator along with a normal recommender model. According to the theory of aleatoric uncertainty, we derive a new recommendation objective to learn the estimator. As the chance of mislabeling reflects the potential of a pair, AUR makes recommendations according to the uncertainty, which is demonstrated to improve the recommendation performance of less popular items without sacrificing the overall performance. We instantiate AUR on three representative recommender models: Matrix Factorization (MF), LightGCN, and VAE from mainstream model architectures. Extensive results on four real-world datasets validate the effectiveness of AUR w.r.t. better recommendation results, especially on long-tail items.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1607-1619"},"PeriodicalIF":7.2,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138138211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

A Black-Box Adversarial Attack Method via Nesterov Accelerated Gradient and Rewiring Towards Attacking Graph Neural Networks 一种基于Nesterov加速梯度和重布线的攻击图神经网络的黑盒对抗攻击方法

IF 7.2 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data

Pub Date : 2023-07-19 DOI: 10.1109/TBDATA.2023.3296936

Shu Zhao;Wenyu Wang;Ziwei Du;Jie Chen;Zhen Duan

Recent studies have shown that Graph Neural Networks (GNNs) are vulnerable to well-designed and imperceptible adversarial attack. Attacks utilizing gradient information are widely used in the field of attack due to their simplicity and efficiency. However, several challenges are faced by gradient-based attacks: 1) Generate perturbations use white-box attacks (i.e., requiring access to the full knowledge of the model), which is not practical in the real world; 2) It is easy to drop into local optima; and 3) The perturbation budget is not limited and might be detected even if the number of modified edges is small. Faced with the above challenges, this article proposes a black-box adversarial attack method, named NAG-R, which consists of two modules known as Nesterov Accelerated Gradient attack module and Rewiring optimization module. Specifically, inspired by adversarial attacks on images, the first module generates perturbations by introducing Nesterov Accelerated Gradient (NAG) to avoid falling into local optima. The second module keeps the fundamental properties of the graph (e.g., the total degree of the graph) unchanged through a rewiring operation, thus ensuring that perturbations are imperceptible. Intensive experiments show that our method has significant attack success and transferability over existing state-of-the-art gradient-based attack methods.

最近的研究表明，图神经网络(gnn)容易受到设计良好且难以察觉的对抗性攻击。利用梯度信息的攻击以其简单、高效的特点在攻击领域得到了广泛的应用。然而，基于梯度的攻击面临着几个挑战:1)使用白盒攻击生成扰动(即需要访问模型的全部知识)，这在现实世界中是不实际的;2)容易陷入局部最优;3)扰动预算不受限制，即使修改边的数量很少也可以检测到扰动预算。面对上述挑战，本文提出了一种黑箱对抗攻击方法，命名为NAG-R，该方法由Nesterov加速梯度攻击模块和Rewiring优化模块两个模块组成。具体来说，受图像对抗性攻击的启发，第一个模块通过引入Nesterov加速梯度(NAG)来产生扰动，以避免陷入局部最优。第二个模块通过重新布线操作保持图的基本属性(例如，图的总度)不变，从而确保扰动是不可察觉的。大量的实验表明，与现有的基于梯度的攻击方法相比，我们的方法具有显著的攻击成功率和可移植性。

{"title":"A Black-Box Adversarial Attack Method via Nesterov Accelerated Gradient and Rewiring Towards Attacking Graph Neural Networks","authors":"Shu Zhao;Wenyu Wang;Ziwei Du;Jie Chen;Zhen Duan","doi":"10.1109/TBDATA.2023.3296936","DOIUrl":"10.1109/TBDATA.2023.3296936","url":null,"abstract":"Recent studies have shown that Graph Neural Networks (GNNs) are vulnerable to well-designed and imperceptible adversarial attack. Attacks utilizing gradient information are widely used in the field of attack due to their simplicity and efficiency. However, several challenges are faced by gradient-based attacks: 1) Generate perturbations use white-box attacks (i.e., requiring access to the full knowledge of the model), which is not practical in the real world; 2) It is easy to drop into local optima; and 3) The perturbation budget is not limited and might be detected even if the number of modified edges is small. Faced with the above challenges, this article proposes a black-box adversarial attack method, named NAG-R, which consists of two modules known as \u0000<bold>N\u0000esterov \u0000<bold>A\u0000ccelerated \u0000<bold>G\u0000radient attack module and \u0000<bold>R\u0000ewiring optimization module. Specifically, inspired by adversarial attacks on images, the first module generates perturbations by introducing Nesterov Accelerated Gradient (NAG) to avoid falling into local optima. The second module keeps the fundamental properties of the graph (e.g., the total degree of the graph) unchanged through a rewiring operation, thus ensuring that perturbations are imperceptible. Intensive experiments show that our method has significant attack success and transferability over existing state-of-the-art gradient-based attack methods.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1586-1597"},"PeriodicalIF":7.2,"publicationDate":"2023-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-Discriminator Active Adversarial Network for Multi-Center Brain Disease Diagnosis 多鉴别器主动对抗网络多中心脑疾病诊断

IF 7.2 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data

Pub Date : 2023-07-11 DOI: 10.1109/TBDATA.2023.3294000

Qi Zhu;Qiming Yang;Mingming Wang;Xiangyu Xu;Yuwu Lu;Wei Shao;Daoqiang Zhang

Multi-center analysis has attracted increasing attention in brain disease diagnosis, because it provides effective approaches to improve disease diagnostic performance by making use of the information from different centers. However, in practical multi-center applications, data uncertainty is more common than that in single center, which brings challenge to robust modeling of diagnosis. In this article, we proposed a multi-discriminator active adversarial network (MDAAN) to alleviate the uncertainties at the center, feature, and label levels for multi-center brain disease diagnosis. First, we extract the latent invariant representation of the source center and target center to reduce domain shift by adversarial learning strategy. Second, the proposed method adaptively evaluates the contribution of different source centers in fusion by measuring data distribution difference between source and target center. Moreover, only the hard learning samples in target center are identified to label with low sample annotation cost. Finally, we treat the selected samples as the auxiliary domain to alleviate the negative transfer and improve the robustness of the multi-center model. We extensively compare the proposed approach with several state-of-the-art multi-center methods on the five-center schizophrenia dataset, and the results demonstrate that our method is superior to the previous methods in identifying brain disease.

多中心分析在脑疾病诊断中越来越受到关注，因为它通过利用来自不同中心的信息，为提高疾病诊断性能提供了有效的途径。然而，在实际的多中心应用中，数据的不确定性比单中心更普遍，这给诊断的鲁棒建模带来了挑战。在本文中，我们提出了一种多鉴别器主动对抗网络(MDAAN)来缓解多中心脑疾病诊断在中心、特征和标签层面的不确定性。首先，我们提取源中心和目标中心的潜在不变表示，通过对抗学习策略减少域漂移。其次，通过测量源中心和目标中心之间的数据分布差异，自适应评估不同源中心对融合的贡献;并且，仅识别目标中心的难学习样本进行标注，样本标注成本较低。最后，我们将选择的样本作为辅助域，以减轻负迁移，提高多中心模型的鲁棒性。在五中心精神分裂症数据集上，我们将所提出的方法与几种最先进的多中心方法进行了广泛的比较，结果表明我们的方法在识别脑部疾病方面优于先前的方法。

{"title":"Multi-Discriminator Active Adversarial Network for Multi-Center Brain Disease Diagnosis","authors":"Qi Zhu;Qiming Yang;Mingming Wang;Xiangyu Xu;Yuwu Lu;Wei Shao;Daoqiang Zhang","doi":"10.1109/TBDATA.2023.3294000","DOIUrl":"10.1109/TBDATA.2023.3294000","url":null,"abstract":"Multi-center analysis has attracted increasing attention in brain disease diagnosis, because it provides effective approaches to improve disease diagnostic performance by making use of the information from different centers. However, in practical multi-center applications, data uncertainty is more common than that in single center, which brings challenge to robust modeling of diagnosis. In this article, we proposed a multi-discriminator active adversarial network (MDAAN) to alleviate the uncertainties at the center, feature, and label levels for multi-center brain disease diagnosis. First, we extract the latent invariant representation of the source center and target center to reduce domain shift by adversarial learning strategy. Second, the proposed method adaptively evaluates the contribution of different source centers in fusion by measuring data distribution difference between source and target center. Moreover, only the hard learning samples in target center are identified to label with low sample annotation cost. Finally, we treat the selected samples as the auxiliary domain to alleviate the negative transfer and improve the robustness of the multi-center model. We extensively compare the proposed approach with several state-of-the-art multi-center methods on the five-center schizophrenia dataset, and the results demonstrate that our method is superior to the previous methods in identifying brain disease.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1575-1585"},"PeriodicalIF":7.2,"publicationDate":"2023-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Survey of Blockchain-Based Schemes for Data Sharing and Exchange 基于区块链的数据共享和交换方案研究

IF 7.2 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data

Pub Date : 2023-07-07 DOI: 10.1109/TBDATA.2023.3293279

Rui Song;Bin Xiao;Yubo Song;Songtao Guo;Yuanyuan Yang

Data immutability, transparency and decentralization of blockchain make it widely used in various fields, such as Internet of things, finance, energy and healthcare. With the advent of the Big Data era, various companies and organizations urgently need data from other parties for data analysis and mining to provide better services. Therefore, data sharing and data exchange have become an enormous industry. Traditional centralized data platforms face many problems, such as privacy leakage, high transaction costs and lack of interoperability. Introducing blockchain into this field can address these problems, while providing decentralized data storage and exchange, access control, identity authentication and copyright protection. Although many impressive blockchain-based schemes for data sharing or data exchange scenarios have been presented in recent years, there is still a lack of review and summary of work in this area. In this paper, we conduct a detailed survey of blockchain-based data sharing and data exchange platforms, discussing the latest technical architectures and research results in this field. In particular, we first survey the current blockchain-based data sharing solutions and provide a detailed analysis of system architecture, access control, interoperability, and security. We then review blockchain-based data exchange systems and data marketplaces, discussing trading process, monetization, copyright protection and other related topics.

区块链的数据不变性、透明性和去中心化使其广泛应用于物联网、金融、能源、医疗等各个领域。随着大数据时代的到来，各种公司和组织迫切需要来自其他各方的数据进行数据分析和挖掘，以提供更好的服务。因此，数据共享和数据交换已经成为一个巨大的产业。传统的集中式数据平台面临着隐私泄露、交易成本高、缺乏互操作性等诸多问题。将区块链引入该领域可以解决这些问题，同时提供分散的数据存储和交换、访问控制、身份认证和版权保护。尽管近年来提出了许多令人印象深刻的基于区块链的数据共享或数据交换方案，但仍然缺乏对该领域工作的审查和总结。在本文中，我们对基于区块链的数据共享和数据交换平台进行了详细的调查，讨论了该领域的最新技术架构和研究成果。特别是，我们首先调查了当前基于区块链的数据共享解决方案，并对系统架构、访问控制、互操作性和安全性进行了详细分析。然后，我们回顾了基于区块链的数据交换系统和数据市场，讨论了交易流程、货币化、版权保护和其他相关主题。

{"title":"A Survey of Blockchain-Based Schemes for Data Sharing and Exchange","authors":"Rui Song;Bin Xiao;Yubo Song;Songtao Guo;Yuanyuan Yang","doi":"10.1109/TBDATA.2023.3293279","DOIUrl":"10.1109/TBDATA.2023.3293279","url":null,"abstract":"Data immutability, transparency and decentralization of blockchain make it widely used in various fields, such as Internet of things, finance, energy and healthcare. With the advent of the Big Data era, various companies and organizations urgently need data from other parties for data analysis and mining to provide better services. Therefore, data sharing and data exchange have become an enormous industry. Traditional centralized data platforms face many problems, such as privacy leakage, high transaction costs and lack of interoperability. Introducing blockchain into this field can address these problems, while providing decentralized data storage and exchange, access control, identity authentication and copyright protection. Although many impressive blockchain-based schemes for data sharing or data exchange scenarios have been presented in recent years, there is still a lack of review and summary of work in this area. In this paper, we conduct a detailed survey of blockchain-based data sharing and data exchange platforms, discussing the latest technical architectures and research results in this field. In particular, we first survey the current blockchain-based data sharing solutions and provide a detailed analysis of system architecture, access control, interoperability, and security. We then review blockchain-based data exchange systems and data marketplaces, discussing trading process, monetization, copyright protection and other related topics.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1477-1495"},"PeriodicalIF":7.2,"publicationDate":"2023-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Event Extraction by Associating Event Types and Argument Roles 通过关联事件类型和参数角色提取事件

IF 7.2 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data

Pub Date : 2023-07-03 DOI: 10.1109/TBDATA.2023.3291563

Qian Li;Shu Guo;Jia Wu;Jianxin Li;Jiawei Sheng;Hao Peng;Lihong Wang

Event extraction (EE), which acquires structural event knowledge from texts, can be divided into two sub-tasks: event type classification and element extraction (namely identifying triggers and arguments under different role patterns). As different event types always own distinct extraction schemas (i.e., role patterns), previous work on EE usually follows an isolated learning paradigm, performing element extraction independently for different event types. It ignores meaningful associations among event types and argument roles, leading to relatively poor performance for less frequent types/roles. This paper proposes a novel neural association framework for the EE task. Given a document, it first performs type classification via constructing a document-level event graph to associate sentence nodes of different types and adopting a document-awared graph attention network to learn sentence embeddings. Then, element extraction is achieved by building a new schema of argument roles, with a type-awared parameter inheritance mechanism to enhance role preference for extracted elements. As such, our model takes into account type and role associations during EE, enabling implicit information sharing among them. Experimental results show that our approach consistently outperforms most state-of-the-art EE methods in both sub-tasks, especially at least 2.51% and 1.12% improvement of the event trigger identification and argument role classification sub-tasks. Particularly, for types/roles with less training data, the performance is superior to the existing methods.

事件提取(EE)从文本中获取结构化的事件知识，分为事件类型分类和元素提取两个子任务(即识别不同角色模式下的触发器和参数)。由于不同的事件类型总是拥有不同的提取模式(即角色模式)，以前关于EE的工作通常遵循一个孤立的学习范式，对不同的事件类型独立地执行元素提取。它忽略了事件类型和参数角色之间有意义的关联，导致较少使用的类型/角色的性能相对较差。本文提出了一种新的面向情感表达任务的神经关联框架。给定一个文档，它首先通过构建文档级事件图来关联不同类型的句子节点，并采用文档感知的图关注网络来学习句子嵌入，从而进行类型分类。然后，通过构建新的参数角色模式来实现元素提取，并使用类型感知的参数继承机制来增强提取元素的角色偏好。因此，我们的模型考虑了EE期间的类型和角色关联，从而实现了它们之间的隐式信息共享。实验结果表明，我们的方法在两个子任务上都优于大多数最先进的EE方法，特别是在事件触发识别和参数角色分类子任务上分别提高了至少2.51%和1.12%。特别是对于训练数据较少的类型/角色，性能优于现有方法。

{"title":"Event Extraction by Associating Event Types and Argument Roles","authors":"Qian Li;Shu Guo;Jia Wu;Jianxin Li;Jiawei Sheng;Hao Peng;Lihong Wang","doi":"10.1109/TBDATA.2023.3291563","DOIUrl":"10.1109/TBDATA.2023.3291563","url":null,"abstract":"Event extraction (EE), which acquires structural event knowledge from texts, can be divided into two sub-tasks: event type classification and element extraction (namely identifying triggers and arguments under different role patterns). As different event types always own distinct extraction schemas (i.e., role patterns), previous work on EE usually follows an isolated learning paradigm, performing element extraction independently for different event types. It ignores meaningful associations among event types and argument roles, leading to relatively poor performance for less frequent types/roles. This paper proposes a novel neural association framework for the EE task. Given a document, it first performs type classification via constructing a document-level event graph to associate sentence nodes of different types and adopting a document-awared graph attention network to learn sentence embeddings. Then, element extraction is achieved by building a new schema of argument roles, with a type-awared parameter inheritance mechanism to enhance role preference for extracted elements. As such, our model takes into account type and role associations during EE, enabling implicit information sharing among them. Experimental results show that our approach consistently outperforms most state-of-the-art EE methods in both sub-tasks, especially at least 2.51% and 1.12% improvement of the event trigger identification and argument role classification sub-tasks. Particularly, for types/roles with less training data, the performance is superior to the existing methods.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1549-1560"},"PeriodicalIF":7.2,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88360714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Personalized Interventions to Increase the Employment Success of People With Disability 提高残疾人就业成功率的个性化干预措施

IF 7.2 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data

Pub Date : 2023-07-03 DOI: 10.1109/TBDATA.2023.3291547

Ha Xuan Tran;Thuc Duy Le;Jiuyong Li;Lin Liu;Jixue Liu;Yanchang Zhao;Tony Waters

An emerging problem in Disability Employment Services (DES) is recommending to people with disability the right skill to upgrade and the right upgrade level to achieve maximum improvement in their employment success. This problem requires causal reasoning to estimate the individual causal effect of possible factors on the outcome to determine the most effective intervention. In this paper, we propose a causal graph based framework to solve the intervention recommendation problem for survival outcome (job retention time) and non-survival outcome (employment status). For an individual, a personalized causal graph is predicted for them. It indicates which factors affect the outcome and their causal effects at different intervention levels. Based on the causal graph, we can determine the most effective intervention for an individual, i.e., the one that can generate a maximum outcome increase. Experiments with two case studies show that our framework can help people with disability increase their employment success. Evaluations with public datasets also show the advantage of our framework in other applications.

残疾人士就业服务(DES)的一个新问题是向残疾人士推荐适当的技能和适当的升级水平，以最大限度地提高他们的就业成功率。这个问题需要因果推理来估计可能因素对结果的个别因果影响，以确定最有效的干预措施。在本文中，我们提出了一个基于因果图的框架来解决生存结果(工作保留时间)和非生存结果(就业状态)的干预推荐问题。对于个人来说，一个个性化的因果图被预测出来。它表明在不同的干预水平下，哪些因素影响结果及其因果关系。根据因果图，我们可以确定对个体最有效的干预措施，即能够产生最大结果增加的干预措施。两个案例研究的实验表明，我们的框架可以帮助残疾人提高他们的就业成功率。使用公共数据集进行评估也显示了我们的框架在其他应用程序中的优势。

{"title":"Personalized Interventions to Increase the Employment Success of People With Disability","authors":"Ha Xuan Tran;Thuc Duy Le;Jiuyong Li;Lin Liu;Jixue Liu;Yanchang Zhao;Tony Waters","doi":"10.1109/TBDATA.2023.3291547","DOIUrl":"10.1109/TBDATA.2023.3291547","url":null,"abstract":"An emerging problem in Disability Employment Services (DES) is recommending to people with disability the right skill to upgrade and the right upgrade level to achieve maximum improvement in their employment success. This problem requires causal reasoning to estimate the individual causal effect of possible factors on the outcome to determine the most effective intervention. In this paper, we propose a causal graph based framework to solve the intervention recommendation problem for survival outcome (job retention time) and non-survival outcome (employment status). For an individual, a personalized causal graph is predicted for them. It indicates which factors affect the outcome and their causal effects at different intervention levels. Based on the causal graph, we can determine the most effective intervention for an individual, i.e., the one that can generate a maximum outcome increase. Experiments with two case studies show that our framework can help people with disability increase their employment success. Evaluations with public datasets also show the advantage of our framework in other applications.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1561-1574"},"PeriodicalIF":7.2,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Survey of Visual Affordance Recognition Based on Deep Learning 基于深度学习的视觉可视性识别研究综述

IF 7.2 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data

Pub Date : 2023-07-03 DOI: 10.1109/TBDATA.2023.3291558

Dongpan Chen;Dehui Kong;Jinghua Li;Shaofan Wang;Baocai Yin

Visual affordance recognition is an important research topic in robotics, human-computer interaction, and other computer vision tasks. In recent years, deep learning-based affordance recognition methods have achieved remarkable performance. However, there is no unified and intensive survey of these methods up to now. Therefore, this article reviews and investigates existing deep learning-based affordance recognition methods from a comprehensive perspective, hoping to pursue greater acceleration in this research domain. Specifically, this article first classifies affordance recognition into five tasks, delves into the methodologies of each task, and explores their rationales and essential relations. Second, several representative affordance recognition datasets are investigated carefully. Third, based on these datasets, this article provides a comprehensive performance comparison and analysis of the current affordance recognition methods, reporting the results of different methods on the same datasets and the results of each method on different datasets. Finally, this article summarizes the progress of affordance recognition, outlines the existing difficulties and provides corresponding solutions, and discusses its future application trends.

视觉特征识别是机器人技术、人机交互和其他计算机视觉任务中的一个重要研究课题。近年来，基于深度学习的可视性识别方法取得了令人瞩目的成绩。然而，目前对这些方法还没有统一而深入的研究。因此，本文从综合的角度对现有的基于深度学习的可视性识别方法进行了回顾和研究，希望能在这一研究领域取得更大的进步。具体而言，本文首先将功能识别分为五个任务，并对每个任务的方法进行了探讨，并探讨了它们之间的基本原理和本质关系。其次，仔细研究了几个具有代表性的功能识别数据集。第三，在这些数据集的基础上，本文对现有的功能识别方法进行了全面的性能比较和分析，报告了不同方法在同一数据集上的结果，以及每种方法在不同数据集上的结果。最后，本文总结了功能识别的研究进展，指出了存在的困难并提出了相应的解决方案，并对其未来的应用趋势进行了探讨。

{"title":"A Survey of Visual Affordance Recognition Based on Deep Learning","authors":"Dongpan Chen;Dehui Kong;Jinghua Li;Shaofan Wang;Baocai Yin","doi":"10.1109/TBDATA.2023.3291558","DOIUrl":"10.1109/TBDATA.2023.3291558","url":null,"abstract":"Visual affordance recognition is an important research topic in robotics, human-computer interaction, and other computer vision tasks. In recent years, deep learning-based affordance recognition methods have achieved remarkable performance. However, there is no unified and intensive survey of these methods up to now. Therefore, this article reviews and investigates existing deep learning-based affordance recognition methods from a comprehensive perspective, hoping to pursue greater acceleration in this research domain. Specifically, this article first classifies affordance recognition into five tasks, delves into the methodologies of each task, and explores their rationales and essential relations. Second, several representative affordance recognition datasets are investigated carefully. Third, based on these datasets, this article provides a comprehensive performance comparison and analysis of the current affordance recognition methods, reporting the results of different methods on the same datasets and the results of each method on different datasets. Finally, this article summarizes the progress of affordance recognition, outlines the existing difficulties and provides corresponding solutions, and discusses its future application trends.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1458-1476"},"PeriodicalIF":7.2,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Multivariate Time-Series Forecasting Model: Predictability Analysis and Empirical Study 多元时间序列预测模型:可预测性分析与实证研究

IF 7.2 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data

Pub Date : 2023-06-22 DOI: 10.1109/TBDATA.2023.3288693

Qinpei Zhao;Guangda Yang;Kai Zhao;Jiaming Yin;Weixiong Rao;Lei Chen

Multivariate time series forecasting has wide applications such as traffic flow prediction, supermarket commodity demand forecasting and etc., and a large number of forecasting models have been developed. Given these models, a natural question has been raised: what theoretical limits of forecasting accuracy can these models achieve? Recent works of urban human mobility prediction have made progress on the maximum predictability that any algorithm can achieve. However, existing approaches on maximum predictability on the multivariate time series fully ignore the interrelationship between multiple variables. In this article, we propose a methodology to measure the upper limit of predictability for multivariate time series with multivariate constraint relations. The key of the proposed methodology is a novel entropy, named Multivariate Constraint Sample Entropy (McSE), to incorporate the multivariate constraint relations for better predictability. We conduct a systematic evaluation over eight datasets and compare existing methods with our proposed predictability and find that we get a higher predictability. We also find that the forecasting algorithms that capture the multivariate constraint relation information, such as GNN, can achieve higher accuracy, confirming the importance of multivariate constraint relations for predictability.

多元时间序列预测在交通流量预测、超市商品需求预测等方面有着广泛的应用，并开发了大量的预测模型。考虑到这些模型，一个自然的问题出现了:这些模型能达到的预测精度的理论极限是什么?最近的城市人口流动预测工作在任何算法都能达到的最大可预测性方面取得了进展。然而，现有的多变量时间序列最大可预测性方法完全忽略了多变量之间的相互关系。本文提出了一种测量具有多变量约束关系的多变量时间序列的可预测性上限的方法。该方法的关键是一个新的熵，称为多元约束样本熵(McSE)，它包含了多变量约束关系，以获得更好的可预测性。我们对八个数据集进行了系统评估，并将现有方法与我们提出的可预测性进行了比较，发现我们得到了更高的可预测性。我们还发现，捕获多变量约束关系信息的预测算法，如GNN，可以达到更高的精度，证实了多变量约束关系对可预测性的重要性。

{"title":"Multivariate Time-Series Forecasting Model: Predictability Analysis and Empirical Study","authors":"Qinpei Zhao;Guangda Yang;Kai Zhao;Jiaming Yin;Weixiong Rao;Lei Chen","doi":"10.1109/TBDATA.2023.3288693","DOIUrl":"10.1109/TBDATA.2023.3288693","url":null,"abstract":"Multivariate time series forecasting has wide applications such as traffic flow prediction, supermarket commodity demand forecasting and etc., and a large number of forecasting models have been developed. Given these models, a natural question has been raised: what theoretical limits of forecasting accuracy can these models achieve? Recent works of urban human mobility prediction have made progress on the maximum predictability that any algorithm can achieve. However, existing approaches on maximum predictability on the multivariate time series fully ignore the interrelationship between multiple variables. In this article, we propose a methodology to measure the upper limit of predictability for multivariate time series with multivariate constraint relations. The key of the proposed methodology is a novel entropy, named Multivariate Constraint Sample Entropy (\u0000<italic>McSE\u0000), to incorporate the multivariate constraint relations for better predictability. We conduct a systematic evaluation over eight datasets and compare existing methods with our proposed predictability and find that we get a higher predictability. We also find that the forecasting algorithms that capture the multivariate constraint relation information, such as GNN, can achieve higher accuracy, confirming the importance of multivariate constraint relations for predictability.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1536-1548"},"PeriodicalIF":7.2,"publicationDate":"2023-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62972293","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Towards Privacy-Aware Causal Structure Learning in Federated Setting 联邦环境下隐私感知因果结构学习研究

IF 7.2 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data

Pub Date : 2023-06-13 DOI: 10.1109/TBDATA.2023.3285477

Jianli Huang;Xianjie Guo;Kui Yu;Fuyuan Cao;Jiye Liang

Causal structure learning has been extensively studied and widely used in machine learning and various applications. To achieve an ideal performance, existing causal structure learning algorithms often need to centralize a large amount of data from multiple data sources. However, in the privacy-preserving setting, it is impossible to centralize data from all sources and put them together as a single dataset. To preserve data privacy, federated learning as a new learning paradigm has attached much attention in machine learning in recent years. In this paper, we study a privacy-aware causal structure learning problem in the federated setting and propose a novel federated PC (FedPC) algorithm with two new strategies for preserving data privacy without centralizing data. Specifically, we first propose a novel layer-wise aggregation strategy for a seamless adaptation of the PC algorithm into the federated learning paradigm for federated skeleton learning, then we design an effective strategy for learning consistent separation sets for federated edge orientation. The extensive experiments validate that FedPC is effective for causal structure learning in federated learning setting.

因果结构学习在机器学习和各种应用中得到了广泛的研究和应用。为了达到理想的性能，现有的因果结构学习算法往往需要对来自多个数据源的大量数据进行集中处理。然而，在隐私保护设置中，不可能集中所有来源的数据并将它们放在一起作为单个数据集。为了保护数据隐私，联邦学习作为一种新的学习范式，近年来在机器学习领域备受关注。本文研究了联邦环境下隐私感知的因果结构学习问题，提出了一种新的联邦PC (FedPC)算法，该算法采用两种新的策略来保护数据隐私，而不需要将数据集中。具体来说，我们首先提出了一种新的分层聚合策略，用于将PC算法无缝地适应到联邦骨架学习的联邦学习范式中，然后我们设计了一种有效的策略来学习联邦边缘方向的一致分离集。大量的实验验证了FedPC在联邦学习环境下对因果结构学习的有效性。

{"title":"Towards Privacy-Aware Causal Structure Learning in Federated Setting","authors":"Jianli Huang;Xianjie Guo;Kui Yu;Fuyuan Cao;Jiye Liang","doi":"10.1109/TBDATA.2023.3285477","DOIUrl":"10.1109/TBDATA.2023.3285477","url":null,"abstract":"Causal structure learning has been extensively studied and widely used in machine learning and various applications. To achieve an ideal performance, existing causal structure learning algorithms often need to centralize a large amount of data from multiple data sources. However, in the privacy-preserving setting, it is impossible to centralize data from all sources and put them together as a single dataset. To preserve data privacy, federated learning as a new learning paradigm has attached much attention in machine learning in recent years. In this paper, we study a privacy-aware causal structure learning problem in the federated setting and propose a novel federated PC (FedPC) algorithm with two new strategies for preserving data privacy without centralizing data. Specifically, we first propose a novel layer-wise aggregation strategy for a seamless adaptation of the PC algorithm into the federated learning paradigm for federated skeleton learning, then we design an effective strategy for learning consistent separation sets for federated edge orientation. The extensive experiments validate that FedPC is effective for causal structure learning in federated learning setting.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"9 6","pages":"1525-1535"},"PeriodicalIF":7.2,"publicationDate":"2023-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77771046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

RGSE: Robust Graph Structure Embedding for Anomalous Link Detection 基于鲁棒图结构嵌入的异常链路检测

IF 7.2 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Big Data

Pub Date : 2023-06-08 DOI: 10.1109/TBDATA.2023.3284270

Zhen Liu;Wenbo Zuo;Dongning Zhang;Xiaodong Feng

Anomalous links such as noisy links or adversarial edges widely exist in real-world networks, which may undermine the credibility of the network study, e.g., community detection in social networks. Therefore, anomalous links need to be removed from the polluted network by a detector. Due to the co-existence of normal links and anomalous links, how to identify anomalous links in a polluted network is a challenging issue. By designing a robust graph structure embedding framework, also called RGSE, the link-level feature representations that are generated from both global embedding view and local stable view can be used for anomalous link detection on contaminated graphs. Comparison experiments on a variety of datasets demonstrate that the new model and its variants achieve up to an average 5.2% improvement with respect to the accuracy of anomalous link detection against the traditional graph representation models. Further analyses also provide interpretable evidence to support the model's superiority.

异常链接，如噪声链接或对抗性边缘，广泛存在于真实世界的网络中，这可能会破坏网络研究的可信度，例如社交网络中的社区检测。因此，异常链路需要通过检测器从被污染的网络中去除。由于正常链路和异常链路共存，如何识别污染网络中的异常链路是一个具有挑战性的问题。通过设计一个鲁棒的图结构嵌入框架，也称为RGSE，从全局嵌入视图和局部稳定视图生成的链接级特征表示可以用于污染图上的异常链接检测。在各种数据集上的比较实验表明，与传统的图表示模型相比，新模型及其变体在异常链接检测的准确性方面平均提高了5.2%。进一步的分析也提供了可解释的证据来支持该模型的优越性。

引用次数: 0