World Wide Web最新文献_第3页

Hyper-relational knowledge graph neural network for next POI recommendation 用于推荐下一个 POI 的超关系知识图谱神经网络

World Wide Web

Pub Date : 2024-07-06 DOI: 10.1007/s11280-024-01279-y

Jixiao Zhang, Yongkang Li, Ruotong Zou, Jingyuan Zhang, Renhe Jiang, Zipei Fan, Xuan Song

With the advancement of mobile technology, Point of Interest (POI) recommendation systems in Location-based Social Networks (LBSN) have brought numerous benefits to both users and companies. Many existing works employ Knowledge Graph (KG) to alleviate the data sparsity issue in LBSN. These approaches primarily focus on modeling the pair-wise relations in LBSN to enrich the semantics and thereby relieve the data sparsity issue. However, existing approaches seldom consider the hyper-relations in LBSN, such as the mobility relation (a 3-ary relation: user-POI-time). This makes the model hard to exploit the semantics accurately. In addition, prior works overlook the rich structural information inherent in KG, which consists of higher-order relations and can further alleviate the impact of data sparsity.To this end, we propose a Hyper-Relational Knowledge Graph Neural Network (HKGNN) model. In HKGNN, a Hyper-Relational Knowledge Graph (HKG) that models the LBSN data is constructed to maintain and exploit the rich semantics of hyper-relations. Then we proposed a Hypergraph Neural Network to utilize the structural information of HKG in a cohesive way. In addition, a self-attention network is used to leverage sequential information and make personalized recommendations. Furthermore, side information, essential in reducing data sparsity by providing background knowledge of POIs, is not fully utilized in current methods. In light of this, we extended the current dataset with available side information to further lessen the impact of data sparsity. Results of experiments on four real-world LBSN datasets demonstrate the effectiveness of our approach compared to existing state-of-the-art methods. Our implementation is available at https://github.com/aeroplanepaper/HKG.

随着移动技术的发展，基于位置的社交网络（LBSN）中的兴趣点（POI）推荐系统为用户和企业带来了诸多好处。现有的许多研究都采用知识图谱（KG）来缓解 LBSN 中的数据稀疏问题。这些方法主要侧重于为 LBSN 中的成对关系建模，以丰富语义，从而缓解数据稀缺问题。然而，现有方法很少考虑 LBSN 中的超关系，如移动关系（三元关系：用户-POI-时间）。这使得模型难以准确利用语义。为此，我们提出了超关系知识图谱神经网络（Hyper-Relational Knowledge Graph Neural Network，HKGNN）模型。在 HKGNN 中，我们构建了一个以 LBSN 数据为模型的超关系知识图谱（HKG），以保持和利用超关系的丰富语义。然后，我们提出了一种超图神经网络，以内聚的方式利用 HKG 的结构信息。此外，我们还使用了自我关注网络来利用序列信息并进行个性化推荐。此外，侧边信息对于通过提供 POI 的背景知识来减少数据稀疏性至关重要，但目前的方法并未充分利用侧边信息。有鉴于此，我们利用可用的侧面信息扩展了当前的数据集，以进一步降低数据稀疏性的影响。在四个真实世界 LBSN 数据集上的实验结果表明，与现有的先进方法相比，我们的方法非常有效。我们的实现方法可在 https://github.com/aeroplanepaper/HKG 上获取。

{"title":"Hyper-relational knowledge graph neural network for next POI recommendation","authors":"Jixiao Zhang, Yongkang Li, Ruotong Zou, Jingyuan Zhang, Renhe Jiang, Zipei Fan, Xuan Song","doi":"10.1007/s11280-024-01279-y","DOIUrl":"https://doi.org/10.1007/s11280-024-01279-y","url":null,"abstract":"With the advancement of mobile technology, Point of Interest (POI) recommendation systems in Location-based Social Networks (LBSN) have brought numerous benefits to both users and companies. Many existing works employ Knowledge Graph (KG) to alleviate the data sparsity issue in LBSN. These approaches primarily focus on modeling the pair-wise relations in LBSN to enrich the semantics and thereby relieve the data sparsity issue. However, existing approaches seldom consider the hyper-relations in LBSN, such as the mobility relation (a 3-ary relation: user-POI-time). This makes the model hard to exploit the semantics accurately. In addition, prior works overlook the rich structural information inherent in KG, which consists of higher-order relations and can further alleviate the impact of data sparsity.To this end, we propose a Hyper-Relational Knowledge Graph Neural Network (HKGNN) model. In HKGNN, a Hyper-Relational Knowledge Graph (HKG) that models the LBSN data is constructed to maintain and exploit the rich semantics of hyper-relations. Then we proposed a Hypergraph Neural Network to utilize the structural information of HKG in a cohesive way. In addition, a self-attention network is used to leverage sequential information and make personalized recommendations. Furthermore, side information, essential in reducing data sparsity by providing background knowledge of POIs, is not fully utilized in current methods. In light of this, we extended the current dataset with available side information to further lessen the impact of data sparsity. Results of experiments on four real-world LBSN datasets demonstrate the effectiveness of our approach compared to existing state-of-the-art methods. Our implementation is available at https://github.com/aeroplanepaper/HKG.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141567936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bridging distribution gaps: invariant pattern discovery for dynamic graph learning 弥合分布差距：动态图学习的不变模式发现

World Wide Web

Pub Date : 2024-07-02 DOI: 10.1007/s11280-024-01283-2

Yucheng Jin, Maoyi Wang, Yun Xiong, Zhizhou Ren, Cuiying Huo, Feng Zhu, Jiawei Zhang, Guangzhong Wang, Haoran Chen

Temporal graph networks (TGNs) have been proposed to facilitate learning on dynamic graphs which are composed of interaction events among nodes. However, existing TGNs suffer from poor generalization under distribution shifts that occur over time. It is vital to discover invariant patterns with stable predictive power across various distributions to improve the generalization ability. Invariant pattern discovery on dynamic graphs is non-trivial, as long-term history of interaction events is compressed into the memory by TGNs in an entangled way, making invariant pattern discovery difficult. Furthermore, TGNs process interaction events chronologically in batches to obtain up-to-date representations. Each batch consisting of chronologically-close events lacks diversity for identifying invariance under distribution shifts. To tackle these challenges, we propose a novel method called Smile, which stands for Structural teMporal Invariant LEarning. Specifically, we first propose the disentangled graph memory network, which selectively extracts pattern information from long-term history through the disentangled memory gating and attention network. The interaction history approximator is further introduced to provide diverse interaction distributions efficiently. Smile guarantees prediction stability under diverse temporal-dynamic distributions by regularizing invariance under cross-time distribution interventions. Experimental results on real-world datasets demonstrate that Smile outperforms baselines, yielding substantial performance improvements.

时态图网络（TGN）被提出来用于促进对动态图的学习，动态图由节点间的交互事件组成。然而，现有的 TGN 在分布随时间发生变化的情况下概括性较差。发现对各种分布具有稳定预测能力的不变模式对提高泛化能力至关重要。在动态图上发现不变模式并非易事，因为交互事件的长期历史会以纠缠的方式被 TGN 压缩到内存中，从而使不变模式的发现变得困难。此外，TGN 按时间顺序分批处理交互事件，以获得最新的表示。每个批次都由时间上相近的事件组成，缺乏多样性，难以识别分布变化下的不变性。为了应对这些挑战，我们提出了一种名为 "微笑 "的新方法。"微笑 "是 Structural teMporal Invariant LEarning 的缩写。具体来说，我们首先提出了分解图记忆网络，通过分解记忆门控和注意力网络从长期历史中选择性地提取模式信息。我们还进一步引入了交互历史近似器，以高效地提供多样化的交互分布。Smile 通过对跨时间分布干预下的不变性进行正则化处理，保证了不同时间动态分布下的预测稳定性。在真实世界数据集上的实验结果表明，Smile 的性能优于基线方法，从而大幅提高了性能。

{"title":"Bridging distribution gaps: invariant pattern discovery for dynamic graph learning","authors":"Yucheng Jin, Maoyi Wang, Yun Xiong, Zhizhou Ren, Cuiying Huo, Feng Zhu, Jiawei Zhang, Guangzhong Wang, Haoran Chen","doi":"10.1007/s11280-024-01283-2","DOIUrl":"https://doi.org/10.1007/s11280-024-01283-2","url":null,"abstract":"Temporal graph networks (TGNs) have been proposed to facilitate learning on dynamic graphs which are composed of interaction events among nodes. However, existing TGNs suffer from poor generalization under distribution shifts that occur over time. It is vital to discover invariant patterns with stable predictive power across various distributions to improve the generalization ability. Invariant pattern discovery on dynamic graphs is non-trivial, as long-term history of interaction events is compressed into the memory by TGNs in an entangled way, making invariant pattern discovery difficult. Furthermore, TGNs process interaction events chronologically in batches to obtain up-to-date representations. Each batch consisting of chronologically-close events lacks diversity for identifying invariance under distribution shifts. To tackle these challenges, we propose a novel method called Smile, which stands for Structural teMporal Invariant LEarning. Specifically, we first propose the disentangled graph memory network, which selectively extracts pattern information from long-term history through the disentangled memory gating and attention network. The interaction history approximator is further introduced to provide diverse interaction distributions efficiently. Smile guarantees prediction stability under diverse temporal-dynamic distributions by regularizing invariance under cross-time distribution interventions. Experimental results on real-world datasets demonstrate that Smile outperforms baselines, yielding substantial performance improvements.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141524613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Efficient base station deployment in specialized regions with splitting particle swarm optimization algorithm 利用分裂粒子群优化算法在专门区域高效部署基站

World Wide Web

Pub Date : 2024-07-02 DOI: 10.1007/s11280-024-01282-3

Jiaying Shen, Donglin Zhu, Rui Li, Xingyun Zhu, Yuemai Zhang, Weijie Li, Changjun Zhou, Jun Zhang, Shi Cheng

Signal coverage quality and intensity distribution in complex environments pose a critical challenge, particularly evident in high-density personnel areas and specialized regions with intricate geographic features. This challenge leads to the inadequacy of the traditional two-dimensional base station model under the strain of communication congestion. Addressing the intricacies of the scenario, this paper focuses on the conditionally constrained deployment of base stations in special areas. It introduces a Splitting Particle Swarm Optimization (SPSO) algorithm, enhancing the algorithm’s global optimization capabilities by incorporating the concepts of splitting and parameter adjustments. This refinement aims to meet the communication requirements of customers in complex scenarios. To better align with the real-world communication needs of base stations, simulation experiments are conducted. These experiments involve assigning fixed coordinates to the special region or randomly generating its position. In the conducted experiments, the SPSO achieves maximum coverage rates of 99.24% and 99.00% with fewer target points and 93.56% and 96.16% with more target points. These results validate the optimization capability of the SPSO algorithm, demonstrating its feasibility and effectiveness. Ablation experiments and comparisons with other algorithms further illustrate the advantages of SPSO.

复杂环境下的信号覆盖质量和强度分布是一个严峻的挑战，这在人员密集区域和具有复杂地理特征的专业区域尤为明显。这一挑战导致传统的二维基站模型在通信拥塞的压力下显得力不从心。针对这一错综复杂的情况，本文重点研究了在特殊区域有条件限制地部署基站的问题。它引入了分裂粒子群优化（SPSO）算法，通过融入分裂和参数调整的概念，增强了算法的全局优化能力。这一改进旨在满足客户在复杂场景下的通信需求。为了更好地与基站的实际通信需求保持一致，我们进行了模拟实验。这些实验包括为特殊区域分配固定坐标或随机生成其位置。在所进行的实验中，SPSO 在目标点较少的情况下实现了 99.24% 和 99.00% 的最大覆盖率，在目标点较多的情况下实现了 93.56% 和 96.16% 的最大覆盖率。这些结果验证了 SPSO 算法的优化能力，证明了其可行性和有效性。消融实验以及与其他算法的比较进一步说明了 SPSO 的优势。

{"title":"Efficient base station deployment in specialized regions with splitting particle swarm optimization algorithm","authors":"Jiaying Shen, Donglin Zhu, Rui Li, Xingyun Zhu, Yuemai Zhang, Weijie Li, Changjun Zhou, Jun Zhang, Shi Cheng","doi":"10.1007/s11280-024-01282-3","DOIUrl":"https://doi.org/10.1007/s11280-024-01282-3","url":null,"abstract":"Signal coverage quality and intensity distribution in complex environments pose a critical challenge, particularly evident in high-density personnel areas and specialized regions with intricate geographic features. This challenge leads to the inadequacy of the traditional two-dimensional base station model under the strain of communication congestion. Addressing the intricacies of the scenario, this paper focuses on the conditionally constrained deployment of base stations in special areas. It introduces a Splitting Particle Swarm Optimization (SPSO) algorithm, enhancing the algorithm’s global optimization capabilities by incorporating the concepts of splitting and parameter adjustments. This refinement aims to meet the communication requirements of customers in complex scenarios. To better align with the real-world communication needs of base stations, simulation experiments are conducted. These experiments involve assigning fixed coordinates to the special region or randomly generating its position. In the conducted experiments, the SPSO achieves maximum coverage rates of 99.24% and 99.00% with fewer target points and 93.56% and 96.16% with more target points. These results validate the optimization capability of the SPSO algorithm, demonstrating its feasibility and effectiveness. Ablation experiments and comparisons with other algorithms further illustrate the advantages of SPSO.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141506041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Editorial on the Special Issue of the World Wide Web journal with selected papers from the 22nd International Conference on Web Information Systems Engineering (WISE) 第 22 届网络信息系统工程（WISE）国际会议论文选编的《万维网》杂志特刊编辑文章

World Wide Web

Pub Date : 2024-07-01 DOI: 10.1007/s11280-024-01284-1

Richard Chbeir, Helen Huang, Yannis Manolopoulos, Fabrizio Silvestri

引用次数: 0

When large language models meet personalization: perspectives of challenges and opportunities 当大型语言模型遇到个性化：挑战与机遇的视角

World Wide Web

Pub Date : 2024-06-28 DOI: 10.1007/s11280-024-01276-1

Jin Chen, Zheng Liu, Xu Huang, Chenwang Wu, Qi Liu, Gangwei Jiang, Yuanhao Pu, Yuxuan Lei, Xiaolong Chen, Xingmei Wang, Kai Zheng, Defu Lian, Enhong Chen

The advent of large language models marks a revolutionary breakthrough in artificial intelligence. With the unprecedented scale of training and model parameters, the capability of large language models has been dramatically improved, leading to human-like performances in understanding, language synthesizing, common-sense reasoning, etc. Such a major leap forward in general AI capacity will fundamentally change the pattern of how personalization is conducted. For one thing, it will reform the way of interaction between humans and personalization systems. Instead of being a passive medium of information filtering, like conventional recommender systems and search engines, large language models present the foundation for active user engagement. On top of such a new foundation, users’ requests can be proactively explored, and users’ required information can be delivered in a natural, interactable, and explainable way. For another thing, it will also considerably expand the scope of personalization, making it grow from the sole function of collecting personalized information to the compound function of providing personalized services. By leveraging large language models as a general-purpose interface, the personalization systems may compile user’s requests into plans, calls the functions of external tools (e.g., search engines, calculators, service APIs, etc.) to execute the plans, and integrate the tools’ outputs to complete the end-to-end personalization tasks. Today, large language models are still being rapidly developed, whereas the application in personalization is largely unexplored. Therefore, we consider it to be right the time to review the challenges in personalization and the opportunities to address them with large language models. In particular, we dedicate this perspective paper to the discussion of the following aspects: the development and challenges for the existing personalization system, the newly emerged capabilities of large language models, and the potential ways of making use of large language models for personalization.

大型语言模型的出现标志着人工智能领域的革命性突破。随着训练规模和模型参数的空前扩大，大型语言模型的能力得到了显著提升，从而在理解、语言合成、常识推理等方面实现了与人类类似的表现。通用人工智能能力的这一重大飞跃，将从根本上改变个性化服务的模式。首先，它将改革人类与个性化系统之间的互动方式。大型语言模型不再像传统的推荐系统和搜索引擎那样是信息过滤的被动媒介，而是为用户的主动参与奠定了基础。在这样一个新的基础上，用户的请求可以被主动发掘，用户所需的信息可以以自然、可交互、可解释的方式提供。另一方面，它还将大大扩展个性化的范围，使其从收集个性化信息的单一功能发展到提供个性化服务的复合功能。通过利用大型语言模型作为通用接口，个性化系统可以将用户的请求编译成计划，调用外部工具（如搜索引擎、计算器、服务应用程序接口等）的功能来执行计划，并整合工具的输出，完成端到端的个性化任务。如今，大型语言模型仍在快速发展中，而其在个性化方面的应用却大多尚未开发。因此，我们认为现在正是回顾个性化挑战和利用大型语言模型解决这些挑战的机会的好时机。本视角论文将特别讨论以下几个方面：现有个性化系统的发展与挑战、大型语言模型新出现的功能以及利用大型语言模型进行个性化的潜在途径。

{"title":"When large language models meet personalization: perspectives of challenges and opportunities","authors":"Jin Chen, Zheng Liu, Xu Huang, Chenwang Wu, Qi Liu, Gangwei Jiang, Yuanhao Pu, Yuxuan Lei, Xiaolong Chen, Xingmei Wang, Kai Zheng, Defu Lian, Enhong Chen","doi":"10.1007/s11280-024-01276-1","DOIUrl":"https://doi.org/10.1007/s11280-024-01276-1","url":null,"abstract":"The advent of large language models marks a revolutionary breakthrough in artificial intelligence. With the unprecedented scale of training and model parameters, the capability of large language models has been dramatically improved, leading to human-like performances in understanding, language synthesizing, common-sense reasoning, etc. Such a major leap forward in general AI capacity will fundamentally change the pattern of how personalization is conducted. For one thing, it will reform the way of interaction between humans and personalization systems. Instead of being a passive medium of information filtering, like conventional recommender systems and search engines, large language models present the foundation for active user engagement. On top of such a new foundation, users’ requests can be proactively explored, and users’ required information can be delivered in a natural, interactable, and explainable way. For another thing, it will also considerably expand the scope of personalization, making it grow from the sole function of collecting personalized information to the compound function of providing personalized services. By leveraging large language models as a general-purpose interface, the personalization systems may compile user’s requests into plans, calls the functions of external tools (e.g., search engines, calculators, service APIs, etc.) to execute the plans, and integrate the tools’ outputs to complete the end-to-end personalization tasks. Today, large language models are still being rapidly developed, whereas the application in personalization is largely unexplored. Therefore, we consider it to be right the time to review the challenges in personalization and the opportunities to address them with large language models. In particular, we dedicate this perspective paper to the discussion of the following aspects: the development and challenges for the existing personalization system, the newly emerged capabilities of large language models, and the potential ways of making use of large language models for personalization.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141529315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Federated learning for supervised cross-modal retrieval 针对有监督的跨模态检索的联合学习

World Wide Web

Pub Date : 2024-06-26 DOI: 10.1007/s11280-024-01249-4

Ang Li, Yawen Li, Yingxia Shao

In the last decade, the explosive surge in multi-modal data has propelled cross-modal retrieval into the forefront of information retrieval research. Exceptional cross-modal retrieval algorithms are crucial for meeting user requirements effectively and offering invaluable support for subsequent tasks, including cross-modal recommendations, multi-modal content generation, and so forth. Previous methods for cross-modal retrieval typically search for a single common subspace, neglecting the possibility of multiple common subspaces that may mutually reinforce each other in reality, thereby resulting in the poor performance of cross-modal retrieval. To address this issue, we propose a Federated Supervised Cross-Modal Retrieval approach (FedSCMR), which leverages competition to learn the optimal common subspace, and adaptively aggregates the common subspaces of multiple clients for dynamic global aggregation. To reduce the differences between modalities, FedSCMR minimizes the semantic discrimination and consistency in the common subspace, in addition to modeling semantic discrimination in the label space. Additionally, it minimizes modal discrimination and semantic invariance across common subspaces to strengthen cross-subspace constraints and promote learning of the optimal common subspace. In the aggregation stage for federated learning, we design an adaptive model aggregation scheme that can dynamically and collaboratively evaluate the model contribution based on data volume, data category, model loss, and mean average precision, to adaptively aggregate multi-party common subspaces. Experimental results on two publicly available datasets demonstrate that our proposed FedSCMR surpasses state-of-the-art cross-modal retrieval methods.

近十年来，多模态数据的爆炸式增长推动跨模态检索成为信息检索研究的前沿领域。卓越的跨模态检索算法对于有效满足用户需求以及为后续任务（包括跨模态推荐、多模态内容生成等）提供宝贵支持至关重要。以往的跨模态检索方法通常只搜索单一的公共子空间，忽略了现实中可能存在多个相互促进的公共子空间，从而导致跨模态检索的性能不佳。为了解决这个问题，我们提出了一种联邦监督跨模态检索方法（FedSCMR），它利用竞争来学习最优的公共子空间，并自适应地聚合多个客户端的公共子空间，进行动态全局聚合。为了减少模态之间的差异，FedSCMR 除了在标签空间中模拟语义辨别外，还最大限度地减少了公共子空间中的语义辨别和一致性。此外，它还将跨公共子空间的模态歧视和语义不变性降到最低，以加强跨子空间约束，促进最优公共子空间的学习。在联合学习的聚合阶段，我们设计了一种自适应模型聚合方案，它可以根据数据量、数据类别、模型损失和平均精度，动态地协同评估模型贡献，从而自适应地聚合多方共同子空间。在两个公开数据集上的实验结果表明，我们提出的 FedSCMR 超越了最先进的跨模态检索方法。

{"title":"Federated learning for supervised cross-modal retrieval","authors":"Ang Li, Yawen Li, Yingxia Shao","doi":"10.1007/s11280-024-01249-4","DOIUrl":"https://doi.org/10.1007/s11280-024-01249-4","url":null,"abstract":"In the last decade, the explosive surge in multi-modal data has propelled cross-modal retrieval into the forefront of information retrieval research. Exceptional cross-modal retrieval algorithms are crucial for meeting user requirements effectively and offering invaluable support for subsequent tasks, including cross-modal recommendations, multi-modal content generation, and so forth. Previous methods for cross-modal retrieval typically search for a single common subspace, neglecting the possibility of multiple common subspaces that may mutually reinforce each other in reality, thereby resulting in the poor performance of cross-modal retrieval. To address this issue, we propose a Federated Supervised Cross-Modal Retrieval approach (FedSCMR), which leverages competition to learn the optimal common subspace, and adaptively aggregates the common subspaces of multiple clients for dynamic global aggregation. To reduce the differences between modalities, FedSCMR minimizes the semantic discrimination and consistency in the common subspace, in addition to modeling semantic discrimination in the label space. Additionally, it minimizes modal discrimination and semantic invariance across common subspaces to strengthen cross-subspace constraints and promote learning of the optimal common subspace. In the aggregation stage for federated learning, we design an adaptive model aggregation scheme that can dynamically and collaboratively evaluate the model contribution based on data volume, data category, model loss, and mean average precision, to adaptively aggregate multi-party common subspaces. Experimental results on two publicly available datasets demonstrate that our proposed FedSCMR surpasses state-of-the-art cross-modal retrieval methods.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141506040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DPHM-Net:de-redundant multi-period hybrid modeling network for long-term series forecasting DPHM-Net：用于长期序列预测的去冗余多期混合建模网络

World Wide Web

Pub Date : 2024-06-22 DOI: 10.1007/s11280-024-01281-4

Chengdong Zheng, Yuliang Shi, Wu Lee, Lin Cheng, Xinjun Wang, Zhongmin Yan, Fanyu Kong

Deep learning models have been widely applied in the field of long-term forecasting has achieved significant success, with the incorporation of inductive bias such as periodicity to model multi-granularity representations of time series being a commonly employed design approach in forecasting methods. However, existing methods still face challenges related to information redundancy during the extraction of inductive bias and the learning process for multi-granularity features. The presence of redundant information can impede the acquisition of a comprehensive temporal representation by the model, thereby adversely impacting its predictive performance. To address the aforementioned issues, we propose a De-Redundant Multi-Period Hybrid Modeling Network (DPHM-Net) that effectively eliminates redundant information from the series inductive bias extraction mechanism and the multi-granularity series features in the time series representation learning. In DPHM-Net, we propose an efficient time series representation learning process based on a period inductive bias and introduce the concept of de-redundancy among multiple time series into the representation learning process for single time series. Additionally, we design a specialized gated unit to dynamically balance the elimination weights between series features and redundant semantic information. The advanced performance and high efficiency of our method in long-term forecasting tasks against previous state-of-the-art are demonstrated through extensive experiments on real-world datasets.

深度学习模型已被广泛应用于长期预测领域，并取得了显著成效，其中结合周期性等归纳偏差对时间序列的多粒度表示进行建模是预测方法中常用的设计方法。然而，现有方法在提取归纳偏差和学习多粒度特征的过程中仍然面临着与信息冗余有关的挑战。冗余信息的存在会阻碍模型获得全面的时间表示，从而对其预测性能产生不利影响。针对上述问题，我们提出了一种去冗余多期混合建模网络（DPHM-Net），它能有效消除时间序列表征学习中序列归纳偏差提取机制和多粒度序列特征中的冗余信息。在 DPHM-Net 中，我们提出了一种基于周期归纳偏差的高效时间序列表示学习过程，并将多个时间序列之间的去冗余概念引入到单个时间序列的表示学习过程中。此外，我们还设计了一个专门的门控单元来动态平衡序列特征和冗余语义信息之间的消除权重。通过在真实世界数据集上进行大量实验，证明了我们的方法在长期预测任务中的先进性能和高效率，与之前的先进方法相比毫不逊色。

{"title":"DPHM-Net:de-redundant multi-period hybrid modeling network for long-term series forecasting","authors":"Chengdong Zheng, Yuliang Shi, Wu Lee, Lin Cheng, Xinjun Wang, Zhongmin Yan, Fanyu Kong","doi":"10.1007/s11280-024-01281-4","DOIUrl":"https://doi.org/10.1007/s11280-024-01281-4","url":null,"abstract":"Deep learning models have been widely applied in the field of long-term forecasting has achieved significant success, with the incorporation of inductive bias such as periodicity to model multi-granularity representations of time series being a commonly employed design approach in forecasting methods. However, existing methods still face challenges related to information redundancy during the extraction of inductive bias and the learning process for multi-granularity features. The presence of redundant information can impede the acquisition of a comprehensive temporal representation by the model, thereby adversely impacting its predictive performance. To address the aforementioned issues, we propose a De-Redundant Multi-Period Hybrid Modeling Network (DPHM-Net) that effectively eliminates redundant information from the series inductive bias extraction mechanism and the multi-granularity series features in the time series representation learning. In DPHM-Net, we propose an efficient time series representation learning process based on a period inductive bias and introduce the concept of de-redundancy among multiple time series into the representation learning process for single time series. Additionally, we design a specialized gated unit to dynamically balance the elimination weights between series features and redundant semantic information. The advanced performance and high efficiency of our method in long-term forecasting tasks against previous state-of-the-art are demonstrated through extensive experiments on real-world datasets.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141506043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Exploring highly concise and accurate text matching model with tiny weights 利用微小权重探索高度简洁准确的文本匹配模型

World Wide Web

Pub Date : 2024-06-20 DOI: 10.1007/s11280-024-01262-7

Yangchun Li, Danfeng Yan, Wei Jiang, Yuanqiang Cai, Zhihong Tian

In this paper, we propose a simple and general lightweight approach named AL-RE2 for text matching models, and conduct experiments on three well-studied benchmark datasets across tasks of natural language inference and paraphrase identification. Firstly, we explore the feasibility of dimensional compression of word embedding vectors using principal component analysis, and then analyze the impact of the information retained in different dimensions on model accuracy. Considering the balance between compression efficiency and information loss, we choose 128 dimensions to represent each word and make the model params 1.6M. Finally, the feasibility of applying depthwise separable convolution instead of standard convolution in the field of text matching is analyzed in detail. The experimental results show that our model’s inference speed is at least 1.5 times faster and it has 42.76% fewer parameters compared to similarly performing models, while its accuracy on the SciTail dataset of is state-of-the-art among all lightweight models.

在本文中，我们为文本匹配模型提出了一种名为 AL-RE2 的简单而通用的轻量级方法，并在三个经过充分研究的基准数据集上进行了实验，这些数据集横跨自然语言推理和转述识别任务。首先，我们探索了利用主成分分析法对词嵌入向量进行维度压缩的可行性，然后分析了不同维度所保留的信息对模型准确性的影响。考虑到压缩效率和信息损失之间的平衡，我们选择 128 维来表示每个词，并将模型参数设置为 1.6M。最后，我们详细分析了在文本匹配领域应用深度可分离卷积代替标准卷积的可行性。实验结果表明，与性能类似的模型相比，我们的模型推理速度至少快了 1.5 倍，参数数量减少了 42.76%，而它在 SciTail 数据集上的准确率在所有轻量级模型中也是最先进的。

引用次数: 0

Multi-temporal heterogeneous graph learning with pattern-aware attention for industrial chain risk detection 利用模式感知注意力的多时态异构图学习，用于产业链风险检测

World Wide Web

Pub Date : 2024-06-15 DOI: 10.1007/s11280-024-01280-5

Ziheng Li, Yongjiao Sun, Xin Bi, Ruijin Wang, Shi Ying, Hangxu Ji

引用次数: 0

Using knowledge graphs for audio retrieval: a case study on copyright infringement detection 使用知识图谱进行音频检索：版权侵权检测案例研究

World Wide Web

Pub Date : 2024-06-11 DOI: 10.1007/s11280-024-01277-0

Marco Montanaro, Antonio Maria Rinaldi, Cristiano Russo, Cristian Tommasino

Abstract

Identifying cases of intellectual property violation in multimedia files poses significant challenges for the Internet infrastructure, especially when dealing with extensive document collections. Typically, techniques used to tackle such issues can be categorized into either of two groups: proactive and reactive approaches. This article introduces an approach combining both proactive and reactive solutions to remove illegal uploads on a platform while preventing legal uploads or modified versions of audio tracks, such as parodies, remixes or further types of edits. To achieve this, we have developed a rule-based focused crawler specifically designed to detect copyright infringement on audio files coupled with a visualization environment that maps the retrieved data on a knowledge graph to represent information extracted from audio files. Our system automatically scans multimedia files that are uploaded to a public collection when a user submits a search query, performing an audio information retrieval task only on files deemed legal. We present experimental results obtained from tests conducted by performing user queries on a large music collection, a subset of 25,000 songs and audio snippets obtained from the Free Music Archive library. The returned audio tracks have an associated Similarity Score, a metric we use to determine the quality of the adversarial searches executed by the system. We then proceed with discussing the effectiveness and efficiency of different settings of our proposed system.

Graphical abstract

摘要识别多媒体文件中侵犯知识产权的案例给互联网基础设施带来了巨大挑战，尤其是在处理大量文件集合时。通常，用于解决此类问题的技术可分为两类：主动式方法和被动式方法。本文介绍了一种结合主动和被动解决方案的方法，既能删除平台上的非法上传，又能防止合法上传或音轨的修改版本，如模仿、混音或其他类型的编辑。为实现这一目标，我们开发了一种基于规则的重点爬虫，专门用于检测音频文件的版权侵权行为，同时还开发了一种可视化环境，将检索到的数据映射到知识图谱上，以表示从音频文件中提取的信息。当用户提交搜索查询时，我们的系统会自动扫描上传到公共收藏中的多媒体文件，只对被视为合法的文件执行音频信息检索任务。我们展示了在一个大型音乐库（从自由音乐档案库中获取的 25,000 首歌曲和音频片段的子集）中执行用户查询的测试结果。返回的音轨都有一个相关的相似度得分，我们用这个指标来确定系统执行的对抗搜索的质量。接下来，我们将讨论我们所提议系统的不同设置的有效性和效率。

{"title":"Using knowledge graphs for audio retrieval: a case study on copyright infringement detection","authors":"Marco Montanaro, Antonio Maria Rinaldi, Cristiano Russo, Cristian Tommasino","doi":"10.1007/s11280-024-01277-0","DOIUrl":"https://doi.org/10.1007/s11280-024-01277-0","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3>Identifying cases of intellectual property violation in multimedia files poses significant challenges for the Internet infrastructure, especially when dealing with extensive document collections. Typically, techniques used to tackle such issues can be categorized into either of two groups: proactive and reactive approaches. This article introduces an approach combining both proactive and reactive solutions to remove illegal uploads on a platform while preventing legal uploads or modified versions of audio tracks, such as parodies, remixes or further types of edits. To achieve this, we have developed a rule-based focused crawler specifically designed to detect copyright infringement on audio files coupled with a visualization environment that maps the retrieved data on a knowledge graph to represent information extracted from audio files. Our system automatically scans multimedia files that are uploaded to a public collection when a user submits a search query, performing an audio information retrieval task only on files deemed legal. We present experimental results obtained from tests conducted by performing user queries on a large music collection, a subset of 25,000 songs and audio snippets obtained from the Free Music Archive library. The returned audio tracks have an associated Similarity Score, a metric we use to determine the quality of the adversarial searches executed by the system. We then proceed with discussing the effectiveness and efficiency of different settings of our proposed system.<h3 data-test=\"abstract-sub-heading\">Graphical abstract</h3>","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141506042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0