World Wide Web最新文献

HetFS: a method for fast similarity search with ad-hoc meta-paths on heterogeneous information networks HetFS：一种在异构信息网络上利用特设元路径进行快速相似性搜索的方法

World Wide Web

Pub Date : 2024-09-18 DOI: 10.1007/s11280-024-01303-1

Xuqi Mao, Zhenyi Chen, Zhenying He, Yinan Jing, Kai Zhang, X. Sean Wang

Numerous real-world information networks form Heterogeneous Information Networks (HINs) with diverse objects and relations represented as nodes and edges in heterogeneous graphs. Similarity between nodes quantifies how closely two nodes resemble each other, mainly depending on the similarity of the nodes they are connected to, recursively. Users may be interested in only specific types of connections in the similarity definition, represented as meta-paths, i.e., a sequence of node and edge types. Existing Heterogeneous Graph Neural Network (HGNN)-based similarity search methods may accommodate meta-paths, but require retraining for different meta-paths. Conversely, existing path-based similarity search methods may switch flexibly between meta-paths but often suffer from lower accuracy, as they rely solely on path information. This paper proposes HetFS, a Fast Similarity method for ad-hoc queries with user-given meta-paths on Heterogeneous information networks. HetFS provides similarity results based on path information that satisfies the meta-path restriction, as well as node content. Extensive experiments demonstrate the effectiveness and efficiency of HetFS in addressing ad-hoc queries, outperforming state-of-the-art HGNNs and path-based approaches, and showing strong performance in downstream applications, including link prediction, node classification, and clustering.

现实世界中的许多信息网络构成了异构信息网络（HINs），不同的对象和关系在异构图中表示为节点和边。节点之间的相似性量化了两个节点之间的相似程度，主要取决于它们所连接的节点的相似性，递归计算。用户可能只对相似性定义中特定类型的连接感兴趣，这些连接表现为元路径，即节点和边类型的序列。现有的基于异构图神经网络（HGNN）的相似性搜索方法可以容纳元路径，但需要针对不同的元路径进行重新训练。相反，现有的基于路径的相似性搜索方法可以在元路径之间灵活切换，但由于只依赖路径信息，往往准确率较低。本文提出的 HetFS 是一种快速相似性方法，用于在异构信息网络上使用用户给定的元路径进行临时查询。HetFS 基于满足元路径限制的路径信息以及节点内容提供相似性结果。广泛的实验证明了 HetFS 在处理临时查询方面的有效性和效率，其性能优于最先进的 HGNN 和基于路径的方法，并在下游应用（包括链接预测、节点分类和聚类）中表现出强劲的性能。

{"title":"HetFS: a method for fast similarity search with ad-hoc meta-paths on heterogeneous information networks","authors":"Xuqi Mao, Zhenyi Chen, Zhenying He, Yinan Jing, Kai Zhang, X. Sean Wang","doi":"10.1007/s11280-024-01303-1","DOIUrl":"https://doi.org/10.1007/s11280-024-01303-1","url":null,"abstract":"Numerous real-world information networks form Heterogeneous Information Networks (HINs) with diverse objects and relations represented as nodes and edges in heterogeneous graphs. Similarity between nodes quantifies how closely two nodes resemble each other, mainly depending on the similarity of the nodes they are connected to, recursively. Users may be interested in only specific types of connections in the similarity definition, represented as meta-paths, i.e., a sequence of node and edge types. Existing Heterogeneous Graph Neural Network (HGNN)-based similarity search methods may accommodate meta-paths, but require retraining for different meta-paths. Conversely, existing path-based similarity search methods may switch flexibly between meta-paths but often suffer from lower accuracy, as they rely solely on path information. This paper proposes HetFS, a Fast Similarity method for ad-hoc queries with user-given meta-paths on Heterogeneous information networks. HetFS provides similarity results based on path information that satisfies the meta-path restriction, as well as node content. Extensive experiments demonstrate the effectiveness and efficiency of HetFS in addressing ad-hoc queries, outperforming state-of-the-art HGNNs and path-based approaches, and showing strong performance in downstream applications, including link prediction, node classification, and clustering.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142260570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A SHAP-based controversy analysis through communities on Twitter 通过 Twitter 上的社区进行基于 SHAP 的争议分析

World Wide Web

Pub Date : 2024-09-14 DOI: 10.1007/s11280-024-01278-z

Samy Benslimane, Thomas Papastergiou, Jérôme Azé, Sandra Bringay, Maximilien Servajean, Caroline Mollevi

Controversy encompasses content that draws diverse perspectives, along with positive and negative feedback on a specific event, resulting in the formation of distinct user communities. we explore the explainability of controversy through the lens of SHAP (SHapley Additive exPlanations) method, aiming to provide a fair assessment of the individual contributions of different text features of tweets to controversy detection. We conduct an analysis of topic discussions on Twitter from a community perspective, investigating the role of text in accurately classifying tweets into their respective communities. To achieve this, we introduce a SHAP-based pipeline designed to quantify the influence of impactful text features on the predictions of three tweet classifiers. Text content alone offers interesting controversy detection accuracy. It can contain predictive features for controversy detection. For instance, negative connotations, pejorative tendencies and positive qualifying adjectives tend to impact the controversy model detection.

我们通过 SHAP（SHapley Additive exPlanations）方法探索争议的可解释性，旨在对推文不同文本特征对争议检测的贡献进行公平评估。我们从社区的角度对 Twitter 上的话题讨论进行分析，研究文本在将推文准确分类到各自社区中的作用。为此，我们引入了基于 SHAP 的管道，旨在量化有影响力的文本特征对三种推文分类器预测的影响。仅文本内容就能提供有趣的争议检测准确率。它可以包含争议检测的预测特征。例如，负面内涵、贬义倾向和正面修饰形容词往往会影响争议模型的检测。

引用次数: 0

pFind: Privacy-preserving lost object finding in vehicular crowdsensing pFind：在车辆群感应中寻找丢失物体时保护隐私

World Wide Web

Pub Date : 2024-09-13 DOI: 10.1007/s11280-024-01300-4

Yinggang Sun, Haining Yu, Xiang Li, Yizheng Yang, Xiangzhan Yu

Web 3.0 makes crowdsensing services more popular, because of its decentralisation and interoperability. Lost Object Finding (LOF) in vehicular crowdsensing is an emerging paradigm in which vehicles act as detectors to find lost objects for their owners. To enjoy LOF services, object owners need to submit the tag ID of his lost object, and then detectors need to update their detecting results together with their locations. But the identity and location information are usually sensitive, which can be used to infer the locations of lost objects, or track participant detectors. This raises serious privacy concerns. In this paper, we study the privacy leakages associated with object finding, and propose a privacy-preserving scheme, named pFind, for locating lost objects. This scheme allows owners to retrieve the locations of their lost objects and provides strong privacy protection for the object owners, lost objects, and detectors. In pFind, we design an oblivious object detection protocol by using RBS cryptosystem, which simultaneously provides confidentiality, authentication and integrity for lost objects detection. Meanwhile, we propose a private location retrieval protocol to compute the approximate location of a lost object over encrypted data. We further propose two optimizations for pFind to enhance functionality and performance. Theoretical analysis and experimental evaluations show that pFind is secure, accurate and efficient.

Web 3.0 因其分散性和互操作性，使众传感服务更受欢迎。车载众感应中的失物查找（LOF）是一种新兴模式，在这种模式中，车辆充当探测器，为失主查找失物。要享受 LOF 服务，失主需要提交失物的标签 ID，然后检测器需要更新检测结果及其位置。但身份和位置信息通常比较敏感，可用于推断失物的位置或跟踪参与的探测器。这引发了严重的隐私问题。在本文中，我们研究了与物体查找相关的隐私泄露问题，并提出了一种名为 pFind 的隐私保护方案，用于查找丢失的物体。该方案允许物主检索其丢失物品的位置，并为物主、丢失物品和检测器提供强大的隐私保护。在 pFind 中，我们利用 RBS 密码系统设计了一种遗忘对象检测协议，同时为丢失对象检测提供了保密性、身份验证和完整性。同时，我们提出了一种私人位置检索协议，通过加密数据计算丢失对象的大致位置。我们还对 pFind 提出了两个优化方案，以增强其功能和性能。理论分析和实验评估表明，pFind 是安全、准确和高效的。

{"title":"pFind: Privacy-preserving lost object finding in vehicular crowdsensing","authors":"Yinggang Sun, Haining Yu, Xiang Li, Yizheng Yang, Xiangzhan Yu","doi":"10.1007/s11280-024-01300-4","DOIUrl":"https://doi.org/10.1007/s11280-024-01300-4","url":null,"abstract":"Web 3.0 makes crowdsensing services more popular, because of its decentralisation and interoperability. Lost Object Finding (LOF) in vehicular crowdsensing is an emerging paradigm in which vehicles act as detectors to find lost objects for their owners. To enjoy LOF services, object owners need to submit the tag ID of his lost object, and then detectors need to update their detecting results together with their locations. But the identity and location information are usually sensitive, which can be used to infer the locations of lost objects, or track participant detectors. This raises serious privacy concerns. In this paper, we study the privacy leakages associated with object finding, and propose a privacy-preserving scheme, named pFind, for locating lost objects. This scheme allows owners to retrieve the locations of their lost objects and provides strong privacy protection for the object owners, lost objects, and detectors. In pFind, we design an oblivious object detection protocol by using RBS cryptosystem, which simultaneously provides confidentiality, authentication and integrity for lost objects detection. Meanwhile, we propose a private location retrieval protocol to compute the approximate location of a lost object over encrypted data. We further propose two optimizations for pFind to enhance functionality and performance. Theoretical analysis and experimental evaluations show that pFind is secure, accurate and efficient.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142204977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Use of prompt-based learning for code-mixed and code-switched text classification 使用基于提示的学习方法进行代码混合和代码切换文本分类

World Wide Web

Pub Date : 2024-09-09 DOI: 10.1007/s11280-024-01302-2

Pasindu Udawatta, Indunil Udayangana, Chathulanka Gamage, Ravi Shekhar, Surangika Ranathunga

Code-mixing and code-switching (CMCS) are prevalent phenomena observed in social media conversations and various other modes of communication. When developing applications such as sentiment analysers and hate-speech detectors that operate on this social media data, CMCS text poses challenges. Recent studies have demonstrated that prompt-based learning of pre-trained language models outperforms full fine-tuning across various tasks. Despite the growing interest in classifying CMCS text, the effectiveness of prompt-based learning for the task remains unexplored. This paper presents an extensive exploration of prompt-based learning for CMCS text classification and the first comprehensive analysis of the impact of the script on classifying CMCS text. Our study reveals that the performance in classifying CMCS text is significantly influenced by the inclusion of multiple scripts and the intensity of code-mixing. In response, we introduce a novel method, Dynamic+AdapterPrompt, which employs distinct models for each script, integrated with adapters. While DynamicPrompt captures the script-specific representation of the text, AdapterPrompt emphasizes capturing the task-oriented functionality. Our experiments on Sinhala-English, Kannada-English, and Hindi-English datasets for sentiment classification, hate-speech detection, and humour detection tasks show that our method outperforms strong fine-tuning baselines and basic prompting strategies.

代码混合和代码转换（CMCS）是社交媒体对话和其他各种交流模式中普遍存在的现象。在开发情感分析仪和仇恨语音检测器等应用时，CMCS 文本对这些社交媒体数据的操作提出了挑战。最近的研究表明，在各种任务中，基于提示的预训练语言模型学习优于完全微调。尽管人们对 CMCS 文本分类的兴趣与日俱增，但基于提示的学习在这项任务中的有效性仍有待探索。本文广泛探讨了基于提示的 CMCS 文本分类学习，并首次全面分析了脚本对 CMCS 文本分类的影响。我们的研究发现，CMCS 文本的分类性能受到包含多个脚本和代码混合强度的显著影响。为此，我们引入了一种新方法--动态+适配器提示（Dynamic+AdapterPrompt），该方法针对每个脚本采用不同的模型，并与适配器集成。动态提示捕捉特定脚本的文本表示，而适配器提示则强调捕捉面向任务的功能。我们在僧伽罗语-英语、坎纳达语-英语和印地语-英语数据集上进行的情感分类、仇恨语音检测和幽默检测任务实验表明，我们的方法优于强微调基线和基本提示策略。

{"title":"Use of prompt-based learning for code-mixed and code-switched text classification","authors":"Pasindu Udawatta, Indunil Udayangana, Chathulanka Gamage, Ravi Shekhar, Surangika Ranathunga","doi":"10.1007/s11280-024-01302-2","DOIUrl":"https://doi.org/10.1007/s11280-024-01302-2","url":null,"abstract":"Code-mixing and code-switching (CMCS) are prevalent phenomena observed in social media conversations and various other modes of communication. When developing applications such as sentiment analysers and hate-speech detectors that operate on this social media data, CMCS text poses challenges. Recent studies have demonstrated that prompt-based learning of pre-trained language models outperforms full fine-tuning across various tasks. Despite the growing interest in classifying CMCS text, the effectiveness of prompt-based learning for the task remains unexplored. This paper presents an extensive exploration of prompt-based learning for CMCS text classification and the first comprehensive analysis of the impact of the script on classifying CMCS text. Our study reveals that the performance in classifying CMCS text is significantly influenced by the inclusion of multiple scripts and the intensity of code-mixing. In response, we introduce a novel method, Dynamic+AdapterPrompt, which employs distinct models for each script, integrated with adapters. While DynamicPrompt captures the script-specific representation of the text, AdapterPrompt emphasizes capturing the task-oriented functionality. Our experiments on Sinhala-English, Kannada-English, and Hindi-English datasets for sentiment classification, hate-speech detection, and humour detection tasks show that our method outperforms strong fine-tuning baselines and basic prompting strategies.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142204976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Drug traceability system based on semantic blockchain and on a reputation method 基于语义区块链和信誉方法的药品溯源系统

World Wide Web

Pub Date : 2024-09-06 DOI: 10.1007/s11280-024-01301-3

Petar Kochovski, Maroua Masmoudi, Redouane Bouhamoum, Vlado Stankovski, Hajer Baazaoui, Chirine Ghedira, Dan Vodislav, Thamer Mecharnia

Drug traceability is a critical process involving monitoring and validation of the origin, quality, and safety of pharmaceutical products throughout their supply chain to prevent the distribution of counterfeit, substandard, or expired drugs that could harm patients. Traditional centralized solutions for drug traceability, relying on intermediaries and central authorities, introduce risks of data manipulation, corruption, and single points of failure. This work presents the design and implementation of a novel solution for decentralized drug traceability based on blockchain technology and on a reputation mechanism that operates on top of a trustworthy decentralized knowledge base, thus integrating three core technologies: blockchain, semantic, and reputation methods. Blockchain technologies ensure transparent and secure supply chain processes while providing a trustworthy estimation of the reputation of supply chain participants. Semantic technologies address drug data heterogeneity by ensuring interoperability and creating mappings between various data sources, including verifying the identities of the various users. Additionally, the reputation mechanism promotes transparency and accountability, as stakeholders contribute feedback on drug quality, authenticity, and reliability. This fosters a culture of trust and reliability, offering the drug supply chain an effective tool for continuous improvement and informed decision-making based on aggregated feedback, ultimately enhancing overall quality and safety throughout the distribution network. The design and implementation of the system, along with several evaluations, show the feasibility of the new semantic blockchain system in real-world scenarios and the improvement of the entities with a high reputation score. Our solution is more trustworthy, discouraging fraudulent activities as security is based on the various properties included in the semantic model.

药品可追溯性是一个关键过程，涉及在整个供应链中对药品的来源、质量和安全性进行监控和验证，以防止假冒伪劣或过期药品流入市场，对患者造成伤害。传统的集中式药品追溯解决方案依赖于中间商和中央机构，存在数据篡改、损坏和单点故障的风险。这项工作介绍了一种基于区块链技术和信誉机制的新型去中心化药物溯源解决方案的设计和实施，该信誉机制在可信的去中心化知识库之上运行，从而整合了三种核心技术：区块链、语义和信誉方法。区块链技术可确保供应链流程的透明性和安全性，同时对供应链参与者的信誉进行可信的评估。语义技术通过确保互操作性和创建不同数据源之间的映射（包括验证不同用户的身份）来解决药物数据异构问题。此外，由于利益相关者会对药品质量、真实性和可靠性提出反馈意见，因此信誉机制可提高透明度和责任感。这促进了一种信任和可靠的文化，为药品供应链提供了一种有效的工具，可根据汇总的反馈意见不断改进和做出明智的决策，最终提高整个分销网络的整体质量和安全性。该系统的设计和实施以及多项评估结果表明，新的语义区块链系统在现实世界的应用场景中具有可行性，并能改善声誉得分较高的实体。我们的解决方案更值得信赖，可以阻止欺诈活动，因为安全性是基于语义模型中包含的各种属性。

{"title":"Drug traceability system based on semantic blockchain and on a reputation method","authors":"Petar Kochovski, Maroua Masmoudi, Redouane Bouhamoum, Vlado Stankovski, Hajer Baazaoui, Chirine Ghedira, Dan Vodislav, Thamer Mecharnia","doi":"10.1007/s11280-024-01301-3","DOIUrl":"https://doi.org/10.1007/s11280-024-01301-3","url":null,"abstract":"Drug traceability is a critical process involving monitoring and validation of the origin, quality, and safety of pharmaceutical products throughout their supply chain to prevent the distribution of counterfeit, substandard, or expired drugs that could harm patients. Traditional centralized solutions for drug traceability, relying on intermediaries and central authorities, introduce risks of data manipulation, corruption, and single points of failure. This work presents the design and implementation of a novel solution for decentralized drug traceability based on blockchain technology and on a reputation mechanism that operates on top of a trustworthy decentralized knowledge base, thus integrating three core technologies: blockchain, semantic, and reputation methods. Blockchain technologies ensure transparent and secure supply chain processes while providing a trustworthy estimation of the reputation of supply chain participants. Semantic technologies address drug data heterogeneity by ensuring interoperability and creating mappings between various data sources, including verifying the identities of the various users. Additionally, the reputation mechanism promotes transparency and accountability, as stakeholders contribute feedback on drug quality, authenticity, and reliability. This fosters a culture of trust and reliability, offering the drug supply chain an effective tool for continuous improvement and informed decision-making based on aggregated feedback, ultimately enhancing overall quality and safety throughout the distribution network. The design and implementation of the system, along with several evaluations, show the feasibility of the new semantic blockchain system in real-world scenarios and the improvement of the entities with a high reputation score. Our solution is more trustworthy, discouraging fraudulent activities as security is based on the various properties included in the semantic model.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142204978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Category-aware self-supervised graph neural network for session-based recommendation 基于会话推荐的分类感知自监督图神经网络

World Wide Web

Pub Date : 2024-09-02 DOI: 10.1007/s11280-024-01299-8

Dongjing Wang, Ruijie Du, Qimeng Yang, Dongjin Yu, Feng Wan, Xiaojun Gong, Guandong Xu, Shuiguang Deng

Session-based recommendation which focuses on predicting the next behavior according to anonymous sessions of behavior records plays an important role in real-world applications. Most previous session-based recommendation approaches capture the preferences of users by modeling the behavior records between users and items within current session. However, items’ category information is not fully exploited, while existing works are still suffering from the severe issue of data sparsity. In this work, we propose a novel session-based recommendation model, namely Category-aware Self-supervised Graph Neural Network (namely CSGNN), which adopts a pre-training layer for capturing the features of items and categories, as well as the correlations among them. Especially, we build a category-aware heterogeneous hypergraph composed of item nodes and category nodes, which enhances the information learning in the current session. Then we design item-level and category-level self-attention models to represent the information of item and category, respectively, and integrate global and local preference of user for session-based recommendation. Finally, we combine self-supervised learning by constructing a category-aware session graph to further enhance the performance CSGNN and alleviate the data sparsity problem. Comprehensive experiments are conducted on three real-world datasets, Nowplaying, Diginetica, and Tmall, and the results show that the proposed model CSGNN achieves better performance than session-based recommendation baselines with several state-of-the-art approaches.

基于会话的推荐侧重于根据匿名会话的行为记录预测下一次行为，在实际应用中发挥着重要作用。以往大多数基于会话的推荐方法都是通过对当前会话中用户与项目之间的行为记录建模来捕捉用户的偏好。然而，项目的类别信息并没有得到充分利用，而现有的工作仍然受到数据稀疏性这一严重问题的困扰。在这项工作中，我们提出了一种新颖的基于会话的推荐模型，即类别感知自监督图神经网络（即 CSGNN），它采用预训练层来捕捉物品和类别的特征以及它们之间的相关性。特别是，我们构建了一个由条目节点和类别节点组成的类别感知异构超图，从而增强了当前会话的信息学习能力。然后，我们设计了项目级和类别级自我关注模型，分别代表项目和类别信息，并整合用户的全局和局部偏好，实现基于会话的推荐。最后，我们通过构建类别感知会话图结合自监督学习，进一步提高了 CSGNN 的性能，并缓解了数据稀疏性问题。我们在 Nowplaying、Diginetica 和 Tmall 三个真实数据集上进行了综合实验，结果表明所提出的模型 CSGNN 比基于会话的推荐基线和几种最先进的方法取得了更好的性能。

{"title":"Category-aware self-supervised graph neural network for session-based recommendation","authors":"Dongjing Wang, Ruijie Du, Qimeng Yang, Dongjin Yu, Feng Wan, Xiaojun Gong, Guandong Xu, Shuiguang Deng","doi":"10.1007/s11280-024-01299-8","DOIUrl":"https://doi.org/10.1007/s11280-024-01299-8","url":null,"abstract":"Session-based recommendation which focuses on predicting the next behavior according to anonymous sessions of behavior records plays an important role in real-world applications. Most previous session-based recommendation approaches capture the preferences of users by modeling the behavior records between users and items within current session. However, items’ category information is not fully exploited, while existing works are still suffering from the severe issue of data sparsity. In this work, we propose a novel session-based recommendation model, namely Category-aware Self-supervised Graph Neural Network (namely CSGNN), which adopts a pre-training layer for capturing the features of items and categories, as well as the correlations among them. Especially, we build a category-aware heterogeneous hypergraph composed of item nodes and category nodes, which enhances the information learning in the current session. Then we design item-level and category-level self-attention models to represent the information of item and category, respectively, and integrate global and local preference of user for session-based recommendation. Finally, we combine self-supervised learning by constructing a category-aware session graph to further enhance the performance CSGNN and alleviate the data sparsity problem. Comprehensive experiments are conducted on three real-world datasets, Nowplaying, Diginetica, and Tmall, and the results show that the proposed model CSGNN achieves better performance than session-based recommendation baselines with several state-of-the-art approaches.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142204998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MvHAAN: multi-view hierarchical attention adversarial network for person re-identification MvHAAN：用于人物再识别的多视角分层注意力对抗网络

World Wide Web

Pub Date : 2024-08-22 DOI: 10.1007/s11280-024-01298-9

Lei Zhu, Weiren Yu, Xinghui Zhu, Chengyuan Zhang, Yangding Li, Shichao Zhang

Person re-identification (re-id) aims to recognize pedestrians across different camera views, which enjoys popularity in computer vision area recently. Notwithstanding the progress achieved by existing methods in rising matching rate, the prevailing solutions still suffer from two impossible-to-ignore issues: (I) multi-grained view-consistent discriminative feature learning across multiple views has barely been explored, and (II) latent non-linear correlation between multiple views is insufficient captured. To this end, this article proposes a novel end-to-end unsupervised framework for person re-id task, dubbed Multi-view Hierarchical Attention Adversarial Network (MvHAAN). This framework enjoys two merits: First, a hierarchical attention mechanism in multi-view networks is present to learn multi-grained view-consistent discriminative features; Second, a multi-view adversarial correlation learning strategy is involved to excavate complex non-linear correlation from all views simultaneously. To the best of our knowledge, it is the early attempt of marrying multi-view deep correlation learning with adversarial learning to further reduce multi-view heterogeneity. Extensive evaluations on three person re-id benchmark datasets verify that the proposed method delivers superior performance of unsupervised person re-id.

人员再识别（re-id）旨在识别不同视角下的行人，近年来在计算机视觉领域颇受欢迎。尽管现有方法在提高匹配率方面取得了进步，但普遍的解决方案仍然存在两个不容忽视的问题：（I）跨多视图的多粒度视图一致性判别特征学习几乎没有被探索过；（II）多视图之间潜在的非线性相关性捕捉不足。为此，本文针对人物再识别任务提出了一种新颖的端到端无监督框架，即多视图分层注意力对抗网络（Multi-view Hierarchical Attention Adversarial Network，MvHAAN）。该框架有两个优点：首先，多视图网络中的分层注意力机制可学习多粒度视图一致的判别特征；其次，多视图对抗关联学习策略可同时从所有视图中挖掘复杂的非线性关联。据我们所知，这是将多视图深度相关学习与对抗学习相结合以进一步减少多视图异质性的早期尝试。在三个人物再识别基准数据集上进行的广泛评估验证了所提出的方法在无监督人物再识别方面的卓越性能。

{"title":"MvHAAN: multi-view hierarchical attention adversarial network for person re-identification","authors":"Lei Zhu, Weiren Yu, Xinghui Zhu, Chengyuan Zhang, Yangding Li, Shichao Zhang","doi":"10.1007/s11280-024-01298-9","DOIUrl":"https://doi.org/10.1007/s11280-024-01298-9","url":null,"abstract":"Person re-identification (re-id) aims to recognize pedestrians across different camera views, which enjoys popularity in computer vision area recently. Notwithstanding the progress achieved by existing methods in rising matching rate, the prevailing solutions still suffer from two impossible-to-ignore issues: (I) multi-grained view-consistent discriminative feature learning across multiple views has barely been explored, and (II) latent non-linear correlation between multiple views is insufficient captured. To this end, this article proposes a novel end-to-end unsupervised framework for person re-id task, dubbed Multi-view Hierarchical Attention Adversarial Network (MvHAAN). This framework enjoys two merits: First, a hierarchical attention mechanism in multi-view networks is present to learn multi-grained view-consistent discriminative features; Second, a multi-view adversarial correlation learning strategy is involved to excavate complex non-linear correlation from all views simultaneously. To the best of our knowledge, it is the early attempt of marrying multi-view deep correlation learning with adversarial learning to further reduce multi-view heterogeneity. Extensive evaluations on three person re-id benchmark datasets verify that the proposed method delivers superior performance of unsupervised person re-id.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142204980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A survey on large language models for recommendation 关于用于推荐的大型语言模型的调查

World Wide Web

Pub Date : 2024-08-22 DOI: 10.1007/s11280-024-01291-2

Likang Wu, Zhi Zheng, Zhaopeng Qiu, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong Chen

Large Language Models (LLMs) have emerged as powerful tools in the field of Natural Language Processing (NLP) and have recently gained significant attention in the domain of Recommendation Systems (RS). These models, trained on massive amounts of data using self-supervised learning, have demonstrated remarkable success in learning universal representations and have the potential to enhance various aspects of recommendation systems by some effective transfer techniques such as fine-tuning, prompt tuning, etc. The crucial aspect of harnessing the power of language models in enhancing recommendation quality is the utilization of their high-quality representations of textual features and their extensive coverage of external knowledge to establish correlations between items and users. To provide a comprehensive understanding of the existing LLM-based recommendation systems, this survey presents a taxonomy that categorizes these models into two major paradigms, respectively Discriminative LLM for Recommendation (DLLM4Rec) and Generative LLM for Recommendation (GLLM4Rec), with the latter being systematically sorted out for the first time. Furthermore, we systematically review and analyze existing LLM-based recommendation systems within each paradigm, providing insights into their methodologies, techniques, and performance. Additionally, we identify key challenges and several valuable findings to provide researchers and practitioners with inspiration. We have also created a GitHub repository to index relevant papers and resources on LLMs for recommendation (https://github.com/WLiK/LLM4Rec-Awesome-Papers).

大型语言模型（LLM）已成为自然语言处理（NLP）领域的强大工具，最近在推荐系统（RS）领域获得了极大关注。这些模型通过自我监督学习对海量数据进行训练，在学习通用表征方面取得了显著的成功，并有可能通过一些有效的转移技术（如微调、及时调整等）来增强推荐系统的各个方面。利用语言模型的力量提高推荐质量的关键在于利用其高质量的文本特征表征和广泛的外部知识覆盖来建立项目和用户之间的相关性。为了全面了解现有的基于 LLM 的推荐系统，本调查报告提出了一种分类法，将这些模型分为两大范式，分别是用于推荐的判别式 LLM（DLLM4Rec）和用于推荐的生成式 LLM（GLLM4Rec），并首次对后者进行了系统梳理。此外，我们还系统地回顾和分析了每种范式中现有的基于 LLM 的推荐系统，深入探讨了它们的方法、技术和性能。此外，我们还指出了关键挑战和一些有价值的发现，为研究人员和从业人员提供了灵感。我们还创建了一个 GitHub 存储库，用于索引有关用于推荐的 LLM 的相关论文和资源 (https://github.com/WLiK/LLM4Rec-Awesome-Papers)。

{"title":"A survey on large language models for recommendation","authors":"Likang Wu, Zhi Zheng, Zhaopeng Qiu, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong Chen","doi":"10.1007/s11280-024-01291-2","DOIUrl":"https://doi.org/10.1007/s11280-024-01291-2","url":null,"abstract":"Large Language Models (LLMs) have emerged as powerful tools in the field of Natural Language Processing (NLP) and have recently gained significant attention in the domain of Recommendation Systems (RS). These models, trained on massive amounts of data using self-supervised learning, have demonstrated remarkable success in learning universal representations and have the potential to enhance various aspects of recommendation systems by some effective transfer techniques such as fine-tuning, prompt tuning, etc. The crucial aspect of harnessing the power of language models in enhancing recommendation quality is the utilization of their high-quality representations of textual features and their extensive coverage of external knowledge to establish correlations between items and users. To provide a comprehensive understanding of the existing LLM-based recommendation systems, this survey presents a taxonomy that categorizes these models into two major paradigms, respectively Discriminative LLM for Recommendation (DLLM4Rec) and Generative LLM for Recommendation (GLLM4Rec), with the latter being systematically sorted out for the first time. Furthermore, we systematically review and analyze existing LLM-based recommendation systems within each paradigm, providing insights into their methodologies, techniques, and performance. Additionally, we identify key challenges and several valuable findings to provide researchers and practitioners with inspiration. We have also created a GitHub repository to index relevant papers and resources on LLMs for recommendation (https://github.com/WLiK/LLM4Rec-Awesome-Papers).","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142204994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-stage enhanced representation learning for document reranking based on query view 基于查询视图的文档重排多级增强表示学习

World Wide Web

Pub Date : 2024-08-21 DOI: 10.1007/s11280-024-01296-x

Hai Liu, Xiaozhi Zhu, Yong Tang, Chaobo He, Tianyong Hao

The large-size language model is able to implicitly extract informative semantic features from queries and candidate documents to achieve impressive reranking performance. However, the large model relies on its own large number of parameters to achieve it and it is not known exactly what semantic information has been learned. In this paper, we propose a multi-stage enhanced representation learning method based on Query-View (MERL) with Intra-query stage and Inter-query stage to guide the model to explicitly learn the semantic relationship between the query and documents. In the Intra-query training stage, a content-based contrastive learning module without considering the special token [CLS] of BERT is utilized to optimize the semantic similarity of query and relevant documents. In the Inter-query training stage, an entity-oriented masked query prediction for establish a semantic relation of query-document pairs and an Inter-query contrastive learning module for extracting similar matching pattern of query-relevant documents are employed. Extensive experiments on MS MARCO passage ranking and TREC DL datasets show that the MERL method obtain significant improvements with a low number of parameters compared to the baseline models.

大型语言模型能够从查询和候选文档中隐含地提取信息丰富的语义特征，从而实现令人印象深刻的重新排序性能。然而，大模型是依靠自身的大量参数来实现的，而且不知道到底学到了哪些语义信息。在本文中，我们提出了一种基于查询视图（MERL）的多阶段增强表示学习方法，包括查询内阶段（Intra-query stage）和查询间阶段（Inter-query stage），以引导模型明确学习查询与文档之间的语义关系。在查询内训练阶段，利用基于内容的对比学习模块（不考虑 BERT 的特殊标记 [CLS]）来优化查询和相关文档的语义相似性。在查询间训练阶段，利用面向实体的屏蔽查询预测建立查询-文档对的语义关系，并利用查询间对比学习模块提取查询-相关文档的相似匹配模式。在 MS MARCO 段落排序和 TREC DL 数据集上进行的大量实验表明，与基线模型相比，MERL 方法在参数数量较少的情况下就能获得显著的改进。

{"title":"Multi-stage enhanced representation learning for document reranking based on query view","authors":"Hai Liu, Xiaozhi Zhu, Yong Tang, Chaobo He, Tianyong Hao","doi":"10.1007/s11280-024-01296-x","DOIUrl":"https://doi.org/10.1007/s11280-024-01296-x","url":null,"abstract":"The large-size language model is able to implicitly extract informative semantic features from queries and candidate documents to achieve impressive reranking performance. However, the large model relies on its own large number of parameters to achieve it and it is not known exactly what semantic information has been learned. In this paper, we propose a multi-stage enhanced representation learning method based on Query-View (MERL) with Intra-query stage and Inter-query stage to guide the model to explicitly learn the semantic relationship between the query and documents. In the Intra-query training stage, a content-based contrastive learning module without considering the special token [CLS] of BERT is utilized to optimize the semantic similarity of query and relevant documents. In the Inter-query training stage, an entity-oriented masked query prediction for establish a semantic relation of query-document pairs and an Inter-query contrastive learning module for extracting similar matching pattern of query-relevant documents are employed. Extensive experiments on MS MARCO passage ranking and TREC DL datasets show that the MERL method obtain significant improvements with a low number of parameters compared to the baseline models.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142204995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

LLMs for knowledge graph construction and reasoning: recent capabilities and future opportunities 用于知识图谱构建和推理的 LLM：最新能力和未来机遇

World Wide Web

Pub Date : 2024-08-21 DOI: 10.1007/s11280-024-01297-w

Yuqi Zhu, Xiaohan Wang, Jing Chen, Shuofei Qiao, Yixin Ou, Yunzhi Yao, Shumin Deng, Huajun Chen, Ningyu Zhang

This paper presents an exhaustive quantitative and qualitative evaluation of Large Language Models (LLMs) for Knowledge Graph (KG) construction and reasoning. We engage in experiments across eight diverse datasets, focusing on four representative tasks encompassing entity and relation extraction, event extraction, link prediction, and question-answering, thereby thoroughly exploring LLMs’ performance in the domain of construction and inference. Empirically, our findings suggest that LLMs, represented by GPT-4, are more suited as inference assistants rather than few-shot information extractors. Specifically, while GPT-4 exhibits good performance in tasks related to KG construction, it excels further in reasoning tasks, surpassing fine-tuned models in certain cases. Moreover, our investigation extends to the potential generalization ability of LLMs for information extraction, leading to the proposition of a Virtual Knowledge Extraction task and the development of the corresponding VINE dataset. Based on these empirical findings, we further propose AutoKG, a multi-agent-based approach employing LLMs and external sources for KG construction and reasoning. We anticipate that this research can provide invaluable insights for future undertakings in the field of knowledge graphs.

本文对用于知识图谱（KG）构建和推理的大型语言模型（LLM）进行了详尽的定量和定性评估。我们在八个不同的数据集上进行了实验，重点关注四个具有代表性的任务，包括实体和关系提取、事件提取、链接预测和问题解答，从而全面探索 LLM 在构建和推理领域的性能。实证研究结果表明，以 GPT-4 为代表的 LLM 更适合作为推理助手，而不是少量信息提取器。具体来说，虽然 GPT-4 在与 KG 构建相关的任务中表现出色，但在推理任务中却更胜一筹，在某些情况下甚至超过了微调模型。此外，我们的研究还扩展到了 LLM 在信息提取方面的潜在泛化能力，从而提出了虚拟知识提取任务，并开发了相应的 VINE 数据集。基于这些实证研究结果，我们进一步提出了 AutoKG，这是一种基于多机器人的方法，利用 LLMs 和外部资源进行 KG 构建和推理。我们期待这项研究能为知识图谱领域未来的工作提供宝贵的见解。

{"title":"LLMs for knowledge graph construction and reasoning: recent capabilities and future opportunities","authors":"Yuqi Zhu, Xiaohan Wang, Jing Chen, Shuofei Qiao, Yixin Ou, Yunzhi Yao, Shumin Deng, Huajun Chen, Ningyu Zhang","doi":"10.1007/s11280-024-01297-w","DOIUrl":"https://doi.org/10.1007/s11280-024-01297-w","url":null,"abstract":"This paper presents an exhaustive quantitative and qualitative evaluation of Large Language Models (LLMs) for Knowledge Graph (KG) construction and reasoning. We engage in experiments across eight diverse datasets, focusing on four representative tasks encompassing entity and relation extraction, event extraction, link prediction, and question-answering, thereby thoroughly exploring LLMs’ performance in the domain of construction and inference. Empirically, our findings suggest that LLMs, represented by GPT-4, are more suited as inference assistants rather than few-shot information extractors. Specifically, while GPT-4 exhibits good performance in tasks related to KG construction, it excels further in reasoning tasks, surpassing fine-tuned models in certain cases. Moreover, our investigation extends to the potential generalization ability of LLMs for information extraction, leading to the proposition of a Virtual Knowledge Extraction task and the development of the corresponding VINE dataset. Based on these empirical findings, we further propose AutoKG, a multi-agent-based approach employing LLMs and external sources for KG construction and reasoning. We anticipate that this research can provide invaluable insights for future undertakings in the field of knowledge graphs.","PeriodicalId":501180,"journal":{"name":"World Wide Web","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142204996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0