首页 > 最新文献

IEEE Transactions on Computational Social Systems最新文献

英文 中文
Counteracting Popularity Bias in Multimedia Web API Recommendation 抵制多媒体Web API推荐中的流行偏见
IF 4.5 2区 计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS Pub Date : 2025-01-30 DOI: 10.1109/TCSS.2024.3517601
Dengshuai Zhai;Chao Yan;Weiyi Zhong;Shaoqi Ding;Lianyong Qi;Xiaokang Zhou
With the widespread adoption of multimedia web APIs (API) in web and mobile applications, a substantial proliferation of these APIs is observed. These APIs have streamlined development processes, reducing both time and costs. Nevertheless, identifying the required APIs from the vast array of options has emerged as a significant challenge. Collaborative filtering (CF)-based recommendation technologies have demonstrated their efficiency in presenting developers with potentially useful APIs. However, these methods often suffer from popularity bias, i.e., popular APIs tend to dominate the recommendation lists. This imbalance in recommendation opportunities among APIs hinders the growth of the multimedia API ecosystem. To mitigate the popularity bias produced by CF-based API recommendation methods, this article introduces a novel debiasing strategy that combines a log postprocessing adjustment (LPA) with determinant point process (DPP). Specifically, the LPA is employed during the prediction phase to yield a more balanced set of candidate APIs. Then, DPP is utilized to generate recommendation lists that are not just relevant but also diverse in terms of API popularity. Experimental results reveal that our proposed method surpasses existing state-of-the-art approaches in multimedia API recommendation, excelling in both accuracy and the capability to mitigate popularity bias effectively.
随着多媒体网络API (API)在网络和移动应用程序中的广泛采用,这些API的大量扩散被观察到。这些api简化了开发过程,减少了时间和成本。然而,从大量选项中确定所需的api已经成为一项重大挑战。基于协同过滤(CF)的推荐技术在向开发人员提供潜在有用的api方面已经证明了它们的效率。然而,这些方法经常受到流行偏差的影响,即流行的api往往会主导推荐列表。API之间推荐机会的不平衡阻碍了多媒体API生态系统的发展。为了减轻基于cf的API推荐方法产生的流行偏差,本文介绍了一种结合日志后处理调整(LPA)和决定点过程(DPP)的新型去偏策略。具体来说,在预测阶段使用LPA来产生一组更平衡的候选api。然后,利用DPP生成推荐列表,这些推荐列表不仅具有相关性,而且在API流行度方面也具有多样性。实验结果表明,我们提出的方法在多媒体API推荐中超越了现有的最先进的方法,在准确性和有效减轻流行偏差的能力方面都表现出色。
{"title":"Counteracting Popularity Bias in Multimedia Web API Recommendation","authors":"Dengshuai Zhai;Chao Yan;Weiyi Zhong;Shaoqi Ding;Lianyong Qi;Xiaokang Zhou","doi":"10.1109/TCSS.2024.3517601","DOIUrl":"https://doi.org/10.1109/TCSS.2024.3517601","url":null,"abstract":"With the widespread adoption of multimedia web APIs (API) in web and mobile applications, a substantial proliferation of these APIs is observed. These APIs have streamlined development processes, reducing both time and costs. Nevertheless, identifying the required APIs from the vast array of options has emerged as a significant challenge. Collaborative filtering (CF)-based recommendation technologies have demonstrated their efficiency in presenting developers with potentially useful APIs. However, these methods often suffer from popularity bias, i.e., popular APIs tend to dominate the recommendation lists. This imbalance in recommendation opportunities among APIs hinders the growth of the multimedia API ecosystem. To mitigate the popularity bias produced by CF-based API recommendation methods, this article introduces a novel debiasing strategy that combines a log postprocessing adjustment (LPA) with determinant point process (DPP). Specifically, the LPA is employed during the prediction phase to yield a more balanced set of candidate APIs. Then, DPP is utilized to generate recommendation lists that are not just relevant but also diverse in terms of API popularity. Experimental results reveal that our proposed method surpasses existing state-of-the-art approaches in multimedia API recommendation, excelling in both accuracy and the capability to mitigate popularity bias effectively.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"12 5","pages":"3858-3868"},"PeriodicalIF":4.5,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145230020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Computational Social Systems Publication Information IEEE计算社会系统汇刊信息
IF 4.5 2区 计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS Pub Date : 2025-01-28 DOI: 10.1109/TCSS.2025.3531587
{"title":"IEEE Transactions on Computational Social Systems Publication Information","authors":"","doi":"10.1109/TCSS.2025.3531587","DOIUrl":"https://doi.org/10.1109/TCSS.2025.3531587","url":null,"abstract":"","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"12 1","pages":"C2-C2"},"PeriodicalIF":4.5,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10856570","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143361026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Systems, Man, and Cybernetics Society Information IEEE系统、人与控制论学会信息
IF 4.5 2区 计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS Pub Date : 2025-01-28 DOI: 10.1109/TCSS.2025.3531589
{"title":"IEEE Systems, Man, and Cybernetics Society Information","authors":"","doi":"10.1109/TCSS.2025.3531589","DOIUrl":"https://doi.org/10.1109/TCSS.2025.3531589","url":null,"abstract":"","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"12 1","pages":"C3-C3"},"PeriodicalIF":4.5,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10856572","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143106470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Computational Social Systems Information for Authors IEEE计算社会系统信息汇刊
IF 4.5 2区 计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS Pub Date : 2025-01-28 DOI: 10.1109/TCSS.2025.3531591
{"title":"IEEE Transactions on Computational Social Systems Information for Authors","authors":"","doi":"10.1109/TCSS.2025.3531591","DOIUrl":"https://doi.org/10.1109/TCSS.2025.3531591","url":null,"abstract":"","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"12 1","pages":"C4-C4"},"PeriodicalIF":4.5,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10856569","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143106471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Creating Healthier Living Environments: The Role of Soundscapes in Promoting Mental Health and Well-Being 创造更健康的生活环境:声景在促进心理健康和幸福中的作用
IF 4.5 2区 计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS Pub Date : 2025-01-28 DOI: 10.1109/TCSS.2025.3530618
Ziwen Sun;Jian Kang;Kun Qian;Björn W. Schuller;Bin Hu
{"title":"Creating Healthier Living Environments: The Role of Soundscapes in Promoting Mental Health and Well-Being","authors":"Ziwen Sun;Jian Kang;Kun Qian;Björn W. Schuller;Bin Hu","doi":"10.1109/TCSS.2025.3530618","DOIUrl":"https://doi.org/10.1109/TCSS.2025.3530618","url":null,"abstract":"","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"12 1","pages":"2-10"},"PeriodicalIF":4.5,"publicationDate":"2025-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10856573","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143360871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding Inciting Speech as New Malice 理解煽动性言论是一种新的恶意
IF 4.5 2区 计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS Pub Date : 2025-01-27 DOI: 10.1109/TCSS.2024.3504357
Vaibhav Garg;Ganning Xu;Munindar P. Singh
Inciting speech seeks to instill hostility or anger in readers or motivate them to take action against a target group. Whereas hate speech in social media has garnered much attention, inciting speech has not been well studied in domains such as religion. We address two aspects of religious incitement: 1) what rhetorical strategies are used in it?; and 2) do the same strategies apply across disparate social contexts and targets? We identify inciting speech against Muslims but demonstrate the generality of the construct vis à vis other targets. We adopt existing datasets of Islamophobic WhatsApp posts and hateful and offensive posts (Twitter and Gab) against other targets. Our methods include: 1) qualitative analysis revealing rhetorical strategies; and 2) an iterative process to label the data, yielding a tool to detect incitement. Incitement applies three rhetorical strategies focused, respectively, on the target group's identity, their imputed misdeeds, and an exhortation to act against them. These strategies carry distinct textual signatures. Our tool (with additional verification) reveals that inciting sentences appear in non-Islamophobic posts and in other contexts (e.g., posts against certain gender identities), indicating the generality of incitement as a concept. Incitement reflects a wide swath of malicious speech omitted from traditional analyses. Understanding and identifying incitement can facilitate online moderation and thus concomitantly reduce harm in real life.
煽动性言论旨在向读者灌输敌意或愤怒,或激励他们对目标群体采取行动。尽管社交媒体上的仇恨言论引起了广泛关注,但在宗教等领域,煽动言论还没有得到很好的研究。我们讨论宗教煽动的两个方面:1)在宗教煽动中使用了什么修辞策略?2)同样的策略是否适用于不同的社会背景和目标?我们确定了针对穆斯林的煽动性言论,但展示了针对其他目标的结构的普遍性。我们采用针对其他目标的仇视伊斯兰的WhatsApp帖子和仇恨和攻击性帖子(Twitter和Gab)的现有数据集。我们的研究方法包括:1)定性分析揭示修辞策略;2)标记数据的迭代过程,从而产生检测刺激的工具。煽动运用了三种修辞策略,分别聚焦于目标群体的身份,他们的罪责,以及对他们采取行动的劝告。这些策略带有不同的文本签名。我们的工具(经过额外验证)显示,煽动语句出现在非伊斯兰恐惧症的帖子和其他背景下(例如,反对某些性别认同的帖子),表明煽动作为一个概念的普遍性。煽动性反映了传统分析中遗漏的大量恶意言论。了解和识别煽动可以促进在线节制,从而减少现实生活中的伤害。
{"title":"Understanding Inciting Speech as New Malice","authors":"Vaibhav Garg;Ganning Xu;Munindar P. Singh","doi":"10.1109/TCSS.2024.3504357","DOIUrl":"https://doi.org/10.1109/TCSS.2024.3504357","url":null,"abstract":"Inciting speech seeks to instill hostility or anger in readers or motivate them to take action against a target group. Whereas hate speech in social media has garnered much attention, inciting speech has not been well studied in domains such as religion. We address two aspects of religious incitement: 1) what rhetorical strategies are used in it?; and 2) do the same strategies apply across disparate social contexts and targets? We identify inciting speech against Muslims but demonstrate the generality of the construct vis à vis other targets. We adopt existing datasets of Islamophobic WhatsApp posts and hateful and offensive posts (Twitter and Gab) against other targets. Our methods include: 1) qualitative analysis revealing rhetorical strategies; and 2) an iterative process to label the data, yielding a tool to detect incitement. Incitement applies three rhetorical strategies focused, respectively, on the target group's identity, their imputed misdeeds, and an exhortation to act against them. These strategies carry distinct textual signatures. Our tool (with additional verification) reveals that inciting sentences appear in non-Islamophobic posts and in other contexts (e.g., posts against certain gender identities), indicating the generality of incitement as a concept. Incitement reflects a wide swath of malicious speech omitted from traditional analyses. Understanding and identifying incitement can facilitate online moderation and thus concomitantly reduce harm in real life.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"12 3","pages":"947-956"},"PeriodicalIF":4.5,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144179157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graph-Based Methods for Multimodal Indoor Activity Recognition: A Comprehensive Survey 基于图的多模式室内活动识别方法综述
IF 4.5 2区 计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS Pub Date : 2025-01-16 DOI: 10.1109/TCSS.2024.3523240
Saeedeh Javadi;Daniele Riboni;Luigi Borzì;Samaneh Zolfaghari
This survey article explores graph-based approaches to multimodal human activity recognition in indoor environments, emphasizing their relevance to advancing multimodal representation and reasoning. With the growing importance of integrating diverse data sources such as sensor events, contextual information, and spatial data, effective human activity recognition methods are essential for applications in smart homes, digital health, and more. We review various graph-based techniques, highlighting their strengths in encoding complex relationships and improving activity recognition performance. Furthermore, we discuss the computational efficiencies and generalization capabilities of these methods across different environments. By providing a comprehensive overview of the state-of-the-art in graph-based human activity recognition, this article aims to contribute to the development of more accurate, interpretable, and robust multimodal systems for understanding human activities in indoor settings.
这篇调查文章探讨了室内环境中基于图的多模态人类活动识别方法,强调了它们与推进多模态表示和推理的相关性。随着集成各种数据源(如传感器事件、上下文信息和空间数据)的重要性日益增加,有效的人类活动识别方法对于智能家居、数字健康等应用至关重要。我们回顾了各种基于图形的技术,强调了它们在编码复杂关系和提高活动识别性能方面的优势。此外,我们还讨论了这些方法在不同环境下的计算效率和泛化能力。通过对基于图形的人类活动识别技术的全面概述,本文旨在为开发更准确、可解释和健壮的多模式系统做出贡献,以理解室内环境中的人类活动。
{"title":"Graph-Based Methods for Multimodal Indoor Activity Recognition: A Comprehensive Survey","authors":"Saeedeh Javadi;Daniele Riboni;Luigi Borzì;Samaneh Zolfaghari","doi":"10.1109/TCSS.2024.3523240","DOIUrl":"https://doi.org/10.1109/TCSS.2024.3523240","url":null,"abstract":"This survey article explores graph-based approaches to multimodal human activity recognition in indoor environments, emphasizing their relevance to advancing multimodal representation and reasoning. With the growing importance of integrating diverse data sources such as sensor events, contextual information, and spatial data, effective human activity recognition methods are essential for applications in smart homes, digital health, and more. We review various graph-based techniques, highlighting their strengths in encoding complex relationships and improving activity recognition performance. Furthermore, we discuss the computational efficiencies and generalization capabilities of these methods across different environments. By providing a comprehensive overview of the state-of-the-art in graph-based human activity recognition, this article aims to contribute to the development of more accurate, interpretable, and robust multimodal systems for understanding human activities in indoor settings.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"12 5","pages":"3728-3746"},"PeriodicalIF":4.5,"publicationDate":"2025-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10843961","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145255997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incorporating Molecular Knowledge in Large Language Models via Multimodal Modeling 通过多模态建模将分子知识纳入大型语言模型
IF 4.5 2区 计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS Pub Date : 2025-01-13 DOI: 10.1109/TCSS.2024.3506158
Zekun Yang;Kun Lv;Jian Shu;Zheng Li;Ping Xiao
In recent years, large language models (LLMs) represented by GPT-4 have achieved tremendous success in natural language-centered tasks. Nevertheless, LLMs face inherent challenges in tasks involving both natural language and molecular modalities. Although there has been some research progress on these tasks, two challenges remain unresolved: modeling the differences in representation format between natural language and molecular modalities, and capturing subtle differences under an instruction-tuned paradigm. To address these two challenges, this article proposes a two-stage training framework to build molecular knowledge-enhanced LLM, named Mol-LLM. The first stage utilizes the multitask instruction tuning method to tackle the modality differences between natural language and molecular sequence. The second stage is the direct preference optimization training strategy with three random preference actions to capture subtle differences under the instruction-tuned paradigm. Extensive experiments have demonstrated the state-of-the-art performances of the model Mol-LLM proposed in this study, including mol2mol, text2text, mol2text, and text2mol task. The effectiveness of the module has been further verified through ablation studies, and the generalizability has been confirmed by additional supportive experiments.
近年来,以GPT-4为代表的大型语言模型(llm)在以自然语言为中心的任务中取得了巨大成功。然而,法学硕士在涉及自然语言和分子模式的任务中面临着固有的挑战。尽管在这些任务上已经取得了一些研究进展,但仍有两个挑战尚未解决:自然语言和分子模式在表示格式上的差异建模,以及在指令调整范式下捕捉细微差异。为了解决这两个挑战,本文提出了一个两阶段的训练框架来构建分子知识增强的LLM,称为Mol-LLM。第一阶段利用多任务指令调整方法解决自然语言和分子序列之间的模态差异。第二阶段是直接偏好优化训练策略,采用三个随机偏好动作捕捉指令调整范式下的细微差异。大量的实验证明了本研究提出的模型mol2mol - llm的最先进性能,包括mol2mol, text2text, mol2text和text2mol任务。通过烧蚀研究进一步验证了该模块的有效性,并通过附加的支持性实验证实了该模块的通用性。
{"title":"Incorporating Molecular Knowledge in Large Language Models via Multimodal Modeling","authors":"Zekun Yang;Kun Lv;Jian Shu;Zheng Li;Ping Xiao","doi":"10.1109/TCSS.2024.3506158","DOIUrl":"https://doi.org/10.1109/TCSS.2024.3506158","url":null,"abstract":"In recent years, large language models (LLMs) represented by GPT-4 have achieved tremendous success in natural language-centered tasks. Nevertheless, LLMs face inherent challenges in tasks involving both natural language and molecular modalities. Although there has been some research progress on these tasks, two challenges remain unresolved: modeling the differences in representation format between natural language and molecular modalities, and capturing subtle differences under an instruction-tuned paradigm. To address these two challenges, this article proposes a two-stage training framework to build molecular knowledge-enhanced LLM, named Mol-LLM. The first stage utilizes the multitask instruction tuning method to tackle the modality differences between natural language and molecular sequence. The second stage is the direct preference optimization training strategy with three random preference actions to capture subtle differences under the instruction-tuned paradigm. Extensive experiments have demonstrated the state-of-the-art performances of the model Mol-LLM proposed in this study, including <italic>mol2mol</i>, <italic>text2text</i>, <italic>mol2text</i>, and <italic>text2mol</i> task. The effectiveness of the module has been further verified through ablation studies, and the generalizability has been confirmed by additional supportive experiments.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"12 5","pages":"3660-3670"},"PeriodicalIF":4.5,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145230027","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SGG-MVAR: Cross-Modal Retrieval With Scene Graph Generation and Multiview Attribute Relationship Guidance 基于场景图生成和多视图属性关系指导的跨模态检索
IF 4.5 2区 计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS Pub Date : 2025-01-13 DOI: 10.1109/TCSS.2024.3524297
Suping Wang;Fei Zhou;Ming Yang;Lei Shi;Chaohong Tan
Cross-modal retrieval is crucial for achieving accurate and efficient information retrieval by establishing semantic correlations between heterogeneous images and text. However, traditional image-text training sets suffer from information asymmetry, which includes short lengths and limited sentence structures. This phenomenon often results in insufficient representations of essential visual information. We introduce RichDataset, which offers extensive semantic information. It includes diverse real-life image-text pairs and AI-generated content across domains such as news, entertainment, education, and posters. Compared with classic benchmarks such as Flickr30k and MS-COCO, RichDataset exhibits a novel and balanced distribution. Existing cross-modal retrieval models face challenges in extracting distinct features from the emerging data, leading to low retrieval accuracy. We propose SGG-MVAR, a comprehensive retrieval model guided by multiview scene information and semantic relationships. Leveraging a scene knowledge database, our model parses scene graphs and identifies differences in attributes and relationships. We conduct extensive experiments to evaluate our proposed dataset and model. All experimental results consistently demonstrate a significant improvement in recall for cross-modal retrieval.
跨模态检索通过在异构图像和文本之间建立语义关联,是实现准确、高效信息检索的关键。然而,传统的图像-文本训练集存在信息不对称的问题,包括长度短和句子结构有限。这种现象通常会导致基本视觉信息的表达不足。我们介绍了RichDataset,它提供了广泛的语义信息。它包括各种现实生活中的图像-文本对和人工智能生成的内容,涵盖新闻、娱乐、教育和海报等领域。与经典的基准测试如Flickr30k和MS-COCO相比,RichDataset呈现出一种新颖而均衡的分布。现有的跨模态检索模型难以从新出现的数据中提取出鲜明的特征,导致检索精度较低。提出了基于多视图场景信息和语义关系的综合检索模型SGG-MVAR。利用场景知识库,我们的模型解析场景图并识别属性和关系的差异。我们进行了大量的实验来评估我们提出的数据集和模型。所有的实验结果都一致证明了跨模态检索在召回方面的显著改善。
{"title":"SGG-MVAR: Cross-Modal Retrieval With Scene Graph Generation and Multiview Attribute Relationship Guidance","authors":"Suping Wang;Fei Zhou;Ming Yang;Lei Shi;Chaohong Tan","doi":"10.1109/TCSS.2024.3524297","DOIUrl":"https://doi.org/10.1109/TCSS.2024.3524297","url":null,"abstract":"Cross-modal retrieval is crucial for achieving accurate and efficient information retrieval by establishing semantic correlations between heterogeneous images and text. However, traditional image-text training sets suffer from information asymmetry, which includes short lengths and limited sentence structures. This phenomenon often results in insufficient representations of essential visual information. We introduce RichDataset, which offers extensive semantic information. It includes diverse real-life image-text pairs and AI-generated content across domains such as news, entertainment, education, and posters. Compared with classic benchmarks such as Flickr30k and MS-COCO, RichDataset exhibits a novel and balanced distribution. Existing cross-modal retrieval models face challenges in extracting distinct features from the emerging data, leading to low retrieval accuracy. We propose SGG-MVAR, a comprehensive retrieval model guided by multiview scene information and semantic relationships. Leveraging a scene knowledge database, our model parses scene graphs and identifies differences in attributes and relationships. We conduct extensive experiments to evaluate our proposed dataset and model. All experimental results consistently demonstrate a significant improvement in recall for cross-modal retrieval.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"12 5","pages":"3671-3683"},"PeriodicalIF":4.5,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145230079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Graph-Based Approach to Identify Opinion Leaders in Twitter 一种基于图表的Twitter意见领袖识别方法
IF 4.5 2区 计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS Pub Date : 2025-01-13 DOI: 10.1109/TCSS.2024.3455415
Marco Furini;Luca Mariotti;Riccardo Martoglia;Manuela Montangero
This study explores the influence of social media on health-related discourse amid the COVID-19 pandemic, focusing on Italian-language tweets posted on Twitter from March 2020 to December 2021. Analyzing a dataset comprising 13 million tweets, the research addresses three key questions: who emerged as opinion leaders on Twitter during the pandemic in Italy?; did health institutions in Italy successfully establish themselves as opinion leaders?; and how did the content of COVID-19-related tweets in Italy evolve over time? Employing a custom-designed graph and the personalized PageRank algorithm, the study identifies opinion leaders on Twitter. Additionally, psycholinguistic analysis provides insights into the content, themes, and emotional undertones of the tweets. The findings of this research contribute to a deeper understanding of social media's influence on public opinion and behavior during the pandemic. Furthermore, they offer valuable insights for public health officials and policymakers seeking to address health-related issues on social media platforms.
本研究探讨了2019冠状病毒病大流行期间社交媒体对健康相关话语的影响,重点关注2020年3月至2021年12月在推特上发布的意大利语推文。该研究分析了一个包含1300万条推文的数据集,解决了三个关键问题:在意大利疫情期间,谁成为推特上的意见领袖?意大利的卫生机构是否成功地确立了自己的意见领袖地位?意大利与covid -19相关的推文内容是如何随着时间的推移而演变的?该研究采用定制设计的图表和个性化的PageRank算法,确定了Twitter上的意见领袖。此外,心理语言学分析提供了对推文内容、主题和情感暗示的见解。这项研究的结果有助于更深入地了解社交媒体在大流行期间对公众舆论和行为的影响。此外,它们为寻求在社交媒体平台上解决与健康相关问题的公共卫生官员和政策制定者提供了宝贵的见解。
{"title":"A Novel Graph-Based Approach to Identify Opinion Leaders in Twitter","authors":"Marco Furini;Luca Mariotti;Riccardo Martoglia;Manuela Montangero","doi":"10.1109/TCSS.2024.3455415","DOIUrl":"https://doi.org/10.1109/TCSS.2024.3455415","url":null,"abstract":"This study explores the influence of social media on health-related discourse amid the COVID-19 pandemic, focusing on Italian-language tweets posted on Twitter from March 2020 to December 2021. Analyzing a dataset comprising 13 million tweets, the research addresses three key questions: who emerged as opinion leaders on Twitter during the pandemic in Italy?; did health institutions in Italy successfully establish themselves as opinion leaders?; and how did the content of COVID-19-related tweets in Italy evolve over time? Employing a custom-designed graph and the personalized PageRank algorithm, the study identifies opinion leaders on Twitter. Additionally, psycholinguistic analysis provides insights into the content, themes, and emotional undertones of the tweets. The findings of this research contribute to a deeper understanding of social media's influence on public opinion and behavior during the pandemic. Furthermore, they offer valuable insights for public health officials and policymakers seeking to address health-related issues on social media platforms.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"12 3","pages":"1268-1278"},"PeriodicalIF":4.5,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144186007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Computational Social Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1