首页 > 最新文献

arXiv - CS - Social and Information Networks最新文献

英文 中文
Community Shaping in the Digital Age: A Temporal Fusion Framework for Analyzing Discourse Fragmentation in Online Social Networks 数字时代的社区塑造:分析在线社交网络话语分裂的时空融合框架
Pub Date : 2024-09-18 DOI: arxiv-2409.11665
Amirhossein Dezhboro, Jose Emmanuel Ramirez-Marquez, Aleksandra Krstikj
This research presents a framework for analyzing the dynamics of onlinecommunities in social media platforms, utilizing a temporal fusion of text andnetwork data. By combining text classification and dynamic social networkanalysis, we uncover mechanisms driving community formation and evolution,revealing the influence of real-world events. We introduced fourteen keyelements based on social science theories to evaluate social media dynamics,validating our framework through a case study of Twitter data during major U.S.events in 2020. Our analysis centers on discrimination discourse, identifyingsexism, racism, xenophobia, ableism, homophobia, and religious intolerance asmain fragments. Results demonstrate rapid community emergence and dissolutioncycles representative of discourse fragments. We reveal how real-worldcircumstances impact discourse dominance and how social media contributes toecho chamber formation and societal polarization. Our comprehensive approachprovides insights into discourse fragmentation, opinion dynamics, andstructural aspects of online communities, offering a methodology forunderstanding the complex interplay between online interactions and societaltrends.
本研究提出了一个分析社交媒体平台中在线社区动态的框架,利用了文本和网络数据的时间融合。通过结合文本分类和动态社交网络分析,我们揭示了驱动社区形成和演变的机制,揭示了现实世界事件的影响。我们引入了 14 个基于社会科学理论的关键要素来评估社交媒体动态,并通过对 2020 年美国重大事件期间 Twitter 数据的案例研究来验证我们的框架。我们的分析以歧视话语为中心,将性别主义、种族主义、仇外心理、能力主义、仇视同性恋和宗教不容忍作为主要片段。分析结果表明,代表话语片段的社区出现和解散周期很快。我们揭示了现实世界的环境如何影响话语的主导地位,以及社交媒体如何促进echo chamber 的形成和社会极化。我们的综合方法提供了对网络社区的话语碎片、舆论动态和结构方面的见解,为理解网络互动与社会趋势之间复杂的相互作用提供了方法论。
{"title":"Community Shaping in the Digital Age: A Temporal Fusion Framework for Analyzing Discourse Fragmentation in Online Social Networks","authors":"Amirhossein Dezhboro, Jose Emmanuel Ramirez-Marquez, Aleksandra Krstikj","doi":"arxiv-2409.11665","DOIUrl":"https://doi.org/arxiv-2409.11665","url":null,"abstract":"This research presents a framework for analyzing the dynamics of online\u0000communities in social media platforms, utilizing a temporal fusion of text and\u0000network data. By combining text classification and dynamic social network\u0000analysis, we uncover mechanisms driving community formation and evolution,\u0000revealing the influence of real-world events. We introduced fourteen key\u0000elements based on social science theories to evaluate social media dynamics,\u0000validating our framework through a case study of Twitter data during major U.S.\u0000events in 2020. Our analysis centers on discrimination discourse, identifying\u0000sexism, racism, xenophobia, ableism, homophobia, and religious intolerance as\u0000main fragments. Results demonstrate rapid community emergence and dissolution\u0000cycles representative of discourse fragments. We reveal how real-world\u0000circumstances impact discourse dominance and how social media contributes to\u0000echo chamber formation and societal polarization. Our comprehensive approach\u0000provides insights into discourse fragmentation, opinion dynamics, and\u0000structural aspects of online communities, offering a methodology for\u0000understanding the complex interplay between online interactions and societal\u0000trends.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
"It Might be Technically Impressive, But It's Practically Useless to Us": Practices, Challenges, and Opportunities for Cross-Functional Collaboration around AI within the News Industry "技术上可能令人印象深刻,但实际上对我们毫无用处":新闻行业围绕人工智能开展跨职能合作的实践、挑战和机遇
Pub Date : 2024-09-18 DOI: arxiv-2409.12000
Qing Xiao, Xianzhe Fan, Felix M. Simon, Bingbing Zhang, Motahhare Eslami
Recently, an increasing number of news organizations have integratedartificial intelligence (AI) into their workflows, leading to a further influxof AI technologists and data workers into the news industry. This has initiatedcross-functional collaborations between these professionals and journalists.While prior research has explored the impact of AI-related roles entering thenews industry, there is a lack of studies on how cross-functional collaborationunfolds between AI professionals and journalists. Through interviews with 17journalists, 6 AI technologists, and 3 AI workers with cross-functionalexperience from leading news organizations, we investigate the currentpractices, challenges, and opportunities for cross-functional collaborationaround AI in today's news industry. We first study how journalists and AIprofessionals perceive existing cross-collaboration strategies. We furtherexplore the challenges of cross-functional collaboration and providerecommendations for enhancing future cross-functional collaboration around AIin the news industry.
最近,越来越多的新闻机构将人工智能(AI)整合到其工作流程中,导致人工智能技术人员和数据工作者进一步涌入新闻行业。虽然之前的研究已经探讨了人工智能相关角色进入新闻行业的影响,但缺乏对人工智能专业人员与新闻记者之间如何开展跨职能合作的研究。通过采访 17 名记者、6 名人工智能技术专家和 3 名具有跨职能经验的领先新闻机构的人工智能工作者,我们调查了当今新闻行业围绕人工智能开展跨职能合作的现行做法、挑战和机遇。我们首先研究了记者和人工智能专业人员如何看待现有的跨部门合作战略。我们进一步探讨了跨职能合作所面临的挑战,并为加强新闻行业未来围绕人工智能的跨职能合作提出了建议。
{"title":"\"It Might be Technically Impressive, But It's Practically Useless to Us\": Practices, Challenges, and Opportunities for Cross-Functional Collaboration around AI within the News Industry","authors":"Qing Xiao, Xianzhe Fan, Felix M. Simon, Bingbing Zhang, Motahhare Eslami","doi":"arxiv-2409.12000","DOIUrl":"https://doi.org/arxiv-2409.12000","url":null,"abstract":"Recently, an increasing number of news organizations have integrated\u0000artificial intelligence (AI) into their workflows, leading to a further influx\u0000of AI technologists and data workers into the news industry. This has initiated\u0000cross-functional collaborations between these professionals and journalists.\u0000While prior research has explored the impact of AI-related roles entering the\u0000news industry, there is a lack of studies on how cross-functional collaboration\u0000unfolds between AI professionals and journalists. Through interviews with 17\u0000journalists, 6 AI technologists, and 3 AI workers with cross-functional\u0000experience from leading news organizations, we investigate the current\u0000practices, challenges, and opportunities for cross-functional collaboration\u0000around AI in today's news industry. We first study how journalists and AI\u0000professionals perceive existing cross-collaboration strategies. We further\u0000explore the challenges of cross-functional collaboration and provide\u0000recommendations for enhancing future cross-functional collaboration around AI\u0000in the news industry.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel DFS/BFS approach towards link prediction 用于链路预测的新型 DFS/BFS 方法
Pub Date : 2024-09-18 DOI: arxiv-2409.11687
Jens Dörpinghaus, Tobias Hübenthal, Denis Stepanov
Knowledge graphs have been shown to play a significant role in currentknowledge mining fields, including life sciences, bioinformatics, computationalsocial sciences, and social network analysis. The problem of link predictionbears many applications and has been extensively studied. However, most methodsare restricted to dimension reduction, probabilistic model, or similarity-basedapproaches and are inherently biased. In this paper, we provide a definition ofgraph prediction for link prediction and outline related work to support ournovel approach, which integrates centrality measures with classical machinelearning methods. We examine our experimental results in detail and identifyareas for potential further research. Our method shows promise, particularlywhen utilizing randomly selected nodes and degree centrality.
知识图谱在当前的知识挖掘领域(包括生命科学、生物信息学、计算社会科学和社会网络分析)发挥着重要作用。链接预测问题有很多应用,并已得到广泛研究。然而,大多数方法都局限于降维、概率模型或基于相似性的方法,本身存在偏差。在本文中,我们为链接预测提供了图预测的定义,并概述了相关工作以支持我们的新方法,该方法将中心性度量与经典机器学习方法相结合。我们详细研究了我们的实验结果,并确定了潜在的进一步研究领域。我们的方法很有前途,尤其是在利用随机选择的节点和度中心性时。
{"title":"A novel DFS/BFS approach towards link prediction","authors":"Jens Dörpinghaus, Tobias Hübenthal, Denis Stepanov","doi":"arxiv-2409.11687","DOIUrl":"https://doi.org/arxiv-2409.11687","url":null,"abstract":"Knowledge graphs have been shown to play a significant role in current\u0000knowledge mining fields, including life sciences, bioinformatics, computational\u0000social sciences, and social network analysis. The problem of link prediction\u0000bears many applications and has been extensively studied. However, most methods\u0000are restricted to dimension reduction, probabilistic model, or similarity-based\u0000approaches and are inherently biased. In this paper, we provide a definition of\u0000graph prediction for link prediction and outline related work to support our\u0000novel approach, which integrates centrality measures with classical machine\u0000learning methods. We examine our experimental results in detail and identify\u0000areas for potential further research. Our method shows promise, particularly\u0000when utilizing randomly selected nodes and degree centrality.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Skill matching at scale: freelancer-project alignment for efficient multilingual candidate retrieval 大规模技能匹配:自由职业者-项目对齐,实现高效多语言候选人检索
Pub Date : 2024-09-18 DOI: arxiv-2409.12097
Warren Jouanneau, Marc Palyart, Emma Jouffroy
Finding the perfect match between a job proposal and a set of freelancers isnot an easy task to perform at scale, especially in multiple languages. In thispaper, we propose a novel neural retriever architecture that tackles thisproblem in a multilingual setting. Our method encodes project descriptions andfreelancer profiles by leveraging pre-trained multilingual language models. Thelatter are used as backbone for a custom transformer architecture that aims tokeep the structure of the profiles and project. This model is trained with acontrastive loss on historical data. Thanks to several experiments, we showthat this approach effectively captures skill matching similarity andfacilitates efficient matching, outperforming traditional methods.
在一份工作提案和一组自由职业者之间找到完美的匹配并不是一件容易的事情,尤其是在多语言环境下。在本文中,我们提出了一种新颖的神经检索器架构,可以在多语言环境中解决这一问题。我们的方法通过利用预先训练好的多语言语言模型,对项目描述和自由职业者简介进行编码。这些模型被用作定制转换器架构的骨干,旨在保持配置文件和项目的结构。该模型在历史数据上进行了对比损失训练。通过多次实验,我们证明这种方法能有效捕捉技能匹配的相似性,并促进高效匹配,其性能优于传统方法。
{"title":"Skill matching at scale: freelancer-project alignment for efficient multilingual candidate retrieval","authors":"Warren Jouanneau, Marc Palyart, Emma Jouffroy","doi":"arxiv-2409.12097","DOIUrl":"https://doi.org/arxiv-2409.12097","url":null,"abstract":"Finding the perfect match between a job proposal and a set of freelancers is\u0000not an easy task to perform at scale, especially in multiple languages. In this\u0000paper, we propose a novel neural retriever architecture that tackles this\u0000problem in a multilingual setting. Our method encodes project descriptions and\u0000freelancer profiles by leveraging pre-trained multilingual language models. The\u0000latter are used as backbone for a custom transformer architecture that aims to\u0000keep the structure of the profiles and project. This model is trained with a\u0000contrastive loss on historical data. Thanks to several experiments, we show\u0000that this approach effectively captures skill matching similarity and\u0000facilitates efficient matching, outperforming traditional methods.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
My Views Do Not Reflect Those of My Employer: Differences in Behavior of Organizations' Official and Personal Social Media Accounts 我的观点并不反映我雇主的观点:组织官方和个人社交媒体账户的行为差异
Pub Date : 2024-09-18 DOI: arxiv-2409.11759
Esa Palosaari, Ted Hsuan Yun Chen, Arttu Malkamäki, Mikko Kivelä
On social media, the boundaries between people's private and public livesoften blur. The need to navigate both roles, which are governed by distinctnorms, impacts how individuals conduct themselves online, and presentsmethodological challenges for researchers. We conduct a systematic explorationon how an organization's official Twitter accounts and its members' personalaccounts differ. Using a climate change Twitter data set as our case, we findsubstantial differences in activity and connectivity across the organizationallevels we examined. The levels differed considerably in their overall retweetnetwork structures, and accounts within each level were more likely to havesimilar connections than accounts at different levels. We illustrate theimplications of these differences for applied research by showing that thelevels closer to the core of the organization display more sectoral homophilybut less triadic closure, and how each level consists of very different groupstructures. Our results show that the common practice of solely analyzingaccounts from a single organizational level, grouping together all levels, orexcluding certain levels can lead to a skewed understanding of howorganizations are represented on social media.
在社交媒体上,人们的私人生活和公共生活之间的界限常常变得模糊。人们需要同时扮演两种角色,而这两种角色又受不同规范的约束,这影响了个人在网上的行为方式,也给研究人员带来了方法论上的挑战。我们对一个组织的官方 Twitter 账户和其成员的个人账户有何不同进行了系统性探索。以气候变化 Twitter 数据集为例,我们发现不同组织级别的活动和连接性存在巨大差异。各层级的转发网络结构差异很大,而且各层级的账户比不同层级的账户更有可能拥有相似的连接。我们说明了这些差异对应用研究的影响,显示出更接近组织核心的层级显示出更多的部门同质性,但较少的三元组封闭性,以及每个层级如何由非常不同的群体结构组成。我们的研究结果表明,仅分析来自单一组织层级的账户、将所有层级分组或排除某些层级的常见做法会导致对组织如何在社交媒体上得到体现的理解出现偏差。
{"title":"My Views Do Not Reflect Those of My Employer: Differences in Behavior of Organizations' Official and Personal Social Media Accounts","authors":"Esa Palosaari, Ted Hsuan Yun Chen, Arttu Malkamäki, Mikko Kivelä","doi":"arxiv-2409.11759","DOIUrl":"https://doi.org/arxiv-2409.11759","url":null,"abstract":"On social media, the boundaries between people's private and public lives\u0000often blur. The need to navigate both roles, which are governed by distinct\u0000norms, impacts how individuals conduct themselves online, and presents\u0000methodological challenges for researchers. We conduct a systematic exploration\u0000on how an organization's official Twitter accounts and its members' personal\u0000accounts differ. Using a climate change Twitter data set as our case, we find\u0000substantial differences in activity and connectivity across the organizational\u0000levels we examined. The levels differed considerably in their overall retweet\u0000network structures, and accounts within each level were more likely to have\u0000similar connections than accounts at different levels. We illustrate the\u0000implications of these differences for applied research by showing that the\u0000levels closer to the core of the organization display more sectoral homophily\u0000but less triadic closure, and how each level consists of very different group\u0000structures. Our results show that the common practice of solely analyzing\u0000accounts from a single organizational level, grouping together all levels, or\u0000excluding certain levels can lead to a skewed understanding of how\u0000organizations are represented on social media.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inside Alameda Research: A Multi-Token Network Analysis 阿拉米达研究中心内部:多代币网络分析
Pub Date : 2024-09-17 DOI: arxiv-2409.10949
Célestin Coquidé, Rémy Cazabet, Natkamon Tovanich
We analyze the token transfer network on Ethereum, focusing on accountsassociated with Alameda Research, a cryptocurrency trading firm implicated inthe misuse of FTX customer funds. Using a multi-token network representation,we examine node centralities and the network backbone to identify criticalaccounts, tokens, and activity groups. The temporal evolution of Alamedaaccounts reveals shifts in token accumulation and distribution patterns leadingup to its bankruptcy in November 2022. Through network analysis, our workoffers insights into the activities and dynamics that shape the DeFi ecosystem.
我们分析了以太坊上的代币转移网络,重点关注与 Alameda Research 相关的账户,Alameda Research 是一家加密货币交易公司,卷入了 FTX 客户资金滥用事件。利用多代币网络表示法,我们研究了节点中心性和网络主干,以识别关键账户、代币和活动组。Alameda 账户的时间演化揭示了代币积累和分配模式的变化,这些变化导致其于 2022 年 11 月破产。通过网络分析,我们的研究深入揭示了形成 DeFi 生态系统的活动和动态。
{"title":"Inside Alameda Research: A Multi-Token Network Analysis","authors":"Célestin Coquidé, Rémy Cazabet, Natkamon Tovanich","doi":"arxiv-2409.10949","DOIUrl":"https://doi.org/arxiv-2409.10949","url":null,"abstract":"We analyze the token transfer network on Ethereum, focusing on accounts\u0000associated with Alameda Research, a cryptocurrency trading firm implicated in\u0000the misuse of FTX customer funds. Using a multi-token network representation,\u0000we examine node centralities and the network backbone to identify critical\u0000accounts, tokens, and activity groups. The temporal evolution of Alameda\u0000accounts reveals shifts in token accumulation and distribution patterns leading\u0000up to its bankruptcy in November 2022. Through network analysis, our work\u0000offers insights into the activities and dynamics that shape the DeFi ecosystem.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Property Encoder for Graph Neural Networks 图神经网络的属性编码器
Pub Date : 2024-09-17 DOI: arxiv-2409.11554
Anwar Said, Xenofon Koutsoukos
Graph machine learning, particularly using graph neural networks,fundamentally relies on node features. Nevertheless, numerous real-worldsystems, such as social and biological networks, often lack node features dueto various reasons, including privacy concerns, incomplete or missing data, andlimitations in data collection. In such scenarios, researchers typically resortto methods like structural and positional encoding to construct node features.However, the length of such features is contingent on the maximum value withinthe property being encoded, for example, the highest node degree, which can beexceedingly large in applications like scale-free networks. Furthermore, theseencoding schemes are limited to categorical data and might not be able toencode metrics returning other type of values. In this paper, we introduce anovel, universally applicable encoder, termed PropEnc, which constructsexpressive node embedding from any given graph metric. PropEnc leverageshistogram construction combined with reverse index encoding, offering aflexible method for node features initialization. It supports flexible encodingin terms of both dimensionality and type of input, demonstrating itseffectiveness across diverse applications. PropEnc allows encoding metrics inlow-dimensional space which effectively avoids the issue of sparsity andenhances the efficiency of the models. We show that emph{PropEnc} canconstruct node features that either exactly replicate one-hot encoding orclosely approximate indices under various settings. Our extensive evaluationsin graph classification setting across multiple social networks that lack nodefeatures support our hypothesis. The empirical results conclusively demonstratethat PropEnc is both an efficient and effective mechanism for constructing nodefeatures from diverse set of graph metrics.
图机器学习,尤其是使用图神经网络的机器学习,从根本上依赖于节点特征。然而,现实世界中的许多系统,如社会和生物网络,往往由于各种原因而缺乏节点特征,包括隐私问题、数据不完整或缺失以及数据收集的限制。在这种情况下,研究人员通常采用结构编码和位置编码等方法来构建节点特征。然而,这些特征的长度取决于被编码属性的最大值,例如最高节点度,而在无标度网络等应用中,最高节点度可能会非常大。此外,这些编码方案仅限于分类数据,可能无法对返回其他类型值的度量进行编码。在本文中,我们介绍了一种新的、普遍适用的编码器,称为 PropEnc,它可以从任何给定的图度量中构建具有表达力的节点嵌入。PropEnc 利用直方图构造与反向索引编码相结合,为节点特征初始化提供了一种灵活的方法。它支持对输入的维度和类型进行灵活编码,在各种应用中都证明了它的有效性。PropEnc 允许在低维空间中对指标进行编码,从而有效避免了稀疏性问题,并提高了模型的效率。我们的研究表明,PropEnc 可以在各种设置下构建节点特征,这些特征可以完全复制单次热编码,也可以近似于指数。我们在缺乏节点特征的多个社交网络的图分类设置中进行的广泛评估支持了我们的假设。实证结果最终证明,PropEnc 是一种高效且有效的机制,可以从不同的图度量集合中构建节点特征。
{"title":"A Property Encoder for Graph Neural Networks","authors":"Anwar Said, Xenofon Koutsoukos","doi":"arxiv-2409.11554","DOIUrl":"https://doi.org/arxiv-2409.11554","url":null,"abstract":"Graph machine learning, particularly using graph neural networks,\u0000fundamentally relies on node features. Nevertheless, numerous real-world\u0000systems, such as social and biological networks, often lack node features due\u0000to various reasons, including privacy concerns, incomplete or missing data, and\u0000limitations in data collection. In such scenarios, researchers typically resort\u0000to methods like structural and positional encoding to construct node features.\u0000However, the length of such features is contingent on the maximum value within\u0000the property being encoded, for example, the highest node degree, which can be\u0000exceedingly large in applications like scale-free networks. Furthermore, these\u0000encoding schemes are limited to categorical data and might not be able to\u0000encode metrics returning other type of values. In this paper, we introduce a\u0000novel, universally applicable encoder, termed PropEnc, which constructs\u0000expressive node embedding from any given graph metric. PropEnc leverages\u0000histogram construction combined with reverse index encoding, offering a\u0000flexible method for node features initialization. It supports flexible encoding\u0000in terms of both dimensionality and type of input, demonstrating its\u0000effectiveness across diverse applications. PropEnc allows encoding metrics in\u0000low-dimensional space which effectively avoids the issue of sparsity and\u0000enhances the efficiency of the models. We show that emph{PropEnc} can\u0000construct node features that either exactly replicate one-hot encoding or\u0000closely approximate indices under various settings. Our extensive evaluations\u0000in graph classification setting across multiple social networks that lack node\u0000features support our hypothesis. The empirical results conclusively demonstrate\u0000that PropEnc is both an efficient and effective mechanism for constructing node\u0000features from diverse set of graph metrics.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Capturing Differences in Character Representations Between Communities: An Initial Study with Fandom 捕捉不同社群之间角色表现形式的差异:关于粉丝的初步研究
Pub Date : 2024-09-17 DOI: arxiv-2409.11170
Bianca N. Y. Kang
Sociolinguistic theories have highlighted how narratives are often retold,co-constructed and reconceptualized in collaborative settings. This workingpaper focuses on the re-interpretation of characters, an integral part of thenarrative story-world, and attempts to study how this may be computationallycompared between online communities. Using online fandom - a highly communalphenomenon that has been largely studied qualitatively - as data, computationalmethods were applied to explore shifts in character representations between twocommunities and the original text. Specifically, text from the Harry Potternovels, r/HarryPotter subreddit, and fanfiction on Archive of Our Own wereanalyzed for changes in character mentions, centrality measures fromco-occurrence networks, and semantic associations. While fandom elevatessecondary characters as found in past work, the two fan communities prioritizedifferent subsets of characters. Word embedding tests reveal starkly differentassociations of the same characters between communities on the genderedconcepts of femininity/masculinity, cruelty, and beauty. Furthermore,fanfiction descriptions of a male character analyzed between romance pairingsscored higher for feminine-coded characteristics in male-male romance, matchingpast qualitative theorizing. The results high-light the potential forcomputational methods to assist in capturing the re-conceptualization ofnarrative elements across communities and in supporting qualitative research onfandom.
社会语言学理论强调了叙事如何在协作环境中经常被重述、共同构建和重新概念化。本工作文件的重点是对人物的重新诠释,这也是叙事故事世界不可或缺的一部分,并试图研究如何在网络社区之间进行计算比较。以网络粉丝(一种高度社区化的现象,主要以定性研究为主)为数据,计算方法被应用于探索两个社区和原始文本之间角色表述的变化。具体来说,我们分析了《哈利-波特》小说、r/HarryPotter subreddit 和 Archive of Our Own 上的同人小说中人物提及的变化、共同发生网络的中心度量以及语义关联。正如过去的研究发现的那样,虽然粉丝会提升次要角色的地位,但这两个粉丝社区优先考虑的角色子集却不同。词语嵌入测试显示,在女性/男性、残忍和美丽等性别概念上,两个社群对相同角色的关联截然不同。此外,通过分析恋人配对之间对男性角色的粉丝小说描述,男性与男性恋人之间的女性编码特征得分更高,这与过去的定性理论相吻合。这些结果凸显了计算方法的潜力,有助于捕捉不同社区对叙事元素的重新概念化,并为有关粉丝的定性研究提供支持。
{"title":"Capturing Differences in Character Representations Between Communities: An Initial Study with Fandom","authors":"Bianca N. Y. Kang","doi":"arxiv-2409.11170","DOIUrl":"https://doi.org/arxiv-2409.11170","url":null,"abstract":"Sociolinguistic theories have highlighted how narratives are often retold,\u0000co-constructed and reconceptualized in collaborative settings. This working\u0000paper focuses on the re-interpretation of characters, an integral part of the\u0000narrative story-world, and attempts to study how this may be computationally\u0000compared between online communities. Using online fandom - a highly communal\u0000phenomenon that has been largely studied qualitatively - as data, computational\u0000methods were applied to explore shifts in character representations between two\u0000communities and the original text. Specifically, text from the Harry Potter\u0000novels, r/HarryPotter subreddit, and fanfiction on Archive of Our Own were\u0000analyzed for changes in character mentions, centrality measures from\u0000co-occurrence networks, and semantic associations. While fandom elevates\u0000secondary characters as found in past work, the two fan communities prioritize\u0000different subsets of characters. Word embedding tests reveal starkly different\u0000associations of the same characters between communities on the gendered\u0000concepts of femininity/masculinity, cruelty, and beauty. Furthermore,\u0000fanfiction descriptions of a male character analyzed between romance pairings\u0000scored higher for feminine-coded characteristics in male-male romance, matching\u0000past qualitative theorizing. The results high-light the potential for\u0000computational methods to assist in capturing the re-conceptualization of\u0000narrative elements across communities and in supporting qualitative research on\u0000fandom.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hyperedge Modeling in Hypergraph Neural Networks by using Densest Overlapping Subgraphs 利用最密集重叠子图在超图神经网络中建立超edge 模型
Pub Date : 2024-09-16 DOI: arxiv-2409.10340
Mehrad Soltani, Luis Rueda
Hypergraphs tackle the limitations of traditional graphs by introducing {emhyperedges}. While graph edges connect only two nodes, hyperedges connect anarbitrary number of nodes along their edges. Also, the underlyingmessage-passing mechanisms in Hypergraph Neural Networks (HGNNs) are in theform of vertex-hyperedge-vertex, which let HGNNs capture and utilize richer andmore complex structural information than traditional Graph Neural Networks(GNNs). More recently, the idea of overlapping subgraphs has emerged. Thesesubgraphs can capture more information about subgroups of vertices withoutlimiting one vertex belonging to just one group, allowing vertices to belong tomultiple groups or subgraphs. In addition, one of the most important problemsin graph clustering is to find densest overlapping subgraphs (DOS). In thispaper, we propose a solution to the DOS problem via Agglomerative GreedyEnumeration (DOSAGE) algorithm as a novel approach to enhance the process ofgenerating the densest overlapping subgraphs and, hence, a robust constructionof the hypergraphs. Experiments on standard benchmarks show that the DOSAGEalgorithm significantly outperforms the HGNNs and six other methods on the nodeclassification task.
超图通过引入{emhyperedges}解决了传统图的局限性。图的边只连接两个节点,而超图则沿边连接任意数量的节点。此外,超图神经网络(HGNN)的基本信息传递机制是顶点-超边-顶点的形式,这使得 HGNN 能够捕捉和利用比传统图神经网络(GNN)更丰富、更复杂的结构信息。最近,出现了重叠子图的概念。这些子图可以捕捉更多的顶点子群信息,而不会限制一个顶点只属于一个群组,从而允许顶点属于多个群组或子图。此外,图聚类中最重要的问题之一是找到最密集的重叠子图(DOS)。在本文中,我们提出了一种通过聚合贪婪枚举(Agglomerative GreedyEnumeration,简称 "DABA")算法解决 DOS 问题的方法,这是一种新颖的方法,可以增强最密集重叠子图的生成过程,从而稳健地构建超图。在标准基准上进行的实验表明,在节点分类任务上,该算法明显优于 HGNN 和其他六种方法。
{"title":"Hyperedge Modeling in Hypergraph Neural Networks by using Densest Overlapping Subgraphs","authors":"Mehrad Soltani, Luis Rueda","doi":"arxiv-2409.10340","DOIUrl":"https://doi.org/arxiv-2409.10340","url":null,"abstract":"Hypergraphs tackle the limitations of traditional graphs by introducing {em\u0000hyperedges}. While graph edges connect only two nodes, hyperedges connect an\u0000arbitrary number of nodes along their edges. Also, the underlying\u0000message-passing mechanisms in Hypergraph Neural Networks (HGNNs) are in the\u0000form of vertex-hyperedge-vertex, which let HGNNs capture and utilize richer and\u0000more complex structural information than traditional Graph Neural Networks\u0000(GNNs). More recently, the idea of overlapping subgraphs has emerged. These\u0000subgraphs can capture more information about subgroups of vertices without\u0000limiting one vertex belonging to just one group, allowing vertices to belong to\u0000multiple groups or subgraphs. In addition, one of the most important problems\u0000in graph clustering is to find densest overlapping subgraphs (DOS). In this\u0000paper, we propose a solution to the DOS problem via Agglomerative Greedy\u0000Enumeration (DOSAGE) algorithm as a novel approach to enhance the process of\u0000generating the densest overlapping subgraphs and, hence, a robust construction\u0000of the hypergraphs. Experiments on standard benchmarks show that the DOSAGE\u0000algorithm significantly outperforms the HGNNs and six other methods on the node\u0000classification task.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ES-KT-24: A Multimodal Knowledge Tracing Benchmark Dataset with Educational Game Playing Video and Synthetic Text Generation ES-KT-24:包含教育游戏视频和合成文本生成的多模态知识追踪基准数据集
Pub Date : 2024-09-16 DOI: arxiv-2409.10244
Dohee Kim, Unggi Lee, Sookbun Lee, Jiyeong Bae, Taekyung Ahn, Jaekwon Park, Gunho Lee, Hyeoncheol Kim
This paper introduces ES-KT-24, a novel multimodal Knowledge Tracing (KT)dataset for intelligent tutoring systems in educational game contexts. AlthoughKT is crucial in adaptive learning, existing datasets often lack game-based andmultimodal elements. ES-KT-24 addresses these limitations by incorporatingeducational game-playing videos, synthetically generated question text, anddetailed game logs. The dataset covers Mathematics, English, Indonesian, andMalaysian subjects, emphasizing diversity and including non-English content.The synthetic text component, generated using a large language model,encompasses 28 distinct knowledge concepts and 182 questions, featuring 15,032users and 7,782,928 interactions. Our benchmark experiments demonstrate thedataset's utility for KT research by comparing Deep learning-based KT modelswith Language Model-based Knowledge Tracing (LKT) approaches. Notably, LKTmodels showed slightly higher performance than traditional DKT models,highlighting the potential of language model-based approaches in this field.Furthermore, ES-KT-24 has the potential to significantly advance research inmultimodal KT models and learning analytics. By integrating game-playing videosand detailed game logs, this dataset offers a unique approach to dissectingstudent learning patterns through advanced data analysis and machine-learningtechniques. It has the potential to unearth new insights into the learningprocess and inspire further exploration in the field.
本文介绍了 ES-KT-24,这是一个新颖的多模态知识追踪(Knowledge Tracing,KT)数据集,用于教育游戏背景下的智能辅导系统。虽然知识追踪在自适应学习中至关重要,但现有数据集往往缺乏基于游戏的多模态元素。ES-KT-24 通过整合教育游戏视频、合成生成的问题文本和详细的游戏日志,解决了这些局限性。该数据集涵盖数学、英语、印尼语和马来西亚语科目,强调多样性并包含非英语内容。合成文本部分由大型语言模型生成,包含 28 个不同的知识概念和 182 个问题,有 15,032 名用户和 7,782,928 次互动。通过比较基于深度学习的知识跟踪模型和基于语言模型的知识跟踪(LKT)方法,我们的基准实验证明了该数据集在知识跟踪研究中的实用性。值得注意的是,LKT 模型的性能略高于传统的 DKT 模型,这凸显了基于语言模型的方法在该领域的潜力。通过整合游戏视频和详细的游戏日志,该数据集提供了一种通过先进的数据分析和机器学习技术剖析学生学习模式的独特方法。它有可能揭示学习过程的新见解,并激发该领域的进一步探索。
{"title":"ES-KT-24: A Multimodal Knowledge Tracing Benchmark Dataset with Educational Game Playing Video and Synthetic Text Generation","authors":"Dohee Kim, Unggi Lee, Sookbun Lee, Jiyeong Bae, Taekyung Ahn, Jaekwon Park, Gunho Lee, Hyeoncheol Kim","doi":"arxiv-2409.10244","DOIUrl":"https://doi.org/arxiv-2409.10244","url":null,"abstract":"This paper introduces ES-KT-24, a novel multimodal Knowledge Tracing (KT)\u0000dataset for intelligent tutoring systems in educational game contexts. Although\u0000KT is crucial in adaptive learning, existing datasets often lack game-based and\u0000multimodal elements. ES-KT-24 addresses these limitations by incorporating\u0000educational game-playing videos, synthetically generated question text, and\u0000detailed game logs. The dataset covers Mathematics, English, Indonesian, and\u0000Malaysian subjects, emphasizing diversity and including non-English content.\u0000The synthetic text component, generated using a large language model,\u0000encompasses 28 distinct knowledge concepts and 182 questions, featuring 15,032\u0000users and 7,782,928 interactions. Our benchmark experiments demonstrate the\u0000dataset's utility for KT research by comparing Deep learning-based KT models\u0000with Language Model-based Knowledge Tracing (LKT) approaches. Notably, LKT\u0000models showed slightly higher performance than traditional DKT models,\u0000highlighting the potential of language model-based approaches in this field.\u0000Furthermore, ES-KT-24 has the potential to significantly advance research in\u0000multimodal KT models and learning analytics. By integrating game-playing videos\u0000and detailed game logs, this dataset offers a unique approach to dissecting\u0000student learning patterns through advanced data analysis and machine-learning\u0000techniques. It has the potential to unearth new insights into the learning\u0000process and inspire further exploration in the field.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142263437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Social and Information Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1