首页 > 最新文献

EPJ Data Science最新文献

英文 中文
Not your mean green: beyond averages in mapping socio-spatial inequities in urban greenery for smart cities. 不是你的平均绿色:超越平均水平,在智慧城市的城市绿化中绘制社会空间不平等。
IF 2.5 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-01 Epub Date: 2026-02-09 DOI: 10.1140/epjds/s13688-026-00627-4
Jenny Martinez, Javier Argota Sánchez-Vaquerizo, Sachit Mahajan

As data-driven "smart city" agendas expand across Latin America, most urban performance metrics remain focused on infrastructure, connectivity, and aggregate efficiency, often neglecting who truly benefits. Urban greenery, a vital determinant of health and climate resilience, is one such blind spot. While some frameworks now consider "green space," they do so at a coarse, citywide scale, overlooking how access is distributed across neighborhoods and social groups. This obscures critical equity gaps, particularly in cities marked by deep socio-spatial segregation. In this study, we develop a fully reproducible geospatial pipeline that integrates high-resolution canopy height models, public park data, gridded population estimates, and socioeconomic strata to assess how greenery is distributed, not just how much exists. Applied to Bogotá and Medellín, the method reveals stark disparities: population-weighted canopy coverage rises significantly between the lowest and highest strata, while access to public parks also shows measurable inequality, especially in high-density, underserved neighborhoods. These inequities persist despite progressive greening policies, revealing the limits of optimization when legacy segregation is ignored. Our open-source pipeline enables finer-grained, justice-oriented audits that go beyond averages to identify where greenery and its benefits are most lacking. By enabling fine-grained equity assessments, this approach underscores the importance of greenery distribution, not just quantity, as a critical indicator for inclusive and equitable smart cities.

随着数据驱动的“智慧城市”议程在拉丁美洲的扩展,大多数城市绩效指标仍然集中在基础设施、连通性和总效率上,往往忽视了谁是真正的受益者。城市绿化是健康和气候适应能力的重要决定因素,也是这样一个盲点。虽然现在一些框架考虑了“绿色空间”,但它们是以粗糙的、全市范围的尺度来考虑的,忽视了如何在社区和社会群体之间分配通道。这掩盖了严重的公平差距,特别是在社会空间隔离严重的城市。在这项研究中,我们开发了一个完全可复制的地理空间管道,该管道集成了高分辨率树冠高度模型、公共公园数据、网格化人口估计和社会经济阶层,以评估绿化如何分布,而不仅仅是存在多少。应用于波哥大和Medellín,该方法揭示了明显的差异:人口加权树冠覆盖率在最低层和最高层之间显著上升,而进入公园的机会也显示出可衡量的不平等,特别是在高密度、服务不足的社区。这些不平等仍然存在,尽管进步的绿化政策,揭示了优化的局限性,当遗留的隔离被忽视。我们的开源管道使更细粒度、以公正为导向的审计能够超越平均水平,以确定最缺乏绿化及其效益的地方。通过实现细粒度的公平评估,该方法强调了绿化分布的重要性,而不仅仅是数量,作为包容性和公平的智慧城市的关键指标。
{"title":"Not your mean green: beyond averages in mapping socio-spatial inequities in urban greenery for smart cities.","authors":"Jenny Martinez, Javier Argota Sánchez-Vaquerizo, Sachit Mahajan","doi":"10.1140/epjds/s13688-026-00627-4","DOIUrl":"https://doi.org/10.1140/epjds/s13688-026-00627-4","url":null,"abstract":"<p><p>As data-driven \"smart city\" agendas expand across Latin America, most urban performance metrics remain focused on infrastructure, connectivity, and aggregate efficiency, often neglecting who truly benefits. Urban greenery, a vital determinant of health and climate resilience, is one such blind spot. While some frameworks now consider \"green space,\" they do so at a coarse, citywide scale, overlooking how access is distributed across neighborhoods and social groups. This obscures critical equity gaps, particularly in cities marked by deep socio-spatial segregation. In this study, we develop a fully reproducible geospatial pipeline that integrates high-resolution canopy height models, public park data, gridded population estimates, and socioeconomic strata to assess how greenery is distributed, not just how much exists. Applied to Bogotá and Medellín, the method reveals stark disparities: population-weighted canopy coverage rises significantly between the lowest and highest strata, while access to public parks also shows measurable inequality, especially in high-density, underserved neighborhoods. These inequities persist despite progressive greening policies, revealing the limits of optimization when legacy segregation is ignored. Our open-source pipeline enables finer-grained, justice-oriented audits that go beyond averages to identify where greenery and its benefits are most lacking. By enabling fine-grained equity assessments, this approach underscores the importance of greenery distribution, not just quantity, as a critical indicator for inclusive and equitable smart cities.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"15 1","pages":"25"},"PeriodicalIF":2.5,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12979267/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147467364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An agent-based model to investigate the effects of urban segregation around the clock on inequalities in health behaviour. 一个基于主体的模型,用于调查城市昼夜隔离对健康行为不平等的影响。
IF 2.5 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-01 Epub Date: 2025-12-11 DOI: 10.1140/epjds/s13688-025-00603-4
Clémentine Cottineau-Mugadza, Julien Perret, Romain Reuillon, Sébastien Rey-Coyrehourcq, Julie Vallée

Social segregation in cities refers to the uneven spatial distribution of individuals from unequal social groups, such as affluent and economically vulnerable people. Social segregation may, in turn, produce social inequalities through contextual effects, since neighbourhood mixing or concentration plays a role in shaping individuals' opinions and behaviours in multiple life domains, including health. Because segregation and contextual effects occur at the places of residence as well as throughout the day, as people move between locations in a city, we aim to understand the social effect of urban segregation 'around the clock' on health behaviours (such as the choice of a healthy diet), using an empirical agent-based model initialised on the Paris region with a synthetic population. We built this synthetic population by pulling together data from two health & nutrition surveys conducted 6 years apart, data from the French census and data from an origin-destination survey. We then combined scenarios of residential patterns (random allocation vs. census-based allocation reflecting the empirical level of residential segregation) with scenarios of daily mobility (no daily moves, random moves or survey-based daily moves reflecting the empirical level of daytime segregation in Paris) to assess the effect of spatio-temporal segregation on the diffusion of health behaviours. While the same upward trend of healthy behaviours is obtained in all scenarios simulated, we find contrasted results with respect to social inequalities: 1/ when the agents' residence is allocated at random, social inequalities of health decrease in the long run; 2/ randomizing daily mobility can mitigate the increase in social inequalities in dietary behaviours induced by effective residential segregation, with this mitigation effect appearing as soon as a small proportion of daily moves are random; 3/ daytime segregation as it exists in Paris slightly reinforces the unequal distribution of health behaviours between the most and least educated groups compared with the sole effect of residential segregation.

Supplementary information: The online version contains supplementary material available at 10.1140/epjds/s13688-025-00603-4.

城市的社会隔离是指来自不平等社会群体(如富裕人群和经济弱势人群)的个体在空间上分布不均。社会隔离反过来又可能通过环境影响产生社会不平等,因为邻里混合或集中在包括健康在内的多个生活领域影响个人的意见和行为。由于隔离和环境影响发生在居住地以及全天,当人们在城市中的不同地点之间移动时,我们的目标是了解城市隔离“全天候”对健康行为(例如选择健康饮食)的社会影响,使用基于经验主体的模型初始化巴黎地区的合成人口。我们将相隔6年的两次健康与营养调查数据、法国人口普查数据和出发地调查数据汇总在一起,建立了这个综合人口。然后,我们将居住模式情景(随机分配vs.基于人口普查的分配,反映了居住隔离的经验水平)与日常流动性情景(无日常移动、随机移动或基于调查的日常移动,反映了巴黎白天隔离的经验水平)相结合,以评估时空隔离对健康行为扩散的影响。虽然在所有模拟情景中都获得了相同的健康行为上升趋势,但我们发现关于社会不平等的对比结果:1/当代理人的住所随机分配时,从长远来看,健康的社会不平等减少;2/随机化日常流动性可以缓解因有效的居住隔离而导致的饮食行为社会不平等的加剧,只要有一小部分日常流动性是随机的,这种缓解效果就会出现;3 .与居住隔离的唯一影响相比,巴黎存在的白天隔离稍微加剧了受教育程度最高和最低的群体之间保健行为的不平等分布。补充信息:在线版本包含补充资料,获取地址:10.1140/epjds/s13688-025-00603-4。
{"title":"An agent-based model to investigate the effects of urban segregation around the clock on inequalities in health behaviour.","authors":"Clémentine Cottineau-Mugadza, Julien Perret, Romain Reuillon, Sébastien Rey-Coyrehourcq, Julie Vallée","doi":"10.1140/epjds/s13688-025-00603-4","DOIUrl":"10.1140/epjds/s13688-025-00603-4","url":null,"abstract":"<p><p>Social segregation in cities refers to the uneven spatial distribution of individuals from unequal social groups, such as affluent and economically vulnerable people. Social segregation may, in turn, produce social inequalities through contextual effects, since neighbourhood mixing or concentration plays a role in shaping individuals' opinions and behaviours in multiple life domains, including health. Because segregation and contextual effects occur at the places of residence as well as throughout the day, as people move between locations in a city, we aim to understand the social effect of urban segregation 'around the clock' on health behaviours (such as the choice of a healthy diet), using an empirical agent-based model initialised on the Paris region with a synthetic population. We built this synthetic population by pulling together data from two health & nutrition surveys conducted 6 years apart, data from the French census and data from an origin-destination survey. We then combined scenarios of residential patterns (random allocation vs. census-based allocation reflecting the empirical level of residential segregation) with scenarios of daily mobility (no daily moves, random moves or survey-based daily moves reflecting the empirical level of daytime segregation in Paris) to assess the effect of spatio-temporal segregation on the diffusion of health behaviours. While the same upward trend of healthy behaviours is obtained in all scenarios simulated, we find contrasted results with respect to social inequalities: 1/ when the agents' residence is allocated at random, social inequalities of health decrease in the long run; 2/ randomizing daily mobility can mitigate the increase in social inequalities in dietary behaviours induced by effective residential segregation, with this mitigation effect appearing as soon as a small proportion of daily moves are random; 3/ daytime segregation as it exists in Paris slightly reinforces the unequal distribution of health behaviours between the most and least educated groups compared with the sole effect of residential segregation.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1140/epjds/s13688-025-00603-4.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"15 1","pages":"5"},"PeriodicalIF":2.5,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12804204/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145997753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative AI voting: fair collective choice is resilient to LLM biases and inconsistencies. 生成式人工智能投票:公平的集体选择对法学硕士的偏见和不一致性具有弹性。
IF 2.5 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-01 Epub Date: 2026-02-09 DOI: 10.1140/epjds/s13688-025-00612-3
Srijoni Majumdar, Edith Elkind, Evangelos Pournaras

Recent breakthroughs in generative artificial intelligence (AI) and large language models (LLMs) unravel new capabilities for AI personal assistants to overcome cognitive bandwidth limitations of humans, providing decision support or even direct representation of abstained human voters at large scale. However, the quality of this representation and what underlying biases manifest when delegating collective decision making to LLMs is an alarming and timely challenge to tackle. By rigorously emulating more than >50K LLM voting personas in 363 real-world voting elections, we disentangle how AI-generated choices differ from human choices and how this affects collective decision outcomes. Complex preferential ballot formats show significant inconsistencies compared to simpler majoritarian elections, which demonstrate higher consistency. Strikingly, proportional ballot aggregation methods such as equal shares prove to be a win-win: fairer voting outcomes for humans and fairer AI representation, especially for voters likely to abstain. This novel underlying relationship proves paramount for building democratic resilience in scenarios of low voters turnout by voter fatigue: abstained voters are mitigated via AI representatives that recover representative and fair voting outcomes. These interdisciplinary insights provide decision support to policymakers and citizens for developing safeguards and policies for risks of using AI in democratic innovations.

Supplementary information: The online version contains supplementary material available at 10.1140/epjds/s13688-025-00612-3.

最近在生成人工智能(AI)和大型语言模型(llm)方面的突破揭示了人工智能个人助理克服人类认知带宽限制的新能力,提供决策支持,甚至大规模地直接代表弃权的人类选民。然而,这种代表性的质量以及在将集体决策委托给法学硕士时所表现出的潜在偏见是一个值得警惕和及时解决的挑战。通过严格模拟363个现实世界投票选举中超过50万个法学硕士投票人物,我们弄清了人工智能生成的选择与人类选择的不同之处,以及这如何影响集体决策结果。与简单的多数选举相比,复杂的优惠选票形式表现出明显的不一致性,后者表现出更高的一致性。引人注目的是,平等份额等比例选票汇总方法被证明是一种双赢:对人类来说,更公平的投票结果,对人工智能来说,更公平的代表,尤其是对可能弃权的选民来说。这种新颖的潜在关系对于在选民疲劳导致投票率低的情况下建立民主弹性至关重要:弃权的选民通过人工智能代表得到缓解,人工智能代表恢复了代表性和公平的投票结果。这些跨学科的见解为决策者和公民提供决策支持,为在民主创新中使用人工智能的风险制定保障措施和政策。补充信息:在线版本包含补充资料,可在10.1140/epjds/s13688-025-00612-3获得。
{"title":"Generative AI voting: fair collective choice is resilient to LLM biases and inconsistencies.","authors":"Srijoni Majumdar, Edith Elkind, Evangelos Pournaras","doi":"10.1140/epjds/s13688-025-00612-3","DOIUrl":"10.1140/epjds/s13688-025-00612-3","url":null,"abstract":"<p><p>Recent breakthroughs in generative artificial intelligence (AI) and large language models (LLMs) unravel new capabilities for AI personal assistants to overcome cognitive bandwidth limitations of humans, providing decision support or even direct representation of abstained human voters at large scale. However, the quality of this representation and what underlying biases manifest when delegating collective decision making to LLMs is an alarming and timely challenge to tackle. By rigorously emulating more than >50K LLM voting personas in 363 real-world voting elections, we disentangle how AI-generated choices differ from human choices and how this affects collective decision outcomes. Complex preferential ballot formats show significant inconsistencies compared to simpler majoritarian elections, which demonstrate higher consistency. Strikingly, proportional ballot aggregation methods such as equal shares prove to be a win-win: fairer voting outcomes for humans and fairer AI representation, especially for voters likely to abstain. This novel underlying relationship proves paramount for building democratic resilience in scenarios of low voters turnout by voter fatigue: abstained voters are mitigated via AI representatives that recover representative and fair voting outcomes. These interdisciplinary insights provide decision support to policymakers and citizens for developing safeguards and policies for risks of using AI in democratic innovations.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1140/epjds/s13688-025-00612-3.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"15 1","pages":"24"},"PeriodicalIF":2.5,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12963128/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147376551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Whose voice matters? Word embeddings reveal identity bias in news quotes. 谁的声音重要?词语嵌入揭示了新闻引用中的身份偏见。
IF 3 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-01-01 Epub Date: 2025-04-17 DOI: 10.1140/epjds/s13688-025-00541-1
Nnaemeka Ohamadike, Kevin Durrheim, Mpho Primus

This paper investigates identity bias (gender and race) in the South African news selection and representation of COVID-19 vaccination quotes. Social bias studies have qualitatively examined race and gender bias in South African news, given South Africa's apartheid history; yet, studies that examine and quantify these biases at the speaker level using news quotes from a representative South African news corpus remain limited. To address this gap, we examined race and gender bias in news selection and framing of quotes. We used word embedding trained on 22,627 vaccination quotes from 76 South African news sources between 2020 and 2023. These large-scale processing embeddings are unbiased by design but can learn and uncover biases hidden in language. Our findings reveal gender and race bias in the news selection and framing of quotes - journalists privilege White voices as more authoritative and connected to global and technical vaccination discourse but confine black voices to primarily localised contexts. They also quote male speakers more frequently in the news than females. In an era where human biases are becoming increasingly implicit, we argue that embeddings offer a robust tool to unearth, monitor, and evaluate these biases at the micro or speaker level in the news.

Supplementary information: The online version contains supplementary material available at 10.1140/epjds/s13688-025-00541-1.

本文调查了南非新闻选择和代表COVID-19疫苗接种引用中的身份偏见(性别和种族)。鉴于南非的种族隔离历史,社会偏见研究定性地考察了南非新闻中的种族和性别偏见;然而,使用具有代表性的南非新闻语料库的新闻引用来检查和量化说话者水平上的这些偏见的研究仍然有限。为了解决这一差距,我们研究了新闻选择和引用框架中的种族和性别偏见。我们对2020年至2023年间76个南非新闻来源的22,627个疫苗接种引用进行了单词嵌入训练。这些大规模的处理嵌入在设计上是无偏的,但可以学习和发现隐藏在语言中的偏见。我们的研究结果揭示了新闻选择和引用框架中的性别和种族偏见——记者将白人的声音视为更权威的,与全球和技术疫苗话语联系在一起,但将黑人的声音主要限制在当地语境中。他们在新闻中也比女性更频繁地引用男性说话者的话。在一个人类偏见变得越来越含蓄的时代,我们认为嵌入提供了一个强大的工具,可以在新闻的微观或说话者层面上发现、监测和评估这些偏见。补充信息:在线版本包含补充资料,可在10.1140/epjds/s13688-025-00541-1获得。
{"title":"Whose voice matters? Word embeddings reveal identity bias in news quotes.","authors":"Nnaemeka Ohamadike, Kevin Durrheim, Mpho Primus","doi":"10.1140/epjds/s13688-025-00541-1","DOIUrl":"https://doi.org/10.1140/epjds/s13688-025-00541-1","url":null,"abstract":"<p><p>This paper investigates identity bias (gender and race) in the South African news selection and representation of COVID-19 vaccination quotes. Social bias studies have qualitatively examined race and gender bias in South African news, given South Africa's apartheid history; yet, studies that examine and quantify these biases at the speaker level using news quotes from a representative South African news corpus remain limited. To address this gap, we examined race and gender bias in news selection and framing of quotes. We used word embedding trained on 22,627 vaccination quotes from 76 South African news sources between 2020 and 2023. These large-scale processing embeddings are unbiased by design but can learn and uncover biases hidden in language. Our findings reveal gender and race bias in the news selection and framing of quotes - journalists privilege White voices as more authoritative and connected to global and technical vaccination discourse but confine black voices to primarily localised contexts. They also quote male speakers more frequently in the news than females. In an era where human biases are becoming increasingly implicit, we argue that embeddings offer a robust tool to unearth, monitor, and evaluate these biases at the micro or speaker level in the news.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1140/epjds/s13688-025-00541-1.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"14 1","pages":"30"},"PeriodicalIF":3.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12006212/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143974850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Safe spaces or toxic places? Content moderation and social dynamics of online eating disorder communities. 安全的地方还是有毒的地方?在线饮食失调社区的内容节制和社会动态。
IF 2.5 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-01-01 Epub Date: 2025-07-25 DOI: 10.1140/epjds/s13688-025-00575-5
Kristina Lerman, Minh Duc Chu, Charles Bickham, Luca Luceri, Emilio Ferrara

Social media platforms have become critical spaces for discussing mental health concerns, including eating disorders. While these platforms can provide valuable support networks, they may also amplify harmful content that glorifies disordered cognition and self-destructive behaviors. While social media platforms have implemented various content moderation strategies, from stringent to laissez-faire approaches, we lack a comprehensive understanding of how these different moderation practices interact with user engagement in online communities around these sensitive mental health topics. This study addresses this knowledge gap through a comparative analysis of eating disorder discussions across Twitter/X (2.6M tweets), Reddit (178K submissions), and TikTok (14K videos) spanning from 2019-2023. Our findings reveal that while users across all platforms engage similarly in expressing concerns and seeking support, platforms with weaker moderation (like Twitter/X) enable the formation of toxic echo chambers that amplify pro-anorexia rhetoric. These results demonstrate how moderation strategies significantly influence the development and impact of online communities, particularly in contexts involving mental health and self-harm.

社交媒体平台已经成为讨论包括饮食失调在内的心理健康问题的关键空间。虽然这些平台可以提供有价值的支持网络,但它们也可能放大美化无序认知和自我毁灭行为的有害内容。虽然社交媒体平台已经实施了各种内容审核策略,从严格的到自由放任的方法,但我们对这些不同的审核实践如何与在线社区中围绕这些敏感的心理健康话题的用户参与相互作用缺乏全面的了解。这项研究通过对2019年至2023年期间Twitter/X(260万条推文)、Reddit(17.8万份提交)和TikTok (14K视频)上关于饮食失调的讨论进行比较分析,解决了这一知识差距。我们的研究结果显示,虽然所有平台的用户都在表达担忧和寻求支持方面表现相似,但适度程度较弱的平台(如Twitter/X)会形成有毒的回音室,放大支持厌食症的言论。这些结果表明,适度策略如何显著影响在线社区的发展和影响,特别是在涉及心理健康和自我伤害的情况下。
{"title":"Safe spaces or toxic places? Content moderation and social dynamics of online eating disorder communities.","authors":"Kristina Lerman, Minh Duc Chu, Charles Bickham, Luca Luceri, Emilio Ferrara","doi":"10.1140/epjds/s13688-025-00575-5","DOIUrl":"10.1140/epjds/s13688-025-00575-5","url":null,"abstract":"<p><p>Social media platforms have become critical spaces for discussing mental health concerns, including eating disorders. While these platforms can provide valuable support networks, they may also amplify harmful content that glorifies disordered cognition and self-destructive behaviors. While social media platforms have implemented various content moderation strategies, from stringent to laissez-faire approaches, we lack a comprehensive understanding of how these different moderation practices interact with user engagement in online communities around these sensitive mental health topics. This study addresses this knowledge gap through a comparative analysis of eating disorder discussions across Twitter/X (2.6M tweets), Reddit (178K submissions), and TikTok (14K videos) spanning from 2019-2023. Our findings reveal that while users across all platforms engage similarly in expressing concerns and seeking support, platforms with weaker moderation (like Twitter/X) enable the formation of toxic echo chambers that amplify pro-anorexia rhetoric. These results demonstrate how moderation strategies significantly influence the development and impact of online communities, particularly in contexts involving mental health and self-harm.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"14 1","pages":"55"},"PeriodicalIF":2.5,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12296748/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144728944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Milgram's experiment in the knowledge space: individual navigation strategies. 米尔格拉姆在知识空间的实验:个人导航策略。
IF 3 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-01-01 Epub Date: 2025-06-05 DOI: 10.1140/epjds/s13688-025-00558-6
Manran Zhu, János Kertész

Data deluge characteristic for our times has led to information overload, posing a significant challenge to effectively finding our way through the digital landscape. Addressing this issue requires an in-depth understanding of how we navigate through the abundance of information. Previous research has discovered multiple patterns in how individuals navigate in the geographic, social, and information spaces, yet individual differences in strategies for navigation in the knowledge space has remained largely unexplored. To bridge the gap, we conducted an online experiment where participants played a navigation game on Wikipedia and completed questionnaires about their personal information. Utilizing the hierarchical structure of the English Wikipedia and a graph embedding trained on it, we identified two navigation strategies and found that there are significant individual differences in the choices of them. Older, white and female participants tend to adopt a proximity-driven strategy, while younger participants prefer a hub-driven strategy. Our study connects social navigation to knowledge navigation: individuals' differing tendencies to use geographical and occupational information about the target person to navigate in the social space can be understood as different choices between the hub-driven and proximity-driven strategies in the knowledge space.

我们这个时代的数据泛滥特征导致了信息超载,对我们在数字环境中有效地找到道路提出了重大挑战。解决这个问题需要深入了解我们如何在丰富的信息中导航。先前的研究已经发现了个体在地理、社会和信息空间中导航的多种模式,但在知识空间中导航策略的个体差异仍未得到很大程度的探索。为了缩小差距,我们进行了一个在线实验,参与者在维基百科上玩一个导航游戏,并完成关于他们个人信息的问卷调查。利用英文维基百科的层次结构和在其上训练的图嵌入,我们确定了两种导航策略,并发现它们的选择存在显著的个体差异。年龄较大的、白人和女性参与者倾向于采用“就近驱动”策略,而年轻的参与者则倾向于采用“中心驱动”策略。我们的研究将社会导航与知识导航联系起来:个体在社会空间中使用目标人的地理和职业信息进行导航的不同倾向可以理解为知识空间中中心驱动策略和邻近驱动策略的不同选择。
{"title":"Milgram's experiment in the knowledge space: individual navigation strategies.","authors":"Manran Zhu, János Kertész","doi":"10.1140/epjds/s13688-025-00558-6","DOIUrl":"10.1140/epjds/s13688-025-00558-6","url":null,"abstract":"<p><p>Data deluge characteristic for our times has led to information overload, posing a significant challenge to effectively finding our way through the digital landscape. Addressing this issue requires an in-depth understanding of how we navigate through the abundance of information. Previous research has discovered multiple patterns in how individuals navigate in the geographic, social, and information spaces, yet individual differences in strategies for navigation in the knowledge space has remained largely unexplored. To bridge the gap, we conducted an online experiment where participants played a navigation game on Wikipedia and completed questionnaires about their personal information. Utilizing the hierarchical structure of the English Wikipedia and a graph embedding trained on it, we identified two navigation strategies and found that there are significant individual differences in the choices of them. Older, white and female participants tend to adopt a proximity-driven strategy, while younger participants prefer a hub-driven strategy. Our study connects social navigation to knowledge navigation: individuals' differing tendencies to use geographical and occupational information about the target person to navigate in the social space can be understood as different choices between the hub-driven and proximity-driven strategies in the knowledge space.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"14 1","pages":"42"},"PeriodicalIF":3.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12141110/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144247072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of anomalous spatio-temporal patterns of app traffic in response to catastrophic events. 检测响应灾难性事件的应用程序流量的异常时空模式。
IF 3 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-01-01 Epub Date: 2025-05-06 DOI: 10.1140/epjds/s13688-025-00546-w
Sofia Medina, Shazia'Ayn Babul, Timothy LaRock, Rohit Sahasrabuddhe, Renaud Lambiotte, Nicola Pedreschi

In this work, we uncover patterns of usage mobile phone applications and information spread in response to perturbations caused by unprecedented events. We focus on categorizing patterns of response in both space and time, tracking their relaxation over time. To this end, we use the NetMob2023 Data Challenge dataset, which provides mobile phone applications traffic volume data for several cities in France at a spatial resolution of 100 m 2 and a time resolution of 15 minutes for a time period ranging from March to May 2019. We analyze the spread of information before, during, and after the catastrophic Notre-Dame fire on April 15th and a bombing that took place in the city centre of Lyon on May 24th using volume of data uploaded and downloaded to different mobile applications as a proxy of information transfer dynamics. We identify different clusters of information transfer dynamics in response to the Notre-Dame fire within the city of Paris as well as in other major French cities. We find a clear pattern of significantly above-baseline usage of the application Twitter (currently known as X) in Paris that radially spreads from the area surrounding the Notre-Dame cathedral to the rest of the city. We detect a similar pattern in the city of Lyon in response to the bombing. Further, we present a null model of radial information spread and develop methods of tracking radial patterns over time. Overall, we illustrate novel analytical methods we devise, showing how they enable a new perspective on mobile phone user response to unplanned catastrophic events and giving insight into how information spreads during a catastrophe in both time and space.

Supplementary information: The online version contains supplementary material available at 10.1140/epjds/s13688-025-00546-w.

在这项工作中,我们揭示了使用手机应用程序和信息传播的模式,以应对前所未有的事件引起的扰动。我们专注于在空间和时间上对反应模式进行分类,跟踪它们随时间的放松。为此,我们使用了NetMob2023数据挑战数据集,该数据集提供了2019年3月至5月期间法国几个城市的手机应用流量数据,空间分辨率为100 m2,时间分辨率为15分钟。我们分析了4月15日巴黎圣母院灾难性火灾和5月24日里昂市中心爆炸事件发生之前、期间和之后的信息传播,使用上传和下载到不同移动应用程序的数据量作为信息传递动态的代理。我们确定了巴黎圣母院火灾以及法国其他主要城市中不同的信息传递动态集群。我们发现,在巴黎,应用程序Twitter(目前称为X)的使用率明显高于基线,它从圣母院周围的区域呈放射状扩散到城市的其他地方。我们在里昂发现了对爆炸案的类似反应。此外,我们提出了径向信息传播的零模型,并开发了随时间跟踪径向模式的方法。总体而言,我们阐述了我们设计的新颖分析方法,展示了它们如何为手机用户对计划外灾难性事件的反应提供了新的视角,并深入了解了信息在灾难期间如何在时间和空间上传播。补充信息:在线版本包含补充资料,可在10.1140/epjds/s13688-025-00546-w获得。
{"title":"Detection of anomalous spatio-temporal patterns of app traffic in response to catastrophic events.","authors":"Sofia Medina, Shazia'Ayn Babul, Timothy LaRock, Rohit Sahasrabuddhe, Renaud Lambiotte, Nicola Pedreschi","doi":"10.1140/epjds/s13688-025-00546-w","DOIUrl":"10.1140/epjds/s13688-025-00546-w","url":null,"abstract":"<p><p>In this work, we uncover patterns of usage mobile phone applications and information spread in response to perturbations caused by unprecedented events. We focus on categorizing patterns of response in both space and time, tracking their relaxation over time. To this end, we use the NetMob2023 Data Challenge dataset, which provides mobile phone applications traffic volume data for several cities in France at a spatial resolution of 100 <math><msup><mi>m</mi> <mn>2</mn></msup> </math> and a time resolution of 15 minutes for a time period ranging from March to May 2019. We analyze the spread of information before, during, and after the catastrophic Notre-Dame fire on April 15th and a bombing that took place in the city centre of Lyon on May 24th using volume of data uploaded and downloaded to different mobile applications as a proxy of information transfer dynamics. We identify different clusters of information transfer dynamics in response to the Notre-Dame fire within the city of Paris as well as in other major French cities. We find a clear pattern of significantly above-baseline usage of the application Twitter (currently known as X) in Paris that radially spreads from the area surrounding the Notre-Dame cathedral to the rest of the city. We detect a similar pattern in the city of Lyon in response to the bombing. Further, we present a null model of radial information spread and develop methods of tracking radial patterns over time. Overall, we illustrate novel analytical methods we devise, showing how they enable a new perspective on mobile phone user response to unplanned catastrophic events and giving insight into how information spreads during a catastrophe in both time and space.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1140/epjds/s13688-025-00546-w.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"14 1","pages":"35"},"PeriodicalIF":3.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12055615/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143990977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Socioeconomic disparities in mobility behavior during the COVID-19 pandemic in developing countries. COVID-19大流行期间发展中国家流动行为的社会经济差异。
IF 3 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-01-01 Epub Date: 2025-03-24 DOI: 10.1140/epjds/s13688-025-00532-2
Lorenzo Lucchini, Ollin D Langle-Chimal, Lorenzo Candeago, Lucio Melito, Alex Chunet, Aleister Montfort, Bruno Lepri, Nancy Lozano-Gracia, Samuel P Fraiberger

Mobile phone data have played a key role in quantifying human mobility during the COVID-19 pandemic. Existing studies on mobility patterns have primarily focused on regional aggregates in high-income countries, obfuscating the accentuated impact of the pandemic on the most vulnerable populations. Leveraging geolocation data from mobile-phone users and population census for 6 middle-income countries across 3 continents between March and December 2020, we uncovered common disparities in the behavioral response to the pandemic across socioeconomic groups. Users living in low-wealth neighborhoods were less likely to respond by self-isolating, relocating to rural areas, or refraining from commuting to work. The gap in the behavioral responses between socioeconomic groups persisted during the entire observation period. Among users living in low-wealth neighborhoods, those who commute to work in high-wealth neighborhoods pre-pandemic were particularly at risk of experiencing economic stress, facing both the reduction in economic activity in the high-wealth neighborhood and being more likely to be affected by public transport closures due to their longer commute distances. While confinement policies were predominantly country-wide, these results suggest that, when data to identify vulnerable individuals are not readily available, GPS-based analytics could help design targeted place-based policies to aid the most vulnerable.

Supplementary information: The online version contains supplementary material available at 10.1140/epjds/s13688-025-00532-2.

在2019冠状病毒病大流行期间,手机数据在量化人类流动性方面发挥了关键作用。现有的关于人口流动模式的研究主要集中在高收入国家的区域总量上,混淆了疫情对最脆弱人群的严重影响。利用2020年3月至12月期间来自三大洲6个中等收入国家的移动电话用户地理位置数据和人口普查数据,我们发现了不同社会经济群体对大流行的行为反应的共同差异。生活在低财富社区的用户不太可能通过自我隔离、搬迁到农村地区或不上下班来应对。在整个观察期间,社会经济群体之间的行为反应差距持续存在。在生活在低财富社区的用户中,那些在大流行前在高财富社区上班的人特别容易遭受经济压力,既面临高财富社区经济活动的减少,又更有可能受到公共交通关闭的影响,因为他们的通勤距离更长。虽然限制政策主要是在全国范围内实施的,但这些结果表明,当识别弱势群体的数据不容易获得时,基于gps的分析可以帮助设计有针对性的基于地方的政策,以帮助最弱势群体。补充信息:在线版本包含补充资料,可在10.1140/epjds/s13688-025-00532-2获得。
{"title":"Socioeconomic disparities in mobility behavior during the COVID-19 pandemic in developing countries.","authors":"Lorenzo Lucchini, Ollin D Langle-Chimal, Lorenzo Candeago, Lucio Melito, Alex Chunet, Aleister Montfort, Bruno Lepri, Nancy Lozano-Gracia, Samuel P Fraiberger","doi":"10.1140/epjds/s13688-025-00532-2","DOIUrl":"10.1140/epjds/s13688-025-00532-2","url":null,"abstract":"<p><p>Mobile phone data have played a key role in quantifying human mobility during the COVID-19 pandemic. Existing studies on mobility patterns have primarily focused on regional aggregates in high-income countries, obfuscating the accentuated impact of the pandemic on the most vulnerable populations. Leveraging geolocation data from mobile-phone users and population census for 6 middle-income countries across 3 continents between March and December 2020, we uncovered common disparities in the behavioral response to the pandemic across socioeconomic groups. Users living in low-wealth neighborhoods were less likely to respond by self-isolating, relocating to rural areas, or refraining from commuting to work. The gap in the behavioral responses between socioeconomic groups persisted during the entire observation period. Among users living in low-wealth neighborhoods, those who commute to work in high-wealth neighborhoods pre-pandemic were particularly at risk of experiencing economic stress, facing both the reduction in economic activity in the high-wealth neighborhood and being more likely to be affected by public transport closures due to their longer commute distances. While confinement policies were predominantly country-wide, these results suggest that, when data to identify vulnerable individuals are not readily available, GPS-based analytics could help design targeted place-based policies to aid the most vulnerable.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1140/epjds/s13688-025-00532-2.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"14 1","pages":"25"},"PeriodicalIF":3.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11933202/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143717971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mapping global value chains at the product level. 在产品层面绘制全球价值链。
IF 3 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-01-01 Epub Date: 2025-03-12 DOI: 10.1140/epjds/s13688-025-00521-5
Lea Karbevska, César A Hidalgo

Value chain data is crucial for navigating economic disruptions. Yet, despite its importance, we lack publicly available product-level value chain datasets, since resources such as the "World Input-Output Database", "Inter-Country Input-Output Tables", "EXIOBASE", and "EORA", lack information about products (e.g. Radio Receivers, Telephones, Electrical Capacitors, LCDs, etc.) and instead rely on aggregate industrial sectors (e.g. Electrical Equipment, Telecommunications). Here, we introduce a method that leverages ideas from machine learning and trade theory to infer product-level value chain relationships from fine-grained international trade data. We apply our method to data summarizing the exports and imports of 1200+ products and 250+ world regions (e.g. states in the U.S., prefectures in Japan, etc.) to infer value chain information implicit in their trade patterns. In short, we leverage the idea that due to global value chains, regions specialized in the export of a product will tend to specialize in the import of its inputs. We use this idea to develop a novel proportional allocation model to estimate product-level trade flows between regions and countries. This contributes a method to approximate value chain data at the product level that should be of interest to people working in logistics, trade, and sustainable development.

Supplementary information: The online version contains supplementary material available at 10.1140/epjds/s13688-025-00521-5.

价值链数据对于应对经济动荡至关重要。然而,尽管它很重要,但我们缺乏公开可用的产品级价值链数据集,因为诸如“世界投入产出数据库”、“国家间投入产出表”、“EXIOBASE”和“EORA”等资源缺乏有关产品的信息(例如无线电接收器、电话、电容器、lcd等),而是依赖于总工业部门(例如电气设备、电信)。在这里,我们引入了一种方法,利用机器学习和贸易理论的思想,从细粒度的国际贸易数据中推断产品级价值链关系。我们将我们的方法应用于汇总1200多种产品和250多个世界地区(如美国各州,日本县等)的进出口数据,以推断其贸易模式中隐含的价值链信息。简而言之,我们利用了这样一种观点,即由于全球价值链的存在,专门从事某种产品出口的地区将倾向于专门从事该产品投入的进口。我们利用这一思想开发了一个新的比例分配模型来估计地区和国家之间的产品级贸易流量。这提供了一种在产品层面近似价值链数据的方法,对于从事物流、贸易和可持续发展工作的人来说应该是感兴趣的。补充信息:在线版本包含补充资料,可在10.1140/epjds/s13688-025-00521-5获得。
{"title":"Mapping global value chains at the product level.","authors":"Lea Karbevska, César A Hidalgo","doi":"10.1140/epjds/s13688-025-00521-5","DOIUrl":"https://doi.org/10.1140/epjds/s13688-025-00521-5","url":null,"abstract":"<p><p>Value chain data is crucial for navigating economic disruptions. Yet, despite its importance, we lack publicly available product-level value chain datasets, since resources such as the \"World Input-Output Database\", \"Inter-Country Input-Output Tables\", \"EXIOBASE\", and \"EORA\", lack information about products (e.g. Radio Receivers, Telephones, Electrical Capacitors, LCDs, etc.) and instead rely on aggregate industrial sectors (e.g. Electrical Equipment, Telecommunications). Here, we introduce a method that leverages ideas from machine learning and trade theory to infer product-level value chain relationships from fine-grained international trade data. We apply our method to data summarizing the exports and imports of 1200+ products and 250+ world regions (e.g. states in the U.S., prefectures in Japan, etc.) to infer value chain information implicit in their trade patterns. In short, we leverage the idea that due to global value chains, regions specialized in the export of a product will tend to specialize in the import of its inputs. We use this idea to develop a novel proportional allocation model to estimate product-level trade flows between regions and countries. This contributes a method to approximate value chain data at the product level that should be of interest to people working in logistics, trade, and sustainable development.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1140/epjds/s13688-025-00521-5.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"14 1","pages":"21"},"PeriodicalIF":3.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11903633/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143647657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Personalisation and profiling using algorithms and not-so-popular Colombian music: goal-directed mechanisms in music emotion recognition. 使用算法和不太流行的哥伦比亚音乐的个性化和分析:音乐情感识别中的目标导向机制。
IF 2.5 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-01-01 Epub Date: 2025-11-13 DOI: 10.1140/epjds/s13688-025-00595-1
Juan Sebastián Gómez-Cañón, Thomas Magnus Lennie, Tuomas Eerola, Pablo Aragón, Estefanía Cano, Perfecto Herrera, Emilia Gómez

This work investigates how personalised Music Emotion Recognition (MER) systems may lead to sensitive profiling when applied to musically induced emotions in politically charged contexts. We focus on traditional Colombian music with explicit political content, including (1) vallenatos and social songs aligned with the left-wing guerrilla Fuerzas Armadas Revolucionarias de Colombia (FARC), and (2) corridos linked to sympathisers of the right-wing paramilitary group Autodefensas Unidas de Colombia (AUC). Using data from 49 participants with diverse political leanings, we train personalised machine learning models to predict induced emotional responses - particularly negative emotions. Our findings reveal that political identity plays a significant role in shaping emotional experiences of music with explicit political content, and that emotion recognition models can capture this variation to a certain extent. These results raise critical concerns about the potential misuse of emotion recognition technologies. What is often framed as a tool for wellbeing and emotional regulation could, in politically sensitive contexts, be repurposed for user profiling. This work highlights the ethical risks of deploying AI-driven emotion analysis without safeguards, particularly among populations that are politically or socially vulnerable. We argue that subjective emotional responses may constitute sensitive personal data, and that failing to account for their sociopolitical context could amplify harm and exclusion.

Supplementary information: The online version contains supplementary material available at 10.1140/epjds/s13688-025-00595-1.

这项工作研究了个性化的音乐情感识别(MER)系统如何在政治环境中应用于音乐诱发的情绪时导致敏感的分析。我们专注于具有明确政治内容的传统哥伦比亚音乐,包括(1)与左翼游击队“哥伦比亚革命武装力量”(FARC)结盟的valenatos和社会歌曲,以及(2)与右翼准军事组织“哥伦比亚自治联盟”(AUC)同情者有关的走廊。使用来自49名具有不同政治倾向的参与者的数据,我们训练个性化的机器学习模型来预测诱发的情绪反应——尤其是负面情绪。我们的研究结果表明,政治认同在塑造具有明确政治内容的音乐的情感体验中起着重要作用,并且情感识别模型可以在一定程度上捕捉这种变化。这些结果引起了人们对情绪识别技术可能被滥用的严重担忧。在政治敏感的背景下,通常被视为健康和情绪调节工具的东西,可能会被重新用于用户分析。这项工作强调了在没有保障措施的情况下部署人工智能驱动的情感分析的伦理风险,特别是在政治或社会弱势群体中。我们认为,主观情绪反应可能构成敏感的个人数据,如果不考虑其社会政治背景,可能会放大伤害和排斥。补充信息:在线版本包含补充资料,可在10.1140/epjds/s13688-025-00595-1获得。
{"title":"Personalisation and profiling using algorithms and not-so-popular Colombian music: goal-directed mechanisms in music emotion recognition.","authors":"Juan Sebastián Gómez-Cañón, Thomas Magnus Lennie, Tuomas Eerola, Pablo Aragón, Estefanía Cano, Perfecto Herrera, Emilia Gómez","doi":"10.1140/epjds/s13688-025-00595-1","DOIUrl":"10.1140/epjds/s13688-025-00595-1","url":null,"abstract":"<p><p>This work investigates how personalised Music Emotion Recognition (MER) systems may lead to sensitive profiling when applied to musically induced emotions in politically charged contexts. We focus on traditional Colombian music with explicit political content, including (1) vallenatos and social songs aligned with the left-wing guerrilla Fuerzas Armadas Revolucionarias de Colombia (FARC), and (2) corridos linked to sympathisers of the right-wing paramilitary group Autodefensas Unidas de Colombia (AUC). Using data from 49 participants with diverse political leanings, we train personalised machine learning models to predict induced emotional responses - particularly negative emotions. Our findings reveal that political identity plays a significant role in shaping emotional experiences of music with explicit political content, and that emotion recognition models can capture this variation to a certain extent. These results raise critical concerns about the potential misuse of emotion recognition technologies. What is often framed as a tool for wellbeing and emotional regulation could, in politically sensitive contexts, be repurposed for user profiling. This work highlights the ethical risks of deploying AI-driven emotion analysis without safeguards, particularly among populations that are politically or socially vulnerable. We argue that subjective emotional responses may constitute sensitive personal data, and that failing to account for their sociopolitical context could amplify harm and exclusion.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1140/epjds/s13688-025-00595-1.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"14 1","pages":"80"},"PeriodicalIF":2.5,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12615516/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145539583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
EPJ Data Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1