首页 > 最新文献

EPJ Data Science最新文献

英文 中文
Language and the use of law are predictive of judge gender and seniority 语言和法律的使用可预测法官的性别和资历
IF 3.6 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-09-02 DOI: 10.1140/epjds/s13688-024-00494-x
Lluc Font-Pomarol, Angelo Piga, Sergio Nasarre-Aznar, Marta Sales-Pardo, Roger Guimerà

There are examples of how unconscious bias can influence actions of people. In the judiciary, however, despite some examples there is no general theory on whether different demographic attributes such as gender, seniority or ethnicity affect case sentencing. We aim to gain insight into this issue by analyzing over 100k decisions of three different areas of law with the goal of understanding whether judge identity or judge attributes such as gender and seniority can be inferred from decision documents. We find that stylistic features of decisions are predictive of judge identities, their gender and their seniority, a finding that is aligned with results from analysis of written texts outside the judiciary. Surprisingly, we find that features based on legislation cited are also predictive of judge identities and attributes. While own content reuse by judges can explain our ability to predict judge identities, no specific reduced set of features can explain the differences we find in the legislation cited of decisions when we group judges by gender or seniority. Our findings open the door for further research on how these differences translate into how judges apply the law and, ultimately, to promote a more transparent and fair judiciary system.

无意识的偏见会影响人们的行为,这方面的例子不胜枚举。然而,在司法领域,尽管有一些例子,但对于性别、资历或种族等不同的人口属性是否会影响案件判决,却没有普遍的理论。我们分析了三个不同法律领域的 10 多万份判决,旨在了解是否可以从判决文件中推断出法官身份或法官属性(如性别和资历),从而深入了解这一问题。我们发现,判决书的文体特征可以预测法官身份、性别和资历,这一发现与司法机构以外的书面文本分析结果一致。令人惊讶的是,我们发现基于所引用立法的特征也能预测法官的身份和属性。虽然法官重复使用自己的内容可以解释我们预测法官身份的能力,但当我们按性别或资历对法官进行分组时,没有一组特定的缩减特征可以解释我们发现的判决所引用立法的差异。我们的发现为进一步研究这些差异如何转化为法官如何适用法律打开了大门,并最终促进司法系统更加透明和公平。
{"title":"Language and the use of law are predictive of judge gender and seniority","authors":"Lluc Font-Pomarol, Angelo Piga, Sergio Nasarre-Aznar, Marta Sales-Pardo, Roger Guimerà","doi":"10.1140/epjds/s13688-024-00494-x","DOIUrl":"https://doi.org/10.1140/epjds/s13688-024-00494-x","url":null,"abstract":"<p>There are examples of how unconscious bias can influence actions of people. In the judiciary, however, despite some examples there is no general theory on whether different demographic attributes such as gender, seniority or ethnicity affect case sentencing. We aim to gain insight into this issue by analyzing over 100k decisions of three different areas of law with the goal of understanding whether judge identity or judge attributes such as gender and seniority can be inferred from decision documents. We find that stylistic features of decisions are predictive of judge identities, their gender and their seniority, a finding that is aligned with results from analysis of written texts outside the judiciary. Surprisingly, we find that features based on legislation cited are also predictive of judge identities and attributes. While own content reuse by judges can explain our ability to predict judge identities, no specific reduced set of features can explain the differences we find in the legislation cited of decisions when we group judges by gender or seniority. Our findings open the door for further research on how these differences translate into how judges apply the law and, ultimately, to promote a more transparent and fair judiciary system.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"13 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Connection between climatic change and international food prices: evidence from robust long-range cross-correlation and variable-lag transfer entropy with sliding windows approach 气候变化与国际粮食价格之间的联系:稳健的长程交叉相关性和滑动窗口法的变滞后转移熵证据
IF 3.6 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-14 DOI: 10.1140/epjds/s13688-024-00482-1
Zouhaier Dhifaoui

As nations progress, the impact of climate change on food prices becomes increasingly substantial. While the influence of climate change on the yields of major agricultural products is widely recognized, its specific effect on food prices remains uncertain. This study delves into the impact of the North Atlantic Oscillation (NAO) index, a well-established climate indicator, on global food prices. To accomplish this, a robust bivariate Hurst exponent (robust bHe) is applied. The study employs a sliding windows approach across various time scales to produce a color map of this coefficient, presenting a time-varying version. Furthermore, variable-lag transfer entropy with a sliding windows approach is utilized to discern causal relationships between the NAO index and international food prices. The findings reveal that significant increases in the NAO index are correlated with noteworthy upswings in various international food prices over both short and long-term periods. Additionally, variable-lag transfer entropy confirms the causal role of the NAO index in influencing international food prices.

随着国家的进步,气候变化对粮食价格的影响越来越大。虽然气候变化对主要农产品产量的影响已得到广泛认可,但其对粮食价格的具体影响仍不确定。本研究深入探讨了北大西洋涛动指数(NAO)这一成熟的气候指标对全球粮食价格的影响。为此,采用了稳健双变量赫斯特指数(稳健 bHe)。该研究采用了一种跨越不同时间尺度的滑动窗口方法,绘制出该系数的彩色地图,呈现出一个随时间变化的版本。此外,还利用滑动窗口法的可变滞后转移熵来判别西北农林业大学指数与国际粮食价格之间的因果关系。研究结果表明,在短期和长期内,NAO 指数的大幅上升与各种国际粮食价格的显著上升相关。此外,可变滞后转移熵也证实了西北农林业大学指数在影响国际粮食价格方面的因果作用。
{"title":"Connection between climatic change and international food prices: evidence from robust long-range cross-correlation and variable-lag transfer entropy with sliding windows approach","authors":"Zouhaier Dhifaoui","doi":"10.1140/epjds/s13688-024-00482-1","DOIUrl":"https://doi.org/10.1140/epjds/s13688-024-00482-1","url":null,"abstract":"<p>As nations progress, the impact of climate change on food prices becomes increasingly substantial. While the influence of climate change on the yields of major agricultural products is widely recognized, its specific effect on food prices remains uncertain. This study delves into the impact of the North Atlantic Oscillation (NAO) index, a well-established climate indicator, on global food prices. To accomplish this, a robust bivariate Hurst exponent (robust bHe) is applied. The study employs a sliding windows approach across various time scales to produce a color map of this coefficient, presenting a time-varying version. Furthermore, variable-lag transfer entropy with a sliding windows approach is utilized to discern causal relationships between the NAO index and international food prices. The findings reveal that significant increases in the NAO index are correlated with noteworthy upswings in various international food prices over both short and long-term periods. Additionally, variable-lag transfer entropy confirms the causal role of the NAO index in influencing international food prices.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"34 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Keep your friends close, and your enemies closer: structural properties of negative relationships on Twitter 亲近朋友,亲近敌人:推特上负面关系的结构特性
IF 3.6 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-09 DOI: 10.1140/epjds/s13688-024-00485-y
Jack Tacchi, Chiara Boldrini, Andrea Passarella, Marco Conti

The Ego Network Model (ENM) is a model for the structural organisation of relationships, rooted in evolutionary anthropology, that is found ubiquitously in social contexts. It takes the perspective of a single user (Ego) and organises their contacts (Alters) into a series of (typically 5) concentric circles of decreasing intimacy and increasing size. Alters are sorted based on their tie strength to the Ego, however, this is difficult to measure directly. Traditionally, the interaction frequency has been used as a proxy but this misses the qualitative aspects of connections, such as signs (i.e. polarity), which have been shown to provide extremely useful information. However, the sign of an online social relationship is usually an implicit piece of information, which needs to be estimated by interaction data from Online Social Networks (OSNs), making sign prediction in OSNs a research challenge in and of itself. This work aims to bring the ENM into the signed networks domain by investigating the interplay of signed connections with the ENM. This paper delivers 2 main contributions. Firstly, a new and data-efficient method of signing relationships between individuals using sentiment analysis and, secondly, we provide an in-depth look at the properties of Signed Ego Networks (SENs), using 9 Twitter datasets of various categories of users. We find that negative connections are generally over-represented in the active part of the Ego Networks, suggesting that Twitter greatly over-emphasises negative relationships with respect to “offline” social networks. Further, users who use social networks for professional reasons have an even greater share of negative connections. Despite this, we also found weak signs that less negative users tend to allocate more cognitive effort to individual relationships and thus have smaller ego networks on average. All in all, even though structurally ENMs are known to be similar in both offline and online social networks, our results indicate that relationships on Twitter tend to nurture more negativity than offline contexts.

自我网络模型(ENM)是一种关系结构组织模型,植根于进化人类学,在社会环境中随处可见。它从单个用户(自我)的角度出发,将他们的联系人(Alters)组织成一系列(通常为 5 个)同心圆,这些同心圆的亲密程度依次递减,规模依次增大。联系人根据与自我的联系强度进行排序,但这很难直接测量。传统上,互动频率被用作一种替代指标,但这忽略了联系的质量方面,如标志(即极性),而这些标志已被证明能提供极为有用的信息。然而,在线社交关系的符号通常是一种隐含信息,需要通过在线社交网络(OSN)中的交互数据来估算,因此在 OSN 中进行符号预测本身就是一项研究挑战。这项工作旨在通过研究签名连接与 ENM 的相互作用,将 ENM 引入签名网络领域。本文有两大贡献。首先,我们提出了一种新的、数据效率高的方法,利用情感分析对个人之间的关系进行签名;其次,我们利用 9 个不同类别用户的 Twitter 数据集深入研究了签名自我网络(SEN)的特性。我们发现,在自我网络的活跃部分,负面联系的比例普遍过高,这表明与 "离线 "社交网络相比,Twitter 过度强调负面关系。此外,因职业原因而使用社交网络的用户的负面连接比例更高。尽管如此,我们也发现了一些微弱的迹象,表明负面关系较少的用户倾向于将更多的认知努力分配给个人关系,因此平均而言,他们的自我网络较小。总而言之,尽管众所周知线下和线上社交网络的ENM在结构上是相似的,但我们的研究结果表明,Twitter上的人际关系往往比线下更容易滋生消极情绪。
{"title":"Keep your friends close, and your enemies closer: structural properties of negative relationships on Twitter","authors":"Jack Tacchi, Chiara Boldrini, Andrea Passarella, Marco Conti","doi":"10.1140/epjds/s13688-024-00485-y","DOIUrl":"https://doi.org/10.1140/epjds/s13688-024-00485-y","url":null,"abstract":"<p>The Ego Network Model (ENM) is a model for the structural organisation of relationships, rooted in evolutionary anthropology, that is found ubiquitously in social contexts. It takes the perspective of a single user (Ego) and organises their contacts (Alters) into a series of (typically 5) concentric circles of decreasing intimacy and increasing size. Alters are sorted based on their tie strength to the Ego, however, this is difficult to measure directly. Traditionally, the interaction frequency has been used as a proxy but this misses the qualitative aspects of connections, such as signs (i.e. polarity), which have been shown to provide extremely useful information. However, the sign of an online social relationship is usually an implicit piece of information, which needs to be estimated by interaction data from Online Social Networks (OSNs), making sign prediction in OSNs a research challenge in and of itself. This work aims to bring the ENM into the signed networks domain by investigating the interplay of signed connections with the ENM. This paper delivers 2 main contributions. Firstly, a new and data-efficient method of signing relationships between individuals using sentiment analysis and, secondly, we provide an in-depth look at the properties of Signed Ego Networks (SENs), using 9 Twitter datasets of various categories of users. We find that negative connections are generally over-represented in the active part of the Ego Networks, suggesting that Twitter greatly over-emphasises negative relationships with respect to “offline” social networks. Further, users who use social networks for professional reasons have an even greater share of negative connections. Despite this, we also found weak signs that less negative users tend to allocate more cognitive effort to <i>individual</i> relationships and thus have smaller ego networks on average. All in all, even though <i>structurally</i> ENMs are known to be similar in both offline and online social networks, our results indicate that relationships on Twitter tend to nurture more negativity than offline contexts.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"89 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analyzing user ideologies and shared news during the 2019 argentinian elections 分析 2019 年阿根廷大选期间的用户意识形态和共享新闻
IF 3.6 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-08 DOI: 10.1140/epjds/s13688-024-00493-y
Sofía M. del Pozo, Sebastián Pinto, Matteo Serafino, Lucio Garcia, Hernán A. Makse, Pablo Balenzuela

The extensive data generated on social media platforms allow us to gain insights over trending topics and public opinions. Additionally, it offers a window into user behavior, including their content engagement and news sharing habits. In this study, we analyze the relationship between users’ political ideologies and the news they share during Argentina’s 2019 election period. Our findings reveal that users predominantly share news that aligns with their political beliefs, despite accessing media outlets with diverse political leanings. Moreover, we observe a consistent pattern of users sharing articles related to topics biased to their preferred candidates, highlighting a deeper level of political alignment in online discussions. We believe that this systematic analysis framework can be applied to similar scenarios in different countries, especially those marked by significant political polarization, akin to Argentina.

社交媒体平台上产生的大量数据使我们能够深入了解热门话题和公众意见。此外,它还提供了一个了解用户行为的窗口,包括他们的内容参与和新闻分享习惯。在本研究中,我们分析了用户的政治意识形态与他们在 2019 年阿根廷大选期间分享的新闻之间的关系。我们的研究结果表明,尽管用户访问的媒体具有不同的政治倾向,但他们主要分享与其政治信仰一致的新闻。此外,我们还观察到一种一致的模式,即用户分享与其偏好的候选人相关的主题文章,这凸显了在线讨论中更深层次的政治一致性。我们相信,这一系统分析框架可应用于不同国家的类似情况,尤其是那些政治两极分化严重的国家,如阿根廷。
{"title":"Analyzing user ideologies and shared news during the 2019 argentinian elections","authors":"Sofía M. del Pozo, Sebastián Pinto, Matteo Serafino, Lucio Garcia, Hernán A. Makse, Pablo Balenzuela","doi":"10.1140/epjds/s13688-024-00493-y","DOIUrl":"https://doi.org/10.1140/epjds/s13688-024-00493-y","url":null,"abstract":"<p>The extensive data generated on social media platforms allow us to gain insights over trending topics and public opinions. Additionally, it offers a window into user behavior, including their content engagement and news sharing habits. In this study, we analyze the relationship between users’ political ideologies and the news they share during Argentina’s 2019 election period. Our findings reveal that users predominantly share news that aligns with their political beliefs, despite accessing media outlets with diverse political leanings. Moreover, we observe a consistent pattern of users sharing articles related to topics biased to their preferred candidates, highlighting a deeper level of political alignment in online discussions. We believe that this systematic analysis framework can be applied to similar scenarios in different countries, especially those marked by significant political polarization, akin to Argentina.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"57 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Empirically measuring online social influence 实证衡量网络社交影响力
IF 3.6 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-05 DOI: 10.1140/epjds/s13688-024-00492-z
Rohit Ram, Marian-Andrei Rizoiu

Social influence pervades our everyday lives and lays the foundation for complex social phenomena, such as the spread of misinformation and the polarization of communities. A disconnect appears between psychology approaches, generally performed and tested in controlled lab experiments, and quantitative methods, which are usually data-driven and rely on network and event analysis. The former are slow, expensive to deploy, and typically do not generalize well to topical issues; the latter often oversimplify the complexities of social influence and ignore psychosocial literature. This work bridges this gap by introducing a human-in-the-loop active learning method that empirically quantifies social influence by crowdsourcing pairwise influence comparisons. We develop simulation and fitting tools, allowing us to estimate the required budget based on the design features and the worker’s decision accuracy. We perform a series of pilot studies to quantify the impact of design features on worker accuracy. We deploy our method to estimate the influence ranking of 500 X/Twitter users. We validate our measure by showing that the obtained empirical influence is tightly linked with agency and communion, the Big Two of social cognition, with agency being the most important dimension for influence formation.

社会影响充斥着我们的日常生活,并为错误信息的传播和社区两极分化等复杂的社会现象奠定了基础。心理学方法通常在受控实验室实验中执行和测试,而定量方法通常是数据驱动的,依赖于网络和事件分析,两者之间出现了脱节。前者速度慢,部署成本高,通常不能很好地概括热点问题;后者往往过于简化社会影响的复杂性,忽略社会心理学文献。这项研究通过引入一种人在回路中的主动学习方法弥补了这一不足,该方法通过众包配对影响力比较来量化社会影响力。我们开发了模拟和拟合工具,使我们能够根据设计特点和工作人员的决策准确性估算出所需预算。我们进行了一系列试点研究,以量化设计特征对工人准确性的影响。我们采用我们的方法估算了 500 名 X/Twitter 用户的影响力排名。我们验证了我们的测量方法,结果表明所获得的经验影响力与社会认知的两大要素--代入感和交际密切相关,其中代入感是影响力形成的最重要维度。
{"title":"Empirically measuring online social influence","authors":"Rohit Ram, Marian-Andrei Rizoiu","doi":"10.1140/epjds/s13688-024-00492-z","DOIUrl":"https://doi.org/10.1140/epjds/s13688-024-00492-z","url":null,"abstract":"<p>Social influence pervades our everyday lives and lays the foundation for complex social phenomena, such as the spread of misinformation and the polarization of communities. A disconnect appears between psychology approaches, generally performed and tested in controlled lab experiments, and quantitative methods, which are usually data-driven and rely on network and event analysis. The former are slow, expensive to deploy, and typically do not generalize well to topical issues; the latter often oversimplify the complexities of social influence and ignore psychosocial literature. This work bridges this gap by introducing a human-in-the-loop active learning method that empirically quantifies social influence by crowdsourcing pairwise influence comparisons. We develop simulation and fitting tools, allowing us to estimate the required budget based on the design features and the worker’s decision accuracy. We perform a series of pilot studies to quantify the impact of design features on worker accuracy. We deploy our method to estimate the influence ranking of 500 X/Twitter users. We validate our measure by showing that the obtained empirical influence is tightly linked with agency and communion, the Big Two of social cognition, with agency being the most important dimension for influence formation.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"3 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficiency and resilience: key drivers of distribution network growth 效率和复原力:配电网增长的主要驱动力
IF 3.6 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-08-01 DOI: 10.1140/epjds/s13688-024-00484-z
Ambra Amico, Giacomo Vaccario, Frank Schweitzer

Networks to distribute goods, from raw materials to food and medicines, are the backbone of a functioning economy. They are shaped by several supply relations connecting manufacturers, distributors, and final buyers worldwide. We present a network-based model to describe the mechanisms underlying the emergence and growth of distribution networks. In our model, firms consider two practices when establishing new supply relations: centralization, the tendency to choose highly connected partners, and multi-sourcing, the preference for multiple suppliers. Centralization enhances network efficiency by leveraging short distribution paths; multi-sourcing fosters resilience by providing multiple distribution paths connecting final buyers to the manufacturer. We validate the proposed model using data on drug shipments in the US. Drawing on these data, we reconstruct 22 nationwide pharmaceutical distribution networks. We demonstrate that the proposed model successfully replicates several structural features of the empirical networks, including their out-degree and path length distributions as well as their resilience and efficiency properties. These findings suggest that the proposed firm-level practices effectively capture the network growth process that leads to the observed structures.

从原材料到食品和药品,商品流通网络是经济运行的支柱。它们是由连接全球制造商、分销商和最终买家的若干供应关系形成的。我们提出了一个基于网络的模型,用以描述分销网络出现和发展的内在机制。在我们的模型中,企业在建立新的供应关系时会考虑两种做法:集中化(倾向于选择联系紧密的合作伙伴)和多重采购(倾向于选择多个供应商)。集中化通过利用短分销路径来提高网络效率;多源化通过提供连接最终买家和制造商的多条分销路径来提高弹性。我们利用美国的药品运输数据验证了所提出的模型。根据这些数据,我们重建了 22 个全国性的药品分销网络。我们证明,所提出的模型成功地复制了经验网络的几个结构特征,包括它们的外度和路径长度分布,以及它们的弹性和效率特性。这些研究结果表明,所提出的企业级实践有效地捕捉到了导致观察到的结构的网络增长过程。
{"title":"Efficiency and resilience: key drivers of distribution network growth","authors":"Ambra Amico, Giacomo Vaccario, Frank Schweitzer","doi":"10.1140/epjds/s13688-024-00484-z","DOIUrl":"https://doi.org/10.1140/epjds/s13688-024-00484-z","url":null,"abstract":"<p>Networks to distribute goods, from raw materials to food and medicines, are the backbone of a functioning economy. They are shaped by several supply relations connecting manufacturers, distributors, and final buyers worldwide. We present a network-based model to describe the mechanisms underlying the emergence and growth of distribution networks. In our model, firms consider two practices when establishing new supply relations: centralization, the tendency to choose highly connected partners, and multi-sourcing, the preference for multiple suppliers. Centralization enhances network efficiency by leveraging short distribution paths; multi-sourcing fosters resilience by providing multiple distribution paths connecting final buyers to the manufacturer. We validate the proposed model using data on drug shipments in the US. Drawing on these data, we reconstruct 22 nationwide pharmaceutical distribution networks. We demonstrate that the proposed model successfully replicates several structural features of the empirical networks, including their out-degree and path length distributions as well as their resilience and efficiency properties. These findings suggest that the proposed firm-level practices effectively capture the network growth process that leads to the observed structures.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"21 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141881844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measuring corporate digital divide through websites: insights from Italian firms 通过网站衡量企业数字鸿沟:意大利企业的启示
IF 3.6 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-30 DOI: 10.1140/epjds/s13688-024-00491-0
Leonardo Mazzoni, Fabio Pinelli, Massimo Riccaboni

With the increasing pervasiveness of Information and Communication Technology (ICT) in the fabric of economic activities, the corporate digital divide has become a crucial issue for the assessment of Information Technology (IT) competencies and the digital gap between firms and territories. With little granular data available to measure the phenomenon, most studies have used survey data. To address this empirical gap, we scanned the homepages of 182,705 Italian companies and extracted ten characteristics related to their digital footprint to develop a new index for the corporate digital assessment. Our results show a significant digital divide between Italian companies according to size, sector and geographical location, opening new perspectives for monitoring and data-driven analysis.

随着信息与传播技术(ICT)在经济活动中的日益普及,企业数字鸿沟已成为评估信息技术(IT)能力以及企业和地区之间数字差距的一个关键问题。由于可用于衡量这一现象的详细数据很少,大多数研究都使用了调查数据。为了弥补这一经验上的不足,我们对 182705 家意大利公司的主页进行了扫描,并提取了与其数字足迹相关的十个特征,从而为企业数字评估制定了一个新的指数。我们的研究结果表明,根据规模、行业和地理位置的不同,意大利公司之间存在明显的数字鸿沟,这为监测和数据驱动分析开辟了新的视角。
{"title":"Measuring corporate digital divide through websites: insights from Italian firms","authors":"Leonardo Mazzoni, Fabio Pinelli, Massimo Riccaboni","doi":"10.1140/epjds/s13688-024-00491-0","DOIUrl":"https://doi.org/10.1140/epjds/s13688-024-00491-0","url":null,"abstract":"<p>With the increasing pervasiveness of Information and Communication Technology (ICT) in the fabric of economic activities, the corporate digital divide has become a crucial issue for the assessment of Information Technology (IT) competencies and the digital gap between firms and territories. With little granular data available to measure the phenomenon, most studies have used survey data. To address this empirical gap, we scanned the homepages of 182,705 Italian companies and extracted ten characteristics related to their digital footprint to develop a new index for the corporate digital assessment. Our results show a significant digital divide between Italian companies according to size, sector and geographical location, opening new perspectives for monitoring and data-driven analysis.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"47 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141868741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The structural evolution of temporal hypergraphs through the lens of hyper-cores 通过超核透视时空超图的结构演化
IF 3.6 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-25 DOI: 10.1140/epjds/s13688-024-00490-1
Marco Mancastroppa, Iacopo Iacopini, Giovanni Petri, Alain Barrat

The richness of many complex systems stems from the interactions among their components. The higher-order nature of these interactions, involving many units at once, and their temporal dynamics constitute crucial properties that shape the behaviour of the system itself. An adequate description of these systems is offered by temporal hypergraphs, that integrate these features within the same framework. However, tools for their temporal and topological characterization are still scarce. Here we develop a series of methods specifically designed to analyse the structural properties of temporal hypergraphs at multiple scales. Leveraging the hyper-core decomposition of hypergraphs, we follow the evolution of the hyper-cores through time, characterizing the hypergraph structure and its temporal dynamics at different topological scales, and quantifying the multi-scale structural stability of the system. We also define two static hypercoreness centrality measures that provide an overall description of the nodes aggregated structural behaviour. We apply the characterization methods to several data sets, establishing connections between structural properties and specific activities within the systems. Finally, we show how the proposed method can be used as a model-validation tool for synthetic temporal hypergraphs, distinguishing the higher-order structures and dynamics generated by different models from the empirical ones, and thus identifying the essential model mechanisms to reproduce the empirical hypergraph structure and evolution. Our work opens several research directions, from the understanding of dynamic processes on temporal higher-order networks to the design of new models of time-varying hypergraphs.

许多复杂系统的丰富性源于其各组成部分之间的相互作用。这些相互作用的高阶性质(同时涉及许多单元)及其时间动态构成了塑造系统本身行为的关键属性。时空超图可以充分描述这些系统,并将这些特征整合到同一个框架中。然而,用于描述这些系统的时间和拓扑特征的工具仍然很少。在此,我们开发了一系列专门用于分析多尺度时空超图结构特性的方法。利用超图的超核分解,我们跟踪超核随时间的演变,在不同拓扑尺度上表征超图结构及其时间动态,并量化系统的多尺度结构稳定性。我们还定义了两种静态超核中心性度量,可全面描述节点的聚合结构行为。我们将特征描述方法应用于多个数据集,建立了结构属性与系统内特定活动之间的联系。最后,我们展示了如何将所提出的方法用作合成时空超图的模型验证工具,将不同模型生成的高阶结构和动态与经验模型区分开来,从而确定重现经验超图结构和演化的基本模型机制。我们的工作开辟了多个研究方向,从理解时间高阶网络的动态过程到设计时变超图的新模型。
{"title":"The structural evolution of temporal hypergraphs through the lens of hyper-cores","authors":"Marco Mancastroppa, Iacopo Iacopini, Giovanni Petri, Alain Barrat","doi":"10.1140/epjds/s13688-024-00490-1","DOIUrl":"https://doi.org/10.1140/epjds/s13688-024-00490-1","url":null,"abstract":"<p>The richness of many complex systems stems from the interactions among their components. The higher-order nature of these interactions, involving many units at once, and their temporal dynamics constitute crucial properties that shape the behaviour of the system itself. An adequate description of these systems is offered by temporal hypergraphs, that integrate these features within the same framework. However, tools for their temporal and topological characterization are still scarce. Here we develop a series of methods specifically designed to analyse the structural properties of temporal hypergraphs at multiple scales. Leveraging the hyper-core decomposition of hypergraphs, we follow the evolution of the hyper-cores through time, characterizing the hypergraph structure and its temporal dynamics at different topological scales, and quantifying the multi-scale structural stability of the system. We also define two static hypercoreness centrality measures that provide an overall description of the nodes aggregated structural behaviour. We apply the characterization methods to several data sets, establishing connections between structural properties and specific activities within the systems. Finally, we show how the proposed method can be used as a model-validation tool for synthetic temporal hypergraphs, distinguishing the higher-order structures and dynamics generated by different models from the empirical ones, and thus identifying the essential model mechanisms to reproduce the empirical hypergraph structure and evolution. Our work opens several research directions, from the understanding of dynamic processes on temporal higher-order networks to the design of new models of time-varying hypergraphs.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"16 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141772499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The role of transport systems in housing insecurity: a mobility-based analysis 交通系统在住房不安全中的作用:基于流动性的分析
IF 3.6 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-17 DOI: 10.1140/epjds/s13688-024-00489-8
Nandini Iyer, Ronaldo Menezes, Hugo Barbosa

With trends of urbanisation on the rise, providing adequate housing to individuals remains a complex issue to be addressed. Often, the slow output of relevant housing policies, coupled with quickly increasing housing costs, leaves individuals with the burden of finding housing that is affordable and in a safe location. In this paper, we unveil how transit service to employment hubs, not just housing policies, can prevent individuals from improving their housing conditions. We approach this question in three steps, applying the workflow to 20 cities in the United States of America. First, we propose a comprehensive framework to quantify housing insecurity and assign a housing demographic to each neighbourhood. Second, we use transit-pedestrian networks and public transit timetables (GTFS feeds) to estimate the time it takes to travel between two neighbourhoods using public transportation. Third, we apply geospatial autocorrelation to identify employment hotspots for each housing demographic. Finally, we use stochastic modelling to highlight how commuting to areas associated with better housing conditions results in transit commute times of over an hour in 15 cities. Ultimately, we consider the compounded burdens that come with housing insecurity, by having poor transit access to employment areas. In doing so, we highlight the importance of understanding how negative outcomes of housing insecurity coincide with various urban mechanisms, particularly emphasising the role that public transportation plays in locking vulnerable demographics into a cycle of poverty.

随着城市化趋势的加剧,为个人提供适当的住房仍然是一个需要解决的复杂问题。通常情况下,由于相关住房政策出台缓慢,加上住房成本快速增长,个人不得不承担寻找负担得起且位置安全的住房的重担。在本文中,我们将揭示通往就业中心的交通服务,而不仅仅是住房政策,是如何阻碍个人改善住房条件的。我们分三步解决这一问题,并将工作流程应用于美国的 20 个城市。首先,我们提出了一个量化住房不安全的综合框架,并为每个街区分配了一个住房人口统计。其次,我们利用公交行人网络和公共交通时刻表(GTFS feeds)来估算使用公共交通往返于两个街区所需的时间。第三,我们利用地理空间自相关性来确定每个住房人口的就业热点。最后,我们利用随机建模来强调在 15 个城市中,通勤到与较好住房条件相关的地区如何导致公交通勤时间超过一小时。最后,我们考虑了住房不安全所带来的复合负担,即通往就业地区的交通不便。在此过程中,我们强调了理解住房无保障的负面结果如何与各种城市机制相吻合的重要性,特别强调了公共交通在将弱势人口锁定在贫困循环中的作用。
{"title":"The role of transport systems in housing insecurity: a mobility-based analysis","authors":"Nandini Iyer, Ronaldo Menezes, Hugo Barbosa","doi":"10.1140/epjds/s13688-024-00489-8","DOIUrl":"https://doi.org/10.1140/epjds/s13688-024-00489-8","url":null,"abstract":"<p>With trends of urbanisation on the rise, providing adequate housing to individuals remains a complex issue to be addressed. Often, the slow output of relevant housing policies, coupled with quickly increasing housing costs, leaves individuals with the burden of finding housing that is affordable and in a safe location. In this paper, we unveil how transit service to employment hubs, not just housing policies, can prevent individuals from improving their housing conditions. We approach this question in three steps, applying the workflow to 20 cities in the United States of America. First, we propose a comprehensive framework to quantify housing insecurity and assign a housing demographic to each neighbourhood. Second, we use transit-pedestrian networks and public transit timetables (GTFS feeds) to estimate the time it takes to travel between two neighbourhoods using public transportation. Third, we apply geospatial autocorrelation to identify employment hotspots for each housing demographic. Finally, we use stochastic modelling to highlight how commuting to areas associated with better housing conditions results in transit commute times of over an hour in 15 cities. Ultimately, we consider the compounded burdens that come with housing insecurity, by having poor transit access to employment areas. In doing so, we highlight the importance of understanding how negative outcomes of housing insecurity coincide with various urban mechanisms, particularly emphasising the role that public transportation plays in locking vulnerable demographics into a cycle of poverty.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"26 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141744744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cycling into the workshop: e-bike and m-bike mobility patterns for predictive maintenance in Barcelona’s bike-sharing system 骑车进车间:巴塞罗那共享单车系统中用于预测性维护的电动自行车和移动自行车流动模式
IF 3.6 2区 计算机科学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-11 DOI: 10.1140/epjds/s13688-024-00486-x
Jordi Grau-Escolano, Aleix Bassolas, Julian Vicens

Bike-sharing systems have emerged as a significant element of urban mobility, providing an environmentally friendly transportation alternative. With the increasing integration of electric bikes alongside mechanical bikes, it is crucial to illuminate distinct usage patterns and their impact on maintenance. Accordingly, this research aims to develop a comprehensive understanding of mobility dynamics, distinguishing between different mobility modes, and introducing a novel predictive maintenance system tailored for bikes. By utilising a combination of trip information and maintenance data from Barcelona’s bike-sharing system, Bicing, this study conducts an extensive analysis of mobility patterns and their relationship to failures of bike components. To accurately predict maintenance needs for essential bike parts, this research delves into various mobility metrics and applies statistical and machine learning survival models, including deep learning models. Due to their complexity, and with the objective of bolstering confidence in the system’s predictions, interpretability techniques explain the main predictors of maintenance needs. The analysis reveals marked differences in the usage patterns of mechanical bikes and electric bikes, with a growing user preference for the latter despite their extra costs. These differences in mobility were found to have a considerable impact on the maintenance needs within the bike-sharing system. Moreover, the predictive maintenance models proved effective in forecasting these maintenance needs, capable of operating across an entire bike fleet. Despite challenges such as approximated bike usage metrics and data imbalances, the study successfully showcases the feasibility of an accurate predictive maintenance system capable of improving operational costs, bike availability, and security.

共享单车系统已成为城市交通的重要组成部分,提供了一种环保的替代交通方式。随着电动自行车与机械自行车的日益融合,阐明不同的使用模式及其对维护的影响至关重要。因此,本研究旨在全面了解交通动态,区分不同的交通模式,并引入一种专为自行车量身定制的新型预测性维护系统。通过综合利用巴塞罗那共享单车系统 Bicing 的出行信息和维护数据,本研究对流动模式及其与自行车部件故障的关系进行了广泛分析。为了准确预测自行车重要部件的维护需求,本研究深入研究了各种流动性指标,并应用了统计和机器学习生存模型,包括深度学习模型。由于其复杂性,并为了增强对系统预测的信心,可解释性技术解释了维护需求的主要预测因素。分析揭示了机械自行车和电动自行车在使用模式上的明显差异,用户越来越倾向于使用电动自行车,尽管其成本更高。这些流动性上的差异对共享单车系统内的维护需求产生了相当大的影响。此外,预测性维护模型在预测这些维护需求方面被证明是有效的,能够在整个自行车车队中运行。尽管存在近似自行车使用指标和数据不平衡等挑战,这项研究还是成功展示了精确预测性维护系统的可行性,该系统能够改善运营成本、自行车可用性和安全性。
{"title":"Cycling into the workshop: e-bike and m-bike mobility patterns for predictive maintenance in Barcelona’s bike-sharing system","authors":"Jordi Grau-Escolano, Aleix Bassolas, Julian Vicens","doi":"10.1140/epjds/s13688-024-00486-x","DOIUrl":"https://doi.org/10.1140/epjds/s13688-024-00486-x","url":null,"abstract":"<p>Bike-sharing systems have emerged as a significant element of urban mobility, providing an environmentally friendly transportation alternative. With the increasing integration of electric bikes alongside mechanical bikes, it is crucial to illuminate distinct usage patterns and their impact on maintenance. Accordingly, this research aims to develop a comprehensive understanding of mobility dynamics, distinguishing between different mobility modes, and introducing a novel predictive maintenance system tailored for bikes. By utilising a combination of trip information and maintenance data from Barcelona’s bike-sharing system, Bicing, this study conducts an extensive analysis of mobility patterns and their relationship to failures of bike components. To accurately predict maintenance needs for essential bike parts, this research delves into various mobility metrics and applies statistical and machine learning survival models, including deep learning models. Due to their complexity, and with the objective of bolstering confidence in the system’s predictions, interpretability techniques explain the main predictors of maintenance needs. The analysis reveals marked differences in the usage patterns of mechanical bikes and electric bikes, with a growing user preference for the latter despite their extra costs. These differences in mobility were found to have a considerable impact on the maintenance needs within the bike-sharing system. Moreover, the predictive maintenance models proved effective in forecasting these maintenance needs, capable of operating across an entire bike fleet. Despite challenges such as approximated bike usage metrics and data imbalances, the study successfully showcases the feasibility of an accurate predictive maintenance system capable of improving operational costs, bike availability, and security.</p>","PeriodicalId":11887,"journal":{"name":"EPJ Data Science","volume":"3 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141610702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
EPJ Data Science
全部 Geobiology Appl. Clay Sci. Geochim. Cosmochim. Acta J. Hydrol. Org. Geochem. Carbon Balance Manage. Contrib. Mineral. Petrol. Int. J. Biometeorol. IZV-PHYS SOLID EART+ J. Atmos. Chem. Acta Oceanolog. Sin. Acta Geophys. ACTA GEOL POL ACTA PETROL SIN ACTA GEOL SIN-ENGL AAPG Bull. Acta Geochimica Adv. Atmos. Sci. Adv. Meteorol. Am. J. Phys. Anthropol. Am. J. Sci. Am. Mineral. Annu. Rev. Earth Planet. Sci. Appl. Geochem. Aquat. Geochem. Ann. Glaciol. Archaeol. Anthropol. Sci. ARCHAEOMETRY ARCT ANTARCT ALP RES Asia-Pac. J. Atmos. Sci. ATMOSPHERE-BASEL Atmos. Res. Aust. J. Earth Sci. Atmos. Chem. Phys. Atmos. Meas. Tech. Basin Res. Big Earth Data BIOGEOSCIENCES Geostand. Geoanal. Res. GEOLOGY Geosci. J. Geochem. J. Geochem. Trans. Geosci. Front. Geol. Ore Deposits Global Biogeochem. Cycles Gondwana Res. Geochem. Int. Geol. J. Geophys. Prospect. Geosci. Model Dev. GEOL BELG GROUNDWATER Hydrogeol. J. Hydrol. Earth Syst. Sci. Hydrol. Processes Int. J. Climatol. Int. J. Earth Sci. Int. Geol. Rev. Int. J. Disaster Risk Reduct. Int. J. Geomech. Int. J. Geog. Inf. Sci. Isl. Arc J. Afr. Earth. Sci. J. Adv. Model. Earth Syst. J APPL METEOROL CLIM J. Atmos. Oceanic Technol. J. Atmos. Sol. Terr. Phys. J. Clim. J. Earth Sci. J. Earth Syst. Sci. J. Environ. Eng. Geophys. J. Geog. Sci. Mineral. Mag. Miner. Deposita Mon. Weather Rev. Nat. Hazards Earth Syst. Sci. Nat. Clim. Change Nat. Geosci. Ocean Dyn. Ocean and Coastal Research npj Clim. Atmos. Sci. Ocean Modell. Ocean Sci. Ore Geol. Rev. OCEAN SCI J Paleontol. J. PALAEOGEOGR PALAEOCL PERIOD MINERAL PETROLOGY+ Phys. Chem. Miner. Polar Sci. Prog. Oceanogr. Quat. Sci. Rev. Q. J. Eng. Geol. Hydrogeol. RADIOCARBON Pure Appl. Geophys. Resour. Geol. Rev. Geophys. Sediment. Geol.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1