首页 > 最新文献

Big Data Research最新文献

英文 中文
Unmasking hate in the pandemic: A cross-platform study of the COVID-19 infodemic 在大流行中揭开仇恨的面纱:COVID-19 信息大流行的跨平台研究
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 Epub Date: 2024-06-25 DOI: 10.1016/j.bdr.2024.100481
Fatima Zahrah , Jason R.C. Nurse , Michael Goldsmith

The past few decades have established how digital technologies and platforms have provided an effective medium for spreading hateful content, which has been linked to several catastrophic consequences. Recent academic studies have also highlighted how online hate is a phenomenon that strategically makes use of multiple online platforms. In this article, we seek to advance the current research landscape by harnessing a cross-platform approach to computationally analyse content relating to the 2020 COVID-19 pandemic. More specifically, we analyse content on hate-specific environments from Twitter, Reddit, 4chan and Stormfront. Our findings show how content and posting activity can change across platforms, and how the psychological components of online content can differ depending on the platform being used. Through this, we provide unique insight into the cross-platform behaviours of online hate. We further define several avenues for future research within this field so as to gain a more comprehensive understanding of the global hate ecosystem.

过去几十年来,数字技术和平台为仇恨内容的传播提供了有效的媒介,而仇恨内容的传播与一些灾难性后果有关。最近的学术研究也强调了网络仇恨是如何战略性地利用多种网络平台的现象。在本文中,我们试图利用跨平台方法对与 2020 年 COVID-19 大流行相关的内容进行计算分析,从而推进当前的研究工作。更具体地说,我们分析了 Twitter、Reddit、4chan 和 Stormfront 上特定仇恨环境的内容。我们的研究结果表明了内容和发帖活动在不同平台上的变化,以及网络内容的心理成分如何因所使用的平台而不同。通过这些研究,我们对网络仇恨的跨平台行为有了独特的见解。我们进一步确定了该领域未来研究的几个方向,以便更全面地了解全球仇恨生态系统。
{"title":"Unmasking hate in the pandemic: A cross-platform study of the COVID-19 infodemic","authors":"Fatima Zahrah ,&nbsp;Jason R.C. Nurse ,&nbsp;Michael Goldsmith","doi":"10.1016/j.bdr.2024.100481","DOIUrl":"https://doi.org/10.1016/j.bdr.2024.100481","url":null,"abstract":"<div><p>The past few decades have established how digital technologies and platforms have provided an effective medium for spreading hateful content, which has been linked to several catastrophic consequences. Recent academic studies have also highlighted how online hate is a phenomenon that strategically makes use of multiple online platforms. In this article, we seek to advance the current research landscape by harnessing a cross-platform approach to computationally analyse content relating to the 2020 COVID-19 pandemic. More specifically, we analyse content on hate-specific environments from Twitter, Reddit, 4chan and Stormfront. Our findings show how content and posting activity can change across platforms, and how the psychological components of online content can differ depending on the platform being used. Through this, we provide unique insight into the cross-platform behaviours of online hate. We further define several avenues for future research within this field so as to gain a more comprehensive understanding of the global hate ecosystem.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"37 ","pages":"Article 100481"},"PeriodicalIF":3.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2214579624000558/pdfft?md5=a8e2330701051448866927c6cb877d10&pid=1-s2.0-S2214579624000558-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141480176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent geological interpretation of AMT data based on machine learning 基于机器学习的 AMT 数据智能地质解释
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 Epub Date: 2024-06-13 DOI: 10.1016/j.bdr.2024.100475
Shuo Wang , Xiang Yu , Dan Zhao , Guocai Ma , Wei Ren , Shuxin Duan

AMT (Audio Magnetotelluric) is widely used for obtaining geological settings related to sandstone-type Uranium deposits, such as the range of buried sand body and the top boundary of baserock. However, these geological settings are hard to interpret via survey sections without conducting geological interpretation, which highly relies on experience and cognition. On the other hand, with the development of 3D technology, artificial geological interpretation shows low efficiency and reliability. In this paper, a machine learning model constructed using U-net was used for the geological interpretation of AMT data in the Naren-Yihegaole area. To train the model, a training dataset was built based on simulated data from random models. The issue of insufficient data samples has been addressed. In the prediction stage, sand bodies and baserock were delineated from the inversion resistivity images. The comparison between two interpretations, one by machine learning method, showed high consistency with the artificial one, but with better time-saving. It indicates that this technology is more individualized and effective than the traditional way.

AMT(音频磁法)被广泛用于获取与砂岩型铀矿床相关的地质环境,如砂体埋藏范围和基岩顶界。然而,如果不进行地质解释,就很难通过勘测断面解释这些地质环境,而地质解释在很大程度上依赖于经验和认知。另一方面,随着三维技术的发展,人工地质解释的效率和可靠性都很低。本文利用 U-net 构建了一个机器学习模型,用于那仁-义合高勒地区 AMT 数据的地质解释。为了训练该模型,根据随机模型的模拟数据建立了一个训练数据集。数据样本不足的问题已得到解决。在预测阶段,根据反演电阻率图像划分了砂体和基岩。对两种解释进行了比较,其中一种解释采用了机器学习方法,结果显示与人工解释高度一致,但更节省时间。这表明该技术比传统方法更加个性化和有效。
{"title":"Intelligent geological interpretation of AMT data based on machine learning","authors":"Shuo Wang ,&nbsp;Xiang Yu ,&nbsp;Dan Zhao ,&nbsp;Guocai Ma ,&nbsp;Wei Ren ,&nbsp;Shuxin Duan","doi":"10.1016/j.bdr.2024.100475","DOIUrl":"10.1016/j.bdr.2024.100475","url":null,"abstract":"<div><p>AMT (Audio Magnetotelluric) is widely used for obtaining geological settings related to sandstone-type Uranium deposits, such as the range of buried sand body and the top boundary of baserock. However, these geological settings are hard to interpret via survey sections without conducting geological interpretation, which highly relies on experience and cognition. On the other hand, with the development of 3D technology, artificial geological interpretation shows low efficiency and reliability. In this paper, a machine learning model constructed using U-net was used for the geological interpretation of AMT data in the Naren-Yihegaole area. To train the model, a training dataset was built based on simulated data from random models. The issue of insufficient data samples has been addressed. In the prediction stage, sand bodies and baserock were delineated from the inversion resistivity images. The comparison between two interpretations, one by machine learning method, showed high consistency with the artificial one, but with better time-saving. It indicates that this technology is more individualized and effective than the traditional way.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"37 ","pages":"Article 100475"},"PeriodicalIF":3.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141408443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two-dimensional data partitioning for non-negative matrix tri-factorization 非负矩阵三因子化的二维数据分区
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 Epub Date: 2024-06-19 DOI: 10.1016/j.bdr.2024.100473
Jiaxing Yan , Hai Liu , Zhiqi Lei , Yanghui Rao , Guan Liu , Haoran Xie , Xiaohui Tao , Fu Lee Wang

As a two-sided clustering and dimensionality reduction paradigm, Non-negative Matrix Tri-Factorization (NMTF) has attracted much attention in machine learning and data mining researchers due to its excellent performance and reliable theoretical support. Unlike Non-negative Matrix Factorization (NMF) methods applicable to one-sided clustering only, NMTF introduces an additional factor matrix and uses the inherent duality of data to realize the mutual promotion of sample clustering and feature clustering, thus showing great advantages in many scenarios (e.g., text co-clustering). However, the existing methods for solving NMTF usually involve intensive matrix multiplication, which is characterized by high time and space complexities, that is, there are limitations of slow convergence of the multiplicative update rules and high memory overhead. In order to solve the above problems, this paper develops a distributed parallel algorithm with a 2-dimensional data partition scheme for NMTF (i.e., PNMTF-2D). Experiments on multiple text datasets show that the proposed PNMTF-2D can substantially improve the computational efficiency of NMTF (e.g., the average iteration time is reduced by up to 99.7% on Amazon) while ensuring the effectiveness of convergence and co-clustering.

作为一种双面聚类和降维范式,非负矩阵三因式分解(NMTF)以其优异的性能和可靠的理论支持吸引了机器学习和数据挖掘研究人员的广泛关注。与只适用于单边聚类的非负矩阵因式分解(NMF)方法不同,NMTF 引入了额外的因式矩阵,利用数据固有的二元性实现了样本聚类和特征聚类的相互促进,因此在很多场景(如文本共聚类)中都显示出巨大的优势。然而,现有的 NMTF 求解方法通常涉及密集的矩阵乘法,具有时间和空间复杂度高的特点,即存在乘法更新规则收敛慢和内存开销大的局限性。为了解决上述问题,本文针对 NMTF 开发了一种具有二维数据分区方案的分布式并行算法(即 PNMTF-2D)。在多个文本数据集上的实验表明,所提出的 PNMTF-2D 可以大幅提高 NMTF 的计算效率(例如,在亚马逊上平均迭代时间最多可缩短 99.7%),同时确保收敛和共聚类的有效性。
{"title":"Two-dimensional data partitioning for non-negative matrix tri-factorization","authors":"Jiaxing Yan ,&nbsp;Hai Liu ,&nbsp;Zhiqi Lei ,&nbsp;Yanghui Rao ,&nbsp;Guan Liu ,&nbsp;Haoran Xie ,&nbsp;Xiaohui Tao ,&nbsp;Fu Lee Wang","doi":"10.1016/j.bdr.2024.100473","DOIUrl":"https://doi.org/10.1016/j.bdr.2024.100473","url":null,"abstract":"<div><p>As a two-sided clustering and dimensionality reduction paradigm, Non-negative Matrix Tri-Factorization (NMTF) has attracted much attention in machine learning and data mining researchers due to its excellent performance and reliable theoretical support. Unlike Non-negative Matrix Factorization (NMF) methods applicable to one-sided clustering only, NMTF introduces an additional factor matrix and uses the inherent duality of data to realize the mutual promotion of sample clustering and feature clustering, thus showing great advantages in many scenarios (e.g., text co-clustering). However, the existing methods for solving NMTF usually involve intensive matrix multiplication, which is characterized by high time and space complexities, that is, there are limitations of slow convergence of the multiplicative update rules and high memory overhead. In order to solve the above problems, this paper develops a distributed parallel algorithm with a 2-dimensional data partition scheme for NMTF (i.e., PNMTF-2D). Experiments on multiple text datasets show that the proposed PNMTF-2D can substantially improve the computational efficiency of NMTF (e.g., the average iteration time is reduced by up to 99.7% on Amazon) while ensuring the effectiveness of convergence and co-clustering.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"37 ","pages":"Article 100473"},"PeriodicalIF":3.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141480175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semi-supervised topic representation through sentiment analysis and semantic networks 通过情感分析和语义网络进行半监督式主题表示
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 Epub Date: 2024-06-13 DOI: 10.1016/j.bdr.2024.100474
Marco Ortu, Maurizio Romano, Andrea Carta

This paper proposes a novel approach to topic detection aimed at improving the semi-supervised clustering of customer reviews in the context of customers' services. The proposed methodology, named SeMi-supervised clustering for Assessment of Reviews using Topic and Sentiment (SMARTS) for Topic-Community Representation with Semantic Networks, combines semantic and sentiment analysis of words to derive topics related to positive and negative reviews of specific services. To achieve this, a semantic network of words is constructed based on word embedding semantic similarity to identify relationships between words used in the reviews. The resulting network is then used to derive the topics present in users' reviews, which are grouped by positive and negative sentiment based on words related to specific services. Clusters of words, obtained from the network's communities, are used to extract topics related to particular services and to improve the interpretation of users' assessments of those services. The proposed methodology is applied to tourism review data from Booking.com, and the results demonstrate the efficacy of the approach in enhancing the interpretability of the topics obtained by semi-supervised clustering. The methodology has the potential to provide valuable insights into the sentiment of customers toward tourism services, which could be utilized by service providers and decision-makers to enhance the quality of their services.

本文提出了一种新颖的主题检测方法,旨在改进客户服务背景下的客户评论半监督聚类。所提出的方法名为 "利用主题和情感评估评论的 SeMi-supervised clustering(SMARTS)",即利用语义网络进行主题-社群表示,该方法结合了对词语的语义分析和情感分析,以得出与特定服务的正面和负面评论相关的主题。为了实现这一目标,我们根据词语嵌入语义相似性构建了词语语义网络,以识别评论中使用的词语之间的关系。然后,利用生成的网络推导出用户评论中的主题,并根据与特定服务相关的词语按正面和负面情绪进行分组。从网络社区中获得的词群用于提取与特定服务相关的主题,并改进对用户对这些服务评价的解释。我们将所提出的方法应用于 Booking.com 的旅游评论数据,结果表明该方法在提高通过半监督聚类获得的主题的可解释性方面非常有效。该方法有可能为客户对旅游服务的情感提供有价值的见解,服务提供商和决策者可以利用这些见解来提高服务质量。
{"title":"Semi-supervised topic representation through sentiment analysis and semantic networks","authors":"Marco Ortu,&nbsp;Maurizio Romano,&nbsp;Andrea Carta","doi":"10.1016/j.bdr.2024.100474","DOIUrl":"10.1016/j.bdr.2024.100474","url":null,"abstract":"<div><p>This paper proposes a novel approach to topic detection aimed at improving the semi-supervised clustering of customer reviews in the context of customers' services. The proposed methodology, named SeMi-supervised clustering for Assessment of Reviews using Topic and Sentiment (SMARTS) for Topic-Community Representation with Semantic Networks, combines semantic and sentiment analysis of words to derive topics related to positive and negative reviews of specific services. To achieve this, a semantic network of words is constructed based on word embedding semantic similarity to identify relationships between words used in the reviews. The resulting network is then used to derive the topics present in users' reviews, which are grouped by positive and negative sentiment based on words related to specific services. Clusters of words, obtained from the network's communities, are used to extract topics related to particular services and to improve the interpretation of users' assessments of those services. The proposed methodology is applied to tourism review data from Booking.com, and the results demonstrate the efficacy of the approach in enhancing the interpretability of the topics obtained by semi-supervised clustering. The methodology has the potential to provide valuable insights into the sentiment of customers toward tourism services, which could be utilized by service providers and decision-makers to enhance the quality of their services.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"37 ","pages":"Article 100474"},"PeriodicalIF":3.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2214579624000509/pdfft?md5=46a689f4478007ad8db7233af95c8c2e&pid=1-s2.0-S2214579624000509-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141401445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributed Heterogeneous Transfer Learning 分布式异构迁移学习
IF 3.3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 Epub Date: 2024-05-14 DOI: 10.1016/j.bdr.2024.100456
Paolo Mignone , Gianvito Pio , Michelangelo Ceci

Transfer learning has proved to be effective for building predictive models even in complex conditions with a low amount of available labeled data, by constructing a predictive model for a target domain also using the knowledge coming from a separate domain, called source domain. However, several existing transfer learning methods assume identical feature spaces between the source and the target domains. This assumption limits the possible real-world applications of such methods, since two separate, although related, domains could be described by totally different feature spaces. Heterogeneous transfer learning methods aim to overcome this limitation, but they usually i) make other assumptions on the features, such as requiring the same number of features, ii) are not generally able to distribute the workload over multiple computational nodes, iii) cannot work in the Positive-Unlabeled (PU) learning setting, which we also considered in this study, or iv) their applicability is limited to specific application domains, i.e., they are not general-purpose methods.

In this manuscript, we present a novel distributed heterogeneous transfer learning method, implemented in Apache Spark, that overcomes all the above-mentioned limitations. Specifically, it is able to work also in the PU learning setting by resorting to a clustering-based approach, and can align totally heterogeneous feature spaces, without exploiting peculiarities of specific application domains. Moreover, our distributed approach allows us to process large source and target datasets.

Our experimental evaluation was performed in three different application domains that can benefit from transfer learning approaches, namely the reconstruction of the human gene regulatory network, the prediction of cerebral stroke in hospital patients, and the prediction of customer energy consumption in power grids. The results show that the proposed approach is able to outperform 4 state-of-the-art heterogeneous transfer learning approaches and 3 baselines, and exhibits ideal performances in terms of scalability.

事实证明,迁移学习可以有效地构建预测模型,即使在可用标注数据较少的复杂条件下,也能利用来自另一个领域(称为源领域)的知识构建目标领域的预测模型。然而,现有的几种迁移学习方法都假设源域和目标域的特征空间完全相同。这一假设限制了此类方法在现实世界中的应用,因为两个独立的领域虽然相关,但可能由完全不同的特征空间来描述。异构迁移学习方法旨在克服这一限制,但它们通常 i) 对特征做出其他假设,如要求特征数量相同;ii) 通常无法在多个计算节点上分配工作量;iii) 无法在正向无标记(PU)学习环境中工作,我们在本研究中也考虑了这一点;或者 iv) 它们的适用性仅限于特定的应用领域,也就是说,它们不是通用方法、在本手稿中,我们介绍了一种在 Apache Spark 中实现的新型分布式异构迁移学习方法,它克服了上述所有局限。具体来说,它通过采用基于聚类的方法,也能在 PU 学习环境中工作,并能对齐完全异构的特征空间,而无需利用特定应用领域的特殊性。此外,我们的分布式方法允许我们处理大型源数据集和目标数据集。我们在三个不同的应用领域进行了实验评估,这些应用领域可以从迁移学习方法中获益,即人类基因调控网络的重建、医院病人脑中风的预测以及电网客户能源消耗的预测。结果表明,所提出的方法能够超越 4 种最先进的异构迁移学习方法和 3 种基线方法,并且在可扩展性方面表现理想。
{"title":"Distributed Heterogeneous Transfer Learning","authors":"Paolo Mignone ,&nbsp;Gianvito Pio ,&nbsp;Michelangelo Ceci","doi":"10.1016/j.bdr.2024.100456","DOIUrl":"10.1016/j.bdr.2024.100456","url":null,"abstract":"<div><p>Transfer learning has proved to be effective for building predictive models even in complex conditions with a low amount of available labeled data, by constructing a predictive model for a target domain also using the knowledge coming from a separate domain, called source domain. However, several existing transfer learning methods assume identical feature spaces between the source and the target domains. This assumption limits the possible real-world applications of such methods, since two separate, although related, domains could be described by totally different feature spaces. Heterogeneous transfer learning methods aim to overcome this limitation, but they usually <em>i)</em> make other assumptions on the features, such as requiring the same number of features, <em>ii)</em> are not generally able to distribute the workload over multiple computational nodes, <em>iii)</em> cannot work in the Positive-Unlabeled (PU) learning setting, which we also considered in this study, or <em>iv)</em> their applicability is limited to specific application domains, i.e., they are not general-purpose methods.</p><p>In this manuscript, we present a novel distributed heterogeneous transfer learning method, implemented in Apache Spark, that overcomes all the above-mentioned limitations. Specifically, it is able to work also in the PU learning setting by resorting to a clustering-based approach, and can align totally heterogeneous feature spaces, without exploiting peculiarities of specific application domains. Moreover, our distributed approach allows us to process large source and target datasets.</p><p>Our experimental evaluation was performed in three different application domains that can benefit from transfer learning approaches, namely the reconstruction of the human gene regulatory network, the prediction of cerebral stroke in hospital patients, and the prediction of customer energy consumption in power grids. The results show that the proposed approach is able to outperform 4 state-of-the-art heterogeneous transfer learning approaches and 3 baselines, and exhibits ideal performances in terms of scalability.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"37 ","pages":"Article 100456"},"PeriodicalIF":3.3,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2214579624000327/pdfft?md5=33cf99e10874514291bfc635b26d260f&pid=1-s2.0-S2214579624000327-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141025163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing image captioning: The effectiveness of vision transformers and VGG networks for remote sensing 优化图像字幕:遥感视觉转换器和 VGG 网络的有效性
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 Epub Date: 2024-06-13 DOI: 10.1016/j.bdr.2024.100477
Huimin Han , Bouba oumarou Aboubakar , Mughair Bhatti , Bandeh Ali Talpur , Yasser A. Ali , Muna Al-Razgan , Yazeed Yasid Ghadi

This study presents a comprehensive evaluation of two prominent deep learning models, Vision Transformer (ViT) and VGG16, within the domain of image captioning for remote sensing data. By leveraging the BLEU score, a widely accepted metric for assessing the quality of text generated by machine learning models against a set of reference captions, this research aims to dissect and understand the capabilities and performance nuances of these models across various sample sizes: 25, 50, 75, and 100 samples. Our findings reveal that the Vision Transformer model generally outperforms the VGG16 model across all evaluated sample sizes, achieving its peak performance at 50 samples with a BLEU score of 0.5507. This performance shows that ViT benefits from its ability to capture global dependencies within the data, providing a more nuanced understanding of the images. However, the performance slightly decreases as the sample size increases beyond 50, indicating potential challenges in scalability or overfitting to the training data. Conversely, the VGG16 model shows a different performance trajectory, starting with a lower BLEU score for smaller sample sizes but demonstrating a consistent improvement as the sample size increases, culminating in its highest BLEU score of 0.4783 for 100 samples. This pattern suggests that VGG16 may require a larger dataset to adequately learn and generalize from the data, although it achieves a more modest performance ceiling compared to ViT. Through a detailed analysis of these findings, the study underscores the strengths and limitations of each model in the context of image captioning. The Vision Transformer's superior performance highlights its potential for applications requiring high accuracy in text generation from images. In contrast, the gradual improvement exhibited by VGG16 suggests its utility in scenarios where large datasets are available, and scalability is a priority. This study contributes to the ongoing discourse in the AI community regarding the selection and optimization of deep learning models for complex tasks such as image captioning, offering insights that could guide future research and application development in this field.

本研究对遥感数据图像标题领域的两个著名深度学习模型--Vision Transformer(ViT)和 VGG16 进行了全面评估。BLEU 分数是评估机器学习模型根据一组参考标题生成的文本质量的一个广为接受的指标,本研究利用这一指标,旨在剖析和了解这些模型在不同样本量(25、50、75 和 100 个样本)下的能力和性能细微差别。我们的研究结果表明,在所有评估的样本量中,Vision Transformer 模型的性能普遍优于 VGG16 模型,在 50 个样本时达到最高性能,BLEU 得分为 0.5507。这一性能表明,ViT 能够捕捉数据中的全局依赖关系,从而提供对图像更细致入微的理解。不过,随着样本量增加到 50 个以上,性能略有下降,这表明在可扩展性或过度拟合训练数据方面存在潜在挑战。相反,VGG16 模型则显示出不同的性能轨迹,开始时样本量较小,BLEU 分数较低,但随着样本量的增加,BLEU 分数不断提高,最终在 100 个样本时达到最高的 0.4783。这种模式表明,VGG16 可能需要更大的数据集才能从数据中充分学习和泛化,尽管与 ViT 相比,它的性能上限更低。通过对这些发现的详细分析,本研究强调了每个模型在图像字幕方面的优势和局限性。Vision Transformer 的卓越性能凸显了它在要求高精度图像文本生成的应用中的潜力。相比之下,VGG16 所表现出的渐进式改进表明,它适用于有大型数据集的场景,而且可扩展性是优先考虑的问题。这项研究为人工智能界正在进行的有关为图像字幕等复杂任务选择和优化深度学习模型的讨论做出了贡献,并提供了可指导该领域未来研究和应用开发的见解。
{"title":"Optimizing image captioning: The effectiveness of vision transformers and VGG networks for remote sensing","authors":"Huimin Han ,&nbsp;Bouba oumarou Aboubakar ,&nbsp;Mughair Bhatti ,&nbsp;Bandeh Ali Talpur ,&nbsp;Yasser A. Ali ,&nbsp;Muna Al-Razgan ,&nbsp;Yazeed Yasid Ghadi","doi":"10.1016/j.bdr.2024.100477","DOIUrl":"10.1016/j.bdr.2024.100477","url":null,"abstract":"<div><p>This study presents a comprehensive evaluation of two prominent deep learning models, Vision Transformer (ViT) and VGG16, within the domain of image captioning for remote sensing data. By leveraging the BLEU score, a widely accepted metric for assessing the quality of text generated by machine learning models against a set of reference captions, this research aims to dissect and understand the capabilities and performance nuances of these models across various sample sizes: 25, 50, 75, and 100 samples. Our findings reveal that the Vision Transformer model generally outperforms the VGG16 model across all evaluated sample sizes, achieving its peak performance at 50 samples with a BLEU score of 0.5507. This performance shows that ViT benefits from its ability to capture global dependencies within the data, providing a more nuanced understanding of the images. However, the performance slightly decreases as the sample size increases beyond 50, indicating potential challenges in scalability or overfitting to the training data. Conversely, the VGG16 model shows a different performance trajectory, starting with a lower BLEU score for smaller sample sizes but demonstrating a consistent improvement as the sample size increases, culminating in its highest BLEU score of 0.4783 for 100 samples. This pattern suggests that VGG16 may require a larger dataset to adequately learn and generalize from the data, although it achieves a more modest performance ceiling compared to ViT. Through a detailed analysis of these findings, the study underscores the strengths and limitations of each model in the context of image captioning. The Vision Transformer's superior performance highlights its potential for applications requiring high accuracy in text generation from images. In contrast, the gradual improvement exhibited by VGG16 suggests its utility in scenarios where large datasets are available, and scalability is a priority. This study contributes to the ongoing discourse in the AI community regarding the selection and optimization of deep learning models for complex tasks such as image captioning, offering insights that could guide future research and application development in this field.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"37 ","pages":"Article 100477"},"PeriodicalIF":3.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141415449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on the legal system of economic-ecological synergistic compensation in carbon neutral marine cities with a background in big data 以大数据为背景的碳中和海洋城市经济生态协同补偿法律制度研究
IF 3.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-08-28 Epub Date: 2024-06-13 DOI: 10.1016/j.bdr.2024.100476

With the increasingly severe global carbon emissions problem and the serious threat ecosystems face, carbon neutrality has gradually attracted widespread attention. This study provides an in-depth analysis of practical cases of international carbon neutrality initiatives and relevant experiences of marine cities, focusing on the construction and implementation of a legal system for economic, ecologically coordinated compensation. To evaluate the actual effectiveness of the legal system in marine cities, this study used a multiple linear regression model, considering factors such as the strictness of the legal system, enforcement efforts, and the level of participation of local enterprises and residents. The research results indicate that carbon emissions have significantly decreased in cities where legal systems are effectively enforced, from an average of 1.5 million tons per year to 1 million tons. At the same time, the economic growth rate of these cities has also significantly improved, increasing by about 2.5 percentage points from the original annual average of 4 % to 6.5 %. The study also found that the biodiversity index of these cities increased by 15 %, far higher than the average increase of 5 % in other cities, indicating the positive role of legal systems in protecting biodiversity. The public's participation rate in environmental protection activities has also increased from 25 % to 45 %, and the growth rate of green investment has reached an average of 8 % per year, far exceeding the 3 % growth rate of other cities. In terms of the ecosystem, data shows that the distribution of the ecosystem is stable, with an average ecological index of 508, which is in a relatively ideal state. The annual average growth rate of ecosystem restoration is about 3.5 %, further proving the effectiveness of ecological protection measures. Comprehensive empirical analysis shows that implementing the new legal system effectively reduces carbon emissions, enhances biodiversity, and promotes sustainable economic development. The economic growth rate increased from an average of 4.2 % to 5.1 % per year after implementing the new legal system, fully demonstrating the important role of the economic, ecologically coordinated compensation legal system in promoting carbon neutrality goals in marine cities.

随着全球碳排放问题日益严峻,生态系统面临严重威胁,碳中和逐渐受到广泛关注。本研究深入分析了国际碳中和倡议的实践案例和海洋城市的相关经验,重点探讨了经济、生态协调补偿法律制度的构建与实施。为评价海洋城市法律制度的实际效果,本研究采用多元线性回归模型,综合考虑了法律制度的严格程度、执行力度、当地企业和居民的参与程度等因素。研究结果表明,在法律制度得到有效执行的城市,碳排放量明显下降,从平均每年 1.同时,这些城市的经济增长率也显著提高,从原来的年均 4% 提高到 6.5%,提高了约 2.5 个百分点。研究还发现,这些城市的生物多样性指数提高了 15%,远高于其他城市 5%的平均增幅,说明法律制度在保护生物多样性方面发挥了积极作用。公众参与环保活动的比例也从 25% 提高到 45%,绿色投资的年均增长率达到 8%,远远超过其他城市 3% 的增长率。在生态系统方面,数据显示生态系统分布稳定,平均生态指数为 508,处于较为理想的状态。生态系统恢复的年均增长率约为 3.5%,进一步证明了生态保护措施的有效性。综合实证分析表明,新法律体系的实施有效减少了碳排放,提高了生物多样性,促进了经济的可持续发展。实施新法律体系后,经济增长率从平均每年 4.2% 提高到 5.1%,充分证明了经济、生态协调补偿法律体系在促进海洋城市碳中和目标中的重要作用。
{"title":"Research on the legal system of economic-ecological synergistic compensation in carbon neutral marine cities with a background in big data","authors":"","doi":"10.1016/j.bdr.2024.100476","DOIUrl":"10.1016/j.bdr.2024.100476","url":null,"abstract":"<div><p>With the increasingly severe global carbon emissions problem and the serious threat ecosystems face, carbon neutrality has gradually attracted widespread attention. This study provides an in-depth analysis of practical cases of international carbon neutrality initiatives and relevant experiences of marine cities, focusing on the construction and implementation of a legal system for economic, ecologically coordinated compensation. To evaluate the actual effectiveness of the legal system in marine cities, this study used a multiple linear regression model, considering factors such as the strictness of the legal system, enforcement efforts, and the level of participation of local enterprises and residents. The research results indicate that carbon emissions have significantly decreased in cities where legal systems are effectively enforced, from an average of 1.5 million tons per year to 1 million tons. At the same time, the economic growth rate of these cities has also significantly improved, increasing by about 2.5 percentage points from the original annual average of 4 % to 6.5 %. The study also found that the biodiversity index of these cities increased by 15 %, far higher than the average increase of 5 % in other cities, indicating the positive role of legal systems in protecting biodiversity. The public's participation rate in environmental protection activities has also increased from 25 % to 45 %, and the growth rate of green investment has reached an average of 8 % per year, far exceeding the 3 % growth rate of other cities. In terms of the ecosystem, data shows that the distribution of the ecosystem is stable, with an average ecological index of 508, which is in a relatively ideal state. The annual average growth rate of ecosystem restoration is about 3.5 %, further proving the effectiveness of ecological protection measures. Comprehensive empirical analysis shows that implementing the new legal system effectively reduces carbon emissions, enhances biodiversity, and promotes sustainable economic development. The economic growth rate increased from an average of 4.2 % to 5.1 % per year after implementing the new legal system, fully demonstrating the important role of the economic, ecologically coordinated compensation legal system in promoting carbon neutrality goals in marine cities.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"37 ","pages":"Article 100476"},"PeriodicalIF":3.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141412546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SD-SLAM: A semantic SLAM approach for dynamic scenes based on LiDAR point clouds SD-SLAM:基于激光雷达点云的动态场景语义 SLAM 方法
IF 3.3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-28 Epub Date: 2024-05-08 DOI: 10.1016/j.bdr.2024.100463
Feiya Li , Chunyun Fu , Dongye Sun , Jian Li , Jianwen Wang

Point cloud maps generated via LiDAR sensors using extensive remotely sensed data are commonly used by autonomous vehicles and robots for localization and navigation. However, dynamic objects contained in point cloud maps not only downgrade localization accuracy and navigation performance but also jeopardize the map quality. In response to this challenge, we propose in this paper a novel semantic SLAM approach for dynamic scenes based on LiDAR point clouds, referred to as SD-SLAM hereafter. The main contributions of this work are in three aspects: 1) introducing a semantic SLAM framework dedicatedly for dynamic scenes based on LiDAR point clouds, 2) employing semantics and Kalman filtering to effectively differentiate between dynamic and semi-static landmarks, and 3) making full use of semi-static and pure static landmarks with semantic information in the SD-SLAM process to improve localization and mapping performance. To evaluate the proposed SD-SLAM, tests were conducted using the widely adopted KITTI odometry dataset. Results demonstrate that the proposed SD-SLAM effectively mitigates the adverse effects of dynamic objects on SLAM, improving vehicle localization and mapping performance in dynamic scenes, and simultaneously constructing a static semantic map with multiple semantic classes for enhanced environment understanding.

通过使用大量遥感数据的激光雷达传感器生成的点云图通常被自动驾驶车辆和机器人用于定位和导航。然而,点云图中包含的动态物体不仅会降低定位精度和导航性能,还会损害地图质量。为了应对这一挑战,我们在本文中提出了一种基于激光雷达点云的新型动态场景语义 SLAM 方法,以下简称 SD-SLAM。这项工作的主要贡献体现在三个方面:1)基于激光雷达点云为动态场景引入专用的语义 SLAM 框架;2)采用语义学和卡尔曼滤波技术有效区分动态和半静态地标;3)在 SD-SLAM 过程中充分利用半静态和纯静态地标的语义信息,提高定位和绘图性能。为了评估所提出的 SD-SLAM,我们使用广泛采用的 KITTI 测速数据集进行了测试。结果表明,所提出的 SD-SLAM 能有效减轻动态物体对 SLAM 的不利影响,提高车辆在动态场景中的定位和映射性能,并同时构建具有多个语义类别的静态语义地图,以增强对环境的理解。
{"title":"SD-SLAM: A semantic SLAM approach for dynamic scenes based on LiDAR point clouds","authors":"Feiya Li ,&nbsp;Chunyun Fu ,&nbsp;Dongye Sun ,&nbsp;Jian Li ,&nbsp;Jianwen Wang","doi":"10.1016/j.bdr.2024.100463","DOIUrl":"https://doi.org/10.1016/j.bdr.2024.100463","url":null,"abstract":"<div><p>Point cloud maps generated via LiDAR sensors using extensive remotely sensed data are commonly used by autonomous vehicles and robots for localization and navigation. However, dynamic objects contained in point cloud maps not only downgrade localization accuracy and navigation performance but also jeopardize the map quality. In response to this challenge, we propose in this paper a novel semantic SLAM approach for dynamic scenes based on LiDAR point clouds, referred to as SD-SLAM hereafter. The main contributions of this work are in three aspects: 1) introducing a semantic SLAM framework dedicatedly for dynamic scenes based on LiDAR point clouds, 2) employing semantics and Kalman filtering to effectively differentiate between dynamic and semi-static landmarks, and 3) making full use of semi-static and pure static landmarks with semantic information in the SD-SLAM process to improve localization and mapping performance. To evaluate the proposed SD-SLAM, tests were conducted using the widely adopted KITTI odometry dataset. Results demonstrate that the proposed SD-SLAM effectively mitigates the adverse effects of dynamic objects on SLAM, improving vehicle localization and mapping performance in dynamic scenes, and simultaneously constructing a static semantic map with multiple semantic classes for enhanced environment understanding.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"36 ","pages":"Article 100463"},"PeriodicalIF":3.3,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141083349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Remote sensing-enhanced transfer learning approach for agricultural damage and change detection: A deep learning perspective 遥感增强转移学习法用于农业损害和变化检测:深度学习视角
IF 3.3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-28 Epub Date: 2024-03-20 DOI: 10.1016/j.bdr.2024.100449
Zehua Liu , Jiuhao Li , Mahmood Ashraf , M.S. Syam , Muhammad Asif , Emad Mahrous Awwad , Muna Al-Razgan , Uzair Aslam Bhatti

With the continuous advancement of science and technology, there has been a growing awareness of safety among people worldwide. Natural disasters such as wildfires, earthquakes, and floods pose persistent threats to both lives and property on our planet, which serves as our fundamental habitat. While it is impossible to prevent or entirely avert these calamities, rapid identification of affected areas and prompt damage assessment post-disaster can significantly aid in the formulation of effective rescue strategies, ultimately saving more lives. This article delves into the application of transfer learning in satellite image damage assessment—a methodology that involves transferring previously acquired knowledge to enhance a model's adaptability to new tasks. Given the limited availability of datasets for satellite image analysis, transfer learning proves to be an effective approach. Specifically, the study proposes a transfer learning method based on YOLOv5 for satellite image damage assessment. Initially, a general convolutional neural network model is trained using a substantial dataset of natural images. Subsequently, the early layers of this model are frozen, while the later layers undergo training to adapt to satellite image data. Fine-tuning is then employed to further enhance the overall model performance. The results demonstrate that this approach yields a high accuracy rate in satellite image damage assessment. Moreover, compared to conventional deep learning methods, the proposed method effectively leverages pre-trained models' knowledge, thereby reducing data dependency. Additionally, it displays robust generalization capabilities across diverse tasks and datasets, underscoring its potential for facilitating transfer learning across various domains.

随着科学技术的不断进步,全世界人民的安全意识日益增强。野火、地震和洪水等自然灾害对我们赖以生存的地球的生命和财产构成了持续的威胁。虽然我们不可能预防或完全避免这些灾难,但灾后快速识别受灾地区并及时进行损失评估,可大大有助于制定有效的救援策略,最终挽救更多生命。本文深入探讨了迁移学习在卫星图像损害评估中的应用--这种方法涉及迁移以前获得的知识,以增强模型对新任务的适应性。鉴于用于卫星图像分析的数据集有限,迁移学习被证明是一种有效的方法。具体来说,本研究提出了一种基于 YOLOv5 的迁移学习方法,用于卫星图像损伤评估。首先,使用大量自然图像数据集训练一个通用卷积神经网络模型。随后,该模型的早期层被冻结,而后期层则接受训练以适应卫星图像数据。然后再进行微调,以进一步提高模型的整体性能。结果表明,这种方法在卫星图像损坏评估方面具有很高的准确率。此外,与传统的深度学习方法相比,所提出的方法有效地利用了预训练模型的知识,从而降低了数据依赖性。此外,该方法在不同的任务和数据集上都表现出了强大的泛化能力,凸显了其促进跨领域迁移学习的潜力。
{"title":"Remote sensing-enhanced transfer learning approach for agricultural damage and change detection: A deep learning perspective","authors":"Zehua Liu ,&nbsp;Jiuhao Li ,&nbsp;Mahmood Ashraf ,&nbsp;M.S. Syam ,&nbsp;Muhammad Asif ,&nbsp;Emad Mahrous Awwad ,&nbsp;Muna Al-Razgan ,&nbsp;Uzair Aslam Bhatti","doi":"10.1016/j.bdr.2024.100449","DOIUrl":"10.1016/j.bdr.2024.100449","url":null,"abstract":"<div><p>With the continuous advancement of science and technology, there has been a growing awareness of safety among people worldwide. Natural disasters such as wildfires, earthquakes, and floods pose persistent threats to both lives and property on our planet, which serves as our fundamental habitat. While it is impossible to prevent or entirely avert these calamities, rapid identification of affected areas and prompt damage assessment post-disaster can significantly aid in the formulation of effective rescue strategies, ultimately saving more lives. This article delves into the application of transfer learning in satellite image damage assessment—a methodology that involves transferring previously acquired knowledge to enhance a model's adaptability to new tasks. Given the limited availability of datasets for satellite image analysis, transfer learning proves to be an effective approach. Specifically, the study proposes a transfer learning method based on YOLOv5 for satellite image damage assessment. Initially, a general convolutional neural network model is trained using a substantial dataset of natural images. Subsequently, the early layers of this model are frozen, while the later layers undergo training to adapt to satellite image data. Fine-tuning is then employed to further enhance the overall model performance. The results demonstrate that this approach yields a high accuracy rate in satellite image damage assessment. Moreover, compared to conventional deep learning methods, the proposed method effectively leverages pre-trained models' knowledge, thereby reducing data dependency. Additionally, it displays robust generalization capabilities across diverse tasks and datasets, underscoring its potential for facilitating transfer learning across various domains.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"36 ","pages":"Article 100449"},"PeriodicalIF":3.3,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140275813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graph Spatial-Temporal Transformer Network for Traffic Prediction 用于交通预测的图时空变换器网络
IF 3.3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-05-28 Epub Date: 2024-01-26 DOI: 10.1016/j.bdr.2024.100427
Zhenzhen Zhao , Guojiang Shen , Lei Wang , Xiangjie Kong

Traffic information can reflect the operating status of a city, and accurate traffic forecasting is critical in intelligent transportation systems (ITS) and urban planning. However, traffic information has complex nonlinearity and dynamic spatial-temporal dependencies due to human mobility, bringing new traffic forecasting challenges. This paper proposed a graph spatial-temporal transformer network for traffic prediction (GSTTN) to cope with the above problems. Specifically, the proposed framework explores spatial characteristics of the across-road network of traffic information hidden in human behavior patterns via a multi-view graph convolutional network (GCN). Furthermore, the transformer network with a multi-head attention mechanism is adopted to capture the random disturbance in the time series characteristics of traffic information. As a result, these two components can be used to model spatial relations and temporal trends. Finally, we examine real-world datasets, and the experiments show that the proposed framework outperforms the current state-of-the-art baselines.

交通信息可以反映一个城市的运行状况,准确的交通预测对智能交通系统(ITS)和城市规划至关重要。然而,由于人的流动性,交通信息具有复杂的非线性和动态时空依赖性,给交通预测带来了新的挑战。本文提出了一种用于交通预测的图时空变换网络(GSTTN)来应对上述问题。具体来说,本文提出的框架通过多视角图卷积网络(GCN)探索了隐藏在人类行为模式中的跨道路交通信息网络的空间特征。此外,还采用了具有多头关注机制的变压器网络来捕捉交通信息时间序列特征中的随机干扰。因此,这两个组件可用于空间关系和时间趋势建模。最后,我们对真实世界的数据集进行了研究,实验结果表明,所提出的框架优于目前最先进的基线框架。
{"title":"Graph Spatial-Temporal Transformer Network for Traffic Prediction","authors":"Zhenzhen Zhao ,&nbsp;Guojiang Shen ,&nbsp;Lei Wang ,&nbsp;Xiangjie Kong","doi":"10.1016/j.bdr.2024.100427","DOIUrl":"10.1016/j.bdr.2024.100427","url":null,"abstract":"<div><p><span>Traffic information can reflect the operating status of a city, and accurate traffic forecasting is critical in intelligent transportation systems (ITS) and urban planning. However, traffic information has complex nonlinearity and dynamic spatial-temporal dependencies due to human mobility, bringing new traffic forecasting challenges. This paper proposed a graph spatial-temporal transformer network for </span>traffic prediction<span> (GSTTN) to cope with the above problems. Specifically, the proposed framework explores spatial characteristics of the across-road network of traffic information hidden in human behavior patterns via a multi-view graph convolutional network<span> (GCN). Furthermore, the transformer network with a multi-head attention mechanism is adopted to capture the random disturbance in the time series characteristics of traffic information. As a result, these two components can be used to model spatial relations and temporal trends. Finally, we examine real-world datasets, and the experiments show that the proposed framework outperforms the current state-of-the-art baselines.</span></span></p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"36 ","pages":"Article 100427"},"PeriodicalIF":3.3,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139582754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Big Data Research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1