Big Data Research最新文献_第7页

End-PolarT: Polar Representation for End-to-End Scene Text Detection End-PolarT:端到端场景文本检测的极性表示

IF 3.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Big Data Research

Pub Date : 2023-09-15 DOI: 10.1016/j.bdr.2023.100410

Yirui Wu , Qiran Kong , Cheng Qian , Michele Nappi , Shaohua Wan

Deep learning has achieved great success in text detection, where recent methods adopt inspirations from segmentation to detect scene texts. However, most segmentation based methods have high computation cost in pixel-level classification and post refinements. Moreover, they still faces challenges like arbitrary directions, curved texts, illumination and so on. Aim to improve detection accuracy and computation cost, we propose an end-to-end and single-stage method named as End-PolarT network by generating contour points in polar coordinates for text detection. End-PolarT not only regress locations of contour points instead of pixels to relieve high computation cost, but also fits with intrinsic characteristics of text instances by centers and contours to suppress mislabeling boundary pixels. To cope with polar representation, we further propose polar IoU and centerness as key parts of loss functions to generate effective paradigms for text detection. Compared with the existing methods, End-PolarT achieves superior results by testing on several public datasets, thus keeping balance between efficiency and effectiveness in complicated scenes.

深度学习在文本检测方面取得了巨大成功，最近的方法借鉴了分割的灵感来检测场景文本。然而，大多数基于分割的方法在像素级分类和后细化方面具有较高的计算成本。此外，它们还面临着任意方向、弯曲文本、照明等挑战。为了提高检测精度和计算成本，我们提出了一种端到端、单阶段的方法，称为end PolarT网络，通过在极坐标中生成轮廓点来进行文本检测。End PolarT不仅回归了轮廓点的位置而不是像素的位置以降低高昂的计算成本，而且通过中心和轮廓来适应文本实例的内在特征，以抑制边界像素的错误标记。为了处理极性表示，我们进一步提出极性IoU和中心性作为损失函数的关键部分，以生成有效的文本检测范式。与现有方法相比，End PolarT通过在多个公共数据集上进行测试，取得了优异的结果，从而在复杂场景中保持了效率和有效性的平衡。

{"title":"End-PolarT: Polar Representation for End-to-End Scene Text Detection","authors":"Yirui Wu , Qiran Kong , Cheng Qian , Michele Nappi , Shaohua Wan","doi":"10.1016/j.bdr.2023.100410","DOIUrl":"https://doi.org/10.1016/j.bdr.2023.100410","url":null,"abstract":"<div><p>Deep learning has achieved great success in text detection, where recent methods adopt inspirations from segmentation to detect scene texts. However, most segmentation based methods have high computation cost in pixel-level classification and post refinements. Moreover, they still faces challenges like arbitrary directions, curved texts, illumination and so on. Aim to improve detection accuracy and computation cost, we propose an end-to-end and single-stage method named as End-PolarT network by generating contour points in polar coordinates for text detection. End-PolarT not only regress locations of contour points instead of pixels to relieve high computation cost, but also fits with intrinsic characteristics of text instances by centers and contours to suppress mislabeling boundary pixels. To cope with polar representation, we further propose polar IoU and centerness as key parts of loss functions to generate effective paradigms for text detection. Compared with the existing methods, End-PolarT achieves superior results by testing on several public datasets, thus keeping balance between efficiency and effectiveness in complicated scenes.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"34 ","pages":"Article 100410"},"PeriodicalIF":3.3,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49733799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Study on the Temporal and Spatial Evolution Characteristics of Chinese Public's Cognition and Attitude to “Double Reduction” Policy Based on Big Data 基于大数据的中国公众对“双减”政策认知与态度时空演化特征研究

IF 3.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Big Data Research

Pub Date : 2023-09-11 DOI: 10.1016/j.bdr.2023.100411

Jiahui Liu , Wei Liu , Chun Yan , Xinhong Liu

The “double reduction” policy is a policy innovation of China's comprehensive education reform to build a high-quality education system. The public's cognition and attitude toward it are of great significance to its actual implementation. A total of 98396 texts related to “double reduction” collected from Sina-Weibo by web crawler technology are investigated to explore the public's cognition and attitude towards the “double reduction” policy as well as its spatio-temporal evolution characteristics. Guided by life cycle theory, the evolution of the public's attitude is studied by sentiment analysis based on the ERINE algorithm and DUTIR. Topics are selected with the adoption of TF-IDF and LDA models to perform spatio-temporal evolution of public cognition and analyze group differences. The results are as follows: the evolution of public concern about the “double reduction” policy is phased and the period of high incidence is closely related to time nodes such as policy release, the new school term, and holidays. There are temporal and spatial differences in the evolution of public attitudes between different stages and groups. Although the public holds a relatively negative attitude, with more information about the “double reduction” policy available, the public's attitude is gradually easing. Topics of public concern vary in different periods, and different groups show different emotional attitudes and have distinctive evolution characteristics of cognitive themes. Compared with other age groups, teenagers pay more attention to topics related to their studies and life. The government's official micro-blog not only shoulders the responsibility of publicizing relevant policies, but also pays close attention to the implementation of relevant policies around the country. The influential groups hold a relatively firm attitude and stable emotions and often can orient public opinions. The regional attention to the “double reduction” policy is positively correlated with the level of local economic development. The research results can help government departments learn about the public's cognition and attitude towards the “double reduction” policy to provide decision-making support, and serve as an important basis for solving existing contradictions and promoting the effective implementation of policies.

“双减”政策是我国教育综合改革建设高质量教育体系的政策创新。公众对它的认知和态度对它的实际实施具有重要意义。通过网络爬虫技术从新浪微博收集的98396篇与“双减”相关的文本，探讨公众对“双减政策”的认知和态度及其时空演变特征。在生命周期理论的指导下，基于ERINE算法和DUTIR，通过情绪分析研究了公众态度的演变。选题采用TF-IDF和LDA模型对公众认知进行时空演化，分析群体差异。结果表明：公众对“双减”政策的关注是阶段性的，高发期与政策发布、新学期、假期等时间节点密切相关。不同阶段和群体的公众态度演变存在时间和空间差异。尽管公众持相对消极的态度，但随着“双减”政策的信息越来越多，公众的态度正在逐渐缓和。公众关注的主题在不同时期各不相同，不同群体表现出不同的情感态度，具有鲜明的认知主题演变特征。与其他年龄组相比，青少年更关注与他们的学习和生活相关的话题。政府官方微博不仅肩负着宣传相关政策的责任，还密切关注各地相关政策的落实情况。有影响力的群体持有相对坚定的态度和稳定的情绪，往往能够引导公众舆论。区域对“双减”政策的重视程度与当地经济发展水平呈正相关。研究结果可以帮助政府部门了解公众对“双减”政策的认知和态度，为决策提供支持，并作为解决现有矛盾、促进政策有效实施的重要依据。

{"title":"Study on the Temporal and Spatial Evolution Characteristics of Chinese Public's Cognition and Attitude to “Double Reduction” Policy Based on Big Data","authors":"Jiahui Liu , Wei Liu , Chun Yan , Xinhong Liu","doi":"10.1016/j.bdr.2023.100411","DOIUrl":"https://doi.org/10.1016/j.bdr.2023.100411","url":null,"abstract":"<div><p><span><span>The “double reduction” policy is a policy innovation of China's comprehensive education reform to build a high-quality education system. The public's cognition and attitude toward it are of great significance to its actual implementation. A total of 98396 texts related to “double reduction” collected from Sina-Weibo by web crawler technology are investigated to explore the public's cognition and attitude towards the “double reduction” policy as well as its spatio-temporal evolution characteristics. Guided by life cycle theory, the evolution of the public's attitude is studied by </span>sentiment analysis based on the ERINE algorithm and DUTIR. Topics are selected with the adoption of TF-IDF and </span>LDA models to perform spatio-temporal evolution of public cognition and analyze group differences. The results are as follows: the evolution of public concern about the “double reduction” policy is phased and the period of high incidence is closely related to time nodes such as policy release, the new school term, and holidays. There are temporal and spatial differences in the evolution of public attitudes between different stages and groups. Although the public holds a relatively negative attitude, with more information about the “double reduction” policy available, the public's attitude is gradually easing. Topics of public concern vary in different periods, and different groups show different emotional attitudes and have distinctive evolution characteristics of cognitive themes. Compared with other age groups, teenagers pay more attention to topics related to their studies and life. The government's official micro-blog not only shoulders the responsibility of publicizing relevant policies, but also pays close attention to the implementation of relevant policies around the country. The influential groups hold a relatively firm attitude and stable emotions and often can orient public opinions. The regional attention to the “double reduction” policy is positively correlated with the level of local economic development. The research results can help government departments learn about the public's cognition and attitude towards the “double reduction” policy to provide decision-making support, and serve as an important basis for solving existing contradictions and promoting the effective implementation of policies.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"34 ","pages":"Article 100411"},"PeriodicalIF":3.3,"publicationDate":"2023-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49733798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Improved CycleGAN for Data Augmentation in Person Re-Identification 一种用于人再识别数据增强的改进CycleGAN

IF 3.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Big Data Research

Pub Date : 2023-09-09 DOI: 10.1016/j.bdr.2023.100409

Zhenzhen Yang , Jing Shao , Yongpeng Yang

Person re-identification (ReID) has attracted more and more attention, which is to retrieve interested persons across multiple non-overlapping cameras. Matching the same person between different camera styles has always been an enormous challenge. In the existing work, cross-camera styles images generated by the cycle-consistent generative adversarial network (CycleGAN) only transfer the camera resolution and ambient lighting. The generated images produce considerable redundancy and inappropriate pictures at the same time. Although the data is added to prevent over-fitting, it also makes significant noise, so the accuracy is not significantly improved. In this paper, an improved CycleGAN is proposed to generate images for achieving improved data augmentation. The transfer of pedestrian posture is added at the same time as transferring the image style. It not only increases the diversity of pedestrian posture but also reduces the domain gap caused by the style change between cameras. Besides, through the multi-pseudo regularized label (MpRL), the generated images are assigned virtual tags dynamically in training. Through many experimental evaluations, we have achieved a very high identification accuracy on Market-1501, DukeMTMC-reID, and CUHK03-NP datasets. On the three datasets, the quantitative results of mAP are 96.20%, 93.72%, and 86.65%, and the quantitative results of rank-1 are 98.27%, 95.37%, and 90.71%, respectively. The experimental results fully show the superiority of our proposed method.

人物重新识别（ReID）越来越受到人们的关注，它是通过多个不重叠的摄像机来检索感兴趣的人。在不同的相机风格之间匹配同一个人一直是一个巨大的挑战。在现有的工作中，由循环一致性生成对抗性网络（CycleGAN）生成的跨相机风格的图像仅传递相机分辨率和环境光照。生成的图像同时产生相当大的冗余和不合适的图片。虽然添加数据是为了防止过度拟合，但它也会产生显著的噪声，因此精度没有显著提高。本文提出了一种改进的CycleGAN来生成图像，以实现改进的数据增强。行人姿势的转移是在转移图像样式的同时添加的。它不仅增加了行人姿势的多样性，还减少了相机之间风格变化造成的领域差距。此外，通过多伪正则化标签（MpRL），生成的图像在训练中被动态分配虚拟标签。通过多次实验评估，我们在Market-1501、DukeMTMC-reID和CUHK03-NP数据集上实现了非常高的识别精度。在三个数据集上，mAP的定量结果分别为96.20%、93.72%和86.65%，秩-1的定量结果为98.27%、95.37%和90.71%。实验结果充分证明了该方法的优越性。

{"title":"An Improved CycleGAN for Data Augmentation in Person Re-Identification","authors":"Zhenzhen Yang , Jing Shao , Yongpeng Yang","doi":"10.1016/j.bdr.2023.100409","DOIUrl":"https://doi.org/10.1016/j.bdr.2023.100409","url":null,"abstract":"<div><p>Person re-identification (ReID) has attracted more and more attention, which is to retrieve interested persons across multiple non-overlapping cameras. Matching the same person between different camera styles has always been an enormous challenge. In the existing work, cross-camera styles images generated by the cycle-consistent generative adversarial network<span> (CycleGAN) only transfer the camera resolution and ambient lighting. The generated images produce considerable redundancy and inappropriate pictures at the same time. Although the data is added to prevent over-fitting, it also makes significant noise, so the accuracy is not significantly improved. In this paper, an improved CycleGAN is proposed to generate images for achieving improved data augmentation. The transfer of pedestrian posture is added at the same time as transferring the image style. It not only increases the diversity of pedestrian posture but also reduces the domain gap caused by the style change between cameras. Besides, through the multi-pseudo regularized label (MpRL), the generated images are assigned virtual tags dynamically in training. Through many experimental evaluations, we have achieved a very high identification accuracy on Market-1501, DukeMTMC-reID, and CUHK03-NP datasets. On the three datasets, the quantitative results of mAP are 96.20%, 93.72%, and 86.65%, and the quantitative results of rank-1 are 98.27%, 95.37%, and 90.71%, respectively. The experimental results fully show the superiority of our proposed method.</span></p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"34 ","pages":"Article 100409"},"PeriodicalIF":3.3,"publicationDate":"2023-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49711263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Classifier-Based Nonuniform Time Slicing Method for Local Community Evolution Analysis 基于分类器的非均匀时间切片局部群落演化分析方法

IF 3.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Big Data Research

Pub Date : 2023-09-09 DOI: 10.1016/j.bdr.2023.100408

Xiangyu Luo , Tian Wang , Gang Xin , Yan Lu , Ke Yan , Ying Liu

With the rapid expansion of the scale of a dynamic network, local community evolution analysis attracts much attention because of its efficiency and accuracy. It concentrates on a particularly interested community rather than considering all communities together. A fundamental problem is how to divide time into slices so that a dynamic network is represented as a sequence of snapshots which accurately capture the evolutionary events of the interested community. Existing time slicing methods lead to inaccurate evolution analysis results. The reason is that they usually rely on a linear strategy while the community evolution is a nonlinear process. This paper investigates the problem and proposes a classifier-based time slicing method for local community evolution analysis. First, a classifier is trained for judging whether there is a community in the given network snapshot is identified as the continuing of the community defined by the given node subset. The features for classification include internal cohesion degree and external coupling degree. Second, a time slicing method is proposed based on the trained classifier. As the network evolves, the method continuously uses the classifier to predict whether there is a community in the newest network identified as the continuing of the interested community. Whenever the answer is negative, an evolutionary event is presumed to have occurred and a new time slice is generated. Experimental results show that compared with existing time slicing methods, our proposed method achieves higher recognition rate for given redundancy ratio.

随着动态网络规模的迅速扩大，局部社区进化分析因其高效性和准确性而备受关注。它专注于一个特别感兴趣的社区，而不是将所有社区放在一起考虑。一个基本问题是如何将时间划分为切片，以便将动态网络表示为一系列快照，准确捕捉感兴趣社区的进化事件。现有的时间切片方法导致进化分析结果不准确。原因是它们通常依赖于线性策略，而群落进化是一个非线性过程。本文研究了这个问题，并提出了一种基于分类器的时间切片方法来进行局部社区进化分析。首先，训练分类器来判断给定网络中是否存在社区。快照被识别为给定节点子集定义的社区的延续。用于分类的特征包括内部内聚度和外部耦合度。其次，提出了一种基于训练分类器的时间切片方法。随着网络的发展，该方法不断使用分类器来预测在最新的网络中是否存在被识别为感兴趣社区的延续的社区。只要答案是否定的，就认为发生了进化事件，并生成了新的时间片。实验结果表明，与现有的时间切片方法相比，在给定冗余率的情况下，该方法具有更高的识别率。

{"title":"Classifier-Based Nonuniform Time Slicing Method for Local Community Evolution Analysis","authors":"Xiangyu Luo , Tian Wang , Gang Xin , Yan Lu , Ke Yan , Ying Liu","doi":"10.1016/j.bdr.2023.100408","DOIUrl":"https://doi.org/10.1016/j.bdr.2023.100408","url":null,"abstract":"<div><p>With the rapid expansion of the scale of a dynamic network, local community evolution analysis attracts much attention because of its efficiency and accuracy. It concentrates on a particularly interested community rather than considering all communities together. A fundamental problem is how to divide time into slices so that a dynamic network is represented as a sequence of snapshots which accurately capture the evolutionary events of the interested community. Existing time slicing methods lead to inaccurate evolution analysis results. The reason is that they usually rely on a linear strategy while the community evolution is a nonlinear process. This paper investigates the problem and proposes a classifier-based time slicing method for local community evolution analysis. First, a classifier is trained for judging whether there is a community in the given network snapshot is identified as the continuing of the community defined by the given node subset. The features for classification include internal cohesion degree and external coupling degree. Second, a time slicing method is proposed based on the trained classifier. As the network evolves, the method continuously uses the classifier to predict whether there is a community in the newest network identified as the continuing of the interested community. Whenever the answer is negative, an evolutionary event is presumed to have occurred and a new time slice is generated. Experimental results show that compared with existing time slicing methods, our proposed method achieves higher recognition rate for given redundancy ratio.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"34 ","pages":"Article 100408"},"PeriodicalIF":3.3,"publicationDate":"2023-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49733790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Multi-View Filter for Relation-Free Knowledge Graph Completion 一种用于无关系知识图补全的多视图滤波器

IF 3.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Big Data Research

Pub Date : 2023-08-28 DOI: 10.1016/j.bdr.2023.100397

Juan Li , Wen Zhang , Hongtao Yu

As knowledge graphs are often incomplete, knowledge graph completion methods have been widely proposed to infer missing facts by predicting the missing element of a triple given the other two elements. However, the assumption that the two elements have to be correlated is strong. Thus in this paper, we investigate relation-free knowledge graph completion to predict relation-tail(r-t) pairs given a head entity. Considering the large scale of candidate relation-tail pairs, previous work proposed to filter r-t pairs before ranking them relying on entity types, which fails when entity types are missing or insufficient. To tackle the limitation, we propose a relation-free knowledge graph completion method that can cope with knowledge graphs without additional ontological information, such as entity types. Specifically, we propose a multi-view filter, including two intra-view modules and an inter-view module, to filter r-t pairs. For the intra-view modules, we construct head-relation and tail-relation graphs based on triples. Two graph neural networks are respectively trained on these two graphs to capture the correlations between the head entities and the relations, as well as the tail entities and the relations. The inter-view module is learned to bridge the embeddings of entities that appeared in the two graphs. In terms of ranking, existing knowledge graph embedding models are applied to score and rank the filtered candidate r-t pairs. Experimental results show the efficiency of our method in preserving higher-quality candidate r-t pairs for knowledge graphs and resulting in better relation-free knowledge graph completion.

由于知识图通常是不完整的，因此已经广泛提出了知识图补全方法，通过在给定其他两个元素的情况下预测三元组的缺失元素来推断缺失事实。然而，这两个要素必须相互关联的假设是强有力的。因此，在本文中，我们研究了在给定头部实体的情况下，无关系知识图完备度来预测关系尾（r-t）对。考虑到候选关系尾对的规模很大，以前的工作提出在根据实体类型对r-t对进行排序之前对其进行过滤，但当实体类型缺失或不足时，这种方法会失败。为了解决这一限制，我们提出了一种无关系的知识图完成方法，该方法可以在没有额外本体信息（如实体类型）的情况下处理知识图。具体来说，我们提出了一种多视图过滤器，包括两个视图内模块和一个视图间模块，用于过滤r-t对。对于视图内模块，我们构造了基于三元组的头关系图和尾关系图。在这两个图上分别训练两个图神经网络，以捕捉头部实体与关系以及尾部实体与关系之间的相关性。视图间模块用于桥接两个图中出现的实体的嵌入。在排序方面，应用现有的知识图嵌入模型对过滤后的候选r-t对进行评分和排序。实验结果表明，我们的方法在为知识图保留更高质量的候选r-t对方面是有效的，并导致更好的无关系知识图完成。

{"title":"A Multi-View Filter for Relation-Free Knowledge Graph Completion","authors":"Juan Li , Wen Zhang , Hongtao Yu","doi":"10.1016/j.bdr.2023.100397","DOIUrl":"https://doi.org/10.1016/j.bdr.2023.100397","url":null,"abstract":"<div><p>As knowledge graphs are often incomplete, knowledge graph completion methods have been widely proposed to infer missing facts by predicting the missing element of a triple given the other two elements. However, the assumption that the two elements have to be correlated is strong. Thus in this paper, we investigate <em>relation-free knowledge graph completion</em> to predict relation-tail(r-t) pairs given a head entity. Considering the large scale of candidate relation-tail pairs, previous work proposed to filter r-t pairs before ranking them relying on entity types, which fails when entity types are missing or insufficient. To tackle the limitation, we propose a relation-free knowledge graph completion method that can cope with knowledge graphs without additional ontological information, such as entity types. Specifically, we propose a multi-view filter, including two intra-view modules and an inter-view module, to filter r-t pairs. For the intra-view modules, we construct <em>head-relation</em> and <em>tail-relation</em><span> graphs based on triples. Two graph neural networks are respectively trained on these two graphs to capture the correlations between the head entities and the relations, as well as the tail entities and the relations. The inter-view module is learned to bridge the embeddings of entities that appeared in the two graphs. In terms of ranking, existing knowledge graph embedding models are applied to score and rank the filtered candidate r-t pairs. Experimental results show the efficiency of our method in preserving higher-quality candidate r-t pairs for knowledge graphs and resulting in better relation-free knowledge graph completion.</span></p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"33 ","pages":"Article 100397"},"PeriodicalIF":3.3,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49711261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Meta-Learning Based Dynamic Adaptive Relation Learning for Few-Shot Knowledge Graph Completion 基于元学习的动态自适应关系学习在少镜头知识图完成中的应用

IF 3.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Big Data Research

Pub Date : 2023-08-28 DOI: 10.1016/j.bdr.2023.100394

Linqin Cai, Lingjun Wang, Rongdi Yuan, Tingjie Lai

As artificial intelligence gradually steps into cognitive intelligence stage, knowledge graphs (KGs) play an increasingly important role in many natural language processing tasks. Due to the prevalence of long-tail relations in KGs, few-shot knowledge graph completion (KGC) for link prediction of long-tail relations has gradually become a hot research topic. Current few-shot KGC methods mainly focus on the static representation of surrounding entities to explore the potential semantic features of entities, while ignoring the dynamic properties among entities and the special influence of the long-tail relation on link prediction. In this paper, a new meta-learning based dynamic adaptive relation learning model (DARL) is proposed for few-shot KGC. For obtaining better semantic information of the meta knowledge, the proposed DARL model applies a dynamic neighbor encoder to incorporate neighbor relations into entity embedding. In addition, DARL builds attention mechanism based fusion strategy for different attributes of the same relation to further enhance the relation-meta learning ability. We evaluate our DARL model on two public benchmark datasets NELL-One and WIKI-One for link prediction. Extensive experimental results indicate that our DARL outperforms the state-of-the-art models with an average relative improvement about 23.37%, 32.46% in MRR and Hits@1 on NELL-One, respectively.

随着人工智能逐渐进入认知智能阶段，知识图在许多自然语言处理任务中发挥着越来越重要的作用。由于长尾关系在KGs中的普遍性，用于长尾关系链接预测的少镜头知识图完成（KGC）逐渐成为研究热点。目前的少镜头KGC方法主要关注周围实体的静态表示，以探索实体的潜在语义特征，而忽略了实体之间的动态特性以及长尾关系对链接预测的特殊影响。本文针对少镜头KGC提出了一种新的基于元学习的动态自适应关系学习模型（DARL）。为了获得更好的元知识语义信息，所提出的DARL模型应用动态邻居编码器将邻居关系纳入实体嵌入。此外，DARL针对同一关系的不同属性构建了基于注意力机制的融合策略，以进一步增强关系元学习能力。我们在两个公共基准数据集NELL One和WIKI One上评估了我们的DARL模型，用于链路预测。大量的实验结果表明，我们的DARL优于最先进的模型，平均相对改进约为23.37%，MRR为32.46%Hits@1分别在NELL One上。

{"title":"Meta-Learning Based Dynamic Adaptive Relation Learning for Few-Shot Knowledge Graph Completion","authors":"Linqin Cai, Lingjun Wang, Rongdi Yuan, Tingjie Lai","doi":"10.1016/j.bdr.2023.100394","DOIUrl":"https://doi.org/10.1016/j.bdr.2023.100394","url":null,"abstract":"<div><p>As artificial intelligence<span> gradually steps into cognitive intelligence stage, knowledge graphs (KGs) play an increasingly important role in many natural language processing<span><span> tasks. Due to the prevalence of long-tail relations in KGs, few-shot knowledge graph completion (KGC) for link prediction of long-tail relations has gradually become a hot research topic. Current few-shot KGC methods mainly focus on the static representation of surrounding entities to explore the potential semantic features<span> of entities, while ignoring the dynamic properties among entities and the special influence of the long-tail relation on link prediction. In this paper, a new meta-learning based dynamic adaptive relation learning model (DARL) is proposed for few-shot KGC. For obtaining better semantic information of the meta knowledge, the proposed DARL model applies a dynamic neighbor encoder to incorporate neighbor relations into entity embedding. In addition, DARL builds </span></span>attention mechanism based fusion strategy for different attributes of the same relation to further enhance the relation-meta learning ability. We evaluate our DARL model on two public benchmark datasets NELL-One and WIKI-One for link prediction. Extensive experimental results indicate that our DARL outperforms the state-of-the-art models with an average relative improvement about 23.37%, 32.46% in MRR and Hits@1 on NELL-One, respectively.</span></span></p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"33 ","pages":"Article 100394"},"PeriodicalIF":3.3,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49711677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Task-Oriented Collaborative Graph Embedding Using Explicit High-Order Proximity for Recommendation 基于显式高阶接近度推荐的面向任务的协同图嵌入

IF 3.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Big Data Research

Pub Date : 2023-08-28 DOI: 10.1016/j.bdr.2023.100382

Mintae Kim, Wooju Kim

A recommender or recommendation system is a subclass of information filtering systems that seeks to predict the “rating” or “preference” that a user would assign to an item. Although many collaborative filtering (CF) approaches based on neural matrix factorization (NMF) have been successful, significant scope for improvement in recommendation systems exists. The primary challenge in recommender systems is to extract high-quality user–item interaction information from sparse data. However, most studies have focused on additional review text or metadata instead of fully used high-order relationships between users and items. In this paper, we propose a novel model—Cross Neighborhood Attention Network (CNAN)—that solves this problem by designing high-order neighborhood selection and neighborhood attention networks to learn user–item interaction efficiently. Our CNAN performs rating prediction using an architecture considering only user–item interaction data. Furthermore, the proposed model uses only user–item interaction (from the user–item ratings matrix) information without additional information such as review text or metadata. We evaluated the effectiveness of the proposed model by performing experiments on five datasets with review text and three datasets with metadata. Consequently, the CNAN model demonstrated a performance improvement of up to 7.59% over the model using review text and up to 1.99% over the model using metadata. Experimental results show that CNAN achieves better recommendation performance through higher-order neighborhood information integration with neighborhood selection and attention. The results show that our model delivers higher prediction performance via efficient structural improvement without using additional information.

推荐器或推荐系统是信息过滤系统的一个子类，旨在预测用户对某个项目的“评分”或“偏好”。尽管许多基于神经矩阵分解（NMF）的协同过滤（CF）方法已经取得了成功，但在推荐系统中仍存在显著的改进空间。推荐系统的主要挑战是从稀疏数据中提取高质量的用户-项目交互信息。然而，大多数研究都集中在额外的评论文本或元数据上，而不是完全使用用户和项目之间的高阶关系。在本文中，我们提出了一种新的模型——跨邻域注意力网络（CNAN），通过设计高阶邻域选择和邻域注意力网络来有效地学习用户-项目交互，从而解决了这个问题。我们的CNAN使用只考虑用户-项目交互数据的架构来执行评级预测。此外，所提出的模型仅使用用户-项目交互（来自用户-项目评级矩阵）信息，而不使用诸如评论文本或元数据之类的附加信息。我们通过在五个包含评论文本的数据集和三个包含元数据的数据集上进行实验来评估所提出的模型的有效性。因此，与使用评审文本的模型相比，CNAN模型的性能提高了7.59%，与使用元数据的模型相比提高了1.99%。实验结果表明，CNAN通过与邻域选择和关注相结合的高阶邻域信息集成，获得了更好的推荐性能。结果表明，我们的模型在不使用额外信息的情况下，通过有效的结构改进提供了更高的预测性能。

{"title":"Task-Oriented Collaborative Graph Embedding Using Explicit High-Order Proximity for Recommendation","authors":"Mintae Kim, Wooju Kim","doi":"10.1016/j.bdr.2023.100382","DOIUrl":"https://doi.org/10.1016/j.bdr.2023.100382","url":null,"abstract":"<div><p><span><span>A recommender or recommendation system is a subclass<span> of information filtering systems that seeks to predict the “rating” or “preference” that a user would assign to an item. Although many collaborative filtering (CF) approaches based on neural matrix factorization (NMF) have been successful, significant scope for improvement in recommendation systems exists. The primary challenge in </span></span>recommender systems<span> is to extract high-quality user–item interaction information from sparse data. However, most studies have focused on additional review text or metadata instead of fully used high-order relationships between users and items. In this paper, we propose a novel model—Cross Neighborhood Attention Network (CNAN)—that solves this problem by designing high-order neighborhood selection and neighborhood attention networks to learn user–item interaction efficiently. Our CNAN performs rating prediction using an architecture considering only user–item interaction data. Furthermore, the proposed model uses only user–item interaction (from the user–item ratings matrix) information without additional information such as review text or metadata. We evaluated the effectiveness of the proposed model by performing experiments on five datasets with review text and three datasets with metadata. Consequently, the CNAN model demonstrated a performance improvement of up to 7.59% over the model using review text and up to 1.99% over the model using metadata. Experimental results show that CNAN achieves better recommendation performance through higher-order neighborhood </span></span>information integration with neighborhood selection and attention. The results show that our model delivers higher prediction performance via efficient structural improvement without using additional information.</p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"33 ","pages":"Article 100382"},"PeriodicalIF":3.3,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49711464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Botnet DGA Domain Name Classification Using Transformer Network with Hybrid Embedding 基于混合嵌入变压器网络的Botnet DGA域名分类

IF 3.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Big Data Research

Pub Date : 2023-08-28 DOI: 10.1016/j.bdr.2023.100395

Ling Ding , Peng Du , Haiwei Hou , Jian Zhang , Di Jin , Shifei Ding

One of the severest threats to cyber security is botnet, which typically uses domain names generated by Domain Generation Algorithms (DGAs) to communicate with their Command and Control (C&C) infrastructure. DGA detection and classification play an important role of assisting cyber security researchers to detect botnet C&C servers. However, many of the existing DGA detection models only focus on single scale word embedding method, and very few models are specially designed to extract more effective features for DGA detection from multiple scales word embedding. To alleviate above questions, first we propose a hybrid word embedding method, which combines character level embedding and bigram level embedding to make full use of the domain names information, and then, we design a deep neural network with hybrid embedding method to distinguish DGA domains from known legitimate domains. Finally, we evaluate our hybrid embedding method and the proposed model on ONIST dataset and compare our methods with several state-of-the-art DGA classification methods.

网络安全面临的最严重威胁之一是僵尸网络，它通常使用域生成算法（DGA）生成的域名与其指挥与控制（C&；C）基础设施进行通信。DGA检测和分类在协助网络安全研究人员检测僵尸网络C&；C服务器。然而，现有的DGA检测模型大多只关注单尺度词嵌入方法，很少有模型专门设计用于从多尺度词嵌入中提取更有效的DGA特征。为了缓解上述问题，我们首先提出了一种混合单词嵌入方法，该方法将字符级嵌入和双字符级嵌入相结合，以充分利用域名信息，然后，我们设计了一种具有混合嵌入方法的深度神经网络，以区分DGA域和已知合法域。最后，我们在ONIST数据集上评估了我们的混合嵌入方法和所提出的模型，并将我们的方法与几种最先进的DGA分类方法进行了比较。

{"title":"Botnet DGA Domain Name Classification Using Transformer Network with Hybrid Embedding","authors":"Ling Ding , Peng Du , Haiwei Hou , Jian Zhang , Di Jin , Shifei Ding","doi":"10.1016/j.bdr.2023.100395","DOIUrl":"https://doi.org/10.1016/j.bdr.2023.100395","url":null,"abstract":"<div><p><span>One of the severest threats to cyber security is botnet, which typically uses domain names generated by Domain Generation Algorithms (DGAs) to communicate with their Command and Control (C&C) infrastructure. </span>DGA detection<span> and classification play an important role of assisting cyber security researchers to detect botnet C&C servers. However, many of the existing DGA detection models only focus on single scale word embedding<span> method, and very few models are specially designed to extract more effective features for DGA detection from multiple scales word embedding. To alleviate above questions, first we propose a hybrid word embedding method, which combines character level embedding and bigram level embedding to make full use of the domain names information, and then, we design a deep neural network with hybrid embedding method to distinguish DGA domains from known legitimate domains. Finally, we evaluate our hybrid embedding method and the proposed model on ONIST dataset and compare our methods with several state-of-the-art DGA classification methods.</span></span></p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"33 ","pages":"Article 100395"},"PeriodicalIF":3.3,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49711678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Big Data Framework to Address Building Sum Insured Misestimation 解决建筑保额误估的大数据框架

IF 3.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Big Data Research

Pub Date : 2023-08-28 DOI: 10.1016/j.bdr.2023.100396

Callum Roberts, Adrian Gepp, James Todd

In the insurance industry, the accumulation of complex problems and volume of data creates a large scope for actuaries to apply big data techniques to investigate and provide unique solutions for millions of policyholders. With much of the actuarial focus on traditional problems like price optimisation or improving claims management, there is an opportunity to tackle other known product inefficiencies with a data-driven approach. The purpose of this paper is to build a framework that exploits big data technologies to measure and explain Australian policyholder Sum Insured Misestimation (SIM). Big data clustering and dimension reduction techniques are leveraged to measure SIM for a national home insurance portfolio. We then design predictive and prescriptive models to explore the relationship between socioeconomic and demographic factors with SIM. Real-world results from a national home insurance portfolio provide actionable business insight on SIM and facilitate solutions for stakeholders, being government and insurers.

在保险业，复杂问题和大量数据的积累为精算师应用大数据技术进行调查创造了巨大的空间，并为数百万投保人提供了独特的解决方案。由于精算师主要关注价格优化或改进索赔管理等传统问题，因此有机会通过数据驱动的方法解决其他已知的产品效率低下问题。本文的目的是建立一个利用大数据技术来衡量和解释澳大利亚投保人投保金额估计错误（SIM）的框架。利用大数据聚类和降维技术来衡量全国家庭保险投资组合的SIM。然后，我们设计了预测和规定模型，以探索社会经济和人口因素与SIM之间的关系。国家家庭保险投资组合的真实世界结果为SIM提供了可操作的商业见解，并为政府和保险公司等利益相关者提供了解决方案。

引用次数: 0

Parallel Framework for Memory-Efficient Computation of Image Descriptors for Megapixel Images 百万像素图像描述符内存高效计算的并行框架

IF 3.3 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Big Data Research

Pub Date : 2023-08-28 DOI: 10.1016/j.bdr.2023.100398

Amr M. Abdeltif , Khalid M. Hosny , Mohamed M. Darwish , Ahmad Salah , Kenli Li

Image moments are image descriptors widely utilized in several image processing, pattern recognition, computer vision, and multimedia security applications. In the era of big data, the computation of image moments yields a huge memory demand, especially for large moment order and/or high-resolution images (i.e., megapixel images). The state-of-the-art moment computation methods successfully accelerate the image moment computation for digital images of a resolution smaller than 1K × 1K pixels. For digital images of higher resolutions, image moment computation is problematic. Researchers utilized GPU-based parallel processing to overcome this problem. In practice, the parallel computation of image moments using GPUs encounters the non-extended memory problem, which is the main challenge. This paper proposed a recurrent-based method for computing the Polar Complex Exponent Transform (PCET) moments of fractional orders. The proposed method utilized the symmetry of the image kernel to reduce kernel computation. In the proposed method, once a kernel value is computed in one quaternion, the other three corresponding values in the remaining three quaternions can be trivially computed. Moreover, the proposed method utilized recurrence equations to compute kernels. Thus, the required memory to store the pre-computed memory is saved. Finally, we implemented the proposed method on the GPU parallel architecture. The proposed method overcomes the memory limit due to saving the kernel's memory. The experiments show that the proposed parallel-friendly and memory-efficient method is superior to the state-of-the-art moment computation methods in memory consumption and runtimes. The proposed method computes the PCET moment of order 50 for an image of size 2K × 2K pixels in 3.5 seconds while the state-of-the-art method of comparison needs 7.0 seconds to process the same image, the memory requirements for the proposed method and the method of comparison for the were 67.0 MB and 3.4 GB, respectively. The method of comparison could not compute the image moment for any image with a resolution higher than 2K × 2K pixels. In contrast, the proposed method managed to compute the image moment up to 16K × 16K pixels image.

图像矩是广泛应用于图像处理、模式识别、计算机视觉和多媒体安全应用中的图像描述符。在大数据时代，图像矩的计算产生了巨大的内存需求，尤其是对于大矩阶和/或高分辨率图像（即百万像素图像）。最先进的矩计算方法成功地加速了分辨率小于1K的数字图像的图像矩计算 × 1K像素。对于更高分辨率的数字图像，图像矩计算是有问题的。研究人员利用基于GPU的并行处理来克服这个问题。在实践中，使用GPU并行计算图像矩遇到了非扩展内存问题，这是主要的挑战。本文提出了一种基于递归的分数阶极复指数变换矩的计算方法。该方法利用图像核的对称性来减少核计算量。在所提出的方法中，一旦在一个四元数中计算出核值，就可以平凡地计算其余三个四元数来的其他三个对应值。此外，该方法还利用递推方程来计算核。因此，保存了存储预先计算的存储器所需的存储器。最后，我们在GPU并行架构上实现了所提出的方法。所提出的方法由于节省了内核的内存而克服了内存限制。实验表明，该方法在内存消耗和运行时间方面优于现有的矩计算方法。所提出的方法计算大小为2K的图像的PCET阶矩50 × 2K像素，而最先进的比较方法需要7.0秒来处理同一图像，所提出的方法和的比较方法的内存需求分别为67.0 MB和3.4 GB。该比较方法无法计算任何分辨率高于2K的图像的图像矩 × 2K像素。相比之下，所提出的方法成功地计算了高达16K的图像力矩 × 16K像素图像。

{"title":"Parallel Framework for Memory-Efficient Computation of Image Descriptors for Megapixel Images","authors":"Amr M. Abdeltif , Khalid M. Hosny , Mohamed M. Darwish , Ahmad Salah , Kenli Li","doi":"10.1016/j.bdr.2023.100398","DOIUrl":"https://doi.org/10.1016/j.bdr.2023.100398","url":null,"abstract":"<div><p><span>Image moments are image descriptors widely utilized in several image processing, pattern recognition, computer vision, and multimedia security applications. In the era of big data, the computation of image moments yields a huge memory demand, especially for large moment order and/or high-resolution images (i.e., megapixel images). The state-of-the-art moment computation methods successfully accelerate the image moment computation for digital images of a resolution smaller than 1K × 1K pixels. For digital images of higher resolutions, image moment computation is problematic. Researchers utilized GPU-based </span>parallel processing<span> to overcome this problem. In practice, the parallel computation of image moments using GPUs encounters the non-extended memory problem, which is the main challenge. This paper proposed a recurrent-based method for computing the Polar Complex Exponent Transform (PCET) moments of fractional orders. The proposed method utilized the symmetry of the image kernel to reduce kernel computation. In the proposed method, once a kernel value is computed in one quaternion, the other three corresponding values in the remaining three quaternions can be trivially computed. Moreover, the proposed method utilized recurrence equations to compute kernels. Thus, the required memory to store the pre-computed memory is saved. Finally, we implemented the proposed method on the GPU parallel architecture. The proposed method overcomes the memory limit due to saving the kernel's memory. The experiments show that the proposed parallel-friendly and memory-efficient method is superior to the state-of-the-art moment computation methods in memory consumption and runtimes. The proposed method computes the PCET moment of order 50 for an image of size 2K × 2K pixels in 3.5 seconds while the state-of-the-art method of comparison needs 7.0 seconds to process the same image, the memory requirements for the proposed method and the method of comparison for the were 67.0 MB and 3.4 GB, respectively. The method of comparison could not compute the image moment for any image with a resolution higher than 2K × 2K pixels. In contrast, the proposed method managed to compute the image moment up to 16K × 16K pixels image.</span></p></div>","PeriodicalId":56017,"journal":{"name":"Big Data Research","volume":"33 ","pages":"Article 100398"},"PeriodicalIF":3.3,"publicationDate":"2023-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49711262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1