Frontiers of Computer Science最新文献

英文中文

Discriminative explicit instance selection for implicit discourse relation classification 用于隐式话语关系分类的辨证显式实例选择

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2023-12-23 DOI: 10.1007/s11704-023-3058-2

Wei Song, Hongfei Han, Xu Han, Miaomiao Cheng, Jiefu Gong, Shijin Wang, Ting Liu

Discourse relation classification is a fundamental task for discourse analysis, which is essential for understanding the structure and connection of texts. Implicit discourse relation classification aims to determine the relationship between adjacent sentences and is very challenging because it lacks explicit discourse connectives as linguistic cues and sufficient annotated training data. In this paper, we propose a discriminative instance selection method to construct synthetic implicit discourse relation data from easy-to-collect explicit discourse relations. An expanded instance consists of an argument pair and its sense label. We introduce the argument pair type classification task, which aims to distinguish between implicit and explicit argument pairs and select the explicit argument pairs that are most similar to natural implicit argument pairs for data expansion. We also propose a simple label-smoothing technique to assign robust sense labels for the selected argument pairs. We evaluate our method on PDTB 2.0 and PDTB 3.0. The results show that our method can consistently improve the performance of the baseline model, and achieve competitive results with the state-of-the-art models.

话语关系分类是话语分析的一项基本任务，对于理解文本的结构和联系至关重要。隐式话语关系分类旨在确定相邻句子之间的关系，由于缺乏显式话语连接词作为语言线索和足够的注释训练数据，因此非常具有挑战性。在本文中，我们提出了一种判别实例选择方法，从易于收集的显式话语关系中构建合成的隐式话语关系数据。扩展实例由参数对及其意义标签组成。我们介绍了论据对类型分类任务，该任务旨在区分隐式和显式论据对，并选择与自然隐式论据对最相似的显式论据对进行数据扩展。我们还提出了一种简单的标签平滑技术，为所选参数对分配稳健的意义标签。我们在 PDTB 2.0 和 PDTB 3.0 上对我们的方法进行了评估。结果表明，我们的方法可以持续提高基线模型的性能，并取得与最先进模型相当的结果。

{"title":"Discriminative explicit instance selection for implicit discourse relation classification","authors":"Wei Song, Hongfei Han, Xu Han, Miaomiao Cheng, Jiefu Gong, Shijin Wang, Ting Liu","doi":"10.1007/s11704-023-3058-2","DOIUrl":"https://doi.org/10.1007/s11704-023-3058-2","url":null,"abstract":"Discourse relation classification is a fundamental task for discourse analysis, which is essential for understanding the structure and connection of texts. Implicit discourse relation classification aims to determine the relationship between adjacent sentences and is very challenging because it lacks explicit discourse connectives as linguistic cues and sufficient annotated training data. In this paper, we propose a discriminative instance selection method to construct synthetic implicit discourse relation data from easy-to-collect explicit discourse relations. An expanded instance consists of an argument pair and its sense label. We introduce the argument pair type classification task, which aims to distinguish between implicit and explicit argument pairs and select the explicit argument pairs that are most similar to natural implicit argument pairs for data expansion. We also propose a simple label-smoothing technique to assign robust sense labels for the selected argument pairs. We evaluate our method on PDTB 2.0 and PDTB 3.0. The results show that our method can consistently improve the performance of the baseline model, and achieve competitive results with the state-of-the-art models.","PeriodicalId":12640,"journal":{"name":"Frontiers of Computer Science","volume":"30 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139025427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

How graph convolutions amplify popularity bias for recommendation? 图卷积如何放大推荐时的人气偏差？

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2023-12-23 DOI: 10.1007/s11704-023-2655-2

Abstract

Graph convolutional networks (GCNs) have become prevalent in recommender system (RS) due to their superiority in modeling collaborative patterns. Although improving the overall accuracy, GCNs unfortunately amplify popularity bias — tail items are less likely to be recommended. This effect prevents the GCN-based RS from making precise and fair recommendations, decreasing the effectiveness of recommender systems in the long run.

In this paper, we investigate how graph convolutions amplify the popularity bias in RS. Through theoretical analyses, we identify two fundamental factors: (1) with graph convolution (i.e., neighborhood aggregation), popular items exert larger influence than tail items on neighbor users, making the users move towards popular items in the representation space; (2) after multiple times of graph convolution, popular items would affect more high-order neighbors and become more influential. The two points make popular items get closer to almost users and thus being recommended more frequently. To rectify this, we propose to estimate the amplified effect of popular nodes on each node’s representation, and intervene the effect after each graph convolution. Specifically, we adopt clustering to discover highly-influential nodes and estimate the amplification effect of each node, then remove the effect from the node embeddings at each graph convolution layer. Our method is simple and generic — it can be used in the inference stage to correct existing models rather than training a new model from scratch, and can be applied to various GCN models. We demonstrate our method on two representative GCN backbones LightGCN and UltraGCN, verifying its ability in improving the recommendations of tail items without sacrificing the performance of popular items. Codes are open-sourced ¹⁾.

摘要图形卷积网络（GCN）由于其在协作模式建模方面的优势，已在推荐系统（RS）中得到广泛应用。虽然 GCNs 提高了整体准确性，但不幸的是，它放大了流行偏差--尾部项目不太可能被推荐。这种效应使基于 GCN 的 RS 无法做出精确、公平的推荐，从而降低了推荐系统的长期有效性。在本文中，我们研究了图卷积是如何放大 RS 中的流行度偏差的。通过理论分析，我们发现了两个基本因素：(1）在图卷积（即邻域聚合）的作用下，热门条目比尾部条目对邻居用户的影响更大，从而使用户在表示空间中向热门条目移动；（2）经过多次图卷积后，热门条目会影响更多的高阶邻居，变得更具影响力。这两点使得热门条目更接近用户，从而更频繁地被推荐。为了解决这个问题，我们建议估算热门节点对每个节点表示的放大效应，并在每次图卷积后对该效应进行干预。具体来说，我们采用聚类来发现高影响力节点，并估算每个节点的放大效应，然后在每个图卷积层从节点嵌入中去除该效应。我们的方法简单而通用--它可以在推理阶段用于修正现有模型，而不是从头开始训练一个新模型，而且可以应用于各种 GCN 模型。我们在两个具有代表性的 GCN 主干网 LightGCN 和 UltraGCN 上演示了我们的方法，验证了它在不牺牲热门项目性能的情况下改进尾部项目推荐的能力。代码开源 1).

{"title":"How graph convolutions amplify popularity bias for recommendation?","authors":"","doi":"10.1007/s11704-023-2655-2","DOIUrl":"https://doi.org/10.1007/s11704-023-2655-2","url":null,"abstract":"<h3>Abstract</h3> Graph convolutional networks (GCNs) have become prevalent in recommender system (RS) due to their superiority in modeling collaborative patterns. Although improving the overall accuracy, GCNs unfortunately amplify popularity bias — tail items are less likely to be recommended. This effect prevents the GCN-based RS from making precise and fair recommendations, decreasing the effectiveness of recommender systems in the long run. In this paper, we investigate how graph convolutions amplify the popularity bias in RS. Through theoretical analyses, we identify two fundamental factors: (1) with graph convolution (i.e., neighborhood aggregation), popular items exert larger influence than tail items on neighbor users, making the users move towards popular items in the representation space; (2) after multiple times of graph convolution, popular items would affect more high-order neighbors and become more influential. The two points make popular items get closer to almost users and thus being recommended more frequently. To rectify this, we propose to estimate the amplified effect of popular nodes on each node’s representation, and intervene the effect after each graph convolution. Specifically, we adopt clustering to discover highly-influential nodes and estimate the amplification effect of each node, then remove the effect from the node embeddings at each graph convolution layer. Our method is simple and generic — it can be used in the inference stage to correct existing models rather than training a new model from scratch, and can be applied to various GCN models. We demonstrate our method on two representative GCN backbones LightGCN and UltraGCN, verifying its ability in improving the recommendations of tail items without sacrificing the performance of popular items. Codes are open-sourced 1).","PeriodicalId":12640,"journal":{"name":"Frontiers of Computer Science","volume":"19 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139026894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Blockchain based federated learning for intrusion detection for Internet of Things 基于区块链的联合学习，用于物联网入侵检测

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2023-12-23 DOI: 10.1007/s11704-023-3026-8

Nan Sun, Wei Wang, Yongxin Tong, Kexin Liu

In Internet of Things (IoT), data sharing among different devices can improve manufacture efficiency and reduce workload, and yet make the network systems be more vulnerable to various intrusion attacks. There has been realistic demand to develop an efficient intrusion detection algorithm for connected devices. Most of existing intrusion detection methods are trained in a centralized manner and are incapable to identify new unlabeled attack types. In this paper, a distributed federated intrusion detection method is proposed, utilizing the information contained in the labeled data as the prior knowledge to discover new unlabeled attack types. Besides, the blockchain technique is introduced in the federated learning process for the consensus of the entire framework. Experimental results are provided to show that our approach can identify the malicious entities, while outperforming the existing methods in discovering new intrusion attack types.

在物联网（IoT）中，不同设备之间的数据共享可以提高生产效率、减少工作量，但也使网络系统更容易受到各种入侵攻击。为联网设备开发一种高效的入侵检测算法已成为现实需求。现有的入侵检测方法大多采用集中式训练，无法识别新的无标记攻击类型。本文提出了一种分布式联合入侵检测方法，利用标记数据中包含的信息作为先验知识，发现新的未标记攻击类型。此外，在联合学习过程中引入了区块链技术，以达成整个框架的共识。实验结果表明，我们的方法可以识别恶意实体，同时在发现新的入侵攻击类型方面优于现有方法。

引用次数: 0

Optimizing B+-tree for hybrid memory with in-node hotspot cache and eADR awareness 利用节点内热点缓存和 eADR 感知优化混合内存的 B+ 树

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2023-12-23 DOI: 10.1007/s11704-023-3344-x

Peiquan Jin, Zhaole Chu, Gaocong Liu, Yongping Luo, Shouhong Wan

The advance in Non-Volatile Memory (NVM) has changed the traditional DRAM-only memory system. Compared to DRAM, NVM has the advantages of non-volatility and large capacity. However, as the read/write speed of NVM is still lower than that of DRAM, building DRAM/NVM-based hybrid memory systems is a feasible way of adding NVM into the current computer architecture. This paper aims to optimize the well-known B⁺-tree for hybrid memory. The novelty of this study is two-fold. First, we observed that the space utilization of internal nodes in B⁺-tree is generally below 70%. Inspired by this observation, we propose to maintain hot keys in the free space within internal nodes, yielding a new index named HATree (Hotness-Aware Tree). The new idea of HATree is to use the unused space of the parent of leaf nodes (PLNs) as the hotspot data cache. Thus, no extra space is needed, and the in-node hotspot cache can efficiently improve query performance. Second, to further improve the update performance of HATree, we propose to utilize the eADR technology supported by the third-generation Intel Xeon Scalable Processors to enhance HATree with instant log persistence, which results in the new HATree-Log structure. We conduct extensive experiments on real hybrid memory architecture involving DRAM and Intel Optane Persistent Memory to evaluate the performance of HATree and HATree-Log. Three state-of-the-art indices for hybrid memory, namely NBTree, LBTree, and FPTree, are included in the experiments, and the results suggest the efficiency of HATree and HATree-Log.

非易失性存储器（NVM）的发展改变了传统的 DRAM 存储系统。与 DRAM 相比，NVM 具有非易失性和大容量的优点。然而，由于 NVM 的读/写速度仍低于 DRAM，因此构建基于 DRAM/NVM 的混合内存系统是将 NVM 添加到当前计算机体系结构中的一种可行方法。本文旨在优化著名的混合内存 B+ 树。这项研究有两方面的新意。首先，我们发现 B+ 树内部节点的空间利用率一般低于 70%。受这一观察结果的启发，我们提出在内部节点的空闲空间中维护热键，从而产生了一种名为 HATree（热度感知树）的新索引。HATree 的新思路是利用叶节点（PLN）父节点的闲置空间作为热点数据缓存。因此，不需要额外的空间，节点内的热点缓存可以有效地提高查询性能。其次，为了进一步提高 HATree 的更新性能，我们建议利用第三代英特尔至强可扩展处理器支持的 eADR 技术来增强 HATree 的即时日志持久性，从而形成新的 HATree-Log 结构。我们在涉及 DRAM 和英特尔 Optane 持久内存的实际混合内存架构上进行了大量实验，以评估 HATree 和 HATree-Log 的性能。实验包括三种最先进的混合内存指数，即 NBTree、LBTree 和 FPTree，结果表明 HATree 和 HATree-Log 非常高效。

{"title":"Optimizing B+-tree for hybrid memory with in-node hotspot cache and eADR awareness","authors":"Peiquan Jin, Zhaole Chu, Gaocong Liu, Yongping Luo, Shouhong Wan","doi":"10.1007/s11704-023-3344-x","DOIUrl":"https://doi.org/10.1007/s11704-023-3344-x","url":null,"abstract":"The advance in Non-Volatile Memory (NVM) has changed the traditional DRAM-only memory system. Compared to DRAM, NVM has the advantages of non-volatility and large capacity. However, as the read/write speed of NVM is still lower than that of DRAM, building DRAM/NVM-based hybrid memory systems is a feasible way of adding NVM into the current computer architecture. This paper aims to optimize the well-known B+-tree for hybrid memory. The novelty of this study is two-fold. First, we observed that the space utilization of internal nodes in B+-tree is generally below 70%. Inspired by this observation, we propose to maintain hot keys in the free space within internal nodes, yielding a new index named HATree (Hotness-Aware Tree). The new idea of HATree is to use the unused space of the parent of leaf nodes (PLNs) as the hotspot data cache. Thus, no extra space is needed, and the in-node hotspot cache can efficiently improve query performance. Second, to further improve the update performance of HATree, we propose to utilize the eADR technology supported by the third-generation Intel Xeon Scalable Processors to enhance HATree with instant log persistence, which results in the new HATree-Log structure. We conduct extensive experiments on real hybrid memory architecture involving DRAM and Intel Optane Persistent Memory to evaluate the performance of HATree and HATree-Log. Three state-of-the-art indices for hybrid memory, namely NBTree, LBTree, and FPTree, are included in the experiments, and the results suggest the efficiency of HATree and HATree-Log.","PeriodicalId":12640,"journal":{"name":"Frontiers of Computer Science","volume":"112 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139025409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Route selection for opportunity-sensing and prediction of waterlogging 机会感应和内涝预测的路线选择

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2023-12-18 DOI: 10.1007/s11704-023-2714-8

Abstract

Accurate monitoring of urban waterlogging contributes to the city’s normal operation and the safety of residents’ daily travel. However, due to feedback delays or high costs, existing methods make large-scale, fine-grained waterlogging monitoring impossible. A common method is to forecast the city’s global waterlogging status using its partial waterlogging data. This method has two challenges: first, existing predictive algorithms are either driven by knowledge or data alone; and second, the partial waterlogging data is not collected selectively, resulting in poor predictions. To overcome the aforementioned challenges, this paper proposes a framework for large-scale and fine-grained spatiotemporal waterlogging monitoring based on the opportunistic sensing of limited bus routes. This framework follows the Sparse Crowdsensing and mainly comprises a pair of iterative predictor and selector. The predictor uses the collected waterlogging status and the predicted status of the uncollected area to train the graph convolutional neural network. It combines both knowledge-driven and data-driven approaches and can be used to forecast waterlogging status in all regions for the upcoming term. The selector consists of a two-stage selection procedure that can select valuable bus routes while satisfying budget constraints. The experimental results on real waterlogging and bus routes in Shenzhen show that the proposed framework could easily perform urban waterlogging monitoring with low cost, high accuracy, wide coverage, and fine granularity.

摘要城市内涝的准确监测有助于城市的正常运行和居民的日常出行安全。然而，由于反馈延迟或成本高昂，现有方法无法实现大规模、精细化的内涝监测。一种常见的方法是利用部分内涝数据预测城市的整体内涝状况。这种方法存在两个挑战：首先，现有的预测算法要么仅由知识驱动，要么仅由数据驱动；其次，没有选择性地收集部分内涝数据，导致预测结果不佳。为了克服上述挑战，本文提出了一种基于有限公交线路机会感知的大规模精细时空内涝监测框架。该框架遵循稀疏人群感知原理，主要由一对迭代预测器和选择器组成。预测器使用收集到的内涝状况和未收集区域的预测状况来训练图卷积神经网络。它结合了知识驱动和数据驱动两种方法，可用于预测所有地区下一年度的内涝状况。选择器由两阶段选择程序组成，可在满足预算限制的前提下选择有价值的公交线路。在深圳实际内涝和公交线路上的实验结果表明，所提出的框架可以轻松实现低成本、高精度、广覆盖和细粒度的城市内涝监测。

{"title":"Route selection for opportunity-sensing and prediction of waterlogging","authors":"","doi":"10.1007/s11704-023-2714-8","DOIUrl":"https://doi.org/10.1007/s11704-023-2714-8","url":null,"abstract":"<h3>Abstract</h3> Accurate monitoring of urban waterlogging contributes to the city’s normal operation and the safety of residents’ daily travel. However, due to feedback delays or high costs, existing methods make large-scale, fine-grained waterlogging monitoring impossible. A common method is to forecast the city’s global waterlogging status using its partial waterlogging data. This method has two challenges: first, existing predictive algorithms are either driven by knowledge or data alone; and second, the partial waterlogging data is not collected selectively, resulting in poor predictions. To overcome the aforementioned challenges, this paper proposes a framework for large-scale and fine-grained spatiotemporal waterlogging monitoring based on the opportunistic sensing of limited bus routes. This framework follows the Sparse Crowdsensing and mainly comprises a pair of iterative predictor and selector. The predictor uses the collected waterlogging status and the predicted status of the uncollected area to train the graph convolutional neural network. It combines both knowledge-driven and data-driven approaches and can be used to forecast waterlogging status in all regions for the upcoming term. The selector consists of a two-stage selection procedure that can select valuable bus routes while satisfying budget constraints. The experimental results on real waterlogging and bus routes in Shenzhen show that the proposed framework could easily perform urban waterlogging monitoring with low cost, high accuracy, wide coverage, and fine granularity.","PeriodicalId":12640,"journal":{"name":"Frontiers of Computer Science","volume":"45 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138743332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-user reinforcement learning based task migration in mobile edge computing 移动边缘计算中基于多用户强化学习的任务迁移

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2023-12-18 DOI: 10.1007/s11704-023-1346-3

Yuya Cui, Degan Zhang, Jie Zhang, Ting Zhang, Lixiang Cao, Lu Chen

Mobile Edge Computing (MEC) is a promising approach. Dynamic service migration is a key technology in MEC. In order to maintain the continuity of services in a dynamic environment, mobile users need to migrate tasks between multiple servers in real time. Due to the uncertainty of movement, frequent migration will increase delays and costs and non-migration will lead to service interruption. Therefore, it is very challenging to design an optimal migration strategy. In this paper, we investigate the multi-user task migration problem in a dynamic environment and minimizes the average service delay while meeting the migration cost. In order to optimize the service delay and migration cost, we propose an adaptive weight deep deterministic policy gradient (AWDDPG) algorithm. And distributed execution and centralized training are adopted to solve the high-dimensional problem. Experiments show that the proposed algorithm can greatly reduce the migration cost and service delay compared with the other related algorithms.

移动边缘计算（MEC）是一种前景广阔的方法。动态服务迁移是 MEC 的一项关键技术。为了在动态环境中保持服务的连续性，移动用户需要在多个服务器之间实时迁移任务。由于移动的不确定性，频繁迁移会增加延迟和成本，而不迁移则会导致服务中断。因此，设计一种最佳迁移策略非常具有挑战性。本文研究了动态环境下的多用户任务迁移问题，并在满足迁移成本的前提下使平均服务延迟最小化。为了优化服务延迟和迁移成本，我们提出了一种自适应权重深度确定性策略梯度（AWDDPG）算法。并采用分布式执行和集中式训练来解决高维问题。实验表明，与其他相关算法相比，所提出的算法可以大大降低迁移成本和服务延迟。

引用次数: 0

HACAN: a hierarchical answer-aware and context-aware network for question generation HACAN：用于生成问题的分层答案感知和上下文感知网络

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2023-12-18 DOI: 10.1007/s11704-023-2246-2

Ruijun Sun, Hanqin Tao, Yanmin Chen, Qi Liu

Question Generation (QG) is the task of generating questions according to the given contexts. Most of the existing methods are based on Recurrent Neural Networks (RNNs) for generating questions with passage-level input for providing more details, which seriously suffer from such problems as gradient vanishing and ineffective information utilization. In fact, reasonably extracting useful information from a given context is more in line with our actual needs during questioning especially in the education scenario. To that end, in this paper, we propose a novel Hierarchical Answer-Aware and Context-Aware Network (HACAN) to construct a high-quality passage representation and judge the balance between the sentences and the whole passage. Specifically, a Hierarchical Passage Encoder (HPE) is proposed to construct an answer-aware and context-aware passage representation, with a strategy of utilizing multi-hop reasoning. Then, we draw inspiration from the actual human questioning process and design a Hierarchical Passage-aware Decoder (HPD) which determines when to utilize the passage information. We conduct extensive experiments on the SQuAD dataset, where the results verify the effectiveness of our model in comparison with several baselines.

问题生成（QG）是一项根据给定语境生成问题的任务。现有的方法大多基于循环神经网络（RNN）生成问题，并通过段落级输入提供更多细节，这些方法存在梯度消失和信息利用率低等严重问题。事实上，从给定上下文中合理提取有用信息更符合我们在提问过程中的实际需求，尤其是在教育场景中。为此，我们在本文中提出了一种新颖的分层答案感知和上下文感知网络（HACAN）来构建高质量的语段表示，并判断句子和整个语段之间的平衡。具体来说，我们提出了一个分层段落编码器（HPE），以构建一个答案感知和上下文感知的段落表示，并利用多跳推理策略。然后，我们从实际的人类提问过程中汲取灵感，设计了分层段落感知解码器（HPD），用于确定何时利用段落信息。我们在 SQuAD 数据集上进行了广泛的实验，实验结果验证了我们的模型与几种基线相比的有效性。

{"title":"HACAN: a hierarchical answer-aware and context-aware network for question generation","authors":"Ruijun Sun, Hanqin Tao, Yanmin Chen, Qi Liu","doi":"10.1007/s11704-023-2246-2","DOIUrl":"https://doi.org/10.1007/s11704-023-2246-2","url":null,"abstract":"Question Generation (QG) is the task of generating questions according to the given contexts. Most of the existing methods are based on Recurrent Neural Networks (RNNs) for generating questions with passage-level input for providing more details, which seriously suffer from such problems as gradient vanishing and ineffective information utilization. In fact, reasonably extracting useful information from a given context is more in line with our actual needs during questioning especially in the education scenario. To that end, in this paper, we propose a novel Hierarchical Answer-Aware and Context-Aware Network (HACAN) to construct a high-quality passage representation and judge the balance between the sentences and the whole passage. Specifically, a Hierarchical Passage Encoder (HPE) is proposed to construct an answer-aware and context-aware passage representation, with a strategy of utilizing multi-hop reasoning. Then, we draw inspiration from the actual human questioning process and design a Hierarchical Passage-aware Decoder (HPD) which determines when to utilize the passage information. We conduct extensive experiments on the SQuAD dataset, where the results verify the effectiveness of our model in comparison with several baselines.","PeriodicalId":12640,"journal":{"name":"Frontiers of Computer Science","volume":"6 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138743206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Embracing connected intelligence with the YuanOS architecture: one OS kit for all 利用 YuanOS 架构实现互联智能：一个操作系统套件满足所有需求

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2023-12-18 DOI: 10.1007/s11704-023-3997-5

Haibo Chen, Ning Jia, Jie Yin

引用次数: 0

Semantic-aware entity alignment for low resource language knowledge graph 针对低资源语言知识图谱的语义感知实体对齐

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2023-12-18 DOI: 10.1007/s11704-023-2542-x

Junfei Tang, Ran Song, Yuxin Huang, Shengxiang Gao, Zhengtao Yu

Entity alignment (EA) is an important technique aiming to find the same real entity between two different source knowledge graphs (KGs). Current methods typically learn the embedding of entities for EA from the structure of KGs for EA. Most EA models are designed for rich-resource languages, requiring sufficient resources such as a parallel corpus and pre-trained language models. However, low-resource language KGs have received less attention, and current models demonstrate poor performance on those low-resource KGs. Recently, researchers have fused relation information and attributes for entity representations to enhance the entity alignment performance, but the relation semantics are often ignored. To address these issues, we propose a novel Semantic-aware Graph Neural Network (SGNN) for entity alignment. First, we generate pseudo sentences according to the relation triples and produce representations using pre-trained models. Second, our approach explores semantic information from the connected relations by a graph neural network. Our model captures expanded feature information from KGs. Experimental results using three low-resource languages demonstrate that our proposed SGNN approach out performs better than state-of-the-art alignment methods on three proposed datasets and three public datasets.

实体对齐（EA）是一项重要技术，旨在从两个不同的源知识图谱（KG）中找到相同的真实实体。目前的方法通常是从用于 EA 的知识图谱结构中学习用于 EA 的实体嵌入。大多数 EA 模型都是为资源丰富的语言设计的，需要足够的资源，如并行语料库和预训练的语言模型。然而，低资源语言的 KGs 受到的关注较少，而且当前的模型在这些低资源 KGs 上表现不佳。最近，研究人员融合了关系信息和实体表征的属性，以提高实体配准性能，但关系语义往往被忽视。为了解决这些问题，我们提出了一种用于实体配准的新型语义感知图神经网络（SGNN）。首先，我们根据关系三元组生成伪句子，并使用预训练模型生成表示。其次，我们的方法通过图神经网络从连接关系中挖掘语义信息。我们的模型可以捕捉 KG 中的扩展特征信息。使用三种低资源语言的实验结果表明，我们提出的 SGNN 方法在三个拟议数据集和三个公开数据集上的表现优于最先进的对齐方法。

{"title":"Semantic-aware entity alignment for low resource language knowledge graph","authors":"Junfei Tang, Ran Song, Yuxin Huang, Shengxiang Gao, Zhengtao Yu","doi":"10.1007/s11704-023-2542-x","DOIUrl":"https://doi.org/10.1007/s11704-023-2542-x","url":null,"abstract":"Entity alignment (EA) is an important technique aiming to find the same real entity between two different source knowledge graphs (KGs). Current methods typically learn the embedding of entities for EA from the structure of KGs for EA. Most EA models are designed for rich-resource languages, requiring sufficient resources such as a parallel corpus and pre-trained language models. However, low-resource language KGs have received less attention, and current models demonstrate poor performance on those low-resource KGs. Recently, researchers have fused relation information and attributes for entity representations to enhance the entity alignment performance, but the relation semantics are often ignored. To address these issues, we propose a novel Semantic-aware Graph Neural Network (SGNN) for entity alignment. First, we generate pseudo sentences according to the relation triples and produce representations using pre-trained models. Second, our approach explores semantic information from the connected relations by a graph neural network. Our model captures expanded feature information from KGs. Experimental results using three low-resource languages demonstrate that our proposed SGNN approach out performs better than state-of-the-art alignment methods on three proposed datasets and three public datasets.","PeriodicalId":12640,"journal":{"name":"Frontiers of Computer Science","volume":"108 1","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138715450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On the upper bounds of (1,0)-super solutions for the regular balanced random (k,2s)-SAT problem 论常规平衡随机 (k,2s)-SAT 问题的 (1,0)- 超级解的上限

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Frontiers of Computer Science

Pub Date : 2023-12-18 DOI: 10.1007/s11704-023-2752-2

Yongping Wang, Daoyun Xu, Jincheng Zhou

This paper explores the conditions which make a regular balanced random (k,2s)-CNF formula (1,0)-unsatisfiable with high probability. The conditions also make a random instance of the regular balanced (k − 1,2(k − 1)s)-SAT problem unsatisfiable with high probability, where the instance obeys a distribution which differs from the distribution obeyed by a regular balanced random (k − 1,2(k − 1)s)-CNF formula. Let F be a regular balanced random (k,2s)-CNF formula where k ⩾ 3, then there exists a number s₀ such that F is (1,0)-unsatisfiable with high probability if s > s₀. A numerical solution of the number s₀ when k ∈ {5, 6,…, 14} is given to conduct simulated experiments. The simulated experiments verify the theoretical result. Besides, the experiments also suggest that F is (1,0)-satisfiable with high probability if s is less than a certain value.

本文探讨了使正则平衡随机 (k,2s)-CNF 公式 (1,0)- 以高概率不可满足的条件。这些条件还使得正则平衡（k - 1,2(k - 1)s）-SAT 问题的随机实例以高概率不可满足，其中该实例服从的分布与正则平衡随机（k - 1,2(k - 1)s）-CNF 公式服从的分布不同。设 F 是一个正则平衡随机 (k,2s)-CNF 公式，其中 k ⩾ 3，则存在一个数字 s0，如果 s > s0，则 F 以高概率是 (1,0)- 不可满足的。给出了当 k∈{5, 6,..., 14} 时数字 s0 的数值解，并进行了模拟实验。模拟实验验证了理论结果。此外，实验还表明，如果 s 小于某一特定值，则 F 很有可能可满足 (1,0)。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Frontiers of Computer Science

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀