Science China Information Sciences最新文献_第10页

Logit prototype learning with active multimodal representation for robust open-set recognition 利用主动多模态表示的对数原型学习，实现稳健的开放集识别

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-05-17 DOI: 10.1007/s11432-023-3924-x

Yimin Fu, Zhunga Liu, Zicheng Wang

Robust open-set recognition (OSR) performance has become a prerequisite for pattern recognition systems in real-world applications. However, the existing OSR methods are primarily implemented on the basis of single-modal perception, and their performance is limited when single-modal data fail to provide sufficient descriptions of the objects. Although multimodal data can provide more comprehensive information than single-modal data, the learning of decision boundaries can be affected by the feature representation gap between different modalities. To effectively integrate multimodal data for robust OSR performance, we propose logit prototype learning (LPL) with active multimodal representation. In LPL, the input multimodal data are transformed into the logit space, enabling a direct exploration of intermodal correlations without the impact of scale inconsistency. Then, the fusion weights of each modality are determined using an entropybased uncertainty estimation method. This approach realizes adaptive adjustment of the fusion strategy to provide comprehensive descriptions in the presence of external disturbances. Moreover, the single-modal and multimodal representations are jointly optimized interactively to learn discriminative decision boundaries. Finally, a stepwise recognition rule is employed to reduce the misclassification risk and facilitate the distinction between known and unknown classes. Extensive experiments on three multimodal datasets have been done to demonstrate the effectiveness of the proposed method.

强大的开放集识别（OSR）性能已成为实际应用中模式识别系统的先决条件。然而，现有的开放集识别方法主要是基于单模态感知实现的，当单模态数据无法提供足够的物体描述时，其性能就会受到限制。虽然多模态数据能提供比单模态数据更全面的信息，但决策边界的学习会受到不同模态之间特征表示差距的影响。为了有效地整合多模态数据，实现稳健的 OSR 性能，我们提出了具有主动多模态表征的 logit 原型学习（LPL）。在 LPL 中，输入的多模态数据被转换到 logit 空间，从而可以直接探索模态间的相关性，而不会受到尺度不一致的影响。然后，使用基于熵的不确定性估计方法确定每种模态的融合权重。这种方法实现了融合策略的自适应调整，从而在存在外部干扰的情况下提供全面的描述。此外，对单模态和多模态表征进行交互式联合优化，以学习判别决策边界。最后，采用逐步识别规则来降低误分类风险，并促进已知和未知类别之间的区分。我们在三个多模态数据集上进行了广泛的实验，以证明所提方法的有效性。

{"title":"Logit prototype learning with active multimodal representation for robust open-set recognition","authors":"Yimin Fu, Zhunga Liu, Zicheng Wang","doi":"10.1007/s11432-023-3924-x","DOIUrl":"https://doi.org/10.1007/s11432-023-3924-x","url":null,"abstract":"Robust open-set recognition (OSR) performance has become a prerequisite for pattern recognition systems in real-world applications. However, the existing OSR methods are primarily implemented on the basis of single-modal perception, and their performance is limited when single-modal data fail to provide sufficient descriptions of the objects. Although multimodal data can provide more comprehensive information than single-modal data, the learning of decision boundaries can be affected by the feature representation gap between different modalities. To effectively integrate multimodal data for robust OSR performance, we propose logit prototype learning (LPL) with active multimodal representation. In LPL, the input multimodal data are transformed into the logit space, enabling a direct exploration of intermodal correlations without the impact of scale inconsistency. Then, the fusion weights of each modality are determined using an entropybased uncertainty estimation method. This approach realizes adaptive adjustment of the fusion strategy to provide comprehensive descriptions in the presence of external disturbances. Moreover, the single-modal and multimodal representations are jointly optimized interactively to learn discriminative decision boundaries. Finally, a stepwise recognition rule is employed to reduce the misclassification risk and facilitate the distinction between known and unknown classes. Extensive experiments on three multimodal datasets have been done to demonstrate the effectiveness of the proposed method.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"8 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141256012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Pyramid-resolution person restoration for cross-resolution person re-identification 用于交叉分辨率人物再识别的金字塔分辨率人物还原

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-05-16 DOI: 10.1007/s11432-023-4023-y

Chunlei Peng, Bo Wang, Decheng Liu, Nannan Wang, Xinbo Gao

In this study, we present a simple yet effective pyramid-resolution person restoration method for cross-resolution person re-identification. Our method involves a pyramid resolution restoration network that enhances pyramid resolution images, and utilizes feature distance fusion to leverage valuable and complementary information from these pyramid images. Extensive experiments demonstrate the effectiveness of our method on both real-world cross-resolution datasets and simulated datasets.

在这项研究中，我们提出了一种简单而有效的金字塔分辨率人物还原方法，用于跨分辨率人物再识别。我们的方法涉及一个金字塔分辨率还原网络，该网络可增强金字塔分辨率图像，并利用特征距离融合来利用这些金字塔图像中有价值的互补信息。广泛的实验证明了我们的方法在真实世界交叉分辨率数据集和模拟数据集上的有效性。

引用次数: 0

Finite-time composite learning control for nonlinear teleoperation systems under networked time-varying delays 网络时变延迟下非线性远程操纵系统的有限时间复合学习控制

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-05-16 DOI: 10.1007/s11432-023-3931-0

Yana Yang, Huixin Jiang, Changchun Hua, Junpeng Li

The robust finite-time synchronization control problem is investigated for master-slave networked nonlinear telerobotics systems (NNTSs) in this article. Although there have been some research achievements on finite-time control for the NNTSs, these studies are based on the strong assumptions of communication time delays or can only achieve finite-time bounded convergence even when the external forces are zero. Accordingly and in view of the importance of these issues, a novel robust composite learning adaptive control scheme rendering the finite-time master-slave synchronization is proposed in this paper. In particular, the influence of time delays on finite-time convergence of the system is analyzed by employing the multi-dimension finite-time small-gain framework. Meanwhile, in order to achieve accurate and fast estimation of uncertain parameters of the system, both the online historical and the instantaneous data of the estimation data are explored to derive the new parameter adaptive law under a more realizable interval-excitation (IE) condition. Therefore, the convergence of the position/force synchronization errors and the adaptive parameter estimation errors is obtained in finite time, and enhanced robustness of the closed-loop system will also be ensured. Finally, the superior performance of the proposed control algorithms is validated by numerical simulations and hardware experiments.

本文研究了主从联网非线性远程机器人系统（NNTS）的鲁棒性有限时间同步控制问题。虽然已有一些关于 NNTS 有限时间控制的研究成果，但这些研究都基于通信时间延迟的强假设，或者即使在外力为零时也只能实现有限时间有界收敛。因此，鉴于这些问题的重要性，本文提出了一种渲染有限时间主从同步的新型鲁棒复合学习自适应控制方案。其中，通过采用多维有限时间小增益框架，分析了时间延迟对系统有限时间收敛性的影响。同时，为了实现对系统不确定参数的精确、快速估计，本文同时探讨了估计数据的在线历史数据和瞬时数据，在更可实现的区间激励（IE）条件下推导出新的参数自适应规律。因此，位置/力同步误差和自适应参数估计误差将在有限时间内收敛，闭环系统的鲁棒性也将得到增强。最后，通过数值模拟和硬件实验验证了所提控制算法的优越性能。

{"title":"Finite-time composite learning control for nonlinear teleoperation systems under networked time-varying delays","authors":"Yana Yang, Huixin Jiang, Changchun Hua, Junpeng Li","doi":"10.1007/s11432-023-3931-0","DOIUrl":"https://doi.org/10.1007/s11432-023-3931-0","url":null,"abstract":"The robust finite-time synchronization control problem is investigated for master-slave networked nonlinear telerobotics systems (NNTSs) in this article. Although there have been some research achievements on finite-time control for the NNTSs, these studies are based on the strong assumptions of communication time delays or can only achieve finite-time bounded convergence even when the external forces are zero. Accordingly and in view of the importance of these issues, a novel robust composite learning adaptive control scheme rendering the finite-time master-slave synchronization is proposed in this paper. In particular, the influence of time delays on finite-time convergence of the system is analyzed by employing the multi-dimension finite-time small-gain framework. Meanwhile, in order to achieve accurate and fast estimation of uncertain parameters of the system, both the online historical and the instantaneous data of the estimation data are explored to derive the new parameter adaptive law under a more realizable interval-excitation (IE) condition. Therefore, the convergence of the position/force synchronization errors and the adaptive parameter estimation errors is obtained in finite time, and enhanced robustness of the closed-loop system will also be ensured. Finally, the superior performance of the proposed control algorithms is validated by numerical simulations and hardware experiments.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"46 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141062379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Meta label associated loss for fine-grained visual recognition 用于细粒度视觉识别的元标签相关损失

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-05-15 DOI: 10.1007/s11432-023-3922-2

Yanchao Li, Fu Xiao, Hao Li, Qun Li, Shui Yu

Recently, intensive attempts have been made to design robust models for fine-grained visual recognition, most notably are the impressive gains for training with noisy labels by incorporating a reweighting strategy into a meta-learning framework. However, it is limited to up or downweighting the contribution of an instance for label reweighting approaches in the learning process. To solve this issue, a novel noise-tolerant method with auxiliary web data is proposed. Specifically, first, the associations made from embeddings of well-labeled data with those of web data and back at the same class are measured. Next, its association probability is employed as a weighting fusion strategy into angular margin-based loss, which makes the trained model robust to noisy datasets. To reduce the influence of the gap between the well-labeled and noisy web data, a bridge schema is proposed via the corresponding loss that encourages the learned embeddings to be coherent. Lastly, the formulation is encapsulated into the meta-learning framework, which can reduce the overfitting of models and learn the network parameters to be noise-tolerant. Extensive experiments are performed on benchmark datasets, and the results clearly show the superiority of the proposed method over existing state-of-the-art approaches.

最近，人们在设计用于细粒度视觉识别的稳健模型方面进行了大量尝试，其中最引人注目的是通过在元学习框架中加入重新加权策略，在使用噪声标签进行训练时获得了令人印象深刻的收益。然而，这种方法仅限于在学习过程中提高或降低标签重权方法的实例贡献权重。为了解决这个问题，我们提出了一种利用辅助网络数据的新型容噪方法。具体地说，首先，测量了标签良好的数据嵌入与网络数据嵌入之间的关联，并将其返回到同一类别。然后，将其关联概率作为一种加权融合策略，纳入基于角度余量的损失中，从而使训练好的模型对噪声数据集具有鲁棒性。为了减少标记良好的网络数据与噪声网络数据之间差距的影响，我们通过相应的损失提出了一种桥接方案，鼓励学习到的嵌入数据保持一致。最后，该公式被封装到元学习框架中，可以减少模型的过拟合，并学习网络参数，使其具有噪声耐受性。我们在基准数据集上进行了广泛的实验，结果清楚地表明，所提出的方法优于现有的最先进方法。

{"title":"Meta label associated loss for fine-grained visual recognition","authors":"Yanchao Li, Fu Xiao, Hao Li, Qun Li, Shui Yu","doi":"10.1007/s11432-023-3922-2","DOIUrl":"https://doi.org/10.1007/s11432-023-3922-2","url":null,"abstract":"Recently, intensive attempts have been made to design robust models for fine-grained visual recognition, most notably are the impressive gains for training with noisy labels by incorporating a reweighting strategy into a meta-learning framework. However, it is limited to up or downweighting the contribution of an instance for label reweighting approaches in the learning process. To solve this issue, a novel noise-tolerant method with auxiliary web data is proposed. Specifically, first, the associations made from embeddings of well-labeled data with those of web data and back at the same class are measured. Next, its association probability is employed as a weighting fusion strategy into angular margin-based loss, which makes the trained model robust to noisy datasets. To reduce the influence of the gap between the well-labeled and noisy web data, a bridge schema is proposed via the corresponding loss that encourages the learned embeddings to be coherent. Lastly, the formulation is encapsulated into the meta-learning framework, which can reduce the overfitting of models and learn the network parameters to be noise-tolerant. Extensive experiments are performed on benchmark datasets, and the results clearly show the superiority of the proposed method over existing state-of-the-art approaches.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"132 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141062386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On equivalence of state-based potential games 论基于状态的潜在博弈的等价性

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-05-15 DOI: 10.1007/s11432-023-3995-5

Han Wu, Jinhuan Wang

In this paper, we explore state-based potential games using the semi-tensor product of matrices. First, applying the potential equation, we derive both a necessary and sufficient condition as well as a sufficient condition to verify whether a state-based game qualifies as a potential game. Next, we present two static equivalence conditions of state-based potential games. We then delve into dynamic equivalence. We propose a criterion that allows us to identify state-based games that are dynamically equivalent to state-based potential games and share similar dynamic properties. Ultimately, we introduce the concept of state-based networked evolutionary games. We provide a necessary and sufficient condition to ensure that a state-based networked evolutionary game can be classified as a state-based potential game.

在本文中，我们利用矩阵的半张积来探索基于状态的潜在博弈。首先，我们运用势等式，推导出一个必要条件和充分条件，以及一个验证基于状态的博弈是否符合势博弈的充分条件。接下来，我们提出了基于状态的潜在博弈的两个静态等价条件。然后，我们深入探讨动态等价性。我们提出了一个标准，它能让我们识别出动态等价于基于状态的潜在博弈且具有相似动态属性的基于状态的博弈。最后，我们引入了基于状态的网络演化博弈概念。我们提供了一个必要条件和充分条件，以确保基于状态的网络演化博弈可以归类为基于状态的潜在博弈。

引用次数: 0

Contact engineering for temperature stability improvement of Bi-contacted MoS2 field effect transistors 接触工程改善双接触 MoS2 场效应晶体管的温度稳定性

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-05-15 DOI: 10.1007/s11432-023-3942-2

Zizheng Liu, Qing Zhang, Xiaohe Huang, Chunsen Liu, Peng Zhou

Semimetallic bismuth (Bi) is one of the most effective strategies for reducing the contact resistance of two-dimensional transition metal dichalcogenide field effect transistors (FETs). However, the low melting point of Bi contact (271.5° C) limits its reliable applications. In this study, we demonstrated that the temperature stability of Bi-contacted electrodes could be improved by inserting a high-melting point semimetallic antimony (Sb) between the Bi contacting layer and the gold (Au) capping layer. The proposed Bi/Sb/Au contact electrodes tended to form a metal mixture with a continuous surface during the heating process (Voids appeared on the surface of the Bi/Au contact electrodes after heating at 120° C). Because of the improved contacting layer formed by the semimetal Bi/Sb alloy, the fabricated Bi/Sb/Au-contacted molybdenum sulfide (MoS₂) FETs with different gate lengths demonstrated higher on-state current stability after heating treatment than the Bi/Au contact. Because of the Bi/Sb/Au contact and poly (methyl methacrylate) package, the MoS₂ FETs demonstrated time stability of at least two months from the almost unchanged transfer characteristics. The electrical stability indicates that the insertion of semimetallic Sb is a promising technology for reliable Bi-based contact.

半金属铋（Bi）是降低二维过渡金属二卤化场效应晶体管（FET）接触电阻的最有效策略之一。然而，铋触点的低熔点（271.5° C）限制了它的可靠应用。在这项研究中，我们证明了在铋接触层和金（Au）封端层之间插入高熔点半金属锑（Sb）可以提高铋接触电极的温度稳定性。在加热过程中，拟议的 Bi/Sb/Au 接触电极往往会形成表面连续的金属混合物（在 120 摄氏度加热后，Bi/Au 接触电极的表面会出现空洞）。由于半金属 Bi/Sb 合金形成的接触层得到了改善，因此在加热处理后，制造出的不同栅极长度的 Bi/Sb/Au 接触硫化钼 (MoS2) FET 比 Bi/Au 接触具有更高的通态电流稳定性。由于采用了 Bi/Sb/Au 触点和聚（甲基丙烯酸甲酯）封装，MoS2 FET 从几乎不变的传输特性来看，具有至少两个月的时间稳定性。这种电气稳定性表明，插入半金属锑是一种可靠的铋基接触技术。

{"title":"Contact engineering for temperature stability improvement of Bi-contacted MoS2 field effect transistors","authors":"Zizheng Liu, Qing Zhang, Xiaohe Huang, Chunsen Liu, Peng Zhou","doi":"10.1007/s11432-023-3942-2","DOIUrl":"https://doi.org/10.1007/s11432-023-3942-2","url":null,"abstract":"Semimetallic bismuth (Bi) is one of the most effective strategies for reducing the contact resistance of two-dimensional transition metal dichalcogenide field effect transistors (FETs). However, the low melting point of Bi contact (271.5° C) limits its reliable applications. In this study, we demonstrated that the temperature stability of Bi-contacted electrodes could be improved by inserting a high-melting point semimetallic antimony (Sb) between the Bi contacting layer and the gold (Au) capping layer. The proposed Bi/Sb/Au contact electrodes tended to form a metal mixture with a continuous surface during the heating process (Voids appeared on the surface of the Bi/Au contact electrodes after heating at 120° C). Because of the improved contacting layer formed by the semimetal Bi/Sb alloy, the fabricated Bi/Sb/Au-contacted molybdenum sulfide (MoS2) FETs with different gate lengths demonstrated higher on-state current stability after heating treatment than the Bi/Au contact. Because of the Bi/Sb/Au contact and poly (methyl methacrylate) package, the MoS2 FETs demonstrated time stability of at least two months from the almost unchanged transfer characteristics. The electrical stability indicates that the insertion of semimetallic Sb is a promising technology for reliable Bi-based contact.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"133 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141062639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Understanding adversarial attacks on observations in deep reinforcement learning 了解深度强化学习中对观察结果的对抗性攻击

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-04-26 DOI: 10.1007/s11432-021-3688-y

You Qiaoben, Chengyang Ying, Xinning Zhou, Hang Su, Jun Zhu, Bo Zhang

Deep reinforcement learning models are vulnerable to adversarial attacks that can decrease the cumulative expected reward of a victim by manipulating its observations. Despite the efficiency of previous optimization-based methods for generating adversarial noise in supervised learning, such methods might not achieve the lowest cumulative reward since they do not generally explore the environmental dynamics. Herein, a framework is provided to better understand the existing methods by reformulating the problem of adversarial attacks on reinforcement learning in the function space. The reformulation approach adopted herein generates an optimal adversary in the function space of targeted attacks, repelling them via a generic two-stage framework. In the first stage, a deceptive policy is trained by hacking the environment and discovering a set of trajectories routing to the lowest reward or the worst-case performance. Next, the adversary misleads the victim to imitate the deceptive policy by perturbing the observations. Compared to existing approaches, it is theoretically shown that our adversary is strong under an appropriate noise level. Extensive experiments demonstrate the superiority of the proposed method in terms of efficiency and effectiveness, achieving state-of-the-art performance in both Atari and MuJoCo environments.

深度强化学习模型很容易受到对抗性攻击的影响，对抗性攻击可以通过操纵受害者的观察结果来降低其累积预期奖励。尽管之前基于优化的方法在监督学习中生成对抗噪声的效率很高，但由于这些方法一般不会探索环境动态，因此可能无法实现最低累积奖励。本文提供了一个框架，通过在函数空间中重新表述对强化学习的对抗性攻击问题，更好地理解现有方法。本文采用的重新表述方法可在目标攻击的函数空间中生成一个最佳对手，并通过一个通用的两阶段框架击退它们。在第一阶段，通过黑客攻击环境并发现一组通向最低奖励或最差性能的轨迹来训练欺骗性策略。接下来，对手通过扰动观测数据误导受害者模仿欺骗性策略。与现有方法相比，理论证明我们的对手在适当的噪声水平下是强大的。广泛的实验证明了所提方法在效率和效果方面的优越性，在 Atari 和 MuJoCo 环境中均达到了最先进的性能。

{"title":"Understanding adversarial attacks on observations in deep reinforcement learning","authors":"You Qiaoben, Chengyang Ying, Xinning Zhou, Hang Su, Jun Zhu, Bo Zhang","doi":"10.1007/s11432-021-3688-y","DOIUrl":"https://doi.org/10.1007/s11432-021-3688-y","url":null,"abstract":"Deep reinforcement learning models are vulnerable to adversarial attacks that can decrease the cumulative expected reward of a victim by manipulating its observations. Despite the efficiency of previous optimization-based methods for generating adversarial noise in supervised learning, such methods might not achieve the lowest cumulative reward since they do not generally explore the environmental dynamics. Herein, a framework is provided to better understand the existing methods by reformulating the problem of adversarial attacks on reinforcement learning in the function space. The reformulation approach adopted herein generates an optimal adversary in the function space of targeted attacks, repelling them via a generic two-stage framework. In the first stage, a deceptive policy is trained by hacking the environment and discovering a set of trajectories routing to the lowest reward or the worst-case performance. Next, the adversary misleads the victim to imitate the deceptive policy by perturbing the observations. Compared to existing approaches, it is theoretically shown that our adversary is strong under an appropriate noise level. Extensive experiments demonstrate the superiority of the proposed method in terms of efficiency and effectiveness, achieving state-of-the-art performance in both Atari and MuJoCo environments.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"8 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140842157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sampling-efficient path planning and improved actor-critic-based obstacle avoidance for autonomous robots 自主机器人的采样高效路径规划和基于行为批判的改进型避障技术

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-04-26 DOI: 10.1007/s11432-022-3904-9

Yefeng Yang, Tao Huang, Tianqi Wang, Wenyu Yang, Han Chen, Boyang Li, Chih-yung Wen

Autonomous robots have garnered extensive utilization in diverse fields. Among the critical concerns for autonomous systems, path planning holds paramount importance. Notwithstanding considerable efforts in its development over the years, path planning for autonomous systems continues to grapple with challenges related to low planning efficiency and inadequate obstacle avoidance response in a timely manner. This study proposes a novel and systematic solution to the path planning problem within intricate office buildings. The solution consists of a global planner and a local planner. To handle the global planning aspect, an adaptive clustering-based dynamic programming rapidly exploring random tree (ACDP-RRT) algorithm is proposed. ACDP-RRT effectively identifies obstacles on the map by leveraging geometric features. These obstacles are then represented as a collection of sequentially arranged convex polygons, optimizing the sampling region and significantly enhancing sampling efficiency. For local planning, a network decoupling actor-critic (ND-AC) algorithm is employed. The proposed ND-AC simplifies the local planner design process by integrating planning and control loops into a neural network (NN) trained via an end-to-end model-free deep reinforcement learning (DRL) framework. Moreover, the adoption of network decoupling (ND) techniques leads to an improved obstacle avoidance success rate when compared to conventional actor-critic (AC)-based methods. Extensive simulations and experiments are conducted to demonstrate the effectiveness and robustness of the proposed approach.

自主机器人在各个领域都得到了广泛应用。在自主系统的关键问题中，路径规划至关重要。尽管多年来自主系统在路径规划方面做出了巨大努力，但仍然面临着规划效率低、避障反应不及时等挑战。本研究针对错综复杂的办公建筑内的路径规划问题提出了一种新颖而系统的解决方案。该解决方案由全局规划器和局部规划器组成。为了处理全局规划方面的问题，提出了一种基于聚类的自适应动态编程快速探索随机树（ACDP-RRT）算法。ACDP-RRT 利用几何特征有效识别地图上的障碍物。然后，将这些障碍物表示为顺序排列的凸多边形集合，从而优化采样区域，显著提高采样效率。在局部规划方面，采用了网络解耦演员批判（ND-AC）算法。所提出的 ND-AC 将规划和控制回路集成到通过端到端无模型深度强化学习（DRL）框架训练的神经网络（NN）中，从而简化了局部规划器的设计过程。此外，与传统的基于行为批判（AC）的方法相比，采用网络解耦（ND）技术提高了避障成功率。为了证明所提方法的有效性和鲁棒性，我们进行了大量的模拟和实验。

{"title":"Sampling-efficient path planning and improved actor-critic-based obstacle avoidance for autonomous robots","authors":"Yefeng Yang, Tao Huang, Tianqi Wang, Wenyu Yang, Han Chen, Boyang Li, Chih-yung Wen","doi":"10.1007/s11432-022-3904-9","DOIUrl":"https://doi.org/10.1007/s11432-022-3904-9","url":null,"abstract":"Autonomous robots have garnered extensive utilization in diverse fields. Among the critical concerns for autonomous systems, path planning holds paramount importance. Notwithstanding considerable efforts in its development over the years, path planning for autonomous systems continues to grapple with challenges related to low planning efficiency and inadequate obstacle avoidance response in a timely manner. This study proposes a novel and systematic solution to the path planning problem within intricate office buildings. The solution consists of a global planner and a local planner. To handle the global planning aspect, an adaptive clustering-based dynamic programming rapidly exploring random tree (ACDP-RRT) algorithm is proposed. ACDP-RRT effectively identifies obstacles on the map by leveraging geometric features. These obstacles are then represented as a collection of sequentially arranged convex polygons, optimizing the sampling region and significantly enhancing sampling efficiency. For local planning, a network decoupling actor-critic (ND-AC) algorithm is employed. The proposed ND-AC simplifies the local planner design process by integrating planning and control loops into a neural network (NN) trained via an end-to-end model-free deep reinforcement learning (DRL) framework. Moreover, the adoption of network decoupling (ND) techniques leads to an improved obstacle avoidance success rate when compared to conventional actor-critic (AC)-based methods. Extensive simulations and experiments are conducted to demonstrate the effectiveness and robustness of the proposed approach.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"99 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140836008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A credible traffic prediction method based on self-supervised causal discovery 基于自监督因果发现的可信交通预测方法

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-04-26 DOI: 10.1007/s11432-023-3899-1

Dan Wang, Yingjie Liu, Bin Song

Next-generation wireless network aims to support low-latency, high-speed data transmission services by incorporating artificial intelligence (AI) technologies. To fulfill this promise, AI-based network traffic prediction is essential for pre-allocating resources, such as bandwidth and computing power. This can help reduce network congestion and improve the quality of service (QoS) for users. Most studies achieve future traffic prediction by exploiting deep learning and reinforcement learning, to mine spatio-temporal correlated variables. Nevertheless, the prediction results obtained only by the spatio-temporal correlated variables cannot reflect real traffic changes. This phenomenon prevents the true prediction variables from being inferred, making the prediction algorithm perform poorly. Inspired by causal science, we propose a novel network traffic prediction method based on self-supervised spatio-temporal causal discovery (SSTCD). We first introduce the Granger causal discovery algorithm to build a causal graph among prediction variables and obtain spatio-temporal causality in the observed data, which reflects the real reasons affecting traffic changes. Next, a graph neural network (GNN) is adopted to incorporate causality for traffic prediction. Furthermore, we propose a self-supervised method to implement causal discovery to to address the challenge of lacking ground-truth causal graphs in the observed data. Experimental results demonstrate the effectiveness of the SSTCD method.

下一代无线网络旨在通过采用人工智能（AI）技术来支持低延迟、高速数据传输服务。要实现这一目标，基于人工智能的网络流量预测对于预先分配带宽和计算能力等资源至关重要。这有助于减少网络拥塞，提高用户的服务质量（QoS）。大多数研究通过利用深度学习和强化学习挖掘时空相关变量来实现未来流量预测。然而，仅通过时空相关变量获得的预测结果无法反映真实的流量变化。这种现象导致无法推断出真正的预测变量，从而使预测算法表现不佳。受因果科学的启发，我们提出了一种基于自监督时空因果发现（SSTCD）的新型网络流量预测方法。首先，我们引入格兰杰因果发现算法，在预测变量之间建立因果图，获得观测数据的时空因果关系，从而反映出影响流量变化的真正原因。接下来，我们采用图神经网络（GNN）将因果关系纳入交通预测。此外，我们还提出了一种自监督方法来实现因果发现，以解决观测数据中缺乏地面真实因果图的难题。实验结果证明了 SSTCD 方法的有效性。

{"title":"A credible traffic prediction method based on self-supervised causal discovery","authors":"Dan Wang, Yingjie Liu, Bin Song","doi":"10.1007/s11432-023-3899-1","DOIUrl":"https://doi.org/10.1007/s11432-023-3899-1","url":null,"abstract":"Next-generation wireless network aims to support low-latency, high-speed data transmission services by incorporating artificial intelligence (AI) technologies. To fulfill this promise, AI-based network traffic prediction is essential for pre-allocating resources, such as bandwidth and computing power. This can help reduce network congestion and improve the quality of service (QoS) for users. Most studies achieve future traffic prediction by exploiting deep learning and reinforcement learning, to mine spatio-temporal correlated variables. Nevertheless, the prediction results obtained only by the spatio-temporal correlated variables cannot reflect real traffic changes. This phenomenon prevents the true prediction variables from being inferred, making the prediction algorithm perform poorly. Inspired by causal science, we propose a novel network traffic prediction method based on self-supervised spatio-temporal causal discovery (SSTCD). We first introduce the Granger causal discovery algorithm to build a causal graph among prediction variables and obtain spatio-temporal causality in the observed data, which reflects the real reasons affecting traffic changes. Next, a graph neural network (GNN) is adopted to incorporate causality for traffic prediction. Furthermore, we propose a self-supervised method to implement causal discovery to to address the challenge of lacking ground-truth causal graphs in the observed data. Experimental results demonstrate the effectiveness of the SSTCD method.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"18 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140835895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Heterogeneous integration of 2D materials on Si charge-coupled devices as optical memory 在硅电荷耦合器件上异质集成二维材料作为光存储器

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-04-26 DOI: 10.1007/s11432-024-3993-5

Zheng Bian, Feng Tian, Zongwen Li, Xiangwei Su, Tianjiao Zhang, Jialei Miao, Bin Yu, Yang Xu, Yuda Zhao

Optical memory integrates the function of optical sensing in memory devices, remarkably promoting the interconnection between sensory and memory terminals. Silicon charge-coupled photodetectors and floating gate memory have been widely used in imaging and storage technologies, respectively. However, the heterogeneous integration of the two devices requires technological innovation and complex electrical connections. In this work, we adopt a three-dimensional layer stacking method to design a novel optical memory device. On the top of Si charge-coupled photodetectors, we successively deposit two-dimensional graphene, hexagonal boron nitride, and molybdenum disulfide as a floating gate layer, a tunneling layer, and a readout layer, respectively. By applying a gate bias on lightly doped Si, a deep depletion layer is formed with a high voltage potential drop. Under dark conditions, the depletion layer cannot be filled, and the electric field across the h-BN tunnel barrier is relatively small. Under light irradiation, the deep depletion layer is gradually filled, and the h-BN tunneling layer withstands the increasing electric field, resulting in charge storage in the floating gate layer. Based on this mechanism, the device exhibits a gate voltage-dependent operation mode, including an integrated optical sensing-memory mode and an electrically driven storage mode. Under moderate gate voltage, the device can effectively detect the optical information with varied intensity and store the optical information in the floating gate, displaying optically controlled memory characteristics. Our work demonstrates a compact device structure for optical memory and displays excellent optically controlled memory performance, which can be applied in artificial vision systems.

光学存储器将光学传感功能集成到存储器件中，极大地促进了传感终端与存储器终端之间的互联。硅电荷耦合光电探测器和浮栅存储器已分别在成像和存储技术中得到广泛应用。然而，这两种器件的异构集成需要技术创新和复杂的电气连接。在这项工作中，我们采用三维层叠方法设计了一种新型光存储器件。在硅电荷耦合光电探测器的顶部，我们依次沉积了二维石墨烯、六方氮化硼和二硫化钼，分别作为浮动栅极层、隧道层和读出层。通过在轻掺杂的硅上施加栅极偏压，可以形成具有高电压势降的深耗尽层。在黑暗条件下，耗尽层无法填充，h-BN 隧道势垒上的电场相对较小。在光照条件下，深耗尽层逐渐被填满，h-BN 隧道层承受住了不断增大的电场，从而在浮动栅极层中存储了电荷。基于这种机制，该器件呈现出一种与栅极电压有关的工作模式，包括集成光传感记忆模式和电驱动存储模式。在适度的栅极电压下，该器件能有效地检测不同强度的光学信息，并将光学信息存储在浮动栅极中，显示出光控存储器的特性。我们的研究成果展示了一种结构紧凑的光学存储器器件，并显示出卓越的光控存储器性能，可应用于人工视觉系统。

{"title":"Heterogeneous integration of 2D materials on Si charge-coupled devices as optical memory","authors":"Zheng Bian, Feng Tian, Zongwen Li, Xiangwei Su, Tianjiao Zhang, Jialei Miao, Bin Yu, Yang Xu, Yuda Zhao","doi":"10.1007/s11432-024-3993-5","DOIUrl":"https://doi.org/10.1007/s11432-024-3993-5","url":null,"abstract":"Optical memory integrates the function of optical sensing in memory devices, remarkably promoting the interconnection between sensory and memory terminals. Silicon charge-coupled photodetectors and floating gate memory have been widely used in imaging and storage technologies, respectively. However, the heterogeneous integration of the two devices requires technological innovation and complex electrical connections. In this work, we adopt a three-dimensional layer stacking method to design a novel optical memory device. On the top of Si charge-coupled photodetectors, we successively deposit two-dimensional graphene, hexagonal boron nitride, and molybdenum disulfide as a floating gate layer, a tunneling layer, and a readout layer, respectively. By applying a gate bias on lightly doped Si, a deep depletion layer is formed with a high voltage potential drop. Under dark conditions, the depletion layer cannot be filled, and the electric field across the h-BN tunnel barrier is relatively small. Under light irradiation, the deep depletion layer is gradually filled, and the h-BN tunneling layer withstands the increasing electric field, resulting in charge storage in the floating gate layer. Based on this mechanism, the device exhibits a gate voltage-dependent operation mode, including an integrated optical sensing-memory mode and an electrically driven storage mode. Under moderate gate voltage, the device can effectively detect the optical information with varied intensity and store the optical information in the floating gate, displaying optically controlled memory characteristics. Our work demonstrates a compact device structure for optical memory and displays excellent optically controlled memory performance, which can be applied in artificial vision systems.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"4 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140835915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0