Pub Date : 2024-05-17DOI: 10.1007/s11432-023-3924-x
Yimin Fu, Zhunga Liu, Zicheng Wang
Robust open-set recognition (OSR) performance has become a prerequisite for pattern recognition systems in real-world applications. However, the existing OSR methods are primarily implemented on the basis of single-modal perception, and their performance is limited when single-modal data fail to provide sufficient descriptions of the objects. Although multimodal data can provide more comprehensive information than single-modal data, the learning of decision boundaries can be affected by the feature representation gap between different modalities. To effectively integrate multimodal data for robust OSR performance, we propose logit prototype learning (LPL) with active multimodal representation. In LPL, the input multimodal data are transformed into the logit space, enabling a direct exploration of intermodal correlations without the impact of scale inconsistency. Then, the fusion weights of each modality are determined using an entropybased uncertainty estimation method. This approach realizes adaptive adjustment of the fusion strategy to provide comprehensive descriptions in the presence of external disturbances. Moreover, the single-modal and multimodal representations are jointly optimized interactively to learn discriminative decision boundaries. Finally, a stepwise recognition rule is employed to reduce the misclassification risk and facilitate the distinction between known and unknown classes. Extensive experiments on three multimodal datasets have been done to demonstrate the effectiveness of the proposed method.
{"title":"Logit prototype learning with active multimodal representation for robust open-set recognition","authors":"Yimin Fu, Zhunga Liu, Zicheng Wang","doi":"10.1007/s11432-023-3924-x","DOIUrl":"https://doi.org/10.1007/s11432-023-3924-x","url":null,"abstract":"<p>Robust open-set recognition (OSR) performance has become a prerequisite for pattern recognition systems in real-world applications. However, the existing OSR methods are primarily implemented on the basis of single-modal perception, and their performance is limited when single-modal data fail to provide sufficient descriptions of the objects. Although multimodal data can provide more comprehensive information than single-modal data, the learning of decision boundaries can be affected by the feature representation gap between different modalities. To effectively integrate multimodal data for robust OSR performance, we propose logit prototype learning (LPL) with active multimodal representation. In LPL, the input multimodal data are transformed into the logit space, enabling a direct exploration of intermodal correlations without the impact of scale inconsistency. Then, the fusion weights of each modality are determined using an entropybased uncertainty estimation method. This approach realizes adaptive adjustment of the fusion strategy to provide comprehensive descriptions in the presence of external disturbances. Moreover, the single-modal and multimodal representations are jointly optimized interactively to learn discriminative decision boundaries. Finally, a stepwise recognition rule is employed to reduce the misclassification risk and facilitate the distinction between known and unknown classes. Extensive experiments on three multimodal datasets have been done to demonstrate the effectiveness of the proposed method.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"8 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141256012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-16DOI: 10.1007/s11432-023-4023-y
Chunlei Peng, Bo Wang, Decheng Liu, Nannan Wang, Xinbo Gao
In this study, we present a simple yet effective pyramid-resolution person restoration method for cross-resolution person re-identification. Our method involves a pyramid resolution restoration network that enhances pyramid resolution images, and utilizes feature distance fusion to leverage valuable and complementary information from these pyramid images. Extensive experiments demonstrate the effectiveness of our method on both real-world cross-resolution datasets and simulated datasets.
{"title":"Pyramid-resolution person restoration for cross-resolution person re-identification","authors":"Chunlei Peng, Bo Wang, Decheng Liu, Nannan Wang, Xinbo Gao","doi":"10.1007/s11432-023-4023-y","DOIUrl":"https://doi.org/10.1007/s11432-023-4023-y","url":null,"abstract":"<p>In this study, we present a simple yet effective pyramid-resolution person restoration method for cross-resolution person re-identification. Our method involves a pyramid resolution restoration network that enhances pyramid resolution images, and utilizes feature distance fusion to leverage valuable and complementary information from these pyramid images. Extensive experiments demonstrate the effectiveness of our method on both real-world cross-resolution datasets and simulated datasets.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"35 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141062384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-16DOI: 10.1007/s11432-023-3931-0
Yana Yang, Huixin Jiang, Changchun Hua, Junpeng Li
The robust finite-time synchronization control problem is investigated for master-slave networked nonlinear telerobotics systems (NNTSs) in this article. Although there have been some research achievements on finite-time control for the NNTSs, these studies are based on the strong assumptions of communication time delays or can only achieve finite-time bounded convergence even when the external forces are zero. Accordingly and in view of the importance of these issues, a novel robust composite learning adaptive control scheme rendering the finite-time master-slave synchronization is proposed in this paper. In particular, the influence of time delays on finite-time convergence of the system is analyzed by employing the multi-dimension finite-time small-gain framework. Meanwhile, in order to achieve accurate and fast estimation of uncertain parameters of the system, both the online historical and the instantaneous data of the estimation data are explored to derive the new parameter adaptive law under a more realizable interval-excitation (IE) condition. Therefore, the convergence of the position/force synchronization errors and the adaptive parameter estimation errors is obtained in finite time, and enhanced robustness of the closed-loop system will also be ensured. Finally, the superior performance of the proposed control algorithms is validated by numerical simulations and hardware experiments.
{"title":"Finite-time composite learning control for nonlinear teleoperation systems under networked time-varying delays","authors":"Yana Yang, Huixin Jiang, Changchun Hua, Junpeng Li","doi":"10.1007/s11432-023-3931-0","DOIUrl":"https://doi.org/10.1007/s11432-023-3931-0","url":null,"abstract":"<p>The robust finite-time synchronization control problem is investigated for master-slave networked nonlinear telerobotics systems (NNTSs) in this article. Although there have been some research achievements on finite-time control for the NNTSs, these studies are based on the strong assumptions of communication time delays or can only achieve finite-time bounded convergence even when the external forces are zero. Accordingly and in view of the importance of these issues, a novel robust composite learning adaptive control scheme rendering the finite-time master-slave synchronization is proposed in this paper. In particular, the influence of time delays on finite-time convergence of the system is analyzed by employing the multi-dimension finite-time small-gain framework. Meanwhile, in order to achieve accurate and fast estimation of uncertain parameters of the system, both the online historical and the instantaneous data of the estimation data are explored to derive the new parameter adaptive law under a more realizable interval-excitation (IE) condition. Therefore, the convergence of the position/force synchronization errors and the adaptive parameter estimation errors is obtained in finite time, and enhanced robustness of the closed-loop system will also be ensured. Finally, the superior performance of the proposed control algorithms is validated by numerical simulations and hardware experiments.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"46 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141062379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-15DOI: 10.1007/s11432-023-3922-2
Yanchao Li, Fu Xiao, Hao Li, Qun Li, Shui Yu
Recently, intensive attempts have been made to design robust models for fine-grained visual recognition, most notably are the impressive gains for training with noisy labels by incorporating a reweighting strategy into a meta-learning framework. However, it is limited to up or downweighting the contribution of an instance for label reweighting approaches in the learning process. To solve this issue, a novel noise-tolerant method with auxiliary web data is proposed. Specifically, first, the associations made from embeddings of well-labeled data with those of web data and back at the same class are measured. Next, its association probability is employed as a weighting fusion strategy into angular margin-based loss, which makes the trained model robust to noisy datasets. To reduce the influence of the gap between the well-labeled and noisy web data, a bridge schema is proposed via the corresponding loss that encourages the learned embeddings to be coherent. Lastly, the formulation is encapsulated into the meta-learning framework, which can reduce the overfitting of models and learn the network parameters to be noise-tolerant. Extensive experiments are performed on benchmark datasets, and the results clearly show the superiority of the proposed method over existing state-of-the-art approaches.
{"title":"Meta label associated loss for fine-grained visual recognition","authors":"Yanchao Li, Fu Xiao, Hao Li, Qun Li, Shui Yu","doi":"10.1007/s11432-023-3922-2","DOIUrl":"https://doi.org/10.1007/s11432-023-3922-2","url":null,"abstract":"<p>Recently, intensive attempts have been made to design robust models for fine-grained visual recognition, most notably are the impressive gains for training with noisy labels by incorporating a reweighting strategy into a meta-learning framework. However, it is limited to up or downweighting the contribution of an instance for label reweighting approaches in the learning process. To solve this issue, a novel noise-tolerant method with auxiliary web data is proposed. Specifically, first, the associations made from embeddings of well-labeled data with those of web data and back at the same class are measured. Next, its association probability is employed as a weighting fusion strategy into angular margin-based loss, which makes the trained model robust to noisy datasets. To reduce the influence of the gap between the well-labeled and noisy web data, a bridge schema is proposed via the corresponding loss that encourages the learned embeddings to be coherent. Lastly, the formulation is encapsulated into the meta-learning framework, which can reduce the overfitting of models and learn the network parameters to be noise-tolerant. Extensive experiments are performed on benchmark datasets, and the results clearly show the superiority of the proposed method over existing state-of-the-art approaches.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"132 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141062386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-15DOI: 10.1007/s11432-023-3995-5
Han Wu, Jinhuan Wang
In this paper, we explore state-based potential games using the semi-tensor product of matrices. First, applying the potential equation, we derive both a necessary and sufficient condition as well as a sufficient condition to verify whether a state-based game qualifies as a potential game. Next, we present two static equivalence conditions of state-based potential games. We then delve into dynamic equivalence. We propose a criterion that allows us to identify state-based games that are dynamically equivalent to state-based potential games and share similar dynamic properties. Ultimately, we introduce the concept of state-based networked evolutionary games. We provide a necessary and sufficient condition to ensure that a state-based networked evolutionary game can be classified as a state-based potential game.
{"title":"On equivalence of state-based potential games","authors":"Han Wu, Jinhuan Wang","doi":"10.1007/s11432-023-3995-5","DOIUrl":"https://doi.org/10.1007/s11432-023-3995-5","url":null,"abstract":"<p>In this paper, we explore state-based potential games using the semi-tensor product of matrices. First, applying the potential equation, we derive both a necessary and sufficient condition as well as a sufficient condition to verify whether a state-based game qualifies as a potential game. Next, we present two static equivalence conditions of state-based potential games. We then delve into dynamic equivalence. We propose a criterion that allows us to identify state-based games that are dynamically equivalent to state-based potential games and share similar dynamic properties. Ultimately, we introduce the concept of state-based networked evolutionary games. We provide a necessary and sufficient condition to ensure that a state-based networked evolutionary game can be classified as a state-based potential game.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"39 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141062411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Semimetallic bismuth (Bi) is one of the most effective strategies for reducing the contact resistance of two-dimensional transition metal dichalcogenide field effect transistors (FETs). However, the low melting point of Bi contact (271.5° C) limits its reliable applications. In this study, we demonstrated that the temperature stability of Bi-contacted electrodes could be improved by inserting a high-melting point semimetallic antimony (Sb) between the Bi contacting layer and the gold (Au) capping layer. The proposed Bi/Sb/Au contact electrodes tended to form a metal mixture with a continuous surface during the heating process (Voids appeared on the surface of the Bi/Au contact electrodes after heating at 120° C). Because of the improved contacting layer formed by the semimetal Bi/Sb alloy, the fabricated Bi/Sb/Au-contacted molybdenum sulfide (MoS2) FETs with different gate lengths demonstrated higher on-state current stability after heating treatment than the Bi/Au contact. Because of the Bi/Sb/Au contact and poly (methyl methacrylate) package, the MoS2 FETs demonstrated time stability of at least two months from the almost unchanged transfer characteristics. The electrical stability indicates that the insertion of semimetallic Sb is a promising technology for reliable Bi-based contact.
半金属铋(Bi)是降低二维过渡金属二卤化场效应晶体管(FET)接触电阻的最有效策略之一。然而,铋触点的低熔点(271.5° C)限制了它的可靠应用。在这项研究中,我们证明了在铋接触层和金(Au)封端层之间插入高熔点半金属锑(Sb)可以提高铋接触电极的温度稳定性。在加热过程中,拟议的 Bi/Sb/Au 接触电极往往会形成表面连续的金属混合物(在 120 摄氏度加热后,Bi/Au 接触电极的表面会出现空洞)。由于半金属 Bi/Sb 合金形成的接触层得到了改善,因此在加热处理后,制造出的不同栅极长度的 Bi/Sb/Au 接触硫化钼 (MoS2) FET 比 Bi/Au 接触具有更高的通态电流稳定性。由于采用了 Bi/Sb/Au 触点和聚(甲基丙烯酸甲酯)封装,MoS2 FET 从几乎不变的传输特性来看,具有至少两个月的时间稳定性。这种电气稳定性表明,插入半金属锑是一种可靠的铋基接触技术。
{"title":"Contact engineering for temperature stability improvement of Bi-contacted MoS2 field effect transistors","authors":"Zizheng Liu, Qing Zhang, Xiaohe Huang, Chunsen Liu, Peng Zhou","doi":"10.1007/s11432-023-3942-2","DOIUrl":"https://doi.org/10.1007/s11432-023-3942-2","url":null,"abstract":"<p>Semimetallic bismuth (Bi) is one of the most effective strategies for reducing the contact resistance of two-dimensional transition metal dichalcogenide field effect transistors (FETs). However, the low melting point of Bi contact (271.5° C) limits its reliable applications. In this study, we demonstrated that the temperature stability of Bi-contacted electrodes could be improved by inserting a high-melting point semimetallic antimony (Sb) between the Bi contacting layer and the gold (Au) capping layer. The proposed Bi/Sb/Au contact electrodes tended to form a metal mixture with a continuous surface during the heating process (Voids appeared on the surface of the Bi/Au contact electrodes after heating at 120° C). Because of the improved contacting layer formed by the semimetal Bi/Sb alloy, the fabricated Bi/Sb/Au-contacted molybdenum sulfide (MoS<sub>2</sub>) FETs with different gate lengths demonstrated higher on-state current stability after heating treatment than the Bi/Au contact. Because of the Bi/Sb/Au contact and poly (methyl methacrylate) package, the MoS<sub>2</sub> FETs demonstrated time stability of at least two months from the almost unchanged transfer characteristics. The electrical stability indicates that the insertion of semimetallic Sb is a promising technology for reliable Bi-based contact.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"133 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141062639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-26DOI: 10.1007/s11432-021-3688-y
You Qiaoben, Chengyang Ying, Xinning Zhou, Hang Su, Jun Zhu, Bo Zhang
Deep reinforcement learning models are vulnerable to adversarial attacks that can decrease the cumulative expected reward of a victim by manipulating its observations. Despite the efficiency of previous optimization-based methods for generating adversarial noise in supervised learning, such methods might not achieve the lowest cumulative reward since they do not generally explore the environmental dynamics. Herein, a framework is provided to better understand the existing methods by reformulating the problem of adversarial attacks on reinforcement learning in the function space. The reformulation approach adopted herein generates an optimal adversary in the function space of targeted attacks, repelling them via a generic two-stage framework. In the first stage, a deceptive policy is trained by hacking the environment and discovering a set of trajectories routing to the lowest reward or the worst-case performance. Next, the adversary misleads the victim to imitate the deceptive policy by perturbing the observations. Compared to existing approaches, it is theoretically shown that our adversary is strong under an appropriate noise level. Extensive experiments demonstrate the superiority of the proposed method in terms of efficiency and effectiveness, achieving state-of-the-art performance in both Atari and MuJoCo environments.
{"title":"Understanding adversarial attacks on observations in deep reinforcement learning","authors":"You Qiaoben, Chengyang Ying, Xinning Zhou, Hang Su, Jun Zhu, Bo Zhang","doi":"10.1007/s11432-021-3688-y","DOIUrl":"https://doi.org/10.1007/s11432-021-3688-y","url":null,"abstract":"<p>Deep reinforcement learning models are vulnerable to adversarial attacks that can decrease the cumulative expected reward of a victim by manipulating its observations. Despite the efficiency of previous optimization-based methods for generating adversarial noise in supervised learning, such methods might not achieve the lowest cumulative reward since they do not generally explore the environmental dynamics. Herein, a framework is provided to better understand the existing methods by reformulating the problem of adversarial attacks on reinforcement learning in the function space. The reformulation approach adopted herein generates an optimal adversary in the function space of targeted attacks, repelling them via a generic two-stage framework. In the first stage, a deceptive policy is trained by hacking the environment and discovering a set of trajectories routing to the lowest reward or the worst-case performance. Next, the adversary misleads the victim to imitate the deceptive policy by perturbing the observations. Compared to existing approaches, it is theoretically shown that our adversary is strong under an appropriate noise level. Extensive experiments demonstrate the superiority of the proposed method in terms of efficiency and effectiveness, achieving state-of-the-art performance in both Atari and MuJoCo environments.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"8 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140842157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-26DOI: 10.1007/s11432-022-3904-9
Yefeng Yang, Tao Huang, Tianqi Wang, Wenyu Yang, Han Chen, Boyang Li, Chih-yung Wen
Autonomous robots have garnered extensive utilization in diverse fields. Among the critical concerns for autonomous systems, path planning holds paramount importance. Notwithstanding considerable efforts in its development over the years, path planning for autonomous systems continues to grapple with challenges related to low planning efficiency and inadequate obstacle avoidance response in a timely manner. This study proposes a novel and systematic solution to the path planning problem within intricate office buildings. The solution consists of a global planner and a local planner. To handle the global planning aspect, an adaptive clustering-based dynamic programming rapidly exploring random tree (ACDP-RRT) algorithm is proposed. ACDP-RRT effectively identifies obstacles on the map by leveraging geometric features. These obstacles are then represented as a collection of sequentially arranged convex polygons, optimizing the sampling region and significantly enhancing sampling efficiency. For local planning, a network decoupling actor-critic (ND-AC) algorithm is employed. The proposed ND-AC simplifies the local planner design process by integrating planning and control loops into a neural network (NN) trained via an end-to-end model-free deep reinforcement learning (DRL) framework. Moreover, the adoption of network decoupling (ND) techniques leads to an improved obstacle avoidance success rate when compared to conventional actor-critic (AC)-based methods. Extensive simulations and experiments are conducted to demonstrate the effectiveness and robustness of the proposed approach.
{"title":"Sampling-efficient path planning and improved actor-critic-based obstacle avoidance for autonomous robots","authors":"Yefeng Yang, Tao Huang, Tianqi Wang, Wenyu Yang, Han Chen, Boyang Li, Chih-yung Wen","doi":"10.1007/s11432-022-3904-9","DOIUrl":"https://doi.org/10.1007/s11432-022-3904-9","url":null,"abstract":"<p>Autonomous robots have garnered extensive utilization in diverse fields. Among the critical concerns for autonomous systems, path planning holds paramount importance. Notwithstanding considerable efforts in its development over the years, path planning for autonomous systems continues to grapple with challenges related to low planning efficiency and inadequate obstacle avoidance response in a timely manner. This study proposes a novel and systematic solution to the path planning problem within intricate office buildings. The solution consists of a global planner and a local planner. To handle the global planning aspect, an adaptive clustering-based dynamic programming rapidly exploring random tree (ACDP-RRT) algorithm is proposed. ACDP-RRT effectively identifies obstacles on the map by leveraging geometric features. These obstacles are then represented as a collection of sequentially arranged convex polygons, optimizing the sampling region and significantly enhancing sampling efficiency. For local planning, a network decoupling actor-critic (ND-AC) algorithm is employed. The proposed ND-AC simplifies the local planner design process by integrating planning and control loops into a neural network (NN) trained via an end-to-end model-free deep reinforcement learning (DRL) framework. Moreover, the adoption of network decoupling (ND) techniques leads to an improved obstacle avoidance success rate when compared to conventional actor-critic (AC)-based methods. Extensive simulations and experiments are conducted to demonstrate the effectiveness and robustness of the proposed approach.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"99 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140836008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-26DOI: 10.1007/s11432-023-3899-1
Dan Wang, Yingjie Liu, Bin Song
Next-generation wireless network aims to support low-latency, high-speed data transmission services by incorporating artificial intelligence (AI) technologies. To fulfill this promise, AI-based network traffic prediction is essential for pre-allocating resources, such as bandwidth and computing power. This can help reduce network congestion and improve the quality of service (QoS) for users. Most studies achieve future traffic prediction by exploiting deep learning and reinforcement learning, to mine spatio-temporal correlated variables. Nevertheless, the prediction results obtained only by the spatio-temporal correlated variables cannot reflect real traffic changes. This phenomenon prevents the true prediction variables from being inferred, making the prediction algorithm perform poorly. Inspired by causal science, we propose a novel network traffic prediction method based on self-supervised spatio-temporal causal discovery (SSTCD). We first introduce the Granger causal discovery algorithm to build a causal graph among prediction variables and obtain spatio-temporal causality in the observed data, which reflects the real reasons affecting traffic changes. Next, a graph neural network (GNN) is adopted to incorporate causality for traffic prediction. Furthermore, we propose a self-supervised method to implement causal discovery to to address the challenge of lacking ground-truth causal graphs in the observed data. Experimental results demonstrate the effectiveness of the SSTCD method.
{"title":"A credible traffic prediction method based on self-supervised causal discovery","authors":"Dan Wang, Yingjie Liu, Bin Song","doi":"10.1007/s11432-023-3899-1","DOIUrl":"https://doi.org/10.1007/s11432-023-3899-1","url":null,"abstract":"<p>Next-generation wireless network aims to support low-latency, high-speed data transmission services by incorporating artificial intelligence (AI) technologies. To fulfill this promise, AI-based network traffic prediction is essential for pre-allocating resources, such as bandwidth and computing power. This can help reduce network congestion and improve the quality of service (QoS) for users. Most studies achieve future traffic prediction by exploiting deep learning and reinforcement learning, to mine spatio-temporal correlated variables. Nevertheless, the prediction results obtained only by the spatio-temporal correlated variables cannot reflect real traffic changes. This phenomenon prevents the true prediction variables from being inferred, making the prediction algorithm perform poorly. Inspired by causal science, we propose a novel network traffic prediction method based on self-supervised spatio-temporal causal discovery (SSTCD). We first introduce the Granger causal discovery algorithm to build a causal graph among prediction variables and obtain spatio-temporal causality in the observed data, which reflects the real reasons affecting traffic changes. Next, a graph neural network (GNN) is adopted to incorporate causality for traffic prediction. Furthermore, we propose a self-supervised method to implement causal discovery to to address the challenge of lacking ground-truth causal graphs in the observed data. Experimental results demonstrate the effectiveness of the SSTCD method.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"18 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140835895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-26DOI: 10.1007/s11432-024-3993-5
Zheng Bian, Feng Tian, Zongwen Li, Xiangwei Su, Tianjiao Zhang, Jialei Miao, Bin Yu, Yang Xu, Yuda Zhao
Optical memory integrates the function of optical sensing in memory devices, remarkably promoting the interconnection between sensory and memory terminals. Silicon charge-coupled photodetectors and floating gate memory have been widely used in imaging and storage technologies, respectively. However, the heterogeneous integration of the two devices requires technological innovation and complex electrical connections. In this work, we adopt a three-dimensional layer stacking method to design a novel optical memory device. On the top of Si charge-coupled photodetectors, we successively deposit two-dimensional graphene, hexagonal boron nitride, and molybdenum disulfide as a floating gate layer, a tunneling layer, and a readout layer, respectively. By applying a gate bias on lightly doped Si, a deep depletion layer is formed with a high voltage potential drop. Under dark conditions, the depletion layer cannot be filled, and the electric field across the h-BN tunnel barrier is relatively small. Under light irradiation, the deep depletion layer is gradually filled, and the h-BN tunneling layer withstands the increasing electric field, resulting in charge storage in the floating gate layer. Based on this mechanism, the device exhibits a gate voltage-dependent operation mode, including an integrated optical sensing-memory mode and an electrically driven storage mode. Under moderate gate voltage, the device can effectively detect the optical information with varied intensity and store the optical information in the floating gate, displaying optically controlled memory characteristics. Our work demonstrates a compact device structure for optical memory and displays excellent optically controlled memory performance, which can be applied in artificial vision systems.
{"title":"Heterogeneous integration of 2D materials on Si charge-coupled devices as optical memory","authors":"Zheng Bian, Feng Tian, Zongwen Li, Xiangwei Su, Tianjiao Zhang, Jialei Miao, Bin Yu, Yang Xu, Yuda Zhao","doi":"10.1007/s11432-024-3993-5","DOIUrl":"https://doi.org/10.1007/s11432-024-3993-5","url":null,"abstract":"<p>Optical memory integrates the function of optical sensing in memory devices, remarkably promoting the interconnection between sensory and memory terminals. Silicon charge-coupled photodetectors and floating gate memory have been widely used in imaging and storage technologies, respectively. However, the heterogeneous integration of the two devices requires technological innovation and complex electrical connections. In this work, we adopt a three-dimensional layer stacking method to design a novel optical memory device. On the top of Si charge-coupled photodetectors, we successively deposit two-dimensional graphene, hexagonal boron nitride, and molybdenum disulfide as a floating gate layer, a tunneling layer, and a readout layer, respectively. By applying a gate bias on lightly doped Si, a deep depletion layer is formed with a high voltage potential drop. Under dark conditions, the depletion layer cannot be filled, and the electric field across the h-BN tunnel barrier is relatively small. Under light irradiation, the deep depletion layer is gradually filled, and the h-BN tunneling layer withstands the increasing electric field, resulting in charge storage in the floating gate layer. Based on this mechanism, the device exhibits a gate voltage-dependent operation mode, including an integrated optical sensing-memory mode and an electrically driven storage mode. Under moderate gate voltage, the device can effectively detect the optical information with varied intensity and store the optical information in the floating gate, displaying optically controlled memory characteristics. Our work demonstrates a compact device structure for optical memory and displays excellent optically controlled memory performance, which can be applied in artificial vision systems.</p>","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"4 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140835915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}