Science China Information Sciences最新文献

An STP look at logical blocking of finite state machines: formulation, detection, and search 有限状态机逻辑阻塞的 STP 观察：表述、检测和搜索

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-09-13 DOI: 10.1007/s11432-022-4124-7

Yongyi Yan, Penglei Hao, Jumei Yue, Haitao Li, Jun-E Feng

The logical blocking of finite state machines (FSMs) is examined at the three levels of formulation, detection, and search from an STP viewpoint (semi-tensor product of matrices). The research idea regards an FSM as a logical system. The realizing method treats the event sequence exciting an FSM as the input signal of a logical system and treats the current states of an FSM as the states of a logical system. Based on a recently developed bilinear dynamic model of FSMs, a difference equation-like model is first proposed to describe the logical blocking. By defining a loop structure of FSMs and using the difference equation-like model, a criterion is built by which whether a given FSM is blocking can be easily judged. If it is, several algorithms are designed to find all the logical blocking of the FSM. Further, these results are extended to apply to the case of nondeterministic FSMs and, thus, to networks of FSMs. The proposed STP approach may provide a new angle for considering the problems of FSMs, and the presented results may strengthen the links between systems governed by human-designed rules and systems governed by natural laws.

从 STP（矩阵的半张量积）的观点出发，从表述、检测和搜索三个层面对有限状态机（FSM）的逻辑阻塞进行了研究。研究思路将 FSM 视为一个逻辑系统。实现方法将 FSM 的事件序列视为逻辑系统的输入信号，并将 FSM 的当前状态视为逻辑系统的状态。基于最近开发的 FSM 双线性动态模型，首先提出了一种类似差分方程的模型来描述逻辑阻塞。通过定义 FSM 的循环结构和使用类似差分方程的模型，建立了一个标准，通过这个标准可以很容易地判断给定的 FSM 是否阻塞。如果是，则设计几种算法来找出 FSM 的所有逻辑阻塞。此外，这些结果还被扩展应用于非确定性 FSM 的情况，从而应用于 FSM 网络。所提出的 STP 方法可为考虑 FSM 问题提供一个新的角度，所提出的结果可加强由人类设计的规则所支配的系统与由自然规律所支配的系统之间的联系。

{"title":"An STP look at logical blocking of finite state machines: formulation, detection, and search","authors":"Yongyi Yan, Penglei Hao, Jumei Yue, Haitao Li, Jun-E Feng","doi":"10.1007/s11432-022-4124-7","DOIUrl":"https://doi.org/10.1007/s11432-022-4124-7","url":null,"abstract":"The logical blocking of finite state machines (FSMs) is examined at the three levels of formulation, detection, and search from an STP viewpoint (semi-tensor product of matrices). The research idea regards an FSM as a logical system. The realizing method treats the event sequence exciting an FSM as the input signal of a logical system and treats the current states of an FSM as the states of a logical system. Based on a recently developed bilinear dynamic model of FSMs, a difference equation-like model is first proposed to describe the logical blocking. By defining a loop structure of FSMs and using the difference equation-like model, a criterion is built by which whether a given FSM is blocking can be easily judged. If it is, several algorithms are designed to find all the logical blocking of the FSM. Further, these results are extended to apply to the case of nondeterministic FSMs and, thus, to networks of FSMs. The proposed STP approach may provide a new angle for considering the problems of FSMs, and the presented results may strengthen the links between systems governed by human-designed rules and systems governed by natural laws.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"192 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142252193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

TSCompiler: efficient compilation framework for dynamic-shape models TSCompiler：动态形状模型的高效编译框架

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-09-13 DOI: 10.1007/s11432-024-4071-6

Xiang Luo, Chen Zhang, Chenbo Geng, Yanzhi Yi, Jiahui Hu, Renwei Zhang, Zhen Zhang, Gianpietro Consolaro, Fan Yang, Tun Lu, Ning Gu, Li Shang

Today’s deep learning models face an increasing demand to handle dynamic shape tensors and computation whose shape information remains unknown at compile time and varies in a nearly infinite range at runtime. This shape dynamism brings tremendous challenges for existing compilation pipelines designed for static models which optimize tensor programs relying on exact shape values. This paper presents TSCompiler, an end-to-end compilation framework for dynamic shape models. TSCompiler first proposes a symbolic shape propagation algorithm to recover symbolic shape information at compile time to enable subsequent optimizations. TSCompiler then partitions the shape-annotated computation graph into multiple subgraphs and fine-tunes the backbone operators from the subgraph within a hardware-aligned search space to find a collection of high-performance schedules. TSCompiler can propagate the explored backbone schedule to other fusion groups within the same subgraph to generate a set of parameterized tensor programs for fused cases based on dependence analysis. At runtime, TSCompiler utilizes an occupancy-targeted cost model to select from pre-compiled tensor programs for varied tensor shapes. Extensive evaluations show that TSCompiler can achieve state-of-the-art speedups for dynamic shape models. For example, we can improve kernel efficiency by up to 3.97× on NVIDIA RTX3090, and 10.30 × on NVIDIA A100 and achieve up to five orders of magnitude speedups on end-to-end latency.

当今的深度学习模型越来越需要处理动态形状张量和计算，其形状信息在编译时是未知的，而在运行时变化范围几乎是无限的。这种形状动态性给为静态模型设计的现有编译流水线带来了巨大挑战，这些流水线依赖精确的形状值来优化张量程序。本文介绍了用于动态形状模型的端到端编译框架 TSCompiler。TSCompiler 首先提出了一种符号形状传播算法，用于在编译时恢复符号形状信息，以便进行后续优化。然后，TSCompiler 将形状标注的计算图分割成多个子图，并在硬件对齐的搜索空间内对子图中的骨干算子进行微调，以找到高性能的时间表集合。TSCompiler 可将探索出的骨干计划传播到同一子图中的其他融合组，从而根据依赖性分析为融合案例生成一组参数化的张量程序。在运行时，TSCompiler 利用占用目标成本模型，从预先编译的张量程序中选择适合不同张量形状的程序。广泛的评估表明，TSCompiler 可为动态形状模型实现最先进的提速。例如，我们可以在英伟达 RTX3090 上将内核效率提高 3.97 倍，在英伟达 A100 上提高 10.30 倍，并在端到端延迟方面实现高达 5 个数量级的提速。

{"title":"TSCompiler: efficient compilation framework for dynamic-shape models","authors":"Xiang Luo, Chen Zhang, Chenbo Geng, Yanzhi Yi, Jiahui Hu, Renwei Zhang, Zhen Zhang, Gianpietro Consolaro, Fan Yang, Tun Lu, Ning Gu, Li Shang","doi":"10.1007/s11432-024-4071-6","DOIUrl":"https://doi.org/10.1007/s11432-024-4071-6","url":null,"abstract":"Today’s deep learning models face an increasing demand to handle dynamic shape tensors and computation whose shape information remains unknown at compile time and varies in a nearly infinite range at runtime. This shape dynamism brings tremendous challenges for existing compilation pipelines designed for static models which optimize tensor programs relying on exact shape values. This paper presents TSCompiler, an end-to-end compilation framework for dynamic shape models. TSCompiler first proposes a symbolic shape propagation algorithm to recover symbolic shape information at compile time to enable subsequent optimizations. TSCompiler then partitions the shape-annotated computation graph into multiple subgraphs and fine-tunes the backbone operators from the subgraph within a hardware-aligned search space to find a collection of high-performance schedules. TSCompiler can propagate the explored backbone schedule to other fusion groups within the same subgraph to generate a set of parameterized tensor programs for fused cases based on dependence analysis. At runtime, TSCompiler utilizes an occupancy-targeted cost model to select from pre-compiled tensor programs for varied tensor shapes. Extensive evaluations show that TSCompiler can achieve state-of-the-art speedups for dynamic shape models. For example, we can improve kernel efficiency by up to 3.97× on NVIDIA RTX3090, and 10.30 × on NVIDIA A100 and achieve up to five orders of magnitude speedups on end-to-end latency.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"39 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142252190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

State and parameter identification of linearized water wave equation via adjoint method 通过邻接法识别线性化水波方程的状态和参数

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-09-13 DOI: 10.1007/s11432-023-4094-4

Yang Yu, Cheng-Zhong Xu, Hai-Long Pei, Jinpeng Yu

In this paper, we focus on the state and parameter identification problem of a hydrodynamical system. This system is modeled as a linearized water wave equation (LWWE), a hyperbolic state-space model coupled with a Laplace equation. We assume that the wave elevation at two distinct points is the only measurement of water waves. We show that the state and water depth can be reconstructed from this point measurement records. The identification problem is recast as an optimization problem over an infinite-dimensional space. We propose the adjoint method-based identification algorithm to generate an estimated state and water depth. We then performed a numerical simulation to show the effectiveness of our designed algorithm by comparing it with existing studies.

本文重点讨论水动力系统的状态和参数识别问题。该系统的模型是线性化水波方程（LWWE），这是一个与拉普拉斯方程耦合的双曲状态空间模型。我们假设两个不同点的波高是水波的唯一测量值。我们的研究表明，状态和水深可以从这个点的测量记录中重建。识别问题被重构为无限维空间上的优化问题。我们提出了基于邻接法的识别算法，以生成估计的状态和水深。然后，我们进行了数值模拟，通过与现有研究的比较，展示了我们设计的算法的有效性。

引用次数: 0

NeurDB: an AI-powered autonomous data system NeurDB：人工智能驱动的自主数据系统

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-09-13 DOI: 10.1007/s11432-024-4125-9

Beng Chin Ooi, Shaofeng Cai, Gang Chen, Yanyan Shen, Kian-Lee Tan, Yuncheng Wu, Xiaokui Xiao, Naili Xing, Cong Yue, Lingze Zeng, Meihui Zhang, Zhanhao Zhao

In the wake of rapid advancements in artificial intelligence (AI), we stand on the brink of a transformative leap in data systems. The imminent fusion of AI and DB (AI×DB) promises a new generation of data systems, which will relieve the burden on end-users across all industry sectors by featuring AI-enhanced functionalities, such as personalized and automated in-database AI-powered analytics, and self-driving capabilities for improved system performance. In this paper, we explore the evolution of data systems with a focus on deepening the fusion of AI and DB. We present NeurDB, an AI-powered autonomous data system designed to fully embrace AI design in each major system component and provide in-database AI-powered analytics. We outline the conceptual and architectural overview of NeurDB, discuss its design choices and key components, and report its current development and future plan.

随着人工智能（AI）的飞速发展，我们正站在数据系统变革性飞跃的边缘。人工智能和数据库（AI×DB）即将融合，这将带来新一代数据系统，通过人工智能增强的功能，如个性化和自动化的数据库内人工智能分析，以及提高系统性能的自驱动功能，减轻所有行业领域终端用户的负担。在本文中，我们将探索数据系统的演变，重点是深化人工智能与数据库的融合。我们介绍了 NeurDB，这是一个人工智能驱动的自主数据系统，旨在在每个主要系统组件中全面采用人工智能设计，并提供数据库内人工智能驱动的分析。我们概述了 NeurDB 的概念和架构，讨论了其设计选择和关键组件，并报告了其当前发展和未来计划。

引用次数: 0

Envisioning future deep learning theories: some basic concepts and characteristics 展望未来的深度学习理论：一些基本概念和特征

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-09-13 DOI: 10.1007/s11432-023-4129-1

Weijie J. Su

To advance deep learning methodologies in the next decade, a theoretical framework for reasoning about modern neural networks is needed. While efforts are increasing toward demystifying why deep learning is so effective, a comprehensive picture remains lacking, suggesting that a better theory is possible. We argue that a future deep learning theory should inherit three characteristics: a hierarchically structured network architecture, parameters iteratively optimized using stochastic gradient-based methods, and information from the data that evolves compressively. As an instantiation, we integrate these characteristics into a graphical model called neurashed. This model effectively explains some common empirical patterns in deep learning. In particular, neurashed enables insights into implicit regularization, information bottleneck, and local elasticity. Finally, we discuss how neurashed can guide the development of deep learning theories.

为了在下一个十年推进深度学习方法，我们需要一个理论框架来推理现代神经网络。虽然越来越多的人在努力揭开深度学习为何如此有效的神秘面纱，但仍然缺乏一个全面的图景，这表明有可能出现更好的理论。我们认为，未来的深度学习理论应继承三个特点：分层结构的网络架构、使用基于随机梯度的方法迭代优化的参数以及压缩演化的数据信息。作为实例化，我们将这些特征整合到一个名为 neurashed 的图形模型中。该模型能有效解释深度学习中一些常见的经验模式。特别是，neurashed 能让我们深入了解隐式正则化、信息瓶颈和局部弹性。最后，我们将讨论 neurashed 如何指导深度学习理论的发展。

引用次数: 0

Weighted sum power maximization for STAR-RIS-aided SWIPT systems with nonlinear energy harvesting 具有非线性能量采集功能的 STAR-RIS 辅助 SWIPT 系统的加权和功率最大化

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-09-13 DOI: 10.1007/s11432-024-4102-3

Weiping Shi, Cunhua Pan, Feng Shu, Yongpeng Wu, Jiangzhou Wang, Yongqiang Bao, Jin Tian

The conventional reconfigurable intelligent surface (RIS) is limited to reflecting incident signals, thereby imposing constraints on the placement of the transmitter and receiver, which hinders achieving comprehensive signal coverage across an entire area. This paper investigates a simultaneously transmitting and reflecting (STAR)-RIS-aided simultaneous wireless information and power transfer (SWIPT) system with a nonlinear energy harvesting model under three different RIS transmission protocols: energy splitting (ES), time switching (TS), and mode switching (MS). The objective of this paper is to maximize the weighted sum power (WSP) of all energy harvesting receivers (EHRs) while ensuring fairness in the collected power among them. This is achieved by jointly optimizing the transmit beamforming at the base station (BS) and the transmission and reflection coefficients at the STAR-RIS, subject to rate constraints for information decoding receivers (IDRs), transmit power constraint at the BS, and coefficient constraints of each element at the STAR-RIS corresponding to the three protocols. Solving this optimization problem poses challenges because of the complicated objective function and numerous coupled optimization variables of the ES STAR-RIS. To address this complexity, an efficient alternating optimization (AO) approach is proposed as an iterative solution method that achieves suboptimal results. The AO algorithm is then extended to MS STAR-RIS and TS STAR-RIS. Specifically, for MS STRA-RIS, binary constraints in the STAR-RIS coefficient optimization subproblem are handled using the first-order approximation technique along with the penalty function method. For TS STAR-RIS, apart from optimizing BS transmit beamforming and STAR-RIS coefficients subproblems, the transmission and reflection time allocation of STAR-RIS also needs optimization. Numerical findings demonstrate that compared to conventional RIS-aided systems, utilizing three different protocols in a STAR-RIS-aided system can enhance power collection at EHRs while expanding the receiver placement range. Furthermore, TS STAR-RIS performs best when the IDRs do not require high achieved rates. Otherwise, ES is the best choice.

传统的可重构智能表面（RIS）仅限于反射入射信号，因此对发射器和接收器的位置造成了限制，从而阻碍了信号在整个区域的全面覆盖。本文研究了一种同时发射和反射（STAR）-RIS 辅助同步无线信息和功率传输（SWIPT）系统，该系统在三种不同的 RIS 传输协议（能量分割（ES）、时间切换（TS）和模式切换（MS））下采用非线性能量收集模型。本文的目标是最大化所有能量收集接收器（EHR）的加权总功率（WSP），同时确保它们之间收集功率的公平性。要实现这一目标，需要联合优化基站（BS）的发射波束成形以及 STAR-RIS 的发射和反射系数，同时还要考虑信息解码接收器（IDR）的速率限制、BS 的发射功率限制以及 STAR-RIS 中与三种协议相对应的每个元素的系数限制。由于 ES STAR-RIS 的目标函数复杂，耦合优化变量众多，因此解决这一优化问题是一项挑战。为解决这一复杂问题，我们提出了一种高效的交替优化（AO）方法，作为一种迭代求解方法，可获得次优结果。随后，AO 算法被扩展到 MS STAR-RIS 和 TS STAR-RIS。具体来说，对于 MS STRA-RIS，在 STAR-RIS 系数优化子问题中使用一阶近似技术和惩罚函数法处理二进制约束。对于 TS STAR-RIS，除了优化 BS 发射波束成形和 STAR-RIS 系数子问题外，还需要优化 STAR-RIS 的发射和反射时间分配。数值研究结果表明，与传统的 RIS 辅助系统相比，在 STAR-RIS 辅助系统中使用三种不同的协议可以增强 EHR 的功率收集，同时扩大接收器的放置范围。此外，当 IDR 不需要很高的实现率时，TS STAR-RIS 的性能最佳。否则，ES 是最佳选择。

{"title":"Weighted sum power maximization for STAR-RIS-aided SWIPT systems with nonlinear energy harvesting","authors":"Weiping Shi, Cunhua Pan, Feng Shu, Yongpeng Wu, Jiangzhou Wang, Yongqiang Bao, Jin Tian","doi":"10.1007/s11432-024-4102-3","DOIUrl":"https://doi.org/10.1007/s11432-024-4102-3","url":null,"abstract":"The conventional reconfigurable intelligent surface (RIS) is limited to reflecting incident signals, thereby imposing constraints on the placement of the transmitter and receiver, which hinders achieving comprehensive signal coverage across an entire area. This paper investigates a simultaneously transmitting and reflecting (STAR)-RIS-aided simultaneous wireless information and power transfer (SWIPT) system with a nonlinear energy harvesting model under three different RIS transmission protocols: energy splitting (ES), time switching (TS), and mode switching (MS). The objective of this paper is to maximize the weighted sum power (WSP) of all energy harvesting receivers (EHRs) while ensuring fairness in the collected power among them. This is achieved by jointly optimizing the transmit beamforming at the base station (BS) and the transmission and reflection coefficients at the STAR-RIS, subject to rate constraints for information decoding receivers (IDRs), transmit power constraint at the BS, and coefficient constraints of each element at the STAR-RIS corresponding to the three protocols. Solving this optimization problem poses challenges because of the complicated objective function and numerous coupled optimization variables of the ES STAR-RIS. To address this complexity, an efficient alternating optimization (AO) approach is proposed as an iterative solution method that achieves suboptimal results. The AO algorithm is then extended to MS STAR-RIS and TS STAR-RIS. Specifically, for MS STRA-RIS, binary constraints in the STAR-RIS coefficient optimization subproblem are handled using the first-order approximation technique along with the penalty function method. For TS STAR-RIS, apart from optimizing BS transmit beamforming and STAR-RIS coefficients subproblems, the transmission and reflection time allocation of STAR-RIS also needs optimization. Numerical findings demonstrate that compared to conventional RIS-aided systems, utilizing three different protocols in a STAR-RIS-aided system can enhance power collection at EHRs while expanding the receiver placement range. Furthermore, TS STAR-RIS performs best when the IDRs do not require high achieved rates. Otherwise, ES is the best choice.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"16 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142252170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Automatically identifying imperfections and attacks in practical quantum key distribution systems via machine learning 通过机器学习自动识别实用量子密钥分发系统中的缺陷和攻击

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-09-10 DOI: 10.1007/s11432-023-3988-x

Jiaxin Xu, Xiao Ma, Jingyang Liu, Chunhui Zhang, Hongwei Li, Xingyu Zhou, Qin Wang

The realistic security of quantum key distribution (QKD) systems is currently a hot research topic in the field of quantum communications. There are always defects in practical devices, and eavesdroppers can make use of the security risk points of various devices to obtain key information. To date, current types of security analysis tend to analyze each security risk point individually, thereby posing great challenges for the overall security evaluation of QKD systems. In this paper, for the first time, we employ machine learning algorithms to identify the defects of different devices and certain attacks in real time, with an accuracy of 98%. It provides a novel solution for the practical security evaluation of QKD systems, thereby addressing the bottleneck problem of multiple risk points being difficult to address simultaneously in QKD systems, thus paving the way for the future large-scale application of quantum communication networks.

量子密钥分发（QKD）系统的现实安全性是目前量子通信领域的热门研究课题。实际设备总是存在缺陷，窃听者可以利用各种设备的安全风险点获取密钥信息。迄今为止，现有类型的安全分析倾向于单独分析每个安全风险点，从而给 QKD 系统的整体安全评估带来了巨大挑战。本文首次采用机器学习算法实时识别不同设备的缺陷和某些攻击，准确率高达 98%。这为 QKD 系统的实际安全性评估提供了一种新颖的解决方案，从而解决了 QKD 系统中多个风险点难以同时解决的瓶颈问题，为量子通信网络未来的大规模应用铺平了道路。

引用次数: 0

Ultra-low power IGZO optoelectronic synaptic transistors for neuromorphic computing 用于神经形态计算的超低功耗 IGZO 光电突触晶体管

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-09-10 DOI: 10.1007/s11432-023-3966-8

Li Zhu, Sixian Li, Junchen Lin, Yuanfeng Zhao, Xiang Wan, Huabin Sun, Shancheng Yan, Yong Xu, Zhihao Yu, Chee Leong Tan, Gang He

Inspired by biological visual systems, optoelectronic synapses with image perception, memory retention, and preprocessing capabilities offer a promising pathway for developing high-performance artificial perceptual vision computing systems. Among these, oxide-based optoelectronic synaptic transistors are well-known for their enduring photoconductive properties and ease of integration, which hold substantial potential in this regard. In this study, we utilized indium gallium zinc oxide as a semiconductor layer and high-k ZrAlO_x as a gate dielectric layer to engineer low-power high-performance synaptic transistors with photonic memory. Crucial biological synaptic functions, including excitatory postsynaptic currents, paired-pulse facilitation, and the transition from short-term to long-term plasticity, were replicated via optical pulse modulation. This simulation was sustained even at an operating voltage as low as 0.0001 V, exhibiting a conspicuous photonic synaptic response with energy consumption as low as 0.0845 fJ per synaptic event. Furthermore, an optoelectronic synaptic device was employed to model “learn-forget-relearn” behavior similar to that exhibited by the human brain, as well as Morse code encoding. Finally, a 3 × 3 device array was constructed to demonstrate its advantages in image recognition and storage. This study provides an effective strategy for developing readily integrable, ultralow-power optoelectronic synapses with substantial potential in the domains of morphological visual systems, biomimetic robotics, and artificial intelligence.

受生物视觉系统的启发，具有图像感知、记忆保持和预处理能力的光电突触为开发高性能人工感知视觉计算系统提供了一条大有可为的途径。其中，基于氧化物的光电突触晶体管以其持久的光电导特性和易于集成而著称，在这方面具有巨大的潜力。在这项研究中，我们利用铟镓锌氧化物作为半导体层，高k ZrAlOx 作为栅极电介质层，设计出了具有光子记忆功能的低功耗高性能突触晶体管。关键的生物突触功能，包括兴奋性突触后电流、成对脉冲促进以及从短期可塑性到长期可塑性的过渡，都通过光脉冲调制得以复制。即使在工作电压低至 0.0001 V 的情况下，这种模拟仍能持续，并表现出明显的光子突触反应，每次突触事件的能量消耗低至 0.0845 fJ。此外，还利用光电突触装置模拟了与人脑类似的 "学习-遗忘-再学习 "行为以及莫尔斯电码编码。最后，还构建了一个 3 × 3 设备阵列，以展示其在图像识别和存储方面的优势。这项研究为开发易于集成的超低功耗光电突触提供了有效策略，在形态视觉系统、仿生机器人和人工智能领域具有巨大潜力。

{"title":"Ultra-low power IGZO optoelectronic synaptic transistors for neuromorphic computing","authors":"Li Zhu, Sixian Li, Junchen Lin, Yuanfeng Zhao, Xiang Wan, Huabin Sun, Shancheng Yan, Yong Xu, Zhihao Yu, Chee Leong Tan, Gang He","doi":"10.1007/s11432-023-3966-8","DOIUrl":"https://doi.org/10.1007/s11432-023-3966-8","url":null,"abstract":"Inspired by biological visual systems, optoelectronic synapses with image perception, memory retention, and preprocessing capabilities offer a promising pathway for developing high-performance artificial perceptual vision computing systems. Among these, oxide-based optoelectronic synaptic transistors are well-known for their enduring photoconductive properties and ease of integration, which hold substantial potential in this regard. In this study, we utilized indium gallium zinc oxide as a semiconductor layer and high-k ZrAlOx as a gate dielectric layer to engineer low-power high-performance synaptic transistors with photonic memory. Crucial biological synaptic functions, including excitatory postsynaptic currents, paired-pulse facilitation, and the transition from short-term to long-term plasticity, were replicated via optical pulse modulation. This simulation was sustained even at an operating voltage as low as 0.0001 V, exhibiting a conspicuous photonic synaptic response with energy consumption as low as 0.0845 fJ per synaptic event. Furthermore, an optoelectronic synaptic device was employed to model “learn-forget-relearn” behavior similar to that exhibited by the human brain, as well as Morse code encoding. Finally, a 3 × 3 device array was constructed to demonstrate its advantages in image recognition and storage. This study provides an effective strategy for developing readily integrable, ultralow-power optoelectronic synapses with substantial potential in the domains of morphological visual systems, biomimetic robotics, and artificial intelligence.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"73 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An efficient schedulability analysis based on worst-case interference time for real-time systems 基于最坏情况干扰时间的实时系统高效可调度性分析

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-08-20 DOI: 10.1007/s11432-022-3891-4

Hongbiao Liu, Mengfei Yang, Lei Qiao, Xi Chen, Jian Gong

Real-time systems are widely implemented in the Internet of Things (IoT) and safety-critical systems, both of which have generated enormous social value. Aiming at the classic schedulability analysis problem in real-time systems, we proposed an exact Boolean analysis based on interference (EBAI) for schedulability analysis in real-time systems. EBAI is based on worst-case interference time (WCIT), which considers both the release jitter and blocking time of the task. We improved the efficiency of the three existing tests and provided a comprehensive summary of related research results in the field. Abundant experiments were conducted to compare EBAI with other related results. Our evaluation showed that in certain cases, the runtime gain achieved using our analysis method may exceed 73% compared to the state-of-the-art schedulability test. Furthermore, the benefits obtained from our tests grew with the number of tasks, reaching a level suitable for practical application. EBAI is oriented to the five-tuple real-time task model with stronger expression ability and possesses a low runtime overhead. These characteristics make it applicable in various real-time systems such as spacecraft, autonomous vehicles, industrial robots, and traffic command systems.

实时系统广泛应用于物联网（IoT）和安全关键型系统，两者都产生了巨大的社会价值。针对实时系统中经典的可调度性分析问题，我们提出了一种基于干扰的精确布尔分析法（EBAI），用于实时系统的可调度性分析。EBAI 基于最坏情况干扰时间（WCIT），它同时考虑了任务的释放抖动和阻塞时间。我们提高了现有三种测试的效率，并对该领域的相关研究成果进行了全面总结。我们还进行了大量实验，将 EBAI 与其他相关成果进行比较。我们的评估表明，在某些情况下，与最先进的可调度性测试相比，使用我们的分析方法所获得的运行时间收益可能超过 73%。此外，我们的测试所获得的收益随着任务数量的增加而增加，达到了适合实际应用的水平。EBAI 面向五元组实时任务模型，具有更强的表达能力和较低的运行时开销。这些特点使其适用于各种实时系统，如航天器、自动驾驶汽车、工业机器人和交通指挥系统。

{"title":"An efficient schedulability analysis based on worst-case interference time for real-time systems","authors":"Hongbiao Liu, Mengfei Yang, Lei Qiao, Xi Chen, Jian Gong","doi":"10.1007/s11432-022-3891-4","DOIUrl":"https://doi.org/10.1007/s11432-022-3891-4","url":null,"abstract":"Real-time systems are widely implemented in the Internet of Things (IoT) and safety-critical systems, both of which have generated enormous social value. Aiming at the classic schedulability analysis problem in real-time systems, we proposed an exact Boolean analysis based on interference (EBAI) for schedulability analysis in real-time systems. EBAI is based on worst-case interference time (WCIT), which considers both the release jitter and blocking time of the task. We improved the efficiency of the three existing tests and provided a comprehensive summary of related research results in the field. Abundant experiments were conducted to compare EBAI with other related results. Our evaluation showed that in certain cases, the runtime gain achieved using our analysis method may exceed 73% compared to the state-of-the-art schedulability test. Furthermore, the benefits obtained from our tests grew with the number of tasks, reaching a level suitable for practical application. EBAI is oriented to the five-tuple real-time task model with stronger expression ability and possesses a low runtime overhead. These characteristics make it applicable in various real-time systems such as spacecraft, autonomous vehicles, industrial robots, and traffic command systems.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"66 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Relative difficulty distillation for semantic segmentation 语义分割的相对难度提炼

IF 8.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Science China Information Sciences

Pub Date : 2024-08-20 DOI: 10.1007/s11432-023-4061-2

Dong Liang, Yue Sun, Yun Du, Songcan Chen, Sheng-Jun Huang

Current knowledge distillation (KD) methods primarily focus on transferring various structured knowledge and designing corresponding optimization goals to encourage the student network to imitate the output of the teacher network. However, introducing too many additional optimization objectives may lead to unstable training, such as gradient conflicts. Moreover, these methods ignored the guidelines of relative learning difficulty between the teacher and student networks. Inspired by human cognitive science, in this paper, we redefine knowledge from a new perspective — the student and teacher networks’ relative difficulty of samples, and propose a pixel-level KD paradigm for semantic segmentation named relative difficulty distillation (RDD). We propose a two-stage RDD framework: teacher-full evaluated RDD (TFE-RDD) and teacher-student evaluated RDD (TSE-RDD). RDD allows the teacher network to provide effective guidance on learning focus without additional optimization goals, thus avoiding adjusting learning weights for multiple losses. Extensive experimental evaluations using a general distillation loss function on popular datasets such as Cityscapes, CamVid, Pascal VOC, and ADE20k demonstrate the effectiveness of RDD against state-of-the-art KD methods. Additionally, our research showcases that RDD can integrate with existing KD methods to improve their upper performance bound. Codes are available at https://github.com/sunyueue/RDD.git.

目前的知识提炼（KD）方法主要侧重于传输各种结构化知识并设计相应的优化目标，以鼓励学生网络模仿教师网络的输出。然而，引入过多的额外优化目标可能会导致训练不稳定，如梯度冲突。此外，这些方法忽略了教师网络和学生网络之间相对学习难度的准则。受人类认知科学的启发，本文从一个新的角度--学生和教师网络样本的相对难度--重新定义知识，并提出了一种像素级的语义分割 KD 范式，命名为相对难度提炼（RDD）。我们提出了一个两阶段 RDD 框架：教师全评估 RDD（TFE-RDD）和师生评估 RDD（TSE-RDD）。RDD 允许教师网络为学习重点提供有效指导，而无需额外的优化目标，从而避免了为多重损失调整学习权重。我们使用通用蒸馏损失函数对 Cityscapes、CamVid、Pascal VOC 和 ADE20k 等流行数据集进行了广泛的实验评估，证明了 RDD 与最先进的 KD 方法相比的有效性。此外，我们的研究还表明，RDD 可以与现有的 KD 方法相结合，以提高其性能上限。代码见 https://github.com/sunyueue/RDD.git。

{"title":"Relative difficulty distillation for semantic segmentation","authors":"Dong Liang, Yue Sun, Yun Du, Songcan Chen, Sheng-Jun Huang","doi":"10.1007/s11432-023-4061-2","DOIUrl":"https://doi.org/10.1007/s11432-023-4061-2","url":null,"abstract":"Current knowledge distillation (KD) methods primarily focus on transferring various structured knowledge and designing corresponding optimization goals to encourage the student network to imitate the output of the teacher network. However, introducing too many additional optimization objectives may lead to unstable training, such as gradient conflicts. Moreover, these methods ignored the guidelines of relative learning difficulty between the teacher and student networks. Inspired by human cognitive science, in this paper, we redefine knowledge from a new perspective — the student and teacher networks’ relative difficulty of samples, and propose a pixel-level KD paradigm for semantic segmentation named relative difficulty distillation (RDD). We propose a two-stage RDD framework: teacher-full evaluated RDD (TFE-RDD) and teacher-student evaluated RDD (TSE-RDD). RDD allows the teacher network to provide effective guidance on learning focus without additional optimization goals, thus avoiding adjusting learning weights for multiple losses. Extensive experimental evaluations using a general distillation loss function on popular datasets such as Cityscapes, CamVid, Pascal VOC, and ADE20k demonstrate the effectiveness of RDD against state-of-the-art KD methods. Additionally, our research showcases that RDD can integrate with existing KD methods to improve their upper performance bound. Codes are available at https://github.com/sunyueue/RDD.git.","PeriodicalId":21618,"journal":{"name":"Science China Information Sciences","volume":"51 1","pages":""},"PeriodicalIF":8.8,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142189442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0