首页 > 最新文献

Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems最新文献

英文 中文
Toward multi-task support and security analyses in PLC program translation for verification: poster abstract 面向PLC程序翻译验证的多任务支持和安全性分析:海报摘要
Luis Garcia, Stefan Mitsch, André Platzer
In this poster, we will present new tool, HyPLC [30], which implements the translation of discrete control code of verified hybrid program models to PLC controller code and, vice versa, the translation of existing PLC code into the discrete control actions for a hybrid program given an additional input of the continuous dynamics of the system to be verified. This approach allows for the generation of real controller code while preserving, by compilation, the correctness of a valid and verified hybrid program. PLCs are common cyber-physical interfaces for safety-critical industrial control applications, and HyPLC serves as a pragmatic tool for bridging formal verification of complex cyber-physical systems at the algorithmic level of hybrid programs with the execution layer of concrete PLC implementations. This poster will also discuss future directions of HyPLC such as support of multiple tasks (and, by extension, multiple PLCs), interval arithmetic, as well as security analyses.
在这张海报中,我们将展示新工具HyPLC[30],它实现了将经过验证的混合程序模型的离散控制代码转换为PLC控制器代码,反之亦然,将现有PLC代码转换为混合程序的离散控制动作,给定待验证系统的连续动力学的额外输入。这种方法允许生成真实的控制器代码,同时通过编译保持有效和经过验证的混合程序的正确性。PLC是安全关键型工业控制应用的常见网络物理接口,HyPLC是一种实用的工具,用于在混合程序的算法级别连接复杂网络物理系统的正式验证与具体PLC实现的执行层。这张海报还将讨论HyPLC的未来发展方向,如支持多任务(以及扩展到多个plc)、区间算法以及安全性分析。
{"title":"Toward multi-task support and security analyses in PLC program translation for verification: poster abstract","authors":"Luis Garcia, Stefan Mitsch, André Platzer","doi":"10.1145/3302509.3313335","DOIUrl":"https://doi.org/10.1145/3302509.3313335","url":null,"abstract":"In this poster, we will present new tool, HyPLC [30], which implements the translation of discrete control code of verified hybrid program models to PLC controller code and, vice versa, the translation of existing PLC code into the discrete control actions for a hybrid program given an additional input of the continuous dynamics of the system to be verified. This approach allows for the generation of real controller code while preserving, by compilation, the correctness of a valid and verified hybrid program. PLCs are common cyber-physical interfaces for safety-critical industrial control applications, and HyPLC serves as a pragmatic tool for bridging formal verification of complex cyber-physical systems at the algorithmic level of hybrid programs with the execution layer of concrete PLC implementations. This poster will also discuss future directions of HyPLC such as support of multiple tasks (and, by extension, multiple PLCs), interval arithmetic, as well as security analyses.","PeriodicalId":413733,"journal":{"name":"Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132109859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep intelligent network for device-free people tracking: WIP abstract 无设备人员跟踪的深度智能网络:WIP摘要
Yang Zhao, Ming-Ching Chang, P. Tu
Recent radio frequency (RF) sensing techniques use a network of RF sensors to detect and locate people that do not carry any devices and can operate in non line-of-sight environments. Model-based device-free RF sensing systems use statistical models to quantify human presence and motion based on the received RF signal measurements. However, such methods often require the fine tuning of multiple model-dependent parameters in order to achieve sub meter accuracy. In this work, we propose to use deep neural networks together with visual tracking systems to effectively generate training data so as to learn a general model. Our method can automatically produce human motion and occupancy images from RF sensor network measurements without the need for manual RF model parameter tuning.
最近的射频(RF)传感技术使用射频传感器网络来检测和定位不携带任何设备并且可以在非视线环境中操作的人。基于模型的无设备射频传感系统使用统计模型根据接收到的射频信号测量来量化人的存在和运动。然而,这种方法通常需要对多个模型相关参数进行微调,以达到亚米精度。在这项工作中,我们提出使用深度神经网络与视觉跟踪系统一起有效地生成训练数据,从而学习通用模型。我们的方法可以从射频传感器网络测量中自动生成人体运动和占用图像,而无需手动调整射频模型参数。
{"title":"Deep intelligent network for device-free people tracking: WIP abstract","authors":"Yang Zhao, Ming-Ching Chang, P. Tu","doi":"10.1145/3302509.3313312","DOIUrl":"https://doi.org/10.1145/3302509.3313312","url":null,"abstract":"Recent radio frequency (RF) sensing techniques use a network of RF sensors to detect and locate people that do not carry any devices and can operate in non line-of-sight environments. Model-based device-free RF sensing systems use statistical models to quantify human presence and motion based on the received RF signal measurements. However, such methods often require the fine tuning of multiple model-dependent parameters in order to achieve sub meter accuracy. In this work, we propose to use deep neural networks together with visual tracking systems to effectively generate training data so as to learn a general model. Our method can automatically produce human motion and occupancy images from RF sensor network measurements without the need for manual RF model parameter tuning.","PeriodicalId":413733,"journal":{"name":"Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems","volume":"109 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121994525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Robustness evaluation of computer-aided clinical trials for medical devices 医疗器械计算机辅助临床试验的稳健性评价
Kuk Jin Jang, Y. Pant, Bo Zhang, James Weimer, R. Mangharam
Medical cyber-physical systems, such as the implantable cardioverter defibrillator (ICD), require evaluation of safety and efficacy in the context of a patient population in a clinical trial. Advances in computer modeling and simulation allow for generation of a simulated cohort or virtual cohort which mimics a patient population and can be used as a source of prior information. A major obstacle to acceptance of simulation results as a source of prior information is the lack of a framework for explicitly modeling sources of uncertainty in simulation results and quantifying the effect on trial outcomes. In this work, we formulate the Computer-Aided Clinical Trial (CACT) within a Bayesian statistical framework allowing explicit modeling of assumptions and utilization of simulation results at all stages of a clinical trial. To quantify the robustness of the CACT outcome with respect to a simulation assumption, we define δ-robustness as the minimum perturbation of the base prior distribution resulting in a change of the CACT outcome and provide a method to estimate the δ-robustness. We demonstrate the utility of the framework and how the results of δ-robustness evaluation can be utilized at various stages of a clinical trial through an application to the Rhythm ID Goes Head-to-head Trial (RIGHT), which was a comparative evaluation of the safety and efficacy of specific software algorithms across different implantable cardiac devices. Finally, we introduce a hardware interface that allows for direct interaction with the physical device in order to validate and confirm the results of a CACT for implantable cardiac devices.
医疗信息物理系统,如植入式心律转复除颤器(ICD),需要在临床试验中对患者群体进行安全性和有效性评估。计算机建模和仿真的进步允许生成模拟患者群体的模拟队列或虚拟队列,并可作为先验信息的来源。接受模拟结果作为先验信息来源的主要障碍是缺乏明确建模模拟结果不确定性来源和量化对试验结果影响的框架。在这项工作中,我们在贝叶斯统计框架内制定了计算机辅助临床试验(CACT),允许在临床试验的所有阶段对假设和模拟结果进行明确建模。为了量化相对于模拟假设的CACT结果的稳健性,我们将δ-鲁棒性定义为导致CACT结果变化的基本先验分布的最小扰动,并提供了一种估计δ-鲁棒性的方法。我们展示了该框架的实用性,以及δ-稳健性评估的结果如何通过应用于Rhythm ID Goes Head-to-head试验(右)在临床试验的各个阶段得到利用,该试验是对不同植入式心脏装置的特定软件算法的安全性和有效性的比较评估。最后,我们介绍了一个硬件接口,允许与物理设备直接交互,以验证和确认植入式心脏设备的CACT结果。
{"title":"Robustness evaluation of computer-aided clinical trials for medical devices","authors":"Kuk Jin Jang, Y. Pant, Bo Zhang, James Weimer, R. Mangharam","doi":"10.1145/3302509.3311058","DOIUrl":"https://doi.org/10.1145/3302509.3311058","url":null,"abstract":"Medical cyber-physical systems, such as the implantable cardioverter defibrillator (ICD), require evaluation of safety and efficacy in the context of a patient population in a clinical trial. Advances in computer modeling and simulation allow for generation of a simulated cohort or virtual cohort which mimics a patient population and can be used as a source of prior information. A major obstacle to acceptance of simulation results as a source of prior information is the lack of a framework for explicitly modeling sources of uncertainty in simulation results and quantifying the effect on trial outcomes. In this work, we formulate the Computer-Aided Clinical Trial (CACT) within a Bayesian statistical framework allowing explicit modeling of assumptions and utilization of simulation results at all stages of a clinical trial. To quantify the robustness of the CACT outcome with respect to a simulation assumption, we define δ-robustness as the minimum perturbation of the base prior distribution resulting in a change of the CACT outcome and provide a method to estimate the δ-robustness. We demonstrate the utility of the framework and how the results of δ-robustness evaluation can be utilized at various stages of a clinical trial through an application to the Rhythm ID Goes Head-to-head Trial (RIGHT), which was a comparative evaluation of the safety and efficacy of specific software algorithms across different implantable cardiac devices. Finally, we introduce a hardware interface that allows for direct interaction with the physical device in order to validate and confirm the results of a CACT for implantable cardiac devices.","PeriodicalId":413733,"journal":{"name":"Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116226975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Incident management and analysis dashboard for fire departments: ICCPS demo 用于消防部门的事件管理和分析仪表板:ICCPS演示
Geoffrey Pettet, Ayan Mukhopadhyay, Chinmaya Samal, A. Dubey, Yevgeniy Vorobeychik
This work presents a dashboard tool that helps emergency responders analyze and manage spatial-temporal incidents like crime and traffic accidents. It uses state-of-the-art statistical models to learn incident probabilities based on factors such as prior incidents, time and weather. The dashboard can then present historic and predicted incident distributions. It also allows responders to analyze how moving or adding depots (stations for emergency responders) affects average response times, and can make dispatching recommendations based on heuristics. Broadly, it is a one-stop tool that helps responders visualize historical data as well as plan for and respond to incidents.
这项工作提出了一个仪表板工具,帮助应急响应人员分析和管理时空事件,如犯罪和交通事故。它使用最先进的统计模型来学习基于先前事件、时间和天气等因素的事件概率。然后仪表板可以显示历史和预测的事件分布。它还允许响应人员分析移动或增加仓库(应急响应人员的站点)如何影响平均响应时间,并可以基于启发式方法提出调度建议。从广义上讲,它是一个一站式工具,可以帮助响应者可视化历史数据,以及计划和响应事件。
{"title":"Incident management and analysis dashboard for fire departments: ICCPS demo","authors":"Geoffrey Pettet, Ayan Mukhopadhyay, Chinmaya Samal, A. Dubey, Yevgeniy Vorobeychik","doi":"10.1145/3302509.3313329","DOIUrl":"https://doi.org/10.1145/3302509.3313329","url":null,"abstract":"This work presents a dashboard tool that helps emergency responders analyze and manage spatial-temporal incidents like crime and traffic accidents. It uses state-of-the-art statistical models to learn incident probabilities based on factors such as prior incidents, time and weather. The dashboard can then present historic and predicted incident distributions. It also allows responders to analyze how moving or adding depots (stations for emergency responders) affects average response times, and can make dispatching recommendations based on heuristics. Broadly, it is a one-stop tool that helps responders visualize historical data as well as plan for and respond to incidents.","PeriodicalId":413733,"journal":{"name":"Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131329716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Availability attacks on computing systems through alteration of environmental control: smart malware approach 通过改变环境控制对计算系统的可用性攻击:智能恶意软件方法
Key-whan Chung, Z. Kalbarczyk, R. Iyer
In this paper, we demonstrate the feasibility of smart malware that advances state-of-the-art attacks by (i) indirectly attacking a computing infrastructure through a cyber-physical system (CPS) that manages the environment in which the computing enterprise operates, (ii) disguising its malicious actions as accidental failures, and (iii) self-learning attack strategies from cyber-physical system measurement data. We address all aspects of the malware, including the construction of the self-learning malware and the launch of a failure injection attack. We validate the attacks in a data-driven CPS simulation environment developed as part of this study.
在本文中,我们论证了智能恶意软件的可行性,它通过(i)通过管理计算企业运行环境的网络物理系统(CPS)间接攻击计算基础设施,(ii)将其恶意行为伪装成意外故障,以及(iii)从网络物理系统测量数据中自我学习攻击策略。我们解决了恶意软件的各个方面,包括自学习恶意软件的构建和失败注入攻击的启动。我们在作为本研究的一部分开发的数据驱动的CPS模拟环境中验证了攻击。
{"title":"Availability attacks on computing systems through alteration of environmental control: smart malware approach","authors":"Key-whan Chung, Z. Kalbarczyk, R. Iyer","doi":"10.1145/3302509.3311041","DOIUrl":"https://doi.org/10.1145/3302509.3311041","url":null,"abstract":"In this paper, we demonstrate the feasibility of smart malware that advances state-of-the-art attacks by (i) indirectly attacking a computing infrastructure through a cyber-physical system (CPS) that manages the environment in which the computing enterprise operates, (ii) disguising its malicious actions as accidental failures, and (iii) self-learning attack strategies from cyber-physical system measurement data. We address all aspects of the malware, including the construction of the self-learning malware and the launch of a failure injection attack. We validate the attacks in a data-driven CPS simulation environment developed as part of this study.","PeriodicalId":413733,"journal":{"name":"Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128561474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Sampling rate optimization for IEEE 802.11 wireless control systems IEEE 802.11无线控制系统的采样率优化
Dohwan Kim, Yuchang Won, Seunghyeon Kim, Y. Eun, Kyung-Joon Park, K. Johansson
Design of wireless control systems has been extensively studied, which is one of the fundamental issues in cyber-physical systems. In this paper, we empirically investigate heterogeneous sampling rate assignment with a testbed when multiple physical systems are controlled through an IEEE 802.11 network. Among the critical design variables in wireless control systems, we focus on the sampling rates because they are always key control knobs regardless of network protocols. There has been little experimental research on heterogeneous sampling rate optimization for IEEE 802.11 wireless control systems, where the sampling rates of each control loop may have different values. We first formulate the co-design problem in an optimization framework with respect to the heterogeneous sampling rates by explicitly taking into account the relations of the sampling rates with the control cost, network energy consumption, and network delay. We further relax the problem as convex optimization, which is provably solved in polynomial time. Our empirical study ensures that the approximate solution is tightly close to the original optimum. To validate the proposed optimization framework, we build a disk-levitation tube testbed, which wirelessly controls the height of 20 disks at the same time. Our empirical study confirms that our optimization formulation is highly effective in practice.
无线控制系统的设计是网络物理系统的基本问题之一,受到了广泛的研究。在本文中,我们通过IEEE 802.11网络对多个物理系统进行控制时,利用一个测试平台对异构采样率分配进行了实证研究。在无线控制系统的关键设计变量中,我们重点关注采样率,因为无论网络协议如何,采样率始终是关键的控制旋钮。对于IEEE 802.11无线控制系统的异构采样率优化,目前的实验研究很少,其中每个控制回路的采样率可能有不同的值。通过明确考虑采样率与控制成本、网络能耗和网络延迟之间的关系,我们首先在异构采样率的优化框架中制定了协同设计问题。我们进一步将该问题松弛为凸优化,并在多项式时间内证明了该问题的解。我们的实证研究保证了近似解与原始最优解紧密接近。为了验证所提出的优化框架,我们建立了一个磁盘悬浮管试验台,同时无线控制20个磁盘的高度。实证研究表明,本文提出的优化配方在实践中是非常有效的。
{"title":"Sampling rate optimization for IEEE 802.11 wireless control systems","authors":"Dohwan Kim, Yuchang Won, Seunghyeon Kim, Y. Eun, Kyung-Joon Park, K. Johansson","doi":"10.1145/3302509.3311045","DOIUrl":"https://doi.org/10.1145/3302509.3311045","url":null,"abstract":"Design of wireless control systems has been extensively studied, which is one of the fundamental issues in cyber-physical systems. In this paper, we empirically investigate heterogeneous sampling rate assignment with a testbed when multiple physical systems are controlled through an IEEE 802.11 network. Among the critical design variables in wireless control systems, we focus on the sampling rates because they are always key control knobs regardless of network protocols. There has been little experimental research on heterogeneous sampling rate optimization for IEEE 802.11 wireless control systems, where the sampling rates of each control loop may have different values. We first formulate the co-design problem in an optimization framework with respect to the heterogeneous sampling rates by explicitly taking into account the relations of the sampling rates with the control cost, network energy consumption, and network delay. We further relax the problem as convex optimization, which is provably solved in polynomial time. Our empirical study ensures that the approximate solution is tightly close to the original optimum. To validate the proposed optimization framework, we build a disk-levitation tube testbed, which wirelessly controls the height of 20 disks at the same time. Our empirical study confirms that our optimization formulation is highly effective in practice.","PeriodicalId":413733,"journal":{"name":"Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems","volume":"149 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126915877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
String stability of commercial adaptive cruise control vehicles: WIP abstract 商用自适应巡航控制车辆的串稳定性:WIP摘要
George Gunter, Yanbing Wang, Derek Gloudemans, Raphael E. Stern, D. Work, M. D. Monache, R. Bhadani, Matt Bunting, Roman L. Lysecky, J. Sprinkle, Benjamin Seibold, B. Piccoli
Adaptive cruise control (ACC) is the first wave of vehicle automation that will reach the mainstream. It has been shown in [3] that automation of a small fraction of vehicles in traffic (e.g., 5%) can change the emergent properties of the flow, for example by dissipating phantom jams. Substantial theoretical and experimental underpinnings of vehicle automation and platooning were established in the from the USDOT Automated Highway System effort [1]. However, it is not yet clear whether the ACC vehicles that are currently commercially available will dampen or amplify phantom jams.
自适应巡航控制(ACC)是第一波将成为主流的汽车自动化技术。b[3]的研究表明,交通中一小部分车辆(例如5%)的自动化可以改变交通流的紧急特性,例如通过消散虚幻堵塞。美国运输部自动化公路系统项目[1]为车辆自动化和队列控制奠定了坚实的理论和实验基础。然而,目前尚不清楚商用的ACC车辆是否会减轻或加剧幽灵堵塞。
{"title":"String stability of commercial adaptive cruise control vehicles: WIP abstract","authors":"George Gunter, Yanbing Wang, Derek Gloudemans, Raphael E. Stern, D. Work, M. D. Monache, R. Bhadani, Matt Bunting, Roman L. Lysecky, J. Sprinkle, Benjamin Seibold, B. Piccoli","doi":"10.1145/3302509.3313325","DOIUrl":"https://doi.org/10.1145/3302509.3313325","url":null,"abstract":"Adaptive cruise control (ACC) is the first wave of vehicle automation that will reach the mainstream. It has been shown in [3] that automation of a small fraction of vehicles in traffic (e.g., 5%) can change the emergent properties of the flow, for example by dissipating phantom jams. Substantial theoretical and experimental underpinnings of vehicle automation and platooning were established in the from the USDOT Automated Highway System effort [1]. However, it is not yet clear whether the ACC vehicles that are currently commercially available will dampen or amplify phantom jams.","PeriodicalId":413733,"journal":{"name":"Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133657889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Hierarchical taxi dispatch system with local coordination among micro-level components: WIP abstract 微级部件局部协调的分层出租车调度系统:WIP抽象
Sota Takashima, Naomi Kuze, T. Ushio
Taxi drivers face to poor working conditions such as low wages and long working hours. To improve the conditions, a taxi dispatch system is very important. F. Miao et al. proposed a method for calculating an optimal taxi cruising path based on the model predictive control and the new parameter of supply demand ratio [1]. T. Ushio et al. proposed a method for calculating an optimal taxi cruising path based on a mixed logical dynamical model [2]. X. Chen et al. proposed a hierarchical taxi-dispatch system consisting of global and regional decision-makers [3]. The global decision-maker calculates the optimal taxi dispatch in a longer time period, while the regional ones determine schedules of taxis in the corresponding regions in a shorter time period. However, these previous mechanisms have lower scalability and real-time properties. To tackle these problems, we propose a novel hierarchical support system for a taxi dispatch system. In our proposal, a macro level component calculates the optimal taxi movement by using a taxi flow model, whereas micro level components determine the optimal taxi cruising in the corresponding regions. Both of them are conducted in the same time period.
出租车司机面临着工资低、工作时间长等恶劣的工作条件。为了改善这种状况,出租车调度系统是非常重要的。F. Miao等人提出了一种基于模型预测控制和新参数供需比的出租车最优巡航路径计算方法[1]。T. Ushio等人提出了一种基于混合逻辑动力学模型的出租车最优巡航路径计算方法[2]。X. Chen等人提出了一种由全局和区域决策者组成的分层出租车调度系统[3]。全局决策者在较长时间内计算出租车最优调度,区域决策者在较短时间内确定相应区域的出租车调度。然而,这些先前的机制具有较低的可伸缩性和实时性。为了解决这些问题,我们提出了一种新的出租车调度系统分层支持系统。在我们的建议中,宏观层面的组件通过使用出租车流模型来计算最优出租车运动,而微观层面的组件确定相应区域的最优出租车巡航。两者都是在同一时期进行的。
{"title":"Hierarchical taxi dispatch system with local coordination among micro-level components: WIP abstract","authors":"Sota Takashima, Naomi Kuze, T. Ushio","doi":"10.1145/3302509.3313314","DOIUrl":"https://doi.org/10.1145/3302509.3313314","url":null,"abstract":"Taxi drivers face to poor working conditions such as low wages and long working hours. To improve the conditions, a taxi dispatch system is very important. F. Miao et al. proposed a method for calculating an optimal taxi cruising path based on the model predictive control and the new parameter of supply demand ratio [1]. T. Ushio et al. proposed a method for calculating an optimal taxi cruising path based on a mixed logical dynamical model [2]. X. Chen et al. proposed a hierarchical taxi-dispatch system consisting of global and regional decision-makers [3]. The global decision-maker calculates the optimal taxi dispatch in a longer time period, while the regional ones determine schedules of taxis in the corresponding regions in a shorter time period. However, these previous mechanisms have lower scalability and real-time properties. To tackle these problems, we propose a novel hierarchical support system for a taxi dispatch system. In our proposal, a macro level component calculates the optimal taxi movement by using a taxi flow model, whereas micro level components determine the optimal taxi cruising in the corresponding regions. Both of them are conducted in the same time period.","PeriodicalId":413733,"journal":{"name":"Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134280828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Reduced variance deep reinforcement learning with temporal logic specifications 基于时间逻辑规范的减少方差深度强化学习
Qitong Gao, Davood Hajinezhad, Yan Zhang, Y. Kantaros, M. Zavlanos
In this paper, we propose a model-free reinforcement learning method to synthesize control policies for mobile robots modeled as Markov Decision Process (MDP) with unknown transition probabilities that satisfy Linear Temporal Logic (LTL) specifications. Specifically, we develop a reduced variance deep Q-Learning technique that relies on Neural Networks (NN) to approximate the state-action values of the MDP and employs a reward function that depends on the accepting condition of the Deterministic Rabin Automaton (DRA) that captures the LTL specification. The key idea is to convert the deep Q-Learning problem into a nonconvex max-min optimization problem with a finite-sum structure, and develop an Arrow-Hurwicz-Uzawa type stochastic reduced variance algorithm with constant stepsize to solve it. Unlike Stochastic Gradient Descent (SGD) methods that are often used in deep reinforcement learning, our method can estimate the gradients of an unknown loss function more accurately and can improve the stability of the training process. Moreover, our method does not require learning the transition probabilities in the MDP, constructing a product MDP, or computing Accepting Maximal End Components (AMECs). This allows the robot to learn an optimal policy even if the environment cannot be modeled accurately or if AMECs do not exist. In the latter case, the resulting control policies minimize the frequency with which the system enters bad states in the DRA that violate the task specifications. To the best of our knowledge, this is the first model-free deep reinforcement learning algorithm that can synthesize policies that maximize the probability of satisfying an LTL specification even if AMECs do not exist. Rigorous convergence analysis and rate of convergence are provided for the proposed algorithm as well as numerical experiments that validate our method.
在本文中,我们提出了一种无模型强化学习方法来综合移动机器人的控制策略,该控制策略建模为马尔可夫决策过程(MDP),具有未知的转移概率,满足线性时间逻辑(LTL)规范。具体来说,我们开发了一种减少方差的深度q -学习技术,该技术依赖于神经网络(NN)来近似MDP的状态-动作值,并采用一个奖励函数,该函数依赖于捕获LTL规范的确定性拉宾自动机(DRA)的接受条件。其关键思想是将深度Q-Learning问题转化为具有有限和结构的非凸极大极小优化问题,并开发了一种Arrow-Hurwicz-Uzawa型的恒步长随机减方差算法来求解。与深度强化学习中经常使用的随机梯度下降(SGD)方法不同,我们的方法可以更准确地估计未知损失函数的梯度,并且可以提高训练过程的稳定性。此外,我们的方法不需要学习MDP中的转移概率,构建产品MDP,或计算接受最大终端组件(AMECs)。这使得机器人即使在环境不能准确建模或不存在amec的情况下也能学习到最优策略。在后一种情况下,产生的控制策略将系统在DRA中进入违反任务规范的坏状态的频率降至最低。据我们所知,这是第一个无模型深度强化学习算法,即使不存在amec,也可以综合策略,使满足LTL规范的概率最大化。给出了严格的收敛分析和收敛速度,并进行了数值实验验证。
{"title":"Reduced variance deep reinforcement learning with temporal logic specifications","authors":"Qitong Gao, Davood Hajinezhad, Yan Zhang, Y. Kantaros, M. Zavlanos","doi":"10.1145/3302509.3311053","DOIUrl":"https://doi.org/10.1145/3302509.3311053","url":null,"abstract":"In this paper, we propose a model-free reinforcement learning method to synthesize control policies for mobile robots modeled as Markov Decision Process (MDP) with unknown transition probabilities that satisfy Linear Temporal Logic (LTL) specifications. Specifically, we develop a reduced variance deep Q-Learning technique that relies on Neural Networks (NN) to approximate the state-action values of the MDP and employs a reward function that depends on the accepting condition of the Deterministic Rabin Automaton (DRA) that captures the LTL specification. The key idea is to convert the deep Q-Learning problem into a nonconvex max-min optimization problem with a finite-sum structure, and develop an Arrow-Hurwicz-Uzawa type stochastic reduced variance algorithm with constant stepsize to solve it. Unlike Stochastic Gradient Descent (SGD) methods that are often used in deep reinforcement learning, our method can estimate the gradients of an unknown loss function more accurately and can improve the stability of the training process. Moreover, our method does not require learning the transition probabilities in the MDP, constructing a product MDP, or computing Accepting Maximal End Components (AMECs). This allows the robot to learn an optimal policy even if the environment cannot be modeled accurately or if AMECs do not exist. In the latter case, the resulting control policies minimize the frequency with which the system enters bad states in the DRA that violate the task specifications. To the best of our knowledge, this is the first model-free deep reinforcement learning algorithm that can synthesize policies that maximize the probability of satisfying an LTL specification even if AMECs do not exist. Rigorous convergence analysis and rate of convergence are provided for the proposed algorithm as well as numerical experiments that validate our method.","PeriodicalId":413733,"journal":{"name":"Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems","volume":"2 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123562373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
Decentralized merging control in traffic networks: a control barrier function approach 交通网络中的分散合并控制:一种控制障碍函数方法
Wei Xiao, C. Belta, C. Cassandras
In this paper, we aim to optimize the process of Connected and Automated Vehicles (CAVs) merging at a traffic intersection while guaranteeing the state, control and safety constraints. We decompose the task of automatic merging for all the CAVs in a control zone around a merging point into same-lane safety constraints and different-lane safe merging, and implement these requirements using control barrier functions (CBFs). We consider two main objectives. First, to minimize travel time, we make the CAVs reach the road maximum speed with exponentially stabilizing control Lyapunov functions (CLF). Second, we penalize energy consumption as a cost in an optimization problem. We then decompose the merging problem into decentralized subproblems formulated as a sequence of quadratic programs (QP), which are solved in real time. Our simulations and comparisons show that the method proposed here outperforms ad hoc controllers used in traffic system simulators and provides comparable results to the optimal control solution of the merging problem in earlier work.
本文的目标是在保证状态、控制和安全约束的前提下,优化网联自动驾驶汽车(cav)在十字路口的合流过程。本文将合并点周围控制区内所有自动合并车辆的任务分解为同车道安全约束和不同车道安全合并,并利用控制屏障函数(CBFs)实现这些要求。我们考虑两个主要目标。首先,利用指数稳定控制李雅普诺夫函数(CLF)使自动驾驶汽车达到道路最大速度,使行驶时间最小。其次,我们将能耗作为优化问题中的成本进行惩罚。然后,我们将合并问题分解为分散的子问题,这些子问题被表述为一个二次规划序列(QP),并实时求解。我们的仿真和比较表明,本文提出的方法优于交通系统模拟器中使用的自组织控制器,并提供与早期工作中合并问题的最优控制方案相当的结果。
{"title":"Decentralized merging control in traffic networks: a control barrier function approach","authors":"Wei Xiao, C. Belta, C. Cassandras","doi":"10.1145/3302509.3311054","DOIUrl":"https://doi.org/10.1145/3302509.3311054","url":null,"abstract":"In this paper, we aim to optimize the process of Connected and Automated Vehicles (CAVs) merging at a traffic intersection while guaranteeing the state, control and safety constraints. We decompose the task of automatic merging for all the CAVs in a control zone around a merging point into same-lane safety constraints and different-lane safe merging, and implement these requirements using control barrier functions (CBFs). We consider two main objectives. First, to minimize travel time, we make the CAVs reach the road maximum speed with exponentially stabilizing control Lyapunov functions (CLF). Second, we penalize energy consumption as a cost in an optimization problem. We then decompose the merging problem into decentralized subproblems formulated as a sequence of quadratic programs (QP), which are solved in real time. Our simulations and comparisons show that the method proposed here outperforms ad hoc controllers used in traffic system simulators and provides comparable results to the optimal control solution of the merging problem in earlier work.","PeriodicalId":413733,"journal":{"name":"Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125093106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
期刊
Proceedings of the 10th ACM/IEEE International Conference on Cyber-Physical Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1