2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)最新文献

英文中文

Establishing Appropriate Trust via Critical States 通过关键状态建立适当的信任

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Pub Date : 2018-10-01 DOI: 10.1109/IROS.2018.8593649

Sandy H. Huang, K. Bhatia, P. Abbeel, A. Dragan

In order to effectively interact with or supervise a robot, humans need to have an accurate mental model of its capabilities and how it acts. Learned neural network policies make that particularly challenging. We propose an approach for helping end-users build a mental model of such policies. Our key observation is that for most tasks, the essence of the policy is captured in a few critical states: states in which it is very important to take a certain action. Our user studies show that if the robot shows a human what its understanding of the task's critical states is, then the human can make a more informed decision about whether to deploy the policy, and if she does deploy it, when she needs to take control from it at execution time.

为了有效地与机器人互动或监督机器人，人类需要对机器人的能力和行为方式有一个准确的心理模型。习得的神经网络策略使得这一点尤其具有挑战性。我们提出了一种方法来帮助最终用户建立这种策略的心理模型。我们的关键观察是，对于大多数任务，策略的本质是在几个关键状态中捕获的:在这些状态中，采取某种操作非常重要。我们的用户研究表明，如果机器人向人类展示了它对任务关键状态的理解，那么人类就可以就是否部署策略做出更明智的决定，如果她部署了策略，那么当她需要在执行时从它那里获得控制权时。

引用次数: 82

Adaptive step rotation in biped walking 双足行走的自适应旋转

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Pub Date : 2018-10-01 DOI: 10.1109/IROS.2018.8594431

N. Bohorquez, Pierre-Brice Wieber

We want to enable the robot to reorient its feet in order to face its direction of motion. Model Predictive Control schemes for biped walking usually assume fixed feet rotation since adapting them online leads to a nonlinear problem. Nonlinear solvers do not guarantee the satisfaction of nonlinear constraints at every iterate and this can be problematic for the real-time operation of robots. We propose to define safe linear constraints that are always inside the intersection of the nonlinear constraints. We make simulations of the robot walking on a crowd and compare the performance of the proposed method with respect to the original nonlinear problem solved as a Sequential Quadratic Program.

我们想让机器人重新定位它的脚，以便面对它的运动方向。两足行走的模型预测控制方案通常采用固定的足部旋转，因为对其进行在线调整会导致非线性问题。非线性求解器不能保证每次迭代都满足非线性约束，这可能会给机器人的实时运行带来问题。我们提出定义安全的线性约束，它总是在非线性约束的交点内。我们对机器人在人群中行走进行了仿真，并将所提出的方法与原非线性问题作为顺序二次规划求解进行了性能比较。

引用次数: 5

Evolving a Sensory-Motor Interconnection for Dynamic Quadruped Robot Locomotion Behavior 动态四足机器人运动行为的感觉-运动互联进化

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Pub Date : 2018-10-01 DOI: 10.1109/IROS.2018.8593671

Azhar Aulia Saputra, W. Chin, János Botzheim, N. Kubota

In this paper, we present a novel biologically inspired evolving neural oscillator for quadruped robot locomotion to minimize constraints during the locomotion process. The proposed sensory-motor coordination model is formed by the interconnection between motor and sensory neurons. The model utilizes Bacterial Programming to reconstruct the number of joints and neurons in each joint based on environmental conditions. Bacterial Programming is inspired by the evolutionary process of bacteria that includes bacterial mutation and gene transfer process. In this system, either the number of joints, the number of neurons, or the interconnection structure are changing dynamically depending on the sensory information from sensors equipped on the robot. The proposed model is simulated in computer for realizing the optimization process and the optimized structure is then applied to a real quadruped robot for locomotion process. The optimizing process is based on tree structure optimization to simplify the sensory-motor interconnection structure. The proposed model was validated by series of real robot experiments in different environmental conditions.

在本文中，我们提出了一种新的生物启发的进化神经振荡器，用于四足机器人运动，以最小化运动过程中的约束。所提出的感觉-运动协调模型是由运动神经元和感觉神经元之间的相互连接形成的。该模型利用细菌编程，根据环境条件重建关节和每个关节中的神经元数量。细菌编程的灵感来自于细菌的进化过程，包括细菌突变和基因转移过程。在该系统中，关节的数量、神经元的数量或互连结构都是动态变化的，这取决于机器人上安装的传感器的感觉信息。在计算机上对所提出的模型进行仿真以实现优化过程，并将优化后的结构应用于实际四足机器人的运动过程。优化过程基于树形结构优化，以简化感觉-运动互连结构。通过一系列不同环境条件下的真实机器人实验验证了该模型的有效性。

{"title":"Evolving a Sensory-Motor Interconnection for Dynamic Quadruped Robot Locomotion Behavior","authors":"Azhar Aulia Saputra, W. Chin, János Botzheim, N. Kubota","doi":"10.1109/IROS.2018.8593671","DOIUrl":"https://doi.org/10.1109/IROS.2018.8593671","url":null,"abstract":"In this paper, we present a novel biologically inspired evolving neural oscillator for quadruped robot locomotion to minimize constraints during the locomotion process. The proposed sensory-motor coordination model is formed by the interconnection between motor and sensory neurons. The model utilizes Bacterial Programming to reconstruct the number of joints and neurons in each joint based on environmental conditions. Bacterial Programming is inspired by the evolutionary process of bacteria that includes bacterial mutation and gene transfer process. In this system, either the number of joints, the number of neurons, or the interconnection structure are changing dynamically depending on the sensory information from sensors equipped on the robot. The proposed model is simulated in computer for realizing the optimization process and the optimized structure is then applied to a real quadruped robot for locomotion process. The optimizing process is based on tree structure optimization to simplify the sensory-motor interconnection structure. The proposed model was validated by series of real robot experiments in different environmental conditions.","PeriodicalId":6640,"journal":{"name":"2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)","volume":"4 1","pages":"7089-7095"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87502087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Electing an Approximate Center in a Huge Modular Robot with the k-BFS SumSweep Algorithm 基于k-BFS SumSweep算法的大型模块化机器人近似中心选择

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Pub Date : 2018-10-01 DOI: 10.1109/IROS.2018.8593612

André Naz, Benoît Piranda, J. Bourgeois, S. Goldstein

Among the diversity of the existing modular robotic systems, we consider in this paper the subset of distributed modular robotic ensembles composed of resource-constrained identical modules that are organized in a lattice structure and which can only communicate with neighboring modules. These modular robotic ensembles form asynchronous distributed embedded systems. In many algorithms dedicated to distributed system coordination, a specific role has to be played by a leader, i.e., a single node in the system. This leader can be elected using various criteria. A possible strategy is to elect a center node, i.e., a node that has the minimum distance to all the other nodes. Indeed, this node is ideally located to communicate with all the others and this leads to better performance in many algorithms. The contribution of this paper is to propose the $k$-BFS SumSweep algorithm designed to elect an approximate-center node. We evaluated our algorithm both on hardware modular robots and in a simulator for large ensembles of robots. Experimental results show that k-BFS SumSweep is often the most accurate approximation algorithm (with an average relative accuracy between 90% to 100%) while using the fewest messages in large-scale systems, requiring only a modest amount of memory per node, and converging in a reasonable length of time.

在现有模块化机器人系统的多样性中，我们考虑了由资源受限的相同模块组成的分布式模块化机器人集成的子集，这些模块以晶格结构组织，并且只能与相邻模块通信。这些模块化机器人组成异步分布式嵌入式系统。在许多致力于分布式系统协调的算法中，领导者必须扮演特定的角色，即系统中的单个节点。这个领导人可以用不同的标准选出来。一种可能的策略是选择一个中心节点，即与所有其他节点的距离最小的节点。事实上，这个节点的理想位置是与所有其他节点进行通信，这在许多算法中导致更好的性能。本文的贡献在于提出了$k$-BFS SumSweep算法，该算法用于选择一个近似中心节点。我们在硬件模块化机器人和大型机器人集成模拟器上评估了我们的算法。实验结果表明，k-BFS SumSweep通常是最精确的近似算法(平均相对精度在90%到100%之间)，同时在大规模系统中使用最少的消息，每个节点只需要适量的内存，并在合理的时间内收敛。

{"title":"Electing an Approximate Center in a Huge Modular Robot with the k-BFS SumSweep Algorithm","authors":"André Naz, Benoît Piranda, J. Bourgeois, S. Goldstein","doi":"10.1109/IROS.2018.8593612","DOIUrl":"https://doi.org/10.1109/IROS.2018.8593612","url":null,"abstract":"Among the diversity of the existing modular robotic systems, we consider in this paper the subset of distributed modular robotic ensembles composed of resource-constrained identical modules that are organized in a lattice structure and which can only communicate with neighboring modules. These modular robotic ensembles form asynchronous distributed embedded systems. In many algorithms dedicated to distributed system coordination, a specific role has to be played by a leader, i.e., a single node in the system. This leader can be elected using various criteria. A possible strategy is to elect a center node, i.e., a node that has the minimum distance to all the other nodes. Indeed, this node is ideally located to communicate with all the others and this leads to better performance in many algorithms. The contribution of this paper is to propose the $k$-BFS SumSweep algorithm designed to elect an approximate-center node. We evaluated our algorithm both on hardware modular robots and in a simulator for large ensembles of robots. Experimental results show that k-BFS SumSweep is often the most accurate approximation algorithm (with an average relative accuracy between 90% to 100%) while using the fewest messages in large-scale systems, requiring only a modest amount of memory per node, and converging in a reasonable length of time.","PeriodicalId":6640,"journal":{"name":"2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)","volume":"47 1","pages":"4825-4832"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87799380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Iterative Learning of Energy-Efficient Dynamic Walking Gaits 节能动态步行步态的迭代学习

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Pub Date : 2018-10-01 DOI: 10.1109/IROS.2018.8593548

Felix H. Kong, I. Manchester

Dynamic walking robots have the potential for efficient and lifelike locomotion, but computing efficient gaits and tracking them is difficult in the presence of under-modeling. Iterative Learning Control (ILC) is a method to learn the control signal to track a periodic reference over several attempts, augmenting a model with online data. Terminal ILC (TILC), a variant of ILC, allows other performance objectives to be addressed at the cost of ignoring parts of the reference. However, dynamic walking robot gaits are not necessarily periodic in time. In this paper, we adapt TILC to jointly optimize final foot placement and energy efficiency on dynamic walking robots by indexing by a phase variable instead of time, yielding “phase-indexed TILC” (θ - TILC). When implemented on a five-link walker in simulation, θ- TILC learns a more energy-efficient walking motion compared to traditional time-indexed TILC.

动态步行机器人具有高效逼真运动的潜力，但在建模不足的情况下，计算有效步态并跟踪它们是困难的。迭代学习控制(ILC)是一种通过多次尝试学习控制信号来跟踪周期性参考点的方法，通过在线数据对模型进行扩充。终端ILC (TILC)是ILC的一种变体，允许以忽略参考部分的代价来解决其他性能目标。然而，动态步行机器人的步态在时间上并不一定具有周期性。在本文中，我们采用相位变量代替时间索引TILC来共同优化动态步行机器人的最终足部位置和能量效率，得到“相位索引TILC”(θ - TILC)。在五连杆步行机器人仿真中，θ- TILC比传统的时间索引TILC学习出更节能的步行动作。

引用次数: 0

Real-Time Segmentation with Appearance, Motion and Geometry 实时分割与外观，运动和几何

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Pub Date : 2018-10-01 DOI: 10.1109/IROS.2018.8594088

Mennatullah Siam, Sara Elkerdawy, M. Gamal, Moemen Abdel-Razek, Martin Jägersand, Hong Zhang

Real-time Segmentation is of crucial importance to robotics related applications such as autonomous driving, driving assisted systems, and traffic monitoring from unmanned aerial vehicles imagery. We propose a novel two-stream convolutional network for motion segmentation, which exploits flow and geometric cues to balance the accuracy and computational efficiency trade-offs. The geometric cues take advantage of the domain knowledge of the application. In case of mostly planar scenes from high altitude unmanned aerial vehicles (UAVs), homography compensated flow is used. While in the case of urban scenes in autonomous driving, with GPS/IMU sensory data available, sparse projected depth estimates and odometry information are used. The network provides 4.7⨯ speedup over the state of the art networks in motion segmentation from 153ms to 36ms, at the expense of a reduction in the segmentation accuracy in terms of pixel boundaries. This enables the network to perform real-time on a Jetson T⨯2. In order to recuperate some of the accuracy loss, geometric priors is used while still achieving a much improved computational efficiency with respect to the state-of-the-art. The usage of geometric priors improved the segmentation in UAV imagery by 5.2 % using the metric of IoU over the baseline network. While on KITTI-MoSeg the sparse depth estimates improved the segmentation by 12.5 % over the baseline. Our proposed motion segmentation solution is verified on the popular KITTI and VIVID datasets, with additional labels we have produced. The code for our work is publicly available at11https://github.com/MSiam/RTMotSeg_Geom

实时分割对于自动驾驶、驾驶辅助系统和无人机图像交通监控等机器人相关应用至关重要。我们提出了一种新的双流卷积网络用于运动分割，它利用流和几何线索来平衡精度和计算效率的权衡。几何线索利用了应用程序的领域知识。对于高空无人机拍摄的多为平面场景，采用了单应性补偿流。而在自动驾驶的城市场景中，使用GPS/IMU传感数据，使用稀疏投影深度估计和里程计信息。该网络在运动分割方面提供了4.7个加速，从153ms到36ms，代价是在像素边界方面降低了分割精度。这使得网络能够在Jetson T上执行实时操作。为了恢复一些精度损失，使用几何先验，同时仍然实现了相对于最新技术的大大提高的计算效率。在基线网络上使用IoU度量，几何先验的使用将无人机图像的分割提高了5.2%。而在KITTI-MoSeg上，稀疏深度估计比基线分割率提高了12.5%。我们提出的运动分割解决方案在流行的KITTI和VIVID数据集上进行了验证，并附带了我们制作的附加标签。我们工作的代码可以在11https://github.com/MSiam/RTMotSeg_Geom上公开获得

{"title":"Real-Time Segmentation with Appearance, Motion and Geometry","authors":"Mennatullah Siam, Sara Elkerdawy, M. Gamal, Moemen Abdel-Razek, Martin Jägersand, Hong Zhang","doi":"10.1109/IROS.2018.8594088","DOIUrl":"https://doi.org/10.1109/IROS.2018.8594088","url":null,"abstract":"Real-time Segmentation is of crucial importance to robotics related applications such as autonomous driving, driving assisted systems, and traffic monitoring from unmanned aerial vehicles imagery. We propose a novel two-stream convolutional network for motion segmentation, which exploits flow and geometric cues to balance the accuracy and computational efficiency trade-offs. The geometric cues take advantage of the domain knowledge of the application. In case of mostly planar scenes from high altitude unmanned aerial vehicles (UAVs), homography compensated flow is used. While in the case of urban scenes in autonomous driving, with GPS/IMU sensory data available, sparse projected depth estimates and odometry information are used. The network provides 4.7⨯ speedup over the state of the art networks in motion segmentation from 153ms to 36ms, at the expense of a reduction in the segmentation accuracy in terms of pixel boundaries. This enables the network to perform real-time on a Jetson T⨯2. In order to recuperate some of the accuracy loss, geometric priors is used while still achieving a much improved computational efficiency with respect to the state-of-the-art. The usage of geometric priors improved the segmentation in UAV imagery by 5.2 % using the metric of IoU over the baseline network. While on KITTI-MoSeg the sparse depth estimates improved the segmentation by 12.5 % over the baseline. Our proposed motion segmentation solution is verified on the popular KITTI and VIVID datasets, with additional labels we have produced. The code for our work is publicly available at11https://github.com/MSiam/RTMotSeg_Geom","PeriodicalId":6640,"journal":{"name":"2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)","volume":"2021 1","pages":"5793-5800"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86822247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Scan Context: Egocentric Spatial Descriptor for Place Recognition Within 3D Point Cloud Map 扫描环境:三维点云图中位置识别的自我中心空间描述符

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Pub Date : 2018-10-01 DOI: 10.1109/IROS.2018.8593953

Giseop Kim, Ayoung Kim

Compared to diverse feature detectors and descriptors used for visual scenes, describing a place using structural information is relatively less reported. Recent advances in simultaneous localization and mapping (SLAM) provides dense 3D maps of the environment and the localization is proposed by diverse sensors. Toward the global localization based on the structural information, we propose Scan Context, a non-histogram-based global descriptor from 3D Light Detection and Ranging (LiDAR) scans. Unlike previously reported methods, the proposed approach directly records a 3D structure of a visible space from a sensor and does not rely on a histogram or on prior training. In addition, this approach proposes the use of a similarity score to calculate the distance between two scan contexts and also a two-phase search algorithm to efficiently detect a loop. Scan context and its search algorithm make loop-detection invariant to LiDAR viewpoint changes so that loops can be detected in places such as reverse revisit and corner. Scan context performance has been evaluated via various benchmark datasets of 3D LiDAR scans, and the proposed method shows a sufficiently improved performance.

与用于视觉场景的各种特征检测器和描述符相比，使用结构信息描述一个地方的报道相对较少。同步定位与制图(SLAM)技术的最新进展提供了密集的环境三维地图，并由不同的传感器提出定位。为了实现基于结构信息的全局定位，我们提出了扫描上下文，这是一种来自3D光探测和测距(LiDAR)扫描的非直方图全局描述符。与之前报道的方法不同，该方法直接记录来自传感器的可见空间的3D结构，不依赖于直方图或先前的训练。此外，该方法提出了使用相似度分数来计算两个扫描上下文之间的距离，并提出了一种两阶段搜索算法来有效地检测环路。扫描环境及其搜索算法使环路检测不受LiDAR视点变化的影响，从而可以在反向重访和拐角等位置检测到环路。通过3D激光雷达扫描的各种基准数据集对扫描上下文性能进行了评估，表明该方法的性能得到了充分提高。

{"title":"Scan Context: Egocentric Spatial Descriptor for Place Recognition Within 3D Point Cloud Map","authors":"Giseop Kim, Ayoung Kim","doi":"10.1109/IROS.2018.8593953","DOIUrl":"https://doi.org/10.1109/IROS.2018.8593953","url":null,"abstract":"Compared to diverse feature detectors and descriptors used for visual scenes, describing a place using structural information is relatively less reported. Recent advances in simultaneous localization and mapping (SLAM) provides dense 3D maps of the environment and the localization is proposed by diverse sensors. Toward the global localization based on the structural information, we propose Scan Context, a non-histogram-based global descriptor from 3D Light Detection and Ranging (LiDAR) scans. Unlike previously reported methods, the proposed approach directly records a 3D structure of a visible space from a sensor and does not rely on a histogram or on prior training. In addition, this approach proposes the use of a similarity score to calculate the distance between two scan contexts and also a two-phase search algorithm to efficiently detect a loop. Scan context and its search algorithm make loop-detection invariant to LiDAR viewpoint changes so that loops can be detected in places such as reverse revisit and corner. Scan context performance has been evaluated via various benchmark datasets of 3D LiDAR scans, and the proposed method shows a sufficiently improved performance.","PeriodicalId":6640,"journal":{"name":"2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)","volume":"404 1","pages":"4802-4809"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86833804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 353

FarSight: Long-Range Depth Estimation from Outdoor Images FarSight:从户外图像中进行远程深度估计

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Pub Date : 2018-10-01 DOI: 10.1109/IROS.2018.8593971

Md. Alimoor Reza, J. Kosecka, P. David

This paper introduces the problem of long-range monocular depth estimation for outdoor urban environments. Range sensors and traditional depth estimation algorithms (both stereo and single view) predict depth for distances of less than 100 meters in outdoor settings and 10 meters in indoor settings. The shortcomings of outdoor single view methods that use learning approaches are, to some extent, due to the lack of long-range ground truth training data, which in turn is due to limitations of range sensors. To circumvent this, we first propose a novel strategy for generating synthetic long-range ground truth depth data. We utilize Google Earth images to reconstruct large-scale 3D models of different cities with proper scale. The acquired repository of 3D models and associated RGB views along with their long-range depth renderings are used as training data for depth prediction. We then train two deep neural network models for long-range depth estimation: i) a Convolutional Neural Network (CNN) and ii) a Generative Adversarial Network (GAN). We found in our experiments that the GAN model predicts depth more accurately. We plan to open-source the database and the baseline models for public use.

本文介绍了城市室外环境的远距离单目深度估计问题。距离传感器和传统的深度估计算法(包括立体和单视图)在室外环境中预测距离小于100米，在室内环境中预测距离小于10米。使用学习方法的室外单视图方法的缺点，在一定程度上是由于缺乏远程地面真值训练数据，而这又是由于距离传感器的限制。为了解决这个问题，我们首先提出了一种新的策略来生成合成的远程地真深度数据。我们利用谷歌地球图像，以适当的比例重建不同城市的大尺度三维模型。获得的3D模型存储库和相关的RGB视图以及它们的远程深度渲染用作深度预测的训练数据。然后，我们训练两个深度神经网络模型用于远程深度估计:i)卷积神经网络(CNN)和ii)生成对抗网络(GAN)。我们在实验中发现，GAN模型更准确地预测深度。我们计划开放数据库和基线模型供公众使用。

{"title":"FarSight: Long-Range Depth Estimation from Outdoor Images","authors":"Md. Alimoor Reza, J. Kosecka, P. David","doi":"10.1109/IROS.2018.8593971","DOIUrl":"https://doi.org/10.1109/IROS.2018.8593971","url":null,"abstract":"This paper introduces the problem of long-range monocular depth estimation for outdoor urban environments. Range sensors and traditional depth estimation algorithms (both stereo and single view) predict depth for distances of less than 100 meters in outdoor settings and 10 meters in indoor settings. The shortcomings of outdoor single view methods that use learning approaches are, to some extent, due to the lack of long-range ground truth training data, which in turn is due to limitations of range sensors. To circumvent this, we first propose a novel strategy for generating synthetic long-range ground truth depth data. We utilize Google Earth images to reconstruct large-scale 3D models of different cities with proper scale. The acquired repository of 3D models and associated RGB views along with their long-range depth renderings are used as training data for depth prediction. We then train two deep neural network models for long-range depth estimation: i) a Convolutional Neural Network (CNN) and ii) a Generative Adversarial Network (GAN). We found in our experiments that the GAN model predicts depth more accurately. We plan to open-source the database and the baseline models for public use.","PeriodicalId":6640,"journal":{"name":"2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)","volume":"39 1","pages":"4751-4757"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87089678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Passive acoustic tracking for behavior mode classification between surface and underwater vehicles 水面和水下航行器行为模式分类的被动声跟踪

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Pub Date : 2018-10-01 DOI: 10.1109/IROS.2018.8593981

E. Fischell, Oscar Viquez, H. Schmidt

Autonomous underwater vehicles (AUVs) pose significant communication challenges: vehicles are submerged for periods of time in which speed-of-light communication is impossible. This is a particular problem on low-cost AUV platforms, on which acoustic modems are not available to get vehicle state or provide re-deploy commands. We investigate one possible method of providing operators with a communication line to these vehicles by using noise underwater to both classify behavior of submerged vehicles and to command them. In this scheme, processing of data from hydrophone arrays provide operators with AUV mode estimates and AUVs with surface vehicle behavior updates. Simulation studies were used to characterize trajectories for simple transect versus loiter behaviors based on the bearing and time to intercept (TTI). A classifier based on K-nearest-neighbor with dynamic time warping as a distance metric was used to classify simulation data. The simulation-based classifier was then applied to classify bearing tracking data from passive tracking of a loitering AUV and bearing and TTI data from passive tracking of a transecting boat based on field array data. Experiment data was classified with 76 % accuracy using bearing-only data, 96% accuracy for TTI -only data and 99 % accuracy for combined classification. The techniques developed here could be used for AUV cuing by surface vessels and monitoring of AUV behavior.

自主水下航行器(auv)带来了重大的通信挑战:航行器在水下一段时间内无法进行光速通信。这在低成本的AUV平台上是一个特别的问题，在这种平台上，声学调制解调器无法获得车辆状态或提供重新部署命令。我们研究了一种可能的方法，通过使用水下噪声来分类水下车辆的行为并指挥它们，从而为操作员提供与这些车辆的通信线路。在该方案中，处理来自水听器阵列的数据为操作员提供AUV模式估计，并为AUV提供水面车辆行为更新。模拟研究用于描述基于方位和拦截时间(TTI)的简单样条与游荡行为的轨迹。采用基于k -最近邻的分类器，以动态时间规整为距离度量对仿真数据进行分类。然后，应用基于仿真的分类器对巡航水下机器人被动跟踪的方位跟踪数据和基于场阵数据的横截艇被动跟踪的方位和TTI数据进行分类。仅使用方位数据对实验数据进行分类，准确率为76%，仅使用TTI数据分类准确率为96%，组合分类准确率为99%。本研究开发的技术可用于水面舰艇对水下航行器的跟踪和对水下航行器行为的监测。

{"title":"Passive acoustic tracking for behavior mode classification between surface and underwater vehicles","authors":"E. Fischell, Oscar Viquez, H. Schmidt","doi":"10.1109/IROS.2018.8593981","DOIUrl":"https://doi.org/10.1109/IROS.2018.8593981","url":null,"abstract":"Autonomous underwater vehicles (AUVs) pose significant communication challenges: vehicles are submerged for periods of time in which speed-of-light communication is impossible. This is a particular problem on low-cost AUV platforms, on which acoustic modems are not available to get vehicle state or provide re-deploy commands. We investigate one possible method of providing operators with a communication line to these vehicles by using noise underwater to both classify behavior of submerged vehicles and to command them. In this scheme, processing of data from hydrophone arrays provide operators with AUV mode estimates and AUVs with surface vehicle behavior updates. Simulation studies were used to characterize trajectories for simple transect versus loiter behaviors based on the bearing and time to intercept (TTI). A classifier based on K-nearest-neighbor with dynamic time warping as a distance metric was used to classify simulation data. The simulation-based classifier was then applied to classify bearing tracking data from passive tracking of a loitering AUV and bearing and TTI data from passive tracking of a transecting boat based on field array data. Experiment data was classified with 76 % accuracy using bearing-only data, 96% accuracy for TTI -only data and 99 % accuracy for combined classification. The techniques developed here could be used for AUV cuing by surface vessels and monitoring of AUV behavior.","PeriodicalId":6640,"journal":{"name":"2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)","volume":"16 1","pages":"2383-2388"},"PeriodicalIF":0.0,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88024764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Achieving Flexible Assembly Using Autonomous Robotic Systems 使用自主机器人系统实现灵活装配

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Pub Date : 2018-10-01 DOI: 10.1109/IROS.2018.8593852

Kieran Gilday, Josie Hughes, F. Iida

Prefabrication of structures is currently used in a limited capacity, due to the lack of flexibility, despite the potential cost and speed advantages. Autonomous flexible reassembly enables structures to be developed which can be continuously and iteratively dis-assembled and re-assembled providing far more flexibility in comparison to single shot pre-fabrication methods. Dis-assembly of structures should be considered when assembling, due to the asymmetry of assembly and dis-assembly processes, to ensure structures can be recycled and re-assembled. This allows for agile development, significantly reducing the time and resource usage during the build process. In this work, a framework for flexible re-assembly is developed and a robotic platform is developed to implement and test this framework with simple Lego bricks. The tradeoffs in terms of time, resource use and probability of success of this new assembly method can be understood by using a cost function to compare to alternative fabrication methods.

尽管具有潜在的成本和速度优势，但由于缺乏灵活性，预制结构目前的使用能力有限。自主柔性重组使结构能够不断迭代地拆卸和重新组装，与单次预制方法相比，提供了更大的灵活性。由于组装和拆卸过程的不对称性，在组装时应考虑结构的拆卸，以确保结构可以回收和重新组装。这允许敏捷开发，显著减少构建过程中的时间和资源使用。在这项工作中，开发了一个灵活的重新组装框架，并开发了一个机器人平台来实现和测试这个框架与简单的乐高积木。这种新装配方法在时间、资源使用和成功概率方面的权衡可以通过使用成本函数来与其他制造方法进行比较来理解。

引用次数: 3

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀