Consider a mobile robot that must navigate as quickly as possible to the global maxima of a function (e.g. density of seabed litter, pollutant concentration, wireless signal strength) defined over its operating area. This objective function is initially unknown and is assumed to be Lipschitz continuous. The limited velocity of the robot restricts the next samples to neighboring positions, and to avoid wasting time and energy, the robot’s path must be adapted as new information becomes available. The paper proposes two methods that use an upper bound on the objective to iteratively change the position targeted by the robot as new samples are acquired. The first method is FTW, which Turns When the best value seen so far of the objective Function is larger than the bound of the current target position. The second is FTWD, an extension of FTW that takes into account the Distance to the target. Convergence guarantees are provided for both methods, and a convergence rate is proven to characterize how fast the FTW suboptimality decreases as the number of samples grows. In a numerical study, FTWD greatly improves performance compared to FTW, outperforms two representative source-seeking baselines, and obtains results similar to a much more computationally intensive method that does not guarantee convergence. The relationship between FTW and FTWD is also confirmed in real-robot experiments, where a TurtleBot3 seeks the darkest point on a 2D grayscale map.
This paper explores the problem of (generalized) Nash equilibrium search in multi-cluster games with heterogeneous dynamics and multiple constraints. Within this research framework, each agent acquires information solely through local interactions with its neighbors and forms clusters based on similarity of interests. These clusters manifest dual relationships of cooperation and competition: agents within the same cluster enhance decision-making capabilities through cooperation, while different clusters compete to maximize their respective benefits. To delve into these complex interactions among clusters and the learning and evolution processes among agents, four distributed control algorithms suitable for various scenario requirements are designed and implemented. These algorithms ensure that each agent converges to a Nash equilibrium (NE) or generalized Nash equilibrium (GNE) of the multi-cluster system within predefined time points. Finally, we apply these algorithms to the connectivity control problem of unmanned aerial vehicle swarms with diverse dynamics, validating the theoretical results through comprehensive simulations.