Driving safety is significant to building a people-oriented and harmonious society, Tires are one of the key components of a vehicle and the character information on the tire sidewall is critical to their storage and usage. However, due to the diverse and differentiated features of typographic fonts, simultaneously extracting comprehensive characteristics is an extremely challenging task. To effectively break through these performance degradation issues, a multi-scale tire sidewall text region detection algorithm based on YOLOv5 is introduced, called YOLOT, which fuses comprehensive feature information in both width and depth directions. In this study, we firstly propose the Width and Depth Awareness (WDA) module in the text region detection field and successfully integrated it with the FPN structure to form the WDA-FPN. The purpose of WDA-FPN is to empower the network to capture multi-scale and multi-shape features in images, thereby augmenting the algorithm’s abstraction and representation of image features and concurrently boosting its robustness and generalization performance. Experimental findings indicate that, compared to the primary algorithm, YOLOT achieves significant improvement in accuracy, providing a higher detection reliability. The dataset and code for the paper are available at: https://github.com/Cloude-dehua/YOLOT.
Unmanned vehicles have seen a significant increase in a wide variety of fields such as for logistics, agriculture and other commercial applications. Controlling swarms of unmanned vehicles is a challenging task that requires complex autonomous control systems. Reinforcement learning has been proposed as a solution to this challenge. We propose an approach based on agent masking to enable a simple Deep Q-Network algorithm to scale on large swarms while training on relatively smaller swarms. We train our approach using multiple swarm sizes and learning rates and compare our results using metrics such as the number of collisions. We also compare the ability of our approach to scale on swarms ranging from five to 25 agents using metrics and visual analysis. Our proposed solution was able to guide a swarm of up to 100 agents to a target while keeping a good swarm cohesion and avoiding collision.
The Mobile Robot Path Problem looks to find the optimal shortest path from the starting point to the target point with collision-free for a mobile robot. This is a popular issue in robotics and in this paper the environment is considered as static and represented as a bidirectional grid map. Besides, the novel optimal method Dhouib-Matrix-SPP (DM-SPP) is applied to create the optimal shortest path for a mobile robot in a static environment. DM-SPP is a greedy method based on a column row navigation in the distance matrix and characterized by its rapidity to solve sparse graphs. The comparative analysis is conducted by applying DM-SPP on thirteen test cases and comparing its results to the results given by four metaheuristics the Max-Min Ant System, the Ant System with punitive measures, the A* and the Improved Hybrid A*. The outcomes acquired from different scenarios indicate that the proposed DM-SPP method can rapidly outperform the four predefined artificial intelligence methods.
Knee joint rehabilitation exercise refers to a therapeutic procedure of a patient having dysfunctions in certain abilities to move knee joint due to some medical conditions like trauma or paralysis. The exercise is basically a series of repeated assistive physical movements within the range of motion (RoM) of the joint. Reflex action of limbs during RoM exercise causes inappropriate balance of load which may cause secondary injuries, such as damages of muscle or tendon tissues. Establishing correlation between impedance data and limb motions is important to solve this problem. This paper aims to design and modeling of a robotic arm with an original approach in control strategy which is developed based on the correlation in between the joint-impedances and joint-motion characteristics during exercise. The knee joint impedances are estimated based on the internal feedback of the system dynamics, that lead to design the torque compensator to improve the overall control signals in real time. This paper also demonstrates the characteristics of various responses of the system during exercise with human subject. Results have reflected good performances with low position and velocity tracking errors, and during hold phase; and and during motion phse. Though, the limitation of the prototype is its current RoM (limited to –), the system has potential in the application of RoM exercise for paraplegic or monoplegic patients.
In the realm of speech emotion recognition, researchers strive to refine representation methods for improved emotional information capture. Traditional one-dimensional time series classification falls short in expressing intricate emotional patterns present in speech signals, posing challenges in accuracy and robustness. This study introduces an innovative algorithm leveraging Hilbert curves to transform one-dimensional speech data into two-dimensional form, enhancing feature extraction accuracy. A tiling module based on Hilbert curve maximizes Hilbert curve arrangements for improved emotional information capture. Results reveal spatial efficiency gains up to 23,195 times pixel units, enhancing data storage. With an exceptional 98.73% accuracy, the proposed approach traditional methods, affirming its superior emotion classification performance on the same dataset. These empirical findings underscore the effectiveness of our proposed method in advancing speech emotion recognition.
These days, plastic trash is exponentially overwhelming our waterways. The catastrophe has attracted global attention at this point. As a result, protecting the environment on the water's surface has received increasing focus. Currently, manpower can be used to clean up contaminated water bodies like ponds, rivers, and oceans. Using the current cleaning approach results in low efficiency and hazard. The detection, collection, sorting, and removal of plastic trash from such water surfaces has been the subject of relatively little robotic research, despite the dire circumstances. From private sources, there are very few individual efforts to be found. In order to attain great efficiency without human assistance or operation, a fully autonomous water surface cleaning robot is proposed in this study. The robot was created to adapt to any type of water body found in the real world. An efficient object identification machine learning technique can be suggested for the creation of autonomous cleaning robots. This study improved the Single Short Detection (SSD) method to recognise objects accurately. Because of the enhanced detection techniques, the robot is able to collect trash on its own. With a mean average precision (mAP) of 94.099 % and a detection speed of up to 64.67 frames per second, experimental findings show that the enhanced SSD has exceptional detection speed and accuracy.
UAVs are often limited by limited resources when performing flight tasks, especially the contradiction between storage resources and computing resources when the huge YOLOv3 model is deployed on the edge UAVs. In this paper, we tend to compress YOLOv3 model in different aspects to achieve load availability at the edge. In this paper, deep separable convolution is introduced to reduce the computation of the model. Then, PR regularization term is used as the regularization term of sparse training to better distinguish scaling factors, and then the hybrid pruning combining channel pruning and layer pruning is carried out on the model according to scaling factors, in order to reduce the number of model parameters and the amount of calculation. Finally, since the training data is a 32-bit floating point number, DoReFa-Net quantization method is used to quantify the model, so as to compress the storage capacity of the model. The experimental results show that the compression scheme proposed in this paper can effectively reduce the number of parameters by 97.5 % and the calculation amount by 82.3 %, and can maintain the original detection efficiency of UAVs.

