NEUROCOMPUTING | 卷:345 |
Obtaining fault tolerance avoidance behavior using deep reinforcement learning | |
Article | |
Aznar, Fidel1  Pujol, Mar1  Rizo, Ramon1  | |
[1] Univ Alicante, Dept Comp Sci & Artificial Intelligence, E-03080 Alicante, Spain | |
关键词: Deep reinforcement learning; Obstacle avoidance; Fault tolerance; | |
DOI : 10.1016/j.neucom.2018.11.090 | |
来源: Elsevier | |
【 摘 要 】
In this article, a mapless movement policy for mobile agents, designed specifically to be fault-tolerant, is presented. The provided policy, which is learned using deep reinforcement learning, has advantages compared to the usual mapless policies: this policy is capable of handling a robot even when some of its sensors are broken. It is an end-to-end policy based on three neuronal models capable not only of moving the robot and maximizing the coverage of the environment but also of learning the best movement behavior to adapt it to its perception needs. A custom robot, for which none of the readings of the sensors overlap each other, has been used. This setup makes it possible to determine the operation of a robust failure policy, since the failure of a sensor unequivocally affects the perceptions. The proposed system exhibits several advantages in terms of robustness, extensibility and utility. The system has been trained and tested exhaustively in a simulator, obtaining very good results. It has also been transferred to real robots, verifying the generalization and the good functioning of our model in real environments. (C) 2019 Elsevier B.V. All rights reserved.
【 授权许可】
Free
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
10_1016_j_neucom_2018_11_090.pdf | 3803KB | download |