会议论文详细信息
2017 Workshop on Materials and Engineering in Aeronautics
Construction of multi-agent mobile robots control system in the problem of persecution with using a modified reinforcement learning method based on neural networks
材料科学;航空航天工程
Patkin, M.L.^1 ; Rogachev, G.N.^1
Samara State Technical University, R-Samara, Russia^1
关键词: Actor-Critic methods;    Control problems;    Management systems;    Multi-agent interaction;    Multiagent control;    Observation vectors;    Reinforcement learning method;    Value functions;   
Others  :  https://iopscience.iop.org/article/10.1088/1757-899X/312/1/012018/pdf
DOI  :  10.1088/1757-899X/312/1/012018
来源: IOP
PDF
【 摘 要 】

A method for constructing a multi-agent control system for mobile robots based on training with reinforcement using deep neural networks is considered. Synthesis of the management system is proposed to be carried out with reinforcement training and the modified Actor-Critic method, in which the Actor module is divided into Action Actor and Communication Actor in order to simultaneously manage mobile robots and communicate with partners. Communication is carried out by sending partners at each step a vector of real numbers that are added to the observation vector and affect the behaviour. Functions of Actors and Critic are approximated by deep neural networks. The Critics value function is trained by using the TD-error method and the Actor's function by using DDPG. The Communication Actor's neural network is trained through gradients received from partner agents. An environment in which a cooperative multi-agent interaction is present was developed, computer simulation of the application of this method in the control problem of two robots pursuing two goals was carried out.

【 预 览 】
附件列表
Files Size Format View
Construction of multi-agent mobile robots control system in the problem of persecution with using a modified reinforcement learning method based on neural networks 365KB PDF download
  文献评价指标  
  下载次数:14次 浏览次数:23次