会议论文详细信息
2019 2nd International Conference on Advanced Materials, Intelligent Manufacturing and Automation
Adaptive Q-learning Algorithm for AUV Route Planning
Zhu, Yingying^1 ; Dai, Changan^2 ; Xia, Jiahao^3 ; Wu, Haowei^4 ; An, Li^3
Basic Teaching and Research Department of Command and Training, Armed Police Special Police Academy, Beijing
102211, China^1
China National Aeronautical Radio Electronics Research Institute Department, Shanghai
200233, China^2
Science and Technology on Underwater Vehicles Laboratory, Harbin Engineering University, Harbin
150001, China^3
Beijing Institute of Astronautically Systems Engineering, Beijing
100076, China^4
关键词: Change trends;    Environmental complexity;    Exploration and exploitation;    Learning convergence;    Learning environments;    Local optimal solution;    Q-learning algorithms;    Route planning;   
Others  :  https://iopscience.iop.org/article/10.1088/1757-899X/569/5/052029/pdf
DOI  :  10.1088/1757-899X/569/5/052029
来源: IOP
PDF
【 摘 要 】

When applying Q-learning algorithm to AUV route planning, it is difficult to strike a balance between exploration and exploitation. Usually, the fixed greedy rate or the greedy rate with fixed change trend is adopted according to experience. However, the value of greedy rate in the above method cannot be matched with the learning environment, and often occurs a problem of falling into a local optimal solution or slow learning convergence. In order to solve the above problems, we proposed an adaptive Q-learning algorithm. It guides the greedy rate with environmental complexity, and the environmental complexity degree of AUV is evaluated by the environmental complexity mathematical model. By setting up the simulation experiment of two-dimensional environment, the fixed greedy rate and the greedy rate with fixed change trend is compared. The simulation results show that the adaptive Q-learning algorithm considering environmental complexity converges faster and is less prone to fall into local optimal solutions.

【 预 览 】
附件列表
Files Size Format View
Adaptive Q-learning Algorithm for AUV Route Planning 540KB PDF download
  文献评价指标  
  下载次数:18次 浏览次数:87次