期刊论文详细信息
Journal of the Brazilian Computer Society
Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states
Costa, Anna Helena Reali1  Universidade de São Paulo, São Paulo, Brasil1  Silva, Valdinei Freire da1 
关键词: machine learning;    reinforcement learning;    abstraction;    partial-policy;    macro-states.;   
DOI  :  10.1007/BF03194507
学科分类:农业科学(综合)
来源: Springer U K
PDF
【 摘 要 】

Reinforcement Learning is carried out on-line, through trial-and-error interactions of the agent with the environment, which can be very time consuming when considering robots. In this paper we contribute a new learning algorithm, CFQ-Learning, which uses macro-states, a low-resolution discretisation of the state space, and a partial-policy to get around obstacles, both of them based on the complexity of the environment structure. The use of macro-states avoids convergence of algorithms, but can accelerate the learning process. In the other hand, partial-policies can guarantee that an agent fulfils its task, even through macro-state. Experiments show that the CFQ-Learning performs a good balance between policy quality and learning rate.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201912010163974ZK.pdf 1115KB PDF download
  文献评价指标  
  下载次数:9次 浏览次数:16次