学位论文详细信息
Model-Based Bayesian Sparse Sampling for Data Efficient Control
machine learning;reinforcement learning;artificial intelligence
Tse, Timmy Rong Tianadvisor:Poupart, Pascal ; advisor:Law, Edith ; affiliation1:Faculty of Mathematics ; Poupart, Pascal ; Law, Edith ;
University of Waterloo
关键词: Master Thesis;    machine learning;    artificial intelligence;    reinforcement learning;   
Others  :  https://uwspace.uwaterloo.ca/bitstream/10012/14774/3/Tse_TimmyRongTian.pdf
瑞士|英语
来源: UWSPACE Waterloo Institutional Repository
PDF
【 摘 要 】

In this work, we propose a novel Bayesian-inspired model-based policy search algorithm for data efficient control. In contrast to other model-based approaches, our algorithm makes use of approximate Gaussian processes in the form of random Fourier features for fast online systems identification and computationally efficient posterior updates via rank one Cholesky updates. Furthermore, fast and tractable posterior updates permits policy optimization to leverage knowledge from posterior evolution tracking for a directed Bayesian approach to the exploration-exploitation dilemma. To address the optimization formulation involving belief monitoring as well as the potentiality of a loss surface with zero gradients everywhere, we leverage a blackbox optimizer in the form of covariance matrix adaptation evolution strategy (CMA-ES). We test our algorithm on four challenging control tasks and report the superior data efficiency as well as the exploration capabilities of our model.

【 预 览 】
附件列表
Files Size Format View
Model-Based Bayesian Sparse Sampling for Data Efficient Control 2256KB PDF download
  文献评价指标  
  下载次数:19次 浏览次数:26次