学位论文详细信息
A spiking neural network of state transition probabilities in model-based reinforcement learning
reinforcement learning;model-based reinforcement learning;spiking neural model;state transition probability;decision task
Shein, Mariahaffiliation1:Faculty of Mathematics ; advisor:Eliasmith, Chris ; Eliasmith, Chris ;
University of Waterloo
关键词: spiking neural model;    Master Thesis;    decision task;    model-based reinforcement learning;    state transition probability;    reinforcement learning;   
Others  :  https://uwspace.uwaterloo.ca/bitstream/10012/12574/3/Shein_Mariah.pdf
瑞士|英语
来源: UWSPACE Waterloo Institutional Repository
PDF
【 摘 要 】

The development of the field of reinforcement learning was based on psychological studies of the instrumental conditioning of humans and other animals. Recently, reinforcement learning algorithms have been applied to neuroscience to help characterize neural activity and animal behaviour in instrumental conditioning tasks. A specific example is the hybrid learner developed to match human behaviour on a two-stage decision task. This hybrid learner is composed of a model-free and a model-based system. The model presented in this thesis is an implementation of that model-based system where the state transition probabilities and Q-value calculations use biologically plausible spiking neurons. Two variants of the model demonstrate the behaviour when the state transition probabilities are encoded in the network at the beginning of the task, and when these probabilities are learned over the course of the task. Various parameters that affect the behaviour of the model are explored, and ranges of these parameters that produce characteristically model-based behaviour are found. This work provides an important first step toward understanding how a model-based system in the human brain could be implemented, and how this system contributes to human behaviour.

【 预 览 】
附件列表
Files Size Format View
A spiking neural network of state transition probabilities in model-based reinforcement learning 4868KB PDF download
  文献评价指标  
  下载次数:20次 浏览次数:32次