期刊论文

【摘要】

Developing general purpose algorithms for learning an accurate model of dynamical systems from example traces of the system is still a challenging research problem. Predictive State Representation (PSR) models represent the state of a dynamical system as a set of predictions about future events. Our work focuses on improving Temporal Difference Networks (TD Nets), a general class of predictive state models. We adapt the internal structure of the TD Net and we present an improved algorithm for learning a TD Net model from experience in the environment. The new algorithm accepts a set of known facts about the environment and uses those facts to accelerate the learning. These facts can come from another learning algorithm (as in this study) or from a designerâs prior knowledge about the environment. Experiments demonstrate that using the new structure and learning algorithm improves the accuracy of the TD Net models. When tested in an in finite environment, our new algorithm outperforms all of the standard PSR learning algorithms.

【授权许可】

Unknown

【预览】

附件列表
Files	Size	Format	View
RO201911300927428ZK.pdf	296KB	PDF	download

Journal of Computer Science
INCORPORATING PRIOR KNOWLEDGE INTO TEMPORAL DIFFERENCE NETWORKS \| Science Publications

James Harpe¹ Britton Wolfe¹
关键词: Predictive State; Temporal Difference; Modeling; Dynamical Systems;
DOI : 10.3844/jcssp.2014.2211.2219
学科分类：计算机科学（综合）
来源: Science Publications
PDF


	文献评价指标
	下载次数：16次	浏览次数：33次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】