期刊论文详细信息
JOURNAL OF THEORETICAL BIOLOGY 卷:293
Evolution of cooperation facilitated by reinforcement learning with adaptive aspiration levels
Article
Tanabe, Shoma3  Masuda, Naold1,2 
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Bunkyo Ku, Tokyo 1138656, Japan
[2] Japan Sci & Technol Agcy, PRESTO, Kawaguchi, Saitama 3320012, Japan
[3] Univ Tokyo, Fac Engn, Bunkyo Ku, Tokyo 1138656, Japan
关键词: Iterated prisoner's dilemma;    Baldwin effect;   
DOI  :  10.1016/j.jtbi.2011.10.020
来源: Elsevier
PDF
【 摘 要 】

Repeated interaction between individuals is the main mechanism for maintaining cooperation in social dilemma situations. Variants of tit-for-tat (repeating the previous action of the opponent) and the win-stay lose-shift strategy are known as strong competitors in iterated social dilemma games. On the other hand, real repeated interaction generally allows plasticity (i.e., learning) of individuals based on the experience of the past. Although plasticity is relevant to various biological phenomena, its role in repeated social dilemma games is relatively unexplored. In particular, if experience-based learning plays a key role in promotion and maintenance of cooperation, learners should evolve in the contest with nonlearners under selection pressure. By modeling players using a simple reinforcement learning model, we numerically show that learning enables the evolution of cooperation. We also show that numerically estimated adaptive dynamics appositely predict the outcome of evolutionary simulations. The analysis of the adaptive dynamics enables us to capture the obtained results as an affirmative example of the Baldwin effect, where learning accelerates the evolution to optimality. (C) 2011 Elsevier Ltd. All rights reserved.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_jtbi_2011_10_020.pdf 1254KB PDF download
  文献评价指标  
  下载次数:2次 浏览次数:0次