期刊论文详细信息
IEEE Access
A Hybrid Multi-Task Learning Approach for Optimizing Deep Reinforcement Learning Agents
Nelson Vithayathil Varghese1  Qusay H. Mahmoud1 
[1] Department of Electrical, Computer and Software Engineering, Ontario Tech University, Oshawa, Canada;
关键词: Machine learning;    deep reinforcement learning;    neural networks;    transfer learning;    actor-critic;    multi-task worker;   
DOI  :  10.1109/ACCESS.2021.3065710
来源: DOAJ
【 摘 要 】

Driven by recent technological advancements within the field of artificial intelligence (AI), deep learning (DL) has been emerged as a promising representation learning technique across different machine learning (ML) classes, especially within the reinforcement learning (RL) arena. This new direction has given rise to the evolution of a new technological domain named deep reinforcement learning (DRL) that combines the high representational learning capabilities of DL with existing RL methods. Performance optimization achieved by RL-based intelligent agents designed with model-free-based approaches was majorly limited to systems with RL algorithms focused on learning a single task. The aforementioned approach was found to be quite data inefficient, whenever DRL agents needed to interact with more complex, data-rich environments. This is primarily due to the limited applicability of DRL algorithms to many scenarios across related tasks from the same distribution. One of the possible approaches to mitigate this issue is by adopting the method of multi-task learning. The objective of this research paper is to present a hybrid multi-task learning-oriented approach for the optimization of DRL agents operating within different but semantically similar environments with related tasks. The proposed framework will be built with multiple, individual actor-critic models functioning within independent environments and transferring knowledge among themselves through a global network to optimize performance. The empirical results obtained by the hybrid multi-task learning model on OpenAI Gym based Atari 2600 video gaming environment demonstrates that the proposed model enhances the performance of the DRL agent relatively in the range of 15% to 20% margin.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:1次