期刊论文详细信息
Sensors
Learning to Rapidly Re-Contact the Lost Plume in Chemical Plume Tracing
Meng-Li Cao1  Qing-Hao Meng1  Jia-Ying Wang1  Bing Luo1  Ya-Qi Jing1  Shu-Gen Ma1 
[1] Institute of Robotics and Autonomous Systems, Tianjin Key Laboratory of Process Measurement and Control, School of Electrical Engineering and Automation, Tianjin University, Tianjin 300072, China; E-Mails:
关键词: chemical plume tracing;    reinforcement learning;    collaborative learning;    behavior-based robotics;   
DOI  :  10.3390/s150407512
来源: mdpi
PDF
【 摘 要 】

Maintaining contact between the robot and plume is significant in chemical plume tracing (CPT). In the time immediately following the loss of chemical detection during the process of CPT, Track-Out activities bias the robot heading relative to the upwind direction, expecting to rapidly re-contact the plume. To determine the bias angle used in the Track-Out activity, we propose an online instance-based reinforcement learning method, namely virtual trail following (VTF). In VTF, action-value is generalized from recently stored instances of successful Track-Out activities. We also propose a collaborative VTF (cVTF) method, in which multiple robots store their own instances, and learn from the stored instances, in the same database. The proposed VTF and cVTF methods are compared with biased upwind surge (BUS) method, in which all Track-Out activities utilize an offline optimized universal bias angle, in an indoor environment with three different airflow fields. With respect to our experimental conditions, VTF and cVTF show stronger adaptability to different airflow environments than BUS, and furthermore, cVTF yields higher success rates and time-efficiencies than VTF.

【 授权许可】

CC BY   
© 2015 by the authors; licensee MDPI, Basel, Switzerland.

【 预 览 】
附件列表
Files Size Format View
RO202003190014774ZK.pdf 3679KB PDF download
  文献评价指标  
  下载次数:12次 浏览次数:3次