Effective lazy training method for deep Q-network in obstacle avoidance and path planning

Juan Wu, Seabyuk Shin, Cheong Gil Kim, Shin Dug Kim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

21 Citations (Scopus)

Abstract

Deep reinforcement learning technique combines reinforcement learning and neural network for various applications. This paper is to propose an effective lazy training method for deep reinforcement learning, especially for deep Qnetwork combining neural network with Q-learning to be used for the obstacle avoidance and path planning applications. The proposed method can reduce the overall training time by designing a lazy learning method and a method removing unnecessary repetitions in the training step. These two methods can reduce a significant portion of total execution time without losing any required accuracy. The proposed method is evaluated for the obstacle avoidance and path planning tasks, where an agent trapped in an unknown environment is trying to find out the shortest path to the destination without any collision, through its self-study. And the experiment results show that the proposed method reduces 53.38% of training time on average, compared to the traditional method with no performance loss and make the training procedure more stable.

Original languageEnglish
Title of host publication2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1799-1804
Number of pages6
ISBN (Electronic)9781538616451
DOIs
Publication statusPublished - 2017 Nov 27
Event2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017 - Banff, Canada
Duration: 2017 Oct 52017 Oct 8

Publication series

Name2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017
Volume2017-January

Other

Other2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017
Country/TerritoryCanada
CityBanff
Period17/10/517/10/8

Bibliographical note

Funding Information:
This work was partially supported by Institute for Information and Communications Technology Promotion(IITP) grant funded by the Korea government(MSIP) (R0124-16-0002, Emotional Intelligence Technology to Infer Human Emotion and Carry on Dialogue Accordingly) and the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT and Future Planning (NRF-2015R1A2A2A01007668).

Publisher Copyright:
© 2017 IEEE.

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Science Applications
  • Human-Computer Interaction
  • Control and Optimization

Fingerprint

Dive into the research topics of 'Effective lazy training method for deep Q-network in obstacle avoidance and path planning'. Together they form a unique fingerprint.

Cite this