Modular Reinforcement Learning for Autonomous UAV Flight Control

Jongkwan Choi, Hyeon Min Kim, Ha Jun Hwang, Yong Duk Kim, Chang Ouk Kim

Research output: Contribution to journalArticlepeer-review

5 Citations (Scopus)

Abstract

Recently, research on unmanned aerial vehicles (UAVs) has increased significantly. UAVs do not require pilots for operation, and UAVs must possess autonomous flight capabilities to ensure that they can be controlled without a human pilot on the ground. Previous studies have mainly focused on rule-based methods, which require specialized personnel to create rules. Reinforcement learning has been applied to research on UAV autonomous flight; however, it does not include six-degree-of-freedom (6-DOF) environments and lacks realistic application, resulting in difficulties in performing complex tasks. This study proposes a method of efficient learning by connecting two different maneuvering methods using modular learning for autonomous UAV flights. The proposed method divides complex tasks into simpler tasks, learns them individually, and then connects them in order to achieve faster learning by transferring information from one module to another. Additionally, the curriculum learning concept was applied, and the difficulty level of individual tasks was gradually increased, which strengthened the learning stability. In conclusion, modular learning and curriculum learning methods were used to demonstrate that UAVs can effectively perform complex tasks in a realistic, 6-DOF environment.

Original languageEnglish
Article number418
JournalDrones
Volume7
Issue number7
DOIs
Publication statusPublished - 2023 Jul

Bibliographical note

Publisher Copyright:
© 2023 by the authors.

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Information Systems
  • Aerospace Engineering
  • Computer Science Applications
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Modular Reinforcement Learning for Autonomous UAV Flight Control'. Together they form a unique fingerprint.

Cite this