NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Collision Avoidance Approach Using Deep Reinforcement LearningA method to enable autonomous robots moving in a 2D space collision free motivates the purposed approach for collision avoidance for autonomous UAM vehicles. Challenges of autonomous collision free navigation for both problems are similar. Agents in each environment do not know the intent, or goal, of the other. Finding the time efficient paths require some level of anticipation with neighboring agents which is computationally expensive. In the original work, these obstacles were overcome with a novel application of deep reinforcement learning which offloads the online computation to an offline learning algorithm. A value network that encodes the estimated time to the goal given the agent’s state and the observable portion of the other agent’s state is trained on a baseline policy and further refined with reinforcement learning to promote time efficient collision free navigation. Online, the value network efficiently informs the agent’s decision making in the face of uncertainty of the other agent’s next move. In this paper, challenges extending this methodology to the 3D environment of autonomous UAM vehicles with kinematic constraints are discussed and initial results shown.
Document ID
20210025617
Acquisition Source
Langley Research Center
Document Type
Conference Paper
Authors
Barton J Bacon
(Langley Research Center Hampton, Virginia, United States)
Date Acquired
December 7, 2021
Subject Category
Cybernetics, Artificial Intelligence And Robotics
Meeting Information
Meeting: AIAA SciTech Forum
Location: San Diego, CA
Country: US
Start Date: January 3, 2022
End Date: January 7, 2022
Sponsors: American Institute of Aeronautics and Astronautics
Funding Number(s)
WBS: 109492.02.07.07
Distribution Limits
Public
Copyright
Work of the US Gov. Public Use Permitted.
Technical Review
NASA Peer Committee
Keywords
Collision Avoidance
Deep Reinforcement Learning
No Preview Available