Collision Avoidance Approach Using Deep Reinforcement Learning

Barton J Bacon

A method to enable autonomous robots moving in a 2D space collision free motivates the purposed approach for collision avoidance for autonomous UAM vehicles. Challenges of autonomous collision free navigation for both problems are similar. Agents in each environment do not know the intent, or goal, of the other. Finding the time efficient paths require some level of anticipation with neighboring agents which is computationally expensive. In the original work, these obstacles were overcome with a novel application of deep reinforcement learning which offloads the online computation to an offline learning algorithm. A value network that encodes the estimated time to the goal given the agent’s state and the observable portion of the other agent’s state is trained on a baseline policy and further refined with reinforcement learning to promote time efficient collision free navigation. Online, the value network efficiently informs the agent’s decision making in the face of uncertainty of the other agent’s next move. In this paper, challenges extending this methodology to the 3D environment of autonomous UAM vehicles with kinematic constraints are discussed and initial results shown.

Document ID

20210025617

Acquisition Source

Langley Research Center

Document Type

Conference Paper

Authors

Date Acquired

December 7, 2021

Subject Category

Meeting Information

Meeting: AIAA SciTech Forum

Location: San Diego, CA

Country: US

Start Date: January 3, 2022

End Date: January 7, 2022

Sponsors: American Institute of Aeronautics and Astronautics

Funding Number(s)

Distribution Limits

Public

Work of the US Gov. Public Use Permitted.

Technical Review

NASA Peer Committee

Keywords

Available Downloads

Name

Type

Collision Advoidance Approach using Deep Reinforcement Learning V3.pdf

STI

No Preview Available

NTRS

NTRS - NASA Technical Reports Server

Available Downloads

Related Records