NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Interplanetary Low-Thrust Design Using Proximal Policy OptimizationThis paper aims to demonstrate a reinforcement learning technique for developing complex, decision-making policies capable of planning interplanetary transfers.Using Proximal Policy Optimization (PPO), a neural network agent is trained to produce a closed-loop controller capable of transfers between Earth and Mars.The agent is trained in an environment that utilizes a medium fidelity solar electric propulsion model and a real ephemeris model of the Earth and Mars. The results are compared against those generated by the Evolutionary Mission Trajectory Generator (EMTG) tool.
Document ID
20190029151
Acquisition Source
Goddard Space Flight Center
Document Type
Conference Paper
Authors
Miller, Daniel
(Massachusetts Institute of Technology (MIT) Cambridge, MA, United States)
Englander, Jacob A.
(NASA Goddard Space Flight Center Greenbelt, MD, United States)
Linares, Richard
(Massachusetts Institute of Technology (MIT) Cambridge, MA, United States)
Date Acquired
August 19, 2019
Publication Date
August 11, 2019
Subject Category
Cybernetics, Artificial Intelligence And Robotics
Astrodynamics
Report/Patent Number
GSFC-E-DAA-TN71225
Meeting Information
Meeting: 2019 AAS/AIAA Astrodynamics Specialist Conference
Location: Portland, ME
Country: United States
Start Date: August 11, 2019
End Date: August 15, 2019
Sponsors: American Astronautical Society (AAS-HQ)
Funding Number(s)
CONTRACT_GRANT: 80NSSC18K1141
Distribution Limits
Public
Copyright
Public Use Permitted.
Technical Review
Single Expert
Keywords
proximal policy optimization
reinforcement learning
No Preview Available