NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Time-Extended Payoffs for Collectives of Autonomous AgentsA collective is a set of self-interested agents which try to maximize their own utilities, along with a a well-defined, time-extended world utility function which rates the performance of the entire system. In this paper, we use theory of collectives to design time-extended payoff utilities for agents that are both aligned with the world utility, and are "learnable", i.e., the agents can readily see how their behavior affects their utility. We show that in systems where each agent aims to optimize such payoff functions, coordination arises as a byproduct of the agents selfishly pursuing their own goals. A game theoretic analysis shows that such payoff functions have the net effect of aligning the Nash equilibrium, Pareto optimal solution and world utility optimum, thus eliminating undesirable behavior such as agents working at cross-purposes. We then apply collective-based payoff functions to the token collection in a gridworld problem where agents need to optimize the aggregate value of tokens collected across an episode of finite duration (i.e., an abstracted version of rovers on Mars collecting scientifically interesting rock samples, subject to power limitations). We show that, regardless of the initial token distribution, reinforcement learning agents using collective-based payoff functions significantly outperform both natural extensions of single agent algorithms and global reinforcement learning solutions based on "team games".
Document ID
20030107270
Acquisition Source
Ames Research Center
Document Type
Preprint (Draft being sent to journal)
Authors
Tumer, Kagan
(NASA Ames Research Center Moffett Field, CA, United States)
Agogino, Adrian K.
(Texas Univ. Austin, TX, United States)
Date Acquired
September 7, 2013
Publication Date
May 2, 2002
Subject Category
Cybernetics, Artificial Intelligence And Robotics
Distribution Limits
Public
Copyright
Public Use Permitted.
No Preview Available