NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Optimal Wonderful Life Utility Functions in Multi-Agent SystemsThe mathematics of Collective Intelligence (COINs) is concerned with the design of multi-agent systems so as to optimize an overall global utility function when those systems lack centralized communication and control. Typically in COINs each agent runs a distinct Reinforcement Learning (RL) algorithm, so that much of the design problem reduces to how best to initialize/update each agent's private utility function, as far as the ensuing value of the global utility is concerned. Traditional team game solutions to this problem assign to each agent the global utility as its private utility function. In previous work we used the COIN framework to derive the alternative Wonderful Life Utility (WLU), and experimentally established that having the agents use it induces global utility performance up to orders of magnitude superior to that induced by use of the team game utility. The WLU has a free parameter (the clamping parameter) which we simply set to zero in that previous work. Here we derive the optimal value of the clamping parameter, and demonstrate experimentally that using that optimal value can result in significantly improved performance over that of clamping to zero, over and above the improvement beyond traditional approaches.
Document ID
20010071848
Acquisition Source
Ames Research Center
Document Type
Other
Authors
Wolpert, David H.
(NASA Ames Research Center Moffett Field, CA United States)
Tumer, Kagan
(NASA Ames Research Center Moffett Field, CA United States)
Swanson, Keith
Date Acquired
September 7, 2013
Publication Date
January 16, 2000
Subject Category
Computer Systems
Distribution Limits
Public
Copyright
Work of the US Gov. Public Use Permitted.
No Preview Available