NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Unifying Temporal and Structural Credit Assignment ProblemsSingle-agent reinforcement learners in time-extended domains and multi-agent systems share a common dilemma known as the credit assignment problem. Multi-agent systems have the structural credit assignment problem of determining the contributions of a particular agent to a common task. Instead, time-extended single-agent systems have the temporal credit assignment problem of determining the contribution of a particular action to the quality of the full sequence of actions. Traditionally these two problems are considered different and are handled in separate ways. In this article we show how these two forms of the credit assignment problem are equivalent. In this unified frame-work, a single-agent Markov decision process can be broken down into a single-time-step multi-agent process. Furthermore we show that Monte-Carlo estimation or Q-learning (depending on whether the values of resulting actions in the episode are known at the time of learning) are equivalent to different agent utility functions in a multi-agent system. This equivalence shows how an often neglected issue in multi-agent systems is equivalent to a well-known deficiency in multi-time-step learning and lays the basis for solving time-extended multi-agent problems, where both credit assignment problems are present.
Document ID
20040068179
Acquisition Source
Ames Research Center
Document Type
Preprint (Draft being sent to journal)
Authors
Agogino, Adrian K.
(NASA Ames Research Center Moffett Field, CA, United States)
Tumer, Kagan
(NASA Ames Research Center Moffett Field, CA, United States)
Date Acquired
September 7, 2013
Publication Date
January 1, 2004
Subject Category
Mathematical And Computer Sciences (General)
Meeting Information
Meeting: Autonomous Agents and Multi-Agent Systems Conference
Location: New York, NY
Country: United States
Start Date: July 19, 2004
End Date: July 23, 2004
Distribution Limits
Public
Copyright
Work of the US Gov. Public Use Permitted.
No Preview Available