NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Use of Inverse Reinforcement Learning for Identity PredictionWe adopt Markov Decision Processes (MDP) to model sequential decision problems, which have the characteristic that the current decision made by a human decision maker has an uncertain impact on future opportunity. We hypothesize that the individuality of decision makers can be modeled as differences in the reward function under a common MDP model. A machine learning technique, Inverse Reinforcement Learning (IRL), was used to learn an individual's reward function based on limited observation of his or her decision choices. This work serves as an initial investigation for using IRL to analyze decision making, conducted through a human experiment in a cyber shopping environment. Specifically, the ability to determine the demographic identity of users is conducted through prediction analysis and supervised learning. The results show that IRL can be used to correctly identify participants, at a rate of 68% for gender and 66% for one of three college major categories.
Document ID
20110012064
Acquisition Source
Langley Research Center
Document Type
Conference Paper
Authors
Hayes, Roy
(Virginia Univ. Charlottesville, VA, United States)
Bao, Jonathan
(Virginia Univ. Charlottesville, VA, United States)
Beling, Peter
(Virginia Univ. Charlottesville, VA, United States)
Horowitz, Barry
(Virginia Univ. Charlottesville, VA, United States)
Date Acquired
August 25, 2013
Publication Date
March 1, 2011
Publication Information
Publication: Selected Papers and Presentations Presented at MODSIM World 2010 Conference Expo
Subject Category
Systems Analysis And Operations Research
Distribution Limits
Public
Copyright
Work of the US Gov. Public Use Permitted.
No Preview Available