Research talk: Evaluating human-like navigation in 3D video games
- Raluca Georgescu, Ida Momennejad | Microsoft Research Cambridge, Microsoft Research NYC
- Microsoft Research Summit 2021 | Reinforcement Learning
On the path to developing agents that learn complex human-like behavior, a key challenge is the need to quickly and accurately quantify human-likeness. While human assessments of such behavior can be highly accurate, speed and scalability are limited. The researchers address these limitations through a novel automated Navigation Turing Test (NTT) that learns to predict human judgments of human-likeness. They demonstrate the effectiveness of their automated NTT on a navigation task in a complex 3D environment. They investigated six classification models to shed light on the types of architectures best suited to this task, and they validated them against data collected through a human NTT. The best models achieve high accuracy when distinguishing true human and agent behavior. At the same time, the researchers show that predicting finer-grained human assessment of agents’ progress towards human-like behavior remains unsolved. Their work takes an important step towards agents that more effectively learn complex human-like behavior.
Learn more about the 2021 Microsoft Research Summit: https://Aka.ms/researchsummit (opens in new tab)
-
-
Raluca Stevenson
Senior Research Scientist
-
Ida Momennejad
Principal Researcher in Reinforcement Learning
-
-
Reinforcement Learning
-
Opening remarks: Reinforcement Learning
- Katja Hofmann
-
-
-
-
Research talk: Evaluating human-like navigation in 3D video games
- Raluca Georgescu,
- Ida Momennejad
-
Research talk: Maia Chess: A human-like neural network chess engine
- Reid McIlroy-Young
-
Fireside chat: Opportunities and challenges in human-oriented AI
- Ashley Llorens,
- Katja Hofmann,
- Siddhartha Sen
-
Research talk: Making deep reinforcement learning industrially applicable
- Jiang Bian,
- Tie-Yan Liu
-
Panel: Generalization in reinforcement learning
- Mingfei Sun,
- Roberta Raileanu,
- Wendelin Böhmer
-
Research talk: Project Dexter: Machine learning and automatic decision-making for robotic manipulation
- Andrey Kolobov,
- Ching-An Cheng
-
-
-
Research talk: Breaking the deadly triad with a target network
- Shangtong Zhang
-
Panel: The future of reinforcement learning
- Geoff Gordon,
- Emma Brunskill,
- Craig Boutilier
-
Closing remarks: Reinforcement Learning
- John Langford