Research talk: Towards efficient generalization in continual RL using episodic memory
- Mandana Samiei | McGill University and Mila (Quebec AI Institute)
- Microsoft Research Summit 2021 | Reinforcement Learning
Reinforcement learning (RL) is a powerful, brain-inspired framework to train agents for making sequential decisions in artificial intelligence. In this talk, the researchers consider two scenarios wherein RL can be challenging. The first is when non-stationarity plays an important role in the environment, and the second is when data and compute available to the agent are limited. We then discuss mitigation principles inspired by the brain’s capacity for episodic memory, that is, the subjective memory of specific previous events. However, the classical implementation of episodic memory in RL is computationally inefficient for storing and retrieving information. Besides that, simple episodic memories do not show good generalization to novel tasks. Despite the recent progress made by episodic memory in RL on the speed of learning, efficient generalization remains an open area for future explorations. The researchers propose that a more realistic view of episodic memory is one that incorporates predictive schemata into an external inference algorithm, which could theoretically help with generalization in RL.
Learn more about the 2021 Microsoft Research Summit: https://Aka.ms/researchsummit (opens in new tab)
-
-
Mandana Samiei
PhD Student
McGill University and Mila (Quebec AI Institute)
-
-
Reinforcement Learning
-
Opening remarks: Reinforcement Learning
- Katja Hofmann
-
-
-
-
Research talk: Evaluating human-like navigation in 3D video games
- Raluca Georgescu,
- Ida Momennejad
-
Research talk: Maia Chess: A human-like neural network chess engine
- Reid McIlroy-Young
-
Fireside chat: Opportunities and challenges in human-oriented AI
- Ashley Llorens,
- Katja Hofmann,
- Siddhartha Sen
-
Research talk: Making deep reinforcement learning industrially applicable
- Jiang Bian,
- Tie-Yan Liu
-
Panel: Generalization in reinforcement learning
- Mingfei Sun,
- Roberta Raileanu,
- Wendelin Böhmer
-
Research talk: Project Dexter: Machine learning and automatic decision-making for robotic manipulation
- Andrey Kolobov,
- Ching-An Cheng
-
-
-
Research talk: Breaking the deadly triad with a target network
- Shangtong Zhang
-
Panel: The future of reinforcement learning
- Geoff Gordon,
- Emma Brunskill,
- Craig Boutilier
-
Closing remarks: Reinforcement Learning
- John Langford