Pinwheel graphic representing the Microsoft Research Summit

Return to Event: Microsoft Research Summit 2021

Microsoft Research Summit 2021 • Videos

Research talk: Towards efficient generalization in continual RL using episodic memory

October 20, 2021
Mandana Samiei | McGill University and Mila (Quebec AI Institute)
Microsoft Research Summit 2021 | Reinforcement Learning

Reinforcement learning (RL) is a powerful, brain-inspired framework to train agents for making sequential decisions in artificial intelligence. In this talk, the researchers consider two scenarios wherein RL can be challenging. The first is when non-stationarity plays an important role in the environment, and the second is when data and compute available to the agent are limited. We then discuss mitigation principles inspired by the brain’s capacity for episodic memory, that is, the subjective memory of specific previous events. However, the classical implementation of episodic memory in RL is computationally inefficient for storing and retrieving information. Besides that, simple episodic memories do not show good generalization to novel tasks. Despite the recent progress made by episodic memory in RL on the speed of learning, efficient generalization remains an open area for future explorations. The researchers propose that a more realistic view of episodic memory is one that incorporates predictive schemata into an external inference algorithm, which could theoretically help with generalization in RL.

Learn more about the 2021 Microsoft Research Summit: https://Aka.ms/researchsummit (opens in new tab)

- Mandana Samiei
  
  PhD Student
  
  McGill University and Mila (Quebec AI Institute)
Research Area
- Artificial intelligence
Research Lab
- Microsoft Research Lab – Montréal
Group
- Reinforcement Learning | Montréal
Event
- Microsoft Research Summit 2021

Reinforcement Learning

Opening remarks: Reinforcement Learning
October 20, 2021
Katja Hofmann
Keynote: Key research challenges for real world reinforcement learning
October 20, 2021
John Langford
Research talk: Reinforcement learning with preference feedback
October 20, 2021
Aadirupa Saha
Research talk: Safe reinforcement learning using advantage-based intervention
October 20, 2021
Nolan Wagener
Research talk: Evaluating human-like navigation in 3D video games
October 20, 2021
Raluca Georgescu,

Ida Momennejad
Research talk: Maia Chess: A human-like neural network chess engine
October 20, 2021
Reid McIlroy-Young
Fireside chat: Opportunities and challenges in human-oriented AI
October 20, 2021
Ashley Llorens,

Katja Hofmann,

Siddhartha Sen
Research talk: Making deep reinforcement learning industrially applicable
October 20, 2021
Jiang Bian,

Tie-Yan Liu
Panel: Generalization in reinforcement learning
October 20, 2021
Mingfei Sun,

Roberta Raileanu,

Wendelin Böhmer

, et. al.
Research talk: Project Dexter: Machine learning and automatic decision-making for robotic manipulation
October 20, 2021
Andrey Kolobov,

Ching-An Cheng
Research talk: Successor feature sets: Generalizing successor representations across policies
October 20, 2021
Kianté Brantley
Research talk: Towards efficient generalization in continual RL using episodic memory
October 20, 2021
Mandana Samiei
Research talk: Breaking the deadly triad with a target network
October 20, 2021
Shangtong Zhang
Panel: The future of reinforcement learning
October 20, 2021
Geoff Gordon,

Emma Brunskill,

Craig Boutilier

, et. al.
Closing remarks: Reinforcement Learning
October 20, 2021
John Langford