Policy Gradient Methods: Tutorial and New Frontiers
- John Schulman | UC Berkeley
- AI Summer School 2017
In this tutorial we discuss several recent advances in deep reinforcement learning involving policy gradient methods. These methods have shown significant success in a wide range of domains, including continuous-action domains such as manipulation, locomotion, and flight. They have also achieved the state of the art in discrete action domains such as Atari. We will provide a unifying overview of a variety of different policy gradient methods, and we will also discuss the formalism of stochastic computation graphs for computing gradients of expectations.
-
-
Scarlet Schwiderski-Grosche
Director
-
-
Series: Cambridge Lab PhD Summer School
-
The Malmo Collaborative AI Challenge
- Katja Hofmann
-
Counterfactual Multi-Agent Policy Gradients
- Shimon Whiteson
-
Design - On the Human Side
- Alex Taylor
-
Probabilistic Machine Learning and AI
- Zoubin Ghahramani
-
Policy Gradient Methods: Tutorial and New Frontiers
- John Schulman
-
Strategic Thinking for Researchers
- Andy Gordon
-
How to Write a Great Research Paper
- Simon Peyton Jones
-
Project Malmo – a platform for fundamental AI research
- Katja Hofmann
-
No Compromises: Distributed Transactions with Consistency, Availability, and Performance
- Aleksandar Dragojevic
-
The Evolution of Innovation
- Hermann Hauser
-
How to Give a Great Research Talk
- Simon Peyton Jones