Policy Gradient Methods: Tutorial and New Frontiers

July 3, 2017
John Schulman | UC Berkeley
AI Summer School 2017

In this tutorial we discuss several recent advances in deep reinforcement learning involving policy gradient methods. These methods have shown significant success in a wide range of domains, including continuous-action domains such as manipulation, locomotion, and flight. They have also achieved the state of the art in discrete action domains such as Atari. We will provide a unifying overview of a variety of different policy gradient methods, and we will also discuss the formalism of stochastic computation graphs for computing gradients of expectations.

- Scarlet Schwiderski-Grosche
  
  Director
Research Area
Research Lab
- Microsoft Research Lab - Cambridge
Event
- AI Summer School 2017

Series: Cambridge Lab PhD Summer School

The Malmo Collaborative AI Challenge
July 6, 2017
Katja Hofmann
Counterfactual Multi-Agent Policy Gradients
July 6, 2017
Shimon Whiteson
Design - On the Human Side
July 5, 2017
Alex Taylor
Probabilistic Machine Learning and AI
July 5, 2017
Zoubin Ghahramani
Policy Gradient Methods: Tutorial and New Frontiers
July 3, 2017
John Schulman
Strategic Thinking for Researchers
August 1, 2016
Andy Gordon
How to Write a Great Research Paper
July 8, 2016
Simon Peyton Jones
Project Malmo – a platform for fundamental AI research
July 7, 2016
Katja Hofmann
No Compromises: Distributed Transactions with Consistency, Availability, and Performance
July 5, 2016
Aleksandar Dragojevic
The Evolution of Innovation
July 5, 2016
Hermann Hauser
How to Give a Great Research Talk
July 5, 2016
Simon Peyton Jones