Dr Jekyll and Mr Hyde

Towards a generalized policy iteration theorem

We intend to advance the theoretical understanding of actor-critic algorithms under the lens of policy iteration.

论文与出版物