Theory and Methods for Reinforcement Learning
EE-618
Lecture 5
This page is part of the content downloaded from Lecture 5 on Wednesday, 25 December 2024, 15:49. Note that some content and any files larger than 50 MB are not downloaded.
Description
Policy gradient methods II: NPG, Sample Based NPG, TRPO, exploration in policy gradients