Notes on All Things Machine Learning and Mathematics

Advanced policy gradient methods & TRPO