Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Peter L. Bartlett and Evan Greensmith and Jonathan Baxter

Hosted by users

Send Feedback