Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Peter L. Bartlett and Evan Greensmith and Jonathan Baxter

No comments yet

Add a comment

Report