Reinforcement Learning in Finite MDPs: PAC Analysis
Alexander L. Strehl and Lihong Li and Michael L. Littman

No comments yet

Add a comment

10 day statistics (1 downloads)
Average Time 7 mins, 55 secs
Average Speed 0.57kB/s
Best Time 7 mins, 55 secs
Best Speed 0.57kB/s
Worst Time 7 mins, 55 secs
Worst Speed 0.57kB/s

Send Feedback