Near-optimal Regret Bounds for Reinforcement Learning
Peter Auer and Thomas Jaksch and Ronald Ortner

Hosted by users:

Send Feedback