Near-optimal Regret Bounds for Reinforcement Learning
Peter Auer and Thomas Jaksch and Ronald Ortner


Send Feedback