The Sample Complexity of Exploration in the Multi-Armed Bandit Problem (Special Topic on Learning Theory)
John N. Tsitsiklis and Shie Mannor

No comments yet

Add a comment

Report