The Sample Complexity of Exploration in the Multi-Armed Bandit Problem (Special Topic on Learning Theory)
John N. Tsitsiklis and Shie Mannor

10 day statistics (1 downloads)
Average Time 30 mins, 01 secs
Average Speed 0.10kB/s
Best Time 30 mins, 01 secs
Best Speed 0.10kB/s
Worst Time 30 mins, 01 secs
Worst Speed 0.10kB/s

Send Feedback