Action Elimination and Stopping Conditions for the Multi-Armed Bandit and Reinforcement Learning Problems
Shie Mannor and Eyal Even-Dar and Yishay Mansour


Send Feedback