Reinforcement Learning in Finite MDPs: PAC Analysis
Alexander L. Strehl and Lihong Li and Michael L. Littman


Send Feedback