Group reading of Sutton and Barto Oct 2020 - Feb 2021
Slides covering the following chapters:
- 1&2: Introduction and Multi-armed Bandits
- 3: Finite Markov Decision Processes
- 4: Dynamic Programming
- 5: Monte Carlo Methods
- 6: Temporal-Difference Learning
- 7: n-step Bootstrapping
- 8: Planning and Learning with Tabular Methods
- 9: On-policy Prediction with Approximation
- 10: On-policy Control with Approximation
- 11: Off-policy Methods with Approximation
- 12: Eligibility Traces
- 13: Policy Gradient Methods