Course Objectives
- To introduce theory of Markov decision problems and reinforcement learning
- To introduce different reinforcement learning techniques
- To implement different reinforcement learning techniques to various problems
- To introduce performance evaluation of reinforcement learning techniques
Topics
- Reinforcement Learning Foundations
- Multi-Armed Bandit
- Markov Decision Processes
- Value Iteration – Policy Iteration
- Monte Carlo Methods
- Q-Learning, SARSA
- Eligibility Traces
- Exploration vs. Exploitation
- Function Approximation: Stochastic-gradient, Semi-gradient TD Update, Least-squares TD
- Value-based Deep RL: Q-network
- Policy-based Deep RL: REINFORCE