Signal Processing Reading Group (Spring 2018)
Time and Venue
Reading Materials
Lectures (Tentative; following Prof. Silver's slides)
Introduction, Markov Decision Process (Ahmad Zoubi, April 11)
Planning by Dynamic Programming (Zeyu You, April 25)
Model-free prediction and control (Falah Alanazi, May 2)
Value function approximation (Sharmin Kibria, May 9)
Policy gradient (Trung Viet Vu, May 16)
Integrating Learning and Planning (Henrique Dantas, May 23)
Exploration and Exploitation (Leonardo Cavalcanti, May 30)
Case Study: RL in Classic Games (June 6)
TBD (KEC 1005, June 13)
More discussion
|