Talk page

Title:
A Greedy Gradient Q-learning Approach for Constructing Optimal Policies in Infinite Time Horizon Settings

Speaker:
Ashkan Ertefaie

Link:
http://www.birs.ca/events/2018/5-day-workshops/18w5054/videos/watch/201801181643-Ertefaie.html

Workshop:
Birs- 18w5054: Workshop on the Interface of Machine Learning and Statistical Inference