VideoArxiv

Talk page

Title:

A Greedy Gradient Q-learning Approach for Constructing Optimal Policies in Infinite Time Horizon Settings

Speaker:

Ashkan Ertefaie

Link:

http://www.birs.ca/events/2018/5-day-workshops/18w5054/videos/watch/201801181643-Ertefaie.html

Workshop:

Birs- 18w5054: Workshop on the Interface of Machine Learning and Statistical Inference