VideoArxiv
Home
Workshops
Speakers
Talk page
Title:
A Greedy Gradient Q-learning Approach for Constructing Optimal Policies in Infinite Time Horizon Settings
Speaker:
Ashkan Ertefaie
Link:
http://www.birs.ca/events/2018/5-day-workshops/18w5054/videos/watch/201801181643-Ertefaie.html
Workshop:
Birs- 18w5054: Workshop on the Interface of Machine Learning and Statistical Inference