Ashkan Ertefaie

  1. Title:
    A Greedy Gradient Q-learning Approach for Constructing Optimal Policies in Infinite Time Horizon Settings

    Speaker:

    Link: