By the end of this course, students should be able to do the following:
·
Learn how to define deep RL tasks and the core principals behind deep RL, including policies, value functions,deriving Bellman equations;
·
Implement in code common algorithms following code standards and libraries used in deep RL;
·
Understand and work with value-based methods and approximate solutions (deep Q network based algorithms);
·
Learn the policy gradient methods and Actor-Critic methods;
·
Explore imitation learning tasks and solutions;
·
Recognize current advanced techniques and applications in deep RL.