Reward design

Loading...
강의 계획서 보기

검토

4.2(366개의 평가)
  • 5 stars
    56.01%
  • 4 stars
    23.77%
  • 3 stars
    9.28%
  • 2 stars
    4.64%
  • 1 star
    6.28%
LJ

Oct 07, 2019

Challenging (unlike many other courses on Coursera, it does not baby you and does not seem to be targeting as high a pass rate as possible), but very very rewarding.

HH

Jan 29, 2020

Very practical lecture. I strongly recommend this lecture. Programming assignments are little difficult, but not impossible :) Just do it!

수업에서
At the heart of RL: Dynamic Programming
This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

강사:

  • Pavel Shvechikov

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Alexander Panin

    Alexander Panin

    Lecturer

Coursera 카탈로그 살펴보기

무료로 참여해 맞춤화된 추천, 업데이트 및 제안을 받아보세요.