Reward design

Loading...

검토

4.1(212개의 평가)
  • 5 stars
    111 ratings
  • 4 stars
    49 ratings
  • 3 stars
    24 ratings
  • 2 stars
    10 ratings
  • 1 star
    18 ratings
VO

Mar 17, 2019

Well Prepared and taught course.. Will highly recommend as the primer for reinforcement learning

AH

Aug 17, 2018

Learned a lot. The pace is quick and the assignment is challenging sometimes

수업에서
At the heart of RL: Dynamic Programming
This week we'll consider the reinforcement learning formalisms in a more rigorous, mathematical way. You'll learn how to effectively compute the return your agent gets for a particular action - and how to pick best actions based on that return.

강사:

  • Pavel Shvechikov

    Pavel Shvechikov

    Researcher at HSE and Sberbank AI Lab
  • Alexander Panin

    Alexander Panin

    Lecturer

Coursera 카탈로그 살펴보기

무료로 참여해 맞춤화된 추천, 업데이트 및 제안을 받아보세요.