Reinforcement Learning
Reinforcement Learning. Instructor: Prof. Balaraman Ravindran, Department of Computer Science and Engineering, IIT Madras. Reinforcement learning is a paradigm that aims to model the trial-and-error learning process that is needed in many problem situations where explicit instructive signals are not available. It has roots in operations research, behavioral psychology and AI. The goal of the course is to introduce the basic mathematical foundations of reinforcement learning, as well as highlight some of the recent directions of research. (from nptel.ac.in)
Lecture 09 - Value Function based Methods |
In this lecture we discuss one approach to solving immediate reinforcement learning problems. This approach is based on maintaining estimates of each arm's expected return and is an approach that will be used extensively when we look at solutions to the full reinforcement learning problem.
Go to the Course Home or watch other lectures: