Policy Evaluation
Policy Evaluation (Prediction)
Computing the state-value function for a given policy . Also called the prediction problem.
Iterative update:
Converges to as . Used as a subroutine in Policy Iteration.
Policy Evaluation (Prediction)
Computing the state-value function for a given policy . Also called the prediction problem.
Iterative update:
Converges to as . Used as a subroutine in Policy Iteration.