Prediction Task and Control Task • Prediction Task – With policy given, we try to predict the value function or Q function using the policy. – Why? because we want to evaluate the policy. – What is a good policy? One that gets a good return for the agent. – How can we get the return? From the Q function – Thus, by predicting a Q function, we predict the (expected) return, and that will evaluate ..