Representation and General Value Functions——General Value Functions(GVFs)
阿新 • • 發佈:2022-05-12
https://sites.ualberta.ca/~pilarski/docs/theses/Sherstan_Craig_D_202009_PhD.pd 原文連結
General value functions (GVFs) make two relaxations to the value function definition we have already considered (Sutton, Modayil, et al., 2011). First, we are free to choose any signal available to the agent as the prediction target, not just reward
Like a value function, a GVF is defined by three components: the policy, the timescale, and the prediction target. GVFs allow the agent to express representation elements in the form of predictive questions. Consider the following examples for a mobile robot: