Linear Function Approximation
Linear Function Approximation
The value function is approximated as a linear combination of features:
where is a feature vector and is a weight vector.
Gradient
The gradient is simply the feature vector:
This makes updates simple:
Convergence Guarantee
Why Linear Is Special
Semi-gradient TD(0) with linear FA converges to the TD Fixed Point:
This guarantee does not hold for non-linear (e.g., neural network) approximators.
Feature Construction
The power of linear FA depends entirely on the feature vector . See Feature Construction:
- Tile Coding: Binary features from overlapping tilings
- Polynomials:
- Fourier basis: Cosine functions at different frequencies
- Radial Basis Functions: Gaussian bumps centered at prototypes
- One-hot (tabular): Each state gets its own feature → recovers tabular case
Connections
- Special case of: Function Approximation
- Solved exactly by: LSTD
- Feature design: Feature Construction, Tile Coding
- Convergence: TD Fixed Point