Neural Network Function Approximation
Neural Network Function Approximation
Using neural networks as non-linear function approximators for value functions or policies: where is a neural network with parameters .
Advantages over Linear FA
- Automatic feature learning: No manual Feature Construction needed
- Representational power: Can approximate any continuous function (universal approximation theorem)
- Handles raw inputs: Can process pixels, text, etc. directly
Challenges in RL
- No convergence guarantees for Semi-Gradient Methods with non-linear FA
- Deadly Triad becomes more dangerous — non-linear + bootstrapping + off-policy
- Non-stationarity: Target values change as policy improves
- Catastrophic forgetting: Updating for new states can degrade performance on old states
Stabilization techniques: Experience Replay, Target Network (as in Deep Q-Network (DQN))