FLAIR Blog

Research blog posts from the Foerster Lab for AI Research (FLAIR) at the University of Oxford

Latest publications and findings from our lab

Reinforcement Learning March 2025

Fixing TD Part II: Overcoming the Deadly Triad

In Part I of this blog, we characterised the stability of TD through the TD Jacobian. In this part, we now build on this analysis to better understand the reasons for instability before proposing a surprisingly simple architectural solution that can stabilise TD.

Continue Reading →