The Single-Policy Shortcut for Offline RL
In the wild world of learning from history, researchers often tell a story of how to teach an agent to act well without real-time trial and error. Offline reinforcement learning is the field that studies this exact question: can a system become reliably capable by staring at a stack of past experiences rather than roaming…