Scale Labs
[PAPERS][BLOG][LEADERBOARDS][SHOWDOWN]
← All posts

Posts by Niklas Lauffer

Research12. 01 2026

Training Robust Multi-Turn LM Agents with On-Policy Expert Corrections

In our recent work, Imitation Learning for Multi-Turn LM Agents via On-Policy Expert Corrections, we expose the problem of covariate shift in SWE LM agents and propose a simple, practical fix that significantly improves training efficiency and agent robustness.

Niklas Lauffer

Scale Labs Newsletter

Research, benchmarks, and insights — delivered to your inbox.

Copyright 2026 Scale Inc. All rights reserved.

TermsPrivacy