Adaptive Guidance Accelerates Reinforcement Learning of Reasoning Models - Scale Labs | Scale Labs