Scale Labs
[PAPERS][BLOG][LEADERBOARDS][SHOWDOWN]
← All posts

Posts by Veronica Chatrath

Research05. 03 2026

VeRO: Can AI Agents Build Better AI Agents?

VeRO benchmarks whether coding agents can improve other AI agents by modifying their prompts, tools, and control logic. Across 105 optimization runs, results show modest gains on tool-use tasks but persistent limits in exploration, cross-model generalization, and deeper architectural changes.

Varun Ursekar, Apaar Shanker, Veronica Chatrath, Sam Denton

Scale Labs Newsletter

Research, benchmarks, and insights — delivered to your inbox.

Copyright 2026 Scale Inc. All rights reserved.

TermsPrivacy