Paper page - LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
…the discovery environment must make the control space tractable and provide cheap, frequent feedback for TTS search. As a concrete instantiation, we formulate width--depth TTS as controller synthesis over pre-collected…