anthropic.com › engineering Demystifying evals for AI agents …a roadmap to great evals for agents This section lays out our practical, field-tested advice for going from no evals to evals you can trust. Think of this as a roadmap… Jan 9, 2026