Skip to content
Sign up
Development tools
Testing & evals

Letta Evals

Introduction to Letta's evaluation framework for testing and measuring agent performance.

Systematic testing for stateful AI agents. Validate changes, prevent regressions, and ship with confidence.

Test agent memory, tool usage, multi-turn conversations, and state evolution with automated grading and pass/fail gates.

Understand the building blocks of evaluations:

Choose how to score your agents: