Supported Models

Overview

Letta routinely runs automated scans against available providers and models. These are the results of the latest scan.

Ran 2464 tests against 154 models across 7 providers on June 16th, 2025

anthropic

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
claude-3-5-haiku-20241022200,0002025-06-16
claude-3-5-sonnet-20241022200,0002025-06-16
claude-3-7-sonnet-20250219200,0002025-06-16
claude-sonnet-4-20250514200,0002025-06-16
claude-opus-4-20250514⚠️200,0002025-06-16
claude-3-5-sonnet-20240620⚠️200,0002025-06-16
claude-3-haiku-20240307⚠️200,0002025-06-16
claude-3-opus-20240229⚠️200,0002025-06-16
claude-3-sonnet-20240229200,0002025-06-16

openai

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
gpt-4.11,047,5762025-06-16
gpt-4.1-2025-04-141,047,5762025-06-16
gpt-4.1-nano-2025-04-141,047,5762025-06-16
gpt-4o128,0002025-06-16
gpt-4o-2024-05-13128,0002025-06-16
gpt-4-turbo⚠️8,1922025-06-16
gpt-4.1-mini⚠️1,047,5762025-06-16
gpt-4.5-preview⚠️128,0002025-06-16
gpt-4.5-preview-2025-02-27⚠️128,0002025-06-16
gpt-4o-2024-08-06⚠️128,0002025-06-16
gpt-4-06138,1922025-06-16
gpt-4-1106-preview128,0002025-06-16
gpt-4-turbo-2024-04-09⚠️128,0002025-06-16
gpt-4.1-mini-2025-04-14⚠️1,047,5762025-06-16
gpt-4.1-nano⚠️1,047,5762025-06-16
gpt-4o-2024-11-20⚠️8,1922025-06-16
gpt-4-turbo-preview⚠️128,0002025-06-16
gpt-4-0125-preview⚠️128,0002025-06-16
gpt-4o-mini⚠️⚠️⚠️128,0002025-06-16
gpt-4o-mini-2024-07-18⚠️⚠️128,0002025-06-16
gpt-4⚠️8,1922025-06-16
o1⚠️8,1922025-06-16
o1-2024-12-17⚠️8,1922025-06-16
o3⚠️8,1922025-06-16
o3-2025-04-16⚠️8,1922025-06-16
o3-mini⚠️8,1922025-06-16
o3-mini-2025-01-31⚠️8,1922025-06-16
o3-pro8,1922025-06-16
o3-pro-2025-06-108,1922025-06-16

google_ai

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
gemini-1.5-pro2,000,0002025-06-16
gemini-1.5-pro-0022,000,0002025-06-16
gemini-1.5-pro-latest2,000,0002025-06-16
gemini-2.5-flash-preview-04-17-thinking1,048,5762025-06-16
gemini-2.5-pro-preview-03-251,048,5762025-06-16
gemini-2.5-pro-preview-05-061,048,5762025-06-16
gemini-2.5-flash-preview-05-20⚠️1,048,5762025-06-16
gemini-2.0-flash-thinking-exp⚠️1,048,5762025-06-16
gemini-2.0-flash-thinking-exp-1219⚠️1,048,5762025-06-16
gemini-2.0-flash-thinking-exp-01-21⚠️⚠️1,048,5762025-06-16
gemini-2.5-flash-preview-04-17⚠️⚠️1,048,5762025-06-16
gemini-2.5-pro-preview-06-05⚠️⚠️1,048,5762025-06-16
gemini-1.0-pro-vision-latest12,2882025-06-16
gemini-1.5-flash1,000,0002025-06-16
gemini-1.5-flash-0021,000,0002025-06-16
gemini-1.5-flash-8b1,000,0002025-06-16
gemini-1.5-flash-8b-0011,000,0002025-06-16
gemini-1.5-flash-8b-latest1,000,0002025-06-16
gemini-1.5-flash-latest1,000,0002025-06-16
gemini-2.0-flash1,048,5762025-06-16
gemini-2.0-flash-0011,048,5762025-06-16
gemini-2.0-flash-exp1,048,5762025-06-16
gemini-2.0-flash-exp-image-generation1,048,5762025-06-16
gemini-2.0-flash-lite1,048,5762025-06-16
gemini-2.0-flash-lite-0011,048,5762025-06-16
gemini-2.0-flash-lite-preview1,048,5762025-06-16
gemini-2.0-flash-lite-preview-02-051,048,5762025-06-16
gemini-2.0-flash-preview-image-generation32,7682025-06-16
gemini-2.0-pro-exp1,048,5762025-06-16
gemini-2.0-pro-exp-02-051,048,5762025-06-16
gemini-2.5-flash-preview-tts32,7682025-06-16
gemini-2.5-pro-exp-03-251,048,5762025-06-16
gemini-2.5-pro-preview-tts65,5362025-06-16
gemini-exp-12061,048,5762025-06-16
gemini-pro-vision12,2882025-06-16

letta

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
letta-free⚠️8,1922025-06-16

together

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
Qwen/Qwen2.5-72B-Instruct-Turbo⚠️131,0722025-06-16
arcee-ai/virtuoso-large⚠️131,0722025-06-16
Qwen/QwQ-32B⚠️⚠️131,0722025-06-16
Qwen/Qwen2.5-7B-Instruct-Turbo⚠️⚠️32,7682025-06-16
Qwen/Qwen2.5-Coder-32B-Instruct⚠️⚠️16,3842025-06-16
arcee-ai/coder-large⚠️⚠️32,7682025-06-16
arcee_ai/arcee-spotlight⚠️⚠️131,0722025-06-16
meta-llama/Llama-3.2-3B-Instruct-Turbo⚠️131,0722025-06-16
meta-llama/Llama-3.3-70B-Instruct-Turbo⚠️131,0722025-06-16
meta-llama/Llama-3.3-70B-Instruct-Turbo-Free⚠️131,0722025-06-16
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo⚠️130,8152025-06-16
meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo⚠️131,0722025-06-16
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF⚠️32,7682025-06-16
arcee-ai/virtuoso-medium-v2⚠️⚠️131,0722025-06-16
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8⚠️1,048,5762025-06-16
Qwen/Qwen3-235B-A22B-fp8-tput⚠️⚠️40,9602025-06-16
deepseek-ai/DeepSeek-V3⚠️⚠️131,0722025-06-16
meta-llama/Llama-4-Scout-17B-16E-Instruct⚠️⚠️1,048,5762025-06-16
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo⚠️⚠️131,0722025-06-16
mistralai/Mixtral-8x7B-Instruct-v0.1⚠️⚠️32,7682025-06-16
arcee-ai/caller⚠️32,7682025-06-16
mistralai/Mistral-Small-24B-Instruct-2501⚠️32,7682025-06-16
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO32,7682025-06-16
Qwen/Qwen2-72B-Instruct32,7682025-06-16
Qwen/Qwen2-VL-72B-Instruct32,7682025-06-16
Qwen/Qwen2.5-VL-72B-Instruct32,7682025-06-16
arcee-ai/arcee-blitz32,7682025-06-16
arcee-ai/maestro-reasoning131,0722025-06-16
deepseek-ai/DeepSeek-R1163,8402025-06-16
deepseek-ai/DeepSeek-R1-Distill-Llama-70B131,0722025-06-16
deepseek-ai/DeepSeek-R1-Distill-Llama-70B-free8,1922025-06-16
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B131,0722025-06-16
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B131,0722025-06-16
deepseek-ai/DeepSeek-V3-p-dp131,0722025-06-16
google/gemma-2-27b-it8,1922025-06-16
lgai/exaone-3-5-32b-instruct32,7682025-06-16
lgai/exaone-deep-32b32,7682025-06-16
marin-community/marin-8b-instruct131,0722025-06-16
meta-llama/Llama-3-70b-chat-hf8,1922025-06-16
meta-llama/Llama-3-8b-chat-hf8,1922025-06-16
meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo131,0722025-06-16
meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo131,0722025-06-16
meta-llama/Llama-Vision-Free131,0722025-06-16
meta-llama/Meta-Llama-3-70B-Instruct-Turbo8,1922025-06-16
meta-llama/Meta-Llama-3-8B-Instruct-Lite8,1922025-06-16
mistralai/Mistral-7B-Instruct-v0.132,7682025-06-16
mistralai/Mistral-7B-Instruct-v0.232,7682025-06-16
mistralai/Mistral-7B-Instruct-v0.332,7682025-06-16
perplexity-ai/r1-1776163,8402025-06-16
scb10x/scb10x-llama3-1-typhoon2-70b-instruct8,1922025-06-16
scb10x/scb10x-typhoon-2-1-gemma3-12b8,1922025-06-16
togethercomputer/MoA-132,7682025-06-16
togethercomputer/MoA-1-Turbo32,7682025-06-16
togethercomputer/Refuel-Llm-V216,3842025-06-16
togethercomputer/Refuel-Llm-V2-Small8,1922025-06-16

deepseek

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
deepseek-chat64,0002025-06-16
deepseek-reasoner64,0002025-06-16

groq

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
allam-2-7b8,1922025-06-16
compound-beta8,1922025-06-16
compound-beta-mini8,1922025-06-16
deepseek-r1-distill-llama-70b8,1922025-06-16
distil-whisper-large-v3-en8,1922025-06-16
gemma2-9b-it8,1922025-06-16
llama-3.1-8b-instant8,1922025-06-16
llama-3.3-70b-versatile8,1922025-06-16
llama-guard-3-8b8,1922025-06-16
llama3-70b-81928,1922025-06-16
llama3-8b-81928,1922025-06-16
meta-llama/llama-4-maverick-17b-128e-instruct8,1922025-06-16
meta-llama/llama-4-scout-17b-16e-instruct8,1922025-06-16
meta-llama/llama-guard-4-12b8,1922025-06-16
meta-llama/llama-prompt-guard-2-22m8,1922025-06-16
meta-llama/llama-prompt-guard-2-86m8,1922025-06-16
mistral-saba-24b8,1922025-06-16
playai-tts8,1922025-06-16
playai-tts-arabic8,1922025-06-16
qwen-qwq-32b8,1922025-06-16
qwen/qwen3-32b8,1922025-06-16
whisper-large-v38,1922025-06-16
whisper-large-v3-turbo8,1922025-06-16