Supported Models

Supported Models

Overview

Letta routinely runs automated scans against available providers and models. These are the results of the latest scan.

Ran 2512 tests against 157 models across 7 providers on June 27th, 2025

anthropic

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
claude-3-5-haiku-20241022200,0002025-06-27
claude-3-5-sonnet-20240620200,0002025-06-27
claude-3-5-sonnet-20241022200,0002025-06-27
claude-3-7-sonnet-20250219200,0002025-06-27
claude-opus-4-20250514200,0002025-06-27
claude-sonnet-4-20250514200,0002025-06-27
claude-3-opus-20240229200,0002025-06-27
claude-3-haiku-20240307200,0002025-06-27
claude-3-sonnet-20240229200,0002025-06-27

openai

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
gpt-4-turbo128,0002025-06-27
gpt-4-turbo-2024-04-09128,0002025-06-27
gpt-4.11,047,5762025-06-27
gpt-4.1-2025-04-141,047,5762025-06-27
gpt-4.1-mini1,047,5762025-06-27
gpt-4.1-mini-2025-04-141,047,5762025-06-27
gpt-4.1-nano1,047,5762025-06-27
gpt-4.1-nano-2025-04-141,047,5762025-06-27
gpt-4o128,0002025-06-27
gpt-4o-2024-05-13128,0002025-06-27
gpt-4o-2024-08-06128,0002025-06-27
gpt-4o-2024-11-20128,0002025-06-27
gpt-4o-mini128,0002025-06-27
gpt-4o-mini-2024-07-18128,0002025-06-27
gpt-4-06138,1922025-06-27
gpt-4-1106-preview128,0002025-06-27
gpt-4-turbo-preview128,0002025-06-27
gpt-4-0125-preview128,0002025-06-27
o1200,0002025-06-27
o1-2024-12-17200,0002025-06-27
o3200,0002025-06-27
o3-2025-04-16200,0002025-06-27
o4-mini30,0002025-06-27
o4-mini-2025-04-1630,0002025-06-27
gpt-48,1922025-06-27
o3-mini200,0002025-06-27
o3-mini-2025-01-31200,0002025-06-27
o3-pro30,0002025-06-27
o3-pro-2025-06-1030,0002025-06-27

google_ai

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
gemini-1.5-pro2,000,0002025-06-27
gemini-1.5-pro-0022,000,0002025-06-27
gemini-1.5-pro-latest2,000,0002025-06-27
gemini-2.0-flash-thinking-exp1,048,5762025-06-27
gemini-2.5-flash-preview-04-171,048,5762025-06-27
gemini-2.5-pro1,048,5762025-06-27
gemini-2.5-pro-preview-03-251,048,5762025-06-27
gemini-2.5-pro-preview-05-061,048,5762025-06-27
gemini-2.5-flash1,048,5762025-06-27
gemini-2.0-flash-thinking-exp-12191,048,5762025-06-27
gemini-2.5-flash-preview-04-17-thinking1,048,5762025-06-27
gemini-2.5-flash-preview-05-201,048,5762025-06-27
gemini-2.5-pro-preview-06-051,048,5762025-06-27
gemini-2.0-flash-thinking-exp-01-211,048,5762025-06-27
gemini-2.5-flash-lite-preview-06-171,048,5762025-06-27
gemini-1.0-pro-vision-latest12,2882025-06-27
gemini-1.5-flash1,000,0002025-06-27
gemini-1.5-flash-0021,000,0002025-06-27
gemini-1.5-flash-8b1,000,0002025-06-27
gemini-1.5-flash-8b-0011,000,0002025-06-27
gemini-1.5-flash-8b-latest1,000,0002025-06-27
gemini-1.5-flash-latest1,000,0002025-06-27
gemini-2.0-flash1,048,5762025-06-27
gemini-2.0-flash-0011,048,5762025-06-27
gemini-2.0-flash-exp1,048,5762025-06-27
gemini-2.0-flash-exp-image-generation1,048,5762025-06-27
gemini-2.0-flash-lite1,048,5762025-06-27
gemini-2.0-flash-lite-0011,048,5762025-06-27
gemini-2.0-flash-lite-preview1,048,5762025-06-27
gemini-2.0-flash-lite-preview-02-051,048,5762025-06-27
gemini-2.0-flash-preview-image-generation32,7682025-06-27
gemini-2.0-pro-exp1,048,5762025-06-27
gemini-2.0-pro-exp-02-051,048,5762025-06-27
gemini-2.5-flash-preview-tts32,7682025-06-27
gemini-2.5-pro-preview-tts65,5362025-06-27
gemini-exp-12061,048,5762025-06-27
gemini-pro-vision12,2882025-06-27

together

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
arcee-ai/coder-large32,7682025-06-27
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP81,048,5762025-06-27
Qwen/Qwen2.5-Coder-32B-Instruct32,7682025-06-27
meta-llama/Llama-3.3-70B-Instruct-Turbo131,0722025-06-27
meta-llama/Llama-3.3-70B-Instruct-Turbo-Free131,0722025-06-27
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo130,8152025-06-27
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo131,0722025-06-27
deepseek-ai/DeepSeek-V3131,0722025-06-27
meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo131,0722025-06-27
Qwen/Qwen2.5-72B-Instruct-Turbo131,0722025-06-27
arcee-ai/virtuoso-large131,0722025-06-27
arcee-ai/virtuoso-medium-v2131,0722025-06-27
meta-llama/Llama-4-Scout-17B-16E-Instruct1,048,5762025-06-27
Qwen/Qwen3-235B-A22B-fp8-tput40,9602025-06-27
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF32,7682025-06-27
scb10x/scb10x-llama3-1-typhoon2-70b-instruct8,1922025-06-27
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO32,7682025-06-27
Qwen/QwQ-32B131,0722025-06-27
google/gemma-3n-E4B-it32,7682025-06-27
mistralai/Mistral-7B-Instruct-v0.232,7682025-06-27
perplexity-ai/r1-1776163,8402025-06-27
Qwen/Qwen2-72B-Instruct32,7682025-06-27
Qwen/Qwen2-VL-72B-Instruct32,7682025-06-27
Qwen/Qwen2.5-7B-Instruct-Turbo32,7682025-06-27
Qwen/Qwen2.5-VL-72B-Instruct32,7682025-06-27
arcee-ai/AFM-4.5B-Preview65,5362025-06-27
arcee-ai/arcee-blitz32,7682025-06-27
arcee-ai/caller32,7682025-06-27
arcee-ai/maestro-reasoning131,0722025-06-27
arcee_ai/arcee-spotlight131,0722025-06-27
deepseek-ai/DeepSeek-R1163,8402025-06-27
deepseek-ai/DeepSeek-R1-0528-tput163,8402025-06-27
deepseek-ai/DeepSeek-R1-Distill-Llama-70B131,0722025-06-27
deepseek-ai/DeepSeek-R1-Distill-Llama-70B-free8,1922025-06-27
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B131,0722025-06-27
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B131,0722025-06-27
deepseek-ai/DeepSeek-V3-p-dp131,0722025-06-27
google/gemma-2-27b-it8,1922025-06-27
lgai/exaone-3-5-32b-instruct32,7682025-06-27
lgai/exaone-deep-32b32,7682025-06-27
meta-llama/Llama-3-70b-chat-hf8,1922025-06-27
meta-llama/Llama-3-8b-chat-hf8,1922025-06-27
meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo131,0722025-06-27
meta-llama/Llama-3.2-3B-Instruct-Turbo131,0722025-06-27
meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo131,0722025-06-27
meta-llama/Llama-Vision-Free131,0722025-06-27
meta-llama/Meta-Llama-3-70B-Instruct-Turbo8,1922025-06-27
meta-llama/Meta-Llama-3-8B-Instruct-Lite8,1922025-06-27
mistralai/Mistral-7B-Instruct-v0.132,7682025-06-27
mistralai/Mistral-7B-Instruct-v0.332,7682025-06-27
mistralai/Mistral-Small-24B-Instruct-250132,7682025-06-27
mistralai/Mixtral-8x7B-Instruct-v0.132,7682025-06-27
scb10x/scb10x-typhoon-2-1-gemma3-12b131,0722025-06-27
togethercomputer/MoA-132,7682025-06-27
togethercomputer/MoA-1-Turbo32,7682025-06-27
togethercomputer/Refuel-Llm-V216,3842025-06-27
togethercomputer/Refuel-Llm-V2-Small8,1922025-06-27

deepseek

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
deepseek-chat64,0002025-06-27
deepseek-reasoner64,0002025-06-27

groq

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
allam-2-7b30,0002025-06-27
compound-beta30,0002025-06-27
compound-beta-mini30,0002025-06-27
deepseek-r1-distill-llama-70b30,0002025-06-27
distil-whisper-large-v3-en30,0002025-06-27
gemma2-9b-it30,0002025-06-27
llama-3.1-8b-instant30,0002025-06-27
llama-3.3-70b-versatile30,0002025-06-27
llama3-70b-819230,0002025-06-27
llama3-8b-819230,0002025-06-27
meta-llama/llama-4-maverick-17b-128e-instruct30,0002025-06-27
meta-llama/llama-4-scout-17b-16e-instruct30,0002025-06-27
meta-llama/llama-guard-4-12b30,0002025-06-27
meta-llama/llama-prompt-guard-2-22m30,0002025-06-27
meta-llama/llama-prompt-guard-2-86m30,0002025-06-27
mistral-saba-24b30,0002025-06-27
playai-tts30,0002025-06-27
playai-tts-arabic30,0002025-06-27
qwen-qwq-32b30,0002025-06-27
qwen/qwen3-32b30,0002025-06-27
whisper-large-v330,0002025-06-27
whisper-large-v3-turbo30,0002025-06-27

letta

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
letta-free8,1922025-06-27