Headless mode

Reference

Run Letta Code non-interactively for scripting and automation

Headless mode allows you to run Letta Code non-interactively, making it easy to integrate into scripts, CI/CD pipelines, or compose with other UNIX tools.

Basic usage

Use the -p flag to pass a prompt directly:

letta -p "Look around this repo and write a README.md documenting it"

You can also pipe input to Letta Code:

echo "Explain this error" | letta -p

Output formats

Letta Code supports three output formats in headless mode:

Text (default)

Returns the agent’s response as plain text:

letta -p "What files are in this directory?"

JSON

Returns a structured JSON response with metadata:

letta -p "List all TypeScript files" --output-format json

{
  "type": "result",
  "result": "Found 15 TypeScript files...",
  "agent_id": "agent-abc123",
  "conversation_id": "conversation-xyz789",
  "usage": {
    "prompt_tokens": 1250,
    "completion_tokens": 89
  }
}

Stream JSON

Returns line-delimited JSON events for real-time streaming. This is useful for preventing timeouts and getting incremental progress:

letta -p "Explain this codebase" --output-format stream-json

Each line is a JSON event:

{"type":"system","subtype":"init","agent_id":"agent-...","conversation_id":"conversation-...","session_id":"agent-...","model":"claude-sonnet-4-6","tools":[...]}
{"type":"message","message_type":"reasoning_message","reasoning":"The user is asking...","otid":"...","seq_id":1}
{"type":"message","message_type":"assistant_message","content":"Here's an overview...","otid":"...","seq_id":5}
{"type":"message","message_type":"stop_reason","stop_reason":"end_turn"}
{"type":"message","message_type":"usage_statistics","prompt_tokens":294,"completion_tokens":97}
{"type":"result","subtype":"success","result":"Here's an overview...","agent_id":"...","conversation_id":"...","session_id":"...","uuid":"..."}

Messages are streamed at the token level - each chunk has the same otid (output turn ID) and incrementing seq_id.

Bidirectional mode

For programmatic control, use --input-format stream-json to enable bidirectional JSON communication over stdin/stdout. This allows external programs to send messages and receive responses in a structured format.

letta -p --input-format stream-json --output-format stream-json

Input message types

Send JSON messages to stdin (one per line):

{
  "type": "user",
  "message": { "role": "user", "content": "What files are here?" }
}

{
  "type": "control_request",
  "request_id": "init_1",
  "request": { "subtype": "initialize" }
}

{
  "type": "control_request",
  "request_id": "int_1",
  "request": { "subtype": "interrupt" }
}

Output message types

The CLI emits JSON messages to stdout:

{"type": "system", "subtype": "init", "agent_id": "agent-xxx", "conversation_id": "conversation-xxx", "session_id": "agent-xxx", "model": "...", "tools": [...]}

{"type": "control_response", "response": {"subtype": "success", "request_id": "init_1", "response": {...}}}

{
  "type": "control_request",
  "request_id": "perm-123",
  "request": {
    "subtype": "can_use_tool",
    "tool_name": "Bash",
    "input": { "command": "ls" }
  }
}

{
  "type": "message",
  "message_type": "assistant_message",
  "content": "Hello!",
  "session_id": "...",
  "uuid": "..."
}

{
  "type": "result",
  "subtype": "success",
  "result": "Hello!",
  "session_id": "...",
  "agent_id": "...",
  "conversation_id": "..."
}

Interactive tool behavior

In one-shot headless mode (-p without --input-format stream-json), there is no control channel for runtime user input. Interactive tools therefore behave differently:

EnterPlanMode can proceed.
AskUserQuestion and ExitPlanMode are denied when they require runtime user input.

In bidirectional mode (--input-format stream-json), permission requests are emitted as control_request messages with subtype: "can_use_tool". Your host process should answer with a matching control_response.

Multi-turn conversations

The process stays alive until stdin closes, allowing multi-turn conversations:

(
echo '{"type": "user", "message": {"role": "user", "content": "Remember: secret is BANANA"}}'
sleep 5
echo '{"type": "user", "message": {"role": "user", "content": "What was the secret?"}}'
) | letta -p --input-format stream-json --output-format stream-json

Token-level streaming

Add --include-partial-messages to receive token-level streaming events:

letta -p --input-format stream-json --output-format stream-json --include-partial-messages

This wraps each chunk in a stream_event:

{
  "type": "stream_event",
  "event": { "message_type": "assistant_message", "content": "Hel" },
  "session_id": "...",
  "uuid": "..."
}

Agent and conversation selection

By default, headless mode uses the last agent from the current directory and its “default” conversation. Your agent retains memory across all runs, and the default conversation preserves message history between sessions.

To create a new conversation for parallel sessions, use --new:

letta -p "..." --new

letta -p "..." --new-agent

letta -p "..." --agent <agent-id>

letta -p "..." --continue

letta -p "..." --conversation <conversation-id>

The JSON and stream-json output formats include a conversation_id field, which you can use to continue the same conversation in subsequent calls:

result=$(letta -p "Start a new task" --output-format json)
conv_id=$(echo $result | jq -r '.conversation_id')

# Continue the same conversation
letta -p "Continue where we left off" --conversation $conv_id --output-format json

Model selection

Specify a model for the headless run:

letta -p "..." --model sonnet
letta -p "..." --model auto
letta -p "..." -m gpt-5-codex
letta -p "..." -m haiku

Specify an embedding model when creating a new agent:

letta -p "..." --new-agent --embedding letta/letta-free

See Models for the full list of supported model IDs.

Permission control

Auto-allow all tools

Use --yolo to bypass all permission prompts (use with caution):

letta -p "Refactor this file" --yolo

Restrict available tools

The --tools flag controls which tools are attached to the agent (removing them from the context window entirely):

letta -p "Analyze this codebase" --tools "Read,Glob,Grep"

letta -p "What do you think about this approach?" --tools ""

This is different from --allowedTools/--disallowedTools which control permissions but keep tools in context. See Permissions for more details.

Permission modes

letta -p "Fix the type errors" --permission-mode acceptEdits

letta -p "Review this PR" --permission-mode plan

Advanced options

Resume by name

Use -n or --name to resume an agent by name (case-insensitive). Matches pinned agents or recent agents:

letta -p "Continue where we left off" --name myproject

System prompt configuration

Customize the agent’s system prompt when creating new agents:

letta -p "..." --new-agent --system letta-claude

letta -p "..." --new-agent --system-custom "You are a Python expert who writes clean code."

letta -p "..." --new-agent --system letta-claude --system-append "Always respond in Spanish."

Available presets:

default / letta-claude - Full Letta Code prompt (Claude-optimized)
letta-codex - Full Letta Code prompt (Codex-optimized)
letta-gemini - Full Letta Code prompt (Gemini-optimized)
claude - Basic Claude (no skills/memory instructions)
codex - Basic Codex
gemini - Basic Gemini

Memory block configuration

Customize which memory blocks the agent uses:

letta -p "..." --new-agent --init-blocks "persona,project"

letta -p "..." --new-agent --init-blocks "persona,project" \
  --block-value persona="You are a Go expert" \
  --block-value project="CLI tool for Docker"

letta -p "..." --new-agent --memory-blocks '[
  {"label": "context", "value": "API documentation for Acme Corp..."},
  {"label": "rules", "value": "Always use TypeScript"}
]'

letta -p "..." --new-agent --init-blocks ""

Available preset blocks:

persona - Agent’s personality and behavior
human - Information about the user
project - Current project context

Toolset override

Force a specific toolset instead of auto-detection based on model:

letta -p "..." --toolset codex    # Codex-style tools
letta -p "..." --toolset gemini   # Gemini-style tools
letta -p "..." --toolset default  # Default Letta tools

Base tools configuration

When creating a new agent with --new-agent, specify which base tools to attach:

letta -p "..." --new-agent --base-tools "memory,web_search"

Create from AgentFile

Create an agent from an AgentFile template or the agent registry:

letta -p "..." --import ./my-agent.af
letta -p "..." --import @author/agent-name

System prompt configuration

Customize the agent’s system prompt:

letta -p "..." --system-custom "You are a helpful assistant that only responds in haiku."

letta -p "..." --system-append "Always respond in JSON format."

Memory block configuration

Configure memory blocks when creating agents:

letta -p "..." --new-agent --memory-blocks '{"persona": "You are a code reviewer", "project": "React app"}'

letta -p "..." --block-value "persona=You are a security auditor" --block-value "project=Backend API"

Examples

Automated tasks

letta -p "Run the linter and fix any errors" --yolo

Structured output for scripts

Use JSON output to parse results programmatically:

result=$(letta -p "What is the main entry point of this project?" --output-format json)
echo $result | jq '.result'

Read-only analysis

Use --tools to restrict the agent to read-only operations:

letta -p "Review this codebase for potential security issues" --tools "Read,Glob,Grep"

Scheduled tasks with cron

Run Letta Code on a schedule using cron:

0 9 * * * cd /path/to/project && letta -p "Review recent changes and summarize any issues" --tools "Read,Glob,Grep" --output-format json >> /var/log/letta-review.log 2>&1

0 10 * * 1 cd /path/to/project && letta -p "Check for outdated dependencies and security vulnerabilities" --yolo >> /var/log/letta-deps.log 2>&1