Agents

List Agents

agents.list() -> SyncArrayPage[AgentState]

GET/v1/agents/

Create Agent

agents.create() -> AgentState

POST/v1/agents/

Update Agent

agents.update(, ) -> AgentState

PATCH/v1/agents/{agent_id}

Retrieve Agent

agents.retrieve(, ) -> AgentState

GET/v1/agents/{agent_id}

Delete Agent

agents.delete() -> object

DELETE/v1/agents/{agent_id}

Export Agent

agents.export_file(, ) -> AgentExportFileResponse

GET/v1/agents/{agent_id}/export

Import Agent

agents.import_file() -> AgentImportFileResponse

POST/v1/agents/import

Recompile Agent

agents.recompile(, ) -> AgentRecompileResponse

POST/v1/agents/{agent_id}/recompile

ModelsExpand Collapse

class AgentEnvironmentVariable: …

agent_id: str

The ID of the agent this environment variable belongs to.

key: str

The name of the environment variable.

value: str

The value of the environment variable.

id: Optional[str]

The human-friendly ID of the Agent-env

created_at: Optional[datetime]

The timestamp when the object was created.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this object.

description: Optional[str]

An optional description of the environment variable.

last_updated_by_id: Optional[str]

The id of the user that made this object.

updated_at: Optional[datetime]

The timestamp when the object was last updated.

formatdate-time

value_enc: Optional[str]

Encrypted secret value (stored as encrypted string)

class AgentState: …

Representation of an agent’s state. This is the state of the agent at a given time, and is persisted in the DB backend. The state has all the information needed to recreate a persisted agent.

id: str

The id of the agent. Assigned by the database.

agent_type: AgentType

The type of agent.

One of the following:

"memgpt_agent"

"memgpt_v2_agent"

"letta_v1_agent"

"react_agent"

"workflow_agent"

"split_thread_agent"

"sleeptime_agent"

"voice_convo_agent"

"voice_sleeptime_agent"

blocks: List[Block]

The memory blocks used by the agent.

value: str

Value of the block.

id: Optional[str]

The human-friendly ID of the Block

base_template_id: Optional[str]

The base template id of the block.

created_by_id: Optional[str]

The id of the user that made this Block.

deployment_id: Optional[str]

The id of the deployment.

description: Optional[str]

Description of the block.

entity_id: Optional[str]

The id of the entity within the template.

hidden: Optional[bool]

If set to True, the block will be hidden.

is_template: Optional[bool]

Whether the block is a template (e.g. saved human/persona options).

label: Optional[str]

Label of the block (e.g. ‘human’, ‘persona’) in the context window.

last_updated_by_id: Optional[str]

The id of the user that last updated this Block.

limit: Optional[int]

Character limit of the block.

metadata: Optional[Dict[str, object]]

Metadata of the block.

preserve_on_migration: Optional[bool]

Preserve the block on template migration.

project_id: Optional[str]

The associated project id.

read_only: Optional[bool]

Whether the agent has read-only access to the block.

tags: Optional[List[str]]

The tags associated with the block.

template_id: Optional[str]

The id of the template.

template_name: Optional[str]

Name of the block if it is a template.

Deprecatedllm_config: LlmConfig

Deprecated: Use model field instead. The LLM configuration used by the agent.

context_window: int

The context window size for the model.

model: str

LLM model name.

model_endpoint_type: Literal["openai", "anthropic", "google_ai", 27 more]

The endpoint type for the model.

One of the following:

"openai"

"anthropic"

"google_ai"

"google_vertex"

"azure"

"groq"

"ollama"

"webui"

"webui-legacy"

"lmstudio"

"lmstudio-legacy"

"lmstudio-chatcompletions"

"llamacpp"

"koboldcpp"

"vllm"

"hugging-face"

"minimax"

"moonshot"

"moonshot_coding"

"mistral"

"together"

"bedrock"

"deepseek"

"xai"

"zai"

"zai_coding"

"baseten"

"fireworks"

"openrouter"

"chatgpt_oauth"

compatibility_type: Optional[Literal["gguf", "mlx"]]

The framework compatibility type for the model.

One of the following:

"gguf"

"mlx"

display_name: Optional[str]

A human-friendly display name for the model.

effort: Optional[Literal["low", "medium", "high", 2 more]]

The effort level for Anthropic models that support it (Opus 4.5+). Controls token spending and thinking behavior. Not setting this gives similar performance to ‘high’.

One of the following:

"low"

"medium"

"high"

"xhigh"

"max"

enable_reasoner: Optional[bool]

Whether or not the model should use extended thinking if it is a ‘reasoning’ style model

frequency_penalty: Optional[float]

Positive values penalize new tokens based on their existing frequency in the text so far, decreasing the model’s likelihood to repeat the same line verbatim. From OpenAI: Number between -2.0 and 2.0.

handle: Optional[str]

The handle for this config, in the format provider/model-name.

max_reasoning_tokens: Optional[int]

Configurable thinking budget for extended thinking. Used for enable_reasoner and also for Google Vertex models like Gemini 2.5 Flash. Minimum value is 1024 when used with enable_reasoner.

max_tokens: Optional[int]

The maximum number of tokens to generate. If not set, the model will use its default value.

model_endpoint: Optional[str]

The endpoint for the model.

model_wrapper: Optional[str]

The wrapper for the model.

Deprecatedparallel_tool_calls: Optional[bool]

Deprecated: Use model_settings to configure parallel tool calls instead. If set to True, enables parallel tool calling. Defaults to False.

provider_category: Optional[ProviderCategory]

The provider category for the model.

One of the following:

"base"

"byok"

provider_name: Optional[str]

The provider name for the model.

put_inner_thoughts_in_kwargs: Optional[bool]

Puts ‘inner_thoughts’ as a kwarg in the function call if this is set to True. This helps with function calling performance and also the generation of inner thoughts.

reasoning_effort: Optional[Literal["none", "minimal", "low", 3 more]]

The reasoning effort to use when generating text reasoning models

One of the following:

"none"

"minimal"

"low"

"medium"

"high"

"xhigh"

response_format: Optional[ResponseFormat]

The response format for the model’s output. Supports text, json_object, and json_schema (structured outputs). Can be set via model_settings.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

return_logprobs: Optional[bool]

Whether to return log probabilities of the output tokens. Useful for RL training.

return_token_ids: Optional[bool]

Whether to return token IDs for all LLM generations via SGLang native endpoint. Required for multi-turn RL training with loss masking. Only works with SGLang provider.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool schemas include strict: true and additionalProperties: false, guaranteeing tool outputs match JSON schemas.

temperature: Optional[float]

The temperature to use when generating text with the model. A higher temperature will result in more random text.

tier: Optional[str]

The cost tier for the model (cloud only).

tool_call_parser: Optional[str]

SGLang tool call parser name (e.g. ‘glm47’, ‘qwen25’, ‘hermes’). Used by the SGLang native adapter to parse tool calls from raw model output.

top_logprobs: Optional[int]

Number of most likely tokens to return at each position (0-20). Requires return_logprobs=True.

verbosity: Optional[Literal["low", "medium", "high"]]

Soft control for how verbose model output should be, used for GPT-5 models.

One of the following:

"low"

"medium"

"high"

Deprecatedmemory: Memory

Deprecated: Use blocks field instead. The in-context memory of the agent.

blocks: List[Block]

Memory blocks contained in the agent’s in-context memory

value: str

Value of the block.

id: Optional[str]

The human-friendly ID of the Block

base_template_id: Optional[str]

The base template id of the block.

created_by_id: Optional[str]

The id of the user that made this Block.

deployment_id: Optional[str]

The id of the deployment.

description: Optional[str]

Description of the block.

entity_id: Optional[str]

The id of the entity within the template.

hidden: Optional[bool]

If set to True, the block will be hidden.

is_template: Optional[bool]

Whether the block is a template (e.g. saved human/persona options).

label: Optional[str]

Label of the block (e.g. ‘human’, ‘persona’) in the context window.

last_updated_by_id: Optional[str]

The id of the user that last updated this Block.

limit: Optional[int]

Character limit of the block.

metadata: Optional[Dict[str, object]]

Metadata of the block.

preserve_on_migration: Optional[bool]

Preserve the block on template migration.

project_id: Optional[str]

The associated project id.

read_only: Optional[bool]

Whether the agent has read-only access to the block.

tags: Optional[List[str]]

The tags associated with the block.

template_id: Optional[str]

The id of the template.

template_name: Optional[str]

Name of the block if it is a template.

agent_type: Optional[Union[AgentType, str, null]]

Agent type controlling prompt rendering.

One of the following:

Literal["memgpt_agent", "memgpt_v2_agent", "letta_v1_agent", 6 more]

One of the following:

"memgpt_agent"

"memgpt_v2_agent"

"letta_v1_agent"

"react_agent"

"workflow_agent"

"split_thread_agent"

"sleeptime_agent"

"voice_convo_agent"

"voice_sleeptime_agent"

str

file_blocks: Optional[List[MemoryFileBlock]]

Special blocks representing the agent’s in-context memory of an attached file

file_id: str

Unique identifier of the file.

is_open: bool

True if the agent currently has the file open.

Deprecatedsource_id: str

Deprecated: Use folder_id field instead. Unique identifier of the source.

value: str

Value of the block.

id: Optional[str]

The human-friendly ID of the Block

base_template_id: Optional[str]

The base template id of the block.

created_by_id: Optional[str]

The id of the user that made this Block.

deployment_id: Optional[str]

The id of the deployment.

description: Optional[str]

Description of the block.

entity_id: Optional[str]

The id of the entity within the template.

hidden: Optional[bool]

If set to True, the block will be hidden.

is_template: Optional[bool]

Whether the block is a template (e.g. saved human/persona options).

label: Optional[str]

Label of the block (e.g. ‘human’, ‘persona’) in the context window.

last_accessed_at: Optional[datetime]

UTC timestamp of the agent’s most recent access to this file. Any operations from the open, close, or search tools will update this field.

formatdate-time

last_updated_by_id: Optional[str]

The id of the user that last updated this Block.

limit: Optional[int]

Character limit of the block.

metadata: Optional[Dict[str, object]]

Metadata of the block.

preserve_on_migration: Optional[bool]

Preserve the block on template migration.

project_id: Optional[str]

The associated project id.

read_only: Optional[bool]

Whether the agent has read-only access to the block.

tags: Optional[List[str]]

The tags associated with the block.

template_id: Optional[str]

The id of the template.

template_name: Optional[str]

Name of the block if it is a template.

git_enabled: Optional[bool]

Whether this agent uses git-backed memory with structured labels.

prompt_template: Optional[str]

Deprecated. Ignored for performance.

The name of the agent.

Deprecatedsources: List[Source]

Deprecated: Use folders field instead. The sources used by the agent.

id: str

The human-friendly ID of the Source

embedding_config: EmbeddingConfig

The embedding configuration used by the source.

embedding_dim: int

The dimension of the embedding.

embedding_endpoint_type: Literal["openai", "anthropic", "bedrock", 16 more]

The endpoint type for the model.

One of the following:

"openai"

"anthropic"

"bedrock"

"google_ai"

"google_vertex"

"azure"

"groq"

"ollama"

"webui"

"webui-legacy"

"lmstudio"

"lmstudio-legacy"

"llamacpp"

"koboldcpp"

"vllm"

"hugging-face"

"mistral"

"together"

"pinecone"

embedding_model: str

The model for the embedding.

azure_deployment: Optional[str]

The Azure deployment for the model.

azure_endpoint: Optional[str]

The Azure endpoint for the model.

azure_version: Optional[str]

The Azure version for the model.

batch_size: Optional[int]

The maximum batch size for processing embeddings.

embedding_chunk_size: Optional[int]

The chunk size of the embedding.

embedding_endpoint: Optional[str]

The endpoint for the model (None if local).

handle: Optional[str]

The handle for this config, in the format provider/model-name.

The name of the source.

created_at: Optional[datetime]

The timestamp when the source was created.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this Tool.

description: Optional[str]

The description of the source.

instructions: Optional[str]

Instructions for how to use the source.

last_updated_by_id: Optional[str]

The id of the user that made this Tool.

metadata: Optional[Dict[str, object]]

Metadata associated with the source.

updated_at: Optional[datetime]

The timestamp when the source was last updated.

formatdate-time

vector_db_provider: Optional[VectorDBProvider]

The vector database provider used for this source’s passages

One of the following:

"native"

"tpuf"

"pinecone"

system: str

The system prompt used by the agent.

tags: List[str]

The tags associated with the agent.

tools: List[Tool]

The tools used by the agent.

id: str

The human-friendly ID of the Tool

args_json_schema: Optional[Dict[str, object]]

The args JSON schema of the function.

created_by_id: Optional[str]

The id of the user that made this Tool.

default_requires_approval: Optional[bool]

Default value for whether or not executing this tool requires approval.

description: Optional[str]

The description of the tool.

enable_parallel_execution: Optional[bool]

If set to True, then this tool will potentially be executed concurrently with other tools. Default False.

json_schema: Optional[Dict[str, object]]

The JSON schema of the function.

last_updated_by_id: Optional[str]

The id of the user that made this Tool.

metadata: Optional[Dict[str, object]]

A dictionary of additional metadata for the tool.

The name of the function.

npm_requirements: Optional[List[NpmRequirement]]

Optional list of npm packages required by this tool.

Name of the npm package.

minLength1

version: Optional[str]

Optional version of the package, following semantic versioning.

pip_requirements: Optional[List[PipRequirement]]

Optional list of pip packages required by this tool.

Name of the pip package.

minLength1

version: Optional[str]

Optional version of the package, following semantic versioning.

project_id: Optional[str]

The project id of the tool.

return_char_limit: Optional[int]

The maximum number of characters in the response.

maximum1000000

minimum1

source_code: Optional[str]

The source code of the function.

source_type: Optional[str]

The type of the source code.

tags: Optional[List[str]]

Metadata tags.

tool_type: Optional[ToolType]

The type of the tool.

One of the following:

"custom"

"letta_core"

"letta_memory_core"

"letta_multi_agent_core"

"letta_sleeptime_core"

"letta_voice_sleeptime_core"

"letta_builtin"

"letta_files_core"

"external_langchain"

"external_composio"

"external_mcp"

base_template_id: Optional[str]

The base template id of the agent.

compaction_settings: Optional[CompactionSettings]

Configuration for conversation compaction / summarization.

Per-model settings (temperature, max tokens, etc.) are derived from the default configuration for that handle.

clip_chars: Optional[int]

The maximum length of the summary in characters. If none, no clipping is performed.

mode: Optional[Literal["all", "sliding_window", "self_compact_all", "self_compact_sliding_window"]]

The type of summarization technique use.

One of the following:

"all"

"sliding_window"

"self_compact_all"

"self_compact_sliding_window"

model: Optional[str]

Model handle to use for sliding_window/all summarization (format: provider/model-name). If None, uses lightweight provider-specific defaults.

model_settings: Optional[CompactionSettingsModelSettings]

Optional model settings used to override defaults for the summarizer model.

One of the following:

class OpenAIModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["openai"]]

The type of the provider.

reasoning: Optional[Reasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "minimal", "low", 3 more]]

The reasoning effort to use when generating text reasoning models

One of the following:

"none"

"minimal"

"low"

"medium"

"high"

"xhigh"

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

class CompactionSettingsModelSettingsSgLangModelSettings: …

SGLang model configuration (OpenAI-compatible runtime with SGLang-specific parsing).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["sglang"]]

The type of the provider.

reasoning: Optional[CompactionSettingsModelSettingsSgLangModelSettingsReasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "minimal", "low", 3 more]]

The reasoning effort to use when generating text reasoning models

One of the following:

"none"

"minimal"

"low"

"medium"

"high"

"xhigh"

response_format: Optional[CompactionSettingsModelSettingsSgLangModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

tool_call_parser: Optional[str]

SGLang tool call parser name (for example ‘glm47’, ‘qwen25’, or ‘hermes’).

class AnthropicModelSettings: …

effort: Optional[Literal["low", "medium", "high", 2 more]]

Effort level for supported Anthropic models (controls token spending). ‘xhigh’ and ‘max’ are available on Opus 4.6+. Not setting this gives similar performance to ‘high’.

One of the following:

"low"

"medium"

"high"

"xhigh"

"max"

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["anthropic"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[Thinking]

The thinking configuration for the model.

budget_tokens: Optional[int]

The maximum number of tokens the model can use for extended thinking.

type: Optional[Literal["enabled", "disabled"]]

The type of thinking to use.

One of the following:

"enabled"

"disabled"

verbosity: Optional[Literal["low", "medium", "high"]]

Soft control for how verbose model output should be, used for GPT-5 models.

One of the following:

"low"

"medium"

"high"

class GoogleAIModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["google_ai"]]

The type of the provider.

response_schema: Optional[ResponseSchema]

The response schema for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking_config: Optional[ThinkingConfig]

The thinking configuration for the model.

include_thoughts: Optional[bool]

Whether to include thoughts in the model’s response.

thinking_budget: Optional[int]

The thinking budget for the model.

class GoogleVertexModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["google_vertex"]]

The type of the provider.

response_schema: Optional[ResponseSchema]

The response schema for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking_config: Optional[ThinkingConfig]

The thinking configuration for the model.

include_thoughts: Optional[bool]

Whether to include thoughts in the model’s response.

thinking_budget: Optional[int]

The thinking budget for the model.

class AzureModelSettings: …

Azure OpenAI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["azure"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class XaiModelSettings: …

xAI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["xai"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class CompactionSettingsModelSettingsMoonshotModelSettings: …

Moonshot/Kimi model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["moonshot"]]

The type of the provider.

response_format: Optional[CompactionSettingsModelSettingsMoonshotModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

class CompactionSettingsModelSettingsZaiModelSettings: …

Z.ai (ZhipuAI) model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["zai"]]

The type of the provider.

response_format: Optional[CompactionSettingsModelSettingsZaiModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[CompactionSettingsModelSettingsZaiModelSettingsThinking]

The thinking configuration for GLM-4.5+ models.

clear_thinking: Optional[bool]

If False, preserved thinking is used (recommended for agents).

type: Optional[Literal["enabled", "disabled"]]

Whether thinking is enabled or disabled.

One of the following:

"enabled"

"disabled"

class CompactionSettingsModelSettingsMoonshotCodingModelSettings: …

Kimi Code model configuration (Anthropic-compatible).

effort: Optional[Literal["low", "medium", "high", 2 more]]

Effort level for supported Anthropic models (controls token spending). ‘xhigh’ and ‘max’ are available on Opus 4.6+. Not setting this gives similar performance to ‘high’.

One of the following:

"low"

"medium"

"high"

"xhigh"

"max"

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["moonshot_coding"]]

The type of the provider.

response_format: Optional[CompactionSettingsModelSettingsMoonshotCodingModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[CompactionSettingsModelSettingsMoonshotCodingModelSettingsThinking]

The thinking configuration for the model.

budget_tokens: Optional[int]

The maximum number of tokens the model can use for extended thinking.

type: Optional[Literal["enabled", "disabled"]]

The type of thinking to use.

One of the following:

"enabled"

"disabled"

verbosity: Optional[Literal["low", "medium", "high"]]

Soft control for how verbose model output should be, used for GPT-5 models.

One of the following:

"low"

"medium"

"high"

class GroqModelSettings: …

Groq model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["groq"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class DeepseekModelSettings: …

Deepseek model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["deepseek"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class TogetherModelSettings: …

Together AI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["together"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class BedrockModelSettings: …

AWS Bedrock model configuration.

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["bedrock"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class CompactionSettingsModelSettingsBasetenModelSettings: …

Baseten model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["baseten"]]

The type of the provider.

temperature: Optional[float]

The temperature of the model.

class CompactionSettingsModelSettingsOpenRouterModelSettings: …

OpenRouter model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["openrouter"]]

The type of the provider.

response_format: Optional[CompactionSettingsModelSettingsOpenRouterModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class CompactionSettingsModelSettingsChatGptoAuthModelSettings: …

ChatGPT OAuth model configuration (uses ChatGPT backend API).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["chatgpt_oauth"]]

The type of the provider.

reasoning: Optional[CompactionSettingsModelSettingsChatGptoAuthModelSettingsReasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "low", "medium", 2 more]]

The reasoning effort level for GPT-5.x and o-series models.

One of the following:

"none"

"low"

"medium"

"high"

"xhigh"

temperature: Optional[float]

The temperature of the model.

prompt: Optional[str]

The prompt to use for summarization. If None, uses mode-specific default.

prompt_acknowledgement: Optional[bool]

Whether to include an acknowledgement post-prompt (helps prevent non-summary outputs).

sliding_window_percentage: Optional[float]

The percentage of the context window to keep post-summarization (only used in sliding window modes).

created_at: Optional[datetime]

The timestamp when the object was created.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this object.

deployment_id: Optional[str]

The id of the deployment.

description: Optional[str]

The description of the agent.

embedding: Optional[str]

The embedding model handle used by the agent (format: provider/model-name).

Deprecatedembedding_config: Optional[EmbeddingConfig]

Configuration for embedding model connection and processing parameters.

embedding_dim: int

The dimension of the embedding.

embedding_endpoint_type: Literal["openai", "anthropic", "bedrock", 16 more]

The endpoint type for the model.

One of the following:

"openai"

"anthropic"

"bedrock"

"google_ai"

"google_vertex"

"azure"

"groq"

"ollama"

"webui"

"webui-legacy"

"lmstudio"

"lmstudio-legacy"

"llamacpp"

"koboldcpp"

"vllm"

"hugging-face"

"mistral"

"together"

"pinecone"

embedding_model: str

The model for the embedding.

azure_deployment: Optional[str]

The Azure deployment for the model.

azure_endpoint: Optional[str]

The Azure endpoint for the model.

azure_version: Optional[str]

The Azure version for the model.

batch_size: Optional[int]

The maximum batch size for processing embeddings.

embedding_chunk_size: Optional[int]

The chunk size of the embedding.

embedding_endpoint: Optional[str]

The endpoint for the model (None if local).

handle: Optional[str]

The handle for this config, in the format provider/model-name.

enable_sleeptime: Optional[bool]

If set to True, memory management will move to a background agent thread.

entity_id: Optional[str]

The id of the entity within the template.

hidden: Optional[bool]

If set to True, the agent will be hidden.

identities: Optional[List[Identity]]

The identities associated with this agent.

id: str

The human-friendly ID of the Identity

Deprecatedagent_ids: List[str]

The IDs of the agents associated with the identity.

Deprecatedblock_ids: List[str]

The IDs of the blocks associated with the identity.

identifier_key: str

External, user-generated identifier key of the identity.

identity_type: Literal["org", "user", "other"]

The type of the identity.

One of the following:

"org"

"user"

"other"

The name of the identity.

project_id: Optional[str]

The project id of the identity, if applicable.

properties: Optional[List[IdentityProperty]]

List of properties associated with the identity

key: str

The key of the property

type: Literal["string", "number", "boolean", "json"]

The type of the property

One of the following:

"string"

"number"

"boolean"

"json"

value: Union[str, float, bool, Dict[str, object]]

The value of the property

One of the following:

str

float

bool

Dict[str, object]

Deprecatedidentity_ids: Optional[List[str]]

Deprecated: Use identities field instead. The ids of the identities associated with this agent.

last_run_completion: Optional[datetime]

The timestamp when the agent last completed a run.

formatdate-time

last_run_duration_ms: Optional[int]

The duration in milliseconds of the agent’s last run.

last_stop_reason: Optional[StopReasonType]

The stop reason from the agent’s last run.

One of the following:

"end_turn"

"error"

"llm_api_error"

"invalid_llm_response"

"invalid_tool_call"

"max_steps"

"max_tokens_exceeded"

"no_tool_call"

"tool_rule"

"cancelled"

"insufficient_credits"

"requires_approval"

"context_window_overflow_in_system_prompt"

last_updated_by_id: Optional[str]

The id of the user that made this object.

managed_group: Optional[ManagedGroup]

The multi-agent group that this agent manages

id: str

The id of the group. Assigned by the database.

agent_ids: List[str]

description: str

manager_type: Literal["round_robin", "supervisor", "dynamic", 3 more]

One of the following:

"round_robin"

"supervisor"

"dynamic"

"sleeptime"

"voice_sleeptime"

"swarm"

base_template_id: Optional[str]

The base template id.

deployment_id: Optional[str]

The id of the deployment.

hidden: Optional[bool]

If set to True, the group will be hidden.

last_processed_message_id: Optional[str]

manager_agent_id: Optional[str]

max_message_buffer_length: Optional[int]

The desired maximum length of messages in the context window of the convo agent. This is a best effort, and may be off slightly due to user/assistant interleaving.

max_turns: Optional[int]

min_message_buffer_length: Optional[int]

The desired minimum length of messages in the context window of the convo agent. This is a best effort, and may be off-by-one due to user/assistant interleaving.

project_id: Optional[str]

The associated project id.

Deprecatedshared_block_ids: Optional[List[str]]

sleeptime_agent_frequency: Optional[int]

template_id: Optional[str]

The id of the template.

termination_token: Optional[str]

turns_counter: Optional[int]

max_files_open: Optional[int]

Maximum number of files that can be open at once for this agent. Setting this too high may exceed the context window, which will break the agent.

message_buffer_autoclear: Optional[bool]

If set to True, the agent will not remember previous messages (though the agent will still retain state via core memory blocks and archival/recall memory). Not recommended unless you have an advanced use case.

message_ids: Optional[List[str]]

The ids of the messages in the agent’s in-context memory.

metadata: Optional[Dict[str, object]]

The metadata of the agent.

model: Optional[str]

The model handle used by the agent (format: provider/model-name).

model_settings: Optional[ModelSettings]

The model settings used by the agent.

One of the following:

class OpenAIModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["openai"]]

The type of the provider.

reasoning: Optional[Reasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "minimal", "low", 3 more]]

The reasoning effort to use when generating text reasoning models

One of the following:

"none"

"minimal"

"low"

"medium"

"high"

"xhigh"

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

class ModelSettingsSgLangModelSettings: …

SGLang model configuration (OpenAI-compatible runtime with SGLang-specific parsing).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["sglang"]]

The type of the provider.

reasoning: Optional[ModelSettingsSgLangModelSettingsReasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "minimal", "low", 3 more]]

The reasoning effort to use when generating text reasoning models

One of the following:

"none"

"minimal"

"low"

"medium"

"high"

"xhigh"

response_format: Optional[ModelSettingsSgLangModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

tool_call_parser: Optional[str]

SGLang tool call parser name (for example ‘glm47’, ‘qwen25’, or ‘hermes’).

class AnthropicModelSettings: …

effort: Optional[Literal["low", "medium", "high", 2 more]]

Effort level for supported Anthropic models (controls token spending). ‘xhigh’ and ‘max’ are available on Opus 4.6+. Not setting this gives similar performance to ‘high’.

One of the following:

"low"

"medium"

"high"

"xhigh"

"max"

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["anthropic"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[Thinking]

The thinking configuration for the model.

budget_tokens: Optional[int]

The maximum number of tokens the model can use for extended thinking.

type: Optional[Literal["enabled", "disabled"]]

The type of thinking to use.

One of the following:

"enabled"

"disabled"

verbosity: Optional[Literal["low", "medium", "high"]]

Soft control for how verbose model output should be, used for GPT-5 models.

One of the following:

"low"

"medium"

"high"

class GoogleAIModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["google_ai"]]

The type of the provider.

response_schema: Optional[ResponseSchema]

The response schema for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking_config: Optional[ThinkingConfig]

The thinking configuration for the model.

include_thoughts: Optional[bool]

Whether to include thoughts in the model’s response.

thinking_budget: Optional[int]

The thinking budget for the model.

class GoogleVertexModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["google_vertex"]]

The type of the provider.

response_schema: Optional[ResponseSchema]

The response schema for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking_config: Optional[ThinkingConfig]

The thinking configuration for the model.

include_thoughts: Optional[bool]

Whether to include thoughts in the model’s response.

thinking_budget: Optional[int]

The thinking budget for the model.

class AzureModelSettings: …

Azure OpenAI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["azure"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class XaiModelSettings: …

xAI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["xai"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class ModelSettingsMoonshotModelSettings: …

Moonshot/Kimi model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["moonshot"]]

The type of the provider.

response_format: Optional[ModelSettingsMoonshotModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

class ModelSettingsZaiModelSettings: …

Z.ai (ZhipuAI) model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["zai"]]

The type of the provider.

response_format: Optional[ModelSettingsZaiModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[ModelSettingsZaiModelSettingsThinking]

The thinking configuration for GLM-4.5+ models.

clear_thinking: Optional[bool]

If False, preserved thinking is used (recommended for agents).

type: Optional[Literal["enabled", "disabled"]]

Whether thinking is enabled or disabled.

One of the following:

"enabled"

"disabled"

class ModelSettingsMoonshotCodingModelSettings: …

Kimi Code model configuration (Anthropic-compatible).

effort: Optional[Literal["low", "medium", "high", 2 more]]

Effort level for supported Anthropic models (controls token spending). ‘xhigh’ and ‘max’ are available on Opus 4.6+. Not setting this gives similar performance to ‘high’.

One of the following:

"low"

"medium"

"high"

"xhigh"

"max"

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["moonshot_coding"]]

The type of the provider.

response_format: Optional[ModelSettingsMoonshotCodingModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[ModelSettingsMoonshotCodingModelSettingsThinking]

The thinking configuration for the model.

budget_tokens: Optional[int]

The maximum number of tokens the model can use for extended thinking.

type: Optional[Literal["enabled", "disabled"]]

The type of thinking to use.

One of the following:

"enabled"

"disabled"

verbosity: Optional[Literal["low", "medium", "high"]]

Soft control for how verbose model output should be, used for GPT-5 models.

One of the following:

"low"

"medium"

"high"

class GroqModelSettings: …

Groq model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["groq"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class DeepseekModelSettings: …

Deepseek model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["deepseek"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class TogetherModelSettings: …

Together AI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["together"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class BedrockModelSettings: …

AWS Bedrock model configuration.

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["bedrock"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class ModelSettingsBasetenModelSettings: …

Baseten model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["baseten"]]

The type of the provider.

temperature: Optional[float]

The temperature of the model.

class ModelSettingsOpenRouterModelSettings: …

OpenRouter model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["openrouter"]]

The type of the provider.

response_format: Optional[ModelSettingsOpenRouterModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class ModelSettingsChatGptoAuthModelSettings: …

ChatGPT OAuth model configuration (uses ChatGPT backend API).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["chatgpt_oauth"]]

The type of the provider.

reasoning: Optional[ModelSettingsChatGptoAuthModelSettingsReasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "low", "medium", 2 more]]

The reasoning effort level for GPT-5.x and o-series models.

One of the following:

"none"

"low"

"medium"

"high"

"xhigh"

temperature: Optional[float]

The temperature of the model.

Deprecatedmulti_agent_group: Optional[MultiAgentGroup]

Deprecated: Use managed_group field instead. The multi-agent group that this agent manages.

id: str

The id of the group. Assigned by the database.

agent_ids: List[str]

description: str

manager_type: Literal["round_robin", "supervisor", "dynamic", 3 more]

One of the following:

"round_robin"

"supervisor"

"dynamic"

"sleeptime"

"voice_sleeptime"

"swarm"

base_template_id: Optional[str]

The base template id.

deployment_id: Optional[str]

The id of the deployment.

hidden: Optional[bool]

If set to True, the group will be hidden.

last_processed_message_id: Optional[str]

manager_agent_id: Optional[str]

max_message_buffer_length: Optional[int]

The desired maximum length of messages in the context window of the convo agent. This is a best effort, and may be off slightly due to user/assistant interleaving.

max_turns: Optional[int]

min_message_buffer_length: Optional[int]

The desired minimum length of messages in the context window of the convo agent. This is a best effort, and may be off-by-one due to user/assistant interleaving.

project_id: Optional[str]

The associated project id.

Deprecatedshared_block_ids: Optional[List[str]]

sleeptime_agent_frequency: Optional[int]

template_id: Optional[str]

The id of the template.

termination_token: Optional[str]

turns_counter: Optional[int]

pending_approval: Optional[ApprovalRequestMessage]

A message representing a request for approval to call a tool (generated by the LLM to trigger tool execution).

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message tool_call (ToolCall): The tool call

id: str

date: datetime

Deprecatedtool_call: ToolCall

The tool call that has been requested by the llm to run

One of the following:

class ToolCall: …

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["approval_request_message"]]

The type of the message.

otid: Optional[str]

The offline threading id (OTID). Set by the client to deduplicate requests. Used for idempotency in background streaming mode — each message in a request must have a unique OTID. Retries of the same request should reuse the same OTIDs.

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

tool_calls: Optional[ToolCalls]

The tool calls that have been requested by the llm to run, which are pending approval

One of the following:

List[ToolCall]

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

per_file_view_window_char_limit: Optional[int]

The per-file view window character limit for this agent. Setting this too high may exceed the context window, which will break the agent.

project_id: Optional[str]

The id of the project the agent belongs to.

response_format: Optional[ResponseFormat]

The response format used by the agent

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

secrets: Optional[List[AgentEnvironmentVariable]]

The environment variables for tool execution specific to this agent.

agent_id: str

The ID of the agent this environment variable belongs to.

key: str

The name of the environment variable.

value: str

The value of the environment variable.

id: Optional[str]

The human-friendly ID of the Agent-env

created_at: Optional[datetime]

The timestamp when the object was created.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this object.

description: Optional[str]

An optional description of the environment variable.

last_updated_by_id: Optional[str]

The id of the user that made this object.

updated_at: Optional[datetime]

The timestamp when the object was last updated.

formatdate-time

value_enc: Optional[str]

Encrypted secret value (stored as encrypted string)

template_id: Optional[str]

The id of the template the agent belongs to.

timezone: Optional[str]

The timezone of the agent (IANA format).

Deprecatedtool_exec_environment_variables: Optional[List[AgentEnvironmentVariable]]

Deprecated: use secrets field instead.

agent_id: str

The ID of the agent this environment variable belongs to.

key: str

The name of the environment variable.

value: str

The value of the environment variable.

id: Optional[str]

The human-friendly ID of the Agent-env

created_at: Optional[datetime]

The timestamp when the object was created.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this object.

description: Optional[str]

An optional description of the environment variable.

last_updated_by_id: Optional[str]

The id of the user that made this object.

updated_at: Optional[datetime]

The timestamp when the object was last updated.

formatdate-time

value_enc: Optional[str]

Encrypted secret value (stored as encrypted string)

tool_rules: Optional[List[ToolRule]]

The list of tool rules.

One of the following:

class ChildToolRule: …

A ToolRule represents a tool that can be invoked by the agent.

children: List[str]

The children tools that can be invoked.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

child_arg_nodes: Optional[List[ChildArgNode]]

Optional list of typed child argument overrides. Each node must reference a child in ‘children’.

The name of the child tool to invoke next.

args: Optional[Dict[str, object]]

Optional prefilled arguments for this child tool. Keys must match the tool’s parameter names and values must satisfy the tool’s JSON schema. Supports partial prefill; non-overlapping parameters are left to the model.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["constrain_child_tools"]]

class InitToolRule: …

Represents the initial tool rule configuration.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

args: Optional[Dict[str, object]]

Optional prefilled arguments for this tool. When present, these values will override any LLM-provided arguments with the same keys during invocation. Keys must match the tool’s parameter names and values must satisfy the tool’s JSON schema. Supports partial prefill; non-overlapping parameters are left to the model.

prompt_template: Optional[str]

Optional template string (ignored). Rendering uses fast built-in formatting for performance.

type: Optional[Literal["run_first"]]

class TerminalToolRule: …

Represents a terminal tool rule configuration where if this tool gets called, it must end the agent loop.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["exit_loop"]]

class ConditionalToolRule: …

A ToolRule that conditionally maps to different child tools based on the output.

child_output_mapping: Dict[str, str]

The output case to check for mapping

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

default_child: Optional[str]

The default child tool to be called. If None, any tool can be called.

prompt_template: Optional[str]

Optional template string (ignored).

require_output_mapping: Optional[bool]

Whether to throw an error when output doesn’t match any case

type: Optional[Literal["conditional"]]

class ContinueToolRule: …

Represents a tool rule configuration where if this tool gets called, it must continue the agent loop.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["continue_loop"]]

class RequiredBeforeExitToolRule: …

Represents a tool rule configuration where this tool must be called before the agent loop can exit.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["required_before_exit"]]

class MaxCountPerStepToolRule: …

Represents a tool rule configuration which constrains the total number of times this tool can be invoked in a single step.

max_count_limit: int

The max limit for the total number of times this tool can be invoked in a single step.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["max_count_per_step"]]

class ParentToolRule: …

A ToolRule that only allows a child tool to be called if the parent has been called.

children: List[str]

The children tools that can be invoked.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["parent_last_tool"]]

class RequiresApprovalToolRule: …

Represents a tool rule configuration which requires approval before the tool can be invoked.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored). Rendering uses fast built-in formatting for performance.

type: Optional[Literal["requires_approval"]]

updated_at: Optional[datetime]

The timestamp when the object was last updated.

formatdate-time

Literal["memgpt_agent", "memgpt_v2_agent", "letta_v1_agent", 6 more]

Enum to represent the type of agent.

One of the following:

"memgpt_agent"

"memgpt_v2_agent"

"letta_v1_agent"

"react_agent"

"workflow_agent"

"split_thread_agent"

"sleeptime_agent"

"voice_convo_agent"

"voice_sleeptime_agent"

class AnthropicModelSettings: …

effort: Optional[Literal["low", "medium", "high", 2 more]]

Effort level for supported Anthropic models (controls token spending). ‘xhigh’ and ‘max’ are available on Opus 4.6+. Not setting this gives similar performance to ‘high’.

One of the following:

"low"

"medium"

"high"

"xhigh"

"max"

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["anthropic"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[Thinking]

The thinking configuration for the model.

budget_tokens: Optional[int]

The maximum number of tokens the model can use for extended thinking.

type: Optional[Literal["enabled", "disabled"]]

The type of thinking to use.

One of the following:

"enabled"

"disabled"

verbosity: Optional[Literal["low", "medium", "high"]]

Soft control for how verbose model output should be, used for GPT-5 models.

One of the following:

"low"

"medium"

"high"

class AzureModelSettings: …

Azure OpenAI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["azure"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class BedrockModelSettings: …

AWS Bedrock model configuration.

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["bedrock"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class ChildToolRule: …

A ToolRule represents a tool that can be invoked by the agent.

children: List[str]

The children tools that can be invoked.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

child_arg_nodes: Optional[List[ChildArgNode]]

Optional list of typed child argument overrides. Each node must reference a child in ‘children’.

The name of the child tool to invoke next.

args: Optional[Dict[str, object]]

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["constrain_child_tools"]]

class ConditionalToolRule: …

A ToolRule that conditionally maps to different child tools based on the output.

child_output_mapping: Dict[str, str]

The output case to check for mapping

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

default_child: Optional[str]

The default child tool to be called. If None, any tool can be called.

prompt_template: Optional[str]

Optional template string (ignored).

require_output_mapping: Optional[bool]

Whether to throw an error when output doesn’t match any case

type: Optional[Literal["conditional"]]

class ContinueToolRule: …

Represents a tool rule configuration where if this tool gets called, it must continue the agent loop.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["continue_loop"]]

class DeepseekModelSettings: …

Deepseek model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["deepseek"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class GoogleAIModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["google_ai"]]

The type of the provider.

response_schema: Optional[ResponseSchema]

The response schema for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking_config: Optional[ThinkingConfig]

The thinking configuration for the model.

include_thoughts: Optional[bool]

Whether to include thoughts in the model’s response.

thinking_budget: Optional[int]

The thinking budget for the model.

class GoogleVertexModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["google_vertex"]]

The type of the provider.

response_schema: Optional[ResponseSchema]

The response schema for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking_config: Optional[ThinkingConfig]

The thinking configuration for the model.

include_thoughts: Optional[bool]

Whether to include thoughts in the model’s response.

thinking_budget: Optional[int]

The thinking budget for the model.

class GroqModelSettings: …

Groq model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["groq"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class InitToolRule: …

Represents the initial tool rule configuration.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

args: Optional[Dict[str, object]]

prompt_template: Optional[str]

Optional template string (ignored). Rendering uses fast built-in formatting for performance.

type: Optional[Literal["run_first"]]

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

LettaMessageContentUnion

Sent via the Anthropic Messages API

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

class ToolCallContent: …

id: str

A unique identifier for this specific tool call instance.

input: Dict[str, object]

The parameters being passed to the tool, structured as a dictionary of parameter names to values.

The name of the tool being called.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this tool call.

type: Optional[Literal["tool_call"]]

Indicates this content represents a tool call event.

class ToolReturnContent: …

content: str

The content returned by the tool execution.

is_error: bool

Indicates whether the tool execution resulted in an error.

tool_call_id: str

References the ID of the ToolCallContent that initiated this tool call.

type: Optional[Literal["tool_return"]]

Indicates this content represents a tool return event.

class ReasoningContent: …

Sent via the Anthropic Messages API

is_native: bool

Whether the reasoning content was generated by a reasoner model that processed this step.

reasoning: str

The intermediate reasoning or thought process content.

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["reasoning"]]

Indicates this is a reasoning/intermediate step.

class RedactedReasoningContent: …

Sent via the Anthropic Messages API

data: str

The redacted or filtered intermediate reasoning content.

type: Optional[Literal["redacted_reasoning"]]

Indicates this is a redacted thinking step.

class OmittedReasoningContent: …

A placeholder for reasoning content we know is present, but isn’t returned by the provider (e.g. OpenAI GPT-5 on ChatCompletions)

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["omitted_reasoning"]]

Indicates this is an omitted reasoning step.

class MaxCountPerStepToolRule: …

Represents a tool rule configuration which constrains the total number of times this tool can be invoked in a single step.

max_count_limit: int

The max limit for the total number of times this tool can be invoked in a single step.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["max_count_per_step"]]

class MessageCreate: …

Request to create a message

content: Union[List[LettaMessageContentUnion], str]

The content of the message.

One of the following:

List[LettaMessageContentUnion]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

class ToolCallContent: …

id: str

A unique identifier for this specific tool call instance.

input: Dict[str, object]

The parameters being passed to the tool, structured as a dictionary of parameter names to values.

The name of the tool being called.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this tool call.

type: Optional[Literal["tool_call"]]

Indicates this content represents a tool call event.

class ToolReturnContent: …

content: str

The content returned by the tool execution.

is_error: bool

Indicates whether the tool execution resulted in an error.

tool_call_id: str

References the ID of the ToolCallContent that initiated this tool call.

type: Optional[Literal["tool_return"]]

Indicates this content represents a tool return event.

class ReasoningContent: …

Sent via the Anthropic Messages API

is_native: bool

Whether the reasoning content was generated by a reasoner model that processed this step.

reasoning: str

The intermediate reasoning or thought process content.

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["reasoning"]]

Indicates this is a reasoning/intermediate step.

class RedactedReasoningContent: …

Sent via the Anthropic Messages API

data: str

The redacted or filtered intermediate reasoning content.

type: Optional[Literal["redacted_reasoning"]]

Indicates this is a redacted thinking step.

class OmittedReasoningContent: …

A placeholder for reasoning content we know is present, but isn’t returned by the provider (e.g. OpenAI GPT-5 on ChatCompletions)

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["omitted_reasoning"]]

Indicates this is an omitted reasoning step.

str

role: Literal["user", "system", "assistant"]

The role of the participant.

One of the following:

"user"

"system"

"assistant"

batch_item_id: Optional[str]

The id of the LLMBatchItem that this message is associated with

group_id: Optional[str]

The multi-agent group that the message was sent in

The name of the participant.

otid: Optional[str]

sender_id: Optional[str]

The id of the sender of the message, can be an identity id or agent id

type: Optional[Literal["message"]]

The message type to be created.

class OpenAIModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["openai"]]

The type of the provider.

reasoning: Optional[Reasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "minimal", "low", 3 more]]

The reasoning effort to use when generating text reasoning models

One of the following:

"none"

"minimal"

"low"

"medium"

"high"

"xhigh"

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

class ParentToolRule: …

A ToolRule that only allows a child tool to be called if the parent has been called.

children: List[str]

The children tools that can be invoked.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["parent_last_tool"]]

class RequiredBeforeExitToolRule: …

Represents a tool rule configuration where this tool must be called before the agent loop can exit.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["required_before_exit"]]

class RequiresApprovalToolRule: …

Represents a tool rule configuration which requires approval before the tool can be invoked.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored). Rendering uses fast built-in formatting for performance.

type: Optional[Literal["requires_approval"]]

class TerminalToolRule: …

Represents a terminal tool rule configuration where if this tool gets called, it must end the agent loop.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["exit_loop"]]

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class TogetherModelSettings: …

Together AI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["together"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class XaiModelSettings: …

xAI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["xai"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

str

class AgentImportFileResponse: …

Response model for imported agents

agent_ids: List[str]

List of IDs of the imported agents

str

AgentsMessages

List Messages

agents.messages.list(, ) -> SyncArrayPage[Message]

GET/v1/agents/{agent_id}/messages

Create Message

agents.messages.create(, ) -> LettaResponse

POST/v1/agents/{agent_id}/messages

Create Message Streaming

Deprecated

agents.messages.stream(, ) -> LettaStreamingResponse

POST/v1/agents/{agent_id}/messages/stream

Cancel Message

agents.messages.cancel(, ) -> MessageCancelResponse

POST/v1/agents/{agent_id}/messages/cancel

Create Message Async

agents.messages.create_async(, ) -> Run

POST/v1/agents/{agent_id}/messages/async

Reset Messages

agents.messages.reset(, ) -> AgentState

PATCH/v1/agents/{agent_id}/reset-messages

Summarize Messages

agents.messages.compact(, ) -> CompactionResponse

POST/v1/agents/{agent_id}/summarize

ModelsExpand Collapse

class ApprovalCreate: …

Input to approve or deny a tool call request

Deprecatedapproval_request_id: Optional[str]

The message ID of the approval request

approvals: Optional[List[Approval]]

The list of approval responses

One of the following:

class ApprovalReturn: …

approve: bool

Whether the tool has been approved

tool_call_id: str

The ID of the tool call that corresponds to this approval

reason: Optional[str]

An optional explanation for the provided approval status

type: Optional[Literal["approval"]]

The message type to be created.

class ToolReturn: …

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

Deprecatedapprove: Optional[bool]

Whether the tool has been approved

group_id: Optional[str]

The multi-agent group that the message was sent in

otid: Optional[str]

Deprecatedreason: Optional[str]

An optional explanation for the provided approval status

type: Optional[Literal["approval"]]

The message type to be created.

class ApprovalRequestMessage: …

A message representing a request for approval to call a tool (generated by the LLM to trigger tool execution).

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message tool_call (ToolCall): The tool call

id: str

date: datetime

Deprecatedtool_call: ToolCall

The tool call that has been requested by the llm to run

One of the following:

class ToolCall: …

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["approval_request_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

tool_calls: Optional[ToolCalls]

The tool calls that have been requested by the llm to run, which are pending approval

One of the following:

List[ToolCall]

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

class ApprovalResponseMessage: …

A message representing a response form the user indicating whether a tool has been approved to run.

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message approve: (bool) Whether the tool has been approved approval_request_id: The ID of the approval request reason: (Optional[str]) An optional explanation for the provided approval status

id: str

date: datetime

Deprecatedapproval_request_id: Optional[str]

The message ID of the approval request

approvals: Optional[List[Approval]]

The list of approval responses

One of the following:

class ApprovalReturn: …

approve: bool

Whether the tool has been approved

tool_call_id: str

The ID of the tool call that corresponds to this approval

reason: Optional[str]

An optional explanation for the provided approval status

type: Optional[Literal["approval"]]

The message type to be created.

class ToolReturn: …

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

Deprecatedapprove: Optional[bool]

Whether the tool has been approved

is_err: Optional[bool]

message_type: Optional[Literal["approval_response_message"]]

The type of the message.

otid: Optional[str]

Deprecatedreason: Optional[str]

An optional explanation for the provided approval status

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class ApprovalReturn: …

approve: bool

Whether the tool has been approved

tool_call_id: str

The ID of the tool call that corresponds to this approval

reason: Optional[str]

An optional explanation for the provided approval status

type: Optional[Literal["approval"]]

The message type to be created.

class AssistantMessage: …

A message sent by the LLM in response to user input. Used in the LLM context.

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message content (Union[str, List[LettaAssistantMessageContentUnion]]): The message content sent by the agent (can be a string or an array of content parts)

id: str

content: Union[List[LettaAssistantMessageContentUnion], str]

The message content sent by the agent (can be a string or an array of content parts)

One of the following:

List[LettaAssistantMessageContentUnion]

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

str

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["assistant_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class EventMessage: …

A message for notifying the developer that an event that has occured (e.g. a compaction). Events are NOT part of the context window.

id: str

date: datetime

event_data: Dict[str, object]

event_type: Literal["compaction"]

is_err: Optional[bool]

message_type: Optional[Literal["event_message"]]

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class HiddenReasoningMessage: …

Representation of an agent’s internal reasoning where reasoning content has been hidden from the response.

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message state (Literal[“redacted”, “omitted”]): Whether the reasoning content was redacted by the provider or simply omitted by the API hidden_reasoning (Optional[str]): The internal reasoning of the agent

id: str

date: datetime

state: Literal["redacted", "omitted"]

One of the following:

"redacted"

"omitted"

hidden_reasoning: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["hidden_reasoning_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

class InternalMessage: …

Letta's internal representation of a message. Includes methods to convert to/from LLM provider formats.

Attributes:
    id (str): The unique identifier of the message.
    role (MessageRole): The role of the participant.
    text (str): The text of the message.
    user_id (str): The unique identifier of the user.
    agent_id (str): The unique identifier of the agent.
    model (str): The model used to make the function call.
    name (str): The name of the participant.
    created_at (datetime): The time the message was created.
    tool_calls (List[OpenAIToolCall,]): The list of tool calls requested.
    tool_call_id (str): The id of the tool call.
    step_id (str): The id of the step that this message was created in.
    otid (str): The offline threading id associated with this message.
    tool_returns (List[ToolReturn]): The list of tool returns requested.
    group_id (str): The multi-agent group that the message was sent in.
    sender_id (str): The id of the sender of the message, can be an identity id or agent id.
    conversation_id (str): The conversation this message belongs to.

id: str

The human-friendly ID of the Message

role: MessageRole

The role of the participant.

One of the following:

"assistant"

"user"

"tool"

"function"

"system"

"approval"

"summary"

agent_id: Optional[str]

The unique identifier of the agent.

approval_request_id: Optional[str]

The id of the approval request if this message is associated with a tool call request.

approvals: Optional[List[Approval]]

The list of approvals for this message.

One of the following:

class ApprovalReturn: …

approve: bool

Whether the tool has been approved

tool_call_id: str

The ID of the tool call that corresponds to this approval

reason: Optional[str]

An optional explanation for the provided approval status

type: Optional[Literal["approval"]]

The message type to be created.

class ApprovalLettaSchemasMessageToolReturnOutput: …

status: Literal["success", "error"]

The status of the tool call

One of the following:

"success"

"error"

func_response: Optional[Union[str, List[ApprovalLettaSchemasMessageToolReturnOutputFuncResponseUnionMember1], null]]

The function response - either a string or list of content parts (text/image)

One of the following:

str

List[ApprovalLettaSchemasMessageToolReturnOutputFuncResponseUnionMember1]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

stderr: Optional[List[str]]

Captured stderr from the tool invocation

stdout: Optional[List[str]]

Captured stdout (e.g. prints, logs) from the tool invocation

tool_call_id: Optional[object]

The ID for the tool call

approve: Optional[bool]

Whether tool call is approved.

batch_item_id: Optional[str]

The id of the LLMBatchItem that this message is associated with

content: Optional[List[Content]]

The content of the message.

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

class ToolCallContent: …

id: str

A unique identifier for this specific tool call instance.

input: Dict[str, object]

The parameters being passed to the tool, structured as a dictionary of parameter names to values.

The name of the tool being called.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this tool call.

type: Optional[Literal["tool_call"]]

Indicates this content represents a tool call event.

class ToolReturnContent: …

content: str

The content returned by the tool execution.

is_error: bool

Indicates whether the tool execution resulted in an error.

tool_call_id: str

References the ID of the ToolCallContent that initiated this tool call.

type: Optional[Literal["tool_return"]]

Indicates this content represents a tool return event.

class ReasoningContent: …

Sent via the Anthropic Messages API

is_native: bool

Whether the reasoning content was generated by a reasoner model that processed this step.

reasoning: str

The intermediate reasoning or thought process content.

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["reasoning"]]

Indicates this is a reasoning/intermediate step.

class RedactedReasoningContent: …

Sent via the Anthropic Messages API

data: str

The redacted or filtered intermediate reasoning content.

type: Optional[Literal["redacted_reasoning"]]

Indicates this is a redacted thinking step.

class OmittedReasoningContent: …

A placeholder for reasoning content we know is present, but isn’t returned by the provider (e.g. OpenAI GPT-5 on ChatCompletions)

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["omitted_reasoning"]]

Indicates this is an omitted reasoning step.

class ContentSummarizedReasoningContent: …

The style of reasoning content returned by the OpenAI Responses API

id: str

The unique identifier for this reasoning step.

summary: List[ContentSummarizedReasoningContentSummary]

Summaries of the reasoning content.

index: int

The index of the summary part.

text: str

The text of the summary part.

encrypted_content: Optional[str]

The encrypted reasoning content.

type: Optional[Literal["summarized_reasoning"]]

Indicates this is a summarized reasoning step.

conversation_id: Optional[str]

The conversation this message belongs to

created_at: Optional[datetime]

The timestamp when the object was created.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this object.

denial_reason: Optional[str]

The reason the tool call request was denied.

group_id: Optional[str]

The multi-agent group that the message was sent in

is_err: Optional[bool]

Whether this message is part of an error step. Used only for debugging purposes.

last_updated_by_id: Optional[str]

The id of the user that made this object.

model: Optional[str]

The model used to make the function call.

For role user/assistant: the (optional) name of the participant. For role tool/function: the name of the function called.

otid: Optional[str]

The offline threading id associated with this message

run_id: Optional[str]

The id of the run that this message was created in.

sender_id: Optional[str]

The id of the sender of the message, can be an identity id or agent id

step_id: Optional[str]

The id of the step that this message was created in.

tool_call_id: Optional[str]

The ID of the tool call. Only applicable for role tool.

tool_calls: Optional[List[ToolCall]]

The list of tool calls requested. Only applicable for role assistant.

id: str

function: ToolCallFunction

The function that the model called.

arguments: str

type: Literal["function"]

tool_returns: Optional[List[ToolReturn]]

Tool execution return information for prior tool calls

status: Literal["success", "error"]

The status of the tool call

One of the following:

"success"

"error"

func_response: Optional[Union[str, List[ToolReturnFuncResponseUnionMember1], null]]

The function response - either a string or list of content parts (text/image)

One of the following:

str

List[ToolReturnFuncResponseUnionMember1]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

stderr: Optional[List[str]]

Captured stderr from the tool invocation

stdout: Optional[List[str]]

Captured stdout (e.g. prints, logs) from the tool invocation

tool_call_id: Optional[object]

The ID for the tool call

updated_at: Optional[datetime]

The timestamp when the object was last updated.

formatdate-time

Literal["created", "running", "completed", 4 more]

Status of the job.

One of the following:

"created"

"running"

"completed"

"failed"

"pending"

"cancelled"

"expired"

Literal["job", "run", "batch"]

One of the following:

"job"

"run"

"batch"

class LettaAssistantMessageContentUnion: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class LettaRequest: …

Deprecatedassistant_message_tool_kwarg: Optional[str]

The name of the message argument in the designated message tool. Still supported for legacy agent types, but deprecated for letta_v1_agent onward.

Deprecatedassistant_message_tool_name: Optional[str]

The name of the designated message tool. Still supported for legacy agent types, but deprecated for letta_v1_agent onward.

client_skills: Optional[List[ClientSkill]]

Client-side skills available in the environment. These are rendered in the system prompt’s available skills section alongside agent-scoped skills from MemFS.

description: str

Description of what the skill does

location: str

Path or location hint for the skill (e.g. skills/my-skill/SKILL.md)

The name of the skill

client_tools: Optional[List[ClientTool]]

Client-side tools that the agent can call. When the agent calls a client-side tool, execution pauses and returns control to the client to execute the tool and provide the result via a ToolReturn.

The name of the tool function

description: Optional[str]

Description of what the tool does

parameters: Optional[Dict[str, object]]

JSON Schema for the function parameters

Deprecatedenable_thinking: Optional[str]

If set to True, enables reasoning before responses or tool calls from the agent.

include_compaction_messages: Optional[bool]

If True, compaction events emit structured SummaryMessage and EventMessage types. If False (default), compaction messages are not included in the response.

include_return_message_types: Optional[List[MessageType]]

Only return specified message types in the response. If None (default) returns all messages.

One of the following:

"system_message"

"user_message"

"assistant_message"

"reasoning_message"

"hidden_reasoning_message"

"tool_call_message"

"tool_return_message"

"approval_request_message"

"approval_response_message"

"summary_message"

"event_message"

input: Optional[Union[str, List[InputUnionMember1], null]]

Syntactic sugar for a single user message. Equivalent to messages=[{‘role’: ‘user’, ‘content’: input}].

One of the following:

str

List[InputUnionMember1]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

class ToolCallContent: …

id: str

A unique identifier for this specific tool call instance.

input: Dict[str, object]

The parameters being passed to the tool, structured as a dictionary of parameter names to values.

The name of the tool being called.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this tool call.

type: Optional[Literal["tool_call"]]

Indicates this content represents a tool call event.

class ToolReturnContent: …

content: str

The content returned by the tool execution.

is_error: bool

Indicates whether the tool execution resulted in an error.

tool_call_id: str

References the ID of the ToolCallContent that initiated this tool call.

type: Optional[Literal["tool_return"]]

Indicates this content represents a tool return event.

class ReasoningContent: …

Sent via the Anthropic Messages API

is_native: bool

Whether the reasoning content was generated by a reasoner model that processed this step.

reasoning: str

The intermediate reasoning or thought process content.

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["reasoning"]]

Indicates this is a reasoning/intermediate step.

class RedactedReasoningContent: …

Sent via the Anthropic Messages API

data: str

The redacted or filtered intermediate reasoning content.

type: Optional[Literal["redacted_reasoning"]]

Indicates this is a redacted thinking step.

class OmittedReasoningContent: …

A placeholder for reasoning content we know is present, but isn’t returned by the provider (e.g. OpenAI GPT-5 on ChatCompletions)

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["omitted_reasoning"]]

Indicates this is an omitted reasoning step.

class InputUnionMember1SummarizedReasoningContent: …

The style of reasoning content returned by the OpenAI Responses API

id: str

The unique identifier for this reasoning step.

summary: List[InputUnionMember1SummarizedReasoningContentSummary]

Summaries of the reasoning content.

index: int

The index of the summary part.

text: str

The text of the summary part.

encrypted_content: Optional[str]

The encrypted reasoning content.

type: Optional[Literal["summarized_reasoning"]]

Indicates this is a summarized reasoning step.

max_steps: Optional[int]

Maximum number of steps the agent should take to process the request.

messages: Optional[List[Message]]

The messages to be sent to the agent.

One of the following:

class MessageCreate: …

Request to create a message

content: Union[List[LettaMessageContentUnion], str]

The content of the message.

One of the following:

List[LettaMessageContentUnion]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

class ToolCallContent: …

id: str

A unique identifier for this specific tool call instance.

input: Dict[str, object]

The parameters being passed to the tool, structured as a dictionary of parameter names to values.

The name of the tool being called.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this tool call.

type: Optional[Literal["tool_call"]]

Indicates this content represents a tool call event.

class ToolReturnContent: …

content: str

The content returned by the tool execution.

is_error: bool

Indicates whether the tool execution resulted in an error.

tool_call_id: str

References the ID of the ToolCallContent that initiated this tool call.

type: Optional[Literal["tool_return"]]

Indicates this content represents a tool return event.

class ReasoningContent: …

Sent via the Anthropic Messages API

is_native: bool

Whether the reasoning content was generated by a reasoner model that processed this step.

reasoning: str

The intermediate reasoning or thought process content.

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["reasoning"]]

Indicates this is a reasoning/intermediate step.

class RedactedReasoningContent: …

Sent via the Anthropic Messages API

data: str

The redacted or filtered intermediate reasoning content.

type: Optional[Literal["redacted_reasoning"]]

Indicates this is a redacted thinking step.

class OmittedReasoningContent: …

A placeholder for reasoning content we know is present, but isn’t returned by the provider (e.g. OpenAI GPT-5 on ChatCompletions)

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["omitted_reasoning"]]

Indicates this is an omitted reasoning step.

str

role: Literal["user", "system", "assistant"]

The role of the participant.

One of the following:

"user"

"system"

"assistant"

batch_item_id: Optional[str]

The id of the LLMBatchItem that this message is associated with

group_id: Optional[str]

The multi-agent group that the message was sent in

The name of the participant.

otid: Optional[str]

sender_id: Optional[str]

The id of the sender of the message, can be an identity id or agent id

type: Optional[Literal["message"]]

The message type to be created.

class ApprovalCreate: …

Input to approve or deny a tool call request

Deprecatedapproval_request_id: Optional[str]

The message ID of the approval request

approvals: Optional[List[Approval]]

The list of approval responses

One of the following:

class ApprovalReturn: …

approve: bool

Whether the tool has been approved

tool_call_id: str

The ID of the tool call that corresponds to this approval

reason: Optional[str]

An optional explanation for the provided approval status

type: Optional[Literal["approval"]]

The message type to be created.

class ToolReturn: …

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

Deprecatedapprove: Optional[bool]

Whether the tool has been approved

group_id: Optional[str]

The multi-agent group that the message was sent in

otid: Optional[str]

Deprecatedreason: Optional[str]

An optional explanation for the provided approval status

type: Optional[Literal["approval"]]

The message type to be created.

class MessageToolReturnCreate: …

Submit tool return(s) from client-side tool execution.

This is the preferred way to send tool results back to the agent after client-side tool execution. It is equivalent to sending an ApprovalCreate with tool return approvals, but provides a cleaner API for the common case.

tool_returns: List[ToolReturn]

List of tool returns from client-side execution

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

group_id: Optional[str]

The multi-agent group that the message was sent in

otid: Optional[str]

type: Optional[Literal["tool_return"]]

The message type to be created.

override_model: Optional[str]

Model handle to use for this request instead of the agent’s default model. This allows sending a message to a different model without changing the agent’s configuration.

override_system: Optional[str]

Optional per-request system prompt override. When set, this is passed directly to the underlying LLM request and bypasses the persisted/compiled system message for that request.

return_logprobs: Optional[bool]

If True, returns log probabilities of the output tokens in the response. Useful for RL training. Only supported for OpenAI-compatible providers (including SGLang).

return_token_ids: Optional[bool]

If True, returns token IDs and logprobs for ALL LLM generations in the agent step, not just the last one. Uses SGLang native /generate endpoint. Returns ‘turns’ field with TurnTokenData for each assistant/tool turn. Required for proper multi-turn RL training with loss masking.

top_logprobs: Optional[int]

Number of most likely tokens to return at each position (0-20). Requires return_logprobs=True.

Deprecateduse_assistant_message: Optional[bool]

Whether the server should parse specific tool call arguments (default send_message) as AssistantMessage objects. Still supported for legacy agent types, but deprecated for letta_v1_agent onward.

class LettaResponse: …

Response object from an agent interaction, consisting of the new messages generated by the agent and usage statistics. The type of the returned messages can be either Message or LettaMessage, depending on what was specified in the request.

Attributes: messages (List[Union[Message, LettaMessage]]): The messages returned by the agent. usage (LettaUsageStatistics): The usage statistics

messages: List[Message]

The messages returned by the agent.

One of the following:

class SystemMessage: …

A message generated by the system. Never streamed back on a response, only used for cursor pagination.

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message content (str): The message content sent by the system

id: str

content: str

The message content sent by the system

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["system_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class UserMessage: …

A message sent by the user. Never streamed back on a response, only used for cursor pagination.

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message content (Union[str, List[LettaUserMessageContentUnion]]): The message content sent by the user (can be a string or an array of multi-modal content parts)

id: str

content: Union[List[LettaUserMessageContentUnion], str]

The message content sent by the user (can be a string or an array of multi-modal content parts)

One of the following:

List[LettaUserMessageContentUnion]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["user_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class ReasoningMessage: …

Representation of an agent’s internal reasoning.

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message source (Literal[“reasoner_model”, “non_reasoner_model”]): Whether the reasoning content was generated natively by a reasoner model or derived via prompting reasoning (str): The internal reasoning of the agent signature (Optional[str]): The model-generated signature of the reasoning step

id: str

date: datetime

reasoning: str

is_err: Optional[bool]

message_type: Optional[Literal["reasoning_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

signature: Optional[str]

source: Optional[Literal["reasoner_model", "non_reasoner_model"]]

One of the following:

"reasoner_model"

"non_reasoner_model"

step_id: Optional[str]

class HiddenReasoningMessage: …

Representation of an agent’s internal reasoning where reasoning content has been hidden from the response.

id: str

date: datetime

state: Literal["redacted", "omitted"]

One of the following:

"redacted"

"omitted"

hidden_reasoning: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["hidden_reasoning_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class ToolCallMessage: …

A message representing a request to call a tool (generated by the LLM to trigger tool execution).

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message tool_call (Union[ToolCall, ToolCallDelta]): The tool call

id: str

date: datetime

Deprecatedtool_call: ToolCall

One of the following:

class ToolCall: …

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["tool_call_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

tool_calls: Optional[ToolCalls]

One of the following:

List[ToolCall]

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

class ToolReturnMessage: …

A message representing the return value of a tool call (generated by Letta executing the requested tool).

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message tool_return (str): The return value of the tool (deprecated, use tool_returns) status (Literal[“success”, “error”]): The status of the tool call (deprecated, use tool_returns) tool_call_id (str): A unique identifier for the tool call that generated this message (deprecated, use tool_returns) stdout (Optional[List(str)]): Captured stdout (e.g. prints, logs) from the tool invocation (deprecated, use tool_returns) stderr (Optional[List(str)]): Captured stderr from the tool invocation (deprecated, use tool_returns) tool_returns (Optional[List[ToolReturn]]): List of tool returns for multi-tool support

id: str

date: datetime

Deprecatedstatus: Literal["success", "error"]

One of the following:

"success"

"error"

Deprecatedtool_call_id: str

Deprecatedtool_return: str

is_err: Optional[bool]

message_type: Optional[Literal["tool_return_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

Deprecatedstderr: Optional[List[str]]

Deprecatedstdout: Optional[List[str]]

step_id: Optional[str]

tool_returns: Optional[List[ToolReturn]]

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

class AssistantMessage: …

A message sent by the LLM in response to user input. Used in the LLM context.

id: str

content: Union[List[LettaAssistantMessageContentUnion], str]

The message content sent by the agent (can be a string or an array of content parts)

One of the following:

List[LettaAssistantMessageContentUnion]

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

str

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["assistant_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class ApprovalRequestMessage: …

A message representing a request for approval to call a tool (generated by the LLM to trigger tool execution).

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message tool_call (ToolCall): The tool call

id: str

date: datetime

Deprecatedtool_call: ToolCall

The tool call that has been requested by the llm to run

One of the following:

class ToolCall: …

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["approval_request_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

tool_calls: Optional[ToolCalls]

The tool calls that have been requested by the llm to run, which are pending approval

One of the following:

List[ToolCall]

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

class ApprovalResponseMessage: …

A message representing a response form the user indicating whether a tool has been approved to run.

id: str

date: datetime

Deprecatedapproval_request_id: Optional[str]

The message ID of the approval request

approvals: Optional[List[Approval]]

The list of approval responses

One of the following:

class ApprovalReturn: …

approve: bool

Whether the tool has been approved

tool_call_id: str

The ID of the tool call that corresponds to this approval

reason: Optional[str]

An optional explanation for the provided approval status

type: Optional[Literal["approval"]]

The message type to be created.

class ToolReturn: …

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

Deprecatedapprove: Optional[bool]

Whether the tool has been approved

is_err: Optional[bool]

message_type: Optional[Literal["approval_response_message"]]

The type of the message.

otid: Optional[str]

Deprecatedreason: Optional[str]

An optional explanation for the provided approval status

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class SummaryMessage: …

A message representing a summary of the conversation. Sent to the LLM as a user or system message depending on the provider.

id: str

date: datetime

summary: str

compaction_stats: Optional[CompactionStats]

Statistics about a memory compaction operation.

context_window: int

The model’s context window size

messages_count_after: int

Number of messages after compaction

messages_count_before: int

Number of messages before compaction

trigger: str

What triggered the compaction (e.g., ‘context_window_exceeded’, ‘post_step_context_check’)

context_tokens_after: Optional[int]

Token count after compaction (message tokens only, does not include tool definitions)

context_tokens_before: Optional[int]

Token count before compaction (from LLM usage stats, includes full context sent to LLM)

is_err: Optional[bool]

message_type: Optional[Literal["summary_message"]]

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class EventMessage: …

A message for notifying the developer that an event that has occured (e.g. a compaction). Events are NOT part of the context window.

id: str

date: datetime

event_data: Dict[str, object]

event_type: Literal["compaction"]

is_err: Optional[bool]

message_type: Optional[Literal["event_message"]]

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

stop_reason: StopReason

The stop reason from Letta indicating why agent loop stopped execution.

stop_reason: StopReasonType

The reason why execution stopped.

One of the following:

"end_turn"

"error"

"llm_api_error"

"invalid_llm_response"

"invalid_tool_call"

"max_steps"

"max_tokens_exceeded"

"no_tool_call"

"tool_rule"

"cancelled"

"insufficient_credits"

"requires_approval"

"context_window_overflow_in_system_prompt"

message_type: Optional[Literal["stop_reason"]]

The type of the message.

usage: Usage

The usage statistics of the agent.

cache_write_tokens: Optional[int]

The number of input tokens written to cache (Anthropic only). None if not reported by provider.

cached_input_tokens: Optional[int]

The number of input tokens served from cache. None if not reported by provider.

completion_tokens: Optional[int]

The number of tokens generated by the agent.

context_tokens: Optional[int]

Estimate of tokens currently in the context window.

message_type: Optional[Literal["usage_statistics"]]

prompt_tokens: Optional[int]

The number of tokens in the prompt.

reasoning_tokens: Optional[int]

The number of reasoning/thinking tokens generated. None if not reported by provider.

run_ids: Optional[List[str]]

The background task run IDs associated with the agent interaction

step_count: Optional[int]

The number of steps taken by the agent.

total_tokens: Optional[int]

The total number of tokens processed by the agent.

logprobs: Optional[Logprobs]

Log probabilities of the output tokens from the last LLM call. Only present if return_logprobs was enabled.

content: Optional[List[LogprobsContent]]

token: str

logprob: float

top_logprobs: List[LogprobsContentTopLogprob]

token: str

logprob: float

bytes: Optional[List[int]]

refusal: Optional[List[LogprobsRefusal]]

token: str

logprob: float

top_logprobs: List[LogprobsRefusalTopLogprob]

token: str

logprob: float

bytes: Optional[List[int]]

turns: Optional[List[Turn]]

Token data for all LLM generations in multi-turn agent interaction. Includes token IDs and logprobs for each assistant turn, plus tool result content. Only present if return_token_ids was enabled. Used for RL training with loss masking.

role: Literal["assistant", "tool"]

Role of this turn: ‘assistant’ for LLM generations (trainable), ‘tool’ for tool results (non-trainable).

One of the following:

"assistant"

"tool"

content: Optional[str]

Text content. For tool turns, client tokenizes this with loss_mask=0.

output_ids: Optional[List[int]]

Token IDs from SGLang native endpoint. Only present for assistant turns.

output_token_logprobs: Optional[List[List[object]]]

Logprobs from SGLang: [[logprob, token_id, top_logprob_or_null], …]. Only present for assistant turns.

tool_name: Optional[str]

Name of the tool called. Only present for tool turns.

class LettaStreamingRequest: …

Deprecatedassistant_message_tool_kwarg: Optional[str]

The name of the message argument in the designated message tool. Still supported for legacy agent types, but deprecated for letta_v1_agent onward.

Deprecatedassistant_message_tool_name: Optional[str]

The name of the designated message tool. Still supported for legacy agent types, but deprecated for letta_v1_agent onward.

background: Optional[bool]

Whether to process the request in the background (only used when streaming=true).

client_skills: Optional[List[ClientSkill]]

Client-side skills available in the environment. These are rendered in the system prompt’s available skills section alongside agent-scoped skills from MemFS.

description: str

Description of what the skill does

location: str

Path or location hint for the skill (e.g. skills/my-skill/SKILL.md)

The name of the skill

client_tools: Optional[List[ClientTool]]

Client-side tools that the agent can call. When the agent calls a client-side tool, execution pauses and returns control to the client to execute the tool and provide the result via a ToolReturn.

The name of the tool function

description: Optional[str]

Description of what the tool does

parameters: Optional[Dict[str, object]]

JSON Schema for the function parameters

Deprecatedenable_thinking: Optional[str]

If set to True, enables reasoning before responses or tool calls from the agent.

include_compaction_messages: Optional[bool]

If True, compaction events emit structured SummaryMessage and EventMessage types. If False (default), compaction messages are not included in the response.

include_pings: Optional[bool]

Whether to include periodic keepalive ping messages in the stream to prevent connection timeouts (only used when streaming=true).

include_return_message_types: Optional[List[MessageType]]

Only return specified message types in the response. If None (default) returns all messages.

One of the following:

"system_message"

"user_message"

"assistant_message"

"reasoning_message"

"hidden_reasoning_message"

"tool_call_message"

"tool_return_message"

"approval_request_message"

"approval_response_message"

"summary_message"

"event_message"

input: Optional[Union[str, List[InputUnionMember1], null]]

Syntactic sugar for a single user message. Equivalent to messages=[{‘role’: ‘user’, ‘content’: input}].

One of the following:

str

List[InputUnionMember1]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

class ToolCallContent: …

id: str

A unique identifier for this specific tool call instance.

input: Dict[str, object]

The parameters being passed to the tool, structured as a dictionary of parameter names to values.

The name of the tool being called.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this tool call.

type: Optional[Literal["tool_call"]]

Indicates this content represents a tool call event.

class ToolReturnContent: …

content: str

The content returned by the tool execution.

is_error: bool

Indicates whether the tool execution resulted in an error.

tool_call_id: str

References the ID of the ToolCallContent that initiated this tool call.

type: Optional[Literal["tool_return"]]

Indicates this content represents a tool return event.

class ReasoningContent: …

Sent via the Anthropic Messages API

is_native: bool

Whether the reasoning content was generated by a reasoner model that processed this step.

reasoning: str

The intermediate reasoning or thought process content.

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["reasoning"]]

Indicates this is a reasoning/intermediate step.

class RedactedReasoningContent: …

Sent via the Anthropic Messages API

data: str

The redacted or filtered intermediate reasoning content.

type: Optional[Literal["redacted_reasoning"]]

Indicates this is a redacted thinking step.

class OmittedReasoningContent: …

A placeholder for reasoning content we know is present, but isn’t returned by the provider (e.g. OpenAI GPT-5 on ChatCompletions)

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["omitted_reasoning"]]

Indicates this is an omitted reasoning step.

class InputUnionMember1SummarizedReasoningContent: …

The style of reasoning content returned by the OpenAI Responses API

id: str

The unique identifier for this reasoning step.

summary: List[InputUnionMember1SummarizedReasoningContentSummary]

Summaries of the reasoning content.

index: int

The index of the summary part.

text: str

The text of the summary part.

encrypted_content: Optional[str]

The encrypted reasoning content.

type: Optional[Literal["summarized_reasoning"]]

Indicates this is a summarized reasoning step.

max_steps: Optional[int]

Maximum number of steps the agent should take to process the request.

messages: Optional[List[Message]]

The messages to be sent to the agent.

One of the following:

class MessageCreate: …

Request to create a message

content: Union[List[LettaMessageContentUnion], str]

The content of the message.

One of the following:

List[LettaMessageContentUnion]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

class ToolCallContent: …

id: str

A unique identifier for this specific tool call instance.

input: Dict[str, object]

The parameters being passed to the tool, structured as a dictionary of parameter names to values.

The name of the tool being called.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this tool call.

type: Optional[Literal["tool_call"]]

Indicates this content represents a tool call event.

class ToolReturnContent: …

content: str

The content returned by the tool execution.

is_error: bool

Indicates whether the tool execution resulted in an error.

tool_call_id: str

References the ID of the ToolCallContent that initiated this tool call.

type: Optional[Literal["tool_return"]]

Indicates this content represents a tool return event.

class ReasoningContent: …

Sent via the Anthropic Messages API

is_native: bool

Whether the reasoning content was generated by a reasoner model that processed this step.

reasoning: str

The intermediate reasoning or thought process content.

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["reasoning"]]

Indicates this is a reasoning/intermediate step.

class RedactedReasoningContent: …

Sent via the Anthropic Messages API

data: str

The redacted or filtered intermediate reasoning content.

type: Optional[Literal["redacted_reasoning"]]

Indicates this is a redacted thinking step.

class OmittedReasoningContent: …

A placeholder for reasoning content we know is present, but isn’t returned by the provider (e.g. OpenAI GPT-5 on ChatCompletions)

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["omitted_reasoning"]]

Indicates this is an omitted reasoning step.

str

role: Literal["user", "system", "assistant"]

The role of the participant.

One of the following:

"user"

"system"

"assistant"

batch_item_id: Optional[str]

The id of the LLMBatchItem that this message is associated with

group_id: Optional[str]

The multi-agent group that the message was sent in

The name of the participant.

otid: Optional[str]

sender_id: Optional[str]

The id of the sender of the message, can be an identity id or agent id

type: Optional[Literal["message"]]

The message type to be created.

class ApprovalCreate: …

Input to approve or deny a tool call request

Deprecatedapproval_request_id: Optional[str]

The message ID of the approval request

approvals: Optional[List[Approval]]

The list of approval responses

One of the following:

class ApprovalReturn: …

approve: bool

Whether the tool has been approved

tool_call_id: str

The ID of the tool call that corresponds to this approval

reason: Optional[str]

An optional explanation for the provided approval status

type: Optional[Literal["approval"]]

The message type to be created.

class ToolReturn: …

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

Deprecatedapprove: Optional[bool]

Whether the tool has been approved

group_id: Optional[str]

The multi-agent group that the message was sent in

otid: Optional[str]

Deprecatedreason: Optional[str]

An optional explanation for the provided approval status

type: Optional[Literal["approval"]]

The message type to be created.

class MessageToolReturnCreate: …

Submit tool return(s) from client-side tool execution.

tool_returns: List[ToolReturn]

List of tool returns from client-side execution

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

group_id: Optional[str]

The multi-agent group that the message was sent in

otid: Optional[str]

type: Optional[Literal["tool_return"]]

The message type to be created.

override_model: Optional[str]

Model handle to use for this request instead of the agent’s default model. This allows sending a message to a different model without changing the agent’s configuration.

override_system: Optional[str]

Optional per-request system prompt override. When set, this is passed directly to the underlying LLM request and bypasses the persisted/compiled system message for that request.

return_logprobs: Optional[bool]

If True, returns log probabilities of the output tokens in the response. Useful for RL training. Only supported for OpenAI-compatible providers (including SGLang).

return_token_ids: Optional[bool]

stream_tokens: Optional[bool]

Flag to determine if individual tokens should be streamed, rather than streaming per step (only used when streaming=true).

streaming: Optional[bool]

If True, returns a streaming response (Server-Sent Events). If False (default), returns a complete response.

top_logprobs: Optional[int]

Number of most likely tokens to return at each position (0-20). Requires return_logprobs=True.

Deprecateduse_assistant_message: Optional[bool]

Whether the server should parse specific tool call arguments (default send_message) as AssistantMessage objects. Still supported for legacy agent types, but deprecated for letta_v1_agent onward.

LettaStreamingResponse

Streaming response type for Server-Sent Events (SSE) endpoints. Each event in the stream will be one of these types.

One of the following:

class SystemMessage: …

A message generated by the system. Never streamed back on a response, only used for cursor pagination.

id: str

content: str

The message content sent by the system

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["system_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class UserMessage: …

A message sent by the user. Never streamed back on a response, only used for cursor pagination.

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message content (Union[str, List[LettaUserMessageContentUnion]]): The message content sent by the user (can be a string or an array of multi-modal content parts)

id: str

content: Union[List[LettaUserMessageContentUnion], str]

The message content sent by the user (can be a string or an array of multi-modal content parts)

One of the following:

List[LettaUserMessageContentUnion]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["user_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class ReasoningMessage: …

Representation of an agent’s internal reasoning.

id: str

date: datetime

reasoning: str

is_err: Optional[bool]

message_type: Optional[Literal["reasoning_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

signature: Optional[str]

source: Optional[Literal["reasoner_model", "non_reasoner_model"]]

One of the following:

"reasoner_model"

"non_reasoner_model"

step_id: Optional[str]

class HiddenReasoningMessage: …

Representation of an agent’s internal reasoning where reasoning content has been hidden from the response.

id: str

date: datetime

state: Literal["redacted", "omitted"]

One of the following:

"redacted"

"omitted"

hidden_reasoning: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["hidden_reasoning_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class ToolCallMessage: …

A message representing a request to call a tool (generated by the LLM to trigger tool execution).

id: str

date: datetime

Deprecatedtool_call: ToolCall

One of the following:

class ToolCall: …

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["tool_call_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

tool_calls: Optional[ToolCalls]

One of the following:

List[ToolCall]

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

class ToolReturnMessage: …

A message representing the return value of a tool call (generated by Letta executing the requested tool).

id: str

date: datetime

Deprecatedstatus: Literal["success", "error"]

One of the following:

"success"

"error"

Deprecatedtool_call_id: str

Deprecatedtool_return: str

is_err: Optional[bool]

message_type: Optional[Literal["tool_return_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

Deprecatedstderr: Optional[List[str]]

Deprecatedstdout: Optional[List[str]]

step_id: Optional[str]

tool_returns: Optional[List[ToolReturn]]

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

class AssistantMessage: …

A message sent by the LLM in response to user input. Used in the LLM context.

id: str

content: Union[List[LettaAssistantMessageContentUnion], str]

The message content sent by the agent (can be a string or an array of content parts)

One of the following:

List[LettaAssistantMessageContentUnion]

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

str

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["assistant_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class ApprovalRequestMessage: …

A message representing a request for approval to call a tool (generated by the LLM to trigger tool execution).

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message tool_call (ToolCall): The tool call

id: str

date: datetime

Deprecatedtool_call: ToolCall

The tool call that has been requested by the llm to run

One of the following:

class ToolCall: …

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["approval_request_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

tool_calls: Optional[ToolCalls]

The tool calls that have been requested by the llm to run, which are pending approval

One of the following:

List[ToolCall]

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

class ApprovalResponseMessage: …

A message representing a response form the user indicating whether a tool has been approved to run.

id: str

date: datetime

Deprecatedapproval_request_id: Optional[str]

The message ID of the approval request

approvals: Optional[List[Approval]]

The list of approval responses

One of the following:

class ApprovalReturn: …

approve: bool

Whether the tool has been approved

tool_call_id: str

The ID of the tool call that corresponds to this approval

reason: Optional[str]

An optional explanation for the provided approval status

type: Optional[Literal["approval"]]

The message type to be created.

class ToolReturn: …

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

Deprecatedapprove: Optional[bool]

Whether the tool has been approved

is_err: Optional[bool]

message_type: Optional[Literal["approval_response_message"]]

The type of the message.

otid: Optional[str]

Deprecatedreason: Optional[str]

An optional explanation for the provided approval status

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class LettaPing: …

A ping message used as a keepalive to prevent SSE streams from timing out during long running requests.

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format

id: str

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["ping"]]

The type of the message. Ping messages are a keep-alive to prevent SSE streams from timing out during long running requests.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class LettaErrorMessage: …

Error messages are used to notify the client of an error that occurred during the agent’s execution.

error_type: str

The type of error.

message: str

The error message.

message_type: Literal["error_message"]

The type of the message.

run_id: str

The ID of the run.

detail: Optional[str]

An optional error detail.

seq_id: Optional[int]

The sequence ID for cursor-based pagination.

class LettaStopReason: …

The stop reason from Letta indicating why agent loop stopped execution.

stop_reason: StopReasonType

The reason why execution stopped.

One of the following:

"end_turn"

"error"

"llm_api_error"

"invalid_llm_response"

"invalid_tool_call"

"max_steps"

"max_tokens_exceeded"

"no_tool_call"

"tool_rule"

"cancelled"

"insufficient_credits"

"requires_approval"

"context_window_overflow_in_system_prompt"

message_type: Optional[Literal["stop_reason"]]

The type of the message.

class LettaUsageStatistics: …

Usage statistics for the agent interaction.

Attributes: completion_tokens (int): The number of tokens generated by the agent. prompt_tokens (int): The number of tokens in the prompt. total_tokens (int): The total number of tokens processed by the agent. step_count (int): The number of steps taken by the agent. cached_input_tokens (Optional[int]): The number of input tokens served from cache. None if not reported. cache_write_tokens (Optional[int]): The number of input tokens written to cache. None if not reported. reasoning_tokens (Optional[int]): The number of reasoning/thinking tokens generated. None if not reported.

cache_write_tokens: Optional[int]

The number of input tokens written to cache (Anthropic only). None if not reported by provider.

cached_input_tokens: Optional[int]

The number of input tokens served from cache. None if not reported by provider.

completion_tokens: Optional[int]

The number of tokens generated by the agent.

context_tokens: Optional[int]

Estimate of tokens currently in the context window.

message_type: Optional[Literal["usage_statistics"]]

prompt_tokens: Optional[int]

The number of tokens in the prompt.

reasoning_tokens: Optional[int]

The number of reasoning/thinking tokens generated. None if not reported by provider.

run_ids: Optional[List[str]]

The background task run IDs associated with the agent interaction

step_count: Optional[int]

The number of steps taken by the agent.

total_tokens: Optional[int]

The total number of tokens processed by the agent.

LettaUserMessageContentUnion

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

Message

A message generated by the system. Never streamed back on a response, only used for cursor pagination.

One of the following:

class SystemMessage: …

A message generated by the system. Never streamed back on a response, only used for cursor pagination.

id: str

content: str

The message content sent by the system

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["system_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class UserMessage: …

A message sent by the user. Never streamed back on a response, only used for cursor pagination.

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message content (Union[str, List[LettaUserMessageContentUnion]]): The message content sent by the user (can be a string or an array of multi-modal content parts)

id: str

content: Union[List[LettaUserMessageContentUnion], str]

The message content sent by the user (can be a string or an array of multi-modal content parts)

One of the following:

List[LettaUserMessageContentUnion]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["user_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class ReasoningMessage: …

Representation of an agent’s internal reasoning.

id: str

date: datetime

reasoning: str

is_err: Optional[bool]

message_type: Optional[Literal["reasoning_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

signature: Optional[str]

source: Optional[Literal["reasoner_model", "non_reasoner_model"]]

One of the following:

"reasoner_model"

"non_reasoner_model"

step_id: Optional[str]

class HiddenReasoningMessage: …

Representation of an agent’s internal reasoning where reasoning content has been hidden from the response.

id: str

date: datetime

state: Literal["redacted", "omitted"]

One of the following:

"redacted"

"omitted"

hidden_reasoning: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["hidden_reasoning_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class ToolCallMessage: …

A message representing a request to call a tool (generated by the LLM to trigger tool execution).

id: str

date: datetime

Deprecatedtool_call: ToolCall

One of the following:

class ToolCall: …

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["tool_call_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

tool_calls: Optional[ToolCalls]

One of the following:

List[ToolCall]

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

class ToolReturnMessage: …

A message representing the return value of a tool call (generated by Letta executing the requested tool).

id: str

date: datetime

Deprecatedstatus: Literal["success", "error"]

One of the following:

"success"

"error"

Deprecatedtool_call_id: str

Deprecatedtool_return: str

is_err: Optional[bool]

message_type: Optional[Literal["tool_return_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

Deprecatedstderr: Optional[List[str]]

Deprecatedstdout: Optional[List[str]]

step_id: Optional[str]

tool_returns: Optional[List[ToolReturn]]

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

class AssistantMessage: …

A message sent by the LLM in response to user input. Used in the LLM context.

id: str

content: Union[List[LettaAssistantMessageContentUnion], str]

The message content sent by the agent (can be a string or an array of content parts)

One of the following:

List[LettaAssistantMessageContentUnion]

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

str

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["assistant_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class ApprovalRequestMessage: …

A message representing a request for approval to call a tool (generated by the LLM to trigger tool execution).

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message tool_call (ToolCall): The tool call

id: str

date: datetime

Deprecatedtool_call: ToolCall

The tool call that has been requested by the llm to run

One of the following:

class ToolCall: …

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["approval_request_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

tool_calls: Optional[ToolCalls]

The tool calls that have been requested by the llm to run, which are pending approval

One of the following:

List[ToolCall]

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

class ApprovalResponseMessage: …

A message representing a response form the user indicating whether a tool has been approved to run.

id: str

date: datetime

Deprecatedapproval_request_id: Optional[str]

The message ID of the approval request

approvals: Optional[List[Approval]]

The list of approval responses

One of the following:

class ApprovalReturn: …

approve: bool

Whether the tool has been approved

tool_call_id: str

The ID of the tool call that corresponds to this approval

reason: Optional[str]

An optional explanation for the provided approval status

type: Optional[Literal["approval"]]

The message type to be created.

class ToolReturn: …

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

Deprecatedapprove: Optional[bool]

Whether the tool has been approved

is_err: Optional[bool]

message_type: Optional[Literal["approval_response_message"]]

The type of the message.

otid: Optional[str]

Deprecatedreason: Optional[str]

An optional explanation for the provided approval status

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class SummaryMessage: …

A message representing a summary of the conversation. Sent to the LLM as a user or system message depending on the provider.

id: str

date: datetime

summary: str

compaction_stats: Optional[CompactionStats]

Statistics about a memory compaction operation.

context_window: int

The model’s context window size

messages_count_after: int

Number of messages after compaction

messages_count_before: int

Number of messages before compaction

trigger: str

What triggered the compaction (e.g., ‘context_window_exceeded’, ‘post_step_context_check’)

context_tokens_after: Optional[int]

Token count after compaction (message tokens only, does not include tool definitions)

context_tokens_before: Optional[int]

Token count before compaction (from LLM usage stats, includes full context sent to LLM)

is_err: Optional[bool]

message_type: Optional[Literal["summary_message"]]

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class EventMessage: …

A message for notifying the developer that an event that has occured (e.g. a compaction). Events are NOT part of the context window.

id: str

date: datetime

event_data: Dict[str, object]

event_type: Literal["compaction"]

is_err: Optional[bool]

message_type: Optional[Literal["event_message"]]

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

Literal["assistant", "user", "tool", 4 more]

One of the following:

"assistant"

"user"

"tool"

"function"

"system"

"approval"

"summary"

Literal["system_message", "user_message", "assistant_message", 8 more]

One of the following:

"system_message"

"user_message"

"assistant_message"

"reasoning_message"

"hidden_reasoning_message"

"tool_call_message"

"tool_return_message"

"approval_request_message"

"approval_response_message"

"summary_message"

"event_message"

class OmittedReasoningContent: …

A placeholder for reasoning content we know is present, but isn’t returned by the provider (e.g. OpenAI GPT-5 on ChatCompletions)

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["omitted_reasoning"]]

Indicates this is an omitted reasoning step.

class ReasoningContent: …

Sent via the Anthropic Messages API

is_native: bool

Whether the reasoning content was generated by a reasoner model that processed this step.

reasoning: str

The intermediate reasoning or thought process content.

signature: Optional[str]

A unique identifier for this reasoning step.

type: Optional[Literal["reasoning"]]

Indicates this is a reasoning/intermediate step.

class ReasoningMessage: …

Representation of an agent’s internal reasoning.

id: str

date: datetime

reasoning: str

is_err: Optional[bool]

message_type: Optional[Literal["reasoning_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

signature: Optional[str]

source: Optional[Literal["reasoner_model", "non_reasoner_model"]]

One of the following:

"reasoner_model"

"non_reasoner_model"

step_id: Optional[str]

class RedactedReasoningContent: …

Sent via the Anthropic Messages API

data: str

The redacted or filtered intermediate reasoning content.

type: Optional[Literal["redacted_reasoning"]]

Indicates this is a redacted thinking step.

class Run: …

Representation of a run - a conversation or processing session for an agent. Runs track when agents process messages and maintain the relationship between agents, steps, and messages.

id: str

The human-friendly ID of the Run

agent_id: str

The unique identifier of the agent associated with the run.

background: Optional[bool]

Whether the run was created in background mode.

base_template_id: Optional[str]

The base template ID that the run belongs to.

callback_error: Optional[str]

Optional error message from attempting to POST the callback endpoint.

callback_sent_at: Optional[datetime]

Timestamp when the callback was last attempted.

formatdate-time

callback_status_code: Optional[int]

HTTP status code returned by the callback endpoint.

callback_url: Optional[str]

If set, POST to this URL when the run completes.

completed_at: Optional[datetime]

The timestamp when the run was completed.

formatdate-time

conversation_id: Optional[str]

The unique identifier of the conversation associated with the run.

created_at: Optional[datetime]

The timestamp when the run was created.

formatdate-time

metadata: Optional[Dict[str, object]]

Additional metadata for the run.

request_config: Optional[RequestConfig]

The request configuration for the run.

assistant_message_tool_kwarg: Optional[str]

The name of the message argument in the designated message tool.

assistant_message_tool_name: Optional[str]

The name of the designated message tool.

include_return_message_types: Optional[List[MessageType]]

Only return specified message types in the response. If None (default) returns all messages.

One of the following:

"system_message"

"user_message"

"assistant_message"

"reasoning_message"

"hidden_reasoning_message"

"tool_call_message"

"tool_return_message"

"approval_request_message"

"approval_response_message"

"summary_message"

"event_message"

use_assistant_message: Optional[bool]

Whether the server should parse specific tool call arguments (default send_message) as AssistantMessage objects.

status: Optional[Literal["created", "running", "completed", 2 more]]

The current status of the run.

One of the following:

"created"

"running"

"completed"

"failed"

"cancelled"

stop_reason: Optional[StopReasonType]

The reason why the run was stopped.

One of the following:

"end_turn"

"error"

"llm_api_error"

"invalid_llm_response"

"invalid_tool_call"

"max_steps"

"max_tokens_exceeded"

"no_tool_call"

"tool_rule"

"cancelled"

"insufficient_credits"

"requires_approval"

"context_window_overflow_in_system_prompt"

total_duration_ns: Optional[int]

Total run duration in nanoseconds

ttft_ns: Optional[int]

Time to first token for a run in nanoseconds

class SummaryMessage: …

A message representing a summary of the conversation. Sent to the LLM as a user or system message depending on the provider.

id: str

date: datetime

summary: str

compaction_stats: Optional[CompactionStats]

Statistics about a memory compaction operation.

context_window: int

The model’s context window size

messages_count_after: int

Number of messages after compaction

messages_count_before: int

Number of messages before compaction

trigger: str

What triggered the compaction (e.g., ‘context_window_exceeded’, ‘post_step_context_check’)

context_tokens_after: Optional[int]

Token count after compaction (message tokens only, does not include tool definitions)

context_tokens_before: Optional[int]

Token count before compaction (from LLM usage stats, includes full context sent to LLM)

is_err: Optional[bool]

message_type: Optional[Literal["summary_message"]]

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class SystemMessage: …

A message generated by the system. Never streamed back on a response, only used for cursor pagination.

id: str

content: str

The message content sent by the system

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["system_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ToolCall: …

arguments: str

tool_call_id: str

class ToolCallContent: …

id: str

A unique identifier for this specific tool call instance.

input: Dict[str, object]

The parameters being passed to the tool, structured as a dictionary of parameter names to values.

The name of the tool being called.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this tool call.

type: Optional[Literal["tool_call"]]

Indicates this content represents a tool call event.

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

class ToolCallMessage: …

A message representing a request to call a tool (generated by the LLM to trigger tool execution).

id: str

date: datetime

Deprecatedtool_call: ToolCall

One of the following:

class ToolCall: …

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["tool_call_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

tool_calls: Optional[ToolCalls]

One of the following:

List[ToolCall]

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

class ToolReturn: …

status: Literal["success", "error"]

One of the following:

"success"

"error"

tool_call_id: str

tool_return: Union[List[ToolReturnUnionMember0], str]

The tool return value - either a string or list of content parts (text/image)

One of the following:

List[ToolReturnUnionMember0]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

stderr: Optional[List[str]]

stdout: Optional[List[str]]

type: Optional[Literal["tool"]]

The message type to be created.

class ToolReturnContent: …

content: str

The content returned by the tool execution.

is_error: bool

Indicates whether the tool execution resulted in an error.

tool_call_id: str

References the ID of the ToolCallContent that initiated this tool call.

type: Optional[Literal["tool_return"]]

Indicates this content represents a tool return event.

class UpdateAssistantMessage: …

content: Union[List[LettaAssistantMessageContentUnion], str]

The message content sent by the assistant (can be a string or an array of content parts)

One of the following:

List[LettaAssistantMessageContentUnion]

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

str

message_type: Optional[Literal["assistant_message"]]

class UpdateReasoningMessage: …

reasoning: str

message_type: Optional[Literal["reasoning_message"]]

class UpdateSystemMessage: …

content: str

The message content sent by the system (can be a string or an array of multi-modal content parts)

message_type: Optional[Literal["system_message"]]

class UpdateUserMessage: …

content: Union[List[LettaUserMessageContentUnion], str]

The message content sent by the user (can be a string or an array of multi-modal content parts)

One of the following:

List[LettaUserMessageContentUnion]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

message_type: Optional[Literal["user_message"]]

class UserMessage: …

A message sent by the user. Never streamed back on a response, only used for cursor pagination.

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message content (Union[str, List[LettaUserMessageContentUnion]]): The message content sent by the user (can be a string or an array of multi-modal content parts)

id: str

content: Union[List[LettaUserMessageContentUnion], str]

The message content sent by the user (can be a string or an array of multi-modal content parts)

One of the following:

List[LettaUserMessageContentUnion]

One of the following:

class TextContent: …

text: str

The text content of the message.

signature: Optional[str]

Stores a unique identifier for any reasoning associated with this text content.

type: Optional[Literal["text"]]

The type of the message.

class ImageContent: …

source: Source

The source of the image.

One of the following:

class SourceURLImage: …

url: str

The URL of the image.

type: Optional[Literal["url"]]

The source type for the image.

class SourceBase64Image: …

data: str

The base64 encoded image data.

media_type: str

The media type for the image.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

type: Optional[Literal["base64"]]

The source type for the image.

class SourceLettaImage: …

file_id: str

The unique identifier of the image file persisted in storage.

data: Optional[str]

The base64 encoded image data.

detail: Optional[str]

What level of detail to use when processing and understanding the image (low, high, or auto to let the model decide)

media_type: Optional[str]

The media type for the image.

type: Optional[Literal["letta"]]

The source type for the image.

type: Optional[Literal["image"]]

The type of the message.

str

date: datetime

is_err: Optional[bool]

message_type: Optional[Literal["user_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

Dict[str, object]

AgentsSchedule

Schedule Agent Message

agents.schedule.create(, ) -> ScheduleCreateResponse

POST/v1/agents/{agent_id}/schedule

List Scheduled Agent Messages

agents.schedule.list(, ) -> ScheduleListResponse

GET/v1/agents/{agent_id}/schedule

Retrieve Scheduled Agent Message

agents.schedule.retrieve(, ) -> ScheduleRetrieveResponse

GET/v1/agents/{agent_id}/schedule/{scheduled_message_id}

Delete Scheduled Agent Message

agents.schedule.delete(, ) -> ScheduleDeleteResponse

DELETE/v1/agents/{agent_id}/schedule/{scheduled_message_id}

ModelsExpand Collapse

class ScheduleCreateResponse: …

id: str

next_scheduled_at: Optional[str]

class ScheduleListResponse: …

has_next_page: bool

scheduled_messages: List[ScheduledMessage]

id: str

agent_id: str

message: ScheduledMessageMessage

messages: List[ScheduledMessageMessageMessage]

content: Union[List[ScheduledMessageMessageMessageContentUnionMember0], str]

One of the following:

List[ScheduledMessageMessageMessageContentUnionMember0]

One of the following:

class ScheduledMessageMessageMessageContentUnionMember0UnionMember0: …

text: str

signature: Optional[str]

type: Optional[Literal["text"]]

class ScheduledMessageMessageMessageContentUnionMember0UnionMember1: …

source: ScheduledMessageMessageMessageContentUnionMember0UnionMember1Source

data: str

media_type: str

detail: Optional[str]

type: Optional[Literal["base64"]]

type: Literal["image"]

str

role: Literal["user", "assistant", "system"]

One of the following:

"user"

"assistant"

"system"

otid: Optional[str]

sender_id: Optional[str]

type: Optional[Literal["message"]]

callback_url: Optional[str]

include_return_message_types: Optional[List[Literal["system_message", "user_message", "assistant_message", 6 more]]]

One of the following:

"system_message"

"user_message"

"assistant_message"

"reasoning_message"

"hidden_reasoning_message"

"tool_call_message"

"tool_return_message"

"approval_request_message"

"approval_response_message"

max_steps: Optional[float]

next_scheduled_time: Optional[str]

schedule: ScheduledMessageSchedule

One of the following:

class ScheduledMessageScheduleUnionMember0: …

scheduled_at: float

type: Optional[Literal["one-time"]]

class ScheduledMessageScheduleUnionMember1: …

cron_expression: str

type: Literal["recurring"]

class ScheduleRetrieveResponse: …

id: str

agent_id: str

message: Message

messages: List[MessageMessage]

content: Union[List[MessageMessageContentUnionMember0], str]

One of the following:

List[MessageMessageContentUnionMember0]

One of the following:

class MessageMessageContentUnionMember0UnionMember0: …

text: str

signature: Optional[str]

type: Optional[Literal["text"]]

class MessageMessageContentUnionMember0UnionMember1: …

source: MessageMessageContentUnionMember0UnionMember1Source

data: str

media_type: str

detail: Optional[str]

type: Optional[Literal["base64"]]

type: Literal["image"]

str

role: Literal["user", "assistant", "system"]

One of the following:

"user"

"assistant"

"system"

otid: Optional[str]

sender_id: Optional[str]

type: Optional[Literal["message"]]

callback_url: Optional[str]

include_return_message_types: Optional[List[Literal["system_message", "user_message", "assistant_message", 6 more]]]

One of the following:

"system_message"

"user_message"

"assistant_message"

"reasoning_message"

"hidden_reasoning_message"

"tool_call_message"

"tool_return_message"

"approval_request_message"

"approval_response_message"

max_steps: Optional[float]

next_scheduled_time: Optional[str]

schedule: Schedule

One of the following:

class ScheduleUnionMember0: …

scheduled_at: float

type: Optional[Literal["one-time"]]

class ScheduleUnionMember1: …

cron_expression: str

type: Literal["recurring"]

class ScheduleDeleteResponse: …

success: Literal[true]

AgentsBlocks

Retrieve Block For Agent

agents.blocks.retrieve(, ) -> BlockResponse

GET/v1/agents/{agent_id}/core-memory/blocks/{block_label}

Update Block For Agent

agents.blocks.update(, ) -> BlockResponse

PATCH/v1/agents/{agent_id}/core-memory/blocks/{block_label}

List Blocks For Agent

agents.blocks.list(, ) -> SyncArrayPage[BlockResponse]

GET/v1/agents/{agent_id}/core-memory/blocks

Attach Block To Agent

agents.blocks.attach(, ) -> AgentState

PATCH/v1/agents/{agent_id}/core-memory/blocks/attach/{block_id}

Detach Block From Agent

agents.blocks.detach(, ) -> AgentState

PATCH/v1/agents/{agent_id}/core-memory/blocks/detach/{block_id}

ModelsExpand Collapse

class Block: …

A Block represents a reserved section of the LLM’s context window.

value: str

Value of the block.

id: Optional[str]

The human-friendly ID of the Block

base_template_id: Optional[str]

The base template id of the block.

created_by_id: Optional[str]

The id of the user that made this Block.

deployment_id: Optional[str]

The id of the deployment.

description: Optional[str]

Description of the block.

entity_id: Optional[str]

The id of the entity within the template.

hidden: Optional[bool]

If set to True, the block will be hidden.

is_template: Optional[bool]

Whether the block is a template (e.g. saved human/persona options).

label: Optional[str]

Label of the block (e.g. ‘human’, ‘persona’) in the context window.

last_updated_by_id: Optional[str]

The id of the user that last updated this Block.

limit: Optional[int]

Character limit of the block.

metadata: Optional[Dict[str, object]]

Metadata of the block.

preserve_on_migration: Optional[bool]

Preserve the block on template migration.

project_id: Optional[str]

The associated project id.

read_only: Optional[bool]

Whether the agent has read-only access to the block.

tags: Optional[List[str]]

The tags associated with the block.

template_id: Optional[str]

The id of the template.

template_name: Optional[str]

Name of the block if it is a template.

class BlockUpdate: …

Update a block

base_template_id: Optional[str]

The base template id of the block.

deployment_id: Optional[str]

The id of the deployment.

description: Optional[str]

Description of the block.

entity_id: Optional[str]

The id of the entity within the template.

hidden: Optional[bool]

If set to True, the block will be hidden.

is_template: Optional[bool]

Whether the block is a template (e.g. saved human/persona options).

label: Optional[str]

Label of the block (e.g. ‘human’, ‘persona’) in the context window.

limit: Optional[int]

Character limit of the block.

metadata: Optional[Dict[str, object]]

Metadata of the block.

preserve_on_migration: Optional[bool]

Preserve the block on template migration.

project_id: Optional[str]

The associated project id.

read_only: Optional[bool]

Whether the agent has read-only access to the block.

tags: Optional[List[str]]

The tags to associate with the block.

template_id: Optional[str]

The id of the template.

template_name: Optional[str]

Name of the block if it is a template.

value: Optional[str]

Value of the block.

AgentsTools

List Tools For Agent

agents.tools.list(, ) -> SyncArrayPage[Tool]

GET/v1/agents/{agent_id}/tools

Attach Tool To Agent

agents.tools.attach(, ) -> AgentState

PATCH/v1/agents/{agent_id}/tools/attach/{tool_id}

Detach Tool From Agent

agents.tools.detach(, ) -> AgentState

PATCH/v1/agents/{agent_id}/tools/detach/{tool_id}

Update Approval For Tool

agents.tools.update_approval(, ) -> AgentState

PATCH/v1/agents/{agent_id}/tools/approval/{tool_name}

Run Tool For Agent

agents.tools.run(, ) -> ToolExecutionResult

POST/v1/agents/{agent_id}/tools/{tool_name}/run

ModelsExpand Collapse

class ToolExecuteRequest: …

Request to execute a tool.

args: Optional[Dict[str, object]]

Arguments to pass to the tool

class ToolExecutionResult: …

status: Literal["success", "error"]

The status of the tool execution and return object

One of the following:

"success"

"error"

Deprecatedagent_state: Optional[AgentState]

Representation of an agent’s state. This is the state of the agent at a given time, and is persisted in the DB backend. The state has all the information needed to recreate a persisted agent.

id: str

The id of the agent. Assigned by the database.

agent_type: AgentType

The type of agent.

One of the following:

"memgpt_agent"

"memgpt_v2_agent"

"letta_v1_agent"

"react_agent"

"workflow_agent"

"split_thread_agent"

"sleeptime_agent"

"voice_convo_agent"

"voice_sleeptime_agent"

blocks: List[Block]

The memory blocks used by the agent.

value: str

Value of the block.

id: Optional[str]

The human-friendly ID of the Block

base_template_id: Optional[str]

The base template id of the block.

created_by_id: Optional[str]

The id of the user that made this Block.

deployment_id: Optional[str]

The id of the deployment.

description: Optional[str]

Description of the block.

entity_id: Optional[str]

The id of the entity within the template.

hidden: Optional[bool]

If set to True, the block will be hidden.

is_template: Optional[bool]

Whether the block is a template (e.g. saved human/persona options).

label: Optional[str]

Label of the block (e.g. ‘human’, ‘persona’) in the context window.

last_updated_by_id: Optional[str]

The id of the user that last updated this Block.

limit: Optional[int]

Character limit of the block.

metadata: Optional[Dict[str, object]]

Metadata of the block.

preserve_on_migration: Optional[bool]

Preserve the block on template migration.

project_id: Optional[str]

The associated project id.

read_only: Optional[bool]

Whether the agent has read-only access to the block.

tags: Optional[List[str]]

The tags associated with the block.

template_id: Optional[str]

The id of the template.

template_name: Optional[str]

Name of the block if it is a template.

Deprecatedllm_config: LlmConfig

Deprecated: Use model field instead. The LLM configuration used by the agent.

context_window: int

The context window size for the model.

model: str

LLM model name.

model_endpoint_type: Literal["openai", "anthropic", "google_ai", 27 more]

The endpoint type for the model.

One of the following:

"openai"

"anthropic"

"google_ai"

"google_vertex"

"azure"

"groq"

"ollama"

"webui"

"webui-legacy"

"lmstudio"

"lmstudio-legacy"

"lmstudio-chatcompletions"

"llamacpp"

"koboldcpp"

"vllm"

"hugging-face"

"minimax"

"moonshot"

"moonshot_coding"

"mistral"

"together"

"bedrock"

"deepseek"

"xai"

"zai"

"zai_coding"

"baseten"

"fireworks"

"openrouter"

"chatgpt_oauth"

compatibility_type: Optional[Literal["gguf", "mlx"]]

The framework compatibility type for the model.

One of the following:

"gguf"

"mlx"

display_name: Optional[str]

A human-friendly display name for the model.

effort: Optional[Literal["low", "medium", "high", 2 more]]

The effort level for Anthropic models that support it (Opus 4.5+). Controls token spending and thinking behavior. Not setting this gives similar performance to ‘high’.

One of the following:

"low"

"medium"

"high"

"xhigh"

"max"

enable_reasoner: Optional[bool]

Whether or not the model should use extended thinking if it is a ‘reasoning’ style model

frequency_penalty: Optional[float]

handle: Optional[str]

The handle for this config, in the format provider/model-name.

max_reasoning_tokens: Optional[int]

Configurable thinking budget for extended thinking. Used for enable_reasoner and also for Google Vertex models like Gemini 2.5 Flash. Minimum value is 1024 when used with enable_reasoner.

max_tokens: Optional[int]

The maximum number of tokens to generate. If not set, the model will use its default value.

model_endpoint: Optional[str]

The endpoint for the model.

model_wrapper: Optional[str]

The wrapper for the model.

Deprecatedparallel_tool_calls: Optional[bool]

Deprecated: Use model_settings to configure parallel tool calls instead. If set to True, enables parallel tool calling. Defaults to False.

provider_category: Optional[ProviderCategory]

The provider category for the model.

One of the following:

"base"

"byok"

provider_name: Optional[str]

The provider name for the model.

put_inner_thoughts_in_kwargs: Optional[bool]

Puts ‘inner_thoughts’ as a kwarg in the function call if this is set to True. This helps with function calling performance and also the generation of inner thoughts.

reasoning_effort: Optional[Literal["none", "minimal", "low", 3 more]]

The reasoning effort to use when generating text reasoning models

One of the following:

"none"

"minimal"

"low"

"medium"

"high"

"xhigh"

response_format: Optional[ResponseFormat]

The response format for the model’s output. Supports text, json_object, and json_schema (structured outputs). Can be set via model_settings.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

return_logprobs: Optional[bool]

Whether to return log probabilities of the output tokens. Useful for RL training.

return_token_ids: Optional[bool]

Whether to return token IDs for all LLM generations via SGLang native endpoint. Required for multi-turn RL training with loss masking. Only works with SGLang provider.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool schemas include strict: true and additionalProperties: false, guaranteeing tool outputs match JSON schemas.

temperature: Optional[float]

The temperature to use when generating text with the model. A higher temperature will result in more random text.

tier: Optional[str]

The cost tier for the model (cloud only).

tool_call_parser: Optional[str]

SGLang tool call parser name (e.g. ‘glm47’, ‘qwen25’, ‘hermes’). Used by the SGLang native adapter to parse tool calls from raw model output.

top_logprobs: Optional[int]

Number of most likely tokens to return at each position (0-20). Requires return_logprobs=True.

verbosity: Optional[Literal["low", "medium", "high"]]

Soft control for how verbose model output should be, used for GPT-5 models.

One of the following:

"low"

"medium"

"high"

Deprecatedmemory: Memory

Deprecated: Use blocks field instead. The in-context memory of the agent.

blocks: List[Block]

Memory blocks contained in the agent’s in-context memory

value: str

Value of the block.

id: Optional[str]

The human-friendly ID of the Block

base_template_id: Optional[str]

The base template id of the block.

created_by_id: Optional[str]

The id of the user that made this Block.

deployment_id: Optional[str]

The id of the deployment.

description: Optional[str]

Description of the block.

entity_id: Optional[str]

The id of the entity within the template.

hidden: Optional[bool]

If set to True, the block will be hidden.

is_template: Optional[bool]

Whether the block is a template (e.g. saved human/persona options).

label: Optional[str]

Label of the block (e.g. ‘human’, ‘persona’) in the context window.

last_updated_by_id: Optional[str]

The id of the user that last updated this Block.

limit: Optional[int]

Character limit of the block.

metadata: Optional[Dict[str, object]]

Metadata of the block.

preserve_on_migration: Optional[bool]

Preserve the block on template migration.

project_id: Optional[str]

The associated project id.

read_only: Optional[bool]

Whether the agent has read-only access to the block.

tags: Optional[List[str]]

The tags associated with the block.

template_id: Optional[str]

The id of the template.

template_name: Optional[str]

Name of the block if it is a template.

agent_type: Optional[Union[AgentType, str, null]]

Agent type controlling prompt rendering.

One of the following:

Literal["memgpt_agent", "memgpt_v2_agent", "letta_v1_agent", 6 more]

One of the following:

"memgpt_agent"

"memgpt_v2_agent"

"letta_v1_agent"

"react_agent"

"workflow_agent"

"split_thread_agent"

"sleeptime_agent"

"voice_convo_agent"

"voice_sleeptime_agent"

str

file_blocks: Optional[List[MemoryFileBlock]]

Special blocks representing the agent’s in-context memory of an attached file

file_id: str

Unique identifier of the file.

is_open: bool

True if the agent currently has the file open.

Deprecatedsource_id: str

Deprecated: Use folder_id field instead. Unique identifier of the source.

value: str

Value of the block.

id: Optional[str]

The human-friendly ID of the Block

base_template_id: Optional[str]

The base template id of the block.

created_by_id: Optional[str]

The id of the user that made this Block.

deployment_id: Optional[str]

The id of the deployment.

description: Optional[str]

Description of the block.

entity_id: Optional[str]

The id of the entity within the template.

hidden: Optional[bool]

If set to True, the block will be hidden.

is_template: Optional[bool]

Whether the block is a template (e.g. saved human/persona options).

label: Optional[str]

Label of the block (e.g. ‘human’, ‘persona’) in the context window.

last_accessed_at: Optional[datetime]

UTC timestamp of the agent’s most recent access to this file. Any operations from the open, close, or search tools will update this field.

formatdate-time

last_updated_by_id: Optional[str]

The id of the user that last updated this Block.

limit: Optional[int]

Character limit of the block.

metadata: Optional[Dict[str, object]]

Metadata of the block.

preserve_on_migration: Optional[bool]

Preserve the block on template migration.

project_id: Optional[str]

The associated project id.

read_only: Optional[bool]

Whether the agent has read-only access to the block.

tags: Optional[List[str]]

The tags associated with the block.

template_id: Optional[str]

The id of the template.

template_name: Optional[str]

Name of the block if it is a template.

git_enabled: Optional[bool]

Whether this agent uses git-backed memory with structured labels.

prompt_template: Optional[str]

Deprecated. Ignored for performance.

The name of the agent.

Deprecatedsources: List[Source]

Deprecated: Use folders field instead. The sources used by the agent.

id: str

The human-friendly ID of the Source

embedding_config: EmbeddingConfig

The embedding configuration used by the source.

embedding_dim: int

The dimension of the embedding.

embedding_endpoint_type: Literal["openai", "anthropic", "bedrock", 16 more]

The endpoint type for the model.

One of the following:

"openai"

"anthropic"

"bedrock"

"google_ai"

"google_vertex"

"azure"

"groq"

"ollama"

"webui"

"webui-legacy"

"lmstudio"

"lmstudio-legacy"

"llamacpp"

"koboldcpp"

"vllm"

"hugging-face"

"mistral"

"together"

"pinecone"

embedding_model: str

The model for the embedding.

azure_deployment: Optional[str]

The Azure deployment for the model.

azure_endpoint: Optional[str]

The Azure endpoint for the model.

azure_version: Optional[str]

The Azure version for the model.

batch_size: Optional[int]

The maximum batch size for processing embeddings.

embedding_chunk_size: Optional[int]

The chunk size of the embedding.

embedding_endpoint: Optional[str]

The endpoint for the model (None if local).

handle: Optional[str]

The handle for this config, in the format provider/model-name.

The name of the source.

created_at: Optional[datetime]

The timestamp when the source was created.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this Tool.

description: Optional[str]

The description of the source.

instructions: Optional[str]

Instructions for how to use the source.

last_updated_by_id: Optional[str]

The id of the user that made this Tool.

metadata: Optional[Dict[str, object]]

Metadata associated with the source.

updated_at: Optional[datetime]

The timestamp when the source was last updated.

formatdate-time

vector_db_provider: Optional[VectorDBProvider]

The vector database provider used for this source’s passages

One of the following:

"native"

"tpuf"

"pinecone"

system: str

The system prompt used by the agent.

tags: List[str]

The tags associated with the agent.

tools: List[Tool]

The tools used by the agent.

id: str

The human-friendly ID of the Tool

args_json_schema: Optional[Dict[str, object]]

The args JSON schema of the function.

created_by_id: Optional[str]

The id of the user that made this Tool.

default_requires_approval: Optional[bool]

Default value for whether or not executing this tool requires approval.

description: Optional[str]

The description of the tool.

enable_parallel_execution: Optional[bool]

If set to True, then this tool will potentially be executed concurrently with other tools. Default False.

json_schema: Optional[Dict[str, object]]

The JSON schema of the function.

last_updated_by_id: Optional[str]

The id of the user that made this Tool.

metadata: Optional[Dict[str, object]]

A dictionary of additional metadata for the tool.

The name of the function.

npm_requirements: Optional[List[NpmRequirement]]

Optional list of npm packages required by this tool.

Name of the npm package.

minLength1

version: Optional[str]

Optional version of the package, following semantic versioning.

pip_requirements: Optional[List[PipRequirement]]

Optional list of pip packages required by this tool.

Name of the pip package.

minLength1

version: Optional[str]

Optional version of the package, following semantic versioning.

project_id: Optional[str]

The project id of the tool.

return_char_limit: Optional[int]

The maximum number of characters in the response.

maximum1000000

minimum1

source_code: Optional[str]

The source code of the function.

source_type: Optional[str]

The type of the source code.

tags: Optional[List[str]]

Metadata tags.

tool_type: Optional[ToolType]

The type of the tool.

One of the following:

"custom"

"letta_core"

"letta_memory_core"

"letta_multi_agent_core"

"letta_sleeptime_core"

"letta_voice_sleeptime_core"

"letta_builtin"

"letta_files_core"

"external_langchain"

"external_composio"

"external_mcp"

base_template_id: Optional[str]

The base template id of the agent.

compaction_settings: Optional[CompactionSettings]

Configuration for conversation compaction / summarization.

Per-model settings (temperature, max tokens, etc.) are derived from the default configuration for that handle.

clip_chars: Optional[int]

The maximum length of the summary in characters. If none, no clipping is performed.

mode: Optional[Literal["all", "sliding_window", "self_compact_all", "self_compact_sliding_window"]]

The type of summarization technique use.

One of the following:

"all"

"sliding_window"

"self_compact_all"

"self_compact_sliding_window"

model: Optional[str]

Model handle to use for sliding_window/all summarization (format: provider/model-name). If None, uses lightweight provider-specific defaults.

model_settings: Optional[CompactionSettingsModelSettings]

Optional model settings used to override defaults for the summarizer model.

One of the following:

class OpenAIModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["openai"]]

The type of the provider.

reasoning: Optional[Reasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "minimal", "low", 3 more]]

The reasoning effort to use when generating text reasoning models

One of the following:

"none"

"minimal"

"low"

"medium"

"high"

"xhigh"

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

class CompactionSettingsModelSettingsSgLangModelSettings: …

SGLang model configuration (OpenAI-compatible runtime with SGLang-specific parsing).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["sglang"]]

The type of the provider.

reasoning: Optional[CompactionSettingsModelSettingsSgLangModelSettingsReasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "minimal", "low", 3 more]]

The reasoning effort to use when generating text reasoning models

One of the following:

"none"

"minimal"

"low"

"medium"

"high"

"xhigh"

response_format: Optional[CompactionSettingsModelSettingsSgLangModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

tool_call_parser: Optional[str]

SGLang tool call parser name (for example ‘glm47’, ‘qwen25’, or ‘hermes’).

class AnthropicModelSettings: …

effort: Optional[Literal["low", "medium", "high", 2 more]]

Effort level for supported Anthropic models (controls token spending). ‘xhigh’ and ‘max’ are available on Opus 4.6+. Not setting this gives similar performance to ‘high’.

One of the following:

"low"

"medium"

"high"

"xhigh"

"max"

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["anthropic"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[Thinking]

The thinking configuration for the model.

budget_tokens: Optional[int]

The maximum number of tokens the model can use for extended thinking.

type: Optional[Literal["enabled", "disabled"]]

The type of thinking to use.

One of the following:

"enabled"

"disabled"

verbosity: Optional[Literal["low", "medium", "high"]]

Soft control for how verbose model output should be, used for GPT-5 models.

One of the following:

"low"

"medium"

"high"

class GoogleAIModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["google_ai"]]

The type of the provider.

response_schema: Optional[ResponseSchema]

The response schema for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking_config: Optional[ThinkingConfig]

The thinking configuration for the model.

include_thoughts: Optional[bool]

Whether to include thoughts in the model’s response.

thinking_budget: Optional[int]

The thinking budget for the model.

class GoogleVertexModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["google_vertex"]]

The type of the provider.

response_schema: Optional[ResponseSchema]

The response schema for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking_config: Optional[ThinkingConfig]

The thinking configuration for the model.

include_thoughts: Optional[bool]

Whether to include thoughts in the model’s response.

thinking_budget: Optional[int]

The thinking budget for the model.

class AzureModelSettings: …

Azure OpenAI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["azure"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class XaiModelSettings: …

xAI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["xai"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class CompactionSettingsModelSettingsMoonshotModelSettings: …

Moonshot/Kimi model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["moonshot"]]

The type of the provider.

response_format: Optional[CompactionSettingsModelSettingsMoonshotModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

class CompactionSettingsModelSettingsZaiModelSettings: …

Z.ai (ZhipuAI) model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["zai"]]

The type of the provider.

response_format: Optional[CompactionSettingsModelSettingsZaiModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[CompactionSettingsModelSettingsZaiModelSettingsThinking]

The thinking configuration for GLM-4.5+ models.

clear_thinking: Optional[bool]

If False, preserved thinking is used (recommended for agents).

type: Optional[Literal["enabled", "disabled"]]

Whether thinking is enabled or disabled.

One of the following:

"enabled"

"disabled"

class CompactionSettingsModelSettingsMoonshotCodingModelSettings: …

Kimi Code model configuration (Anthropic-compatible).

effort: Optional[Literal["low", "medium", "high", 2 more]]

Effort level for supported Anthropic models (controls token spending). ‘xhigh’ and ‘max’ are available on Opus 4.6+. Not setting this gives similar performance to ‘high’.

One of the following:

"low"

"medium"

"high"

"xhigh"

"max"

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["moonshot_coding"]]

The type of the provider.

response_format: Optional[CompactionSettingsModelSettingsMoonshotCodingModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[CompactionSettingsModelSettingsMoonshotCodingModelSettingsThinking]

The thinking configuration for the model.

budget_tokens: Optional[int]

The maximum number of tokens the model can use for extended thinking.

type: Optional[Literal["enabled", "disabled"]]

The type of thinking to use.

One of the following:

"enabled"

"disabled"

verbosity: Optional[Literal["low", "medium", "high"]]

Soft control for how verbose model output should be, used for GPT-5 models.

One of the following:

"low"

"medium"

"high"

class GroqModelSettings: …

Groq model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["groq"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class DeepseekModelSettings: …

Deepseek model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["deepseek"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class TogetherModelSettings: …

Together AI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["together"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class BedrockModelSettings: …

AWS Bedrock model configuration.

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["bedrock"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class CompactionSettingsModelSettingsBasetenModelSettings: …

Baseten model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["baseten"]]

The type of the provider.

temperature: Optional[float]

The temperature of the model.

class CompactionSettingsModelSettingsOpenRouterModelSettings: …

OpenRouter model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["openrouter"]]

The type of the provider.

response_format: Optional[CompactionSettingsModelSettingsOpenRouterModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class CompactionSettingsModelSettingsChatGptoAuthModelSettings: …

ChatGPT OAuth model configuration (uses ChatGPT backend API).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["chatgpt_oauth"]]

The type of the provider.

reasoning: Optional[CompactionSettingsModelSettingsChatGptoAuthModelSettingsReasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "low", "medium", 2 more]]

The reasoning effort level for GPT-5.x and o-series models.

One of the following:

"none"

"low"

"medium"

"high"

"xhigh"

temperature: Optional[float]

The temperature of the model.

prompt: Optional[str]

The prompt to use for summarization. If None, uses mode-specific default.

prompt_acknowledgement: Optional[bool]

Whether to include an acknowledgement post-prompt (helps prevent non-summary outputs).

sliding_window_percentage: Optional[float]

The percentage of the context window to keep post-summarization (only used in sliding window modes).

created_at: Optional[datetime]

The timestamp when the object was created.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this object.

deployment_id: Optional[str]

The id of the deployment.

description: Optional[str]

The description of the agent.

embedding: Optional[str]

The embedding model handle used by the agent (format: provider/model-name).

Deprecatedembedding_config: Optional[EmbeddingConfig]

Configuration for embedding model connection and processing parameters.

embedding_dim: int

The dimension of the embedding.

embedding_endpoint_type: Literal["openai", "anthropic", "bedrock", 16 more]

The endpoint type for the model.

One of the following:

"openai"

"anthropic"

"bedrock"

"google_ai"

"google_vertex"

"azure"

"groq"

"ollama"

"webui"

"webui-legacy"

"lmstudio"

"lmstudio-legacy"

"llamacpp"

"koboldcpp"

"vllm"

"hugging-face"

"mistral"

"together"

"pinecone"

embedding_model: str

The model for the embedding.

azure_deployment: Optional[str]

The Azure deployment for the model.

azure_endpoint: Optional[str]

The Azure endpoint for the model.

azure_version: Optional[str]

The Azure version for the model.

batch_size: Optional[int]

The maximum batch size for processing embeddings.

embedding_chunk_size: Optional[int]

The chunk size of the embedding.

embedding_endpoint: Optional[str]

The endpoint for the model (None if local).

handle: Optional[str]

The handle for this config, in the format provider/model-name.

enable_sleeptime: Optional[bool]

If set to True, memory management will move to a background agent thread.

entity_id: Optional[str]

The id of the entity within the template.

hidden: Optional[bool]

If set to True, the agent will be hidden.

identities: Optional[List[Identity]]

The identities associated with this agent.

id: str

The human-friendly ID of the Identity

Deprecatedagent_ids: List[str]

The IDs of the agents associated with the identity.

Deprecatedblock_ids: List[str]

The IDs of the blocks associated with the identity.

identifier_key: str

External, user-generated identifier key of the identity.

identity_type: Literal["org", "user", "other"]

The type of the identity.

One of the following:

"org"

"user"

"other"

The name of the identity.

project_id: Optional[str]

The project id of the identity, if applicable.

properties: Optional[List[IdentityProperty]]

List of properties associated with the identity

key: str

The key of the property

type: Literal["string", "number", "boolean", "json"]

The type of the property

One of the following:

"string"

"number"

"boolean"

"json"

value: Union[str, float, bool, Dict[str, object]]

The value of the property

One of the following:

str

float

bool

Dict[str, object]

Deprecatedidentity_ids: Optional[List[str]]

Deprecated: Use identities field instead. The ids of the identities associated with this agent.

last_run_completion: Optional[datetime]

The timestamp when the agent last completed a run.

formatdate-time

last_run_duration_ms: Optional[int]

The duration in milliseconds of the agent’s last run.

last_stop_reason: Optional[StopReasonType]

The stop reason from the agent’s last run.

One of the following:

"end_turn"

"error"

"llm_api_error"

"invalid_llm_response"

"invalid_tool_call"

"max_steps"

"max_tokens_exceeded"

"no_tool_call"

"tool_rule"

"cancelled"

"insufficient_credits"

"requires_approval"

"context_window_overflow_in_system_prompt"

last_updated_by_id: Optional[str]

The id of the user that made this object.

managed_group: Optional[ManagedGroup]

The multi-agent group that this agent manages

id: str

The id of the group. Assigned by the database.

agent_ids: List[str]

description: str

manager_type: Literal["round_robin", "supervisor", "dynamic", 3 more]

One of the following:

"round_robin"

"supervisor"

"dynamic"

"sleeptime"

"voice_sleeptime"

"swarm"

base_template_id: Optional[str]

The base template id.

deployment_id: Optional[str]

The id of the deployment.

hidden: Optional[bool]

If set to True, the group will be hidden.

last_processed_message_id: Optional[str]

manager_agent_id: Optional[str]

max_message_buffer_length: Optional[int]

The desired maximum length of messages in the context window of the convo agent. This is a best effort, and may be off slightly due to user/assistant interleaving.

max_turns: Optional[int]

min_message_buffer_length: Optional[int]

The desired minimum length of messages in the context window of the convo agent. This is a best effort, and may be off-by-one due to user/assistant interleaving.

project_id: Optional[str]

The associated project id.

Deprecatedshared_block_ids: Optional[List[str]]

sleeptime_agent_frequency: Optional[int]

template_id: Optional[str]

The id of the template.

termination_token: Optional[str]

turns_counter: Optional[int]

max_files_open: Optional[int]

Maximum number of files that can be open at once for this agent. Setting this too high may exceed the context window, which will break the agent.

message_buffer_autoclear: Optional[bool]

message_ids: Optional[List[str]]

The ids of the messages in the agent’s in-context memory.

metadata: Optional[Dict[str, object]]

The metadata of the agent.

model: Optional[str]

The model handle used by the agent (format: provider/model-name).

model_settings: Optional[ModelSettings]

The model settings used by the agent.

One of the following:

class OpenAIModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["openai"]]

The type of the provider.

reasoning: Optional[Reasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "minimal", "low", 3 more]]

The reasoning effort to use when generating text reasoning models

One of the following:

"none"

"minimal"

"low"

"medium"

"high"

"xhigh"

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

class ModelSettingsSgLangModelSettings: …

SGLang model configuration (OpenAI-compatible runtime with SGLang-specific parsing).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["sglang"]]

The type of the provider.

reasoning: Optional[ModelSettingsSgLangModelSettingsReasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "minimal", "low", 3 more]]

The reasoning effort to use when generating text reasoning models

One of the following:

"none"

"minimal"

"low"

"medium"

"high"

"xhigh"

response_format: Optional[ModelSettingsSgLangModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

tool_call_parser: Optional[str]

SGLang tool call parser name (for example ‘glm47’, ‘qwen25’, or ‘hermes’).

class AnthropicModelSettings: …

effort: Optional[Literal["low", "medium", "high", 2 more]]

Effort level for supported Anthropic models (controls token spending). ‘xhigh’ and ‘max’ are available on Opus 4.6+. Not setting this gives similar performance to ‘high’.

One of the following:

"low"

"medium"

"high"

"xhigh"

"max"

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["anthropic"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[Thinking]

The thinking configuration for the model.

budget_tokens: Optional[int]

The maximum number of tokens the model can use for extended thinking.

type: Optional[Literal["enabled", "disabled"]]

The type of thinking to use.

One of the following:

"enabled"

"disabled"

verbosity: Optional[Literal["low", "medium", "high"]]

Soft control for how verbose model output should be, used for GPT-5 models.

One of the following:

"low"

"medium"

"high"

class GoogleAIModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["google_ai"]]

The type of the provider.

response_schema: Optional[ResponseSchema]

The response schema for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking_config: Optional[ThinkingConfig]

The thinking configuration for the model.

include_thoughts: Optional[bool]

Whether to include thoughts in the model’s response.

thinking_budget: Optional[int]

The thinking budget for the model.

class GoogleVertexModelSettings: …

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["google_vertex"]]

The type of the provider.

response_schema: Optional[ResponseSchema]

The response schema for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking_config: Optional[ThinkingConfig]

The thinking configuration for the model.

include_thoughts: Optional[bool]

Whether to include thoughts in the model’s response.

thinking_budget: Optional[int]

The thinking budget for the model.

class AzureModelSettings: …

Azure OpenAI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["azure"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class XaiModelSettings: …

xAI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["xai"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class ModelSettingsMoonshotModelSettings: …

Moonshot/Kimi model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["moonshot"]]

The type of the provider.

response_format: Optional[ModelSettingsMoonshotModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

class ModelSettingsZaiModelSettings: …

Z.ai (ZhipuAI) model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["zai"]]

The type of the provider.

response_format: Optional[ModelSettingsZaiModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[ModelSettingsZaiModelSettingsThinking]

The thinking configuration for GLM-4.5+ models.

clear_thinking: Optional[bool]

If False, preserved thinking is used (recommended for agents).

type: Optional[Literal["enabled", "disabled"]]

Whether thinking is enabled or disabled.

One of the following:

"enabled"

"disabled"

class ModelSettingsMoonshotCodingModelSettings: …

Kimi Code model configuration (Anthropic-compatible).

effort: Optional[Literal["low", "medium", "high", 2 more]]

Effort level for supported Anthropic models (controls token spending). ‘xhigh’ and ‘max’ are available on Opus 4.6+. Not setting this gives similar performance to ‘high’.

One of the following:

"low"

"medium"

"high"

"xhigh"

"max"

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["moonshot_coding"]]

The type of the provider.

response_format: Optional[ModelSettingsMoonshotCodingModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

strict: Optional[bool]

Enable strict mode for tool calling. When true, tool outputs are guaranteed to match JSON schemas.

temperature: Optional[float]

The temperature of the model.

thinking: Optional[ModelSettingsMoonshotCodingModelSettingsThinking]

The thinking configuration for the model.

budget_tokens: Optional[int]

The maximum number of tokens the model can use for extended thinking.

type: Optional[Literal["enabled", "disabled"]]

The type of thinking to use.

One of the following:

"enabled"

"disabled"

verbosity: Optional[Literal["low", "medium", "high"]]

Soft control for how verbose model output should be, used for GPT-5 models.

One of the following:

"low"

"medium"

"high"

class GroqModelSettings: …

Groq model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["groq"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class DeepseekModelSettings: …

Deepseek model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["deepseek"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class TogetherModelSettings: …

Together AI model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["together"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class BedrockModelSettings: …

AWS Bedrock model configuration.

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["bedrock"]]

The type of the provider.

response_format: Optional[ResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class ModelSettingsBasetenModelSettings: …

Baseten model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["baseten"]]

The type of the provider.

temperature: Optional[float]

The temperature of the model.

class ModelSettingsOpenRouterModelSettings: …

OpenRouter model configuration (OpenAI-compatible).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["openrouter"]]

The type of the provider.

response_format: Optional[ModelSettingsOpenRouterModelSettingsResponseFormat]

The response format for the model.

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

temperature: Optional[float]

The temperature of the model.

class ModelSettingsChatGptoAuthModelSettings: …

ChatGPT OAuth model configuration (uses ChatGPT backend API).

max_output_tokens: Optional[int]

The maximum number of tokens the model can generate.

parallel_tool_calls: Optional[bool]

Whether to enable parallel tool calling.

provider_type: Optional[Literal["chatgpt_oauth"]]

The type of the provider.

reasoning: Optional[ModelSettingsChatGptoAuthModelSettingsReasoning]

The reasoning configuration for the model.

reasoning_effort: Optional[Literal["none", "low", "medium", 2 more]]

The reasoning effort level for GPT-5.x and o-series models.

One of the following:

"none"

"low"

"medium"

"high"

"xhigh"

temperature: Optional[float]

The temperature of the model.

Deprecatedmulti_agent_group: Optional[MultiAgentGroup]

Deprecated: Use managed_group field instead. The multi-agent group that this agent manages.

id: str

The id of the group. Assigned by the database.

agent_ids: List[str]

description: str

manager_type: Literal["round_robin", "supervisor", "dynamic", 3 more]

One of the following:

"round_robin"

"supervisor"

"dynamic"

"sleeptime"

"voice_sleeptime"

"swarm"

base_template_id: Optional[str]

The base template id.

deployment_id: Optional[str]

The id of the deployment.

hidden: Optional[bool]

If set to True, the group will be hidden.

last_processed_message_id: Optional[str]

manager_agent_id: Optional[str]

max_message_buffer_length: Optional[int]

The desired maximum length of messages in the context window of the convo agent. This is a best effort, and may be off slightly due to user/assistant interleaving.

max_turns: Optional[int]

min_message_buffer_length: Optional[int]

The desired minimum length of messages in the context window of the convo agent. This is a best effort, and may be off-by-one due to user/assistant interleaving.

project_id: Optional[str]

The associated project id.

Deprecatedshared_block_ids: Optional[List[str]]

sleeptime_agent_frequency: Optional[int]

template_id: Optional[str]

The id of the template.

termination_token: Optional[str]

turns_counter: Optional[int]

pending_approval: Optional[ApprovalRequestMessage]

A message representing a request for approval to call a tool (generated by the LLM to trigger tool execution).

Args: id (str): The ID of the message date (datetime): The date the message was created in ISO format name (Optional[str]): The name of the sender of the message tool_call (ToolCall): The tool call

id: str

date: datetime

Deprecatedtool_call: ToolCall

The tool call that has been requested by the llm to run

One of the following:

class ToolCall: …

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

is_err: Optional[bool]

message_type: Optional[Literal["approval_request_message"]]

The type of the message.

otid: Optional[str]

run_id: Optional[str]

sender_id: Optional[str]

seq_id: Optional[int]

step_id: Optional[str]

tool_calls: Optional[ToolCalls]

The tool calls that have been requested by the llm to run, which are pending approval

One of the following:

List[ToolCall]

arguments: str

tool_call_id: str

class ToolCallDelta: …

arguments: Optional[str]

tool_call_id: Optional[str]

per_file_view_window_char_limit: Optional[int]

The per-file view window character limit for this agent. Setting this too high may exceed the context window, which will break the agent.

project_id: Optional[str]

The id of the project the agent belongs to.

response_format: Optional[ResponseFormat]

The response format used by the agent

One of the following:

class TextResponseFormat: …

Response format for plain text responses.

type: Optional[Literal["text"]]

The type of the response format.

class JsonSchemaResponseFormat: …

Response format for JSON schema-based responses.

json_schema: Dict[str, object]

The JSON schema of the response.

type: Optional[Literal["json_schema"]]

The type of the response format.

class JsonObjectResponseFormat: …

Response format for JSON object responses.

type: Optional[Literal["json_object"]]

The type of the response format.

secrets: Optional[List[AgentEnvironmentVariable]]

The environment variables for tool execution specific to this agent.

agent_id: str

The ID of the agent this environment variable belongs to.

key: str

The name of the environment variable.

value: str

The value of the environment variable.

id: Optional[str]

The human-friendly ID of the Agent-env

created_at: Optional[datetime]

The timestamp when the object was created.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this object.

description: Optional[str]

An optional description of the environment variable.

last_updated_by_id: Optional[str]

The id of the user that made this object.

updated_at: Optional[datetime]

The timestamp when the object was last updated.

formatdate-time

value_enc: Optional[str]

Encrypted secret value (stored as encrypted string)

template_id: Optional[str]

The id of the template the agent belongs to.

timezone: Optional[str]

The timezone of the agent (IANA format).

Deprecatedtool_exec_environment_variables: Optional[List[AgentEnvironmentVariable]]

Deprecated: use secrets field instead.

agent_id: str

The ID of the agent this environment variable belongs to.

key: str

The name of the environment variable.

value: str

The value of the environment variable.

id: Optional[str]

The human-friendly ID of the Agent-env

created_at: Optional[datetime]

The timestamp when the object was created.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this object.

description: Optional[str]

An optional description of the environment variable.

last_updated_by_id: Optional[str]

The id of the user that made this object.

updated_at: Optional[datetime]

The timestamp when the object was last updated.

formatdate-time

value_enc: Optional[str]

Encrypted secret value (stored as encrypted string)

tool_rules: Optional[List[ToolRule]]

The list of tool rules.

One of the following:

class ChildToolRule: …

A ToolRule represents a tool that can be invoked by the agent.

children: List[str]

The children tools that can be invoked.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

child_arg_nodes: Optional[List[ChildArgNode]]

Optional list of typed child argument overrides. Each node must reference a child in ‘children’.

The name of the child tool to invoke next.

args: Optional[Dict[str, object]]

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["constrain_child_tools"]]

class InitToolRule: …

Represents the initial tool rule configuration.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

args: Optional[Dict[str, object]]

prompt_template: Optional[str]

Optional template string (ignored). Rendering uses fast built-in formatting for performance.

type: Optional[Literal["run_first"]]

class TerminalToolRule: …

Represents a terminal tool rule configuration where if this tool gets called, it must end the agent loop.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["exit_loop"]]

class ConditionalToolRule: …

A ToolRule that conditionally maps to different child tools based on the output.

child_output_mapping: Dict[str, str]

The output case to check for mapping

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

default_child: Optional[str]

The default child tool to be called. If None, any tool can be called.

prompt_template: Optional[str]

Optional template string (ignored).

require_output_mapping: Optional[bool]

Whether to throw an error when output doesn’t match any case

type: Optional[Literal["conditional"]]

class ContinueToolRule: …

Represents a tool rule configuration where if this tool gets called, it must continue the agent loop.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["continue_loop"]]

class RequiredBeforeExitToolRule: …

Represents a tool rule configuration where this tool must be called before the agent loop can exit.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["required_before_exit"]]

class MaxCountPerStepToolRule: …

Represents a tool rule configuration which constrains the total number of times this tool can be invoked in a single step.

max_count_limit: int

The max limit for the total number of times this tool can be invoked in a single step.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["max_count_per_step"]]

class ParentToolRule: …

A ToolRule that only allows a child tool to be called if the parent has been called.

children: List[str]

The children tools that can be invoked.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored).

type: Optional[Literal["parent_last_tool"]]

class RequiresApprovalToolRule: …

Represents a tool rule configuration which requires approval before the tool can be invoked.

tool_name: str

The name of the tool. Must exist in the database for the user’s organization.

prompt_template: Optional[str]

Optional template string (ignored). Rendering uses fast built-in formatting for performance.

type: Optional[Literal["requires_approval"]]

updated_at: Optional[datetime]

The timestamp when the object was last updated.

formatdate-time

func_return: Optional[object]

The function return object

sandbox_config_fingerprint: Optional[str]

The fingerprint of the config for the sandbox

stderr: Optional[List[str]]

Captured stderr from the function invocation

stdout: Optional[List[str]]

Captured stdout (prints, logs) from function invocation

ModelsExpand Collapse

class FolderListResponse: …

(Deprecated: Use Folder) Representation of a source, which is a collection of files and passages.

id: str

The human-friendly ID of the Source

embedding_config: EmbeddingConfig

The embedding configuration used by the source.

embedding_dim: int

The dimension of the embedding.

embedding_endpoint_type: Literal["openai", "anthropic", "bedrock", 16 more]

The endpoint type for the model.

One of the following:

"openai"

"anthropic"

"bedrock"

"google_ai"

"google_vertex"

"azure"

"groq"

"ollama"

"webui"

"webui-legacy"

"lmstudio"

"lmstudio-legacy"

"llamacpp"

"koboldcpp"

"vllm"

"hugging-face"

"mistral"

"together"

"pinecone"

embedding_model: str

The model for the embedding.

azure_deployment: Optional[str]

The Azure deployment for the model.

azure_endpoint: Optional[str]

The Azure endpoint for the model.

azure_version: Optional[str]

The Azure version for the model.

batch_size: Optional[int]

The maximum batch size for processing embeddings.

embedding_chunk_size: Optional[int]

The chunk size of the embedding.

embedding_endpoint: Optional[str]

The endpoint for the model (None if local).

handle: Optional[str]

The handle for this config, in the format provider/model-name.

The name of the source.

created_at: Optional[datetime]

The timestamp when the source was created.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this Tool.

description: Optional[str]

The description of the source.

instructions: Optional[str]

Instructions for how to use the source.

last_updated_by_id: Optional[str]

The id of the user that made this Tool.

metadata: Optional[Dict[str, object]]

Metadata associated with the source.

updated_at: Optional[datetime]

The timestamp when the source was last updated.

formatdate-time

vector_db_provider: Optional[VectorDBProvider]

The vector database provider used for this source’s passages

One of the following:

"native"

"tpuf"

"pinecone"

ModelsExpand Collapse

List[str]

class FileListResponse: …

Response model for agent file attachments showing file status in agent context

id: str

Unique identifier of the file-agent relationship

file_id: str

Unique identifier of the file

file_name: str

Name of the file

folder_id: str

Unique identifier of the folder/source

folder_name: str

Name of the folder/source

is_open: bool

Whether the file is currently open in the agent’s context

end_line: Optional[int]

Ending line number if file was opened with line range

last_accessed_at: Optional[datetime]

Timestamp of last access by the agent

formatdate-time

start_line: Optional[int]

Starting line number if file was opened with line range

visible_content: Optional[str]

Portion of the file visible to the agent if open

AgentsArchives

Attach Archive To Agent

agents.archives.attach(, ) -> object

PATCH/v1/agents/{agent_id}/archives/attach/{archive_id}

Detach Archive From Agent

agents.archives.detach(, ) -> object

PATCH/v1/agents/{agent_id}/archives/detach/{archive_id}

ModelsExpand Collapse

List[Passage]

embedding: Optional[List[float]]

The embedding of the passage.

embedding_config: Optional[EmbeddingConfig]

Configuration for embedding model connection and processing parameters.

embedding_dim: int

The dimension of the embedding.

embedding_endpoint_type: Literal["openai", "anthropic", "bedrock", 16 more]

The endpoint type for the model.

One of the following:

"openai"

"anthropic"

"bedrock"

"google_ai"

"google_vertex"

"azure"

"groq"

"ollama"

"webui"

"webui-legacy"

"lmstudio"

"lmstudio-legacy"

"llamacpp"

"koboldcpp"

"vllm"

"hugging-face"

"mistral"

"together"

"pinecone"

embedding_model: str

The model for the embedding.

azure_deployment: Optional[str]

The Azure deployment for the model.

azure_endpoint: Optional[str]

The Azure endpoint for the model.

azure_version: Optional[str]

The Azure version for the model.

batch_size: Optional[int]

The maximum batch size for processing embeddings.

embedding_chunk_size: Optional[int]

The chunk size of the embedding.

embedding_endpoint: Optional[str]

The endpoint for the model (None if local).

handle: Optional[str]

The handle for this config, in the format provider/model-name.

text: str

The text of the passage.

id: Optional[str]

The human-friendly ID of the Passage

archive_id: Optional[str]

The unique identifier of the archive containing this passage.

created_at: Optional[datetime]

The creation date of the passage.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this object.

file_id: Optional[str]

The unique identifier of the file associated with the passage.

file_name: Optional[str]

The name of the file (only for source passages).

is_deleted: Optional[bool]

Whether this passage is deleted or not.

last_updated_by_id: Optional[str]

The id of the user that made this object.

metadata: Optional[Dict[str, object]]

The metadata of the passage.

Deprecatedsource_id: Optional[str]

Deprecated: Use folder_id field instead. The data source of the passage.

tags: Optional[List[str]]

Tags associated with this passage.

updated_at: Optional[datetime]

The timestamp when the object was last updated.

formatdate-time

List[Passage]

embedding: Optional[List[float]]

The embedding of the passage.

embedding_config: Optional[EmbeddingConfig]

Configuration for embedding model connection and processing parameters.

embedding_dim: int

The dimension of the embedding.

embedding_endpoint_type: Literal["openai", "anthropic", "bedrock", 16 more]

The endpoint type for the model.

One of the following:

"openai"

"anthropic"

"bedrock"

"google_ai"

"google_vertex"

"azure"

"groq"

"ollama"

"webui"

"webui-legacy"

"lmstudio"

"lmstudio-legacy"

"llamacpp"

"koboldcpp"

"vllm"

"hugging-face"

"mistral"

"together"

"pinecone"

embedding_model: str

The model for the embedding.

azure_deployment: Optional[str]

The Azure deployment for the model.

azure_endpoint: Optional[str]

The Azure endpoint for the model.

azure_version: Optional[str]

The Azure version for the model.

batch_size: Optional[int]

The maximum batch size for processing embeddings.

embedding_chunk_size: Optional[int]

The chunk size of the embedding.

embedding_endpoint: Optional[str]

The endpoint for the model (None if local).

handle: Optional[str]

The handle for this config, in the format provider/model-name.

text: str

The text of the passage.

id: Optional[str]

The human-friendly ID of the Passage

archive_id: Optional[str]

The unique identifier of the archive containing this passage.

created_at: Optional[datetime]

The creation date of the passage.

formatdate-time

created_by_id: Optional[str]

The id of the user that made this object.

file_id: Optional[str]

The unique identifier of the file associated with the passage.

file_name: Optional[str]

The name of the file (only for source passages).

is_deleted: Optional[bool]

Whether this passage is deleted or not.

last_updated_by_id: Optional[str]

The id of the user that made this object.

metadata: Optional[Dict[str, object]]

The metadata of the passage.

Deprecatedsource_id: Optional[str]

Deprecated: Use folder_id field instead. The data source of the passage.

tags: Optional[List[str]]

Tags associated with this passage.

updated_at: Optional[datetime]

The timestamp when the object was last updated.

formatdate-time

class PassageSearchResponse: …

Total number of results returned

results: List[Result]

List of search results matching the query

id: str

Unique identifier of the archival memory passage

content: str

Text content of the archival memory passage

timestamp: str

Timestamp of when the memory was created, formatted in agent’s timezone

tags: Optional[List[str]]

List of tags associated with this memory

AgentsIdentities

Attach Identity To Agent

agents.identities.attach(, ) -> object

PATCH/v1/agents/{agent_id}/identities/attach/{identity_id}

Detach Identity From Agent

agents.identities.detach(, ) -> object

PATCH/v1/agents/{agent_id}/identities/detach/{identity_id}

Agents

ModelsExpand Collapse

AgentsMessages

ModelsExpand Collapse

AgentsSchedule

ModelsExpand Collapse

AgentsBlocks

ModelsExpand Collapse

AgentsTools

ModelsExpand Collapse

AgentsFolders

ModelsExpand Collapse

AgentsFiles

ModelsExpand Collapse

AgentsArchives

AgentsPassages

ModelsExpand Collapse

AgentsIdentities