List LLM Models

List available LLM models using the asynchronous implementation for improved performance

Query parameters

provider_categorylist of enums or nullOptional

Allowed values:

provider_namestring or nullOptional

provider_typeenum or nullOptional

Response

Successful Response

modelstring

LLM model name.

model_endpoint_typeenum

The endpoint type for the model.

context_windowinteger

The context window size for the model.

model_endpointstring or null

The endpoint for the model.

provider_namestring or null

The provider name for the model.

provider_categoryenum or null

The provider category for the model.

Allowed values:

model_wrapperstring or null

The wrapper for the model.

put_inner_thoughts_in_kwargsboolean or nullDefaults to true

Puts ‘inner_thoughts’ as a kwarg in the function call if this is set to True. This helps with function calling performance and also the generation of inner thoughts.

handlestring or null

The handle for this config, in the format provider/model-name.

temperaturedouble or nullDefaults to 0.7

The temperature to use when generating text with the model. A higher temperature will result in more random text.

max_tokensinteger or nullDefaults to 4096

The maximum number of tokens to generate. If not set, the model will use its default value.

enable_reasonerboolean or nullDefaults to false

Whether or not the model should use extended thinking if it is a 'reasoning' style model

reasoning_effortenum or null

The reasoning effort to use when generating text reasoning models

Allowed values:

max_reasoning_tokensinteger or nullDefaults to 0

Configurable thinking budget for extended thinking, only used if enable_reasoner is True. Minimum value is 1024.

frequency_penaltydouble or null

Positive values penalize new tokens based on their existing frequency in the text so far, decreasing the model’s likelihood to repeat the same line verbatim. From OpenAI: Number between -2.0 and 2.0.

compatibility_typeenum or null

The framework compatibility type for the model.

Allowed values:

1	from letta_client import Letta
2
3	client = Letta(
4	project="YOUR_PROJECT",
5	token="YOUR_TOKEN",
6	)
7	client.models.list()

1	[
2	{
3	"model": "string",
4	"model_endpoint_type": "openai",
5	"context_window": 1,
6	"model_endpoint": "string",
7	"provider_name": "string",
8	"provider_category": "base",
9	"model_wrapper": "string",
10	"put_inner_thoughts_in_kwargs": true,
11	"handle": "string",
12	"temperature": 0.7,
13	"max_tokens": 1,
14	"enable_reasoner": false,
15	"reasoning_effort": "low",
16	"max_reasoning_tokens": 0,
17	"frequency_penalty": 1.1,
18	"compatibility_type": "gguf"
19	}
20	]

List LLM Models

Headers

Query parameters

Response

Errors