Create message (LiteLLM - Anthropic format)

{ "id": "<string>", "type": "message", "role": "assistant", "content": [ { "text": "<string>", "thinking": "<string>", "signature": "<string>", "data": "<string>", "tool_use_id": "<string>", "id": "<string>", "name": "<string>", "input": {}, "server_name": "<string>", "content": "<string>", "source": { "media_type": "<string>", "data": "<string>", "url": "<string>" }, "cache_control": { "ttl": "<string>" }, "citations": { "enabled": true }, "context": "<string>", "title": "<string>" } ], "model": "<string>", "stop_sequence": "<string>", "usage": { "input_tokens": 123, "output_tokens": 123, "cache_creation_input_tokens": 123, "cache_read_input_tokens": 123, "cache_creation": { "ephemeral_5m_input_tokens": 123, "ephemeral_1h_input_tokens": 123 } } }

Authorizations

Authorization

string

header

required

Bearer token authentication. Use your provider API key or Bifrost authentication token. Virtual keys (prefixed with sk-bf-) can also be passed here.

Body

application/json

model

string

required

Model identifier (e.g., claude-3-opus-20240229)

Example:

"claude-3-opus-20240229"

max_tokens

integer

required

Maximum tokens to generate

messages

object[]

required

List of messages in the conversation

Show child attributes

system

System prompt

cache_control

object

Automatic caching directives for the whole request

Show child attributes

metadata

object

Show child attributes

stream

boolean

Whether to stream the response

temperature

number

Required range: 0 <= x <= 1

top_p

number

top_k

integer

stop_sequences

string[]

tools

object[]

Show child attributes

tool_choice

object

Show child attributes

mcp_servers

object[]

MCP servers configuration (requires beta header)

Show child attributes

thinking

object

Show child attributes

output_format

object

Structured output format (requires beta header)

fallbacks

string[]

Response

Successful response

string

type

string

default:message

role

string

default:assistant

content

object[]

Show child attributes

model

string

stop_reason

enum<string>

Available options:

end_turn,

max_tokens,

stop_sequence,

tool_use,

pause_turn,

refusal,

model_context_window_exceeded,

null

stop_sequence

string | null

usage

object

Show child attributes