A valid request URL is required to generate request examples{
"id": "<string>",
"type": "message",
"role": "assistant",
"content": [
{
"text": "<string>",
"thinking": "<string>",
"signature": "<string>",
"data": "<string>",
"tool_use_id": "<string>",
"id": "<string>",
"name": "<string>",
"input": {},
"server_name": "<string>",
"content": "<string>",
"source": {
"media_type": "<string>",
"data": "<string>",
"url": "<string>"
},
"cache_control": {
"ttl": "<string>"
},
"citations": {
"enabled": true
},
"context": "<string>",
"title": "<string>"
}
],
"model": "<string>",
"stop_sequence": "<string>",
"usage": {
"input_tokens": 123,
"output_tokens": 123,
"cache_creation_input_tokens": 123,
"cache_read_input_tokens": 123,
"cache_creation": {
"ephemeral_5m_input_tokens": 123,
"ephemeral_1h_input_tokens": 123
}
}
}Create message (LiteLLM - Anthropic format)
Creates a message using Anthropic-compatible format via LiteLLM.
A valid request URL is required to generate request examples{
"id": "<string>",
"type": "message",
"role": "assistant",
"content": [
{
"text": "<string>",
"thinking": "<string>",
"signature": "<string>",
"data": "<string>",
"tool_use_id": "<string>",
"id": "<string>",
"name": "<string>",
"input": {},
"server_name": "<string>",
"content": "<string>",
"source": {
"media_type": "<string>",
"data": "<string>",
"url": "<string>"
},
"cache_control": {
"ttl": "<string>"
},
"citations": {
"enabled": true
},
"context": "<string>",
"title": "<string>"
}
],
"model": "<string>",
"stop_sequence": "<string>",
"usage": {
"input_tokens": 123,
"output_tokens": 123,
"cache_creation_input_tokens": 123,
"cache_read_input_tokens": 123,
"cache_creation": {
"ephemeral_5m_input_tokens": 123,
"ephemeral_1h_input_tokens": 123
}
}
}Authorizations
Bearer token authentication. Use your provider API key or Bifrost authentication token.
Virtual keys (prefixed with sk-bf-) can also be passed here.
Body
Model identifier (e.g., claude-3-opus-20240229)
"claude-3-opus-20240229"
Maximum tokens to generate
List of messages in the conversation
Show child attributes
Show child attributes
System prompt
Automatic caching directives for the whole request
Show child attributes
Show child attributes
Show child attributes
Show child attributes
Whether to stream the response
0 <= x <= 1Show child attributes
Show child attributes
Show child attributes
Show child attributes
MCP servers configuration (requires beta header)
Show child attributes
Show child attributes
Show child attributes
Show child attributes
Structured output format (requires beta header)
Response
Successful response
Show child attributes
Show child attributes
end_turn, max_tokens, stop_sequence, tool_use, pause_turn, refusal, model_context_window_exceeded, null Show child attributes
Show child attributes
Was this page helpful?

