A valid request URL is required to generate request examples{
"id": "<string>",
"embeddings": {
"float": [
[
123
]
],
"int8": [
[
123
]
],
"uint8": [
[
123
]
],
"binary": [
[
123
]
],
"ubinary": [
[
123
]
],
"base64": [
"<string>"
]
},
"response_type": "<string>",
"texts": [
"<string>"
],
"images": [
{
"width": 123,
"height": 123,
"format": "<string>",
"bit_depth": 123
}
],
"meta": {
"api_version": {
"version": "<string>",
"is_deprecated": true,
"is_experimental": true
},
"billed_units": {
"input_tokens": 123,
"output_tokens": 123,
"search_units": 123,
"classifications": 123
},
"tokens": {
"input_tokens": 123,
"output_tokens": 123
},
"warnings": [
"<string>"
]
}
}Create embeddings (LiteLLM - Cohere format)
Creates embeddings using Cohere-compatible format via LiteLLM.
A valid request URL is required to generate request examples{
"id": "<string>",
"embeddings": {
"float": [
[
123
]
],
"int8": [
[
123
]
],
"uint8": [
[
123
]
],
"binary": [
[
123
]
],
"ubinary": [
[
123
]
],
"base64": [
"<string>"
]
},
"response_type": "<string>",
"texts": [
"<string>"
],
"images": [
{
"width": 123,
"height": 123,
"format": "<string>",
"bit_depth": 123
}
],
"meta": {
"api_version": {
"version": "<string>",
"is_deprecated": true,
"is_experimental": true
},
"billed_units": {
"input_tokens": 123,
"output_tokens": 123,
"search_units": 123,
"classifications": 123
},
"tokens": {
"input_tokens": 123,
"output_tokens": 123
},
"warnings": [
"<string>"
]
}
}Authorizations
Bearer token authentication. Use your provider API key or Bifrost authentication token.
Virtual keys (prefixed with sk-bf-) can also be passed here.
Body
ID of an available embedding model
"embed-english-v3.0"
Specifies the type of input passed to the model. Required for embedding models v3 and higher.
Array of strings to embed. Maximum 96 texts per call. At least one of texts, images, or inputs is required.
96Array of image data URIs for multimodal embedding. Maximum 1 image per call. Supports JPEG, PNG, WebP, GIF up to 5MB.
1Array of mixed text/image components for embedding. Maximum 96 per call.
96Show child attributes
Show child attributes
Specifies the return format types (float, int8, uint8, binary, ubinary, base64). Defaults to float if unspecified.
Number of dimensions for output embeddings (256, 512, 1024, 1536). Available only for embed-v4 and newer models.
Maximum tokens to embed per input before truncation.
Handling for inputs exceeding token limits. Defaults to END.
Response
Successful response
Response ID
Embedding data object with different types
Show child attributes
Show child attributes
Response type (embeddings_floats, embeddings_by_type)
Original text entries
Original image entries
Show child attributes
Show child attributes
Metadata in embedding response
Show child attributes
Show child attributes
Was this page helpful?

