- Cohere:
Provider: schemas.Cohere,Model: "rerank-v3.5" - vLLM:
Provider: schemas.VLLM,Model: "BAAI/bge-reranker-v2-m3"
Basic Example
Parameters
Provider,Model: provider/model to use for rerankQuery: query textDocuments: documents to score (text, optionalid,meta)Params.TopN: max result countParams.MaxTokensPerDoc: provider-dependent token capParams.Priority: provider-dependent priority hintParams.ReturnDocuments: include source document in each resultFallbacks: fallback provider/model choices
Provider to schemas.VLLM and use the upstream model ID as Model (without the vllm/ prefix that is used in Gateway HTTP requests).
Response
BifrostRerankResponse includes:
Results []RerankResult(index,relevance_score, optionaldocument)Model- optional
Usage ExtraFieldsmetadata (provider,latency,request_type, etc.)
Next Steps
- Streaming Responses - Real-time response processing
- Tool Calling - Enable AI to use external functions
- Multimodal AI - Process images and multimedia content
- Core Features - Advanced Bifrost capabilities

