Skip to main content
Bifrost integrates with GraySwan Cygnal Monitor to provide AI safety monitoring with natural language rule definitions and advanced threat detection capabilities. This page covers the configuration and capabilities of the GraySwan Cygnal guardrail provider. GraySwan configuration form

Capabilities

  • Violation Scoring: Continuous 0-1 scale violation detection with configurable thresholds
  • Custom Natural Language Rules: Define safety rules in plain English without code
  • Policy Management: Use pre-built policies from GraySwan platform or create custom ones
  • Indirect Prompt Injection (IPI) Detection: Identify hidden instructions in user inputs
  • Mutation Detection: Detect attempts to manipulate or alter content
  • Reasoning Modes: Choose from fast (“off”), balanced (“hybrid”), or thorough (“thinking”) analysis

Configuration Fields

FieldTypeRequiredDefaultDescription
api_keystringYes-GraySwan API key
violation_thresholdnumberNo0.5Score threshold (0-1) for triggering intervention. Lower values are more strict.
reasoning_modeenumNo”off”Analysis depth: off (fastest), hybrid (balanced), or thinking (most thorough)
policy_idstringNo-Single custom policy ID from GraySwan platform
policy_idsarrayNo-Multiple policy IDs for aggregated rule evaluation
rulesobjectNo-Custom natural language rules as key-value pairs

Request Header Metadata

For each GraySwan monitor call, Bifrost includes sanitized incoming request headers in GraySwan metadata.headers. This gives GraySwan request context for correlation and policy analysis, such as x-request-id, x-correlation-id, traceparent, x-tenant-id, x-org-id, content-type, and content-length. Credential-bearing headers are excluded. Bifrost does not send authorization, proxy-authorization, x-api-key, api-key, x-goog-api-key, x-bf-vk, x-bf-api-key, x-bf-api-key-id, cookie, set-cookie, or grayswan-api-key in GraySwan metadata. This is metadata only: these values are added to the JSON body sent to GraySwan, not forwarded as outbound HTTP headers, and they cannot override the configured GraySwan API key.
{
  "metadata": {
    "headers": {
      "x-request-id": "req-123",
      "traceparent": "00-...",
      "x-tenant-id": "tenant-123",
      "content-type": "application/json"
    }
  }
}

Custom Rules Example

GraySwan custom rules Rules are defined as key-value pairs where the key is the rule name and the value is a natural language description:
{
  "rules": {
    "no_profanity": "Do not allow profanity or vulgar language",
    "no_pii": "Do not allow personally identifiable information",
    "professional_tone": "Ensure all responses maintain a professional tone"
  }
}

Detection Features

  • Real-time violation scoring
  • Multi-rule evaluation
  • IPI attack detection
  • Content mutation monitoring
  • Detailed violation descriptions with rule attribution
For provider comparison and information on configuring guardrail rules and profiles, see Guardrails.