Models

Relai provides access to frontier models from leading providers through private cloud deployments on Amazon Bedrock and Azure OpenAI. All models are available in both EU and US regions.

Private Cloud Inference

Unlike public API endpoints, Relai routes your requests through private cloud deployments on Amazon Bedrock and Azure:

  • Zero training on your data — Frontier labs (Anthropic, OpenAI, Google) never see or train on your prompts
  • No logging by model providers — Your conversations are not stored or analyzed by the AI companies
  • Enterprise-grade isolation — Dedicated compute with SOC2 and HIPAA compliance
  • Contractual guarantees — Data processing agreements with AWS and Microsoft, not frontier labs

Pricing

Transparent pricing with no markup. You pay the same rates as direct Bedrock/Azure access.

ModelProviderInput (1M tokens)Output (1M tokens)Context
claude-opus-4.7Anthropic$15.00$75.00200k
claude-sonnet-4.6Anthropic$3.00$15.00200k
claude-haiku-4Anthropic$0.80$4.00200k
gpt-5.5OpenAI$5.00$15.00256k
gpt-5.5-miniOpenAI$0.60$2.40256k
o4-miniOpenAI$1.10$4.40200k
gemini-2.5-proGoogle$2.50$10.002M
gemini-2.5-flashGoogle$0.15$0.601M
llama-4-maverickMeta$0.20$0.601M
llama-4-scoutMeta$0.17$0.85512k
mistral-large-3Mistral$2.00$6.00128k

Model IDs

Use the full model ID in your API requests:

{
  "model": "claude-sonnet-4.6",
  "messages": [...]
}

Provider-Prefixed Models

You can also use provider-prefixed model IDs:

{
  "model": "anthropic/claude-sonnet-4.6",
  "messages": [...]
}

This is useful when a model name exists across multiple providers.

Model Capabilities

ModelVisionToolsJSON ModeStreaming
claude-opus-4.7
claude-sonnet-4.6
claude-haiku-4
gpt-5.5
gpt-5.5-mini
o4-mini
gemini-2.5-pro
gemini-2.5-flash
llama-4-maverick
llama-4-scout
mistral-large-3

Infrastructure

All inference runs on:

  • Amazon Bedrock — Claude models (Anthropic), Llama models (Meta), Mistral models
  • Azure OpenAI — GPT and O-series models (OpenAI)

This ensures your data stays within enterprise cloud boundaries and is never exposed to frontier lab training pipelines.

EU-Safe Models

All models accessible through the EU region (eu.api.llmrelai.com) route through EU-based Bedrock and Azure deployments. Your prompts and completions never leave the EU.

For a complete list of models, see the pricing page.