Unified API

One contract across models and providers

Access 200+ models through a single integration instead of a patchwork of SDKs

Unified API
Request
{
"model":"google/gemma-4-31b-it"
"messages":[
0:{
"role":"system"
"content":"You are a concise assistant."
}
1:{
"role":"user"
"content":"Summarize this ticket in one sentence."
}
]
"max_tokens":256
}
{
"model":"google/gemma-4-31b-it"
"choices":[
0:{
"index":0
"message":{
"role":"assistant"
"content":"Customer reports intermittent timeouts after the last deploy and wants a rollback window."
}
"finish_reason":"stop"
}
]
"usage":{
"prompt_tokens":42
"completion_tokens":28
"total_tokens":70
}
}
One integration for models, tools, and safety

MODELS

200+

PROVIDERS

30+

UPTIME

99.9%
Integration

Swap the URL. Keep your code.

The gateway is OpenAI-compatible. Change the base URL and get the same schema for messages, tool calls, and streaming across 200+ models from 30+ providers.

svg-animation
svg-animation
Reliability

Automatic failover. Zero downtime.

If a provider goes down or rate-limits you, Sansa routes to the same model on a different provider automatically to keep uptime high without any retry logic on your side.

Extensibility

Every feature, one request object

Memory, search, compression, and input guard all activate by adding fields to your request object. Opt-in per request, no new code.

sansa-request
svg-animation

Your all-in-one AI backend.

Get started for free.