Smart Model Routing

Frontier quality. Half the cost.

Sansa routes every request to the model most likely to answer it correctly at the lowest cost.

Your app calls the API with sansa-auto as the model.

HOW ROUTING WORKS

No single model wins every query

Same test,
different strengths

  1. Models have different strengths: one is great at math, another at code, another at writing.

  2. Give them all the same test and they each miss the questions outside their strengths.

  3. If you take only the answers each one got right and combine them into one test.

  4. That combined test outscores every individual model.

  5. That is routing. Requests go to the model most likely to get it right, so the system as a whole beats any single model.

Gemini

Claude

ChatGPT

Accounting
Coding
Writing
Health
Algorithms
Law

Score

33%

Score

33%

Score

33%

Routed test

Beats every model

Accounting
Coding
Writing
Health
Algorithms
Law

100%

MODEL ROUTING

Self Improving Agents

Frontier performance, that gets better over time.

Cost vs. quality

Grok 4.3
Kimi K2.6
Gemini 3.1
Opus 4.7
GPT 5.5

Sansa Auto

93.3% accuracy

3× lower cost

Better agents every day

Your eval results feed back into the router, so your agents improve and your metrics climb while you sleep.

Eval Pass Rate (90d)

+31%
Cost Decreasing

Route

Respond

Evaluate

Improve

Know exactly what ran, and why

Every model match is logged, so you can debug issues and track performance over time.

Routing decisions

Search Filter
2026-02-1...
TimestampRequestRouted toLatencyCost
Feb 15, 10:33 PMreq_8f2a
claude-4.8-opus
1.4s$0.09
Feb 15, 10:32 PMreq_7c1b
gpt-5.5
2.1s$0.14
Feb 15, 10:31 PMreq_6d9e
gemini-3.1-pro
1.8s$0.06
Feb 15, 10:30 PMreq_5a4c
kimi-k2.6
980ms$0.03
Feb 15, 10:29 PMreq_4b2f
claude-4.6
1.1s$0.05
Feb 15, 10:28 PMreq_3e8d
deepseek-v4
640ms$0.01

Match score

94%Strong match
Candidates considered12 models
Selected modelclaude-4.8-opus
Routing latency48 ms
MMLU-Pro Benchmark

Routing that wins on benchmarks

Sansa routing beats all frontier models on MMLU-Pro accuracy.

Run this benchmark yourself

Reproduce every number against sansa-auto with the open-source evaluation harness.

sansa-mmlu-pro

Get frontier performance for less