OpenAI compatible API. Attested gateway. Public status.
Anthropic: Claude Sonnet 4.5 Benchmarks
Benchmark and measurement links for Anthropic: Claude Sonnet 4.5, with TrustedRouter route data first.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
anthropic/claude-sonnet-4.5
Benchmarks
Published benchmark scores
First-party model-card and open-leaderboard scores for Anthropic: Claude Sonnet 4.5 — every row links to its primary source. TrustedRouter does not run these evals; we cite them, and only attach a score to the exact checkpoint it was measured on.
| Benchmark | Category | Score | Source |
|---|---|---|---|
| OSWorld computer use |
Agentic | 61.4% | Anthropic — Claude Sonnet 4.5 2025-09-29 |
| SWE-bench Verified avg of 10 trials, 200K thinking budget; no test-time compute |
Coding | 77.2% | Anthropic — Claude Sonnet 4.5 2025-09-29 |
TrustedRouter measurements
TrustedRouter publishes route and status measurements without storing prompt or output content. Provider latency and uptime are exposed through the model performance and uptime pages.
External benchmark references
- TrustedRouter performance pageTrustedRouter measurement
- TrustedRouter uptime pageTrustedRouter measurement
- Anthropic model docsOfficial model information
- LMArena leaderboardIndependent benchmark index
- LiveBenchIndependent benchmark index
- Artificial Analysis modelsIndependent benchmark index
- HELMIndependent benchmark index