OpenAI compatible API. Attested gateway. Public status.
LLM Provider Latency Benchmarks
Measured time-to-first-token, time-to-first-byte, throughput, and success rate for LLM providers routed through TrustedRouter.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
Measured provider latency
Provider speed data from real routed requests.
TrustedRouter publishes metadata-only measurements for time-to-first-token, time-to-first-byte, throughput, uptime, and excluded probe-configuration rows. The goal is to show what the router actually sees, not what a provider claims in a launch post.
- ✓ Provider and model leaderboards
- ✓ Per-provider performance pages when enough samples exist
- ✓ Per-model performance pages when enough samples exist
- ✓ Prompt and output content never stored for these rollups
Signalsmetadata only
{
"provider": "tinfoil",
"model": "moonshotai/kimi-k2.6",
"p50_ttft_ms": 1192,
"uptime": 0.999,
"sample_count": 42
}
Provider pages
- Tinfoil performanceConfidential and E2EE route samples
- Anthropic performanceClaude route samples
- Gemini performanceGoogle model route samples
Model pages
- Kimi K2.6 performanceProvider-specific route metrics
- Gemini Flash performanceFast multimodal route metrics
- GPT Nano performanceSmall-model latency metrics
Questions
Are these vendor claims?
No. The leaderboard is generated from TrustedRouter synthetic probes and runtime metadata, not provider marketing claims.
Do latency probes store prompts or outputs?
No. Status and leaderboard records store provider, model, latency, token, route, cost, and outcome metadata only.