OpenAI compatible API · Attested · Public status

LLM Provider Latency Benchmarks

Measured time-to-first-token, time-to-first-byte, throughput, and success rate for LLM providers routed through TrustedRouter.

Verify gateway

Onebase URL to migrate

100sof models and routes

0prompt or output logs. Always.

Measured provider latency

Provider speed data from real routed requests.

TrustedRouter publishes metadata-only measurements for time-to-first-token, time-to-first-byte, throughput, uptime, and excluded probe-configuration rows. The goal is to show what the router actually sees, not what a provider claims in a launch post.

✓ Provider and model leaderboards
✓ Per-provider performance pages when enough samples exist
✓ Per-model performance pages when enough samples exist
✓ Prompt and output content never stored for these rollups

Open leaderboard Open status

Signalsmetadata only

{
  "provider": "tinfoil",
  "model": "moonshotai/kimi-k2.6",
  "p50_ttft_ms": 1192,
  "uptime": 0.999,
  "sample_count": 42
}

Provider pages

Tinfoil performanceConfidential and E2EE route samples
Anthropic performanceClaude route samples
Google Vertex performanceManaged Google Cloud route samples
Google AI Studio performanceGemini Developer API route samples

Model pages

Kimi K2.6 performanceProvider-specific route metrics
Gemini Flash performanceFast multimodal route metrics
GPT Nano performanceSmall-model latency metrics

Questions

Are these vendor claims?

No. The leaderboard is generated from TrustedRouter synthetic probes and runtime metadata, not provider marketing claims.

Do latency probes store prompts or outputs?

No. Status and leaderboard records store provider, model, latency, token, route, cost, and outcome metadata only.