OpenAI compatible API. Attested gateway. Public status.

DeepInfra performance

Measured TTFT, TTFB, throughput, uptime, and sampled model routes for DeepInfra.

Verify gateway
1 URLbase_url migration
100smodels and routes
0prompt logs by default

deepinfra

320 samples

Provider overview

Continuously sampled provider performance. TrustedRouter reports unsupported route and probe-configuration rows separately from provider downtime. Prompt and output content is not stored.

p50 TTFT827 ms
p95 TTFT ms
p50 TTFB ms
Throughput
Uptime99.69%

Measured model routes

Modelp50 TTFTp50 TTFBThroughputUptimeConfig excludedSamples
google/gemma-4-26b-a4b-it 708 ms 667 ms 100.00% 45
qwen/qwen3.5-27b 754 ms 652 ms 100.00% 48
meta-llama/llama-3.1-70b-instruct 779 ms 700 ms 100.00% 52
google/gemma-4-31b-it 827 ms 735 ms 100.00% 46
google/gemma-3-12b-it 831 ms 738 ms 100.00% 52
google/gemma-3-4b-it 884 ms 783 ms 96.88% 32
google/gemma-3-27b-it 959 ms 885 ms 100.00% 45

Sign in

Choose a sign in method.