OpenAI compatible API. Attested gateway. Public status.
DeepInfra performance
Measured TTFT, TTFB, throughput, uptime, and sampled model routes for DeepInfra.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
deepinfra
320 samples
Continuously sampled provider performance. TrustedRouter reports unsupported route and probe-configuration rows separately from provider downtime. Prompt and output content is not stored.
| p50 TTFT | 827 ms |
|---|---|
| p95 TTFT | ms |
| p50 TTFB | ms |
| Throughput | — |
| Uptime | 99.69% |
Measured model routes
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| google/gemma-4-26b-a4b-it | 708 ms | 667 ms | — | 100.00% | — | 45 |
| qwen/qwen3.5-27b | 754 ms | 652 ms | — | 100.00% | — | 48 |
| meta-llama/llama-3.1-70b-instruct | 779 ms | 700 ms | — | 100.00% | — | 52 |
| google/gemma-4-31b-it | 827 ms | 735 ms | — | 100.00% | — | 46 |
| google/gemma-3-12b-it | 831 ms | 738 ms | — | 100.00% | — | 52 |
| google/gemma-3-4b-it | 884 ms | 783 ms | — | 96.88% | — | 32 |
| google/gemma-3-27b-it | 959 ms | 885 ms | — | 100.00% | — | 45 |