OpenAI compatible API. Attested gateway. Public status.
DeepInfra
DeepInfra models on TrustedRouter with prices, routes, policy notes, and source links.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
deepinfra
No logs
| Provider | DeepInfra |
|---|---|
| Models | 8 public models |
| Prepaid routes | 8 |
| BYOK routes | 8 |
| Zero data retention | yes |
| Confidential compute | not claimed |
| Provider E2EE | not claimed |
| Policy note | Tracked as provider ZDR — DeepInfra documents memory-only handling with no storage of API content and no training on submitted API data. (Exception: requests to Google/Anthropic-backed models inherit those vendors' policies.) Policy source |
Measured performance
320 samplesContinuously sampled across DeepInfra's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.
| p50 TTFT | 923 ms |
|---|---|
| Throughput | — |
| Uptime | 98.12% |
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| qwen/qwen3.5-27b | 811 ms | 718 ms | — | 100.00% | — | 49 |
| google/gemma-3-12b-it | 900 ms | 800 ms | — | 100.00% | — | 49 |
| google/gemma-4-26b-a4b-it | 915 ms | 820 ms | — | 100.00% | — | 37 |
| google/gemma-3-27b-it | 923 ms | 845 ms | — | 98.25% | — | 57 |
| google/gemma-3-4b-it | 937 ms | 837 ms | — | 98.15% | — | 54 |
| google/gemma-4-31b-it | 958 ms | 863 ms | — | 87.88% | — | 33 |
| meta-llama/llama-3.1-70b-instruct | 975 ms | 874 ms | — | 100.00% | — | 41 |
Provider models
Models served by DeepInfra.
Each row links to pricing, provider, benchmark, and API pages for the model.
| Model | Context | Endpoints | Prompt | Completion | Routes |
|---|---|---|---|---|---|
Qwen/Qwen3-Embedding-8BQwen3 Embedding 8B |
32,000 | 2 | $0.011/1M | selected route | prepaid BYOK |
google/gemma-3-12b-itGoogle: Gemma 3 12B |
131,072 | 2 | $0.055/1M | $0.165/1M | prepaid BYOK |
google/gemma-3-27b-itGoogle: Gemma 3 27B |
131,072 | 2 | $0.088/1M | $0.176/1M | prepaid BYOK |
google/gemma-3-4b-itGoogle: Gemma 3 4B |
131,072 | 2 | $0.055/1M | $0.11/1M | prepaid BYOK |
google/gemma-4-26b-a4b-itGoogle: Gemma 4 26B A4B |
262,144 | 2 | $0.077/1M | $0.374/1M | prepaid BYOK |
google/gemma-4-31b-itGoogle: Gemma 4 31B |
262,144 | 2 | $0.143/1M | $0.418/1M | prepaid BYOK |
meta-llama/llama-3.1-70b-instructMeta: Llama 3.1 70B Instruct |
131,072 | 2 | $0.44/1M | $0.44/1M | prepaid BYOK |
qwen/qwen3.5-27bQwen: Qwen3.5-27B |
262,144 | 2 | $0.286/1M | $2.86/1M | prepaid BYOK |