OpenAI compatible API. Attested gateway. Public status.
Venice performance
Measured TTFT, TTFB, throughput, uptime, and sampled model routes for Venice.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
venice
320 samples
Continuously sampled provider performance. TrustedRouter reports unsupported route and probe-configuration rows separately from provider downtime. Prompt and output content is not stored.
| p50 TTFT | 1287 ms |
|---|---|
| p95 TTFT | ms |
| p50 TTFB | ms |
| Throughput | — |
| Uptime | 87.19% |
Measured model routes
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| qwen/qwen3-235b-a22b-thinking-2507 | 816 ms | 815 ms | — | 100.00% | — | 29 |
| qwen/qwen3.6-27b | 920 ms | 878 ms | — | 100.00% | — | 36 |
| qwen/qwen3.5-9b | 987 ms | 884 ms | — | 100.00% | — | 33 |
| z-ai/glm-4.7-flash | 1115 ms | 1011 ms | — | 93.94% | — | 33 |
| z-ai/glm-5.1 | 1287 ms | 1195 ms | — | 100.00% | — | 29 |
| z-ai/glm-4.6 | 1327 ms | 1224 ms | — | 100.00% | — | 22 |
| z-ai/glm-5 | 1406 ms | 1301 ms | — | 100.00% | — | 23 |
| qwen/qwen3.5-397b-a17b | 1410 ms | 1328 ms | — | 100.00% | — | 22 |
| z-ai/glm-4.7 | 1414 ms | 1326 ms | — | 100.00% | — | 24 |
| z-ai/glm-5-turbo | 1631 ms | 1613 ms | — | 93.75% | — | 32 |
| z-ai/glm-5v-turbo | — | — | — | 0.00% | — | 37 |