OpenAI compatible API. Attested gateway. Public status.
Venice
Venice models on TrustedRouter with prices, routes, policy notes, and source links.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
venice
Confidential
| Provider | Venice |
|---|---|
| Models | 11 public models |
| Prepaid routes | 11 |
| BYOK routes | 11 |
| Zero data retention | yes |
| Confidential compute | yes |
| Provider E2EE | yes |
| Policy note | Tracked as confidential — Venice documents no logging or storage of prompts/responses plus TEE-isolated, end-to-end-encrypted inference. (Caveat: requests Venice proxies to external frontier models inherit those providers' policies; TR routes Venice-native open models here.) Policy source |
Measured performance
320 samplesContinuously sampled across Venice's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.
| p50 TTFT | 1379 ms |
|---|---|
| Throughput | — |
| Uptime | 97.50% |
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| qwen/qwen3-235b-a22b-thinking-2507 | 969 ms | 896 ms | — | 100.00% | — | 32 |
| qwen/qwen3.5-9b | 1007 ms | 904 ms | — | 93.10% | — | 29 |
| z-ai/glm-4.7-flash | 1061 ms | 1059 ms | — | 100.00% | — | 28 |
| qwen/qwen3.6-27b | 1085 ms | 1024 ms | — | 93.33% | — | 30 |
| z-ai/glm-5.1 | 1134 ms | 1109 ms | — | 100.00% | — | 34 |
| z-ai/glm-4.6 | 1379 ms | 1284 ms | — | 100.00% | — | 22 |
| z-ai/glm-4.7 | 1443 ms | 1364 ms | — | 100.00% | — | 37 |
| z-ai/glm-5 | 1476 ms | 1381 ms | — | 100.00% | — | 32 |
| z-ai/glm-5-turbo | 1842 ms | 1739 ms | — | 92.31% | — | 26 |
| qwen/qwen3.5-397b-a17b | 1852 ms | 1748 ms | — | 100.00% | — | 21 |
| z-ai/glm-5v-turbo | 2690 ms | 2689 ms | — | 93.10% | — | 29 |
Provider models
Models served by Venice.
Each row links to pricing, provider, benchmark, and API pages for the model.
| Model | Context | Endpoints | Prompt | Completion | Routes |
|---|---|---|---|---|---|
qwen/qwen3-235b-a22b-thinking-2507Qwen: Qwen3 235B A22B Thinking 2507 |
262,144 | 2 | $0.495/1M | $3.85/1M | prepaid BYOK |
qwen/qwen3.5-397b-a17bQwen: Qwen3.5 397B A17B |
262,144 | 2 | $0.825/1M | $4.95/1M | prepaid BYOK |
qwen/qwen3.5-9bQwen: Qwen3.5-9B |
262,144 | 2 | $0.11/1M | $0.165/1M | prepaid BYOK |
qwen/qwen3.6-27bQwen: Qwen3.6 27B |
262,144 | 2 | $0.363/1M | $3.575/1M | prepaid BYOK |
z-ai/glm-4.6Z.ai: GLM 4.6 |
202,752 | 2 | $0.935/1M | $3.025/1M | prepaid BYOK |
z-ai/glm-4.7Z.ai: GLM 4.7 |
202,752 | 2 | $0.605/1M | $2.915/1M | prepaid BYOK |
z-ai/glm-4.7-flashZ.ai: GLM 4.7 Flash |
202,752 | 2 | $0.143/1M | $0.55/1M | prepaid BYOK |
z-ai/glm-5Z.ai: GLM 5 |
204,800 | 2 | $1.1/1M | $3.52/1M | prepaid BYOK |
z-ai/glm-5-turboZ.ai: GLM 5 Turbo |
202,752 | 2 | $1.32/1M | $4.4/1M | prepaid BYOK |
z-ai/glm-5.1Z.ai: GLM 5.1 |
202,752 | 2 | $1.925/1M | $6.05/1M | prepaid BYOK |
z-ai/glm-5v-turboZ.ai: GLM 5V Turbo |
202,752 | 2 | $1.65/1M | $5.5/1M | prepaid BYOK |