OpenAI compatible API. Attested gateway. Public status.

Venice performance

Measured TTFT, TTFB, throughput, uptime, and sampled model routes for Venice.

Verify gateway
1 URLbase_url migration
100smodels and routes
0prompt logs by default

venice

320 samples

Provider overview

Continuously sampled provider performance. TrustedRouter reports unsupported route and probe-configuration rows separately from provider downtime. Prompt and output content is not stored.

p50 TTFT1287 ms
p95 TTFT ms
p50 TTFB ms
Throughput
Uptime87.19%

Measured model routes

Modelp50 TTFTp50 TTFBThroughputUptimeConfig excludedSamples
qwen/qwen3-235b-a22b-thinking-2507 816 ms 815 ms 100.00% 29
qwen/qwen3.6-27b 920 ms 878 ms 100.00% 36
qwen/qwen3.5-9b 987 ms 884 ms 100.00% 33
z-ai/glm-4.7-flash 1115 ms 1011 ms 93.94% 33
z-ai/glm-5.1 1287 ms 1195 ms 100.00% 29
z-ai/glm-4.6 1327 ms 1224 ms 100.00% 22
z-ai/glm-5 1406 ms 1301 ms 100.00% 23
qwen/qwen3.5-397b-a17b 1410 ms 1328 ms 100.00% 22
z-ai/glm-4.7 1414 ms 1326 ms 100.00% 24
z-ai/glm-5-turbo 1631 ms 1613 ms 93.75% 32
z-ai/glm-5v-turbo 0.00% 37

Sign in

Choose a sign in method.