OpenAI compatible API · Attested · Public status

Venice

Venice models on TrustedRouter with prices, routes, policy notes, and source links.

Verify gateway

Onebase URL to migrate

100sof models and routes

Noneprompt logs by default

`venice`

Confidential

All providers

Provider	Venice
Models	12 public models
Prepaid routes	12
BYOK routes	12
Zero data retention	yes
Confidential compute	yes
Provider E2EE	yes
Policy note	Tracked as confidential — Venice documents no logging or storage of prompts/responses plus TEE-isolated, end-to-end-encrypted inference. (Caveat: requests Venice proxies to external frontier models inherit those providers' policies; TR routes Venice-native open models here.) Policy source

Measured performance

191 samples

Continuously sampled across Venice's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.

p50 TTFT	5545 ms
Throughput	61 tok/s
Uptime	99.48%

Model	p50 TTFT	p50 TTFB	Throughput	Uptime	Config excluded	Samples
qwen/qwen3.6-27b	2214 ms	2214 ms	—	100.00%	—	20
z-ai/glm-5.2	2315 ms	3699 ms	61 tok/s	100.00%	—	16
z-ai/glm-4.6	3323 ms	3323 ms	—	100.00%	—	15
qwen/qwen3-235b-a22b-thinking-2507	4013 ms	4012 ms	—	100.00%	—	16
z-ai/glm-5	5313 ms	5313 ms	—	100.00%	—	21
qwen/qwen3.5-397b-a17b	5545 ms	5545 ms	—	100.00%	—	16
qwen/qwen3.5-9b	6707 ms	6706 ms	—	100.00%	—	14
z-ai/glm-4.7-flash	6808 ms	6808 ms	—	100.00%	—	15
z-ai/glm-5-turbo	8336 ms	8336 ms	—	93.33%	—	15
z-ai/glm-4.7	8397 ms	8397 ms	—	100.00%	—	17
z-ai/glm-5v-turbo	9251 ms	9251 ms	—	100.00%	—	16
z-ai/glm-5.1	9797 ms	9796 ms	—	100.00%	—	10

Venice performance history · Full provider & model leaderboard.

Provider models

Models served by Venice.

Each row links to pricing, provider, benchmark, and API pages for the model.

Model	AI IQ	Context	Endpoints	Prompt	Completion	Routes
`qwen/qwen3-235b-a22b-thinking-2507` Qwen: Qwen3 235B A22B Thinking 2507 providers pricing	—	262,144	2	$0.4725/1M	$3.675/1M	prepaid BYOK
`qwen/qwen3.5-397b-a17b` Qwen: Qwen3.5 397B A17B providers pricing	—	262,144	2	$0.7875/1M	$4.725/1M	prepaid BYOK
`qwen/qwen3.5-9b` Qwen: Qwen3.5-9B providers pricing	IQ 93#88	262,144	2	$0.105/1M	$0.1575/1M	prepaid BYOK
`qwen/qwen3.6-27b` Qwen: Qwen3.6 27B providers pricing	IQ 112#38	262,144	2	$0.3465/1M	$3.4125/1M	prepaid BYOK
`z-ai/glm-4.6` Z.ai: GLM 4.6 providers pricing	—	204,800	2	$0.4515/1M	$1.8375/1M	prepaid BYOK
`z-ai/glm-4.7` Z.ai: GLM 4.7 providers pricing	IQ 103#56	204,800	2	$0.5775/1M	$2.7825/1M	prepaid BYOK
`z-ai/glm-4.7-flash` Z.ai: GLM 4.7 Flash providers pricing	—	202,752	2	$0.1365/1M	$0.525/1M	prepaid BYOK
`z-ai/glm-5` Z.ai: GLM 5 benchmarks providers pricing	IQ 106#50	204,800	2	$1.05/1M	$3.36/1M	prepaid BYOK
`z-ai/glm-5-turbo` Z.ai: GLM 5 Turbo providers pricing	—	202,752	2	$1.26/1M	$4.2/1M	prepaid BYOK
`z-ai/glm-5.1` Z.ai: GLM 5.1 benchmarks providers pricing	IQ 113#30	204,800	2	$1.617/1M	$5.082/1M	prepaid BYOK
`z-ai/glm-5.2` Z.ai: GLM 5.2 benchmarks providers pricing	IQ 120#16	1,048,576	2	$1.47/1M	$4.62/1M	prepaid BYOK
`z-ai/glm-5v-turbo` Z.ai: GLM 5V Turbo providers pricing	—	202,752	2	$1.575/1M	$5.25/1M	prepaid BYOK