OpenAI compatible API · Attested · Public status

Z.ai: GLM 5 Performance

Name: Z.ai: GLM 5 TrustedRouter performance measurements
Creator: TrustedRouter
License: https://www.apache.org/licenses/LICENSE-2.0

TrustedRouter performance signals and provider route posture for Z.ai: GLM 5.

Verify gateway

Onebase URL to migrate

100sof models and routes

Noneprompt logs by default

`z-ai/glm-5`

open weights Performance

All models

AI IQ IQ 106 #50 public AI IQ rank for glm-5

View AI IQ profile

Measured performance

Continuously sampled p50/p95 time-to-first-token (TTFT), time-to-first-byte (TTFB), throughput, and success rate for Z.ai: GLM 5 — unsupported route and probe-configuration rows are separated from provider downtime, and no prompt or output content is stored.

Provider	p50 TTFT	p95 TTFT	p50 TTFB	Throughput	Uptime	Config excluded	Samples
deepinfra	1762 ms	13071 ms	1762 ms	—	100.00%	—	3
baseten	2141 ms	12932 ms	2140 ms	—	100.00%	—	14
atlas-cloud	2528 ms	6519 ms	2528 ms	—	100.00%	—	6
chutes	2717 ms	8308 ms	2717 ms	—	100.00%	—	6
digitalocean	2834 ms	16982 ms	2834 ms	—	100.00%	—	9
gmi	3757 ms	14236 ms	3757 ms	—	100.00%	—	47
zai	4487 ms	16943 ms	4487 ms	—	100.00%	—	16
venice	5313 ms	17589 ms	5313 ms	—	100.00%	—	21
siliconflow	5662 ms	15035 ms	5661 ms	—	100.00%	—	7

Full provider & model leaderboard.

Provider diversity

16 routes.

More routes give the auto router more room to fail over around provider 429 and 5xx responses.

Streaming

Gateway overhead is measured separately.

Public status separates TLS/health overhead from full model latency so slow LLMs do not inflate the router metric.

Status

Metadata rollups.

Status samples store latency, outcome, provider, model, route, cost, and region metadata only.