OpenAI compatible API. Attested gateway. Public status.

GMI Cloud performance

Measured TTFT, TTFB, throughput, uptime, and sampled model routes for GMI Cloud.

Verify gateway
1 URLbase_url migration
100smodels and routes
0prompt logs by default

gmi

320 samples

Provider overview

Continuously sampled provider performance. TrustedRouter reports unsupported route and probe-configuration rows separately from provider downtime. Prompt and output content is not stored.

p50 TTFT5268 ms
p95 TTFT ms
p50 TTFB ms
Throughput
Uptime61.88%

Measured model routes

Modelp50 TTFTp50 TTFBThroughputUptimeConfig excludedSamples
deepseek/deepseek-v4-pro 3205 ms 3102 ms 98.21% 56
google/gemma-4-31b-it 3401 ms 3400 ms 63.93% 61
z-ai/glm-5.1 5268 ms 5164 ms 21.54% 65
google/gemma-4-26b-a4b-it 5668 ms 5564 ms 53.57% 56
z-ai/glm-5 7725 ms 7666 ms 73.17% 82

Sign in

Choose a sign in method.