OpenAI compatible API. Attested gateway. Public status.
GMI Cloud performance
Measured TTFT, TTFB, throughput, uptime, and sampled model routes for GMI Cloud.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
gmi
320 samples
Continuously sampled provider performance. TrustedRouter reports unsupported route and probe-configuration rows separately from provider downtime. Prompt and output content is not stored.
| p50 TTFT | 5268 ms |
|---|---|
| p95 TTFT | ms |
| p50 TTFB | ms |
| Throughput | — |
| Uptime | 61.88% |
Measured model routes
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| deepseek/deepseek-v4-pro | 3205 ms | 3102 ms | — | 98.21% | — | 56 |
| google/gemma-4-31b-it | 3401 ms | 3400 ms | — | 63.93% | — | 61 |
| z-ai/glm-5.1 | 5268 ms | 5164 ms | — | 21.54% | — | 65 |
| google/gemma-4-26b-a4b-it | 5668 ms | 5564 ms | — | 53.57% | — | 56 |
| z-ai/glm-5 | 7725 ms | 7666 ms | — | 73.17% | — | 82 |