OpenAI compatible API. Attested gateway. Public status.
Phala
Phala models on TrustedRouter with prices, routes, policy notes, and source links.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
phala
Confidential
| Provider | Phala |
|---|---|
| Models | 18 public models |
| Prepaid routes | 18 |
| BYOK routes | 18 |
| Zero data retention | yes |
| Confidential compute | yes |
| Provider E2EE | yes |
| Policy note | Tracked as a confidential AI provider with provider-side attestation and encrypted prompt transport. Policy source |
Measured performance
320 samplesContinuously sampled across Phala's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.
| p50 TTFT | 1592 ms |
|---|---|
| Throughput | — |
| Uptime | 95.00% |
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| qwen/qwen2.5-vl-72b-instruct | 780 ms | 778 ms | — | 100.00% | — | 14 |
| qwen/qwen-2.5-7b-instruct | 875 ms | 771 ms | — | 91.30% | — | 23 |
| google/gemma-3-27b-it | 900 ms | 853 ms | — | 100.00% | — | 23 |
| openai/gpt-oss-20b | 935 ms | 911 ms | — | 100.00% | — | 17 |
| qwen/qwen3.5-27b | 1181 ms | 1078 ms | — | 100.00% | — | 16 |
| openai/gpt-oss-120b | 1184 ms | 1119 ms | — | 100.00% | — | 26 |
| z-ai/glm-4.7-flash | 1189 ms | 1126 ms | — | 100.00% | — | 10 |
| qwen/qwen3-vl-30b-a3b-instruct | 1211 ms | 1178 ms | — | 100.00% | — | 17 |
| deepseek/deepseek-chat-v3.1 | 1358 ms | 1255 ms | — | 100.00% | — | 10 |
| qwen/qwen3-30b-a3b-instruct-2507 | 1592 ms | 1590 ms | — | 100.00% | — | 18 |
| moonshotai/kimi-k2.6 | 1651 ms | 1547 ms | — | 100.00% | — | 20 |
| moonshotai/kimi-k2.5 | 1977 ms | 1874 ms | — | 93.33% | — | 15 |
| z-ai/glm-4.7 | 1993 ms | 1991 ms | — | 100.00% | — | 17 |
| minimax/minimax-m2.5 | 2026 ms | 1922 ms | — | 83.33% | — | 18 |
| deepseek/deepseek-v3.2 | 2071 ms | 2070 ms | — | 72.73% | — | 22 |
| qwen/qwen3.5-397b-a17b | 2100 ms | 2098 ms | — | 100.00% | — | 14 |
| z-ai/glm-5.1 | 3028 ms | 2925 ms | — | 88.89% | — | 18 |
| z-ai/glm-5 | 3229 ms | 3124 ms | — | 90.91% | — | 22 |
Provider models
Models served by Phala.
Each row links to pricing, provider, benchmark, and API pages for the model.
| Model | Context | Endpoints | Prompt | Completion | Routes |
|---|---|---|---|---|---|
deepseek/deepseek-chat-v3.1DeepSeek: DeepSeek V3.1 |
163,840 | 2 | $1.155/1M | $3.41/1M | prepaid BYOK |
deepseek/deepseek-v3.2DeepSeek: DeepSeek V3.2 |
163,840 | 2 | $0.352/1M | $0.528/1M | prepaid BYOK |
google/gemma-3-27b-itGoogle: Gemma 3 27B |
131,072 | 2 | $0.121/1M | $0.44/1M | prepaid BYOK |
minimax/minimax-m2.5MiniMax: MiniMax M2.5 |
204,800 | 2 | $0.22/1M | $1.518/1M | prepaid BYOK |
moonshotai/kimi-k2.5MoonshotAI: Kimi K2.5 |
262,144 | 2 | $0.66/1M | $3.3/1M | prepaid BYOK |
moonshotai/kimi-k2.6MoonshotAI: Kimi K2.6 |
262,144 | 2 | $1.199/1M | $5.06/1M | prepaid BYOK |
openai/gpt-oss-120bOpenAI: gpt-oss-120b |
131,072 | 2 | $0.165/1M | $0.66/1M | prepaid BYOK |
openai/gpt-oss-20bOpenAI: gpt-oss-20b |
131,072 | 2 | $0.044/1M | $0.165/1M | prepaid BYOK |
qwen/qwen-2.5-7b-instructQwen: Qwen2.5 7B Instruct |
131,072 | 2 | $0.044/1M | $0.11/1M | prepaid BYOK |
qwen/qwen2.5-vl-72b-instructQwen: Qwen2.5 VL 72B Instruct |
131,072 | 2 | $0.22/1M | $0.77/1M | prepaid BYOK |
qwen/qwen3-30b-a3b-instruct-2507Qwen: Qwen3 30B A3B Instruct 2507 |
131,072 | 2 | $0.165/1M | $0.605/1M | prepaid BYOK |
qwen/qwen3-vl-30b-a3b-instructQwen: Qwen3 VL 30B A3B Instruct |
262,144 | 2 | $0.22/1M | $0.77/1M | prepaid BYOK |
qwen/qwen3.5-27bQwen: Qwen3.5-27B |
262,144 | 2 | $0.33/1M | $2.64/1M | prepaid BYOK |
qwen/qwen3.5-397b-a17bQwen: Qwen3.5 397B A17B |
262,144 | 2 | $0.605/1M | $3.85/1M | prepaid BYOK |
z-ai/glm-4.7Z.ai: GLM 4.7 |
202,752 | 2 | $0.935/1M | $3.63/1M | prepaid BYOK |
z-ai/glm-4.7-flashZ.ai: GLM 4.7 Flash |
202,752 | 2 | $0.11/1M | $0.473/1M | prepaid BYOK |
z-ai/glm-5Z.ai: GLM 5 |
204,800 | 2 | $1.32/1M | $3.85/1M | prepaid BYOK |
z-ai/glm-5.1Z.ai: GLM 5.1 |
202,752 | 2 | $1.331/1M | $4.62/1M | prepaid BYOK |