OpenAI compatible API. Attested gateway. Public status.

Phala

Phala models on TrustedRouter with prices, routes, policy notes, and source links.

Verify gateway
1 URLbase_url migration
100smodels and routes
0prompt logs by default

phala

Confidential

All providers

ProviderPhala
Models18 public models
Prepaid routes18
BYOK routes18
Zero data retentionyes
Confidential computeyes
Provider E2EEyes
Policy noteTracked as a confidential AI provider with provider-side attestation and encrypted prompt transport.
Policy source

Measured performance

320 samples

Continuously sampled across Phala's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.

p50 TTFT1592 ms
Throughput
Uptime95.00%
Modelp50 TTFTp50 TTFBThroughputUptimeConfig excludedSamples
qwen/qwen2.5-vl-72b-instruct 780 ms 778 ms 100.00% 14
qwen/qwen-2.5-7b-instruct 875 ms 771 ms 91.30% 23
google/gemma-3-27b-it 900 ms 853 ms 100.00% 23
openai/gpt-oss-20b 935 ms 911 ms 100.00% 17
qwen/qwen3.5-27b 1181 ms 1078 ms 100.00% 16
openai/gpt-oss-120b 1184 ms 1119 ms 100.00% 26
z-ai/glm-4.7-flash 1189 ms 1126 ms 100.00% 10
qwen/qwen3-vl-30b-a3b-instruct 1211 ms 1178 ms 100.00% 17
deepseek/deepseek-chat-v3.1 1358 ms 1255 ms 100.00% 10
qwen/qwen3-30b-a3b-instruct-2507 1592 ms 1590 ms 100.00% 18
moonshotai/kimi-k2.6 1651 ms 1547 ms 100.00% 20
moonshotai/kimi-k2.5 1977 ms 1874 ms 93.33% 15
z-ai/glm-4.7 1993 ms 1991 ms 100.00% 17
minimax/minimax-m2.5 2026 ms 1922 ms 83.33% 18
deepseek/deepseek-v3.2 2071 ms 2070 ms 72.73% 22
qwen/qwen3.5-397b-a17b 2100 ms 2098 ms 100.00% 14
z-ai/glm-5.1 3028 ms 2925 ms 88.89% 18
z-ai/glm-5 3229 ms 3124 ms 90.91% 22

Full provider & model leaderboard.

Provider models

Models served by Phala.

Each row links to pricing, provider, benchmark, and API pages for the model.

Model Context Endpoints Prompt Completion Routes
deepseek/deepseek-chat-v3.1
DeepSeek: DeepSeek V3.1
163,840 2 $1.155/1M $3.41/1M prepaid BYOK
deepseek/deepseek-v3.2
DeepSeek: DeepSeek V3.2
163,840 2 $0.352/1M $0.528/1M prepaid BYOK
google/gemma-3-27b-it
Google: Gemma 3 27B
131,072 2 $0.121/1M $0.44/1M prepaid BYOK
minimax/minimax-m2.5
MiniMax: MiniMax M2.5
204,800 2 $0.22/1M $1.518/1M prepaid BYOK
moonshotai/kimi-k2.5
MoonshotAI: Kimi K2.5
262,144 2 $0.66/1M $3.3/1M prepaid BYOK
moonshotai/kimi-k2.6
MoonshotAI: Kimi K2.6
262,144 2 $1.199/1M $5.06/1M prepaid BYOK
openai/gpt-oss-120b
OpenAI: gpt-oss-120b
131,072 2 $0.165/1M $0.66/1M prepaid BYOK
openai/gpt-oss-20b
OpenAI: gpt-oss-20b
131,072 2 $0.044/1M $0.165/1M prepaid BYOK
qwen/qwen-2.5-7b-instruct
Qwen: Qwen2.5 7B Instruct
131,072 2 $0.044/1M $0.11/1M prepaid BYOK
qwen/qwen2.5-vl-72b-instruct
Qwen: Qwen2.5 VL 72B Instruct
131,072 2 $0.22/1M $0.77/1M prepaid BYOK
qwen/qwen3-30b-a3b-instruct-2507
Qwen: Qwen3 30B A3B Instruct 2507
131,072 2 $0.165/1M $0.605/1M prepaid BYOK
qwen/qwen3-vl-30b-a3b-instruct
Qwen: Qwen3 VL 30B A3B Instruct
262,144 2 $0.22/1M $0.77/1M prepaid BYOK
qwen/qwen3.5-27b
Qwen: Qwen3.5-27B
262,144 2 $0.33/1M $2.64/1M prepaid BYOK
qwen/qwen3.5-397b-a17b
Qwen: Qwen3.5 397B A17B
262,144 2 $0.605/1M $3.85/1M prepaid BYOK
z-ai/glm-4.7
Z.ai: GLM 4.7
202,752 2 $0.935/1M $3.63/1M prepaid BYOK
z-ai/glm-4.7-flash
Z.ai: GLM 4.7 Flash
202,752 2 $0.11/1M $0.473/1M prepaid BYOK
z-ai/glm-5
Z.ai: GLM 5
204,800 2 $1.32/1M $3.85/1M prepaid BYOK
z-ai/glm-5.1
Z.ai: GLM 5.1
202,752 2 $1.331/1M $4.62/1M prepaid BYOK

Sign in

Choose a sign in method.