OpenAI compatible API. Attested gateway. Public status.

Novita AI performance

Measured TTFT, TTFB, throughput, uptime, and sampled model routes for Novita AI.

Verify gateway
1 URLbase_url migration
100smodels and routes
0prompt logs by default

novita

320 samples

Provider overview

Continuously sampled provider performance. TrustedRouter reports unsupported route and probe-configuration rows separately from provider downtime. Prompt and output content is not stored.

p50 TTFT1382 ms
p95 TTFT ms
p50 TTFB ms
Throughput43 tok/s
Uptime95.94%

Measured model routes

Modelp50 TTFTp50 TTFBThroughputUptimeConfig excludedSamples
sao10k/l3-8b-lunaris 441 ms 439 ms 100.00% 3
deepseek/deepseek-ocr-2 656 ms 656 ms 100.00% 2
meta-llama/llama-3.3-70b-instruct 662 ms 661 ms 100.00% 2
qwen/qwen3-vl-235b-a22b-instruct 687 ms 684 ms 100.00% 4
qwen/qwen-2.5-72b-instruct 703 ms 702 ms 100.00% 2
qwen/qwen3.5-35b-a3b 732 ms 731 ms 100.00% 2
qwen/qwen-mt-plus 818 ms 817 ms 100.00% 1
meta-llama/llama-4-scout-17b-16e-instruct 830 ms 728 ms 100.00% 3
sao10k/l31-70b-euryale-v2.2 849 ms 846 ms 100.00% 2
zai-org/glm-4.5 869 ms 868 ms 100.00% 2
moonshotai/kimi-k2.5 896 ms 895 ms 100.00% 2
qwen/qwen3-next-80b-a3b-instruct 906 ms 904 ms 100.00% 2
zai-org/glm-4.5-air 907 ms 906 ms 100.00% 4
zai-org/autoglm-phone-9b-multilingual 928 ms 927 ms 100.00% 5
openai/gpt-oss-20b 930 ms 927 ms 100.00% 5
deepseek/deepseek-v4-pro 934 ms 932 ms 100.00% 4
qwen/qwen3-coder-480b-a35b-instruct 942 ms 941 ms 100.00% 4
qwen/qwen3-vl-8b-instruct 966 ms 862 ms 100.00% 3
zai-org/glm-4.5v 1016 ms 913 ms 100.00% 4
qwen/qwen3.6-27b 1024 ms 1023 ms 100.00% 3
meta-llama/llama-4-maverick-17b-128e-instruct-fp8 1050 ms 946 ms 100.00% 3
zai-org/glm-4.6v 1063 ms 1019 ms 100.00% 4
qwen/qwen3-coder-30b-a3b-instruct 1072 ms 1071 ms 100.00% 4
qwen/qwen3-omni-30b-a3b-instruct 1077 ms 1076 ms 100.00% 2
google/gemma-4-31b-it 1087 ms 981 ms 100.00% 6
qwen/qwen3-omni-30b-a3b-thinking 1108 ms 1107 ms 100.00% 1
minimax/minimax-m2.1 1109 ms 1108 ms 100.00% 2
qwen/qwen3-vl-30b-a3b-thinking 1118 ms 1117 ms 100.00% 3
qwen/qwen3-next-80b-a3b-thinking 1124 ms 1122 ms 100.00% 6
meta-llama/llama-3.1-8b-instruct 1127 ms 1048 ms 100.00% 7
qwen/qwen3-235b-a22b-fp8 1209 ms 1209 ms 100.00% 5
qwen/qwen3.6-35b-a3b 1211 ms 1108 ms 100.00% 6
mistralai/mistral-nemo 1217 ms 1215 ms 100.00% 1
deepseek/deepseek-v4-flash 1241 ms 1187 ms 100.00% 5
meta-llama/llama-3-70b-instruct 1259 ms 1257 ms 100.00% 2
qwen/qwen3-235b-a22b-thinking-2507 1264 ms 1263 ms 100.00% 3
qwen/qwen3.5-122b-a10b 1265 ms 1160 ms 100.00% 1
google/gemma-4-26b-a4b-it 1268 ms 1165 ms 100.00% 5
inclusionai/ling-2.6-flash 1284 ms 1194 ms 100.00% 6
google/gemma-3-27b-it 1304 ms 1202 ms 50.00% 4
Sao10K/L3-8B-Stheno-v3.2 1330 ms 1227 ms 100.00% 2
qwen/qwen3-coder-next 1345 ms 1243 ms 100.00% 3
minimax/minimax-m2 1355 ms 1278 ms 100.00% 4
kwaipilot/kat-coder-pro 1358 ms 1302 ms 100.00% 4
qwen/qwen3-vl-30b-a3b-instruct 1373 ms 1372 ms 100.00% 5
qwen/qwen3.5-27b 1382 ms 1279 ms 100.00% 2
microsoft/wizardlm-2-8x22b 1407 ms 1304 ms 100.00% 3
qwen/qwen3-vl-235b-a22b-thinking 1414 ms 1311 ms 100.00% 4
deepseek/deepseek-ocr 1417 ms 1416 ms 100.00% 6
moonshotai/kimi-k2.6 1452 ms 1349 ms 32 tok/s 100.00% 4
minimaxai/minimax-m1-80k 1462 ms 1461 ms 100.00% 1
qwen/qwen3.5-397b-a17b 1502 ms 1398 ms 100.00% 5
openai/gpt-oss-120b 1517 ms 1413 ms 100.00% 3
moonshotai/kimi-k2-thinking 1523 ms 1419 ms 100.00% 7
deepseek/deepseek-v3-turbo 1524 ms 1523 ms 100.00% 7
qwen/qwen3-max 1590 ms 1488 ms 100.00% 3
zai-org/glm-4.7 1615 ms 1614 ms 100.00% 4
minimax/minimax-m2.7 1625 ms 1536 ms 100.00% 5
deepseek/deepseek-v3.2 1632 ms 1630 ms 100.00% 3
qwen/qwen3-235b-a22b-instruct-2507 1659 ms 1600 ms 100.00% 7
deepseek/deepseek-v3-0324 1669 ms 1566 ms 100.00% 3
inclusionai/ring-2.6-1t 1723 ms 1677 ms 100.00% 3
zai-org/glm-5 1723 ms 1722 ms 100.00% 3
deepseek/deepseek-v3.1-terminus 1757 ms 1655 ms 100.00% 3
zai-org/glm-4.6 1757 ms 1653 ms 100.00% 1
xiaomimimo/mimo-v2.5-pro 1761 ms 1759 ms 100.00% 3
deepseek/deepseek-prover-v2-671b 1767 ms 1664 ms 100.00% 3
inclusionai/ling-2.6-1t 1770 ms 1666 ms 100.00% 3
deepseek/deepseek-r1-distill-llama-70b 1771 ms 1666 ms 100.00% 1
deepseek/deepseek-v3.2-exp 1778 ms 1676 ms 100.00% 9
moonshotai/kimi-k2-instruct 1928 ms 1909 ms 100.00% 5
deepseek/deepseek-v3.1 1931 ms 1920 ms 100.00% 4
moonshotai/kimi-k2.7-code 1954 ms 1380 ms 43 tok/s 100.00% 30
deepseek/deepseek-r1-turbo 1974 ms 1973 ms 100.00% 2
baidu/ernie-4.5-vl-424b-a47b 1993 ms 1888 ms 100.00% 5
moonshotai/kimi-k2-0905 2031 ms 1927 ms 100.00% 2
minimax/minimax-m2.5-highspeed 2070 ms 1966 ms 100.00% 3
zai-org/glm-5.1 2102 ms 2000 ms 100.00% 5
deepseek/deepseek-r1-0528 2267 ms 2164 ms 100.00% 3
zai-org/glm-4.7-flash 7732 ms 7730 ms 50.00% 2
google/gemma-3-12b-it 0.00% 2
baidu/ernie-4.5-vl-28b-a3b 0.00% 3
elephant 0.00% 1
baidu/ernie-4.5-21B-a3b 0.00% 2
baichuan/baichuan-m2-32b 0.00% 2

Sign in

Choose a sign in method.