OpenAI compatible API. Attested gateway. Public status.
Novita AI performance
Measured TTFT, TTFB, throughput, uptime, and sampled model routes for Novita AI.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
novita
320 samples
Continuously sampled provider performance. TrustedRouter reports unsupported route and probe-configuration rows separately from provider downtime. Prompt and output content is not stored.
| p50 TTFT | 1382 ms |
|---|---|
| p95 TTFT | ms |
| p50 TTFB | ms |
| Throughput | 43 tok/s |
| Uptime | 95.94% |
Measured model routes
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| sao10k/l3-8b-lunaris | 441 ms | 439 ms | — | 100.00% | — | 3 |
| deepseek/deepseek-ocr-2 | 656 ms | 656 ms | — | 100.00% | — | 2 |
| meta-llama/llama-3.3-70b-instruct | 662 ms | 661 ms | — | 100.00% | — | 2 |
| qwen/qwen3-vl-235b-a22b-instruct | 687 ms | 684 ms | — | 100.00% | — | 4 |
| qwen/qwen-2.5-72b-instruct | 703 ms | 702 ms | — | 100.00% | — | 2 |
| qwen/qwen3.5-35b-a3b | 732 ms | 731 ms | — | 100.00% | — | 2 |
| qwen/qwen-mt-plus | 818 ms | 817 ms | — | 100.00% | — | 1 |
| meta-llama/llama-4-scout-17b-16e-instruct | 830 ms | 728 ms | — | 100.00% | — | 3 |
| sao10k/l31-70b-euryale-v2.2 | 849 ms | 846 ms | — | 100.00% | — | 2 |
| zai-org/glm-4.5 | 869 ms | 868 ms | — | 100.00% | — | 2 |
| moonshotai/kimi-k2.5 | 896 ms | 895 ms | — | 100.00% | — | 2 |
| qwen/qwen3-next-80b-a3b-instruct | 906 ms | 904 ms | — | 100.00% | — | 2 |
| zai-org/glm-4.5-air | 907 ms | 906 ms | — | 100.00% | — | 4 |
| zai-org/autoglm-phone-9b-multilingual | 928 ms | 927 ms | — | 100.00% | — | 5 |
| openai/gpt-oss-20b | 930 ms | 927 ms | — | 100.00% | — | 5 |
| deepseek/deepseek-v4-pro | 934 ms | 932 ms | — | 100.00% | — | 4 |
| qwen/qwen3-coder-480b-a35b-instruct | 942 ms | 941 ms | — | 100.00% | — | 4 |
| qwen/qwen3-vl-8b-instruct | 966 ms | 862 ms | — | 100.00% | — | 3 |
| zai-org/glm-4.5v | 1016 ms | 913 ms | — | 100.00% | — | 4 |
| qwen/qwen3.6-27b | 1024 ms | 1023 ms | — | 100.00% | — | 3 |
| meta-llama/llama-4-maverick-17b-128e-instruct-fp8 | 1050 ms | 946 ms | — | 100.00% | — | 3 |
| zai-org/glm-4.6v | 1063 ms | 1019 ms | — | 100.00% | — | 4 |
| qwen/qwen3-coder-30b-a3b-instruct | 1072 ms | 1071 ms | — | 100.00% | — | 4 |
| qwen/qwen3-omni-30b-a3b-instruct | 1077 ms | 1076 ms | — | 100.00% | — | 2 |
| google/gemma-4-31b-it | 1087 ms | 981 ms | — | 100.00% | — | 6 |
| qwen/qwen3-omni-30b-a3b-thinking | 1108 ms | 1107 ms | — | 100.00% | — | 1 |
| minimax/minimax-m2.1 | 1109 ms | 1108 ms | — | 100.00% | — | 2 |
| qwen/qwen3-vl-30b-a3b-thinking | 1118 ms | 1117 ms | — | 100.00% | — | 3 |
| qwen/qwen3-next-80b-a3b-thinking | 1124 ms | 1122 ms | — | 100.00% | — | 6 |
| meta-llama/llama-3.1-8b-instruct | 1127 ms | 1048 ms | — | 100.00% | — | 7 |
| qwen/qwen3-235b-a22b-fp8 | 1209 ms | 1209 ms | — | 100.00% | — | 5 |
| qwen/qwen3.6-35b-a3b | 1211 ms | 1108 ms | — | 100.00% | — | 6 |
| mistralai/mistral-nemo | 1217 ms | 1215 ms | — | 100.00% | — | 1 |
| deepseek/deepseek-v4-flash | 1241 ms | 1187 ms | — | 100.00% | — | 5 |
| meta-llama/llama-3-70b-instruct | 1259 ms | 1257 ms | — | 100.00% | — | 2 |
| qwen/qwen3-235b-a22b-thinking-2507 | 1264 ms | 1263 ms | — | 100.00% | — | 3 |
| qwen/qwen3.5-122b-a10b | 1265 ms | 1160 ms | — | 100.00% | — | 1 |
| google/gemma-4-26b-a4b-it | 1268 ms | 1165 ms | — | 100.00% | — | 5 |
| inclusionai/ling-2.6-flash | 1284 ms | 1194 ms | — | 100.00% | — | 6 |
| google/gemma-3-27b-it | 1304 ms | 1202 ms | — | 50.00% | — | 4 |
| Sao10K/L3-8B-Stheno-v3.2 | 1330 ms | 1227 ms | — | 100.00% | — | 2 |
| qwen/qwen3-coder-next | 1345 ms | 1243 ms | — | 100.00% | — | 3 |
| minimax/minimax-m2 | 1355 ms | 1278 ms | — | 100.00% | — | 4 |
| kwaipilot/kat-coder-pro | 1358 ms | 1302 ms | — | 100.00% | — | 4 |
| qwen/qwen3-vl-30b-a3b-instruct | 1373 ms | 1372 ms | — | 100.00% | — | 5 |
| qwen/qwen3.5-27b | 1382 ms | 1279 ms | — | 100.00% | — | 2 |
| microsoft/wizardlm-2-8x22b | 1407 ms | 1304 ms | — | 100.00% | — | 3 |
| qwen/qwen3-vl-235b-a22b-thinking | 1414 ms | 1311 ms | — | 100.00% | — | 4 |
| deepseek/deepseek-ocr | 1417 ms | 1416 ms | — | 100.00% | — | 6 |
| moonshotai/kimi-k2.6 | 1452 ms | 1349 ms | 32 tok/s | 100.00% | — | 4 |
| minimaxai/minimax-m1-80k | 1462 ms | 1461 ms | — | 100.00% | — | 1 |
| qwen/qwen3.5-397b-a17b | 1502 ms | 1398 ms | — | 100.00% | — | 5 |
| openai/gpt-oss-120b | 1517 ms | 1413 ms | — | 100.00% | — | 3 |
| moonshotai/kimi-k2-thinking | 1523 ms | 1419 ms | — | 100.00% | — | 7 |
| deepseek/deepseek-v3-turbo | 1524 ms | 1523 ms | — | 100.00% | — | 7 |
| qwen/qwen3-max | 1590 ms | 1488 ms | — | 100.00% | — | 3 |
| zai-org/glm-4.7 | 1615 ms | 1614 ms | — | 100.00% | — | 4 |
| minimax/minimax-m2.7 | 1625 ms | 1536 ms | — | 100.00% | — | 5 |
| deepseek/deepseek-v3.2 | 1632 ms | 1630 ms | — | 100.00% | — | 3 |
| qwen/qwen3-235b-a22b-instruct-2507 | 1659 ms | 1600 ms | — | 100.00% | — | 7 |
| deepseek/deepseek-v3-0324 | 1669 ms | 1566 ms | — | 100.00% | — | 3 |
| inclusionai/ring-2.6-1t | 1723 ms | 1677 ms | — | 100.00% | — | 3 |
| zai-org/glm-5 | 1723 ms | 1722 ms | — | 100.00% | — | 3 |
| deepseek/deepseek-v3.1-terminus | 1757 ms | 1655 ms | — | 100.00% | — | 3 |
| zai-org/glm-4.6 | 1757 ms | 1653 ms | — | 100.00% | — | 1 |
| xiaomimimo/mimo-v2.5-pro | 1761 ms | 1759 ms | — | 100.00% | — | 3 |
| deepseek/deepseek-prover-v2-671b | 1767 ms | 1664 ms | — | 100.00% | — | 3 |
| inclusionai/ling-2.6-1t | 1770 ms | 1666 ms | — | 100.00% | — | 3 |
| deepseek/deepseek-r1-distill-llama-70b | 1771 ms | 1666 ms | — | 100.00% | — | 1 |
| deepseek/deepseek-v3.2-exp | 1778 ms | 1676 ms | — | 100.00% | — | 9 |
| moonshotai/kimi-k2-instruct | 1928 ms | 1909 ms | — | 100.00% | — | 5 |
| deepseek/deepseek-v3.1 | 1931 ms | 1920 ms | — | 100.00% | — | 4 |
| moonshotai/kimi-k2.7-code | 1954 ms | 1380 ms | 43 tok/s | 100.00% | — | 30 |
| deepseek/deepseek-r1-turbo | 1974 ms | 1973 ms | — | 100.00% | — | 2 |
| baidu/ernie-4.5-vl-424b-a47b | 1993 ms | 1888 ms | — | 100.00% | — | 5 |
| moonshotai/kimi-k2-0905 | 2031 ms | 1927 ms | — | 100.00% | — | 2 |
| minimax/minimax-m2.5-highspeed | 2070 ms | 1966 ms | — | 100.00% | — | 3 |
| zai-org/glm-5.1 | 2102 ms | 2000 ms | — | 100.00% | — | 5 |
| deepseek/deepseek-r1-0528 | 2267 ms | 2164 ms | — | 100.00% | — | 3 |
| zai-org/glm-4.7-flash | 7732 ms | 7730 ms | — | 50.00% | — | 2 |
| google/gemma-3-12b-it | — | — | — | 0.00% | — | 2 |
| baidu/ernie-4.5-vl-28b-a3b | — | — | — | 0.00% | — | 3 |
| elephant | — | — | — | 0.00% | — | 1 |
| baidu/ernie-4.5-21B-a3b | — | — | — | 0.00% | — | 2 |
| baichuan/baichuan-m2-32b | — | — | — | 0.00% | — | 2 |