Measured performance

Provider & model performance

Measured time-to-first-token, time-to-first-byte, throughput, and uptime for every LLM provider and model TrustedRouter routes to — continuously sampled, not vendor-claimed.

Last updated 2026-06-09T13:26:14Z
Continuously sampled from TrustedRouter's monitor regions over the 5,000-sample benchmark set — time-to-first-token (TTFT), time-to-first-byte (TTFB), throughput, and success rate measured on real streaming requests, not vendor-claimed. Unsupported route and probe-configuration rows are reported separately and do not count as provider downtime. No prompt or output content is ever stored.

Providers

Ranked by measured p50 time-to-first-token across all of a provider's models in the 5,000-sample benchmark set (22 providers · 2770 samples).

#ProviderModels p50 TTFTThroughputUptimeErrorsConfig excludedSamples
1 mistral 8 811 ms 100.00% 131
2 lightning 1 885 ms 100.00% 125
3 deepinfra 7 945 ms 99.28% provider_error 1% 139
4 grok 2 985 ms 100.00% 144
5 cerebras 4 992 ms 91.06% provider_error 9% 123
6 parasail 24 1029 ms 85.19% provider_error 15% 3 probe_config_error 108
7 gemini 5 1060 ms 100.00% 12 probe_config_error 73
8 together 3 1178 ms 98.45% provider_error 2% 129
9 phala 18 1206 ms 96.77% provider_error 3% 124
10 openai 11 1221 ms 100.00% 135
11 tinfoil 5 1258 ms 84.06% provider_error 16% 138
12 venice 11 1354 ms 98.55% provider_error 1% 138
13 zai 11 1494 ms 96.80% provider_error 3% 125
14 novita 71 1504 ms 92.48% provider_error 8% 133
15 deepseek 2 1540 ms 100.00% 130
16 nebius 24 1562 ms 100.00% 1 probe_config_error 107
17 minimax 6 1662 ms 100.00% 137
18 kimi 2 1691 ms 93.10% provider_error 7% 116
19 anthropic 10 1717 ms 45 tok/s 100.00% 126
20 siliconflow 7 1882 ms 90.98% provider_error 9% 122
21 xiaomi 5 2360 ms 378 tok/s 100.00% 140
22 gmi 5 3126 ms 84.25% empty_stream 16% 127

Models

Models sampled in the 5,000-sample benchmark set, fastest measured TTFT first. Rows with few samples are marked — more data sharpens the numbers.

#ModelProvider p50 TTFTp95 TTFTp50 TTFB ThroughputUptimeConfig excludedSamples
1 nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 limited data nebius 458 ms 575 ms 457 ms 100.00% 2
2 qwen/qwen3.5-27b deepinfra 470 ms 1021 ms 469 ms 100.00% 21
3 qwen/qwen3-next-80b-a3b-instruct limited data parasail 636 ms 652 ms 634 ms 100.00% 3
4 mistralai/mistral-small-2603 limited data mistral 661 ms 1472 ms 659 ms 100.00% 12
5 meta-llama/llama-3-70b-instruct limited data novita 695 ms 1303 ms 693 ms 100.00% 2
6 qwen/qwen3-vl-30b-a3b-instruct limited data phala 695 ms 8274 ms 693 ms 100.00% 6
7 qwen/qwen2.5-vl-72b-instruct limited data phala 711 ms 5230 ms 709 ms 100.00% 9
8 mistralai/mistral-small-3.2-24b-instruct limited data parasail 715 ms 1161 ms 714 ms 100.00% 4
9 thedrummer/cydonia-24b-v4.1 limited data parasail 715 ms 875 ms 713 ms 100.00% 3
10 mistralai/ministral-8b-2512 limited data mistral 721 ms 1157 ms 720 ms 100.00% 19
11 Sao10K/L3-8B-Stheno-v3.2 limited data novita 728 ms 728 ms 726 ms 100.00% 1
12 mistralai/mistral-medium-3-5 limited data mistral 771 ms 1060 ms 769 ms 100.00% 19
13 google/gemma-3-27b-it limited data phala 800 ms 1312 ms 788 ms 100.00% 7
14 mistralai/ministral-3b-2512 limited data mistral 811 ms 1286 ms 769 ms 100.00% 17
15 openai/gpt-oss-120b limited data parasail 814 ms 1289 ms 811 ms 100.00% 4
16 meta-llama/llama-4-maverick limited data parasail 815 ms 1060 ms 712 ms 100.00% 4
17 google/gemma-3-27b-it limited data novita 828 ms 828 ms 826 ms 100.00% 1
18 mistralai/mistral-large limited data mistral 830 ms 1909 ms 828 ms 100.00% 12
19 meta-llama/Llama-3.3-70B-Instruct limited data nebius 830 ms 11370 ms 829 ms 100.00% 4
20 cerebras/gpt-oss-120b cerebras 839 ms 2480 ms 759 ms 80.77% 26
21 qwen/qwen-2.5-7b-instruct limited data phala 845 ms 1485 ms 742 ms 83.33% 12
22 deepseek/deepseek-v3.2 limited data parasail 846 ms 3370 ms 844 ms 100.00% 5
23 mistralai/mistral-small-3.2-24b-instruct limited data mistral 847 ms 1107 ms 768 ms 100.00% 18
24 mistralai/ministral-14b-2512 limited data mistral 854 ms 1687 ms 751 ms 100.00% 19
25 Qwen/Qwen2.5-VL-72B-Instruct limited data nebius 859 ms 7011 ms 858 ms 100.00% 5
26 qwen/qwen-2.5-7b-instruct together 861 ms 1224 ms 763 ms 100.00% 38
27 openai/gpt-4o limited data openai 864 ms 6080 ms 780 ms 100.00% 15
28 z-ai/glm-5.1 limited data parasail 867 ms 1486 ms 864 ms 100.00% 3
29 bytedance/ui-tars-1.5-7b limited data parasail 869 ms 1294 ms 765 ms 100.00% 5
30 google/gemma-4-26b-a4b-it limited data deepinfra 876 ms 7029 ms 820 ms 100.00% 13
31 openai/gpt-4.1-mini limited data openai 883 ms 7254 ms 880 ms 100.00% 8
32 Qwen/Qwen3-32B limited data nebius 883 ms 1796 ms 881 ms 100.00% 4
33 google/gemma-4-31b-it lightning 885 ms 1408 ms 783 ms 100.00% 125
34 thedrummer/skyfall-36b-v2 limited data parasail 888 ms 899 ms 788 ms 100.00% 3
35 z-ai/glm-4.7-flash limited data venice 893 ms 2757 ms 891 ms 100.00% 8
36 mistralai/mistral-nemo limited data mistral 895 ms 1737 ms 791 ms 100.00% 15
37 google/gemini-3.1-flash-lite-preview gemini 899 ms 1333 ms 897 ms 100.00% 20
38 NousResearch/Hermes-4-70B limited data nebius 905 ms 1211 ms 902 ms 100.00% 4
39 openai/gpt-oss-120b cerebras 912 ms 10113 ms 808 ms 73.91% 23
40 sao10k/l3-8b-lunaris limited data novita 913 ms 913 ms 808 ms 100.00% 1
41 meta-llama/llama-3.3-70b-instruct tinfoil 921 ms 1337 ms 818 ms 100.00% 23
42 qwen/qwen3-30b-a3b-instruct-2507 limited data phala 930 ms 1792 ms 827 ms 100.00% 4
43 google/gemma-3-4b-it deepinfra 933 ms 6841 ms 830 ms 100.00% 25
44 openai/gpt-4.1-nano limited data openai 936 ms 6064 ms 833 ms 100.00% 15
45 Qwen/Qwen3-235B-A22B-Instruct-2507 limited data nebius 944 ms 1160 ms 943 ms 100.00% 4
46 google/gemma-4-31b-it limited data deepinfra 945 ms 2366 ms 843 ms 93.75% 16
47 meta-llama/llama-3.1-8b-instruct limited data novita 972 ms 1202 ms 971 ms 100.00% 2
48 openai/gpt-oss-120b tinfoil 976 ms 1336 ms 974 ms 100.00% 24
49 meta-llama/llama-3.1-70b-instruct deepinfra 977 ms 1253 ms 874 ms 100.00% 23
50 google/gemma-3-27b-it deepinfra 978 ms 2105 ms 889 ms 100.00% 24
51 qwen/qwen3.5-9b limited data venice 979 ms 1412 ms 876 ms 85.71% 14
52 meta-llama/llama-3.3-70b-instruct limited data novita 982 ms 1258 ms 980 ms 100.00% 2
53 x-ai/grok-4.20 grok 985 ms 3208 ms 887 ms 100.00% 73
54 qwen/qwen3-coder-next limited data parasail 986 ms 1082 ms 883 ms 100.00% 3
55 meta-llama/llama-4-scout-17b-16e-instruct limited data novita 988 ms 988 ms 885 ms 100.00% 1
56 cerebras/zai-glm-4.7 cerebras 992 ms 2284 ms 975 ms 100.00% 33
57 google/gemma-3-12b-it limited data deepinfra 992 ms 1435 ms 890 ms 100.00% 17
58 google/gemma-4-26b-a4b-it limited data parasail 1000 ms 3413 ms 897 ms 100.00% 6
59 qwen/qwen3-vl-8b-instruct limited data parasail 1001 ms 1443 ms 898 ms 100.00% 5
60 openai/gpt-oss-120b limited data nebius 1015 ms 1439 ms 1014 ms 100.00% 4
61 qwen/qwen3-vl-235b-a22b-instruct limited data parasail 1029 ms 2340 ms 1027 ms 100.00% 4
62 meta-llama/llama-3.3-70b-instruct limited data parasail 1041 ms 1473 ms 1039 ms 100.00% 4
63 inclusionai/ling-2.6-flash limited data novita 1045 ms 1607 ms 1044 ms 100.00% 2
64 z-ai/glm-4.7 cerebras 1052 ms 2824 ms 986 ms 100.00% 41
65 google/gemini-2.5-flash-lite limited data gemini 1055 ms 1242 ms 954 ms 100.00% 13
66 zai-org/autoglm-phone-9b-multilingual limited data novita 1057 ms 1057 ms 1056 ms 100.00% 1
67 google/gemini-2.5-flash limited data gemini 1060 ms 1493 ms 1026 ms 100.00% 16
68 openai/gpt-oss-120b limited data phala 1070 ms 5625 ms 965 ms 100.00% 13
69 google/gemma-3-27b-it limited data parasail 1081 ms 1278 ms 976 ms 100.00% 3
70 NousResearch/Hermes-4-405B limited data nebius 1089 ms 1214 ms 1087 ms 100.00% 2
71 qwen/qwen3-vl-30b-a3b-instruct limited data novita 1093 ms 1093 ms 989 ms 100.00% 1
72 openai/gpt-4.1 limited data openai 1097 ms 7417 ms 994 ms 100.00% 14
73 qwen/qwen3-235b-a22b-thinking-2507 limited data venice 1102 ms 2494 ms 998 ms 100.00% 13
74 openai/gpt-oss-120b limited data novita 1109 ms 1537 ms 1006 ms 100.00% 2
75 openai/gpt-oss-20b limited data parasail 1110 ms 1255 ms 1007 ms 100.00% 2
76 qwen/qwen3.6-27b limited data venice 1114 ms 2727 ms 1012 ms 100.00% 12
77 anthropic/claude-haiku-4.5 limited data anthropic 1119 ms 1426 ms 1015 ms 100.00% 9
78 minimaxai/minimax-m1-80k limited data novita 1132 ms 2188 ms 1131 ms 100.00% 2
79 qwen/qwen3.6-27b limited data novita 1137 ms 1732 ms 1135 ms 100.00% 2
80 inclusionai/ling-2.6-1t limited data novita 1156 ms 8026 ms 1155 ms 100.00% 4
81 google/gemini-3.1-flash-lite limited data gemini 1163 ms 1646 ms 1060 ms 100.00% 13
82 qwen/qwen3-vl-30b-a3b-thinking limited data novita 1172 ms 1997 ms 1170 ms 100.00% 3
83 google/gemma-4-26b-a4b-it limited data novita 1174 ms 1864 ms 1072 ms 100.00% 2
84 moonshotai/kimi-k2.6 together 1178 ms 8385 ms 1176 ms 95.74% 47
85 z-ai/glm-5-turbo limited data zai 1178 ms 2481 ms 1177 ms 100.00% 6
86 google/gemma-3-27b-it limited data nebius 1181 ms 1285 ms 1107 ms 100.00% 5
87 meta-llama/llama-3.3-70b-instruct together 1188 ms 2377 ms 1084 ms 100.00% 44
88 anthropic/claude-opus-4.7 limited data anthropic 1190 ms 1534 ms 1188 ms 45 tok/s 100.00% 16
89 openai/gpt-5.4-mini limited data openai 1202 ms 3606 ms 1114 ms 100.00% 13
90 qwen/qwen3.5-27b limited data phala 1206 ms 4082 ms 1132 ms 100.00% 11
91 x-ai/grok-4.3 grok 1207 ms 2256 ms 1105 ms 100.00% 71
92 qwen/qwen3-vl-235b-a22b-thinking limited data novita 1212 ms 1977 ms 1108 ms 100.00% 3
93 z-ai/glm-4.5 limited data zai 1213 ms 2619 ms 1211 ms 100.00% 12
94 moonshotai/Kimi-K2.5-fast limited data nebius 1213 ms 1213 ms 1212 ms 100.00% 1
95 openai/gpt-4o-mini limited data openai 1221 ms 3383 ms 1117 ms 100.00% 13
96 qwen/qwen2.5-vl-72b-instruct limited data parasail 1225 ms 11062 ms 1123 ms 70.00% 10
97 qwen/qwen-mt-plus limited data novita 1232 ms 1232 ms 1129 ms 100.00% 1
98 qwen/qwen3.6-35b-a3b limited data novita 1235 ms 1543 ms 1232 ms 100.00% 2
99 Qwen/Qwen3-30B-A3B-Instruct-2507 limited data nebius 1236 ms 1498 ms 1134 ms 100.00% 5
100 deepseek/deepseek-prover-v2-671b limited data novita 1254 ms 1254 ms 1252 ms 100.00% 1
101 deepseek/deepseek-v4-pro tinfoil 1258 ms 5748 ms 1195 ms 100.00% 31
102 z-ai/glm-4.5-air:free limited data zai 1267 ms 8345 ms 1265 ms 100.00% 9
103 z-ai/glm-5.1 limited data venice 1285 ms 5733 ms 1182 ms 100.00% 15
104 z-ai/glm-4.5-air limited data zai 1288 ms 4443 ms 1286 ms 100.00% 16
105 moonshotai/kimi-k2.5 limited data novita 1296 ms 5895 ms 1295 ms 100.00% 2
106 zai-org/glm-4.5-air limited data novita 1303 ms 1756 ms 1201 ms 100.00% 3
107 deepseek/deepseek-v4-flash deepseek 1337 ms 2053 ms 1256 ms 100.00% 62
108 minimax/minimax-m2 limited data minimax 1339 ms 2165 ms 1265 ms 100.00% 18
109 z-ai/glm-4.6 limited data venice 1354 ms 1968 ms 1340 ms 100.00% 11
110 openai/o3 limited data openai 1366 ms 1953 ms 1270 ms 100.00% 13
111 google/gemma-4-31b-it limited data novita 1394 ms 3149 ms 1289 ms 100.00% 3
112 z-ai/glm-4.7 limited data venice 1396 ms 3133 ms 1388 ms 100.00% 16
113 qwen/qwen3-omni-30b-a3b-instruct limited data novita 1398 ms 1398 ms 1295 ms 100.00% 1
114 minimax/minimax-m2.5 limited data novita 1401 ms 1401 ms 1399 ms 100.00% 1
115 xiaomi/mimo-v2-flash xiaomi 1416 ms 2598 ms 1415 ms 100.00% 24
116 deepseek/deepseek-chat-v3.1 limited data phala 1417 ms 2524 ms 1415 ms 100.00% 5
117 minimax/minimax-m2 limited data novita 1423 ms 1423 ms 1319 ms 100.00% 1
118 google/gemma-4-31b-it limited data parasail 1426 ms 19795 ms 1425 ms 75.00% 8
119 arcee-ai/trinity-large-thinking limited data parasail 1428 ms 1590 ms 1325 ms 100.00% 6
120 moonshotai/kimi-k2.6 tinfoil 1433 ms 2900 ms 1382 ms 100.00% 25
121 google/gemma-4-26b-a4b-it limited data gmi 1435 ms 14298 ms 1434 ms 63.16% 19
122 qwen/qwen3-coder-480b-a35b-instruct limited data novita 1439 ms 1879 ms 1438 ms 100.00% 2
123 moonshotai/kimi-k2.6 limited data phala 1445 ms 4619 ms 1392 ms 100.00% 6
124 z-ai/glm-5 limited data venice 1448 ms 1633 ms 1344 ms 100.00% 15
125 deepseek-ai/DeepSeek-V3.2 limited data nebius 1449 ms 1747 ms 1447 ms 100.00% 3
126 z-ai/glm-4.7-flash limited data phala 1450 ms 7565 ms 1346 ms 100.00% 7
127 z-ai/glm-4.5v limited data zai 1464 ms 10717 ms 1362 ms 82.35% 17
128 anthropic/claude-opus-4.8 limited data anthropic 1465 ms 2365 ms 1464 ms 100.00% 9
129 qwen/qwen3-next-80b-a3b-instruct limited data novita 1477 ms 1850 ms 1374 ms 100.00% 4
130 microsoft/wizardlm-2-8x22b limited data novita 1478 ms 1664 ms 1476 ms 100.00% 2
131 xiaomi/mimo-v2.5-pro-ultraspeed xiaomi 1482 ms 2795 ms 1662 ms 378 tok/s 100.00% 38
132 qwen/qwen3-235b-a22b-instruct-2507 limited data novita 1482 ms 2366 ms 1480 ms 100.00% 2
133 openai/gpt-5.5 limited data openai 1486 ms 2095 ms 1383 ms 100.00% 11
134 z-ai/glm-4.6v limited data zai 1494 ms 8278 ms 1400 ms 100.00% 15
135 deepseek/deepseek-ocr-2 limited data novita 1495 ms 2138 ms 1391 ms 100.00% 4
136 minimax/minimax-m2.7 minimax 1495 ms 3621 ms 1392 ms 100.00% 22
137 z-ai/glm-5v-turbo limited data zai 1499 ms 1835 ms 1397 ms 100.00% 10
138 deepseek/deepseek-v3.2-exp limited data novita 1504 ms 1504 ms 1502 ms 100.00% 1
139 mistralai/mistral-nemo limited data novita 1527 ms 1527 ms 1524 ms 100.00% 1
140 qwen/qwen3-next-80b-a3b-thinking limited data novita 1534 ms 1812 ms 1431 ms 100.00% 2
141 openai/o4-mini limited data openai 1536 ms 2083 ms 1534 ms 100.00% 9
142 deepseek/deepseek-v4-pro deepseek 1540 ms 2316 ms 1473 ms 100.00% 68
143 deepseek/deepseek-v4-pro limited data siliconflow 1543 ms 2147 ms 1439 ms 100.00% 11
144 deepseek-ai/DeepSeek-V4-Pro limited data nebius 1544 ms 4096 ms 1485 ms 100.00% 9
145 openai/gpt-oss-20b limited data phala 1551 ms 1580 ms 1448 ms 100.00% 3
146 z-ai/glm-5 limited data siliconflow 1554 ms 2317 ms 1472 ms 100.00% 15
147 nvidia/nemotron-3-super-120b-a12b limited data nebius 1562 ms 1825 ms 1560 ms 100.00% 3
148 qwen/qwen3-235b-a22b-thinking-2507 limited data novita 1574 ms 1574 ms 1471 ms 100.00% 1
149 qwen/qwen3.5-397b-a17b limited data novita 1581 ms 1597 ms 1495 ms 100.00% 2
150 qwen/qwen3-coder-next limited data novita 1586 ms 2176 ms 1584 ms 100.00% 2
151 anthropic/claude-opus-4.5 limited data anthropic 1604 ms 2710 ms 1603 ms 100.00% 13
152 deepseek/deepseek-v3-0324 limited data novita 1605 ms 2336 ms 1604 ms 100.00% 4
153 minimax/minimax-m2.1 limited data novita 1629 ms 1629 ms 1525 ms 100.00% 1
154 moonshotai/kimi-k2-thinking limited data novita 1635 ms 1852 ms 1531 ms 100.00% 2
155 minimax/minimax-m2.1-highspeed minimax 1639 ms 3343 ms 1620 ms 100.00% 26
156 qwen/qwen3.5-397b-a17b limited data venice 1641 ms 8745 ms 1614 ms 100.00% 16
157 deepseek/deepseek-v4-flash limited data siliconflow 1653 ms 14180 ms 1549 ms 100.00% 17
158 minimax/minimax-m2.7-highspeed limited data minimax 1662 ms 8083 ms 1633 ms 100.00% 19
159 Qwen/Qwen3-235B-A22B-Thinking-2507-fast limited data nebius 1671 ms 1827 ms 1568 ms 100.00% 5
160 qwen/qwen3.5-122b-a10b limited data novita 1685 ms 1685 ms 1582 ms 100.00% 1
161 minimax/minimax-m2.5-highspeed minimax 1689 ms 3988 ms 1568 ms 100.00% 30
162 moonshotai/kimi-k2.5 kimi 1691 ms 3441 ms 1632 ms 95.45% 66
163 qwen/qwen3-vl-8b-instruct limited data novita 1706 ms 1706 ms 1602 ms 100.00% 1
164 minimax/minimax-m2.5 limited data phala 1709 ms 2231 ms 1707 ms 100.00% 3
165 anthropic/claude-sonnet-4 limited data anthropic 1717 ms 2331 ms 1668 ms 100.00% 17
166 deepseek/deepseek-v4-flash limited data novita 1725 ms 2559 ms 1621 ms 100.00% 2
167 z-ai/glm-5-turbo limited data venice 1733 ms 8245 ms 1629 ms 100.00% 8
168 openai/gpt-oss-120b-fast limited data nebius 1734 ms 1899 ms 1631 ms 100.00% 3
169 Qwen/Qwen3-Next-80B-A3B-Thinking-fast limited data nebius 1745 ms 1856 ms 1642 ms 100.00% 3
170 MiniMaxAI/MiniMax-M2.5-fast limited data nebius 1760 ms 1995 ms 1757 ms 100.00% 2
171 deepseek-ai/DeepSeek-V3.2-fast limited data nebius 1766 ms 2497 ms 1662 ms 100.00% 8
172 qwen/qwen3-omni-30b-a3b-thinking limited data novita 1771 ms 1986 ms 1667 ms 100.00% 3
173 minimax/minimax-m2.7 limited data novita 1778 ms 1909 ms 1675 ms 100.00% 2
174 Qwen/Qwen3-Next-80B-A3B-Thinking limited data nebius 1784 ms 2076 ms 1774 ms 100.00% 9
175 moonshotai/Kimi-K2.5 limited data nebius 1786 ms 2623 ms 1755 ms 100.00% 1 probe_config_error 5
176 deepseek/deepseek-v4-pro limited data novita 1791 ms 1791 ms 1687 ms 100.00% 1
177 z-ai/glm-5.1 limited data phala 1798 ms 4624 ms 1797 ms 100.00% 4
178 deepseek/deepseek-v3.1-terminus limited data novita 1803 ms 2207 ms 1802 ms 100.00% 2
179 Qwen/Qwen3.5-397B-A17B-fast limited data nebius 1804 ms 2034 ms 1699 ms 100.00% 6
180 moonshotai/kimi-k2-0905 limited data novita 1834 ms 2375 ms 1832 ms 100.00% 4
181 z-ai/glm-5v-turbo limited data siliconflow 1882 ms 3596 ms 1780 ms 84.21% 19
182 qwen/qwen3.5-27b limited data novita 1897 ms 4113 ms 1795 ms 100.00% 2
183 anthropic/claude-sonnet-4.6 limited data anthropic 1925 ms 3056 ms 1822 ms 100.00% 12
184 z-ai/glm-4.7 limited data zai 1950 ms 2531 ms 1948 ms 100.00% 11
185 deepseek/deepseek-v3-turbo limited data novita 1953 ms 1953 ms 1849 ms 100.00% 1
186 xiaomimimo/mimo-v2.5-pro limited data novita 1955 ms 2860 ms 1953 ms 100.00% 2
187 anthropic/claude-sonnet-4.5 limited data anthropic 1969 ms 3721 ms 1885 ms 100.00% 15
188 minimax/minimax-m2.5-highspeed limited data novita 1975 ms 2132 ms 1871 ms 100.00% 2
189 moonshotai/kimi-k2.6 kimi 1988 ms 2589 ms 1897 ms 90.00% 50
190 deepseek/deepseek-v3.1 limited data novita 2007 ms 2369 ms 2006 ms 100.00% 3
191 openai/o3-mini limited data openai 2015 ms 2467 ms 2012 ms 100.00% 12
192 baidu/ernie-4.5-vl-424b-a47b limited data novita 2036 ms 2036 ms 1933 ms 100.00% 1
193 z-ai/glm-4.6 limited data zai 2074 ms 2740 ms 2044 ms 92.86% 14
194 moonshotai/kimi-k2.6 limited data novita 2097 ms 2097 ms 1993 ms 100.00% 1
195 zai-org/GLM-5 limited data nebius 2100 ms 2581 ms 1997 ms 100.00% 5
196 tencent/hunyuan-a13b-instruct limited data siliconflow 2134 ms 3382 ms 2031 ms 100.00% 17
197 minimax/minimax-m3 siliconflow 2135 ms 5025 ms 2134 ms 61.90% 21
198 deepseek/deepseek-r1-0528 limited data novita 2138 ms 2138 ms 2136 ms 100.00% 1
199 anthropic/claude-opus-4 limited data anthropic 2161 ms 4938 ms 2058 ms 100.00% 14
200 zai-org/glm-4.7 limited data novita 2180 ms 2180 ms 2178 ms 100.00% 1
201 moonshotai/kimi-k2-instruct limited data novita 2187 ms 2187 ms 2084 ms 100.00% 1
202 zai-org/glm-4.6v limited data novita 2209 ms 2209 ms 2107 ms 100.00% 1
203 deepseek/deepseek-ocr limited data novita 2227 ms 2227 ms 2123 ms 100.00% 1
204 anthropic/claude-opus-4.6 limited data anthropic 2255 ms 2747 ms 2152 ms 100.00% 12
205 google/gemini-3.1-pro-preview limited data gemini 2255 ms 3989 ms 2254 ms 100.00% 12 probe_config_error 11
206 moonshotai/kimi-k2.5 limited data phala 2264 ms 4817 ms 2160 ms 100.00% 8
207 tencent/hy3-preview siliconflow 2274 ms 3280 ms 2171 ms 100.00% 22
208 deepseek/deepseek-r1-turbo limited data novita 2284 ms 2400 ms 2180 ms 100.00% 3
209 deepseek/deepseek-v3.2 limited data novita 2301 ms 2811 ms 2199 ms 100.00% 3
210 z-ai/glm-4.7 limited data phala 2311 ms 2900 ms 2288 ms 100.00% 7
211 z-ai/glm-5 limited data phala 2322 ms 16618 ms 2217 ms 66.67% 6
212 xiaomi/mimo-v2.5-pro xiaomi 2360 ms 4561 ms 2257 ms 100.00% 31
213 moonshotai/kimi-k2.6 limited data parasail 2361 ms 7775 ms 2257 ms 66.67% 6
214 minimax/minimax-m3 minimax 2363 ms 4595 ms 2350 ms 100.00% 22
215 zai-org/glm-4.6 limited data novita 2374 ms 2759 ms 2371 ms 100.00% 4
216 z-ai/glm-5v-turbo limited data venice 2385 ms 6973 ms 2383 ms 100.00% 10
217 anthropic/claude-opus-4.1 limited data anthropic 2407 ms 3439 ms 2304 ms 100.00% 9
218 qwen/qwen3.5-397b-a17b limited data phala 2443 ms 3305 ms 2340 ms 100.00% 9
219 xiaomi/mimo-v2-pro xiaomi 2472 ms 3635 ms 2368 ms 100.00% 20
220 deepseek/deepseek-v3.2 limited data phala 2530 ms 3700 ms 2529 ms 100.00% 4
221 xiaomi/mimo-v2.5 xiaomi 2706 ms 4143 ms 2696 ms 100.00% 27
222 z-ai/glm-5 limited data zai 2729 ms 3942 ms 2627 ms 100.00% 9
223 z-ai/glm-5.1 limited data zai 2781 ms 4838 ms 2679 ms 100.00% 6
224 zai-org/glm-5 limited data novita 2952 ms 2952 ms 2849 ms 100.00% 1
225 deepseek/deepseek-v4-pro gmi 2981 ms 5094 ms 2958 ms 96.67% 30
226 google/gemma-4-31b-it gmi 3126 ms 11709 ms 3072 ms 69.57% 23
227 openai/o1 limited data openai 3194 ms 4254 ms 3092 ms 100.00% 12
228 moonshotai/kimi-k2.5 limited data parasail 3422 ms 7893 ms 3421 ms 100.00% 5
229 z-ai/glm-5 gmi 3688 ms 9982 ms 3585 ms 84.62% 26
230 z-ai/glm-5.1 tinfoil 3705 ms 15106 ms 3680 ms 37.14% 35
231 zai-org/GLM-5.1 limited data nebius 3899 ms 11386 ms 3796 ms 100.00% 6
232 qwen/qwen3-max limited data novita 4759 ms 4759 ms 4657 ms 100.00% 1
233 z-ai/glm-5.1 gmi 5402 ms 13482 ms 5298 ms 96.55% 29
234 z-ai/glm-4.7 limited data parasail 6293 ms 7384 ms 6190 ms 75.00% 4
235 zai-org/glm-4.7-flash limited data novita 9738 ms 9738 ms 9737 ms 100.00% 1
236 baidu/ernie-4.5-vl-28b-a3b limited data novita 0.00% 2
237 stepfun/step-3.5-flash limited data parasail 0.00% 6
238 google/gemma-3-12b-it limited data novita 0.00% 3
239 elephant limited data novita 0.00% 3
240 deepseek/deepseek-v4-pro limited data parasail 0.00% 3 probe_config_error 2
241 kwaipilot/kat-coder-pro limited data novita 0.00% 1
242 sao10k/l31-70b-euryale-v2.2 limited data novita 0.00% 1

Sign in

Choose a sign in method.